Science.gov

Sample records for affymetrix ath1 genome

  1. Statistical evaluation of transcriptomic data generated using the Affymetrix one-cycle, two-cycle and IVT-Express RNA labelling protocols with the Arabidopsis ATH1 microarray

    PubMed Central

    2010-01-01

    Background Microarrays are a powerful tool used for the determination of global RNA expression. There is an increasing requirement to focus on profiling gene expression in tissues where it is difficult to obtain large quantities of material, for example individual tissues within organs such as the root, or individual isolated cells. From such samples, it is difficult to produce the amount of RNA required for labelling and hybridisation in microarray experiments, thus a process of amplification is usually adopted. Despite the increasing use of two-cycle amplification for transcriptomic analyses on the Affymetrix ATH1 array, there has been no report investigating any potential bias in gene representation that may occur as a result. Results Here we compare transcriptomic data generated using Affymetrix one-cycle (standard labelling protocol), two-cycle (small-sample protocol) and IVT-Express protocols with the Affymetrix ATH1 array using Arabidopsis root samples. Results obtained with each protocol are broadly similar. However, we show that there are 35 probe sets (of a total of 22810) that are misrepresented in the two-cycle data sets. Of these, 33 probe sets were classed as mis-amplified when comparisons of two independent publicly available data sets were undertaken. Conclusions Given the unreliable nature of the highlighted probes, we caution against using data associated with the corresponding genes in analyses involving transcriptomic data generated with two-cycle amplification protocols. We have shown that the Affymetrix IVT-E labelling protocol produces data with less associated bias than the two-cycle protocol, and as such, would recommend this kit for new experiments that involve small samples. PMID:20230623

  2. X:Map: annotation and visualization of genome structure for Affymetrix exon array analysis

    PubMed Central

    Yates, Tim; Okoniewski, Michał J.; Miller, Crispin J.

    2008-01-01

    Affymetrix exon arrays aim to target every known and predicted exon in the human, mouse or rat genomes, and have reporters that extend beyond protein coding regions to other areas of the transcribed genome. This combination of increased coverage and precision is important because a substantial proportion of protein coding genes are predicted to be alternatively spliced, and because many non-coding genes are known also to be of biological significance. In order to fully exploit these arrays, it is necessary to associate each reporter on the array with the features of the genome it is targeting, and to relate these to gene and genome structure. X:Map is a genome annotation database that provides this information. Data can be browsed using a novel Google-maps based interface, and analysed and further visualized through an associated BioConductor package. The database can be found at http://xmap.picr.man.ac.uk. PMID:17932061

  3. ATH1 and KNAT2 proteins act together in regulation of plant inflorescence architecture.

    PubMed

    Li, Yang; Pi, Limin; Huang, Hai; Xu, Lin

    2012-02-01

    The inflorescence of flowering plants is a highly organized structure, not only contributing to plant reproductive processes, but also constituting an important part of the entire plant morphology. Previous studies have revealed that the class-I KNOTTED1-like homeobox (KNOX) genes BREVIPEDICELLUS (BP or KNAT1), KNAT2, and KNAT6 play essential roles in inflorescence architecture. Pedicel morphology is known to contribute greatly to inflorescence architecture, and BP negatively regulates KNAT2 and KNAT6 to ensure that pedicels have a normal upward-pointing orientation. These findings indicate that a genetic network exists in controlling pedicel orientation, but how this network functions in the developmental process remains elusive. Here it is reported that the ARABIDOPSIS THALIANA HOMEOBOX GENE1 (ATH1) gene, which belongs to the BELL1-like homeodomain gene family, is a new member participating in regulating pedicel orientation in the class-I KNOX network. In a genetic screening for suppressors of isoginchaku-2D, a gain-of-function ASYMMETRIC LEAVES2 mutant that displays downward-pointing pedicels, a suppressor mutant was obtained. Characterization of this mutant revealed that the mutation corresponds to ATH1. Genetic analysis indicated that ATH1 acts mainly in the KNAT2 pathway. Yeast two-hybrid and bimolecular fluorescence complementation assays demonstrated that ATH1 physically interacts with KNAT2. The data indicate that the ATH1-KNAT2 complex acts redundantly with KNAT6, both of which are negatively regulated by BP during pedicel development.

  4. Secretion of the acid trehalase encoded by the CgATH1 gene allows trehalose fermentation by Candida glabrata.

    PubMed

    Zilli, D M W; Lopes, R G; Alves, S L; Barros, L M; Miletti, L C; Stambuk, B U

    2015-10-01

    The emergent pathogen Candida glabrata differs from other yeasts because it assimilates only two sugars, glucose and the disaccharide trehalose. Since rapid identification tests are based on the ability of this yeast to rapidly hydrolyze trehalose, in this work a biochemical and molecular characterization of trehalose catabolism by this yeast was performed. Our results show that C. glabrata consumes and ferments trehalose, with parameters similar to those observed during glucose fermentation. The presence of glucose in the medium during exponential growth on trehalose revealed extracellular hydrolysis of the sugar by a cell surface acid trehalase with a pH optimum of 4.4. Approximately ∼30% of the total enzymatic activity is secreted into the medium during growth on trehalose or glycerol. The secreted enzyme shows an apparent molecular mass of 275 kDa in its native form, but denaturant gel electrophoresis revealed a protein with ∼130 kDa, which due to its migration pattern and strong binding to concanavalin A, indicates that it is probably a dimeric glycoprotein. The secreted acid trehalase shows high affinity and activity for trehalose, with Km and Vmax values of 3.4 mM and 80 U (mg protein)(-1), respectively. Cloning of the CgATH1 gene (CAGLOK05137g) from de C. glabrata genome, a gene showing high homology to fungal acid trehalases, allowed trehalose fermentation after heterologous expression in Saccharomyces cerevisiae.

  5. VIZARD: analysis of Affymetrix Arabidopsis GeneChip data

    NASA Technical Reports Server (NTRS)

    Moseyko, Nick; Feldman, Lewis J.

    2002-01-01

    SUMMARY: The Affymetrix GeneChip Arabidopsis genome array has proved to be a very powerful tool for the analysis of gene expression in Arabidopsis thaliana, the most commonly studied plant model organism. VIZARD is a Java program created at the University of California, Berkeley, to facilitate analysis of Arabidopsis GeneChip data. It includes several integrated tools for filtering, sorting, clustering and visualization of gene expression data as well as tools for the discovery of regulatory motifs in upstream sequences. VIZARD also includes annotation and upstream sequence databases for the majority of genes represented on the Affymetrix Arabidopsis GeneChip array. AVAILABILITY: VIZARD is available free of charge for educational, research, and not-for-profit purposes, and can be downloaded at http://www.anm.f2s.com/research/vizard/ CONTACT: moseyko@uclink4.berkeley.edu.

  6. New insights into trehalose metabolism by Saccharomyces cerevisiae: NTH2 encodes a functional cytosolic trehalase, and deletion of TPS1 reveals Ath1p-dependent trehalose mobilization.

    PubMed

    Jules, Matthieu; Beltran, Gemma; François, Jean; Parrou, Jean Luc

    2008-02-01

    In the yeast Saccharomyces cerevisiae, the synthesis of endogenous trehalose is catalyzed by a trehalose synthase complex, TPS, and its hydrolysis relies on a cytosolic/neutral trehalase encoded by NTH1. In this work, we showed that NTH2, a paralog of NTH1, encodes a functional trehalase that is implicated in trehalose mobilization. Yeast is also endowed with an acid trehalase encoded by ATH1 and an H+/trehalose transporter encoded by AGT1, which can together sustain assimilation of exogenous trehalose. We showed that a tps1 mutant defective in the TPS catalytic subunit cultivated on trehalose, or on a dual source of carbon made of galactose and trehalose, accumulated high levels of intracellular trehalose by its Agt1p-mediated transport. The accumulated disaccharide was mobilized as soon as cells entered the stationary phase by a process requiring a coupling between its export and immediate extracellular hydrolysis by Ath1p. Compared to what is seen for classical growth conditions on glucose, this mobilization was rather unique, since it took place prior to that of glycogen, which was postponed until the late stationary phase. However, when the Ath1p-dependent mobilization of trehalose identified in this study was impaired, glycogen was mobilized earlier and faster, indicating a fine-tuning control in carbon storage management during periods of carbon and energy restriction.

  7. Qualitative assessment of gene expression in affymetrix genechip arrays

    NASA Astrophysics Data System (ADS)

    Nagarajan, Radhakrishnan; Upreti, Meenakshi

    2007-01-01

    Affymetrix Genechip microarrays are used widely to determine the simultaneous expression of genes in a given biological paradigm. Probes on the Genechip array are atomic entities which by definition are randomly distributed across the array and in turn govern the gene expression. In the present study, we make several interesting observations. We show that there is considerable correlation between the probe intensities across the array which defy the independence assumption. While the mechanism behind such correlations is unclear, we show that scaling behavior and the profiles of perfect match (PM) as well as mismatch (MM) probes are similar and immune-to-background subtraction. We believe that the observed correlations are possibly an outcome of inherent non-stationarities or patchiness in the array devoid of biological significance. This is demonstrated by inspecting their scaling behavior and profiles of the PM and MM probe intensities obtained from publicly available Genechip arrays from three eukaryotic genomes, namely: Drosophila melanogaster (fruit fly), Homo sapiens (humans) and Mus musculus (house mouse) across distinct biological paradigms and across laboratories, with and without background subtraction. The fluctuation functions were estimated using detrended fluctuation analysis (DFA) with fourth-order polynomial detrending. The results presented in this study provide new insights into correlation signatures of PM and MM probe intensities and suggests the choice of DFA as a tool for qualitative assessment of Affymetrix Genechip microarrays prior to their analysis. A more detailed investigation is necessary in order to understand the source of these correlations.

  8. Discovery and mapping of single feature polymorphisms in wheat using affymetrix arrays

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single feature polymorphisms (SFPs) can be a rich source of markers for gene mapping and function studies. To explore the feasibility of using the Affymetrix GeneChip to discover and map SFPs in the large hexaploid wheat genome, six wheat varieties of diverse origins were analyzed for significant pr...

  9. BLADE-ON-PETIOLE1 and 2 regulate Arabidopsis inflorescence architecture in conjunction with homeobox genes KNAT6 and ATH1.

    PubMed

    Khan, Madiha; Tabb, Paul; Hepworth, Shelley R

    2012-07-01

    Inflorescence architecture varies widely among flowering plants, serving to optimize the display of flowers for reproductive success. In Arabidopsis thaliana, internode elongation begins at the floral transition, generating a regular spiral arrangement of upwardly-oriented flowers on the primary stem. Post-elongation, differentiation of lignified interfascicular fibers in the stem provides mechanical support. Correct inflorescence patterning requires two interacting homeodomain transcription factors: the KNOTTED1-like protein BREVIPEDICELLUS (BP) and its BEL1-like interaction partner PENNYWISE (PNY). Mutations in BP and PNY cause short internodes, irregular spacing and/or orientation of lateral organs, and altered lignin deposition in stems. Recently, we showed that these defects are caused by the misexpression of lateral organ boundary genes, BLADE-ON-PETIOLE1 (BOP1) and BOP2, which function downstream of BP-PNY in an antagonistic fashion. BOP1/2 gain-of-function in stems promotes expression of the boundary gene KNOTTED1-LIKE FROM ARABIDOPSIS THALIANA6 (KNAT6) and shown here, ARABIDOPSIS THALIANA HOMEOBOX GENE1 (ATH1), providing KNAT6 with a BEL1-like co-factor. Our further analyses show that defects caused by BOP1/2 gain-of-function require both KNAT6 and ATH1. These data reveal how BOP1/2-dependent activation of a boundary module in stems exerts changes in inflorescence architecture.

  10. Rawcopy: Improved copy number analysis with Affymetrix arrays

    PubMed Central

    Mayrhofer, Markus; Viklund, Björn; Isaksson, Anders

    2016-01-01

    Microarray data is subject to noise and systematic variation that negatively affects the resolution of copy number analysis. We describe Rawcopy, an R package for processing of Affymetrix CytoScan HD, CytoScan 750k and SNP 6.0 microarray raw intensities (CEL files). Noise characteristics of a large number of reference samples are used to estimate log ratio and B-allele frequency for total and allele-specific copy number analysis. Rawcopy achieves better signal-to-noise ratio and higher proportion of validated alterations than commonly used free and proprietary alternatives. In addition, Rawcopy visualizes each microarray sample for assessment of technical quality, patient identity and genome-wide absolute copy number states. Software and instructions are available at http://rawcopy.org. PMID:27796336

  11. An annotation infrastructure for the analysis and interpretation of Affymetrix exon array data.

    PubMed

    Okoniewski, Michał J; Yates, Tim; Dibben, Siân; Miller, Crispin J

    2007-01-01

    Affymetrix exon arrays contain probesets intended to target every known and predicted exon in the entire genome, posing significant challenges for high-throughput genome-wide data analysis. X:MAP http://xmap.picr.man.ac.uk, an annotation database, and exonmap http://www.bioconductor.org/packages/2.0/bioc/html/exonmap.html, a BioConductor/R package, are designed to support fine-grained analysis of exon array data. The system supports the application of standard statistical techniques, prior to the use of genome scale annotation to provide gene-, transcript- and exon-level summaries and visualization tools.

  12. An annotation infrastructure for the analysis and interpretation of Affymetrix exon array data

    PubMed Central

    Okoniewski, Michał J; Yates, Tim; Dibben, Siân; Miller, Crispin J

    2007-01-01

    Affymetrix exon arrays contain probesets intended to target every known and predicted exon in the entire genome, posing significant challenges for high-throughput genome-wide data analysis. X:MAP , an annotation database, and exonmap , a BioConductor/R package, are designed to support fine-grained analysis of exon array data. The system supports the application of standard statistical techniques, prior to the use of genome scale annotation to provide gene-, transcript- and exon-level summaries and visualization tools. PMID:17498294

  13. Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.

    PubMed

    Guzzi, Pietro Hiram; Cannataro, Mario

    2013-08-01

    A current trend in genomics is the investigation of the cell mechanism using different technologies, in order to explain the relationship among genes, molecular processes and diseases. For instance, the combined use of gene-expression arrays and genomic arrays has been demonstrated as an effective instrument in clinical practice. Consequently, in a single experiment different kind of microarrays may be used, resulting in the production of different types of binary data (images and textual raw data). The analysis of microarray data requires an initial preprocessing phase, that makes raw data suitable for use on existing analysis platforms, such as the TIGR M4 (TM4) Suite. An additional challenge to be faced by emerging data analysis platforms is the ability to treat in a combined way those different microarray formats coupled with clinical data. In fact, resulting integrated data may include both numerical and symbolic data (e.g. gene expression and SNPs regarding molecular data), as well as temporal data (e.g. the response to a drug, time to progression and survival rate), regarding clinical data. Raw data preprocessing is a crucial step in analysis but is often performed in a manual and error prone way using different software tools. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of different microarray data are needed. The paper presents Micro-Analyzer (Microarray Analyzer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix gene expression and SNP binary data. It represents the evolution of the μ-CS tool, extending the preprocessing to SNP arrays that were not allowed in μ-CS. The Micro-Analyzer is provided as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data (gene expression and SNPs) by invoking TM4 platform. It avoids: (i) the manual invocation of external tools (e.g. the Affymetrix Power

  14. Celsius: a community resource for Affymetrix microarray data.

    PubMed

    Day, Allen; Carlson, Marc R J; Dong, Jun; O'Connor, Brian D; Nelson, Stanley F

    2007-01-01

    Celsius is a data warehousing system to aggregate Affymetrix CEL files and associated metadata. It provides mechanisms for importing, storing, querying, and exporting large volumes of primary and pre-processed microarray data. Celsius contains ten billion assay measurements and affiliated metadata. It is the largest publicly available source of Affymetrix microarray data, and through sheer volume it allows a sophisticated, broad view of transcription that has not previously been possible.

  15. Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories.

    PubMed

    Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas

    2016-09-19

    Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.

  16. Arabidopsis transcriptional responses differentiating closely related chemicals (herbicides) and cross-species extrapolation to Brassica

    EPA Science Inventory

    Using whole genome Affymetrix ATH1 GeneChips we characterized the transcriptional response of Arabidopsis thaliana Columbia 24 hours after treatment with five different herbicides. Four of them (chloransulam, imazapyr, primisulfuron, sulfometuron) inhibit acetolactate synthase (A...

  17. Reverse engineering and analysis of large genome-scale gene networks.

    PubMed

    Aluru, Maneesha; Zola, Jaroslaw; Nettleton, Dan; Aluru, Srinivas

    2013-01-07

    Reverse engineering the whole-genome networks of complex multicellular organisms continues to remain a challenge. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. To enable fast and accurate reconstruction of large networks, we developed Tool for Inferring Network of Genes (TINGe), a parallel mutual information (MI)-based program. The novel features of our approach include: (i) B-spline-based formulation for linear-time computation of MI, (ii) a novel algorithm for direct permutation testing and (iii) development of parallel algorithms to reduce run-time and facilitate construction of large networks. We assess the quality of our method by comparison with ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks) and GeneNet and demonstrate its unique capability by reverse engineering the whole-genome network of Arabidopsis thaliana from 3137 Affymetrix ATH1 GeneChips in just 9 min on a 1024-core cluster. We further report on the development of a new software Gene Network Analyzer (GeNA) for extracting context-specific subnetworks from a given set of seed genes. Using TINGe and GeNA, we performed analysis of 241 Arabidopsis AraCyc 8.0 pathways, and the results are made available through the web.

  18. Exon array data analysis using Affymetrix power tools and R statistical software

    PubMed Central

    2011-01-01

    The use of microarray technology to measure gene expression on a genome-wide scale has been well established for more than a decade. Methods to process and analyse the vast quantity of expression data generated by a typical microarray experiment are similarly well-established. The Affymetrix Exon 1.0 ST array is a relatively new type of array, which has the capability to assess expression at the individual exon level. This allows a more comprehensive analysis of the transcriptome, and in particular enables the study of alternative splicing, a gene regulation mechanism important in both normal conditions and in diseases. Some aspects of exon array data analysis are shared with those for standard gene expression data but others present new challenges that have required development of novel tools. Here, I will introduce the exon array and present a detailed example tutorial for analysis of data generated using this platform. PMID:21498550

  19. Exon array data analysis using Affymetrix power tools and R statistical software.

    PubMed

    Lockstone, Helen E

    2011-11-01

    The use of microarray technology to measure gene expression on a genome-wide scale has been well established for more than a decade. Methods to process and analyse the vast quantity of expression data generated by a typical microarray experiment are similarly well-established. The Affymetrix Exon 1.0 ST array is a relatively new type of array, which has the capability to assess expression at the individual exon level. This allows a more comprehensive analysis of the transcriptome, and in particular enables the study of alternative splicing, a gene regulation mechanism important in both normal conditions and in diseases. Some aspects of exon array data analysis are shared with those for standard gene expression data but others present new challenges that have required development of novel tools. Here, I will introduce the exon array and present a detailed example tutorial for analysis of data generated using this platform.

  20. Genetic and genomic analysis of Rhizoctonia solani interactions with Arabidopsis; evidence of resistance mediated through NADPH oxidases.

    PubMed

    Foley, Rhonda C; Gleason, Cynthia A; Anderson, Jonathan P; Hamann, Thorsten; Singh, Karam B

    2013-01-01

    Rhizoctonia solani is an important soil-borne necrotrophic fungal pathogen, with a broad host range and little effective resistance in crop plants. Arabidopsis is resistant to R. solani AG8 but susceptible to R. solani AG2-1. A screen of 36 Arabidopsis ecotypes and mutants affected in the auxin, camalexin, salicylic acid, abscisic acid and ethylene/jasmonic acid pathways did not reveal any variation in response to R. solani and demonstrated that resistance to AG8 was independent of these defense pathways. The Arabidopsis Affymetrix ATH1 Genome array was used to assess global gene expression changes in plants infected with AG8 and AG2-1 at seven days post-infection. While there was considerable overlap in the response, some gene families were differentially affected by AG8 or AG2-1 and included those involved in oxidative stress, cell wall associated proteins, transcription factors and heat shock protein genes. Since a substantial proportion of the gene expression changes were associated with oxidative stress responses, we analysed the role of NADPH oxidases in resistance. While single NADPH oxidase mutants had no effect, a NADPH oxidase double mutant atrbohf atrbohd resulted in an almost complete loss of resistance to AG8, suggesting that reactive oxidative species play an important role in Arabidopsis's resistance to R. solani.

  1. CEL_INTERROGATOR: A FREE AND OPEN SOURCE PACKAGE FOR AFFYMETRIX CEL FILE PARSING

    Technology Transfer Automated Retrieval System (TEKTRAN)

    CEL_Interrogator Package is a suite of programs designed to extract the average probe intensity and other information for each probe sequence from an Affymetrix GeneChip CEL file and unite them with their human-readable Affymetrix consensus sequence names. The resulting text file is suitable for di...

  2. High correspondence between Affymetrix exon and standard expression arrays.

    PubMed

    Okoniewski, Michał J; Hey, Yvonne; Pepper, Stuart D; Miller, Crispin J

    2007-02-01

    Exon arrays aim to provide comprehensive gene expression data at the level of individual exons, similar to that provided on a per-gene basis by existing expression arrays. This report describes the performance of Affymetrix GeneChip Human Exon 1.0 ST array by using replicated RNA samples from two human cell lines, MCF7 and MCF10A, hybridized both to Exon 1.0 ST and to HG-U133 Plus2 arrays. Cross-comparison between array types requires an appropriate mapping to be found between individual probe sets. Three possible mappings were considered, reflecting different strategies for dealing with probe sets that target different parts of the same transcript. Irrespective of the mapping used, Exon 1.0 ST and HG-U133 Plus2 arrays show a high degree of correspondence. More than 80% of HG-U133 Plus2 probe sets may be mapped to the Exon chip, and fold changes are found well preserved for over 96% of those probe sets detected present. Since HG-U133 Plus2 arrays have already been extensively validated, these results lend a significant degree of confidence to exon arrays.

  3. SFP Genotyping from Affymetrix Arrays is Robust but Largely Detects Cis-acting Expression Regulators

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The recent development of Affymetrix chips designed from assembled EST sequences has spawned considerable interest in identifying single-feature polymorphisms (SFPs) from transcriptome data. SFPs are valuable genetic markers that potentially offer a physical link to the structural genes themselves....

  4. Genome-Wide Analysis of Hydrogen Peroxide-Regulated Gene Expression in Arabidopsis Reveals a High Light-Induced Transcriptional Cluster Involved in Anthocyanin Biosynthesis1[w

    PubMed Central

    Vanderauwera, Sandy; Zimmermann, Philip; Rombauts, Stéphane; Vandenabeele, Steven; Langebartels, Christian; Gruissem, Wilhelm; Inzé, Dirk; Van Breusegem, Frank

    2005-01-01

    In plants, reactive oxygen species and, more particularly, hydrogen peroxide (H2O2) play a dual role as toxic by-products of normal cell metabolism and as regulatory molecules in stress perception and signal transduction. Peroxisomal catalases are an important sink for photorespiratory H2O2. Using ATH1 Affymetrix microarrays, expression profiles were compared between control and catalase-deficient Arabidopsis (Arabidopsis thaliana) plants. Reduced catalase levels already provoked differences in nuclear gene expression under ambient growth conditions, and these effects were amplified by high light exposure in a sun simulator for 3 and 8 h. This genome-wide expression analysis allowed us to reveal the expression characteristics of complete pathways and functional categories during H2O2 stress. In total, 349 transcripts were significantly up-regulated by high light in catalase-deficient plants and 88 were down-regulated. From this data set, H2O2 was inferred to play a key role in the transcriptional up-regulation of small heat shock proteins during high light stress. In addition, several transcription factors and candidate regulatory genes involved in H2O2 transcriptional gene networks were identified. Comparisons with other publicly available transcriptome data sets of abiotically stressed Arabidopsis revealed an important intersection with H2O2-deregulated genes, positioning elevated H2O2 levels as an important signal within abiotic stress-induced gene expression. Finally, analysis of transcriptional changes in a combination of a genetic (catalase deficiency) and an environmental (high light) perturbation identified a transcriptional cluster that was strongly and rapidly induced by high light in control plants, but impaired in catalase-deficient plants. This cluster comprises the complete known anthocyanin regulatory and biosynthetic pathway, together with genes encoding unknown proteins. PMID:16183842

  5. Whole genome analysis of gene expression reveals coordinated activation of signaling and metabolic pathways during pollen-pistil interactions in Arabidopsis.

    PubMed

    Boavida, Leonor C; Borges, Filipe; Becker, Jörg D; Feijó, José A

    2011-04-01

    Plant reproduction depends on the concerted activation of many genes to ensure correct communication between pollen and pistil. Here, we queried the whole transcriptome of Arabidopsis (Arabidopsis thaliana) in order to identify genes with specific reproductive functions. We used the Affymetrix ATH1 whole genome array to profile wild-type unpollinated pistils and unfertilized ovules. By comparing the expression profile of pistils at 0.5, 3.5, and 8.0 h after pollination and applying a number of statistical and bioinformatics criteria, we found 1,373 genes differentially regulated during pollen-pistil interactions. Robust clustering analysis grouped these genes in 16 time-course clusters representing distinct patterns of regulation. Coregulation within each cluster suggests the presence of distinct genetic pathways, which might be under the control of specific transcriptional regulators. A total of 78% of the regulated genes were expressed initially in unpollinated pistil and/or ovules, 15% were initially detected in the pollen data sets as enriched or preferentially expressed, and 7% were induced upon pollination. Among those, we found a particular enrichment for unknown transcripts predicted to encode secreted proteins or representing signaling and cell wall-related proteins, which may function by remodeling the extracellular matrix or as extracellular signaling molecules. A strict regulatory control in various metabolic pathways suggests that fine-tuning of the biochemical and physiological cellular environment is crucial for reproductive success. Our study provides a unique and detailed temporal and spatial gene expression profile of in vivo pollen-pistil interactions, providing a framework to better understand the basis of the molecular mechanisms operating during the reproductive process in higher plants.

  6. An orthologous transcriptional signature differentiates responses towards closely related chemicals in Arabidopsis thaliana and brassica napus

    EPA Science Inventory

    Herbicides are structurally diverse chemicals that inhibit plant-specific targets, however their off-target and potentially differentiating side-effects are less well defined. In this study, genome-wide expression profiling based on Affymetrix AtH1 arrays was used to identify dis...

  7. A composite transcriptional signature differentiates responses towards closely related herbicides in Arabidopsis thaliana and brassica napus

    EPA Science Inventory

    In this study, genome-wide expression profiling based on Affymetrix ATH1 arrays was used to identify discriminating responses of Arabidopsis thaliana to five herbicides, which contain active ingredients targeting two different branches of amino acid biosynthesis. One herbicide co...

  8. Improvements to previous algorithms to predict gene structure and isoform concentrations using Affymetrix Exon arrays

    PubMed Central

    2010-01-01

    Background Exon arrays provide a way to measure the expression of different isoforms of genes in an organism. Most of the procedures to deal with these arrays are focused on gene expression or on exon expression. Although the only biological analytes that can be properly assigned a concentration are transcripts, there are very few algorithms that focus on them. The reason is that previously developed summarization methods do not work well if applied to transcripts. In addition, gene structure prediction, i.e., the correspondence between probes and novel isoforms, is a field which is still unexplored. Results We have modified and adapted a previous algorithm to take advantage of the special characteristics of the Affymetrix exon arrays. The structure and concentration of transcripts -some of them possibly unknown- in microarray experiments were predicted using this algorithm. Simulations showed that the suggested modifications improved both specificity (SP) and sensitivity (ST) of the predictions. The algorithm was also applied to different real datasets showing its effectiveness and the concordance with PCR validated results. Conclusions The proposed algorithm shows a substantial improvement in the performance over the previous version. This improvement is mainly due to the exploitation of the redundancy of the Affymetrix exon arrays. An R-Package of SPACE with the updated algorithms have been developed and is freely available. PMID:21110835

  9. Understanding the physics of oligonucleotide microarrays: the Affymetrix spike-in data reanalysed

    NASA Astrophysics Data System (ADS)

    Burden, Conrad J.

    2008-03-01

    The Affymetrix U95 and U133 Latin-Square spike-in datasets are reanalysed, together with a dataset from a version of the U95 spike-in experiment without a complex non-specific background. The approach uses a physico-chemical model which includes the effects of the specific and non-specific hybridization and probe folding at the microarray surface, target folding and hybridization in the bulk RNA target solution and duplex dissociation during the post-hybridization washing phase. The model predicts a three-parameter hyperbolic response function that fits well with fluorescence intensity data from all the three datasets. The importance of the various hybridization and washing effects in determining each of the three parameters is examined, and some guidance is given as to how a practical algorithm for determining specific target concentrations might be developed.

  10. MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data

    PubMed Central

    2014-01-01

    Background Mandatory deposit of raw microarray data files for public access, prior to study publication, provides significant opportunities to conduct new bioinformatics analyses within and across multiple datasets. Analysis of raw microarray data files (e.g. Affymetrix CEL files) can be time consuming, complex, and requires fundamental computational and bioinformatics skills. The development of analytical workflows to automate these tasks simplifies the processing of, improves the efficiency of, and serves to standardize multiple and sequential analyses. Once installed, workflows facilitate the tedious steps required to run rapid intra- and inter-dataset comparisons. Results We developed a workflow to facilitate and standardize Meta-Analysis of Affymetrix Microarray Data analysis (MAAMD) in Kepler. Two freely available stand-alone software tools, R and AltAnalyze were embedded in MAAMD. The inputs of MAAMD are user-editable csv files, which contain sample information and parameters describing the locations of input files and required tools. MAAMD was tested by analyzing 4 different GEO datasets from mice and drosophila. MAAMD automates data downloading, data organization, data quality control assesment, differential gene expression analysis, clustering analysis, pathway visualization, gene-set enrichment analysis, and cross-species orthologous-gene comparisons. MAAMD was utilized to identify gene orthologues responding to hypoxia or hyperoxia in both mice and drosophila. The entire set of analyses for 4 datasets (34 total microarrays) finished in ~ one hour. Conclusions MAAMD saves time, minimizes the required computer skills, and offers a standardized procedure for users to analyze microarray datasets and make new intra- and inter-dataset comparisons. PMID:24621103

  11. A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

    PubMed Central

    Li, Ming; Wen, Yalu; Fu, Wenjiang

    2014-01-01

    Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease. PMID:26279618

  12. Identifying the impact of G-quadruplexes on Affymetrix 3' arrays using cloud computing.

    PubMed

    Memon, Farhat N; Owen, Anne M; Sanchez-Graillet, Olivia; Upton, Graham J G; Harrison, Andrew P

    2010-01-15

    A tetramer quadruplex structure is formed by four parallel strands of DNA/ RNA containing runs of guanine. These quadruplexes are able to form because guanine can Hoogsteen hydrogen bond to other guanines, and a tetrad of guanines can form a stable arrangement. Recently we have discovered that probes on Affymetrix GeneChips that contain runs of guanine do not measure gene expression reliably. We associate this finding with the likelihood that quadruplexes are forming on the surface of GeneChips. In order to cope with the rapidly expanding size of GeneChip array datasets in the public domain, we are exploring the use of cloud computing to replicate our experiments on 3' arrays to look at the effect of the location of G-spots (runs of guanines). Cloud computing is a recently introduced high-performance solution that takes advantage of the computational infrastructure of large organisations such as Amazon and Google. We expect that cloud computing will become widely adopted because it enables bioinformaticians to avoid capital expenditure on expensive computing resources and to only pay a cloud computing provider for what is used. Moreover, as well as financial efficiency, cloud computing is an ecologically-friendly technology, it enables efficient data-sharing and we expect it to be faster for development purposes. Here we propose the advantageous use of cloud computing to perform a large data-mining analysis of public domain 3' arrays.

  13. The Affymetrix DMET Plus Platform Reveals Unique Distribution of ADME-Related Variants in Ethnic Arabs

    PubMed Central

    Wakil, Salma M.; Nguyen, Cao; Muiya, Nzioka P.; Andres, Editha; Lykowska-Tarnowska, Agnieszka; Baz, Batoul; Meyer, Brian F.; Morahan, Grant

    2015-01-01

    Background. The Affymetrix Drug Metabolizing Enzymes and Transporters (DMET) Plus Premier Pack has been designed to genotype 1936 gene variants thought to be essential for screening patients in personalized drug therapy. These variants include the cytochrome P450s (CYP450s), the key metabolizing enzymes, many other enzymes involved in phase I and phase II pharmacokinetic reactions, and signaling mediators associated with variability in clinical response to numerous drugs not only among individuals, but also between ethnic populations. Materials and Methods. We genotyped 600 Saudi individuals for 1936 variants on the DMET platform to evaluate their clinical potential in personalized medicine in ethnic Arabs. Results. Approximately 49% each of the 437 CYP450 variants, 56% of the 581 transporters, 56% of 419 transferases, 48% of the 104 dehydrogenases, and 58% of the remaining 390 variants were detected. Several variants, such as rs3740071, rs6193, rs258751, rs6199, rs11568421, and rs8187797, exhibited significantly either higher or lower minor allele frequencies (MAFs) than those in other ethnic groups. Discussion. The present study revealed some unique distribution trends for several variants in Arabs, which displayed partly inverse allelic prevalence compared to other ethnic populations. The results point therefore to the need to verify and ascertain the prevalence of a variant as a prerequisite for engaging it in clinical routine screening in personalized medicine in any given population. PMID:25802476

  14. A sequence-based identification of the genes detected by probesets on the Affymetrix U133 plus 2.0 array.

    PubMed

    Harbig, Jeremy; Sprinkle, Robert; Enkemann, Steven A

    2005-02-18

    One of the biggest problems facing microarray experiments is the difficulty of translating results into other microarray formats or comparing microarray results to other biochemical methods. We believe that this is largely the result of poor gene identification. We re-identified the probesets on the Affymetrix U133 plus 2.0 GeneChip array. This identification was based on the sequence of the probes and the sequence of the human genome. Using the BLAST program, we matched probes with documented and postulated human transcripts. This resulted in the redefinition of approximately 37% of the probes on the U133 plus 2.0 array. This updated identification specifically points out where the identification is complicated by cross-hybridization from splice variants or closely related genes. More than 5000 probesets detect multiple transcripts and therefore the exact protein affected cannot be readily concluded from the performance of one probeset alone. This makes naming difficult and impacts any downstream analysis such as associating gene ontologies, mapping affected pathways or simply validating expression changes. We have now automated the sequence-based identification and can more appropriately annotate any array where the sequence on each spot is known.

  15. Evaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individuals

    PubMed Central

    Nishida, Nao; Koike, Asako; Tajima, Atsushi; Ogasawara, Yuko; Ishibashi, Yoshimi; Uehara, Yasuka; Inoue, Ituro; Tokunaga, Katsushi

    2008-01-01

    Background With improvements in genotyping technologies, genome-wide association studies with hundreds of thousands of SNPs allow the identification of candidate genetic loci for multifactorial diseases in different populations. However, genotyping errors caused by genotyping platforms or genotype calling algorithms may lead to inflation of false associations between markers and phenotypes. In addition, the number of SNPs available for genome-wide association studies in the Japanese population has been investigated using only 45 samples in the HapMap project, which could lead to an inaccurate estimation of the number of SNPs with low minor allele frequencies. We genotyped 400 Japanese samples in order to estimate the number of SNPs available for genome-wide association studies in the Japanese population and to examine the performance of the current SNP Array 6.0 platform and the genotype calling algorithm "Birdseed". Results About 20% of the 909,622 SNP markers on the array were revealed to be monomorphic in the Japanese population. Consequently, 661,599 SNPs were available for genome-wide association studies in the Japanese population, after excluding the poorly behaving SNPs. The Birdseed algorithm accurately determined the genotype calls of each sample with a high overall call rate of over 99.5% and a high concordance rate of over 99.8% using more than 48 samples after removing low-quality samples by adjusting QC criteria. Conclusion Our results confirmed that the SNP Array 6.0 platform reached the level reported by the manufacturer, and thus genome-wide association studies using the SNP Array 6.0 platform have considerable potential to identify candidate susceptibility or resistance genetic factors for multifactorial diseases in the Japanese population, as well as in other populations. PMID:18803882

  16. Affymetrix Whole-Transcript Human Gene 1.0 ST array is highly concordant with standard 3' expression arrays.

    PubMed

    Pradervand, Sylvain; Paillusson, Alexandra; Thomas, Jérôme; Weber, Johann; Wirapati, Pratyaksha; Hagenbüchle, Otto; Harshman, Keith

    2008-05-01

    The recently released Affymetrix Human Gene 1.0 ST array has two major differences compared with standard 3' based arrays: (i) it interrogates the entire mRNA transcript, and (ii) it uses DNA targets. To assess the impact of these differences on array performance, we performed a series of comparative hybridizations between the Human Gene 1.0 ST and the Affymetrix HG-U133 Plus 2.0 and the Illumina HumanRef-8 BeadChip arrays. Additionally, both RNA and DNA targets were hybridized on HG-U133 Plus 2.0 arrays. The results show that the overall reproducibility of the Gene 1.0 ST array is best. When looking only at the high intensity probes, the reproducibility of the Gene 1.0 ST array and the Illumina BeadChip array is equally good. Concordance of array results was assessed using different inter-platform mappings. Agreements are best between the two labeling protocols using HG-U133 Plus 2.0 array. The Gene 1.0 ST array is most concordant with the HG-U133 array hybridized with cDNA targets. This may reflect the impact of the target type. Overall, the high degree of correspondence provides strong evidence for the reliability of the Gene 1.0 ST array.

  17. Mining Affymetrix microarray data for long non-coding RNAs: altered expression in the nucleus accumbens of heroin abusers.

    PubMed

    Michelhaugh, Sharon K; Lipovich, Leonard; Blythe, Jason; Jia, Hui; Kapatos, Gregory; Bannon, Michael J

    2011-02-01

    Although recent data suggest that some long non-coding RNAs (lncRNAs) exert widespread effects on gene expression and organelle formation, lncRNAs as a group constitute a sizable but poorly characterized fraction of the human transcriptome. We investigated whether some human lncRNA sequences were fortuitously represented on commonly used microarrays, then used this annotation to assess lncRNA expression in human brain. A computational and annotation pipeline was developed to identify lncRNA transcripts represented on Affymetrix U133 arrays. A previously published dataset derived from human nucleus accumbens was then examined for potential lncRNA expression. Twenty-three lncRNAs were determined to be represented on U133 arrays. Of these, dataset analysis revealed that five lncRNAs were consistently detected in samples of human nucleus accumbens. Strikingly, the abundance of these lncRNAs was up-regulated in human heroin abusers compared to matched drug-free control subjects, a finding confirmed by quantitative PCR. This study presents a paradigm for examining existing Affymetrix datasets for the detection and potential regulation of lncRNA expression, including changes associated with human disease. The finding that all detected lncRNAs were up-regulated in heroin abusers is consonant with the proposed role of lncRNAs as mediators of widespread changes in gene expression as occur in drug abuse.

  18. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    DTIC Science & Technology

    2011-09-01

    were down-selected and successfully genotyped for whole genome (WG) single nucleotide polymorphism (SNP) markers by means of the Affymetrix Canine...SUBJECT TERMS Military working dog genome-wide association study genetic marker intelligence... marker , intelligence, Canine Intelligence Testing Protocol, classification technique, clustering analysis Technical Report: September 2011 2

  19. EzArray: A web-based highly automated Affymetrix expression array data management and analysis system

    PubMed Central

    Zhu, Yuerong; Zhu, Yuelin; Xu, Wei

    2008-01-01

    Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from . PMID:18218103

  20. FULL-GENOME ANALYSIS OF ALTERNATIVE SPLICING IN MOUSE LIVER AFTER HEPATOTOXICANT EXPOSURE

    EPA Science Inventory

    Alternative splicing plays a role in determining gene function and protein diversity. We have employed whole genome exon profiling using Affymetrix Mouse Exon 1.0 ST arrays to understand the significance of alternative splicing on a genome-wide scale in response to multiple toxic...

  1. Acquisition of biologically relevant gene expression data by Affymetrix microarray analysis of archival formalin-fixed paraffin-embedded tumours

    PubMed Central

    Linton, K M; Hey, Y; Saunders, E; Jeziorska, M; Denton, J; Wilson, C L; Swindell, R; Dibben, S; Miller, C J; Pepper, S D; Radford, J A; Freemont, A J

    2008-01-01

    Robust protocols for microarray gene expression profiling of archival formalin-fixed paraffin-embedded tissue (FFPET) are needed to facilitate research when availability of fresh-frozen tissue is limited. Recent reports attest to the feasibility of this approach, but the clinical value of these data is poorly understood. We employed state-of-the-art RNA extraction and Affymetrix microarray technology to examine 34 archival FFPET primary extremity soft tissue sarcomas. Nineteen arrays met stringent QC criteria and were used to model prognostic signatures for metastatic recurrence. Arrays from two paired frozen and FFPET samples were compared: although FFPET sensitivity was low (∼50%), high specificity (95%) and positive predictive value (92%) suggest that transcript detection is reliable. Good agreement between arrays and real time (RT)–PCR was confirmed, especially for abundant transcripts, and RT–PCR validated the regulation pattern for 19 of 24 candidate genes (overall R2=0.4662). RT–PCR and immunohistochemistry on independent cases validated prognostic significance for several genes including RECQL4, FRRS1, CFH and MET – whose combined expression carried greater prognostic value than tumour grade – and cmet and TRKB proteins. These molecules warrant further evaluation in larger series. Reliable clinically relevant data can be obtained from archival FFPET, but protocol amendments are needed to improve the sensitivity and broad application of this approach. PMID:18382428

  2. Acquisition of biologically relevant gene expression data by Affymetrix microarray analysis of archival formalin-fixed paraffin-embedded tumours.

    PubMed

    Linton, K M; Hey, Y; Saunders, E; Jeziorska, M; Denton, J; Wilson, C L; Swindell, R; Dibben, S; Miller, C J; Pepper, S D; Radford, J A; Freemont, A J

    2008-04-22

    Robust protocols for microarray gene expression profiling of archival formalin-fixed paraffin-embedded tissue (FFPET) are needed to facilitate research when availability of fresh-frozen tissue is limited. Recent reports attest to the feasibility of this approach, but the clinical value of these data is poorly understood. We employed state-of-the-art RNA extraction and Affymetrix microarray technology to examine 34 archival FFPET primary extremity soft tissue sarcomas. Nineteen arrays met stringent QC criteria and were used to model prognostic signatures for metastatic recurrence. Arrays from two paired frozen and FFPET samples were compared: although FFPET sensitivity was low ( approximately 50%), high specificity (95%) and positive predictive value (92%) suggest that transcript detection is reliable. Good agreement between arrays and real time (RT)-PCR was confirmed, especially for abundant transcripts, and RT-PCR validated the regulation pattern for 19 of 24 candidate genes (overall R(2)=0.4662). RT-PCR and immunohistochemistry on independent cases validated prognostic significance for several genes including RECQL4, FRRS1, CFH and MET - whose combined expression carried greater prognostic value than tumour grade - and cmet and TRKB proteins. These molecules warrant further evaluation in larger series. Reliable clinically relevant data can be obtained from archival FFPET, but protocol amendments are needed to improve the sensitivity and broad application of this approach.

  3. A Microarray Analysis for Differential Gene Expression in the Soybean Genome Using Bioconductor and R

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This paper describes specific procedures for conducting quality assessment of Affymetrix GeneChip® soybean genome data and performing analyses to determine differential gene expression using the open-source R language and environment in conjunction with the open-source Bioconductor package. Procedu...

  4. Methods comparison for high-resolution transcriptional analysis of archival material on Affymetrix Plus 2.0 and Exon 1.0 microarrays.

    PubMed

    Linton, Kim; Hey, Yvonne; Dibben, Sian; Miller, Crispin; Freemont, Anthony; Radford, John; Pepper, Stuart

    2009-07-01

    Microarray gene expression profiling of formalin-fixed paraffin-embedded (FFPE) tissues is a new and evolving technique. This report compares transcript detection rates on Affymetrix U133 Plus 2.0 and Human Exon 1.0 ST GeneChips across several RNA extraction and target labeling protocols, using routinely collected archival FFPE samples. All RNA extraction protocols tested (Ambion-Optimum, Ambion-RecoverAll, and Qiagen-RNeasy FFPE) provided extracts suitable for microarray hybridization. Compared with Affymetrix One-Cycle labeled extracts, NuGEN system protocols utilizing oligo(dT) and random hexamer primers, and cDNA target preparations instead of cRNA, achieved percent present rates up to 55% on Plus 2.0 arrays. Based on two paired-sample analyses, at 90% specificity this equalled an average 30 percentage-point increase (from 50% to 80%) in FFPE transcript sensitivity relative to fresh frozen tissues, which we have assumed to have 100% sensitivity and specificity. The high content of Exon arrays, with multiple probe sets per exon, improved FFPE sensitivity to 92% at 96% specificity, corresponding to an absolute increase of ~600 genes over Plus 2.0 arrays. While larger series are needed to confirm high correspondence between fresh-frozen and FFPE expression patterns, these data suggest that both Plus 2.0 and Exon arrays are suitable platforms for FFPE microarray expression analyses.

  5. Computational Integration of Structural and Functional Genomics Data Across Species to Develop Information on Porcine Inflammatory Gene Regulatory Pathway

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Comparative integration of structural and functional genomic data across species holds great promise in finding genes controlling disease resistance. We are investigating the porcine gut immune response to infection through gene expression profiling. We have collected porcine Affymetrix GeneChip da...

  6. Gene Expression Analysis of Cultured Rat-Endothelial Cells after Nd:YAG Laser Irradiation by Affymetrix GeneChip Array

    PubMed Central

    MASUDA, YOSHIKO; YOKOSE, SATOSHI; SAKAGAMI, HIROSHI

    2017-01-01

    Endothelial cells and dental pulp cells enhance osteo-/odontogenic and angiogenic differentiation. In our previous study, rat pulp cells migrated to Nd:YAG laser-irradiated endothelial cells in an insert cell culture system. The purpose of this study was to examine the possible changes in the gene expression of cultured rat aortic endothelial cells after Nd:YAG laser irradiation using affymetrix GeneChip Array. Total RNA was extracted from the cells at 5 h after laser irradiation. Gene expressions were evaluated by DNA array chip. Up-regulated genes were related to cell migration and cell structure (membrane stretch, actin regulation and junctional complexes), neurotransmission and inflammation. Heat-shock 70 kDa protein (Hsp70) was related to the development of tooth germ. This study offers candidate genes for understanding the relationship between the laser-stimulated endothelial cells and dental pulp cells. PMID:28064220

  7. Comparative transcriptomic profiling of Vitis vinifera under high light using a custom-made array and the Affymetrix GeneChip.

    PubMed

    Carvalho, Luísa C; Vilela, Belmiro J; Mullineaux, Phil M; Amâncio, Sara

    2011-11-01

    Understanding abiotic stress responses is one of the most important issues in plant research nowadays. Abiotic stress, including excess light, can promote the onset of oxidative stress through the accumulation of reactive oxygen species. Oxidative stress also arises when in vitro propagated plants are exposed to high light upon transfer to ex vitro. To determine whether the underlying pathways activated at the transfer of in vitro grapevine to ex vitro conditions reflect the processes occurring upon light stress, we used Vitis vinifera Affymetrix GeneChip (VvGA) and a custom array of genes responsive to light stress (LSCA) detected by real-time reverse transcriptase PCR (qRT-PCR). When gene-expression profiles were compared, 'protein metabolism and modification', 'signaling', and 'anti-oxidative' genes were more represented in LSCA, while, in VvGA, 'cell wall metabolism' and 'secondary metabolism' were the categories in which gene expression varied more significantly. The above functional categories confirm previous studies involving other types of abiotic stresses, enhancing the common attributes of abiotic stress defense pathways. The LSCA analysis of our experimental system detected strong response of heat shock genes, particularly the protein rescuing mechanism involving the cooperation of two ATP-dependent chaperone systems, Hsp100 and Hsp70, which showed an unusually late response during the recovery period, of extreme relevance to remove non-functional, potentially harmful polypeptides arising from misfolding, denaturation, or aggregation brought about by stress. The success of LSCA also proves the feasibility of a custom-made qRT-PCR approach, particularly for species for which no GeneChip is available and for researchers dealing with a specific and focused problem.

  8. BEAT: Bioinformatics Exon Array Tool to store, analyze and visualize Affymetrix GeneChip Human Exon Array data from disease experiments

    PubMed Central

    2012-01-01

    Background It is known from recent studies that more than 90% of human multi-exon genes are subject to Alternative Splicing (AS), a key molecular mechanism in which multiple transcripts may be generated from a single gene. It is widely recognized that a breakdown in AS mechanisms plays an important role in cellular differentiation and pathologies. Polymerase Chain Reactions, microarrays and sequencing technologies have been applied to the study of transcript diversity arising from alternative expression. Last generation Affymetrix GeneChip Human Exon 1.0 ST Arrays offer a more detailed view of the gene expression profile providing information on the AS patterns. The exon array technology, with more than five million data points, can detect approximately one million exons, and it allows performing analyses at both gene and exon level. In this paper we describe BEAT, an integrated user-friendly bioinformatics framework to store, analyze and visualize exon arrays datasets. It combines a data warehouse approach with some rigorous statistical methods for assessing the AS of genes involved in diseases. Meta statistics are proposed as a novel approach to explore the analysis results. BEAT is available at http://beat.ba.itb.cnr.it. Results BEAT is a web tool which allows uploading and analyzing exon array datasets using standard statistical methods and an easy-to-use graphical web front-end. BEAT has been tested on a dataset with 173 samples and tuned using new datasets of exon array experiments from 28 colorectal cancer and 26 renal cell cancer samples produced at the Medical Genetics Unit of IRCCS Casa Sollievo della Sofferenza. To highlight all possible AS events, alternative names, accession Ids, Gene Ontology terms and biochemical pathways annotations are integrated with exon and gene level expression plots. The user can customize the results choosing custom thresholds for the statistical parameters and exploiting the available clinical data of the samples for a

  9. Analysis of copy number variations in Mexican Holstein cattle using axiom genome-wide Bos 1 array

    PubMed Central

    Salomon-Torres, Ricardo; Villa-Angulo, Rafael; Villa-Angulo, Carlos

    2015-01-01

    Recently, for copy number variation (CNV) analysis, bovine researchers have focused mainly on the use of genome-wide SNP genotyping arrays. One of the highest densities commercially available SNPchips for cattle is the Affymetrix axiom genome-wide Bos 1, which assays 648,315 informative SNPs across the whole bovine genome. Here, we describe the microarray data, quality controls and validation implemented in a study published in Genetics and Molecular Research Journal in 2015 [1]. The microarray raw data has been deposited into Gene Expression Omnibus under accession #GSE54813. PMID:26981375

  10. Genome-wide analysis links NFATC2 with asparaginase hypersensitivity

    PubMed Central

    Fernandez, Christian A.; Smith, Colton; Yang, Wenjian; Mullighan, Charles G.; Qu, Chunxu; Larsen, Eric; Bowman, W. Paul; Liu, Chengcheng; Ramsey, Laura B.; Chang, Tamara; Karol, Seth E.; Loh, Mignon L.; Raetz, Elizabeth A.; Winick, Naomi J.; Hunger, Stephen P.; Carroll, William L.; Jeha, Sima; Pui, Ching-Hon; Evans, William E.; Devidas, Meenakshi

    2015-01-01

    Asparaginase is used to treat acute lymphoblastic leukemia (ALL); however, hypersensitivity reactions can lead to suboptimal asparaginase exposure. Our objective was to use a genome-wide approach to identify loci associated with asparaginase hypersensitivity in children with ALL enrolled on St. Jude Children’s Research Hospital (SJCRH) protocols Total XIIIA (n = 154), Total XV (n = 498), and Total XVI (n = 271), or Children’s Oncology Group protocols POG 9906 (n = 222) and AALL0232 (n = 2163). Germline DNA was genotyped using the Affymetrix 500K, Affymetrix 6.0, or the Illumina Exome BeadChip array. In multivariate logistic regression, the intronic rs6021191 variant in nuclear factor of activated T cells 2 (NFATC2) had the strongest association with hypersensitivity (P = 4.1 × 10−8; odds ratio [OR] = 3.11). RNA-seq data available from 65 SJCRH ALL tumor samples and 52 Yoruba HapMap samples showed that samples carrying the rs6021191 variant had higher NFATC2 expression compared with noncarriers (P = 1.1 × 10−3 and 0.03, respectively). The top ranked nonsynonymous polymorphism was rs17885382 in HLA-DRB1 (P = 3.2 × 10−6; OR = 1.63), which is in near complete linkage disequilibrium with the HLA-DRB1*07:01 allele we previously observed in a candidate gene study. The strongest risk factors for asparaginase allergy are variants within genes regulating the immune response. PMID:25987655

  11. An experimental validation of genomic selection in octoploid strawberry

    PubMed Central

    Gezan, Salvador A; Osorio, Luis F; Verma, Sujeet; Whitaker, Vance M

    2017-01-01

    The primary goal of genomic selection is to increase genetic gains for complex traits by predicting performance of individuals for which phenotypic data are not available. The objective of this study was to experimentally evaluate the potential of genomic selection in strawberry breeding and to define a strategy for its implementation. Four clonally replicated field trials, two in each of 2 years comprised of a total of 1628 individuals, were established in 2013–2014 and 2014–2015. Five complex yield and fruit quality traits with moderate to low heritability were assessed in each trial. High-density genotyping was performed with the Affymetrix Axiom IStraw90 single-nucleotide polymorphism array, and 17 479 polymorphic markers were chosen for analysis. Several methods were compared, including Genomic BLUP, Bayes B, Bayes C, Bayesian LASSO Regression, Bayesian Ridge Regression and Reproducing Kernel Hilbert Spaces. Cross-validation within training populations resulted in higher values than for true validations across trials. For true validations, Bayes B gave the highest predictive abilities on average and also the highest selection efficiencies, particularly for yield traits that were the lowest heritability traits. Selection efficiencies using Bayes B for parent selection ranged from 74% for average fruit weight to 34% for early marketable yield. A breeding strategy is proposed in which advanced selection trials are utilized as training populations and in which genomic selection can reduce the breeding cycle from 3 to 2 years for a subset of untested parents based on their predicted genomic breeding values. PMID:28090334

  12. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  13. A genomic approach to myoblast fusion in Drosophila

    PubMed Central

    Estrada, Beatriz; Michelson, Alan M.

    2009-01-01

    Summary We have developed an integrated genetic, genomic and computational approach to identify and characterize genes involved in myoblast fusion in Drosophila. We first used fluorescence activated cell sorting to purify mesodermal cells both from wild-type embryos and from twelve variant genotypes in which muscle development is perturbed in known ways. Then, we obtained gene expression profiles for the purified cells by hybridizing isolated mesodermal RNA to Affymetrix GeneChip arrays. These data were subsequently compounded into a statistical meta-analysis that predicts myoblast subtype-specific gene expression signatures that were later validated by in situ hybridization experiments. Finally, we analyzed the myogenic functions of a subset of these myoblast genes using a double-stranded RNA interference assay in living embryos expressing green fluorescent protein under control of a muscle-specific promoter. This experimental strategy led to the identification of several previously uncharacterized genes required for myoblast fusion in Drosophila. PMID:18979251

  14. SEARCH FOR GENOMIC ALTERATIONS IN MONOZYGOTIC TWINS DISCORDANT FOR CLEFT LIP AND/OR PALATE

    PubMed Central

    Kimani, Jane W.; Yoshiura, Koh-ichiro; Shi, Min; Jugessur, Astanand; Moretti-Ferreira, Danilo; Christensen, Kaare; Murray, Jeffrey C.

    2010-01-01

    Phenotypically discordant monozygotic twins offer the possibility of gene discovery through delineation of molecular abnormalities in one member of the twin pair. One proposed mechanism of discordance is postzygotically occurring genomic alterations resulting from mitotic recombination and other somatic changes. Detection of altered genomic fragments can reveal candidate gene loci that can be verified through additional analyses. We investigated this hypothesis using array comparative genomic hybridization; the 50K and 250K Affymetrix GeneChip® SNP arrays and an Illumina custom array consisting of 1,536 SNPs, to scan for genomic alterations in a sample of monozygotic twin pairs with discordant cleft lip and/or palate phenotypes. Paired analysis for deletions, amplifications and loss of heterozygosity, along with sequence verification of SNPs with discordant genotype calls did not reveal any genomic discordance between twin pairs in lymphocyte DNA samples. Our results demonstrate that postzygotic genomic alterations are not a common cause of monozygotic twin discordance for isolated cleft lip and/or palate. However, rare or balanced genomic alterations, tissue-specific events and small aberrations beyond the detection level of our experimental approach cannot be ruled out. The stability of genomes we observed in our study samples also suggests that detection of discordant events in other monozygotic twin pairs would be remarkable and of potential disease significance. PMID:19803774

  15. An Affymetrix Microarray Design for Microbial Genotyping

    DTIC Science & Technology

    2009-10-01

    Clostridium botulinum APRT Okra 5 Clostridium botulinum A str. ATCC 19397 5 Clostridium botulinum ATCC 3502 40 Clostridium botulinum B str. Eklund 17B 5...Clostridium botulinum SNP B1 str. Okra plasmid pCLD 20 Clostridium botulinum B1 str. Okra plasmid pCLD 5 Clostridium botulinum Bf 5 Clostridium...botulinum HPT Eklund 17B 10 Clostridium botulinum HPT Loch Maree 20 Clostridium botulinum HPT Okra 5 Clostridium botulinum A3 str. Loch Maree 5

  16. Preterm Birth Genome Project (PGP) -- validation of resources for preterm birth genome-wide studies.

    PubMed

    Pennell, Craig E; Vadillo-Ortega, Felipe; Olson, David M; Ha, Eun-Hee; Williams, Scott; Frayling, Tim M; Dolan, Siobhan; Katz, Michael; Merialdi, Mario; Menon, Ramkumar

    2013-01-01

    We determined a series of quality control (QC) analyses to assess the usability of DNA collected and processed from different countries utilizing different DNA extraction techniques prior to genome-wide association studies (GWAS). The quality of DNA collected utilizing four different DNA extraction techniques and the impact of shipping DNA at different temperatures on array performance were evaluated. Fifteen maternal-fetal pairs were used from four countries. DNA was extracted using four approaches: whole blood, blood spots with whole genome amplification (WGA), saliva and buccal swab. Samples were sent to a genotyping facility, either on dry ice or at room temperature and genotyped using Affymetrix SNP array 6.0. QC measured included extraction techniques, effect of shipping temperatures, accuracy and Mendelian concordance. Significantly fewer (50 % ) single nucleotide polymorphisms (SNPs) passed QC metrics for buccal swab DNA (P < 0.0001) due to missing genotype data (P < 0.0001). Whole blood or saliva DNA had the highest call rates (99.2 0.4 % and 99.3 0.2 % , respectively) and Mendelian concordance. Shipment temperature had no effect. DNA from blood or saliva had the highest call rate accuracy, and buccal swabs had the lowest. DNA extracted from blood, saliva and blood spots were found suitable for GWAS in our study.

  17. Microarray-based genomic profiling reveals novel genomic aberrations in follicular lymphoma which associate with patient survival and gene expression status.

    PubMed

    Schwaenen, Carsten; Viardot, Andreas; Berger, Hilmar; Barth, Thomas F E; Bentink, Stefan; Döhner, Hartmut; Enz, Martina; Feller, Alfred C; Hansmann, Martin-Leo; Hummel, Michael; Kestler, Hans A; Klapper, Wolfram; Kreuz, Markus; Lenze, Dido; Loeffler, Markus; Möller, Peter; Müller-Hermelink, Hans-Konrad; Ott, German; Rosolowski, Maciej; Rosenwald, Andreas; Ruf, Sandra; Siebert, Reiner; Spang, Rainer; Stein, Harald; Truemper, Lorenz; Lichter, Peter; Bentz, Martin; Wessendorf, Swen

    2009-01-01

    Follicular lymphoma (FL) is characterized by a large number of chromosomal aberrations. However, their exact genomic extension and involved target genes remain to be determined. For this purpose, we used array-based intermediate-high resolution genomic profiling in combination with Affymetrix gene expression analysis. Tumor specimens from 128 FL patients were analyzed for the presence of genomic aberrations and the results were correlated to clinical data sets and mRNA expression levels. In 114 (89%) of the 128 analyzed cases, a total of 688 genomic aberrations (384 gains/amplifications and 304 losses) were detected. Frequent genomic aberrations were: -1p36 (18%), +2p15 (24%), -3q (14%), -6q (25%), +7p (19%), +7q (23%), +8q (14%), -9p (16%), -11q (15%), +12q (20%), -13q (11%), -17p (16%), +18p (18%), and +18q (28%). Critical segments of these imbalances were delineated to genomic fragments with a minimum size down to 0.2 Mb. By comparison of these with mRNA gene expression data, putative candidate genes were identified. Moreover, we found that deletions affecting the tumor suppressor gene CDKN2A/B on 9p21 were detected in nontransformed FL grade I-II. For this aberration as well as for -6q25 and -6q26, an association with inferior survival was observed.

  18. Arabidopsis transcriptional responses differentiate between O3 and herbicides

    EPA Science Inventory

    Using published data based on Affymetrix ATH1 Gene-Chips we characterized the transcriptional response of Arabidopsis thaliana Columbia to O3 and a few other major environmental stresses including oxidative stress . A set of 101 markers could be extracted which provided a compo...

  19. Genetics and genomics of Drosophila mating behavior

    PubMed Central

    Mackay, Trudy F. C.; Heinsohn, Stefanie L.; Lyman, Richard F.; Moehring, Amanda J.; Morgan, Theodore J.; Rollmann, Stephanie M.

    2005-01-01

    The first steps of animal speciation are thought to be the development of sexual isolating mechanisms. In contrast to recent progress in understanding the genetic basis of postzygotic isolating mechanisms, little is known about the genetic architecture of sexual isolation. Here, we have subjected Drosophila melanogaster to 29 generations of replicated divergent artificial selection for mating speed. The phenotypic response to selection was highly asymmetrical in the direction of reduced mating speed, with estimates of realized heritability averaging 7%. The selection response was largely attributable to a reduction in female receptivity. We assessed the whole genome transcriptional response to selection for mating speed using Affymetrix GeneChips and a rigorous statistical analysis. Remarkably, >3,700 probe sets (21% of the array elements) exhibited a divergence in message levels between the Fast and Slow replicate lines. Genes with altered transcriptional abundance in response to selection fell into many different biological process and molecular function Gene Ontology categories, indicating substantial pleiotropy for this complex behavior. Future functional studies are necessary to test the extent to which transcript profiling of divergent selection lines accurately predicts genes that directly affect the selected trait. PMID:15851659

  20. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    PubMed

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  1. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  2. Antarctic Genomics

    PubMed Central

    Clarke, Andrew; Cockell, Charles S.; Convey, Peter; Detrich III, H. William; Fraser, Keiron P. P.; Johnston, Ian A.; Methe, Barbara A.; Murray, Alison E.; Peck, Lloyd S.; Römisch, Karin; Rogers, Alex D.

    2004-01-01

    With the development of genomic science and its battery of technologies, polar biology stands on the threshold of a revolution, one that will enable the investigation of important questions of unprecedented scope and with extraordinary depth and precision. The exotic organisms of polar ecosystems are ideal candidates for genomic analysis. Through such analyses, it will be possible to learn not only the novel features that enable polar organisms to survive, and indeed thrive, in their extreme environments, but also fundamental biological principles that are common to most, if not all, organisms. This article aims to review recent developments in Antarctic genomics and to demonstrate the global context of such studies. PMID:18629155

  3. Genomic Testing

    MedlinePlus

    ... Services released a report identifying gaps in the regulation, oversight, and usefulness of genetic testing. They expressed ... December 20, 2016 Content source: Center for Surveillance, Epidemiology and Laboratory Services (CSELS) , Public Health Genomics Email ...

  4. Genome-Wide Association Study of Copy Number Variations in Patients with Familial Neurocardiogenic Syncope.

    PubMed

    Demir, Emre; Hasdemir, Can; Ak, Handan; Atay, Sevcan; Aydin, Hikmet Hakan

    2016-08-01

    Neurocardiogenic syncope (NCS) is the most frequent type of syncope characterized by a self-limited episode of systemic hypotension. In this study, we conducted the first genome-wide association study testing copy number variations for association with NCS. Study population consisted of 107 consecutive patients with recurrent syncope and positive head-up tilt table testing. Four families with NCS were selected for CNV analysis. Affymetrix GeneChip(®) SNP 6.0 array was used for CNV analysis. Data and statistical analysis were performed with Affymetrix genotyping console 4.0 and GraphPad Prism v6. Positive family history of NCS was present in 19.6 % (n = 21) in our study population (n = 107). Twenty-six CNV regions were found to be significantly altered in families with NCS (P < 0.05). Several CNVs were identified in families with NCS. Further studies comprising wider study population are required to determine the effect of these variations on NCS development.

  5. Genome Sequencing.

    PubMed

    Verma, Mansi; Kulshrestha, Samarth; Puri, Ayush

    2017-01-01

    Genome sequencing is an important step toward correlating genotypes with phenotypic characters. Sequencing technologies are important in many fields in the life sciences, including functional genomics, transcriptomics, oncology, evolutionary biology, forensic sciences, and many more. The era of sequencing has been divided into three generations. First generation sequencing involved sequencing by synthesis (Sanger sequencing) and sequencing by cleavage (Maxam-Gilbert sequencing). Sanger sequencing led to the completion of various genome sequences (including human) and provided the foundation for development of other sequencing technologies. Since then, various techniques have been developed which can overcome some of the limitations of Sanger sequencing. These techniques are collectively known as "Next-generation sequencing" (NGS), and are further classified into second and third generation technologies. Although NGS methods have many advantages in terms of speed, cost, and parallelism, the accuracy and read length of Sanger sequencing is still superior and has confined the use of NGS mainly to resequencing genomes. Consequently, there is a continuing need to develop improved real time sequencing techniques. This chapter reviews some of the options currently available and provides a generic workflow for sequencing a genome.

  6. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  7. Listeria Genomics

    NASA Astrophysics Data System (ADS)

    Cabanes, Didier; Sousa, Sandra; Cossart, Pascale

    The opportunistic intracellular foodborne pathogen Listeria monocytogenes has become a paradigm for the study of host-pathogen interactions and bacterial adaptation to mammalian hosts. Analysis of L. monocytogenes infection has provided considerable insight into how bacteria invade cells, move intracellularly, and disseminate in tissues, as well as tools to address fundamental processes in cell biology. Moreover, the vast amount of knowledge that has been gathered through in-depth comparative genomic analyses and in vivo studies makes L. monocytogenes one of the most well-studied bacterial pathogens. This chapter provides an overview of progress in the exploration of genomic, transcriptomic, and proteomic data in Listeria spp. to understand genome evolution and diversity, as well as physiological aspects of metabolism used by bacteria when growing in diverse environments, in particular in infected hosts.

  8. Impact of copy number variations burden on coding genome in humans using integrated high resolution arrays.

    PubMed

    Veerappa, Avinash M; Lingaiah, Kusuma; Vishweswaraiah, Sangeetha; Murthy, Megha N; Suresh, Raviraj V; Manjegowda, Dinesh S; Ramachandra, Nallur B

    2014-12-16

    Copy number variations (CNVs) alter the transcriptional and translational levels of genes by disrupting the coding structure and this burden of CNVs seems to be a significant contributor to phenotypic variations. Therefore it was necessary to assess the complexities of CNV burden on the coding genome. A total of 1715 individuals from 12 populations were used for CNV analysis in the present investigation. Analysis was performed using Affymetrix Genome-Wide Human SNP Array 6·0 chip and CytoScan High-Density arrays. CNVs were more frequently observed in the coding region than in the non-coding region. CNVs were observed vastly more frequently in the coding region than the non-coding region. CNVs were found to be enriched in the regions containing functional genes (83-96%) compared with the regions containing pseudogenes (4-17%). CNVs across the genome of an individual showed multiple hits across many genes, whose proteins interact physically and function under the same pathway. We identified varying numbers of proteins and degrees of interactions within protein complexes of single individual genomes. This study represents the first draft of a population-specific CNV genes map as well as a cross-populational map. The complex relationship of CNVs on genes and their physically interacting partners unravels many complexities involved in phenotype expression. This study identifies four mechanisms contributing to the complexities caused by the presence of multiple CNVs across many genes in the coding part of the genome.

  9. Admixture mapping identifies introgressed genomic regions in North American canids.

    PubMed

    vonHoldt, Bridgett M; Kays, Roland; Pollinger, John P; Wayne, Robert K

    2016-06-01

    Hybrid zones typically contain novel gene combinations that can be tested by natural selection in a unique genetic context. Parental haplotypes that increase fitness can introgress beyond the hybrid zone, into the range of parental species. We used the Affymetrix canine SNP genotyping array to identify genomic regions tagged by multiple ancestry informative markers that are more frequent in an admixed population than expected. We surveyed a hybrid zone formed in the last 100 years as coyotes expanded their range into eastern North America. Concomitant with expansion, coyotes hybridized with wolves and some populations became more wolflike, such that coyotes in the northeast have the largest body size of any coyote population. Using a set of 3102 ancestry informative markers, we identified 60 differentially introgressed regions in 44 canines across this admixture zone. These regions are characterized by an excess of exogenous ancestry and, in northeastern coyotes, are enriched for genes affecting body size and skeletal proportions. Further, introgressed wolf-derived alleles have penetrated into Southern US coyote populations. Because no wolves currently exist in this area, these alleles are unlikely to have originated from recent hybridization. Instead, they probably originated from intraspecific gene flow or ancient admixture. We show that grey wolf and coyote admixture has far-reaching effects and, in addition to phenotypically transforming admixed populations, allows for the differential movement of alleles from different parental species to be tested in new genomic backgrounds.

  10. Genome mapping

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome maps can be thought of much like road maps except that, instead of traversing across land, they traverse across the chromosomes of an organism. Genetic markers serve as landmarks along the chromosome and provide researchers information as to how close they may be to a gene or region of inter...

  11. Comparison of Comparative Genomic Hybridization Technologies across Microarray Platforms

    EPA Science Inventory

    In the 2007 Association of Biomolecular Resource Facilities (ABRF) Microarray Research Group (MARG) project, we analyzed HL-60 DNA with five platforms: Agilent, Affymetrix 500K, Affymetrix U133 Plus 2.0, Illumina, and RPCI 19K BAC arrays. Copy number variation (CNV) was analyzed ...

  12. Genome cartography: charting the apicomplexan genome.

    PubMed

    Kissinger, Jessica C; DeBarry, Jeremy

    2011-08-01

    Genes reside in particular genomic contexts that can be mapped at many levels. Historically, 'genetic maps' were used primarily to locate genes. Recent technological advances in the determination of genome sequences have made the analysis and comparison of whole genomes possible and increasingly tractable. What do we see if we shift our focus from gene content (the 'inventory' of genes contained within a genome) to the composition and organization of a genome? This review examines what has been learned about the evolution of the apicomplexan genome as well as the significance and impact of genomic location on our understanding of the eukaryotic genome and parasite biology.

  13. Whole-genome linkage analysis in mapping alcoholism genes using single-nucleotide polymorphisms and microsatellites.

    PubMed

    Wang, Shuang; Huang, Song; Liu, Nianjun; Chen, Liang; Oh, Cheongeun; Zhao, Hongyu

    2005-12-30

    There is currently a great interest in using single-nucleotide polymorphisms (SNPs) in genetic linkage and association studies because of the abundance of SNPs as well as the availability of high-throughput genotyping technologies. In this study, we compared the performance of whole-genome scans using SNPs with microsatellites on 143 pedigrees from the Collaborative Studies on Genetics of Alcoholism provided by Genetic Analysis Workshop 14. A total of 315 microsatellites and 10,081 SNPs from Affymetrix on 22 autosomal chromosomes were used in our analyses. We found that the results from the two scans had good overall concordance. One region on chromosome 2 and two regions on chromosome 7 showed significant linkage signals (i.e., NPL >or= 2) for alcoholism from both the SNP and microsatellite scans. The different results observed between the two scans may be explained by the difference observed in information content between the SNPs and the microsatellites.

  14. Personal genomics services: whose genomes?

    PubMed Central

    Gurwitz, David; Bregman-Eschet, Yael

    2009-01-01

    New companies offering personal whole-genome information services over the internet are dynamic and highly visible players in the personal genomics field. For fees currently ranging from US$399 to US$2500 and a vial of saliva, individuals can now purchase online access to their individual genetic information regarding susceptibility to a range of chronic diseases and phenotypic traits based on a genome-wide SNP scan. Most of the companies offering such services are based in the United States, but their clients may come from nearly anywhere in the world. Although the scientific validity, clinical utility and potential future implications of such services are being hotly debated, several ethical and regulatory questions related to direct-to-consumer (DTC) marketing strategies of genetic tests have not yet received sufficient attention. For example, how can we minimize the risk of unauthorized third parties from submitting other people's DNA for testing? Another pressing question concerns the ownership of (genotypic and phenotypic) information, as well as the unclear legal status of customers regarding their own personal information. Current legislation in the US and Europe falls short of providing clear answers to these questions. Until the regulation of personal genomics services catches up with the technology, we call upon commercial providers to self-regulate and coordinate their activities to minimize potential risks to individual privacy. We also point out some specific steps, along the trustee model, that providers of DTC personal genomics services as well as regulators and policy makers could consider for addressing some of the concerns raised below. PMID:19259127

  15. Citrus Genomics

    PubMed Central

    Talon, Manuel; Gmitter Jr., Fred G.

    2008-01-01

    Citrus is one of the most widespread fruit crops globally, with great economic and health value. It is among the most difficult plants to improve through traditional breeding approaches. Currently, there is risk of devastation by diseases threatening to limit production and future availability to the human population. As technologies rapidly advance in genomic science, they are quickly adapted to address the biological challenges of the citrus plant system and the world's industries. The historical developments of linkage mapping, markers and breeding, EST projects, physical mapping, an international citrus genome sequencing project, and critical functional analysis are described. Despite the challenges of working with citrus, there has been substantial progress. Citrus researchers engaged in international collaborations provide optimism about future productivity and contributions to the benefit of citrus industries worldwide and to the human population who can rely on future widespread availability of this health-promoting and aesthetically pleasing fruit crop. PMID:18509486

  16. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  17. Ancient genomics.

    PubMed

    Der Sarkissian, Clio; Allentoft, Morten E; Ávila-Arcos, María C; Barnett, Ross; Campos, Paula F; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D; Moreno-Mayar, J Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M Thomas P; Willerslev, Eske; Orlando, Ludovic

    2015-01-19

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past.

  18. A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies.

    PubMed

    Sun, Yan V; Jacobsen, Douglas M; Turner, Stephen T; Boerwinkle, Eric; Kardia, Sharon L R

    2009-03-15

    In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N=195) using the Affymetrix Human Mapping 100K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R(2)<0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 hours. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4,000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms.

  19. Genomic Islands of Speciation in Anopheles gambiae

    PubMed Central

    Hahn, Matthew W; Nuzhdin, Sergey V

    2005-01-01

    The African malaria mosquito, Anopheles gambiae sensu stricto (A. gambiae), provides a unique opportunity to study the evolution of reproductive isolation because it is divided into two sympatric, partially isolated subtaxa known as M form and S form. With the annotated genome of this species now available, high-throughput techniques can be applied to locate and characterize the genomic regions contributing to reproductive isolation. In order to quantify patterns of differentiation within A. gambiae, we hybridized population samples of genomic DNA from each form to Affymetrix GeneChip microarrays. We found that three regions, together encompassing less than 2.8 Mb, are the only locations where the M and S forms are significantly differentiated. Two of these regions are adjacent to centromeres, on Chromosomes 2L and X, and contain 50 and 12 predicted genes, respectively. Sequenced loci in these regions contain fixed differences between forms and no shared polymorphisms, while no fixed differences were found at nearby control loci. The third region, on Chromosome 2R, contains only five predicted genes; fixed differences in this region were also verified by direct sequencing. These “speciation islands” remain differentiated despite considerable gene flow, and are therefore expected to contain the genes responsible for reproductive isolation. Much effort has recently been applied to locating the genes and genetic changes responsible for reproductive isolation between species. Though much can be inferred about speciation by studying taxa that have diverged for millions of years, studying differentiation between taxa that are in the early stages of isolation will lead to a clearer view of the number and size of regions involved in the genetics of speciation. Despite appreciable levels of gene flow between the M and S forms of A. gambiae, we were able to isolate three small regions of differentiation where genes responsible for ecological and behavioral isolation are

  20. Genomic arrays in chronic lymphocytic leukemia routine clinical practice: are we ready to substitute conventional cytogenetics and fluorescence in situ hybridization techniques?

    PubMed

    Puiggros, Anna; Puigdecanet, Eulàlia; Salido, Marta; Ferrer, Ana; Abella, Eugènia; Gimeno, Eva; Nonell, Lara; Herranz, María José; Galván, Ana Belén; Rodríguez-Rivera, María; Melero, Carme; Pairet, Silvia; Bellosillo, Beatriz; Serrano, Sergi; Florensa, Lourdes; Solé, Francesc; Espinet, Blanca

    2013-05-01

    Chronic lymphocytic leukemia (CLL) is characterized by a highly variable clinical course. Del(11q) and del(17p), routinely studied by conventional G-banding cytogenetics (CGC) and fluorescence in situ hybridization (FISH), have been related to progression and shorter overall survival. Recently, array-based karyotyping has gained acceptance as a high-resolution new tool for detecting genomic imbalances. The aim of the present study was to compare genomic arrays with CGC and FISH to ascertain whether the current techniques could be substituted in routine procedures. We analyzed 70 patients with CLL using the Cytogenetics Whole-Genome 2.7M Array and CytoScan HD Array (Affymetrix), CGC and FISH with the classical CLL panel. Whereas 31.4% and 68.6% of patients presented abnormalities when studied by CGC and FISH, respectively, these rates increased when arrays were also analyzed (78.6% and 80%). Although abnormality detection is higher when arrays are applied, one case with del(11q) and three with del(17p) were missed by genomic arrays due to their limited sensitivity. We consider that the complete substitution of CGC and FISH by genomic arrays in routine laboratories could negatively affect the management of some patients harboring 11q or 17p deletions. In conclusion, genomic arrays are valid to detect known and novel genomic imbalances in CLL, but should be maintained as a complementary tool to the current techniques.

  1. The platypus genome unraveled.

    PubMed

    O'Brien, Stephen J

    2008-06-13

    The genome of the platypus has been sequenced, assembled, and annotated by an international genomics team. Like the animal itself the platypus genome contains an amalgam of mammal, reptile, and bird-like features.

  2. Genome evolution: the dynamics of static genomes.

    PubMed

    Stechmann, Alexandra

    2004-06-22

    A random survey of a microsporidian genome has revealed some striking features. Although the genomes of microsporidians are among the smallest known for eukaryotes, their organisation appears to be well conserved.

  3. Plant Genome Duplication Database.

    PubMed

    Lee, Tae-Ho; Kim, Junah; Robertson, Jon S; Paterson, Andrew H

    2017-01-01

    Genome duplication, widespread in flowering plants, is a driving force in evolution. Genome alignments between/within genomes facilitate identification of homologous regions and individual genes to investigate evolutionary consequences of genome duplication. PGDD (the Plant Genome Duplication Database), a public web service database, provides intra- or interplant genome alignment information. At present, PGDD contains information for 47 plants whose genome sequences have been released. Here, we describe methods for identification and estimation of dates of genome duplication and speciation by functions of PGDD.The database is freely available at http://chibba.agtec.uga.edu/duplication/.

  4. Rice-arsenate interactions in hydroponics: whole genome transcriptional analysis.

    PubMed

    Norton, Gareth J; Lou-Hing, Daniel E; Meharg, Andrew A; Price, Adam H

    2008-01-01

    Rice (Oryza sativa) varieties that are arsenate-tolerant (Bala) and -sensitive (Azucena) were used to conduct a transcriptome analysis of the response of rice seedlings to sodium arsenate (AsV) in hydroponic solution. RNA extracted from the roots of three replicate experiments of plants grown for 1 week in phosphate-free nutrient with or without 13.3 muM AsV was used to challenge the Affymetrix (52K) GeneChip Rice Genome array. A total of 576 probe sets were significantly up-regulated at least 2-fold in both varieties, whereas 622 were down-regulated. Ontological classification is presented. As expected, a large number of transcription factors, stress proteins, and transporters demonstrated differential expression. Striking is the lack of response of classic oxidative stress-responsive genes or phytochelatin synthases/synthatases. However, the large number of responses from genes involved in glutathione synthesis, metabolism, and transport suggests that glutathione conjugation and arsenate methylation may be important biochemical responses to arsenate challenge. In this report, no attempt is made to dissect differences in the response of the tolerant and sensitive variety, but analysis in a companion article will link gene expression to the known tolerance loci available in the BalaxAzucena mapping population.

  5. Genomic analysis of gum disease and hypertrichosis in foxes.

    PubMed

    Clark, J-A B J; Whalen, D; Marshall, H D

    2016-05-20

    Since the 1940s, a proliferative gingival disease called hereditary hyperplastic gingivitis (HHG) has been described in the farmed silver fox, Vulpes vulpes (Dyrendahl and Henricson 1960). HHG displays an autosomal recessive transmission and has a pleiotropic relationship with superior fur quality in terms of length and thickness of guard hairs. An analogous human disease, hereditary gingival fibromatosis (HGF), is characterized by a predominantly autosomal dominant transmission and a complex etiology, occurring either as an isolated condition or as a part of a syndrome. Similar to HHG, the symptom most commonly associated with syndromic HGF is hypertrichosis. Here we explore potential mechanisms involved in HHG by comparison to known genetic information about hypertrichosis co-occurring with HGF, using an Affymetrix canine genome microarray platform, quantitative PCR, and candidate gene sequencing. We conclude that the mitogen-activated protein kinase pathway is involved in HHG, however despite involvement of the mitogen-activated protein kinase kinase 6 gene in congenital hypertrichosis with gingival fibromatosis in humans, this gene did not contain any fixed mutations in exons or exon-intron boundaries in HHG-affected foxes, suggesting that it is not causative of HHG in the farmed silver fox population. Differential up-regulation of MAP2K6 gene in HHG-affected foxes does implicate this gene in the HHG phenotype.

  6. Brain Perihematoma Genomic Profile Following Spontaneous Human Intracerebral Hemorrhage

    PubMed Central

    Rosell, Anna; Vilalta, Anna; García-Berrocoso, Teresa; Fernández-Cadenas, Israel; Domingues-Montanari, Sophie; Cuadrado, Eloy; Delgado, Pilar; Ribó, Marc; Martínez-Sáez, Elena; Ortega-Aznar, Arantxa; Montaner, Joan

    2011-01-01

    Background Spontaneous intracerebral hemorrhage (ICH) represents about 15% of all strokes and is associated with high mortality rates. Our aim was to identify the gene expression changes and biological pathways altered in the brain following ICH. Methodology/Principal Findings Twelve brain samples were obtained from four deceased patients who suffered an ICH including perihematomal tissue (PH) and the corresponding contralateral white (CW) and grey (CG) matter. Affymetrix GeneChip platform for analysis of over 47,000 transcripts was conducted. Microarray Analysis Suite 5.0 was used to process array images and the Ingenuity Pathway Analysis System was used to analyze biological mechanisms and functions of the genes. We identified 468 genes in the PH areas displaying a different expression pattern with a fold change between −3.74 and +5.16 when compared to the contralateral areas (291 overexpressed and 177 underexpressed). The top genes which appeared most significantly overexpressed in the PH areas codify for cytokines, chemokines, coagulation factors, cell growth and proliferation factors while the underexpressed codify for proteins involved in cell cycle or neurotrophins. Validation and replication studies at gene and protein level in brain samples confirmed microarray results. Conclusions The genomic responses identified in this study provide valuable information about potential biomarkers and target molecules altered in the perihematomal regions. PMID:21311749

  7. Ensembl genomes 2016: more genomes, more complexity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent...

  8. Ensembl Genomes 2016: more genomes, more complexity

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J.; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K.; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D.; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello–Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin L.; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  9. Ensembl Genomes 2016: more genomes, more complexity.

    PubMed

    Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

    2016-01-04

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.

  10. Soybean Knowledge Base (SoyKB): a Web Resource for Soybean Translational Genomics

    SciTech Connect

    Joshi, Trupti; Patil, Kapil; Fitzpatrick, Michael R.; Franklin, Levi D.; Yao, Qiuming; Cook, Jeffrey R.; Wang, Zhem; Libault, Marc; Brechenmacher, Laurent; Valliyodan, Babu; Wu, Xiaolei; Cheng, Jianlin; Stacey, Gary; Nguyen, Henry T.; Xu, Dong

    2012-01-17

    Background: Soybean Knowledge Base (SoyKB) is a comprehensive all-inclusive web resource for soybean translational genomics. SoyKB is designed to handle the management and integration of soybean genomics, transcriptomics, proteomics and metabolomics data along with annotation of gene function and biological pathway. It contains information on four entities, namely genes, microRNAs, metabolites and single nucleotide polymorphisms (SNPs). Methods: SoyKB has many useful tools such as Affymetrix probe ID search, gene family search, multiple gene/ metabolite search supporting co-expression analysis, and protein 3D structure viewer as well as download and upload capacity for experimental data and annotations. It has four tiers of registration, which control different levels of access to public and private data. It allows users of certain levels to share their expertise by adding comments to the data. It has a user-friendly web interface together with genome browser and pathway viewer, which display data in an intuitive manner to the soybean researchers, producers and consumers. Conclusions: SoyKB addresses the increasing need of the soybean research community to have a one-stop-shop functional and translational omics web resource for information retrieval and analysis in a user-friendly way. SoyKB can be publicly accessed at http://soykb.org/.

  11. Rapid Identification of Potential Drugs for Diabetic Nephropathy Using Whole-Genome Expression Profiles of Glomeruli

    PubMed Central

    Shi, Jingsong; Jiang, Song; Qiu, Dandan; Le, Weibo; Wang, Xiao; Lu, Yinhui; Liu, Zhihong

    2016-01-01

    Objective. To investigate potential drugs for diabetic nephropathy (DN) using whole-genome expression profiles and the Connectivity Map (CMAP). Methodology. Eighteen Chinese Han DN patients and six normal controls were included in this study. Whole-genome expression profiles of microdissected glomeruli were measured using the Affymetrix human U133 plus 2.0 chip. Differentially expressed genes (DEGs) between late stage and early stage DN samples and the CMAP database were used to identify potential drugs for DN using bioinformatics methods. Results. (1) A total of 1065 DEGs (FDR < 0.05 and fold change > 1.5) were found in late stage DN patients compared with early stage DN patients. (2) Piperlongumine, 15d-PGJ2 (15-delta prostaglandin J2), vorinostat, and trichostatin A were predicted to be the most promising potential drugs for DN, acting as NF-κB inhibitors, histone deacetylase inhibitors (HDACIs), PI3K pathway inhibitors, or PPARγ agonists, respectively. Conclusion. Using whole-genome expression profiles and the CMAP database, we rapidly predicted potential DN drugs, and therapeutic potential was confirmed by previously published studies. Animal experiments and clinical trials are needed to confirm both the safety and efficacy of these drugs in the treatment of DN. PMID:27069916

  12. Genome-wide patterns of population structure and admixture in West Africans and African Americans.

    PubMed

    Bryc, Katarzyna; Auton, Adam; Nelson, Matthew R; Oksenberg, Jorge R; Hauser, Stephen L; Williams, Scott; Froment, Alain; Bodo, Jean-Marie; Wambebe, Charles; Tishkoff, Sarah A; Bustamante, Carlos D

    2010-01-12

    Quantifying patterns of population structure in Africans and African Americans illuminates the history of human populations and is critical for undertaking medical genomic studies on a global scale. To obtain a fine-scale genome-wide perspective of ancestry, we analyze Affymetrix GeneChip 500K genotype data from African Americans (n = 365) and individuals with ancestry from West Africa (n = 203 from 12 populations) and Europe (n = 400 from 42 countries). We find that population structure within the West African sample reflects primarily language and secondarily geographical distance, echoing the Bantu expansion. Among African Americans, analysis of genomic admixture by a principal component-based approach indicates that the median proportion of European ancestry is 18.5% (25th-75th percentiles: 11.6-27.7%), with very large variation among individuals. In the African-American sample as a whole, few autosomal regions showed exceptionally high or low mean African ancestry, but the X chromosome showed elevated levels of African ancestry, consistent with a sex-biased pattern of gene flow with an excess of European male and African female ancestry. We also find that genomic profiles of individual African Americans afford personalized ancestry reconstructions differentiating ancient vs. recent European and African ancestry. Finally, patterns of genetic similarity among inferred African segments of African-American genomes and genomes of contemporary African populations included in this study suggest African ancestry is most similar to non-Bantu Niger-Kordofanian-speaking populations, consistent with historical documents of the African Diaspora and trans-Atlantic slave trade.

  13. High-resolution genomic profiling of chronic lymphocytic leukemia reveals new recurrent genomic alterations.

    PubMed

    Edelmann, Jennifer; Holzmann, Karlheinz; Miller, Florian; Winkler, Dirk; Bühler, Andreas; Zenz, Thorsten; Bullinger, Lars; Kühn, Michael W M; Gerhardinger, Andreas; Bloehdorn, Johannes; Radtke, Ina; Su, Xiaoping; Ma, Jing; Pounds, Stanley; Hallek, Michael; Lichter, Peter; Korbel, Jan; Busch, Raymonde; Mertens, Daniel; Downing, James R; Stilgenbauer, Stephan; Döhner, Hartmut

    2012-12-06

    To identify genomic alterations in chronic lymphocytic leukemia (CLL), we performed single-nucleotide polymorphism-array analysis using Affymetrix Version 6.0 on 353 samples from untreated patients entered in the CLL8 treatment trial. Based on paired-sample analysis (n = 144), a mean of 1.8 copy number alterations per patient were identified; approximately 60% of patients carried no copy number alterations other than those detected by fluorescence in situ hybridization analysis. Copy-neutral loss-of-heterozygosity was detected in 6% of CLL patients and was found most frequently on 13q, 17p, and 11q. Minimally deleted regions were refined on 13q14 (deleted in 61% of patients) to the DLEU1 and DLEU2 genes, on 11q22.3 (27% of patients) to ATM, on 2p16.1-2p15 (gained in 7% of patients) to a 1.9-Mb fragment containing 9 genes, and on 8q24.21 (5% of patients) to a segment 486 kb proximal to the MYC locus. 13q deletions exhibited proximal and distal breakpoint cluster regions. Among the most common novel lesions were deletions at 15q15.1 (4% of patients), with the smallest deletion (70.48 kb) found in the MGA locus. Sequence analysis of MGA in 59 samples revealed a truncating mutation in one CLL patient lacking a 15q deletion. MNT at 17p13.3, which in addition to MGA and MYC encodes for the network of MAX-interacting proteins, was also deleted recurrently.

  14. Funding Opportunity: Genomic Data Centers

    Cancer.gov

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  15. Ontology for Genome Comparison and Genomic Rearrangements

    PubMed Central

    Flanagan, Keith; Stevens, Robert; Pocock, Matthew; Lee, Pete

    2004-01-01

    We present an ontology for describing genomes, genome comparisons, their evolution and biological function. This ontology will support the development of novel genome comparison algorithms and aid the community in discussing genomic evolution. It provides a framework for communication about comparative genomics, and a basis upon which further automated analysis can be built. The nomenclature defined by the ontology will foster clearer communication between biologists, and also standardize terms used by data publishers in the results of analysis programs. The overriding aim of this ontology is the facilitation of consistent annotation of genomes through computational methods, rather than human annotators. To this end, the ontology includes definitions that support computer analysis and automated transfer of annotations between genomes, rather than relying upon human mediation. PMID:18629137

  16. Enabling functional genomics with genome engineering.

    PubMed

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances.

  17. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  18. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer.

  19. Culex genome is not just another genome for comparative genomics.

    PubMed

    Reddy, B P Niranjan; Labbé, Pierrick; Corbel, Vincent

    2012-03-30

    Formal publication of the Culex genome sequence has closed the human disease vector triangle by meeting the Anopheles gambiae and Aedes aegypti genome sequences. Compared to these other mosquitoes, Culex quinquefasciatus possesses many specific hallmark characteristics, and may thus provide different angles for research which ultimately leads to a practical solution for controlling the ever increasing burden of insect-vector-borne diseases around the globe. We argue the special importance of the cosmopolitan species- Culex genome sequence by invoking many interesting questions and the possible of potential of the Culex genome to answer those.

  20. Exploring Other Genomes: Bacteria.

    ERIC Educational Resources Information Center

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  1. Diversity in global gene expression and morphology across a watercress (Nasturtium officinale R. Br.) germplasm collection: first steps to breeding.

    PubMed

    Payne, Adrienne C; Clarkson, Graham J J; Rothwell, Steve; Taylor, Gail

    2015-01-01

    Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called 'Boldrewood') and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop.

  2. Diversity in global gene expression and morphology across a watercress (Nasturtium officinale R. Br.) germplasm collection: first steps to breeding

    PubMed Central

    Payne, Adrienne C.; Clarkson, Graham J.J.; Rothwell, Steve; Taylor, Gail

    2015-01-01

    Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called ‘Boldrewood’) and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop. PMID:26504575

  3. Exploiting the Genome

    DTIC Science & Technology

    1998-09-11

    complete human genome sequence . 14. SUBJECT TERMS 15. NUMBER OF PAGES 16. PRICE CODE 17. SECURITY CLASSIFICATION OF REPORT Unclassified 18. SECURITY...goal of the project is to ob- tain the complete sequence of the human genome by the year 2005. The genome contains approximately 3.3 Gb (billion base...and second, to consider possible roles for the DOE in the "post- genomic " era, following acquisition of the complete human genome

  4. Genome Maps, a new generation genome browser

    PubMed Central

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-01-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  5. Genome Maps, a new generation genome browser.

    PubMed

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org.

  6. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  7. A Whole Genome Association Study on Meat Palatability in Hanwoo

    PubMed Central

    Hyeong, K.-E.; Lee, Y.-M.; Kim, Y.-S.; Nam, K. C.; Jo, C.; Lee, K.-H.; Lee, J.-E.; Kim, J.-J.

    2014-01-01

    A whole genome association (WGA) study was carried out to find quantitative trait loci (QTL) for sensory evaluation traits in Hanwoo. Carcass samples of 250 Hanwoo steers were collected from National Agricultural Cooperative Livestock Research Institute, Ansung, Gyeonggi province, Korea, between 2011 and 2012 and genotyped with the Affymetrix Bovine Axiom Array 640K single nucleotide polymorphism (SNP) chip. Among the SNPs in the chip, a total of 322,160 SNPs were chosen after quality control tests. After adjusting for the effects of age, slaughter-year-season, and polygenic effects using genome relationship matrix, the corrected phenotypes for the sensory evaluation measurements were regressed on each SNP using a simple linear regression additive based model. A total of 1,631 SNPs were detected for color, aroma, tenderness, juiciness and palatability at 0.1% comparison-wise level. Among the significant SNPs, the best set of 52 SNP markers were chosen using a forward regression procedure at 0.05 level, among which the sets of 8, 14, 11, 10, and 9 SNPs were determined for the respectively sensory evaluation traits. The sets of significant SNPs explained 18% to 31% of phenotypic variance. Three SNPs were pleiotropic, i.e. AX-26703353 and AX-26742891 that were located at 101 and 110 Mb of BTA6, respectively, influencing tenderness, juiciness and palatability, while AX-18624743 at 3 Mb of BTA10 affected tenderness and palatability. Our results suggest that some QTL for sensory measures are segregating in a Hanwoo steer population. Additional WGA studies on fatty acid and nutritional components as well as the sensory panels are in process to characterize genetic architecture of meat quality and palatability in Hanwoo. PMID:25178363

  8. Genomic Analysis of Stress Response against Arsenic in Caenorhabditis elegans

    PubMed Central

    Sahu, Surasri N.; Lewis, Jada; Patel, Isha; Bozdag, Serdar; Lee, Jeong H.; Sprando, Robert; Cinar, Hediye Nese

    2013-01-01

    Arsenic, a known human carcinogen, is widely distributed around the world and found in particularly high concentrations in certain regions including Southwestern US, Eastern Europe, India, China, Taiwan and Mexico. Chronic arsenic poisoning affects millions of people worldwide and is associated with increased risk of many diseases including arthrosclerosis, diabetes and cancer. In this study, we explored genome level global responses to high and low levels of arsenic exposure in Caenorhabditis elegans using Affymetrix expression microarrays. This experimental design allows us to do microarray analysis of dose-response relationships of global gene expression patterns. High dose (0.03%) exposure caused stronger global gene expression changes in comparison with low dose (0.003%) exposure, suggesting a positive dose-response correlation. Biological processes such as oxidative stress, and iron metabolism, which were previously reported to be involved in arsenic toxicity studies using cultured cells, experimental animals, and humans, were found to be affected in C. elegans. We performed genome-wide gene expression comparisons between our microarray data and publicly available C. elegans microarray datasets of cadmium, and sediment exposure samples of German rivers Rhine and Elbe. Bioinformatics analysis of arsenic-responsive regulatory networks were done using FastMEDUSA program. FastMEDUSA analysis identified cancer-related genes, particularly genes associated with leukemia, such as dnj-11, which encodes a protein orthologous to the mammalian ZRF1/MIDA1/MPP11/DNAJC2 family of ribosome-associated molecular chaperones. We analyzed the protective functions of several of the identified genes using RNAi. Our study indicates that C. elegans could be a substitute model to study the mechanism of metal toxicity using high-throughput expression data and bioinformatics tools such as FastMEDUSA. PMID:23894281

  9. "Replicated" genome wide association for dependence on illegal substances: genomic regions identified by overlapping clusters of nominally positive SNPs.

    PubMed

    Drgon, Tomas; Johnson, Catherine A; Nino, Michelle; Drgonova, Jana; Walther, Donna M; Uhl, George R

    2011-03-01

    Declaring "replication" from results of genome wide association (GWA) studies is straightforward when major gene effects provide genome-wide significance for association of the same allele of the same SNP in each of multiple independent samples. However, such unambiguous replication may be unlikely when phenotypes display polygenic genetic architecture, allelic heterogeneity, locus heterogeneity, and when different samples display linkage disequilibria with different fine structures. We seek chromosomal regions that are tagged by clustered SNPs that display nominally significant association in each of several independent samples. This approach provides one "nontemplate" approach to identifying overall replication of groups of GWA results in the face of difficult genetic architectures. We apply this strategy to 1 million (1M) SNP Affymetrix and Illumina GWA results for dependence on illegal substances. This approach provides high confidence in rejecting the null hypothesis that chance alone accounts for the extent to which clustered, nominally significant SNPs from samples of the same racial/ethnic background identify the same chromosomal regions. There is more modest confidence in: (a) identification of individual chromosomal regions and genes and (b) overlap between results from samples of different racial/ethnic backgrounds. The strong overlap identified among the samples with similar racial/ethnic backgrounds, together with prior work that identified overlapping results in samples of different racial/ethnic backgrounds, support contributions to individual differences in vulnerability to addictions that come from both relatively older allelic variants that are common in many current human populations and newer allelic variants that are common in fewer current human populations.

  10. Integrated analysis of copy number variation and genome-wide expression profiling in colorectal cancer tissues.

    PubMed

    Ali Hassan, Nur Zarina; Mokhtar, Norfilza Mohd; Kok Sin, Teow; Mohamed Rose, Isa; Sagap, Ismail; Harun, Roslan; Jamal, Rahman

    2014-01-01

    Integrative analyses of multiple genomic datasets for selected samples can provide better insight into the overall data and can enhance our knowledge of cancer. The objective of this study was to elucidate the association between copy number variation (CNV) and gene expression in colorectal cancer (CRC) samples and their corresponding non-cancerous tissues. Sixty-four paired CRC samples from the same patients were subjected to CNV profiling using the Illumina HumanOmni1-Quad assay, and validation was performed using multiplex ligation probe amplification method. Genome-wide expression profiling was performed on 15 paired samples from the same group of patients using the Affymetrix Human Gene 1.0 ST array. Significant genes obtained from both array results were then overlapped. To identify molecular pathways, the data were mapped to the KEGG database. Whole genome CNV analysis that compared primary tumor and non-cancerous epithelium revealed gains in 1638 genes and losses in 36 genes. Significant gains were mostly found in chromosome 20 at position 20q12 with a frequency of 45.31% in tumor samples. Examples of genes that were associated at this cytoband were PTPRT, EMILIN3 and CHD6. The highest number of losses was detected at chromosome 8, position 8p23.2 with 17.19% occurrence in all tumor samples. Among the genes found at this cytoband were CSMD1 and DLC1. Genome-wide expression profiling showed 709 genes to be up-regulated and 699 genes to be down-regulated in CRC compared to non-cancerous samples. Integration of these two datasets identified 56 overlapping genes, which were located in chromosomes 8, 20 and 22. MLPA confirmed that the CRC samples had the highest gains in chromosome 20 compared to the reference samples. Interpretation of the CNV data in the context of the transcriptome via integrative analyses may provide more in-depth knowledge of the genomic landscape of CRC.

  11. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population

    PubMed Central

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  12. JGI Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  13. Genomic Encyclopedia of Fungi

    SciTech Connect

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  14. Plant genomics: an overview.

    PubMed

    Campos-de Quiroz, Hugo

    2002-01-01

    Recent technological advancements have substantially expanded our ability to analyze and understand plant genomes and to reduce the gap existing between genotype and phenotype. The fast evolving field of genomics allows scientists to analyze thousand of genes in parallel, to understand the genetic architecture of plant genomes and also to isolate the genes responsible for mutations. Furthermore, whole genomes can now be sequenced. This review addresses these issues and also discusses ways to extract biological meaning from DNA data. Although genomic issuesare addressed from a plant perspective, this review provides insights into the genomic analyses of other organisms.

  15. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  16. Genomic Data Commons | Office of Cancer Genomics

    Cancer.gov

    The NCI’s Center for Cancer Genomics launches the Genomic Data Commons (GDC), a unified data sharing platform for the cancer research community. The mission of the GDC is to enable data sharing across the entire cancer research community, to ultimately support precision medicine in oncology.

  17. Directed genome engineering for genome optimization.

    PubMed

    D'Halluin, Kathleen; Ruiter, Rene

    2013-01-01

    The ability to develop nucleases with tailor-made activities for targeted DNA double-strand break induction at will at any desired position in the genome has been a major breakthrough to make targeted genome optimization feasible in plants. The development of site specific nucleases for precise genome modification has expanded the repertoire of tools for the development and optimization of traits, already including mutation breeding, molecular breeding and transgenesis.Through directed genome engineering technology, the huge amount of information provided by genomics and systems biology can now more effectively be used for the creation of plants with improved or new traits, and for the dissection of gene functions. Although still in an early phase of deployment, its utility has been demonstrated for engineering disease resistance, herbicide tolerance, altered metabolite profiles, and for molecular trait stacking to allow linked transmission of transgenes. In this article, we will briefly review the different approaches for directed genome engineering with the emphasis on double strand break (DSB)-mediated engineering to-wards genome optimization for crop improvement and towards the acceleration of functional genomics.

  18. GENOMICS AND ENVIRONMENTAL RESEARCH

    EPA Science Inventory

    The impact of recently developed and emerging genomics technologies on environmental sciences has significant implications for human and ecological risk assessment issues. The linkage of data generated from genomics, transcriptomics, proteomics, metabalomics, and ecology can be ...

  19. Genomic Data Commons launches

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  20. Genome-wide association studies for multiple diseases of the German Shepherd Dog

    PubMed Central

    Tsai, Kate L.; Noorai, Rooksana E.; Starr-Moss, Alison N.; Quignon, Pascale; Rinz, Caitlin J.; Ostrander, Elaine A.; Steiner, Jörg M.; Murphy, Keith E.

    2012-01-01

    The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877

  1. Genome-wide and fine-resolution association analysis of malaria in West Africa.

    PubMed

    Jallow, Muminatou; Teo, Yik Ying; Small, Kerrin S; Rockett, Kirk A; Deloukas, Panos; Clark, Taane G; Kivinen, Katja; Bojang, Kalifa A; Conway, David J; Pinder, Margaret; Sirugo, Giorgio; Sisay-Joof, Fatou; Usen, Stanley; Auburn, Sarah; Bumpstead, Suzannah J; Campino, Susana; Coffey, Alison; Dunham, Andrew; Fry, Andrew E; Green, Angela; Gwilliam, Rhian; Hunt, Sarah E; Inouye, Michael; Jeffreys, Anna E; Mendy, Alieu; Palotie, Aarno; Potter, Simon; Ragoussis, Jiannis; Rogers, Jane; Rowlands, Kate; Somaskantharajah, Elilan; Whittaker, Pamela; Widden, Claire; Donnelly, Peter; Howie, Bryan; Marchini, Jonathan; Morris, Andrew; SanJoaquin, Miguel; Achidi, Eric Akum; Agbenyega, Tsiri; Allen, Angela; Amodu, Olukemi; Corran, Patrick; Djimde, Abdoulaye; Dolo, Amagana; Doumbo, Ogobara K; Drakeley, Chris; Dunstan, Sarah; Evans, Jennifer; Farrar, Jeremy; Fernando, Deepika; Hien, Tran Tinh; Horstmann, Rolf D; Ibrahim, Muntaser; Karunaweera, Nadira; Kokwaro, Gilbert; Koram, Kwadwo A; Lemnge, Martha; Makani, Julie; Marsh, Kevin; Michon, Pascal; Modiano, David; Molyneux, Malcolm E; Mueller, Ivo; Parker, Michael; Peshu, Norbert; Plowe, Christopher V; Puijalon, Odile; Reeder, John; Reyburn, Hugh; Riley, Eleanor M; Sakuntabhai, Anavaj; Singhasivanon, Pratap; Sirima, Sodiomon; Tall, Adama; Taylor, Terrie E; Thera, Mahamadou; Troye-Blomberg, Marita; Williams, Thomas N; Wilson, Michael; Kwiatkowski, Dominic P

    2009-06-01

    We report a genome-wide association (GWA) study of severe malaria in The Gambia. The initial GWA scan included 2,500 children genotyped on the Affymetrix 500K GeneChip, and a replication study included 3,400 children. We used this to examine the performance of GWA methods in Africa. We found considerable population stratification, and also that signals of association at known malaria resistance loci were greatly attenuated owing to weak linkage disequilibrium (LD). To investigate possible solutions to the problem of low LD, we focused on the HbS locus, sequencing this region of the genome in 62 Gambian individuals and then using these data to conduct multipoint imputation in the GWA samples. This increased the signal of association, from P = 4 × 10(-7) to P = 4 × 10(-14), with the peak of the signal located precisely at the HbS causal variant. Our findings provide proof of principle that fine-resolution multipoint imputation, based on population-specific sequencing data, can substantially boost authentic GWA signals and enable fine mapping of causal variants in African populations.

  2. Whole-genome transcriptional analysis of heavy metal stresses inCaulobacter crescentus

    SciTech Connect

    Hu, Ping; Brodie, Eoin L.; Suzuki, Yohey; McAdams, Harley H.; Andersen, Gary L.

    2005-09-21

    The bacterium Caulobacter crescentus and related stalkbacterial species are known for their distinctive ability to live in lownutrient environments, a characteristic of most heavy metal contaminatedsites. Caulobacter crescentus is a model organism for studying cell cycleregulation with well developed genetics. We have identified the pathwaysresponding to heavy metal toxicity in C. crescentus to provide insightsfor possible application of Caulobacter to environmental restoration. Weexposed C. crescentus cells to four heavy metals (chromium, cadmium,selenium and uranium) and analyzed genome wide transcriptional activitiespost exposure using a Affymetrix GeneChip microarray. C. crescentusshowed surprisingly high tolerance to uranium, a possible mechanism forwhich may be formation of extracellular calcium-uranium-phosphateprecipitates. The principal response to these metals was protectionagainst oxidative stress (up-regulation of manganese-dependent superoxidedismutase, sodA). Glutathione S-transferase, thioredoxin, glutaredoxinsand DNA repair enzymes responded most strongly to cadmium and chromate.The cadmium and chromium stress response also focused on reducing theintracellular metal concentration, with multiple efflux pumps employed toremove cadmium while a sulfate transporter was down-regulated to reducenon-specific uptake of chromium. Membrane proteins were also up-regulatedin response to most of the metals tested. A two-component signaltransduction system involved in the uranium response was identified.Several differentially regulated transcripts from regions previously notknown to encode proteins were identified, demonstrating the advantage ofevaluating the transcriptome using whole genome microarrays.

  3. Genome-Wide Association Study of a Varroa-Specific Defense Behavior in Honeybees (Apis mellifera).

    PubMed

    Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert; Bienefeld, Kaspar

    2016-05-01

    Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes.

  4. Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

    PubMed Central

    Choi, Sungkyoung; Bae, Sunghwan

    2016-01-01

    The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN). We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC) for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes. PMID:28154504

  5. Transient Genome-Wide Transcriptional Response to Low-Dose Ionizing Radiation In Vivo in Humans

    SciTech Connect

    Berglund, Susanne R.; Rocke, David M.; Dai Jian; Schwietert, Chad W.; Santana, Alison; Stern, Robin L.; Lehmann, Joerg; Hartmann Siantar, Christine L.; Goldberg, Zelanna

    2008-01-01

    Purpose: The in vivo effects of low-dose low linear energy transfer ionizing radiation on healthy human skin are largely unknown. Using a patient-based tissue acquisition protocol, we have performed a series of genomic analyses on the temporal dynamics over a 24-hour period to determine the radiation response after a single exposure of 10 cGy. Methods and Materials: RNA from each patient tissue sample was hybridized to an Affymetrix Human Genome U133 Plus 2.0 array. Data analysis was performed on selected gene groups and pathways. Results: Nineteen gene groups and seven gene pathways that had been shown to be radiation responsive were analyzed. Of these, nine gene groups showed significant transient transcriptional changes in the human tissue samples, which returned to baseline by 24 hours postexposure. Conclusions: Low doses of ionizing radiation on full-thickness human skin produce a definable temporal response out to 24 hours postexposure. Genes involved in DNA and tissue remodeling, cell cycle transition, and inflammation show statistically significant changes in expression, despite variability between patients. These data serve as a reference for the temporal dynamics of ionizing radiation response following low-dose exposure in healthy full-thickness human skin.

  6. Whole-genome patenting.

    PubMed

    O'Malley, Maureen A; Bostanci, Adam; Calvert, Jane

    2005-06-01

    Gene patenting is now a familiar commercial practice, but there is little awareness that several patents claim ownership of the complete genome sequence of a prokaryote or virus. When these patents are analysed and compared to those for other biological entities, it becomes clear that genome patents seek to exploit the genome as an information base and are part of a broader shift towards intangible intellectual property in genomics.

  7. Exploiting the genome

    SciTech Connect

    Block, S.; Cornwall, J.; Dyson, F.; Koonin, S.; Lewis, N.; Schwitters, R.

    1998-09-11

    In 1997, JASON conducted a DOE-sponsored study of the human genome project with special emphasis on the areas of technology, quality assurance and quality control, and informatics. The present study has two aims: first, to update the 1997 Report in light of recent developments in genome sequencing technology, and second, to consider possible roles for the DOE in the ''post-genomic" era, following acquisition of the complete human genome sequence.

  8. Office of Cancer Genomics |

    Cancer.gov

    The mission of the NCI’s Office of Cancer Genomics (OCG) is to enhance the understanding of the molecular mechanisms of cancer, advance and accelerate genomics science and technology development, and efficiently translate the genomics data to improve cancer research, prevention, early detection, diagnosis and treatment.

  9. Genome-wide association for smoking cessation success: participants in the Patch in Practice trial of nicotine replacement

    PubMed Central

    Uhl, George R; Drgon, Tomas; Johnson, Catherine; Walther, Donna; Aveyard, Paul; Murphy, Michael; Johnstone, Elaine C; Munafò, Marcus R

    2011-01-01

    Aims To confirm and extend to primary care settings prior genome-wide association results that distinguish smokers who successfully quit from individuals who were not able to quit smoking in clinical trials. Materials & methods Affymetrix® 6.0 Arrays were used to study DNA from successful quitters and matched individuals who did not quit from the Patch in Practice study of 925 smokers in 26 UK general practices who were provided with 15 mg/16 h nicotine-replacement therapy and varying degrees of behavioral support. Results Only a few SNPs provided results near ‘genome-wide’ levels of significance. Nominally significant (p < 0.01) SNP results identify the same chromosomal regions identified by prior genome-wide association studies to a much greater extent than expected by chance. Conclusion Ability to change smoking behavior in a general practice setting appears to share substantial underlying genetics with the ability to change this behavior in clinical trials, though the modest sample sizes available for these studies provides some caution to these conclusions. PMID:20235792

  10. Family based genome-wide copy number scan identifies complex rearrangements at 17q21.31 in dyslexics.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2014-10-01

    Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype.

  11. Genome-Wide Screening of Alpha-Tocopherol Sensitive Genes in Heart Tissue from Alpha-Tocopherol Transfer Protein Null Mice (ATTP−/−)

    PubMed Central

    Vasu, Vihas T.; Hobson, Brad; Gohil, Kishorchandra; Cross, Carroll E.

    2009-01-01

    Alpha tocopherol transfer protein (ATTP) null mice (ATTP−/−) have a systemic deficiency of alpha-tocopherol (AT). The heart AT levels of ATTP−/− are <10% of those in ATTP+/+ mice. The genomic responses of heart to AT deficiency were determined in 3 months old male ATTP−/− mice and compared with their ATTP+/+ littermate controls using Affymetrix 430A 2.0 high density oligonucleotide arrays. Differential analysis of ~13,000 genes identified repression of genes related to immune system and activation of genes related to lipid metabolism and inflammation with no significant change in the expression of classical antioxidant genes (catalase, superoxide dismutase, glutathione peroxidase) in ATTP−/− as compared to ATTP+/+ mice. The present data identifies novel classes of AT sensitive genes in heart tissue. PMID:17382327

  12. Genome Wide Association for Addiction: Replicated Results and Comparisons of Two Analytic Approaches

    PubMed Central

    Drgon, Tomas; Zhang, Ping-Wu; Johnson, Catherine; Walther, Donna; Hess, Judith; Nino, Michelle; Uhl, George R.

    2010-01-01

    Background Vulnerabilities to dependence on addictive substances are substantially heritable complex disorders whose underlying genetic architecture is likely to be polygenic, with modest contributions from variants in many individual genes. “Nontemplate” genome wide association (GWA) approaches can identity groups of chromosomal regions and genes that, taken together, are much more likely to contain allelic variants that alter vulnerability to substance dependence than expected by chance. Methodology/Principal Findings We report pooled “nontemplate” genome-wide association studies of two independent samples of substance dependent vs control research volunteers (n = 1620), one European-American and the other African-American using 1 million SNP (single nucleotide polymorphism) Affymetrix genotyping arrays. We assess convergence between results from these two samples using two related methods that seek clustering of nominally-positive results and assess significance levels with Monte Carlo and permutation approaches. Both “converge then cluster” and “cluster then converge” analyses document convergence between the results obtained from these two independent datasets in ways that are virtually never found by chance. The genes identified in this fashion are also identified by individually-genotyped dbGAP data that compare allele frequencies in cocaine dependent vs control individuals. Conclusions/Significance These overlapping results identify small chromosomal regions that are also identified by genome wide data from studies of other relevant samples to extents much greater than chance. These chromosomal regions contain more genes related to “cell adhesion” processes than expected by chance. They also contain a number of genes that encode potential targets for anti-addiction pharmacotherapeutics. “Nontemplate” GWA approaches that seek chromosomal regions in which nominally-positive associations are found in multiple independent samples are

  13. Genome-Wide Association for Smoking Cessation Success in a Trial of Precessation Nicotine Replacement

    PubMed Central

    Uhl, George R; Drgon, Tomas; Johnson, Catherine; Ramoni, Marco F; Behm, Frederique M; Rose, Jed E

    2010-01-01

    Abilities to successfully quit smoking display substantial evidence for heritability in classic and molecular genetic studies. Genome-wide association (GWA) studies have demonstrated single-nucleotide polymorphisms (SNPs) and haplotypes that distinguish successful quitters from individuals who were unable to quit smoking in clinical trial participants and in community samples. Many of the subjects in these clinical trial samples were aided by nicotine replacement therapy (NRT). We now report novel GWA results from participants in a clinical trial that sought dose/response relationships for “precessation” NRT. In this trial, 369 European-American smokers were randomized to 21 or 42 mg NRT, initiated 2 wks before target quit dates. Ten-week continuous smoking abstinence was assessed on the basis of self-reports and carbon monoxide levels. SNP genotyping used Affymetrix 6.0 arrays. GWA results for smoking cessation success provided no P value that reached “genome-wide” significance. Compared with chance, these results do identify (a) more clustering of nominally positive results within small genomic regions, (b) more overlap between these genomic regions and those identified in six prior successful smoking cessation GWA studies and (c) sets of genes that fall into gene ontology categories that appear to be biologically relevant. The 1,000 SNPs with the strongest associations form a plausible Bayesian network; no such network is formed by randomly selected sets of SNPs. The data provide independent support, based on individual genotyping, for many loci previously nominated on the basis of data from genotyping in pooled DNA samples. These results provide further support for the idea that aid for smoking cessation may be personalized on the basis of genetic predictors of outcome. PMID:20811658

  14. Genome-wide transcriptional profiling reveals molecular signatures of secondary xylem differentiation in Populus tomentosa.

    PubMed

    Yang, X H; Li, X G; Li, B L; Zhang, D Q

    2014-11-11

    Wood formation occurs via cell division, primary cell wall and secondary wall formation, and programmed cell death in the vascular cambium. Transcriptional profiling of secondary xylem differentiation is essential for understanding the molecular mechanisms underlying wood formation. Differential gene expression in secondary xylem differentiation of Populus has been previously investigated using cDNA microarray analysis. However, little is known about the molecular mechanisms from a genome-wide perspective. In this study, the Affymetrix poplar genome chips containing 61,413 probes were used to investigate the changes in the transcriptome during secondary xylem differentiation in Chinese white poplar (Populus tomentosa). Two xylem tissues (newly formed and lignified) were sampled for genome-wide transcriptional profiling. In total, 6843 genes (~11%) were identified with differential expression in the two xylem tissues. Many genes involved in cell division, primary wall modification, and cellulose synthesis were preferentially expressed in the newly formed xylem. In contrast, many genes, including 4-coumarate:cinnamate-4-hydroxylase (C4H), 4-coumarate:CoA ligase (4CL), cinnamyl alcohol dehydrogenase (CAD), and caffeoyl CoA 3-O-methyltransferase (CCoAOMT), associated with lignin biosynthesis were more transcribed in the lignified xylem. The two xylem tissues also showed differential expression of genes related to various hormones; thus, the secondary xylem differentiation could be regulated by hormone signaling. Furthermore, many transcription factor genes were preferentially expressed in the lignified xylem, suggesting that wood lignification involves extensive transcription regulation. The genome-wide transcriptional profiling of secondary xylem differentiation could provide additional insights into the molecular basis of wood formation in poplar species.

  15. The Bluejay genome browser.

    PubMed

    Soh, Jung; Gordon, Paul M K; Sensen, Christoph W

    2012-03-01

    The Bluejay genome browser is a stand-alone visualization tool for the multi-scale viewing of annotated genomes and other genomic elements. Bluejay allows users to customize display features to suit their needs, and produces publication-quality graphics. Bluejay provides a multitude of ways to interrelate biological data at the genome scale. Users can load gene expression data into a genome display for expression visualization in context. Multiple genomes can be compared concurrently, including time series expression data, based on Gene Ontology labels. External, context-sensitive biological Web Services are linked to the displayed genomic elements ad hoc for in-depth genomic data analysis and interpretation. Users can mark multiple points of interest in a genome by creating waypoints, and exploit them for easy navigation of single or multiple genomes. Using this comprehensive visual environment, users can study a gene not just in relation to its genome, but also its transcriptome and evolutionary origins. Written in Java, Bluejay is platform-independent and is freely available from http://bluejay.ucalgary.ca.

  16. Bacterial Genome Instability

    PubMed Central

    Darmon, Elise

    2014-01-01

    SUMMARY Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease. PMID:24600039

  17. UCSC genome browser tutorial.

    PubMed

    Zweig, Ann S; Karolchik, Donna; Kuhn, Robert M; Haussler, David; Kent, W James

    2008-08-01

    The University of California Santa Cruz (UCSC) Genome Bioinformatics website consists of a suite of free, open-source, on-line tools that can be used to browse, analyze, and query genomic data. These tools are available to anyone who has an Internet browser and an interest in genomics. The website provides a quick and easy-to-use visual display of genomic data. It places annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information. Many of the annotation tracks are submitted by scientists worldwide; the others are computed by the UCSC Genome Bioinformatics group from publicly available sequence data. It also allows users to upload and display their own experimental results or annotation sets by creating a custom track. The suite of tools, downloadable data files, and links to documentation and other information can be found at http://genome.ucsc.edu/.

  18. Enabling responsible public genomics.

    PubMed

    Conley, John M; Doerr, Adam K; Vorhaus, Daniel B

    2010-01-01

    As scientific understandings of genetics advance, researchers require increasingly rich datasets that combine genomic data from large numbers of individuals with medical and other personal information. Linking individuals' genetic data and personal information precludes anonymity and produces medically significant information--a result not contemplated by the established legal and ethical conventions governing human genomic research. To pursue the next generation of human genomic research and commerce in a responsible fashion, scientists, lawyers, and regulators must address substantial new issues, including researchers' duties with respect to clinically significant data, the challenges to privacy presented by genomic data, the boundary between genomic research and commerce, and the practice of medicine. This Article presents a new model for understanding and addressing these new challenges--a "public genomics" premised on the idea that ethically, legally, and socially responsible genomics research requires openness, not privacy, as its organizing principle. Responsible public genomics combines the data contributed by informed and fully consenting information altruists and the research potential of rich datasets in a genomic commons that is freely and globally available. This Article examines the risks and benefits of this public genomics model in the context of an ambitious genetic research project currently under way--the Personal Genome Project. This Article also (i) demonstrates that large-scale genomic projects are desirable, (ii) evaluates the risks and challenges presented by public genomics research, and (iii) determines that the current legal and regulatory regimes restrict beneficial and responsible scientific inquiry while failing to adequately protect participants. The Article concludes by proposing a modified normative and legal framework that embraces and enables a future of responsible public genomics.

  19. Whole-exome/genome sequencing and genomics.

    PubMed

    Grody, Wayne W; Thompson, Barry H; Hudgins, Louanne

    2013-12-01

    As medical genetics has progressed from a descriptive entity to one focused on the functional relationship between genes and clinical disorders, emphasis has been placed on genomics. Genomics, a subelement of genetics, is the study of the genome, the sum total of all the genes of an organism. The human genome, which is contained in the 23 pairs of nuclear chromosomes and in the mitochondrial DNA of each cell, comprises >6 billion nucleotides of genetic code. There are some 23,000 protein-coding genes, a surprisingly small fraction of the total genetic material, with the remainder composed of noncoding DNA, regulatory sequences, and introns. The Human Genome Project, launched in 1990, produced a draft of the genome in 2001 and then a finished sequence in 2003, on the 50th anniversary of the initial publication of Watson and Crick's paper on the double-helical structure of DNA. Since then, this mass of genetic information has been translated at an ever-increasing pace into useable knowledge applicable to clinical medicine. The recent advent of massively parallel DNA sequencing (also known as shotgun, high-throughput, and next-generation sequencing) has brought whole-genome analysis into the clinic for the first time, and most of the current applications are directed at children with congenital conditions that are undiagnosable by using standard genetic tests for single-gene disorders. Thus, pediatricians must become familiar with this technology, what it can and cannot offer, and its technical and ethical challenges. Here, we address the concepts of human genomic analysis and its clinical applicability for primary care providers.

  20. Genome-wide association study identifies COL2A1 locus involved in the hand development failure of Kashin-Beck disease

    PubMed Central

    Hao, Jingcan; Wang, Wenyu; Wen, Yan; Xiao, Xiao; He, Awen; Wu, Cuiyan; Wang, Sen; Guo, Xiong; Zhang, Feng

    2017-01-01

    Kashin-Beck disease (KBD) is a chronic osteochondropathy. The pathogenesis of growth and development failure of hand of KBD remains elusive now. In this study, we conducted a two-stage genome-wide association study (GWAS) of palmar length-width ratio (LWR) of KBD, totally including 493 study subjects. Affymetrix Genome Wide Human SNP Array 6.0 was applied for genome-wide SNP genotyping of 90 KBD patients. Association analysis was conducted by PLINK. Imputation analysis was performed by IMPUTE against the reference panel of the 1000 genome project. Two SNPs were selected for replication in an independent validation sample of 403 KBD patients. In the discovery GWAS, significant association was observed between palmar LWR and rs2071358 of COL2A1 gene (P value = 4.68 × 10−8). In addition, GWAS detected suggestive association signal at rs4760608 of COL2A1 gene (P value = 1.76 × 10−4). Imputation analysis of COL2A1 further identified 2 SNPs with association evidence for palmar LWR. Replication study observed significant association signals at both rs2071358 (P value = 0.017) and rs4760608 (P value = 0.002) of COL2A1 gene. Based on previous and our study results, we suggest that COL2A1 was a likely susceptibility gene involved in the hand development failure of KBD. PMID:28059113

  1. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax

    PubMed Central

    Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura e Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22–2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08–2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29–2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  2. Genome-wide SNP typing reveals signatures of population history.

    PubMed

    Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith

    2008-07-01

    Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.

  3. Biology of breast cancer during pregnancy using genomic profiling.

    PubMed

    Azim, Hatem A; Brohée, Sylvain; Peccatori, Fedro A; Desmedt, Christine; Loi, Sherene; Lambrechts, Diether; Dell'Orto, Patrizia; Majjaj, Samira; Jose, Vinu; Rotmensz, Nicole; Ignatiadis, Michail; Pruneri, Giancarlo; Piccart, Martine; Viale, Giuseppe; Sotiriou, Christos

    2014-08-01

    Breast cancer during pregnancy is rare and is associated with relatively poor prognosis. No information is available on its biological features at the genomic level. Using a dataset of 54 pregnant and 113 non-pregnant breast cancer patients, we evaluated the pattern of hot spot somatic mutations and did transcriptomic profiling using Sequenom and Affymetrix respectively. We performed gene set enrichment analysis to evaluate the pathways associated with diagnosis during pregnancy. We also evaluated the expression of selected cancer-related genes in pregnant and non-pregnant patients and correlated the results with changes occurring in the normal breast using a pregnant murine model. We finally investigated aberrations associated with disease-free survival (DFS). No significant differences in mutations were observed. Of the total number of patients, 18.6% of pregnant and 23% of non-pregnant patients had a PIK3CA mutation. Around 30% of tumors were basal, with no differences in the distribution of breast cancer molecular subtypes between pregnant and non-pregnant patients. Two pathways were enriched in tumors diagnosed during pregnancy: the G protein-coupled receptor pathway and the serotonin receptor pathway (FDR <0.0001). Tumors diagnosed during pregnancy had higher expression of PD1 (PDCD1; P=0.015), PDL1 (CD274; P=0.014), and gene sets related to SRC (P=0.004), IGF1 (P=0.032), and β-catenin (P=0.019). Their expression increased almost linearly throughout gestation when evaluated on the normal breast using a pregnant mouse model underscoring the potential effect of the breast microenvironment on tumor phenotype. No genes were associated with DFS in a multivariate model, which could be due to low statistical power. Diagnosis during pregnancy impacts the breast cancer transcriptome including potential cancer targets.

  4. MicroRNA expression analysis using the Affymetrix Platform.

    PubMed

    Dee, Suzanne; Getts, Robert C

    2012-01-01

    Microarrays have been used extensively for messenger RNA expression monitoring. Recently, microarrays have been designed to interrogate expression levels of noncoding RNAs. Here, we describe methods for RNA labeling and the use of a miRNA array to identify and measure microRNA present in RNA samples.

  5. State of cat genomics.

    PubMed

    O'Brien, Stephen J; Johnson, Warren; Driscoll, Carlos; Pontius, Joan; Pecon-Slattery, Jill; Menotti-Raymond, Marilyn

    2008-06-01

    Our knowledge of cat family biology was recently expanded to include a genomics perspective with the completion of a draft whole genome sequence of an Abyssinian cat. The utility of the new genome information has been demonstrated by applications ranging from disease gene discovery and comparative genomics to species conservation. Patterns of genomic organization among cats and inbred domestic cat breeds have illuminated our view of domestication, revealing linkage disequilibrium tracks consequent of breed formation, defining chromosome exchanges that punctuated major lineages of mammals and suggesting ancestral continental migration events that led to 37 modern species of Felidae. We review these recent advances here. As the genome resources develop, the cat is poised to make a major contribution to many areas in genetics and biology.

  6. Querying genomic databases

    SciTech Connect

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  7. A genome-wide association study identifies novel single nucleotide polymorphisms associated with dermal shank pigmentation in chickens.

    PubMed

    Li, Guangqi; Li, Dongfeng; Yang, Ning; Qu, Lujiang; Hou, Zhuocheng; Zheng, Jiangxia; Xu, Guiyun; Chen, Sirui

    2014-12-01

    Shank color of domestic chickens varies from black to blue, green, yellow, or white, which is controlled by the combination of melanin and xanthophylls in dermis and epidermis. Dermal shank pigmentation of chickens is determined by sex-linked inhibitor of dermal melanin (Id), which is located on the distal end of the long arm of Z chromosome, through controlling dermal melanin pigmentation. Although previous studies have focused on the identification of Id and the linear relationship with barring and recessive white skin, no causal mutations have yet been identified in relation to the mutant dermal pigment inhibiting allele at the Id locus. In this study, we first used the 600K Affymetrix Axiom HD genotyping array, which includes ~580,961 SNP of which 26,642 SNP were on the Z chromosome to perform a genome-wide association study on pure lines of 19 Tibetan hens with dermal pigmentation shank and 21 Tibetan hens with yellow shank to refine the Id location. Association analysis was conducted by the PLINK software using the standard chi-squared test, and then Bonferroni correction was used to adjust multiple testing. The genome-wide study revealed that 3 SNP located at 78.5 to 79.2 Mb on the Z chromosome in the current assembly of chicken genome (galGal4) were significantly associated with dermal shank pigmentation of chickens, but none of them were located in known genes. The interval we refined was partly converged with previous results, suggesting that the Id gene is in or near our refined genome region. However, the genomic context of this region was complex. There were only 15 SNP markers developed by the genotyping array within the interval region, in which only 1 SNP marker passed quality control. Additionally, there were about 5.8-Mb gaps on both sides of the refined interval. The follow-up replication studies may be needed to further confirm the functional significance for these newly identified SNP.

  8. [Landscape and ecological genomics].

    PubMed

    Tetushkin, E Ia

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment.

  9. Genomics of Clostridium tetani.

    PubMed

    Brüggemann, Holger; Brzuszkiewicz, Elzbieta; Chapeton-Montes, Diana; Plourde, Lucile; Speck, Denis; Popoff, Michel R

    2015-05-01

    Genomic information about Clostridium tetani, the causative agent of the tetanus disease, is scarce. The genome of strain E88, a strain used in vaccine production, was sequenced about 10 years ago. One additional genome (strain 12124569) has recently been released. Here we report three new genomes of C. tetani and describe major differences among all five C. tetani genomes. They all harbor tetanus-toxin-encoding plasmids that contain highly conserved genes for TeNT (tetanus toxin), TetR (transcriptional regulator of TeNT) and ColT (collagenase), but substantially differ in other plasmid regions. The chromosomes share a large core genome that contains about 85% of all genes of a given chromosome. The non-core chromosome comprises mainly prophage-like genomic regions and genes encoding environmental interaction and defense functions (e.g. surface proteins, restriction-modification systems, toxin-antitoxin systems, CRISPR/Cas systems) and other fitness functions (e.g. transport systems, metabolic activities). This new genome information will help to assess the level of genome plasticity of the species C. tetani and provide the basis for detailed comparative studies.

  10. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  11. Between two fern genomes.

    PubMed

    Sessa, Emily B; Banks, Jo Ann; Barker, Michael S; Der, Joshua P; Duffy, Aaron M; Graham, Sean W; Hasebe, Mitsuyasu; Langdale, Jane; Li, Fay-Wei; Marchant, D Blaine; Pryer, Kathleen M; Rothfels, Carl J; Roux, Stanley J; Salmi, Mari L; Sigel, Erin M; Soltis, Douglas E; Soltis, Pamela S; Stevenson, Dennis W; Wolf, Paul G

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves.

  12. Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor

    2012-03-12

    The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.

  13. MIPS plant genome information resources.

    PubMed

    Spannagl, Manuel; Haberer, Georg; Ernst, Rebecca; Schoof, Heiko; Mayer, Klaus F X

    2007-01-01

    The Munich Institute for Protein Sequences (MIPS) has been involved in maintaining plant genome databases since the Arabidopsis thaliana genome project. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable data sets for model plant genomes as a backbone against which experimental data, for example from high-throughput functional genomics, can be organized and evaluated. In addition, model genomes also form a scaffold for comparative genomics, and much can be learned from genome-wide evolutionary studies.

  14. Home - The Cancer Genome Atlas - Cancer Genome - TCGA

    Cancer.gov

    The Cancer Genome Atlas (TCGA) is a comprehensive and coordinated effort to accelerate our understanding of the molecular basis of cancer through the application of genome analysis technologies, including large-scale genome sequencing.

  15. Genomic analysis of circulating cell-free DNA infers breast cancer dormancy

    PubMed Central

    Shaw, Jacqueline A.; Page, Karen; Blighe, Kevin; Hava, Natasha; Guttery, David; Ward, Becky; Brown, James; Ruangpratheep, Chetana; Stebbing, Justin; Payne, Rachel; Palmieri, Carlo; Cleator, Suzy; Walker, Rosemary A.; Coombes, R. Charles

    2012-01-01

    Biomarkers in breast cancer to monitor minimal residual disease have remained elusive. We hypothesized that genomic analysis of circulating free DNA (cfDNA) isolated from plasma may form the basis for a means of detecting and monitoring breast cancer. We profiled 251 genomes using Affymetrix SNP 6.0 arrays to determine copy number variations (CNVs) and loss of heterozygosity (LOH), comparing 138 cfDNA samples with matched primary tumor and normal leukocyte DNA in 65 breast cancer patients and eight healthy female controls. Concordance of SNP genotype calls in paired cfDNA and leukocyte DNA samples distinguished between breast cancer patients and healthy female controls (P < 0.0001) and between preoperative patients and patients on follow-up who had surgery and treatment (P = 0.0016). Principal component analyses of cfDNA SNP/copy number results also separated presurgical breast cancer patients from the healthy controls, suggesting specific CNVs in cfDNA have clinical significance. We identified focal high-level DNA amplification in paired tumor and cfDNA clustered in a number of chromosome arms, some of which harbor genes with oncogenic potential, including USP17L2 (DUB3), BRF1, MTA1, and JAG2. Remarkably, in 50 patients on follow-up, specific CNVs were detected in cfDNA, mirroring the primary tumor, up to 12 yr after diagnosis despite no other evidence of disease. These data demonstrate the potential of SNP/CNV analysis of cfDNA to distinguish between patients with breast cancer and healthy controls during routine follow-up. The genomic profiles of cfDNA infer dormancy/minimal residual disease in the majority of patients on follow-up. PMID:21990379

  16. Genomic analysis of circulating cell-free DNA infers breast cancer dormancy.

    PubMed

    Shaw, Jacqueline A; Page, Karen; Blighe, Kevin; Hava, Natasha; Guttery, David; Ward, Becky; Brown, James; Ruangpratheep, Chetana; Stebbing, Justin; Payne, Rachel; Palmieri, Carlo; Cleator, Suzy; Walker, Rosemary A; Coombes, R Charles

    2012-02-01

    Biomarkers in breast cancer to monitor minimal residual disease have remained elusive. We hypothesized that genomic analysis of circulating free DNA (cfDNA) isolated from plasma may form the basis for a means of detecting and monitoring breast cancer. We profiled 251 genomes using Affymetrix SNP 6.0 arrays to determine copy number variations (CNVs) and loss of heterozygosity (LOH), comparing 138 cfDNA samples with matched primary tumor and normal leukocyte DNA in 65 breast cancer patients and eight healthy female controls. Concordance of SNP genotype calls in paired cfDNA and leukocyte DNA samples distinguished between breast cancer patients and healthy female controls (P < 0.0001) and between preoperative patients and patients on follow-up who had surgery and treatment (P = 0.0016). Principal component analyses of cfDNA SNP/copy number results also separated presurgical breast cancer patients from the healthy controls, suggesting specific CNVs in cfDNA have clinical significance. We identified focal high-level DNA amplification in paired tumor and cfDNA clustered in a number of chromosome arms, some of which harbor genes with oncogenic potential, including USP17L2 (DUB3), BRF1, MTA1, and JAG2. Remarkably, in 50 patients on follow-up, specific CNVs were detected in cfDNA, mirroring the primary tumor, up to 12 yr after diagnosis despite no other evidence of disease. These data demonstrate the potential of SNP/CNV analysis of cfDNA to distinguish between patients with breast cancer and healthy controls during routine follow-up. The genomic profiles of cfDNA infer dormancy/minimal residual disease in the majority of patients on follow-up.

  17. A GENOME WIDE ASSOCIATION STUDY FOR DIABETIC NEPHROPATHY GENES IN AFRICAN AMERICANS

    PubMed Central

    McDonough, Caitrin W.; Palmer, Nicholette D.; Hicks, Pamela J.; Roh, Bong H.; An, S. Sandy; Cooke, Jessica N.; Hester, Jessica M.; Wing, Maria R.; Bostrom, Meredith A.; Rudock, Megan E.; Lewis, Joshua P.; Talbert, Matthew E.; Blevins, Rebecca A.; Lu, Lingyi; Ng, Maggie C.Y.; Sale, Michele M.; Divers, Jasmin; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.

    2011-01-01

    A genome-wide association study was performed using the Affymetrix 6.0 chip to identify genes associated with diabetic nephropathy in African Americans. Association analysis was performed adjusting for admixture in 965 type 2 diabetic African American patients with end-stage renal disease (ESRD) and in 1029 African Americans without type 2 diabetes or kidney disease as controls. The top 724 single nucleotide polymorphisms (SNPs) with evidence of association to diabetic nephropathy were then genotyped in a replication sample of an additional 709 type 2 diabetes-ESRD patients and 690 controls. SNPs with evidence of association in both the original and replication studies were tested in additional African American cohorts consisting of 1246 patients with type 2 diabetes without kidney disease and 1216 with non-diabetic ESRD to differentiate candidate loci for type 2 diabetes-ESRD, type 2 diabetes, and/or all-cause ESRD. Twenty-five SNPs were significantly associated with type 2 diabetes-ESRD in the genome-wide association and initial replication. Although genome-wide significance with type 2 diabetes was not found for any of these 25 SNPs, several genes, including RPS12, LIMK2, and SFI1 are strong candidates for diabetic nephropathy. A combined analysis of all 2890 patients with ESRD showed significant association SNPs in LIMK2 and SFI1 suggesting that they also contribute to all-cause ESRD. Thus, our results suggest that multiple loci underlie susceptibility to kidney disease in African Americans with type 2 diabetes and some may also contribute to all-cause ESRD. PMID:21150874

  18. Genome-Wide Transcriptional Analysis of Genes Associated with Acute Desiccation Stress in Anopheles gambiae

    PubMed Central

    Wang, Mei-Hui; Marinotti, Osvaldo; Vardo-Zalik, Anne; Boparai, Rajni; Yan, Guiyun

    2011-01-01

    Malaria transmission in sub-Saharan Africa varies seasonally in intensity. Outbreaks of malaria occur after the beginning of the rainy season, whereas, during the dry season, reports of the disease are less frequent. Anopheles gambiae mosquitoes, the main malaria vector, are observed all year long but their densities are low during the dry season that generally lasts several months. Aestivation, seasonal migration, and local adaptation have been suggested as mechanisms that enable mosquito populations to persist through the dry season. Studies of chromosomal inversions have shown that inversions 2La, 2Rb, 2Rc, 2Rd, and 2Ru are associated with various physiological changes that confer aridity resistance. However, little is known about how phenotypic plasticity responds to seasonally dry conditions. This study examined the effects of desiccation stress on transcriptional regulation in An. gambiae. We exposed female An. gambiae G3 mosquitoes to acute desiccation and conducted a genome-wide analysis of their transcriptomes using the Affymetrix Plasmodium/Anopheles Genome Array. The transcription of 248 genes (1.7% of all transcripts) was significantly affected in all experimental conditions, including 96 with increased expression and 152 with decreased expression. In general, the data indicate a reduction in the metabolic rate of mosquitoes exposed to desiccation. Transcripts accumulated at higher levels during desiccation are associated with oxygen radical detoxification, DNA repair and stress responses. The proportion of transcripts within 2La and 2Rs (2Rb, 2Rc, 2Rd, and 2Ru) (67/248, or 27%) is similar to the percentage of transcripts located within these inversions (31%). These data may be useful in efforts to elucidate the role of chromosomal inversions in aridity tolerance. The scope of application of the anopheline genome demonstrates that examining transcriptional activity in relation to genotypic adaptations greatly expands the number of candidate regions

  19. Evaluation of Genome Wide Association Study Associated Type 2 Diabetes Susceptibility Loci in Sub Saharan Africans

    PubMed Central

    Adeyemo, Adebowale A.; Tekola-Ayele, Fasil; Doumatey, Ayo P.; Bentley, Amy R.; Chen, Guanjie; Huang, Hanxia; Zhou, Jie; Shriner, Daniel; Fasanmade, Olufemi; Okafor, Godfrey; Eghan, Benjamin; Agyenim-Boateng, Kofi; Adeleye, Jokotade; Balogun, Williams; Elkahloun, Abdel; Chandrasekharappa, Settara; Owusu, Samuel; Amoah, Albert; Acheampong, Joseph; Johnson, Thomas; Oli, Johnnie; Adebamowo, Clement; Collins, Francis; Dunston, Georgia; Rotimi, Charles N.

    2015-01-01

    Genome wide association studies (GWAS) for type 2 diabetes (T2D) undertaken in European and Asian ancestry populations have yielded dozens of robustly associated loci. However, the genomics of T2D remains largely understudied in sub-Saharan Africa (SSA), where rates of T2D are increasing dramatically and where the environmental background is quite different than in these previous studies. Here, we evaluate 106 reported T2D GWAS loci in continental Africans. We tested each of these SNPs, and SNPs in linkage disequilibrium (LD) with these index SNPs, for an association with T2D in order to assess transferability and to fine map the loci leveraging the generally reduced LD of African genomes. The study included 1775 unrelated Africans (1035 T2D cases, 740 controls; mean age 54 years; 59% female) enrolled in Nigeria, Ghana, and Kenya as part of the Africa America Diabetes Mellitus (AADM) study. All samples were genotyped on the Affymetrix Axiom PanAFR SNP array. Forty-one of the tested loci showed transferability to this African sample (p < 0.05, same direction of effect), 11 at the exact reported SNP and 30 others at SNPs in LD with the reported SNP (after adjustment for the number of tested SNPs). TCF7L2 SNP rs7903146 was the most significant locus in this study (p = 1.61 × 10−8). Most of the loci that showed transferability were successfully fine-mapped, i.e., localized to smaller haplotypes than in the original reports. The findings indicate that the genetic architecture of T2D in SSA is characterized by several risk loci shared with non-African ancestral populations and that data from African populations may facilitate fine mapping of risk loci. The study provides an important resource for meta-analysis of African ancestry populations and transferability of novel loci. PMID:26635871

  20. Transcriptome response analysis of Arabidopsis thaliana to leafminer (Liriomyza huidobrensis)

    PubMed Central

    2012-01-01

    Background Plants have evolved a complicated resistance system and exhibit a variety of defense patterns in response to different attackers. Previous studies have shown that responses of plants to chewing insects and phloem-feeding insects are significantly different. Less is known, however, regarding molecular responses to leafminer insects. To investigate plant transcriptome response to leafminers, we selected the leafminer Liriomyza huidobrensis, which has a special feeding pattern more similar to pathogen damage than that of chewing insects, as a model insect, and Arabidopsis thaliana as a response plant. Results We first investigated local and systemic responses of A. thaliana to leafminer feeding using an Affymetrix ATH1 genome array. Genes related to metabolic processes and stimulus responses were highly regulated. Most systemically-induced genes formed a subset of the local response genes. We then downloaded gene expression data from online databases and used hierarchical clustering to explore relationships among gene expression patterns in A. thaliana damaged by different attackers. Conclusions Our results demonstrate that plant response patterns are strongly coupled to damage patterns of attackers. PMID:23231622

  1. Arabidopsis gene expression patterns during spaceflight

    NASA Astrophysics Data System (ADS)

    Paul, A.-L.; Ferl, R. J.

    The exposure of Arabidopsis thaliana (Arabidopsis) plants to spaceflight environments resulted in the differential expression of hundreds of genes. A 5 day mission on orbiter Columbia in 1999 (STS-93) carried transgenic Arabidopsis plants engineered with a transgene composed of the alcohol dehydrogenase (Adh) gene promoter linked to the β -Glucuronidase (GUS) reporter gene. The plants were used to evaluate the effects of spaceflight on two fronts. First, expression patterns visualized with the Adh/GUS transgene were used to address specifically the possibility that spaceflight induces a hypoxic stress response, and to assess whether any spaceflight response was similar to control terrestrial hypoxia-induced gene expression patterns. (Paul et al., Plant Physiol. 2001, 126:613). Second, genome-wide patterns of native gene expression were evaluated utilizing the Affymetrix ATH1 GeneChip? array of 8,000 Arabidopsis genes. As a control for the veracity of the array analyses, a selection of genes identified with the arrays was further characterized with quantitative Real-Time RT PCR (ABI - TaqmanTM). Comparison of the patterns of expression for arrays of hybridized with RNA isolated from plants exposed to spaceflight compared to the control arrays revealed hundreds of genes that were differentially expressed in response to spaceflight, yet most genes that are hallmarks of hypoxic stress were unaffected. These results will be discussed in light of current models for plant responses to the spaceflight environment, and with regard to potential future flight opportunities.

  2. Genomics of Disease

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This edited book represents the 23rd symposium in the Stadler Genetics Symposia series, and the general theme of this conference was "The Genomics of Disease." The 24 national and international speakers were invited to discuss their world-class research into the advances that genomics has made on c...

  3. Genetics and Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Good progress is being made on genetics and genomics of sugar beet, however it is in process and the tools are now being generated and some results are being analyzed. The GABI BeetSeq project released a first draft of the sugar beet genome of KWS2320, a dihaploid (see http://bvseq.molgen.mpg.de/Gen...

  4. Automated Microbial Genome Annotation

    SciTech Connect

    Land, Miriam

    2009-05-29

    Miriam Land of the DOE Joint Genome Institute at Oak Ridge National Laboratory gives a talk on the current state and future challenges of moving toward automated microbial genome annotation at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  5. Genomics for Weed Science

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and ...

  6. Unlocking the bovine genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The draft genome sequence of cattle (Bos taurus) has now been analyzed by the Bovine Genome Sequencing and Analysis Consortium and the Bovine HapMap Consortium, which together represent an extensive collaboration involving more than 300 scientists from 25 different countries. ...

  7. Breeding-assisted genomics.

    PubMed

    Poland, Jesse

    2015-04-01

    The revolution of inexpensive sequencing has ushered in an unprecedented age of genomics. The promise of using this technology to accelerate plant breeding is being realized with a vision of genomics-assisted breeding that will lead to rapid genetic gain for expensive and difficult traits. The reality is now that robust phenotypic data is an increasing limiting resource to complement the current wealth of genomic information. While genomics has been hailed as the discipline to fundamentally change the scope of plant breeding, a more symbiotic relationship is likely to emerge. In the context of developing and evaluating large populations needed for functional genomics, none excel in this area more than plant breeders. While genetic studies have long relied on dedicated, well-structured populations, the resources dedicated to these populations in the context of readily available, inexpensive genotyping is making this philosophy less tractable relative to directly focusing functional genomics on material in breeding programs. Through shifting effort for basic genomic studies from dedicated structured populations, to capturing the entire scope of genetic determinants in breeding lines, we can move towards not only furthering our understanding of functional genomics in plants, but also rapidly improving crops for increased food security, availability and nutrition.

  8. The Future of Microbial Genomics

    SciTech Connect

    Kyrpides, Nikos

    2010-06-02

    Nikos Kyrpides, head of the Genome Biology group at the DOE Joint Genome Institute discusses current challenges in the field of microbial genomics on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  9. The UCSC Genome Browser

    PubMed Central

    Karolchik, Donna; Hinrichs, Angie S.; Kent, W. James

    2011-01-01

    The University of California Santa Cruz (UCSC) Genome Browser is a popular Web-based tool for quickly displaying a requested portion of a genome at any scale, accompanied by a series of aligned annotation “tracks.” The annotations generated by the UCSC Genome Bioinformatics Group and external collaborators include gene predictions, mRNA and expressed sequence tag alignments, simple nucleotide polymorphisms, expression and regulatory data, phenotype and variation data, and pairwise and multiple-species comparative genomics data. All information relevant to a region is presented in one window, facilitating biological analysis and interpretation. The database tables underlying the Genome Browser tracks can be viewed, downloaded, and manipulated using another Web-based application, the UCSC Table Browser. Users can upload personal datasets in a wide variety of formats as custom annotation tracks in both browsers for research or educational purposes. PMID:21975940

  10. AutoGenomics, Inc.

    PubMed

    Vairavan, Ram

    2004-07-01

    AutoGenomics has created an automated multiplexing microarray platform to make genomic and proteomic analyses routine and efficient for clinical and research laboratories. While the emergence of microarrays has advanced genomic analyses, a number of underlying issues, such as cross-hybridization, poor spot morphology and intrinsic fluorescence of the solid substrate, have yet to be fully resolved. Current methods use discrete instrumentation, are manual and require highly skilled labor, which leads to inconsistent results. AutoGenomics' automated platform uses a three-dimensional BioFilmChip microarray to circumvent these issues, providing optimal spot morphology and utilizing solution-based hybridization with allele-specific primer extension to improve single-base discrimination. AutoGenomics is developing applications for the early detection and management of complex disease states in oncology, cardiology, and mental disorders. Customers include clinical reference laboratories, hospitals, academic institutions, and pharmaceutical and biotech companies. Founded in 1999, the company is headquartered in Carlsbad, California, USA.

  11. Microbial Genomes Multiply

    NASA Technical Reports Server (NTRS)

    Doolittle, Russell F.

    2002-01-01

    The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.

  12. Comparative genomics of nematodes.

    PubMed

    Mitreva, Makedonka; Blaxter, Mark L; Bird, David M; McCarter, James P

    2005-10-01

    Recent transcriptome and genome projects have dramatically expanded the biological data available across the phylum Nematoda. Here we summarize analyses of these sequences, which have revealed multiple unexpected results. Despite a uniform body plan, nematodes are more diverse at the molecular level than was previously recognized, with many species- and group-specific novel genes. In the genus Caenorhabditis, changes in chromosome arrangement, particularly local inversions, are also rapid, with breakpoints occurring at 50-fold the rate in vertebrates. Tylenchid plant parasitic nematode genomes contain several genes closely related to genes in bacteria, implicating horizontal gene transfer events in the origins of plant parasitism. Functional genomics techniques are also moving from Caenorhabditis elegans to application throughout the phylum. Soon, eight more draft nematode genome sequences will be available. This unique resource will underpin both molecular understanding of these most abundant metazoan organisms and aid in the examination of the dynamics of genome evolution in animals.

  13. Genome-wide expression profiling reveals distinct clusters of transcriptional regulation during bovine preimplantation development in vivo.

    PubMed

    Kues, W A; Sudheer, S; Herrmann, D; Carnwath, J W; Havlicek, V; Besenfelder, U; Lehrach, H; Adjaye, J; Niemann, H

    2008-12-16

    Bovine embryos can be generated by in vitro fertilization or somatic nuclear transfer; however, these differ from their in vivo counterparts in many aspects and exhibit a higher proportion of developmental abnormalities. Here, we determined for the first time the transcriptomes of bovine metaphase II oocytes and all stages of preimplantation embryos developing in vivo up to the blastocyst using the Affymetrix GeneChip Bovine Genome Array which examines approximately 23,000 transcripts. The data show that bovine oocytes and embryos transcribed a significantly higher number of genes than somatic cells. Several hundred genes were transcribed well before the 8-cell stage, at which the major activation of the bovine genome expression occurs. Importantly, stage-specific expression patterns in 2-cell, 4-cell, and 8-cell stages, and in morulae and blastocysts, were detected, indicating dynamic changes in the embryonic transcriptome and in groups of transiently active genes. Pathway analysis revealed >120 biochemical pathways that are operative in early preimplantation bovine development. Significant differences were observed between the mRNA expression profiles of in vivo and in vitro matured oocytes, highlighting the need to include in vivo derived oocytes/embryos in studies evaluating assisted reproductive techniques. This study provides the first comprehensive analysis of gene expression and transcriptome dynamics of in vivo developing bovine embryos and will serve as a basis for improving assisted reproductive technology.

  14. Phytozome Comparative Plant Genomics Portal

    SciTech Connect

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  15. Genome-enabled hitchhiking mapping identifies QTLs for stress resistance in natural Drosophila.

    PubMed

    Nuzhdin, S V; Harshman, L G; Zhou, M; Harmon, K

    2007-09-01

    Identification of genes underlying complex traits is an important problem. Quantitative trait loci (QTL) are mapped using marker-trait co-segregation in large panels of recombinant genotypes. Most frequently, recombinant inbred lines derived from two isogenic parents are used. Segregation patterns are also studied in pedigrees from multiple families. Great advances have been made through creative use of these techniques, but narrow sampling and inadequate power represent strong limitations. Here, we propose an approach combining the strengths of both techniques. We established a mapping population from a sample of natural genotypes, and applied artificial selection for a complex character. Selection changed the frequencies of alleles in QTLs contributing to the selection response. We infer QTLs with dense genotyping microarrays by identifying blocks of linked markers undergoing selective changes in allele frequency. We demonstrated this approach with an experimental population composed from 20 isogenic strains. Selection for starvation survival was executed in three replicated populations with three control non-selected populations. Three individuals per population were genotyped using Affymetrix GeneChips. Two regions of the genome, one each on the left arms of the second and third chromosomes, showed significant divergence between control and selected populations. For the former region, we inferred allele frequencies in selected and control populations by pyrosequencing. We conclude that the allele frequency difference, averaging approximately 40% between selected and control lines, contributed to selection response. Our approach can contribute to the fine scale decomposition of the genetics of direct and indirect selection responses, and genotype by environment interactions.

  16. A genome wide association study of pulmonary tuberculosis susceptibility in Indonesians

    PubMed Central

    2012-01-01

    Background There is reason to expect strong genetic influences on the risk of developing active pulmonary tuberculosis (TB) among latently infected individuals. Many of the genome wide linkage and association studies (GWAS) to date have been conducted on African populations. In order to identify additional targets in genetically dissimilar populations, and to enhance our understanding of this disease, we performed a multi-stage GWAS in a Southeast Asian cohort from Indonesia. Methods In stage 1, we used the Affymetrix 100 K SNP GeneChip marker set to genotype 259 Indonesian samples. After quality control filtering, 108 cases and 115 controls were analyzed for association of 95,207 SNPs. In stage 2, we attempted validation of 2,453 SNPs with promising associations from the first stage, in 1,189 individuals from the same Indonesian cohort, and finally in stage 3 we selected 251 SNPs from this stage to test TB association in an independent Caucasian cohort (n = 3,760) from Russia. Results Our study suggests evidence of association (P = 0.0004-0.0067) for 8 independent loci (nominal significance P < 0.05), which are located within or near the following genes involved in immune signaling: JAG1, DYNLRB2, EBF1, TMEFF2, CCL17, HAUS6, PENK and TXNDC4. Conclusions Mechanisms of immune defense suggested by some of the identified genes exhibit biological plausibility and may suggest novel pathways involved in the host containment of infection with TB. PMID:22239941

  17. A genome-wide approach to identify genetic variants that contribute to etoposide-induced cytotoxicity.

    PubMed

    Huang, R Stephanie; Duan, Shiwei; Bleibel, Wasim K; Kistner, Emily O; Zhang, Wei; Clark, Tyson A; Chen, Tina X; Schweitzer, Anthony C; Blume, John E; Cox, Nancy J; Dolan, M Eileen

    2007-06-05

    Large interindividual variance has been observed in sensitivity to drugs. To comprehensively decipher the genetic contribution to these variations in drug susceptibility, we present a genome-wide model using human lymphoblastoid cell lines from the International HapMap consortium, of which extensive genotypic information is available, to identify genetic variants that contribute to chemotherapeutic agent-induced cytotoxicity. Our model integrated genotype, gene expression, and sensitivity of HapMap cell lines to drugs. Cell lines derived from 30 trios of European descent (Center d'Etude du Polymorphisme Humain population) and 30 trios of African descent (Yoruban population) were used. Cell growth inhibition at increasing concentrations of etoposide for 72 h was determined by using alamarBlue assay. Gene expression on 176 HapMap cell lines (87 Center d'Etude du Polymorphisme Humain population and 89 Yoruban population) was determined by using the Affymetrix GeneChip Human Exon 1.0ST Array. We evaluated associations between genotype and cytotoxicity, genotype and gene expression and correlated gene expression of the identified candidates with cytotoxicity. The analysis identified 63 genetic variants that contribute to etoposide-induced toxicity through their effect on gene expression. These include genes that may play a role in cancer (AGPAT2, IL1B, and WNT5B) and genes not yet known to be associated with sensitivity to etoposide. This unbiased method can be used to elucidate genetic variants contributing to a wide range of cellular phenotypes induced by chemotherapeutic agents.

  18. Integrated genome-based studies of Shewanella ecophysiology

    SciTech Connect

    Segre Daniel; Beg Qasim

    2012-02-14

    This project was a component of the Shewanella Federation and, as such, contributed to the overall goal of applying the genomic tools to better understand eco-physiology and speciation of respiratory-versatile members of Shewanella genus. Our role at Boston University was to perform bioreactor and high throughput gene expression microarrays, and combine dynamic flux balance modeling with experimentally obtained transcriptional and gene expression datasets from different growth conditions. In the first part of project, we designed the S. oneidensis microarray probes for Affymetrix Inc. (based in California), then we identified the pathways of carbon utilization in the metal-reducing marine bacterium Shewanella oneidensis MR-1, using our newly designed high-density oligonucleotide Affymetrix microarray on Shewanella cells grown with various carbon sources. Next, using a combination of experimental and computational approaches, we built algorithm and methods to integrate the transcriptional and metabolic regulatory networks of S. oneidensis. Specifically, we combined mRNA microarray and metabolite measurements with statistical inference and dynamic flux balance analysis (dFBA) to study the transcriptional response of S. oneidensis MR-1 as it passes through exponential, stationary, and transition phases. By measuring time-dependent mRNA expression levels during batch growth of S. oneidensis MR-1 under two radically different nutrient compositions (minimal lactate and nutritionally rich LB medium), we obtain detailed snapshots of the regulatory strategies used by this bacterium to cope with gradually changing nutrient availability. In addition to traditional clustering, which provides a first indication of major regulatory trends and transcription factors activities, we developed and implemented a new computational approach for Dynamic Detection of Transcriptional Triggers (D2T2). This new method allows us to infer a putative topology of transcriptional dependencies

  19. NCBI viral genomes resource.

    PubMed

    Brister, J Rodney; Ako-Adjei, Danso; Bao, Yiming; Blinkova, Olga

    2015-01-01

    Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets.

  20. The banana genome hub.

    PubMed

    Droc, Gaëtan; Larivière, Delphine; Guignon, Valentin; Yahiaoui, Nabila; This, Dominique; Garsmeur, Olivier; Dereeper, Alexis; Hamelin, Chantal; Argout, Xavier; Dufayard, Jean-François; Lengelle, Juliette; Baurens, Franc-Christophe; Cenci, Alberto; Pitollat, Bertrand; D'Hont, Angélique; Ruiz, Manuel; Rouard, Mathieu; Bocs, Stéphanie

    2013-01-01

    Banana is one of the world's favorite fruits and one of the most important crops for developing countries. The banana reference genome sequence (Musa acuminata) was recently released. Given the taxonomic position of Musa, the completed genomic sequence has particular comparative value to provide fresh insights about the evolution of the monocotyledons. The study of the banana genome has been enhanced by a number of tools and resources that allows harnessing its sequence. First, we set up essential tools such as a Community Annotation System, phylogenomics resources and metabolic pathways. Then, to support post-genomic efforts, we improved banana existing systems (e.g. web front end, query builder), we integrated available Musa data into generic systems (e.g. markers and genetic maps, synteny blocks), we have made interoperable with the banana hub, other existing systems containing Musa data (e.g. transcriptomics, rice reference genome, workflow manager) and finally, we generated new results from sequence analyses (e.g. SNP and polymorphism analysis). Several uses cases illustrate how the Banana Genome Hub can be used to study gene families. Overall, with this collaborative effort, we discuss the importance of the interoperability toward data integration between existing information systems. Database URL: http://banana-genome.cirad.fr/

  1. Genomic Insights into Bifidobacteria

    PubMed Central

    Lee, Ju-Hoon; O'Sullivan, Daniel J.

    2010-01-01

    Summary: Since the discovery in 1899 of bifidobacteria as numerically dominant microbes in the feces of breast-fed infants, there have been numerous studies addressing their role in modulating gut microflora as well as their other potential health benefits. Because of this, they are frequently incorporated into foods as probiotic cultures. An understanding of their full interactions with intestinal microbes and the host is needed to scientifically validate any health benefits they may afford. Recently, the genome sequences of nine strains representing four species of Bifidobacterium became available. A comparative genome analysis of these genomes reveals a likely efficient capacity to adapt to their habitats, with B. longum subsp. infantis exhibiting more genomic potential to utilize human milk oligosaccharides, consistent with its habitat in the infant gut. Conversely, B. longum subsp. longum exhibits a higher genomic potential for utilization of plant-derived complex carbohydrates and polyols, consistent with its habitat in an adult gut. An intriguing observation is the loss of much of this genome potential when strains are adapted to pure culture environments, as highlighted by the genomes of B. animalis subsp. lactis strains, which exhibit the least potential for a gut habitat and are believed to have evolved from the B. animalis species during adaptation to dairy fermentation environments. PMID:20805404

  2. Who are the Okinawans? Ancestry, genome diversity, and implications for the genetic study of human longevity from a geographically isolated population.

    PubMed

    Bendjilali, Nasrine; Hsueh, Wen-Chi; He, Qimei; Willcox, D Craig; Nievergelt, Caroline M; Donlon, Timothy A; Kwok, Pui-Yan; Suzuki, Makoto; Willcox, Bradley J

    2014-12-01

    Isolated populations have advantages for genetic studies of longevity from decreased haplotype diversity and long-range linkage disequilibrium. This permits smaller sample sizes without loss of power, among other utilities. Little is known about the genome of the Okinawans, a potential population isolate, recognized for longevity. Therefore, we assessed genetic diversity, structure, and admixture in Okinawans, and compared this with Caucasians, Chinese, Japanese, and Africans from HapMap II, genotyped on the same Affymetrix GeneChip Human Mapping 500K array. Principal component analysis, haplotype coverage, and linkage disequilibrium decay revealed a distinct Okinawan genome-more homogeneity, less haplotype diversity, and longer range linkage disequilibrium. Population structure and admixture analyses utilizing 52 global reference populations from the Human Genome Diversity Cell Line Panel demonstrated that Okinawans clustered almost exclusively with East Asians. Sibling relative risk (λs) analysis revealed that siblings of Okinawan centenarians have 3.11 times (females) and 3.77 times (males) more likelihood of centenarianism. These findings suggest that Okinawans are genetically distinct and share several characteristics of a population isolate, which are prone to develop extreme phenotypes (eg, longevity) from genetic drift, natural selection, and population bottlenecks. These data support further exploration of genetic influence on longevity in the Okinawans.

  3. Ensembl comparative genomics resources.

    PubMed

    Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org.

  4. What Is a Genome?

    PubMed Central

    Goldman, Aaron David; Landweber, Laura F.

    2016-01-01

    The genome is often described as the information repository of an organism. Whether millions or billions of letters of DNA, its transmission across generations confers the principal medium for inheritance of organismal traits. Several emerging areas of research demonstrate that this definition is an oversimplification. Here, we explore ways in which a deeper understanding of genomic diversity and cell physiology is challenging the concepts of physical permanence attached to the genome as well as its role as the sole information source for an organism. PMID:27442251

  5. Human Genome Program

    SciTech Connect

    Not Available

    1993-01-01

    The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.

  6. Genetics and genomic medicine.

    PubMed

    Bogaard, Kali; Johnson, Marlene

    2009-01-01

    Genetics is playing an increasingly important role in the diagnosis, monitoring and treatment of diseases, and the expansion of genetics into health care has generated the field of genomic medicine. Health care delivery is shifting away from general diagnostic evaluation toward a generation of therapeutics based on a patient's genetic makeup. Meanwhile, the scientific community debates how best to incorporate genetics and genomic medicine into practice. While obstacles remain, the ultimate goal is to use information generated from the study of human genetics to improve disease treatment, cure and prevention. As the use of genetics in medical diagnosis and treatment increases, health care workers will require an understanding of genetics and genomic medicine.

  7. Genomic variation in maize

    SciTech Connect

    Rivin, C.J.

    1990-01-01

    We have endeavored to learn to learn how different DNA sequences and sequence arrangements contribute to genome plasticity in maize. We describe quantitative variation among maize inbred lines for tandemly arrayed and dispersed repeated DNA sequences and gene families, and qualitative variation for sequences homologous to the Mutator family of transposons. The potential of these sequences to undergo unequal crossing over, non-allelic (ectopic) recombination and transposition makes them a source of genome instability. We have found examples of rapid genomic change involving these sequences in F1 hybrids, tissue culture cells and regenerated plants.

  8. Human Genome Project

    SciTech Connect

    Block, S.; Cornwall, J.; Dally, W.; Dyson, F.; Fortson, N.; Joyce, G.; Kimble, H. J.; Lewis, N.; Max, C.; Prince, T.; Schwitters, R.; Weinberger, P.; Woodin, W. H.

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  9. Genomic Grade Index (GGI): Feasibility in Routine Practice and Impact on Treatment Decisions in Early Breast Cancer

    PubMed Central

    Metzger-Filho, Otto; Catteau, Aurélie; Michiels, Stefan; Buyse, Marc; Ignatiadis, Michail; Saini, Kamal S.; de Azambuja, Evandro; Fasolo, Virginie; Naji, Sihem; Canon, Jean Luc; Delrée, Paul; Coibion, Michel; Cusumano, Pino; Jossa, Veronique; Kains, Jean Pierre; Larsimont, Denis; Richard, Vincent; Faverly, Daniel; Cornez, Nathalie; Vuylsteke, Peter; Vanderschueren, Brigitte; Peyro-Saint-Paul, Hélène; Piccart, Martine; Sotiriou, Christos

    2013-01-01

    Purpose Genomic Grade Index (GGI) is a 97-gene signature that improves histologic grade (HG) classification in invasive breast carcinoma. In this prospective study we sought to evaluate the feasibility of performing GGI in routine clinical practice and its impact on treatment recommendations. Methods Patients with pT1pT2 or operable pT3, N0-3 invasive breast carcinoma were recruited from 8 centers in Belgium. Fresh surgical samples were sent at room temperature in the MapQuant Dx™ PathKit for centralized genomic analysis. Genomic profiles were determined using Affymetrix U133 Plus 2.0 and GGI calculated using the MapQuant Dx® protocol, which defines tumors as low or high Genomic Grade (GG-1 and GG-3 respectively). Results 180 pts were recruited and 155 were eligible. The MapQuant test was performed in 142 cases and GGI was obtained in 78% of cases (n=111). Reasons for failures were 15 samples with <30% of invasive tumor cells (11%), 15 with insufficient RNA quality (10%), and 1 failed hybridization (<1%). For tumors with an available representative sample (≥ 30% inv. tumor cells) (n=127), the success rate was 87.5%. GGI reclassified 69% of the 54 HG2 tumors as GG-1 (54%) or GG-3 (46%). Changes in treatment recommendations occurred mainly in the subset of HG2 tumors reclassified into GG-3, with increased use of chemotherapy in this subset. Conclusion The use of GGI is feasible in routine clinical practice and impacts treatment decisions in early-stage breast cancer. Trial Registration ClinicalTrials.gov NCT01916837, http://clinicaltrials.gov/ct2/show/NCT01916837 PMID:23990869

  10. Genome-Wide Association Study of Lp-PLA2 Activity and Mass in the Framingham Heart Study

    PubMed Central

    Suchindran, Sunil; Rivedal, David; Guyton, John R.; Milledge, Tom; Gao, Xiaoyi; Benjamin, Ashlee; Rowell, Jennifer; Ginsburg, Geoffrey S.; McCarthy, Jeanette J.

    2010-01-01

    Lipoprotein-associated phospholipase A2 (Lp-PLA2) is an emerging risk factor and therapeutic target for cardiovascular disease. The activity and mass of this enzyme are heritable traits, but major genetic determinants have not been explored in a systematic, genome-wide fashion. We carried out a genome-wide association study of Lp-PLA2 activity and mass in 6,668 Caucasian subjects from the population-based Framingham Heart Study. Clinical data and genotypes from the Affymetrix 550K SNP array were obtained from the open-access Framingham SHARe project. Each polymorphism that passed quality control was tested for associations with Lp-PLA2 activity and mass using linear mixed models implemented in the R statistical package, accounting for familial correlations, and controlling for age, sex, smoking, lipid-lowering-medication use, and cohort. For Lp-PLA2 activity, polymorphisms at four independent loci reached genome-wide significance, including the APOE/APOC1 region on chromosome 19 (p = 6×10−24); CELSR2/PSRC1 on chromosome 1 (p = 3×10−15); SCARB1 on chromosome 12 (p = 1×10−8) and ZNF259/BUD13 in the APOA5/APOA1 gene region on chromosome 11 (p = 4×10−8). All of these remained significant after accounting for associations with LDL cholesterol, HDL cholesterol, or triglycerides. For Lp-PLA2 mass, 12 SNPs achieved genome-wide significance, all clustering in a region on chromosome 6p12.3 near the PLA2G7 gene. Our analyses demonstrate that genetic polymorphisms may contribute to inter-individual variation in Lp-PLA2 activity and mass. PMID:20442857

  11. Center for Cancer Genomics | Office of Cancer Genomics

    Cancer.gov

    The Center for Cancer Genomics (CCG) was established to unify the National Cancer Institute's activities in cancer genomics, with the goal of advancing genomics research and translating findings into the clinic to improve the precise diagnosis and treatment of cancers. In addition to promoting genomic sequencing app

  12. Genomic libraries: I. Construction and screening of fosmid genomic libraries.

    PubMed

    Quail, Mike A; Matthews, Lucy; Sims, Sarah; Lloyd, Christine; Beasley, Helen; Baxter, Simon W

    2011-01-01

    Large insert genome libraries have been a core resource required to sequence genomes, analyze haplotypes, and aid gene discovery. While next generation sequencing technologies are revolutionizing the field of genomics, traditional genome libraries will still be required for accurate genome assembly. Their utility is also being extended to functional studies for understanding DNA regulatory elements. Here, we present a detailed method for constructing genomic fosmid libraries, testing for common contaminants, gridding the library to nylon membranes, then hybridizing the library membranes with a radiolabeled probe to identify corresponding genomic clones. While this chapter focuses on fosmid libraries, many of these steps can also be applied to bacterial artificial chromosome libraries.

  13. Comparative primate genomics: emerging patterns of genome content and dynamics

    PubMed Central

    Rogers, Jeffrey; Gibbs, Richard A.

    2014-01-01

    Preface Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for several primates, with analyses of several others underway. Whole genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other nonhuman primates provide valuable insight into genetic similarities and differences among species used as models for disease-related research. This review summarizes current knowledge regarding primate genome content and dynamics and offers a series of goals for the near future. PMID:24709753

  14. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

    PubMed Central

    Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  15. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-04

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search.

  16. Genomic imprinting and reproduction.

    PubMed

    Swales, A K E; Spears, N

    2005-10-01

    Genomic imprinting is the parent-of-origin specific gene expression which is a vital mechanism through both development and adult life. One of the key elements of the imprinting mechanism is DNA methylation, controlled by DNA methyltransferase enzymes. Germ cells undergo reprogramming to ensure that sex-specific genomic imprinting is initiated, thus allowing normal embryo development to progress after fertilisation. In some cases, errors in genomic imprinting are embryo lethal while in others they lead to developmental disorders and disease. Recent studies have suggested a link between the use of assisted reproductive techniques and an increase in normally rare imprinting disorders. A greater understanding of the mechanisms of genomic imprinting and the factors that influence them are important in assessing the safety of these techniques.

  17. Rubicon Genomics, Inc.

    PubMed

    Langmore, John P

    2002-07-01

    Rubicon Genomics, Inc. is a leader in development and application of effective methods to analyze human DNA for genome-wide genotyping and haplotyping. The company is developing its proprietary OmniPlex technology as an integrated platform for archiving, amplifying and analyzing patient DNA for drug target discovery, pharmacogenomics and diagnostics. Single-site, multiple-site or whole genome amplification can be done using small samples of DNA that have been archived as OmniPlex DNA. Rubicon technology will make genome-wide SNP scoring faster, more accurate, more robust and less expensive. Rubicon will partner with pharmaceutical and diagnostic companies, as well as the makers of instruments and reagents to bring OmniPlex technology to the widest market - increasing the pipeline of more effective and safer drugs and ushering in the practice of gene-based medicine.

  18. Mouse genome database 2016

    PubMed Central

    Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.; Kadin, James A.; Richardson, Joel E.

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  19. The rise of genomics.

    PubMed

    Weissenbach, Jean

    2016-01-01

    A brief history of the development of genomics is provided. Complete sequencing of genomes of uni- and multicellular organisms is based on important progress in sequencing and bioinformatics. Evolution of these methods is ongoing and has triggered an explosion in data production and analysis. Initial analyses focused on the inventory of genes encoding proteins. Completeness and quality of gene prediction remains crucial. Genome analyses profoundly modified our views on evolution, biodiversity and contributed to the detection of new functions, yet to be fully elucidated, such as those fulfilled by non-coding RNAs. Genomics has become the basis for the study of biology and provides the molecular support for a bunch of large-scale studies, the omics.

  20. Mouse genome database 2016.

    PubMed

    Bult, Carol J; Eppig, Janan T; Blake, Judith A; Kadin, James A; Richardson, Joel E

    2016-01-04

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data.

  1. Human genomic variation

    PubMed Central

    Disotell, Todd R

    2000-01-01

    The recent completion and assembly of the first draft of the human genome, which combines samples from several ethnically diverse males and females, provides preliminary data on the extent of human genetic variation. PMID:11178257

  2. Genomic definition of species

    SciTech Connect

    Crkvenjakov, R.; Drmanac, R.

    1991-07-01

    The subject of this paper is the definition of species based on the assumption that genome is the fundamental level for the origin and maintenance of biological diversity. For this view to be logically consistent it is necessary to assume the existence and operation of the new law which we call genome law. For this reason the genome law is included in the explanation of species phenomenon presented here even if its precise formulation and elaboration are left for the future. The intellectual underpinnings of this definition can be traced to Goldschmidt. We wish to explore some philosophical aspects of the definition of species in terms of the genome. The point of proposing the definition on these grounds is that any real advance in evolutionary theory has to be correct in both its philosophy and its science.

  3. Lophotrochozoan mitochondrial genomes

    SciTech Connect

    Valles, Yvonne; Boore, Jeffrey L.

    2005-10-01

    Progress in both molecular techniques and phylogeneticmethods has challenged many of the interpretations of traditionaltaxonomy. One example is in the recognition of the animal superphylumLophotrochozoa (annelids, mollusks, echiurans, platyhelminthes,brachiopods, and other phyla), although the relationships within thisgroup and the inclusion of some phyla remain uncertain. While much ofthis progress in phylogenetic reconstruction has been based on comparingsingle gene sequences, we are beginning to see the potential of comparinglarge-scale features of genomes, such as the relative order of genes.Even though tremendous progress is being made on the sequencedetermination of whole nuclear genomes, the dataset of choice forgenome-level characters for many animals across a broad taxonomic rangeremains mitochondrial genomes. We review here what is known aboutmitochondrial genomes of the lophotrochozoans and discuss the promisethat this dataset will enable insight into theirrelationships.

  4. Platyzoan mitochondrial genomes.

    PubMed

    Wey-Fabrizius, Alexandra R; Podsiadlowski, Lars; Herlyn, Holger; Hankeln, Thomas

    2013-11-01

    Platyzoa is a putative lophotrochozoan (spiralian) subtaxon within the protostome clade of Metazoa, comprising a range of biologically diverse, mostly small worm-shaped animals. The monophyly of Platyzoa, the relationships between the putative subgroups Platyhelminthes, Gastrotricha and Gnathifera (the latter comprising at least Gnathostomulida, "Rotifera" and Acanthocephala) as well as some aspects of the internal phylogenies of these subgroups are highly debated. Here we review how complete mitochondrial (mt) genome data contribute to these debates. We highlight special features of the mt genomes and discuss problems in mtDNA phylogenies of the clade. Mitochondrial genome data seem to be insufficient to resolve the position of the platyzoan clade within the Spiralia but can help to address internal phylogenetic questions. The present review includes a tabular survey of all published platyzoan mt genomes.

  5. Epidemiology & Genomics Research Program

    Cancer.gov

    The Epidemiology and Genomics Research Program, in the National Cancer Institute's Division of Cancer Control and Population Sciences, funds research in human populations to understand the determinants of cancer occurrence and outcomes.

  6. Biobanks for Genomics and Genomics for Biobanks

    PubMed Central

    Ducournau, Pascal; Gourraud, Pierre-Antoine; Pontille, David

    2003-01-01

    Biobanks include biological samples and attached databases. Human biobanks occur in research, technological development and medical activities. Population genomics is highly dependent on the availability of large biobanks. Ethical issues must be considered: protecting the rights of those people whose samples or data are in biobanks (information, autonomy, confidentiality, protection of private life), assuring the non-commercial use of human body elements and the optimal use of samples and data. They balance other issues, such as protecting the rights of researchers and companies, allowing long-term use of biobanks while detailed information on future uses is not available. At the level of populations, the traditional form of informed consent is challenged. Other dimensions relate to the rights of a group as such, in addition to individual rights. Conditions of return of results and/or benefit to a population need to be defined. With ‘large-scale biobanking’ a marked trend in genomics, new societal dimensions appear, regarding communication, debate, regulation, societal control and valorization of such large biobanks. Exploring how genomics can help health sector biobanks to become more rationally constituted and exploited is an interesting perspective. For example, evaluating how genomic approaches can help in optimizing haematopoietic stem cell donor registries using new markers and high-throughput techniques to increase immunogenetic variability in such registries is a challenge currently being addressed. Ethical issues in such contexts are important, as not only individual decisions or projects are concerned, but also national policies in the international arena and organization of democratic debate about science, medicine and society. PMID:18629026

  7. An Introduction to Genome Annotation.

    PubMed

    Campbell, Michael S; Yandell, Mark

    2015-12-17

    Genome projects have evolved from large international undertakings to tractable endeavors for a single lab. Accurate genome annotation is critical for successful genomic, genetic, and molecular biology experiments. These annotations can be generated using a number of approaches and available software tools. This unit describes methods for genome annotation and a number of software tools commonly used in gene annotation.

  8. Molluscan Evolutionary Genomics

    SciTech Connect

    Simison, W. Brian; Boore, Jeffrey L.

    2005-12-01

    In the last 20 years there have been dramatic advances in techniques of high-throughput DNA sequencing, most recently accelerated by the Human Genome Project, a program that has determined the three billion base pair code on which we are based. Now this tremendous capability is being directed at other genome targets that are being sampled across the broad range of life. This opens up opportunities as never before for evolutionary and organismal biologists to address questions of both processes and patterns of organismal change. We stand at the dawn of a new 'modern synthesis' period, paralleling that of the early 20th century when the fledgling field of genetics first identified the underlying basis for Darwin's theory. We must now unite the efforts of systematists, paleontologists, mathematicians, computer programmers, molecular biologists, developmental biologists, and others in the pursuit of discovering what genomics can teach us about the diversity of life. Genome-level sampling for mollusks to date has mostly been limited to mitochondrial genomes and it is likely that these will continue to provide the best targets for broad phylogenetic sampling in the near future. However, we are just beginning to see an inroad into complete nuclear genome sequencing, with several mollusks and other eutrochozoans having been selected for work about to begin. Here, we provide an overview of the state of molluscan mitochondrial genomics, highlight a few of the discoveries from this research, outline the promise of broadening this dataset, describe upcoming projects to sequence whole mollusk nuclear genomes, and challenge the community to prepare for making the best use of these data.

  9. Automated Microfluidics for Genomics

    DTIC Science & Technology

    2007-11-02

    the automation of it, see [4]. In the Genomation Laboratory at the Univ. of Washington (http://rcs.ee.washington.edu/GNL/genomation.html) and with Orca ...reproducible biology without contamination . The high throughput capability is competitive with large scale robotic batch processing. III. INSTRUMENTATION...essentially arbitrary low volume, and without any contact that might cause contamination . A. ACAPELLA-5K Core Processor The ACAPELLA-5K was designed with

  10. Bacteriophage T4 genome.

    PubMed

    Miller, Eric S; Kutter, Elizabeth; Mosig, Gisela; Arisaka, Fumio; Kunisawa, Takashi; Rüger, Wolfgang

    2003-03-01

    Phage T4 has provided countless contributions to the paradigms of genetics and biochemistry. Its complete genome sequence of 168,903 bp encodes about 300 gene products. T4 biology and its genomic sequence provide the best-understood model for modern functional genomics and proteomics. Variations on gene expression, including overlapping genes, internal translation initiation, spliced genes, translational bypassing, and RNA processing, alert us to the caveats of purely computational methods. The T4 transcriptional pattern reflects its dependence on the host RNA polymerase and the use of phage-encoded proteins that sequentially modify RNA polymerase; transcriptional activator proteins, a phage sigma factor, anti-sigma, and sigma decoy proteins also act to specify early, middle, and late promoter recognition. Posttranscriptional controls by T4 provide excellent systems for the study of RNA-dependent processes, particularly at the structural level. The redundancy of DNA replication and recombination systems of T4 reveals how phage and other genomes are stably replicated and repaired in different environments, providing insight into genome evolution and adaptations to new hosts and growth environments. Moreover, genomic sequence analysis has provided new insights into tail fiber variation, lysis, gene duplications, and membrane localization of proteins, while high-resolution structural determination of the "cell-puncturing device," combined with the three-dimensional image reconstruction of the baseplate, has revealed the mechanism of penetration during infection. Despite these advances, nearly 130 potential T4 genes remain uncharacterized. Current phage-sequencing initiatives are now revealing the similarities and differences among members of the T4 family, including those that infect bacteria other than Escherichia coli. T4 functional genomics will aid in the interpretation of these newly sequenced T4-related genomes and in broadening our understanding of the complex

  11. National Plant Genome Initiative

    DTIC Science & Technology

    2005-01-01

    Genomics” was held to bring together researchers working on legumes such as Medicago, alfalfa, soybean, bean, lotus, cowpea , and chickpea to discuss... Cowpea and Pigeonpea for India and Africa Chickpea, cowpea , and pigeonpea are staple crops in India and Africa yet lack a critical mass of genomic tools...Team in the fi eld; The NSF Potato Genome Project Page 14 - Cowpea and Chickpea images; Dr. Jane Silverthorne, NSF Page 15 - CCGI Logo; Jennifer Foltz

  12. Ebolavirus comparative genomics

    DOE PAGES

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; ...

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  13. Genomic Instability in Cancer

    PubMed Central

    Abbas, Tarek; Keaton, Mignon A.; Dutta, Anindya

    2013-01-01

    One of the fundamental challenges facing the cell is to accurately copy its genetic material to daughter cells. When this process goes awry, genomic instability ensues in which genetic alterations ranging from nucleotide changes to chromosomal translocations and aneuploidy occur. Organisms have developed multiple mechanisms that can be classified into two major classes to ensure the fidelity of DNA replication. The first class includes mechanisms that prevent premature initiation of DNA replication and ensure that the genome is fully replicated once and only once during each division cycle. These include cyclin-dependent kinase (CDK)-dependent mechanisms and CDK-independent mechanisms. Although CDK-dependent mechanisms are largely conserved in eukaryotes, higher eukaryotes have evolved additional mechanisms that seem to play a larger role in preventing aberrant DNA replication and genome instability. The second class ensures that cells are able to respond to various cues that continuously threaten the integrity of the genome by initiating DNA-damage-dependent “checkpoints” and coordinating DNA damage repair mechanisms. Defects in the ability to safeguard against aberrant DNA replication and to respond to DNA damage contribute to genomic instability and the development of human malignancy. In this article, we summarize our current knowledge of how genomic instability arises, with a particular emphasis on how the DNA replication process can give rise to such instability. PMID:23335075

  14. Human Genome Annotation

    NASA Astrophysics Data System (ADS)

    Gerstein, Mark

    A central problem for 21st century science is annotating the human genome and making this annotation useful for the interpretation of personal genomes. My talk will focus on annotating the 99% of the genome that does not code for canonical genes, concentrating on intergenic features such as structural variants (SVs), pseudogenes (protein fossils), binding sites, and novel transcribed RNAs (ncRNAs). In particular, I will describe how we identify regulatory sites and variable blocks (SVs) based on processing next-generation sequencing experiments. I will further explain how we cluster together groups of sites to create larger annotations. Next, I will discuss a comprehensive pseudogene identification pipeline, which has enabled us to identify >10K pseudogenes in the genome and analyze their distribution with respect to age, protein family, and chromosomal location. Throughout, I will try to introduce some of the computational algorithms and approaches that are required for genome annotation. Much of this work has been carried out in the framework of the ENCODE, modENCODE, and 1000 genomes projects.

  15. An archaeal genomic signature

    NASA Technical Reports Server (NTRS)

    Graham, D. E.; Overbeek, R.; Olsen, G. J.; Woese, C. R.

    2000-01-01

    Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein-encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal "design fabric." Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).

  16. Human Social Genomics

    PubMed Central

    Cole, Steven W.

    2014-01-01

    A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA) characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural “social signal transduction” pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving. PMID:25166010

  17. How the genome folds

    NASA Astrophysics Data System (ADS)

    Lieberman Aiden, Erez

    2012-02-01

    I describe Hi-C, a novel technology for probing the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. Working with collaborators at the Broad Institute and UMass Medical School, we used Hi-C to construct spatial proximity maps of the human genome at a resolution of 1Mb. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.

  18. Upregulation of FOXM1 induces genomic instability in human epidermal keratinocytes

    PubMed Central

    2010-01-01

    Background The human cell cycle transcription factor FOXM1 is known to play a key role in regulating timely mitotic progression and accurate chromosomal segregation during cell division. Deregulation of FOXM1 has been linked to a majority of human cancers. We previously showed that FOXM1 was upregulated in basal cell carcinoma and recently reported that upregulation of FOXM1 precedes malignancy in a number of solid human cancer types including oral, oesophagus, lung, breast, kidney, bladder and uterus. This indicates that upregulation of FOXM1 may be an early molecular signal required for aberrant cell cycle and cancer initiation. Results The present study investigated the putative early mechanism of UVB and FOXM1 in skin cancer initiation. We have demonstrated that UVB dose-dependently increased FOXM1 protein levels through protein stabilisation and accumulation rather than de novo mRNA expression in human epidermal keratinocytes. FOXM1 upregulation in primary human keratinocytes triggered pro-apoptotic/DNA-damage checkpoint response genes such as p21, p38 MAPK, p53 and PARP, however, without causing significant cell cycle arrest or cell death. Using a high-resolution Affymetrix genome-wide single nucleotide polymorphism (SNP) mapping technique, we provided the evidence that FOXM1 upregulation in epidermal keratinocytes is sufficient to induce genomic instability, in the form of loss of heterozygosity (LOH) and copy number variations (CNV). FOXM1-induced genomic instability was significantly enhanced and accumulated with increasing cell passage and this instability was increased even further upon exposure to UVB resulting in whole chromosomal gain (7p21.3-7q36.3) and segmental LOH (6q25.1-6q25.3). Conclusion We hypothesise that prolonged and repeated UVB exposure selects for skin cells bearing stable FOXM1 protein causes aberrant cell cycle checkpoint thereby allowing ectopic cell cycle entry and subsequent genomic instability. The aberrant upregulation of FOXM1

  19. Genomic characterization of explant tumorgraft models derived from fresh patient tumor tissue

    PubMed Central

    2012-01-01

    Background There is resurgence within drug and biomarker development communities for the use of primary tumorgraft models as improved predictors of patient tumor response to novel therapeutic strategies. Despite perceived advantages over cell line derived xenograft models, there is limited data comparing the genotype and phenotype of tumorgrafts to the donor patient tumor, limiting the determination of molecular relevance of the tumorgraft model. This report directly compares the genomic characteristics of patient tumors and the derived tumorgraft models, including gene expression, and oncogenic mutation status. Methods Fresh tumor tissues from 182 cancer patients were implanted subcutaneously into immune-compromised mice for the development of primary patient tumorgraft models. Histological assessment was performed on both patient tumors and the resulting tumorgraft models. Somatic mutations in key oncogenes and gene expression levels of resulting tumorgrafts were compared to the matched patient tumors using the OncoCarta (Sequenom, San Diego, CA) and human gene microarray (Affymetrix, Santa Clara, CA) platforms respectively. The genomic stability of the established tumorgrafts was assessed across serial in vivo generations in a representative subset of models. The genomes of patient tumors that formed tumorgrafts were compared to those that did not to identify the possible molecular basis to successful engraftment or rejection. Results Fresh tumor tissues from 182 cancer patients were implanted into immune-compromised mice with forty-nine tumorgraft models that have been successfully established, exhibiting strong histological and genomic fidelity to the originating patient tumors. Comparison of the transcriptomes and oncogenic mutations between the tumorgrafts and the matched patient tumors were found to be stable across four tumorgraft generations. Not only did the various tumors retain the differentiation pattern, but supporting stromal elements were preserved

  20. WheatGenome.info: A Resource for Wheat Genomics Resource.

    PubMed

    Lai, Kaitao

    2016-01-01

    An integrated database with a variety of Web-based systems named WheatGenome.info hosting wheat genome and genomic data has been developed to support wheat research and crop improvement. The resource includes multiple Web-based applications, which are implemented as a variety of Web-based systems. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This portal provides links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/ .

  1. A genome-wide scan in affected sibling pairs with idiopathic recurrent miscarriage suggests genetic linkage.

    PubMed

    Kolte, A M; Nielsen, H S; Moltke, I; Degn, B; Pedersen, B; Sunde, L; Nielsen, F C; Christiansen, O B

    2011-06-01

    Previously, siblings of patients with idiopathic recurrent miscarriage (IRM) have been shown to have a higher risk of miscarriage. This study comprises two parts: (i) an epidemiological part, in which we introduce data on the frequency of miscarriage among 268 siblings of 244 patients with IRM and (ii) a genetic part presenting data from a genome-wide linkage study of 38 affected sibling pairs with IRM. All IRM patients (probands) had experienced three or more miscarriages and affected siblings two or more miscarriages. The sibling pairs were genotyped by the Affymetrix GeneChip 50K XbaI platform and non-parametric linkage analysis was performed via the software package Merlin. We find that siblings of IRM patients exhibit a higher frequency of miscarriage than population controls regardless of age at the time of pregnancy. We identify chromosomal regions with LOD scores between 2.5 and 3.0 in subgroups of affected sibling pairs. Maximum LOD scores were identified in four occurrences: for rs10514716 (3p14.2) when analyzing sister-pairs only; for rs10511668 (9p22.1) and rs341048 (11q13.4) when only analyzing families where the probands have had four or more miscarriages; and for rs10485275 (6q16.3) when analyzing one sibling pair from each family only. We identify no founder mutations. Concluding, our results imply that IRM patients and their siblings share factors which increase the risk of miscarriage. In this first genome-wide linkage study of affected sibling pairs with IRM, we identify regions on chromosomes 3, 6, 9 and 11 which warrant further investigation in order to elucidate their putative roles in the genesis of IRM.

  2. Whole-Genome Transcriptional Analysis of Heavy Metal Stresses in Caulobacter crescentus†

    PubMed Central

    Hu, Ping; Brodie, Eoin L.; Suzuki, Yohey; McAdams, Harley H.; Andersen, Gary L.

    2005-01-01

    The bacterium Caulobacter crescentus and related stalk bacterial species are known for their distinctive ability to live in low-nutrient environments, a characteristic of most heavy metal-contaminated sites. Caulobacter crescentus is a model organism for studying cell cycle regulation with well-developed genetics. We have identified the pathways responding to heavy-metal toxicity in C. crescentus to provide insights for the possible application of Caulobacter to environmental restoration. We exposed C. crescentus cells to four heavy metals (chromium, cadmium, selenium, and uranium) and analyzed genome-wide transcriptional activities postexposure using an Affymetrix GeneChip microarray. C. crescentus showed surprisingly high tolerance to uranium, a possible mechanism for which may be the formation of extracellular calcium-uranium-phosphate precipitates. The principal response to these metals was protection against oxidative stress (up-regulation of manganese-dependent superoxide dismutase sodA). Glutathione S-transferase, thioredoxin, glutaredoxins, and DNA repair enzymes responded most strongly to cadmium and chromate. The cadmium and chromium stress response also focused on reducing the intracellular metal concentration, with multiple efflux pumps employed to remove cadmium, while a sulfate transporter was down-regulated to reduce nonspecific uptake of chromium. Membrane proteins were also up-regulated in response to most of the metals tested. A two-component signal transduction system involved in the uranium response was identified. Several differentially regulated transcripts from regions previously not known to encode proteins were identified, demonstrating the advantage of evaluating the transcriptome by using whole-genome microarrays. PMID:16321948

  3. Systematic, genome-wide, sex-specific linkage of cardiovascular traits in French Canadians.

    PubMed

    Seda, Ondrej; Tremblay, Johanne; Gaudet, Daniel; Brunelle, Pierre-Luc; Gurau, Alexandru; Merlo, Ettore; Pilote, Louise; Orlov, Sergei N; Boulva, Francis; Petrovich, Milan; Kotchen, Theodore A; Cowley, Allen W; Hamet, Pavel

    2008-04-01

    The sexual dimorphism of cardiovascular traits, as well as susceptibility to a variety of related diseases, has long been recognized, yet their sex-specific genomic determinants are largely unknown. We systematically assessed the sex-specific heritability and linkage of 539 hemodynamic, metabolic, anthropometric, and humoral traits in 120 French-Canadian families from the Saguenay-Lac-St-Jean region of Quebec, Canada. We performed multipoint linkage analysis using microsatellite markers followed by peak-wide linkage scan based on Affymetrix Human Mapping 50K Array Xba240 single nucleotide polymorphism genotypes in 3 settings, including the entire sample and then separately in men and women. Nearly one half of the traits were age and sex independent, one quarter were both age and sex dependent, and one eighth were exclusively age or sex dependent. Sex-specific phenotypes are most frequent in heart rate and blood pressure categories, whereas sex- and age-independent determinants are predominant among humoral and biochemical parameters. Twenty sex-specific loci passing multiple testing criteria were corroborated by 2-point single nucleotide polymorphism linkage. Several resting systolic blood pressure measurements showed significant genotype-by-sex interaction, eg, male-specific locus at chromosome 12 (male-female logarithm of odds difference: 4.16; interaction P=0.0002), which was undetectable in the entire population, even after adjustment for sex. Detailed interrogation of this locus revealed a 220-kb block overlapping parts of TAO-kinase 3 and SUDS3 genes. In summary, a large number of complex cardiovascular traits display significant sexual dimorphism, for which we have demonstrated genomic determinants at the haplotype level. Many of these would have been missed in a traditional, sex-adjusted setting.

  4. Genome-wide SNP analysis of the Systemic Capillary Leak Syndrome (Clarkson disease)

    PubMed Central

    Xie, Zhihui; Nagarajan, Vijayaraj; Sturdevant, Daniel E; Iwaki, Shoko; Chan, Eunice; Wisch, Laura; Young, Michael; Nelson, Celeste M; Porcella, Stephen F; Druey, Kirk M

    2013-01-01

    The Systemic Capillary Leak Syndrome (SCLS) is an extremely rare, orphan disease that resembles, and is frequently erroneously diagnosed as, systemic anaphylaxis. The disorder is characterized by repeated, transient, and seemingly unprovoked episodes of hypotensive shock and peripheral edema due to transient endothelial hyperpermeability. SCLS is often accompanied by a monoclonal gammopathy of unknown significance (MGUS). Using Affymetrix Single Nucleotide Polymorphism (SNP) microarrays, we performed the first genome-wide SNP analysis of SCLS in a cohort of 12 disease subjects and 18 controls. Exome capture sequencing was performed on genomic DNA from nine of these patients as validation for the SNP-chip discoveries and de novo data generation. We identified candidate susceptibility loci for SCLS, which included a region flanking CAV3 (3p25.3) as well as SNP clusters in PON1 (7q21.3), PSORS1C1 (6p21.3), and CHCHD3 (7q33). Among the most highly ranked discoveries were gene-associated SNPs in the uncharacterized LOC100130480 gene (rs6417039, rs2004296). Top case-associated SNPs were observed in BTRC (rs12355803, 3rs4436485), ARHGEF18 (rs11668246), CDH13 (rs4782779), and EDG2 (rs12552348), which encode proteins with known or suspected roles in B cell function and/or vascular integrity. 61 SNPs that were significantly associated with SCLS by microarray analysis were also detected and validated by exome deep sequencing. Functional annotation of highly ranked SNPs revealed enrichment of cell projections, cell junctions and adhesion, and molecules containing pleckstrin homology, Ras/Rho regulatory, and immunoglobulin Ig-like C2/fibronectin type III domains, all of which involve mechanistic functions that correlate with the SCLS phenotype. These results highlight SNPs with potential relevance to SCLS. PMID:24808988

  5. A genome-wide association study of early spontaneous preterm delivery.

    PubMed

    Zhang, Heping; Baldwin, Don A; Bukowski, Radek K; Parry, Samuel; Xu, Yaji; Song, Chi; Andrews, William W; Saade, George R; Esplin, M Sean; Sadovsky, Yoel; Reddy, Uma M; Ilekis, John; Varner, Michael; Biggio, Joseph R

    2015-03-01

    Preterm birth is the leading cause of infant morbidity and mortality. Despite extensive research, the genetic contributions to spontaneous preterm birth (SPTB) are not well understood. Term controls were matched with cases by race/ethnicity, maternal age, and parity prior to recruitment. Genotyping was performed using Affymetrix SNP Array 6.0 assays. Statistical analyses utilized PLINK to compare allele occurrence rates between case and control groups, and incorporated quality control and multiple-testing adjustments. We analyzed DNA samples from mother-infant pairs from early SPTB cases (20(0/7)-33(6/7) weeks, 959 women and 979 neonates) and term delivery controls (39(0/7)-41(6/7) weeks, 960 women and 985 neonates). For validation purposes, we included an independent validation cohort consisting of early SPTB cases (293 mothers and 243 infants) and term controls (200 mothers and 149 infants). Clustering analysis revealed no population stratification. Multiple maternal SNPs were identified with association P-values between 10×10(-5) and 10×10(-6). The most significant maternal SNP was rs17053026 on chromosome 3 with an odds ratio (OR) 0.44 with a P-value of 1.0×10(-6). Two neonatal SNPs reached the genome-wide significance threshold, including rs17527054 on chromosome 6p22 with a P-value of 2.7×10(-12) and rs3777722 on chromosome 6q27 with a P-value of 1.4×10(-10). However, we could not replicate these findings after adjusting for multiple comparisons in a validation cohort. This is the first report of a genome-wide case-control study to identify single nucleotide polymorphisms (SNPs) that correlate with SPTB.

  6. Genome-Wide Association Study for Autism Spectrum Disorder in Taiwanese Han Population

    PubMed Central

    Kuo, Po-Hsiu; Chuang, Li-Chung; Su, Mei-Hsin; Chen, Chia-Hsiang; Chen, Chien-Hsiun; Wu, Jer-Yuarn; Yen, Chung-Jen; Wu, Yu-Yu; Liu, Shih-Kai; Chou, Miao-Chun; Chou, Wen-Jiun; Chiu, Yen-Nan; Tsai, Wen-Che; Gau, Susan Shur-Fen

    2015-01-01

    Background Autism spectrum disorder (ASD) is a neurodevelopmental disorder with strong genetic components. Several recent genome-wide association (GWA) studies in Caucasian samples have reported a number of gene regions and loci correlated with the risk of ASD—albeit with very little consensus across studies. Methods A two-stage GWA study was employed to identify common genetic variants for ASD in the Taiwanese Han population. The discovery stage included 315 patients with ASD and 1,115 healthy controls, using the Affymetrix SNP array 6.0 platform for genotyping. Several gene regions were then selected for fine-mapping and top markers were examined in extended samples. Single marker, haplotype, gene-based, and pathway analyses were conducted for associations. Results Seven SNPs had p-values ranging from 3.4~9.9*10−6, but none reached the genome-wide significant level. Five of them were mapped to three known genes (OR2M4, STYK1, and MNT) with significant empirical gene-based p-values in OR2M4 (p = 3.4*10−5) and MNT (p = 0.0008). Results of the fine-mapping study showed single-marker associations in the GLIS1 (rs12082358 and rs12080993) and NAALADL2 (rs3914502 and rs2222447) genes, and gene-based associations for the OR2M3-OR2T5 (olfactory receptor genes, p = 0.02), and GLIPR1/KRR1 gene regions (p = 0.015). Pathway analyses revealed important pathways for ASD, such as olfactory and G protein–coupled receptors signaling pathways. Conclusions We reported Taiwanese Han specific susceptibility genes and variants for ASD. However, further replication in other Asian populations is warranted to validate our findings. Investigation in the biological functions of our reported genetic variants might also allow for better understanding on the underlying pathogenesis of autism. PMID:26398136

  7. Genome-wide Association Study of Autism Spectrum Disorder in the East Asian Populations.

    PubMed

    Liu, Xiaoxi; Shimada, Takafumi; Otowa, Takeshi; Wu, Yu-Yu; Kawamura, Yoshiya; Tochigi, Mamoru; Iwata, Yasuhide; Umekage, Tadashi; Toyota, Tomoko; Maekawa, Motoko; Iwayama, Yoshimi; Suzuki, Katsuaki; Kakiuchi, Chihiro; Kuwabara, Hitoshi; Kano, Yukiko; Nishida, Hisami; Sugiyama, Toshiro; Kato, Nobumasa; Chen, Chia-Hsiang; Mori, Norio; Yamada, Kazuo; Yoshikawa, Takeo; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa; Gau, Susan Shur-Fen

    2016-03-01

    Autism spectrum disorder is a heterogeneous neurodevelopmental disorder with strong genetic basis. To identify common genetic variations conferring the risk of ASD, we performed a two-stage genome-wide association study using ASD family and healthy control samples obtained from East Asian populations. A total of 166 ASD families (n = 500) and 642 healthy controls from the Japanese population were used as the discovery cohort. Approximately 900,000 single nucleotide polymorphisms (SNPs) were genotyped using Affymetrix Genome-Wide Human SNP array 6.0 chips. In the replication stage, 205 Japanese ASD cases and 184 healthy controls, as well as 418 Chinese Han trios (n = 1,254), were genotyped by TaqMan platform. Case-control analysis, family based association test, and transmission/disequilibrium test (TDT) were then conducted to test the association. In the discovery stage, significant associations were suggested for 14 loci, including 5 known ASD candidate genes: GPC6, JARID2, YTHDC2, CNTN4, and CSMD1. In addition, significant associations were identified for several novel genes with intriguing functions, such as JPH3, PTPRD, CUX1, and RIT2. After a meta-analysis combining the Japanese replication samples, the strongest signal was found at rs16976358 (P = 6.04 × 10(-7)), which is located near the RIT2 gene. In summary, our results provide independent support to known ASD candidate genes and highlight a number of novel genes warranted to be further investigated in a larger sample set in an effort to improve our understanding of the genetic basis of ASD.

  8. Translational genomics for plant breeding with the genome sequence explosion.

    PubMed

    Kang, Yang Jae; Lee, Taeyoung; Lee, Jayern; Shim, Sangrea; Jeong, Haneul; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

    2016-04-01

    The use of next-generation sequencers and advanced genotyping technologies has propelled the field of plant genomics in model crops and plants and enhanced the discovery of hidden bridges between genotypes and phenotypes. The newly generated reference sequences of unstudied minor plants can be annotated by the knowledge of model plants via translational genomics approaches. Here, we reviewed the strategies of translational genomics and suggested perspectives on the current databases of genomic resources and the database structures of translated information on the new genome. As a draft picture of phenotypic annotation, translational genomics on newly sequenced plants will provide valuable assistance for breeders and researchers who are interested in genetic studies.

  9. Genomes to Proteomes

    SciTech Connect

    Panisko, Ellen A.; Grigoriev, Igor; Daly, Don S.; Webb-Robertson, Bobbie-Jo; Baker, Scott E.

    2009-03-01

    Biologists are awash with genomic sequence data. In large part, this is due to the rapid acceleration in the generation of DNA sequence that occurred as public and private research institutes raced to sequence the human genome. In parallel with the large human genome effort, mostly smaller genomes of other important model organisms were sequenced. Projects following on these initial efforts have made use of technological advances and the DNA sequencing infrastructure that was built for the human and other organism genome projects. As a result, the genome sequences of many organisms are available in high quality draft form. While in many ways this is good news, there are limitations to the biological insights that can be gleaned from DNA sequences alone; genome sequences offer only a bird's eye view of the biological processes endemic to an organism or community. Fortunately, the genome sequences now being produced at such a high rate can serve as the foundation for other global experimental platforms such as proteomics. Proteomic methods offer a snapshot of the proteins present at a point in time for a given biological sample. Current global proteomics methods combine enzymatic digestion, separations, mass spectrometry and database searching for peptide identification. One key aspect of proteomics is the prediction of peptide sequences from mass spectrometry data. Global proteomic analysis uses computational matching of experimental mass spectra with predicted spectra based on databases of gene models that are often generated computationally. Thus, the quality of gene models predicted from a genome sequence is crucial in the generation of high quality peptide identifications. Once peptides are identified they can be assigned to their parent protein. Proteins identified as expressed in a given experiment are most useful when compared to other expressed proteins in a larger biological context or biochemical pathway. In this chapter we will discuss the automatic

  10. Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms (SNPs) Associated With the Development of Erectile Dysfunction in African-American Men After Radiotherapy for Prostate Cancer

    SciTech Connect

    Kerns, Sarah L.; Ostrer, Harry; Stock, Richard; Li, William; Pearlman, Alexander; Campbell, Christopher; Shao Yongzhao; Stone, Nelson; Kusnetz, Lynda; Rosenstein, Barry S.

    2010-12-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with erectile dysfunction (ED) among African-American prostate cancer patients treated with external beam radiation therapy. Methods and Materials: A cohort of African-American prostate cancer patients treated with external beam radiation therapy was observed for the development of ED by use of the five-item Sexual Health Inventory for Men (SHIM) questionnaire. Final analysis included 27 cases (post-treatment SHIM score {<=}7) and 52 control subjects (post-treatment SHIM score {>=}16). A genome-wide association study was performed using approximately 909,000 SNPs genotyped on Affymetrix 6.0 arrays (Affymetrix, Santa Clara, CA). Results: We identified SNP rs2268363, located in the follicle-stimulating hormone receptor (FSHR) gene, as significantly associated with ED after correcting for multiple comparisons (unadjusted p = 5.46 x 10{sup -8}, Bonferroni p = 0.028). We identified four additional SNPs that tended toward a significant association with an unadjusted p value < 10{sup -6}. Inference of population substructure showed that cases had a higher proportion of African ancestry than control subjects (77% vs. 60%, p = 0.005). A multivariate logistic regression model that incorporated estimated ancestry and four of the top-ranked SNPs was a more accurate classifier of ED than a model that included only clinical variables. Conclusions: To our knowledge, this is the first genome-wide association study to identify SNPs associated with adverse effects resulting from radiotherapy. It is important to note that the SNP that proved to be significantly associated with ED is located within a gene whose encoded product plays a role in male gonad development and function. Another key finding of this project is that the four SNPs most strongly associated with ED were specific to persons of African ancestry and would therefore not have been identified had a cohort of European ancestry been screened. This study

  11. Genomics for Weed Science

    PubMed Central

    Horvath, David

    2010-01-01

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and evolutionary processes of weedy plants. Genomics-based tools such as extensive EST databases and microarrays have been developed for a limited number of weedy species, although application of information and resources developed for model plants and crops are possible and have been exploited. These tools have just begun to provide insights into the response of these weeds to herbivore and pathogen attack, survival of extreme environmental conditions, and interaction with crops. The potential of these tools to illuminate mechanisms controlling the traits that allow weeds to invade novel habitats, survive extreme environments, and that make weeds difficult to eradicate have potential for both improving crops and developing novel methods to control weeds. PMID:20808523

  12. Genes, genome and Gestalt.

    PubMed

    Grisolia, Cesar Koppe

    2005-03-31

    According to Gestalt thinking, biological systems cannot be viewed as the sum of their elements, but as processes of the whole. To understand organisms we must start from the whole, observing how the various parts are related. In genetics, we must observe the genome over and above the sum of its genes. Either loss or addition of one gene in a genome can change the function of the organism. Genomes are organized in networks of genes, which need to be well integrated. In the case of genetically modified organisms (GMOs), for example, soybeans, rats, Anopheles mosquitoes, and pigs, the insertion of an exogenous gene into a receptive organism generally causes disturbance in the networks, resulting in the breakdown of gene interactions. In these cases, genetic modification increased the genetic load of the GMO and consequently decreased its adaptability (fitness). Therefore, it is hard to claim that the production of such organisms with an increased genetic load does not have ethical implications.

  13. Genomics of Preterm Birth

    PubMed Central

    Swaggart, Kayleigh A.; Pavlicev, Mihaela; Muglia, Louis J.

    2015-01-01

    The molecular mechanisms controlling human birth timing at term, or resulting in preterm birth, have been the focus of considerable investigation, but limited insights have been gained over the past 50 years. In part, these processes have remained elusive because of divergence in reproductive strategies and physiology shown by model organisms, making extrapolation to humans uncertain. Here, we summarize the evolution of progesterone signaling and variation in pregnancy maintenance and termination. We use this comparative physiology to support the hypothesis that selective pressure on genomic loci involved in the timing of parturition have shaped human birth timing, and that these loci can be identified with comparative genomic strategies. Previous limitations imposed by divergence of mechanisms provide an important new opportunity to elucidate fundamental pathways of parturition control through increasing availability of sequenced genomes and associated reproductive physiology characteristics across diverse organisms. PMID:25646385

  14. Genomics of preterm birth.

    PubMed

    Swaggart, Kayleigh A; Pavlicev, Mihaela; Muglia, Louis J

    2015-02-02

    The molecular mechanisms controlling human birth timing at term, or resulting in preterm birth, have been the focus of considerable investigation, but limited insights have been gained over the past 50 years. In part, these processes have remained elusive because of divergence in reproductive strategies and physiology shown by model organisms, making extrapolation to humans uncertain. Here, we summarize the evolution of progesterone signaling and variation in pregnancy maintenance and termination. We use this comparative physiology to support the hypothesis that selective pressure on genomic loci involved in the timing of parturition have shaped human birth timing, and that these loci can be identified with comparative genomic strategies. Previous limitations imposed by divergence of mechanisms provide an important new opportunity to elucidate fundamental pathways of parturition control through increasing availability of sequenced genomes and associated reproductive physiology characteristics across diverse organisms.

  15. Genomics for weed science.

    PubMed

    Horvath, David

    2010-03-01

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and evolutionary processes of weedy plants. Genomics-based tools such as extensive EST databases and microarrays have been developed for a limited number of weedy species, although application of information and resources developed for model plants and crops are possible and have been exploited. These tools have just begun to provide insights into the response of these weeds to herbivore and pathogen attack, survival of extreme environmental conditions, and interaction with crops. The potential of these tools to illuminate mechanisms controlling the traits that allow weeds to invade novel habitats, survive extreme environments, and that make weeds difficult to eradicate have potential for both improving crops and developing novel methods to control weeds.

  16. Genomics of Salmonella Species

    NASA Astrophysics Data System (ADS)

    Canals, Rocio; McClelland, Michael; Santiviago, Carlos A.; Andrews-Polymenis, Helene

    Progress in the study of Salmonella survival, colonization, and virulence has increased rapidly with the advent of complete genome sequencing and higher capacity assays for transcriptomic and proteomic analysis. Although many of these techniques have yet to be used to directly assay Salmonella growth on foods, these assays are currently in use to determine Salmonella factors necessary for growth in animal models including livestock animals and in in vitro conditions that mimic many different environments. As sequencing of the Salmonella genome and microarray analysis have revolutionized genomics and transcriptomics of salmonellae over the last decade, so are new high-throughput sequencing technologies currently accelerating the pace of our studies and allowing us to approach complex problems that were not previously experimentally tractable.

  17. Genomics and drug discovery.

    PubMed

    Haseltine, W A

    2001-09-01

    Genomics, the systematic study of all the genes of an organism, offers a new and much-needed source of systematic productivity for the pharmaceutical industry. The isolation of the majority of human genes in their most useful form is leading to the creation of new drugs based on human proteins, antibodies, peptides, and genes. Human Genome Sciences, Inc, was the first company to use the systematic, genomics approach to discovering drugs, and we have placed 4 of these in clinical trials. Two are described: repifermin (keratinocyte growth factor-2, KGF-2) for wound healing and treatment of mucositis caused by cancer therapy, and B lymphocyte stimulator (BLyS) for stimulation of the immune system. An anti-BLyS antibody drug is in advanced preclinical development for treatment of autoimmune diseases.

  18. Genomics of Volvocine Algae

    PubMed Central

    Umen, James G.; Olson, Bradley J.S.C.

    2015-01-01

    Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411

  19. Ebolavirus comparative genomics.

    PubMed

    Jun, Se-Ran; Leuze, Michael R; Nookaew, Intawat; Uberbacher, Edward C; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S; Pedersen, Thomas D; Wassenaar, Trudy M; Ussery, David W

    2015-09-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).

  20. Ebolavirus comparative genomics

    PubMed Central

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  1. Landscape evolutionary genomics.

    PubMed

    Lowry, David B

    2010-08-23

    Tremendous advances in genetic and genomic techniques have resulted in the capacity to identify genes involved in adaptive evolution across numerous biological systems. One of the next major steps in evolutionary biology will be to determine how landscape-level geographical and environmental features are involved in the distribution of this functional adaptive genetic variation. Here, I outline how an emerging synthesis of multiple disciplines has and will continue to facilitate a deeper understanding of the ways in which heterogeneity of the natural landscapes mould the genomes of organisms.

  2. The cancer genome

    PubMed Central

    Stratton, Michael R.; Campbell, Peter J.; Futreal, P. Andrew

    2010-01-01

    All cancers arise as a result of changes that have occurred in the DNA sequence of the genomes of cancer cells. Over the past quarter of a century much has been learnt about these mutations and the abnormal genes that operate in human cancers. We are now, however, moving into an era in which it will be possible to obtain the complete DNA sequence of large numbers of cancer genomes. These studies will provide us with a detailed and comprehensive perspective on how individual cancers have developed. PMID:19360079

  3. The genomics of mycobacteria.

    PubMed

    Viale, M N; Zumárraga, M J; Araújo, F R; Zarraga, A M; Cataldi, A A; Romano, M I; Bigi, F

    2016-04-01

    The species Mycobacterium bovis and Mycobacterium avium subspecies paratuberculosis are the causal agents, respectively, of tuberculosis and paratuberculosis in animals. Both mycobacteria, especially M. bovis, are also important to public health because they can infect humans. In recent years, this and the impact of tuberculosis and paratuberculosis on animal production have led to significant advances in knowledge about both pathogens and their host interactions. This article describes the contribution of genomics and functional genomics to studies of the evolution, virulence, epidemiology and diagnosis of both these pathogenic mycobacteria.

  4. Methanococcus jannaschii genome: revisited

    NASA Technical Reports Server (NTRS)

    Kyrpides, N. C.; Olsen, G. J.; Klenk, H. P.; White, O.; Woese, C. R.

    1996-01-01

    Analysis of genomic sequences is necessarily an ongoing process. Initial gene assignments tend (wisely) to be on the conservative side (Venter, 1996). The analysis of the genome then grows in an iterative fashion as additional data and more sophisticated algorithms are brought to bear on the data. The present report is an emendation of the original gene list of Methanococcus jannaschii (Bult et al., 1996). By using a somewhat more updated database and more relaxed (and operator-intensive) pattern matching methods, we were able to add significantly to, and in a few cases amend, the gene identification table originally published by Bult et al. (1996).

  5. Brief Guide to Genomics: DNA, Genes and Genomes

    MedlinePlus

    ... guía de genómica A Brief Guide to Genomics DNA, Genes and Genomes Deoxyribonucleic acid (DNA) is the ... and lead to a disease such as cancer. DNA Sequencing Sequencing simply means determining the exact order ...

  6. Visualizing Genomic Annotations with the UCSC Genome Browser.

    PubMed

    Hung, Jui-Hung; Weng, Zhiping

    2016-11-01

    Genomic data and annotations are rapidly accumulating in databases such as the UCSC Genome Browser, NCBI, and Ensembl. Given the massive scale of these genomic databases, it is important to be able to easily retrieve known data and annotations of a specified genomic locus. For example, for a newly identified cis-regulatory element bound by a transcription factor, questions that immediately come to mind include whether the element is near a transcriptional start site and, if so, the name of the corresponding gene, and whether the histones or DNA at the locus are modified. The UCSC Genome Browser organizes data and annotations (called tracks) around the reference sequences or draft assemblies of many eukaryotic genomes and presents them using a powerful web-based graphical interface. This protocol describes how to use the UCSC Genome Browser to visualize selected tracks at specified genomic regions, download the data and annotations for further analysis, and retrieve multiple sequence alignments and their conservation scores.

  7. Genome-Wide Differences in DNA Methylation Changes in Two Contrasting Rice Genotypes in Response to Drought Conditions

    PubMed Central

    Wang, Wensheng; Qin, Qiao; Sun, Fan; Wang, Yinxiao; Xu, Dandan; Li, Zhikang; Fu, Binying

    2016-01-01

    Differences in drought stress tolerance within diverse rice genotypes have been attributed to genetic diversity and epigenetic alterations. DNA methylation is an important epigenetic modification that influences diverse biological processes, but its effects on rice drought stress tolerance are poorly understood. In this study, methylated DNA immunoprecipitation sequencing and an Affymetrix GeneChip rice genome array were used to profile the DNA methylation patterns and transcriptomes of the drought-tolerant introgression line DK151 and its drought-sensitive recurrent parent IR64 under drought and control conditions. The introgression of donor genomic DNA induced genome-wide DNA methylation changes in DK151 plants. A total of 1190 differentially methylated regions (DMRs) were detected between the two genotypes under normal growth conditions, and the DMR-associated genes in DK151 plants were mainly related to stress response, programmed cell death, and nutrient reservoir activity, which are implicated to constitutive drought stress tolerance. A comparison of the DNA methylation changes in the two genotypes under drought conditions indicated that DK151 plants have a more stable methylome, with only 92 drought-induced DMRs, than IR64 plants with 506 DMRs. Gene ontology analyses of the DMR-associated genes in drought-stressed plants revealed that changes to the DNA methylation status of genotype-specific genes are associated with the epigenetic regulation of drought stress responses. Transcriptome analysis further helped to identify a set of 12 and 23 DMR-associated genes that were differentially expressed in DK151 and IR64, respectively, under drought stress compared with respective controls. Correlation analysis indicated that DNA methylation has various effects on gene expression, implying that it affects gene expression directly or indirectly through diverse regulatory pathways. Our results indicate that drought-induced alterations to DNA methylation may influence

  8. Genomic and genetic variability of six chicken populations using single nucleotide polymorphism and copy number variants as markers.

    PubMed

    Strillacci, M G; Cozzi, M C; Gorla, E; Mosca, F; Schiavini, F; Román-Ponce, S I; Ruiz López, F J; Schiavone, A; Marzoni, M; Cerolini, S; Bagnato, A

    2016-11-07

    Genomic and genetic variation among six Italian chicken native breeds (Livornese, Mericanel della Brianza, Milanino, Bionda Piemontese, Bianca di Saluzzo and Siciliana) were studied using single nucleotide polymorphism (SNP) and copy number variants (CNV) as markers. A total of 94 DNA samples genotyped with Axiom® Genome-Wide Chicken Genotyping Array (Affymetrix) were used in the analyses. The results showed the genetic and genomic variability occurring among the six Italian chicken breeds. The genetic relationship among animals was established with a principal component analysis. The genetic diversity within breeds was calculated using heterozygosity values (expected and observed) and with Wright's F-statistics. The individual-based CNV calling, based on log R ratio and B-allele frequency values, was done by the Hidden-Markov Model (HMM) of PennCNV software on autosomes. A hierarchical agglomerative clustering was applied in each population according to the absence or presence of definite CNV regions (CNV were grouped by overlapping of at least 1 bp). The CNV map was built on a total of 1003 CNV found in individual samples, after grouping by overlaps, resulting in 564 unique CNV regions (344 gains, 213 losses and 7 complex), for a total of 9.43 Mb of sequence and 1.03% of the chicken assembly autosome. All the approaches using SNP data showed that the Siciliana breed clearly differentiate from other populations, the Livornese breed separates into two distinct groups according to the feather colour (i.e. white and black) and the Bionda Piemontese and Bianca di Saluzzo breeds are closely related. The genetic variability found using SNP is comparable with that found by other authors in the same breeds using microsatellite markers. The CNV markers analysis clearly confirmed the SNP results.

  9. Whole mitochondrial genome screening in maternally inherited non-syndromic hearing impairment using a microarray resequencing mitochondrial DNA chip.

    PubMed

    Lévêque, Marianne; Marlin, Sandrine; Jonard, Laurence; Procaccio, Vincent; Reynier, Pascal; Amati-Bonneau, Patrizia; Baulande, Sylvain; Pierron, Denis; Lacombe, Didier; Duriez, Françoise; Francannet, Christine; Mom, Thierry; Journel, Hubert; Catros, Hélène; Drouin-Garraud, Valérie; Obstoy, Marie-Françoise; Dollfus, Hélène; Eliot, Marie-Madeleine; Faivre, Laurence; Duvillard, Christian; Couderc, Remy; Garabedian, Eréa-Noël; Petit, Christine; Feldmann, Delphine; Denoyelle, Françoise

    2007-11-01

    Mitochondrial DNA (mtDNA) mutations have been implicated in non-syndromic hearing loss either as primary or as predisposing factors. As only a part of the mitochondrial genome is usually explored in deafness, its prevalence is probably under-estimated. Among 1350 families with non-syndromic sensorineural hearing loss collected through a French collaborative network, we selected 29 large families with a clear maternal lineage and screened them for known mtDNA mutations in 12S rRNA, tRNASer(UCN) and tRNALeu(UUR) genes. When no mutation could be identified, a whole mitochondrial genome screening was performed, using a microarray resequencing chip: the MitoChip version 2.0 developed by Affymetrix Inc. Known mtDNA mutations was found in nine of the 29 families, which are described in the article: five with A1555G, two with the T7511C, one with 7472insC and one with A3243G mutation. In the remaining 20 families, the resequencing Mitochip detected 258 mitochondrial homoplasmic variants and 107 potentially heteroplasmic variants. Controls were made by direct sequencing on selected fragments and showed a high sensibility of the MitoChip but a low specificity, especially for heteroplasmic variations. An original analysis on the basis of species conservation, frequency and phylogenetic investigation was performed to select the more probably pathogenic variants. The entire genome analysis allowed us to identify five additional families with a putatively pathogenic mitochondrial variant: T669C, C1537T, G8078A, G12236A and G15077A. These results indicate that the new MitoChip platform is a rapid and valuable tool for identification of new mtDNA mutations in deafness.

  10. Genomics of Post-Prandial Lipidomic Phenotypes in the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) Study

    PubMed Central

    Irvin, Marguerite R.; Zhi, Degui; Aslibekyan, Stella; Claas, Steven A.; Absher, Devin M.; Ordovas, Jose M.; Tiwari, Hemant K.; Watkins, Steve; Arnett, Donna K.

    2014-01-01

    Background Increased postprandial lipid (PPL) response to dietary fat intake is a heritable risk factor for cardiovascular disease (CVD). Variability in postprandial lipids results from the complex interplay of dietary and genetic factors. We hypothesized that detailed lipid profiles (eg, sterols and fatty acids) may help elucidate specific genetic and dietary pathways contributing to the PPL response. Methods and Results We used gas chromatography mass spectrometry to quantify the change in plasma concentration of 35 fatty acids and 11 sterols between fasting and 3.5 hours after the consumption of a high-fat meal (PPL challenge) among 40 participants from the GOLDN study. Correlations between sterols, fatty acids and clinical measures were calculated. Mixed linear regression was used to evaluate associations between lipidomic profiles and genomic markers including single nucleotide polymorphisms (SNPs) and methylation markers derived from the Affymetrix 6.0 array and the Illumina Methyl450 array, respectively. After the PPL challenge, fatty acids increased as well as sterols associated with cholesterol absorption, while sterols associated with cholesterol synthesis decreased. PPL saturated fatty acids strongly correlated with triglycerides, very low-density lipoprotein, and chylomicrons. Two SNPs (rs12247017 and rs12240292) in the sorbin and SH3 domain containing 1 (SORBS1) gene were associated with b-Sitosterol after correction for multiple testing (P≤4.5*10−10). SORBS1 has been linked to obesity and insulin signaling. No other markers reached the genome-wide significance threshold, yet several other biologically relevant loci are highlighted (eg, PRIC285, a co-activator of PPARa). Conclusions Integration of lipidomic and genomic data has the potential to identify new biomarkers of CVD risk. PMID:24905834

  11. Center for Cancer Genomics | Office of Cancer Genomics

    Cancer.gov

    The Center for Cancer Genomics (CCG) was established to unify the National Cancer Institute's activities in cancer genomics, with the goal of advancing genomics research and translating findings into the clinic to improve the precise diagnosis and treatment of cancers. In addition to promoting genomic sequencing approaches, CCG aims to accelerate structural, functional and computational research to explore cancer mechanisms, discover new cancer targets, and develop new therapeutics.

  12. The tomato genome: implications for plant breeding, genomics and evolution

    PubMed Central

    2012-01-01

    The genome sequence of tomato (Solanum lycopersicum), one of the most important vegetable crops, has recently been decoded. We address implications of the tomato genome for plant breeding, genomics and evolutionary studies, and its potential to fuel future crop biology research. PMID:22943138

  13. Dynamic evolution of genomes and the concept of genome space.

    PubMed

    Bellgard, M I; Itoh, T; Watanabe, H; Imanishi, T; Gojobori, T

    1999-05-18

    A new era in the elucidation of genome evolution has been heralded with the availability of numerous genome sequences. With these data, it has been possible to study evolutionary processes at a greater level of detail in order to characterize features such as gene shuffling, genome rearrangements, base bias composition, and horizontal gene transfer. In this paper, we discuss the evolutionary implications of significant rearrangements within genomes as well as characteristic genomic regions that have been conserved across genomes. This is based on our analysis of orthologous and paralogous genes. We argue that genome plasticity has most likely contributed substantially to the dynamic evolution of genomes. We also describe the characteristic mosaic features of an archaea genome that is comprised of both bacterial and eukaryal elements. Here we investigate base compositional differences as well as the similarity of this species' genes to either bacteria or eukarya. We conclude that these features can be largely explained by the mechanism of horizontal gene transfer. Finally, we introduce the concept of genome space which is defined as the entire set of genomes of all living organisms. We explain its usefulness to describe as well as to gain deeper insight into the general features of the dynamic genomic evolutionary process.

  14. Metaplastic breast carcinomas display genomic and transcriptomic heterogeneity [corrected]. .

    PubMed

    Weigelt, Britta; Ng, Charlotte K Y; Shen, Ronglai; Popova, Tatiana; Schizas, Michail; Natrajan, Rachael; Mariani, Odette; Stern, Marc-Henri; Norton, Larry; Vincent-Salomon, Anne; Reis-Filho, Jorge S

    2015-03-01

    Metaplastic breast carcinoma is a rare and aggressive histologic type of breast cancer, preferentially displaying a triple-negative phenotype. We sought to define the transcriptomic heterogeneity of metaplastic breast cancers on the basis of current gene expression microarray-based classifiers, and to determine whether these tumors display gene copy number profiles consistent with those of BRCA1-associated breast cancers. Twenty-eight consecutive triple-negative metaplastic breast carcinomas were reviewed, and the metaplastic component present in each frozen specimen was defined (ie, spindle cell, squamous, chondroid metaplasia). RNA and DNA extracted from frozen sections with tumor cell content >60% were subjected to gene expression (Illumina HumanHT-12 v4) and copy number profiling (Affymetrix SNP 6.0), respectively. Using the best practice PAM50/claudin-low microarray-based classifier, all metaplastic breast carcinomas with spindle cell metaplasia were of claudin-low subtype, whereas those with squamous or chondroid metaplasia were preferentially of basal-like subtype. Triple-negative breast cancer subtyping using a dedicated website (http://cbc.mc.vanderbilt.edu/tnbc/) revealed that all metaplastic breast carcinomas with chondroid metaplasia were of mesenchymal-like subtype, spindle cell carcinomas preferentially of unstable or mesenchymal stem-like subtype, and those with squamous metaplasia were of multiple subtypes. None of the cases was classified as immunomodulatory or luminal androgen receptor subtype. Integrative clustering, combining gene expression and gene copy number data, revealed that metaplastic breast carcinomas with spindle cell and chondroid metaplasia were preferentially classified as of integrative clusters 4 and 9, respectively, whereas those with squamous metaplasia were classified into six different clusters. Eight of the 26 metaplastic breast cancers subjected to SNP6 analysis were classified as BRCA1-like. The diversity of histologic

  15. Genomic Data Commons launches - TCGA

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  16. The Auxin Response Factor Transcription Factor Family in Soybean: Genome-Wide Identification and Expression Analyses During Development and Water Stress

    PubMed Central

    Van Ha, Chien; Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Sulieman, Saad; Tran, Uyen Thi; Mochida, Keiichi; Van Dong, Nguyen; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan

    2013-01-01

    In plants, the auxin response factor (ARF) transcription factors play important roles in regulating diverse biological processes, including development, growth, cell division and responses to environmental stimuli. An exhaustive search of soybean genome revealed 51 GmARFs, many of which were formed by genome duplications. The typical GmARFs (43 members) contain a DNA-binding domain, an ARF domain and an auxin/indole acetic acid (AUX/IAA) dimerization domain, whereas the remaining eight members lack the dimerization domain. Phylogenetic analysis of the ARFs from soybean and Arabidopsis revealed both similarity and divergence between the two ARF families, as well as enabled us to predict the functions of the GmARFs. Using quantitative real-time polymerase chain reaction (qRT-PCR) and available soybean Affymetrix array and Illumina transcriptome sequence data, a comprehensive expression atlas of GmARF genes was obtained in various organs and tissues, providing useful information about their involvement in defining the precise nature of individual tissues. Furthermore, expression profiling using qRT-PCR and microarray data revealed many water stress-responsive GmARFs in soybean, albeit with different patterns depending on types of tissues and/or developmental stages. Our systematic analysis has identified excellent tissue-specific and/or stress-responsive candidate GmARF genes for in-depth in planta functional analyses, which would lead to potential applications in the development of genetically modified soybean cultivars with enhanced drought tolerance. PMID:23810914

  17. RIKEN mouse genome encyclopedia.

    PubMed

    Hayashizaki, Yoshihide

    2003-01-01

    We have been working to establish the comprehensive mouse full-length cDNA collection and sequence database to cover as many genes as we can, named Riken mouse genome encyclopedia. Recently we are constructing higher-level annotation (Functional ANnoTation Of Mouse cDNA; FANTOM) not only with homology search based annotation but also with expression data profile, mapping information and protein-protein database. More than 1,000,000 clones prepared from 163 tissues were end-sequenced to classify into 159,789 clusters and 60,770 representative clones were fully sequenced. As a conclusion, the 60,770 sequences contained 33,409 unique. The next generation of life science is clearly based on all of the genome information and resources. Based on our cDNA clones we developed the additional system to explore gene function. We developed cDNA microarray system to print all of these cDNA clones, protein-protein interaction screening system, protein-DNA interaction screening system and so on. The integrated database of all the information is very useful not only for analysis of gene transcriptional network and for the connection of gene to phenotype to facilitate positional candidate approach. In this talk, the prospect of the application of these genome resourced should be discussed. More information is available at the web page: http://genome.gsc.riken.go.jp/.

  18. Better chocolate through genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Theobroma cacao, the cacao or chocolate tree, is a tropical understory tree whose seeds are used to make chocolate. And like any important crop, cacao is the subject of much research. On September 15, 2010, scientists publicly released a preliminary sequence of the cacao genome--which contains all o...

  19. Prenatal Whole Genome Sequencing

    PubMed Central

    Donley, Greer; Hull, Sara Chandros; Berkman, Benjamin E.

    2014-01-01

    With whole genome sequencing set to become the preferred method of prenatal screening, we need to pay more attention to the massive amount of information it will deliver to parents—and the fact that we don't yet understand what most of it means. PMID:22777977

  20. The tomato genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The tomato genome sequence was undertaken at a time when state-of-the-art sequencing methodologies were undergoing a transition to co-called next generation methodologies. The result was an international consortium undertaking a strategy merging both old and new approaches. Because biologists were...

  1. [Genomic instability in atherosclerosis].

    PubMed

    Dzhokhadze, T A; Buadze, T Zh; Gaiozishvili, M N; Kakauridze, N G; Lezhava, T A

    2014-11-01

    A comparative study of the level of genomic instability, parameters of quantitative and structural mutations of chromosomes (aberration, aneuploidy, polyploidy) in lymphocyte cultures from patients with atherosclerosis of age 80 years and older (control group - 30-35 years old) was conducted. The possibility of correction of disturbed genomic indicators by peptide bioregulators - Livagen (Lys-Glu-Asp-Ala) and cobalt ions with separate application or in combination was also studied. Control was lymphocyte culture of two healthy respective age groups. It was also shown that patients with atherosclerosis exhibit high level of genomic instability in all studied parameters, regardless of age, which may suggest that there is marked increase in chromatin condensation in atherosclerosis. It was also shown that Livagen (characterized by modifying influence on chromatin) separately and in combination with cobalt ions, promotes normalization of altered genomic indicators of atherosclerosis in both age groups. The results show that Livagen separately and in combination with cobalt ions has impact on chromatin of patients with atherosclerosis. The identified protective action of Livagen proves its efficacy in prevention of atherosclerosis.

  2. Poster: the macaque genome.

    PubMed

    2007-04-13

    The rhesus macaque (Macaca mulatta) facilitates an extraordinary range of biomedical and basic research, and the publication of the genome only makes it a more powerful model for studies of human disease; moreover, the macaque's position relative to humans and chimpanzees affords the opportunity to learn about the processes that have shaped the last 25 million years of primate evolution. To allow users to explore these themes of the macaque genome, Science has created a special interactive version of the poster published in the print edition of the 13 April 2007 issue. The interactive version includes additional text and exploration, as well as embedded video featuring seven scientists discussing the importance of the macaque and its genome sequence in studies of biomedicine and evolution. We have also created an accompanying teaching resource, including a lesson plan aimed at teachers of advanced high school life science students, for exploring what a comparison of the macaque and human genomes can tell us about human biology and evolution. These items are free to all site visitors.

  3. The Nostoc punctiforme Genome

    SciTech Connect

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9 Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.

  4. Ascaris suum draft genome.

    PubMed

    Jex, Aaron R; Liu, Shiping; Li, Bo; Young, Neil D; Hall, Ross S; Li, Yingrui; Yang, Linfeng; Zeng, Na; Xu, Xun; Xiong, Zijun; Chen, Fangyuan; Wu, Xuan; Zhang, Guojie; Fang, Xiaodong; Kang, Yi; Anderson, Garry A; Harris, Todd W; Campbell, Bronwyn E; Vlaminck, Johnny; Wang, Tao; Cantacessi, Cinzia; Schwarz, Erich M; Ranganathan, Shoba; Geldhof, Peter; Nejsum, Peter; Sternberg, Paul W; Yang, Huanming; Wang, Jun; Wang, Jian; Gasser, Robin B

    2011-10-26

    Parasitic diseases have a devastating, long-term impact on human health, welfare and food production worldwide. More than two billion people are infected with geohelminths, including the roundworms Ascaris (common roundworm), Necator and Ancylostoma (hookworms), and Trichuris (whipworm), mainly in developing or impoverished nations of Asia, Africa and Latin America. In humans, the diseases caused by these parasites result in about 135,000 deaths annually, with a global burden comparable with that of malaria or tuberculosis in disability-adjusted life years. Ascaris alone infects around 1.2 billion people and, in children, causes nutritional deficiency, impaired physical and cognitive development and, in severe cases, death. Ascaris also causes major production losses in pigs owing to reduced growth, failure to thrive and mortality. The Ascaris-swine model makes it possible to study the parasite, its relationship with the host, and ascariasis at the molecular level. To enable such molecular studies, we report the 273 megabase draft genome of Ascaris suum and compare it with other nematode genomes. This genome has low repeat content (4.4%) and encodes about 18,500 protein-coding genes. Notably, the A. suum secretome (about 750 molecules) is rich in peptidases linked to the penetration and degradation of host tissues, and an assemblage of molecules likely to modulate or evade host immune responses. This genome provides a comprehensive resource to the scientific community and underpins the development of new and urgently needed interventions (drugs, vaccines and diagnostic tests) against ascariasis and other nematodiases.

  5. (Genomic variation in maize)

    SciTech Connect

    Rivin, C.J.

    1991-01-01

    These studies have sought to learn how different DNA sequences and sequence arrangements contribute to genome plasticity in maize. We describe quantitative variation among maize inbred lines for tandemly arrayed and dispersed repeated DNA sequences and gene families, and qualitative variation for sequences homologous to the Mutator family of transposons. The potential of these sequences to undergo unequal crossing over, non-allelic (ectopic) recombination and transposition makes them a source of genome instability. We have found examples of rapid genomic change involving these sequences in Fl hybrids, tissue culture cells and regenerated plants. We describe the repetitive portion of the maize genome as composed primarily of sequences that vary markedly in copy number among different genetic stocks. The most highly variable is the 185 bp repeat associated with the heterochromatic chromosome knobs. Even in lines without visible knobs, there is a considerable quantity of tandemly arrayed repeats. We also found a high degree of variability for the tandemly arrayed 5S and ribosomal DNA repeats. While such variation might be expected as the result of unequal cross-over, we were surprised to find considerable variation among lower copy number, dispersed repeats as well. One highly repeated sequence that showed a complex tandem and dispersed arrangement stood out as showing no detectable variability among the maize lines. In striking contrast to the variability seen between the inbred stocks, individuals within a stock were indistinguishable with regard to their repeated sequence multiplicities.

  6. Genetics, genomics and fertility

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In order to enhance the sustainability of dairy businesses, new management tools are needed to increase the fertility of dairy cattle. Genomic selection has been successfully used by AI studs to screen potential sires and significantly decrease the generation interval of bulls. Buoyed by the success...

  7. The G4 Genome

    PubMed Central

    Maizels, Nancy; Gray, Lucas T.

    2013-01-01

    Recent experiments provide fascinating examples of how G4 DNA and G4 RNA structures—aka quadruplexes—may contribute to normal biology and to genomic pathologies. Quadruplexes are transient and therefore difficult to identify directly in living cells, which initially caused skepticism regarding not only their biological relevance but even their existence. There is now compelling evidence for functions of some G4 motifs and the corresponding quadruplexes in essential processes, including initiation of DNA replication, telomere maintenance, regulated recombination in immune evasion and the immune response, control of gene expression, and genetic and epigenetic instability. Recognition and resolution of quadruplex structures is therefore an essential component of genome biology. We propose that G4 motifs and structures that participate in key processes compose the G4 genome, analogous to the transcriptome, proteome, or metabolome. This is a new view of the genome, which sees DNA as not only a simple alphabet but also a more complex geography. The challenge for the future is to systematically identify the G4 motifs that form quadruplexes in living cells and the features that confer on specific G4 motifs the ability to function as structural elements. PMID:23637633

  8. The human genome project.

    PubMed Central

    Olson, M V

    1993-01-01

    The Human Genome Project in the United States is now well underway. Its programmatic direction was largely set by a National Research Council report issued in 1988. The broad framework supplied by this report has survived almost unchanged despite an upheaval in the technology of genome analysis. This upheaval has primarily affected physical and genetic mapping, the two dominant activities in the present phase of the project. Advances in mapping techniques have allowed good progress toward the specific goals of the project and are also providing strong corollary benefits throughout biomedical research. Actual DNA sequencing of the genomes of the human and model organisms is still at an early stage. There has been little progress in the intrinsic efficiency of DNA-sequence determination. However, refinements in experimental protocols, instrumentation, and project management have made it practical to acquire sequence data on an enlarged scale. It is also increasingly apparent that DNA-sequence data provide a potent means of relating knowledge gained from the study of model organisms to human biology. There is as yet little indication that the infusion of technology from outside biology into the Human Genome Project has been effectively stimulated. Opportunities in this area remain large, posing substantial technical and policy challenges. PMID:8506271

  9. The Human Genome Program

    SciTech Connect

    Bell, G.I.

    1989-01-01

    Early in 1986, Charles DeLisi, then head of the Office of Health and Environmental Research at the Department of Energy (DOE) requested the Los Alamos National Laboratory (LANL) to organize a workshop charged with inquiring whether the state of technology and potential payoffs in biological knowledge and medical practice were such as to justify an organized program to map and sequence the human genome. The DOE's interest arose from its mission to assess the effects of radiation and other products of energy generation on human health in general and genetic material in particular. The workshop concluded that the technology was ripe, the benefits would be great, and a national program should be promptly initiated. Later committees, reporting to DOE, to the NIH, to the Office of Technology Assessment of the US Congress, and to the National Academy of Science have reviewed these issues more deliberately and come to the same conclusion. As a consequence, there has been established in the United States, a Human Genome Program, with funding largely from the NIH and the DOE, as indicated in Table 1. Moreover, the Program has attracted international interest, and Great Britain, France, Italy, and the Soviet Union, among other countries, have been reported to be starting human genome initiatives. Coordination of these programs, clearly in the interests of each, remains to be worked out, although an international Human Genome Organization (HUGO) is considering such coordination. 5 refs., 1 fig., 2 tabs.

  10. Genomics in Cardiovascular Disease

    PubMed Central

    Roberts, Robert; Marian, A.J.; Dandona, Sonny; Stewart, Alexandre F.R.

    2013-01-01

    A paradigm shift towards biology occurred in the 1990’s subsequently catalyzed by the sequencing of the human genome in 2000. The cost of DNA sequencing has gone from millions to thousands of dollars with sequencing of one’s entire genome costing only $1,000. Rapid DNA sequencing is being embraced for single gene disorders, particularly for sporadic cases and those from small families. Transmission of lethal genes such as associated with Huntington’s disease can, through in-vitro fertilization, avoid passing it on to one’s offspring. DNA sequencing will meet the challenge of elucidating the genetic predisposition for common polygenic diseases, especially in determining the function of the novel common genetic risk variants and identifying the rare variants, which may also partially ascertain the source of the missing heritability. The challenge for DNA sequencing remains great, despite human genome sequences being 99.5% identical, the 3 million single nucleotide polymorphisms (SNPs) responsible for most of the unique features add up to 60 new mutations per person which, for 7 billion people, is 420 billion mutations. It is claimed that DNA sequencing has increased 10,000 fold while information storage and retrieval only 16 fold. The physician and health user will be challenged by the convergence of two major trends, whole genome sequencing and the storage/retrieval and integration of the data. PMID:23524054

  11. Genomic imprinting: parental influence on the genome.

    PubMed

    Reik, W; Walter, J

    2001-01-01

    Genomic imprinting affects several dozen mammalian genes and results in the expression of those genes from only one of the two parental chromosomes. This is brought about by epigenetic instructions--imprints--that are laid down in the parental germ cells. Imprinting is a particularly important genetic mechanism in mammals, and is thought to influence the transfer of nutrients to the fetus and the newborn from the mother. Consistent with this view is the fact that imprinted genes tend to affect growth in the womb and behaviour after birth. Aberrant imprinting disturbs development and is the cause of various disease syndromes. The study of imprinting also provides new insights into epigenetic gene modification during development.

  12. Plant functional genomics

    NASA Astrophysics Data System (ADS)

    Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf

    2002-04-01

    Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.

  13. TUTORIAL ON NETWORK GENOMICS.

    SciTech Connect

    Forst, C.

    2001-01-01

    With the ever-increasing genomic information pouring into the databases researchers start to look for pattern in genomes. Key questions are the identification of function. In the past function was mainly understood to be assigned to a single gene isolated from other cellular components or mechanisms. Sequence comparison fo single genes and their products (proteins) as well as of intergenic space are a consequence of a well established one-gene one-function interpretation. prediction of function solely by sequence similarity searches are powerful techniques that initiated the advent of bioinformatics and computational biology. Seminal work on sequence alignment by Temple Smith and Michael Waterman [33] and sequence searches with the BLAST algorithm by Altschul et al. [2] provide essential methods for sequence based determination of function. Similar outstanding contributions to determination of function have been archived in the area of structure prediction, molecular modeling and molecular dynamics. Techniques covering ab initio and homology modeling up to biophysical interpretation of long-run molecular dynamics simulations are mentioned ehre. With the ever-increasing number of information of different genetic/genomic origin, new aspect are looked for that deviate from the single gene at a time method. Especially with the identification of surprisingly few human genes the emerging perception in the scientific community that the concept of function has to be extended to include other sequence based as well as non-sequenced based information. A schema of determination of function by different concepts is shown in Figure 1. The tutorial is comprised of the following sections: The first two sections discuss the differences between genomic and non-genomic based context information, section three will cover combined methods. Finally, section four lsits web-resources and databases. All presented approaches extensively employ comparative methods.

  14. Towards Sequencing Cotton (Gossypium) Genomes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Despite rapidly decreasing costs and innovative technologies, sequencing of angiosperm genomes is not yet undertaken lightly. Generating larger amounts of sequence data more quickly does not address the difficulties of sequencing and assembling complex genomes de novo. The cotton genomes represent a...

  15. From human genome to cancer genome: The first decade

    PubMed Central

    Wheeler, David A.; Wang, Linghua

    2013-01-01

    The realization that cancer progression required the participation of cellular genes provided one of several key rationales, in 1986, for embarking on the human genome project. Only with a reference genome sequence could the full spectrum of somatic changes leading to cancer be understood. Since its completion in 2003, the human reference genome sequence has fulfilled its promise as a foundational tool to illuminate the pathogenesis of cancer. Herein, we review the key historical milestones in cancer genomics since the completion of the genome, and some of the novel discoveries that are shaping our current understanding of cancer. PMID:23817046

  16. Comprehensive genome sequencing of the liver cancer genome.

    PubMed

    Nakagawa, Hidewaki; Shibata, Tatsuhiro

    2013-11-01

    Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. Recently, comprehensive whole genome and exome sequencing analyses for HCC revealed new cancer-associated genes and a variety of genomic alterations. In particular, frequent genetic alterations of the chromatin remodeling genes were observed, suggesting a new potential therapeutic target for HCC. Sequencing analysis has further identified the molecular complexities of multicentric lesions and intratumoral heterogeneity. Detailed analyses of the somatic substitution pattern of the cancer genome and the HBV virus genome integration sites by using whole-genome sequencing will elucidate the molecular basis and diverse etiological factors involved in liver cancer development.

  17. Genome-wide analysis of the structure of the South African Coloured Population in the Western Cape.

    PubMed

    de Wit, Erika; Delport, Wayne; Rugamika, Chimusa E; Meintjes, Ayton; Möller, Marlo; van Helden, Paul D; Seoighe, Cathal; Hoal, Eileen G

    2010-08-01

    Admixed populations present unique opportunities to discover the genetic factors underlying many multifactorial diseases. The geographical position and complex history of South Africa has led to the establishment of the unique admixed population known as the South African Coloured. Not much is known about the genetic make-up of this population, and the historical record is patchy. We genotyped 959 individuals from the Western Cape area, self-identified as belonging to this population, using the Affymetrix 500k genotyping platform. This resulted in nearly 75,000 autosomal SNPs that could be compared with populations represented in the International HapMap Project and the Human Genome Diversity Project. Analysis by means of both the admixture and linkage models in STRUCTURE revealed that the major ancestral components of this population are predominantly Khoesan (32-43%), Bantu-speaking Africans (20-36%), European (21-28%) and a smaller Asian contribution (9-11%), depending on the model used. This is consistent with historical data. While of great historical and genealogical interest, this information is also essential for future admixture mapping of disease genes in this population.

  18. Whole-genome transcriptional and physiological responses of Nitrosomonas europaea to cyanide: identification of cyanide stress response genes.

    PubMed

    Park, Sunhwa; Ely, Roger L

    2009-04-15

    Nitrosomonas europaea (ATCC 19718) is one of several nitrifying species that participate in the biological removal of nitrogen from wastewater by oxidizing ammonia to nitrite, the first step in nitrification. Because nitrification is quite sensitive to cyanide, a compound often encountered in wastewater treatment plants, we characterized the physiological and transcriptional responses of N. europaea cells to cyanide. The cells were extremely sensitive to low concentrations of cyanide, with NO-(2)production and ammonia-dependent oxygen uptake rates decreasing by 50% within 30 min of exposure to 1 microM NaCN. Whole-genome transcriptional responses of cells exposed to 1 microM NaCN were examined using Affymetrix microarrays to identify stress-induced genes. The transcript levels of 35 genes increased more than 2-fold while transcript levels of 29 genes decreased more than 20-fold. A gene cluster that included moeZ (NE2353), encoding a rhodanese homologue and thought to be involved in detoxification of cyanide, showed the highest up-regulation (7-fold). The down-regulated genes included genes encoding proteins involved in the sulfate reduction pathway, signal transduction mechanisms, carbohydrate transport, energy production, coenzyme metabolism, and amino acid transport.

  19. Genome-wide association study identifies ALDH7A1 as a novel susceptibility gene for osteoporosis.

    PubMed

    Guo, Yan; Tan, Li-Jun; Lei, Shu-Feng; Yang, Tie-Lin; Chen, Xiang-Ding; Zhang, Feng; Chen, Yuan; Pan, Feng; Yan, Han; Liu, Xiaogang; Tian, Qing; Zhang, Zhi-Xin; Zhou, Qi; Qiu, Chuan; Dong, Shan-Shan; Xu, Xiang-Hong; Guo, Yan-Fang; Zhu, Xue-Zhen; Liu, Shan-Lin; Wang, Xiang-Li; Li, Xi; Luo, Yi; Zhang, Li-Shu; Li, Meng; Wang, Jin-Tang; Wen, Ting; Drees, Betty; Hamilton, James; Papasian, Christopher J; Recker, Robert R; Song, Xiao-Ping; Cheng, Jing; Deng, Hong-Wen

    2010-01-08

    Osteoporosis is a major public health problem. It is mainly characterized by low bone mineral density (BMD) and/or low-trauma osteoporotic fractures (OF), both of which have strong genetic determination. The specific genes influencing these phenotypic traits, however, are largely unknown. Using the Affymetrix 500K array set, we performed a case-control genome-wide association study (GWAS) in 700 elderly Chinese Han subjects (350 with hip OF and 350 healthy matched controls). A follow-up replication study was conducted to validate our major GWAS findings in an independent Chinese sample containing 390 cases with hip OF and 516 controls. We found that a SNP, rs13182402 within the ALDH7A1 gene on chromosome 5q31, was strongly associated with OF with evidence combined GWAS and replication studies (P = 2.08x10(-9), odds ratio = 2.25). In order to explore the target risk factors and potential mechanism underlying hip OF risk, we further examined this candidate SNP's relevance to hip BMD both in Chinese and Caucasian populations involving 9,962 additional subjects. This SNP was confirmed as consistently associated with hip BMD even across ethnic boundaries, in both Chinese and Caucasians (combined P = 6.39x10(-6)), further attesting to its potential effect on osteoporosis. ALDH7A1 degrades and detoxifies acetaldehyde, which inhibits osteoblast proliferation and results in decreased bone formation. Our findings may provide new insights into the pathogenesis of osteoporosis.

  20. Genome of Crocodilepox Virus

    PubMed Central

    Afonso, C. L.; Tulman, E. R.; Delhon, G.; Lu, Z.; Viljoen, G. J.; Wallace, D. B.; Kutish, G. F.; Rock, D. L.

    2006-01-01

    Here, we present the genome sequence, with analysis, of a poxvirus infecting Nile crocodiles (Crocodylus niloticus) (crocodilepox virus; CRV). The genome is 190,054 bp (62% G+C) and predicted to contain 173 genes encoding proteins of 53 to 1,941 amino acids. The central genomic region contains genes conserved and generally colinear with those of other chordopoxviruses (ChPVs). CRV is distinct, as the terminal 33-kbp (left) and 13-kbp (right) genomic regions are largely CRV specific, containing 48 unique genes which lack similarity to other poxvirus genes. Notably, CRV also contains 14 unique genes which disrupt ChPV gene colinearity within the central genomic region, including 7 genes encoding GyrB-like ATPase domains similar to those in cellular type IIA DNA topoisomerases, suggestive of novel ATP-dependent functions. The presence of 10 CRV proteins with similarity to components of cellular multisubunit E3 ubiquitin-protein ligase complexes, including 9 proteins containing F-box motifs and F-box-associated regions and a homologue of cellular anaphase-promoting complex subunit 11 (Apc11), suggests that modification of host ubiquitination pathways may be significant for CRV-host cell interaction. CRV encodes a novel complement of proteins potentially involved in DNA replication, including a NAD+-dependent DNA ligase and a protein with similarity to both vaccinia virus F16L and prokaryotic serine site-specific resolvase-invertases. CRV lacks genes encoding proteins for nucleotide metabolism. CRV shares notable genomic similarities with molluscum contagiosum virus, including genes found only in these two viruses. Phylogenetic analysis indicates that CRV is quite distinct from other ChPVs, representing a new genus within the subfamily Chordopoxvirinae, and it lacks recognizable homologues of most ChPV genes involved in virulence and host range, including those involving interferon response, intracellular signaling, and host immune response modulation. These data reveal

  1. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

    PubMed

    Manolio, Teri A

    2016-10-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so.

  2. The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

    PubMed

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-02-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.

  3. Nongenetic functions of the genome.

    PubMed

    Bustin, Michael; Misteli, Tom

    2016-05-06

    The primary function of the genome is to store, propagate, and express the genetic information that gives rise to a cell's architectural and functional machinery. However, the genome is also a major structural component of the cell. Besides its genetic roles, the genome affects cellular functions by nongenetic means through its physical and structural properties, particularly by exerting mechanical forces and by serving as a scaffold for binding of cellular components. Major cellular processes affected by nongenetic functions of the genome include establishment of nuclear structure, signal transduction, mechanoresponses, cell migration, and vision in nocturnal animals. We discuss the concept, mechanisms, and implications of nongenetic functions of the genome.

  4. Genomics and the immune system.

    PubMed

    Pipkin, Matthew E; Monticelli, Silvia

    2008-05-01

    While the hereditary information encoded in the Watson-Crick base pairing of genomes is largely static within a given individual, access to this information is controlled by dynamic mechanisms. The human genome is pervasively transcribed, but the roles played by the majority of the non-protein-coding genome sequences are still largely unknown. In this review we focus on insights to gene transcriptional regulation by placing special emphasis on genome-wide approaches, and on how non-coding RNAs, which derive from global transcription of the genome, in turn control gene expression. We review recent progress in the field with highlights on the immune system.

  5. Polygenic transmission and complex neuro developmental network for attention deficit hyperactivity disorder: genome-wide association study of both common and rare variants.

    PubMed

    Yang, Li; Neale, Benjamin M; Liu, Lu; Lee, S Hong; Wray, Naomi R; Ji, Ning; Li, Haimei; Qian, Qiujin; Wang, Dongliang; Li, Jun; Faraone, Stephen V; Wang, Yufeng; Doyle, Alysa E; Reif, Andreas; Rothenberger, Aribert; Franke, Barbara; Sonuga-Barke, Edmund J S; Steinhausen, Hans-Christoph; Buitelaar, Jan K; Kuntsi, Jonna; Biederman, Joseph; Lesch, Klaus-Peter; Kent, Lindsey; Asherson, Philip; Oades, Robert D; Loo, Sandra K; Nelson, Stan F; Faraone, Stephen V; Smalley, Susan L; Banaschewski, Tobias; Arias Vasquez, Alejandro; Todorov, Alexandre; Charach, Alice; Miranda, Ana; Warnke, Andreas; Thapar, Anita; Neale, Benjamin M; Cormand, Bru; Freitag, Christine; Mick, Eric; Mulas, Fernando; Middleton, Frank; HakonarsonHakonarson, Hakon; Palmason, Haukur; Schäfer, Helmut; Roeyers, Herbert; McGough, James J; Romanos, Jasmin; Crosbie, Jennifer; Meyer, Jobst; Ramos-Quiroga, Josep Antoni; Sergeant, Joseph; Elia, Josephine; Langely, Kate; Nisenbaum, Laura; Romanos, Marcel; Daly, Mark J; Ribasés, Marta; Gill, Michael; O'Donovan, Michael; Owen, Michael; Casas, Miguel; Bayés, Mònica; Lambregts-Rommelse, Nanda; Williams, Nigel; Holmans, Peter; Anney, Richard J L; Ebstein, Richard P; Schachar, Russell; Medland, Sarah E; Ripke, Stephan; Walitza, Susanne; Nguyen, Thuy Trang; Renner, Tobias J; Hu, Xiaolan

    2013-07-01

    Attention-deficit hyperactivity disorder (ADHD) is a complex polygenic disorder. This study aimed to discover common and rare DNA variants associated with ADHD in a large homogeneous Han Chinese ADHD case-control sample. The sample comprised 1,040 cases and 963 controls. All cases met DSM-IV ADHD diagnostic criteria. We used the Affymetrix6.0 array to assay both single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Genome-wide association analyses were performed using PLINK. SNP-heritability and SNP-genetic correlations with ADHD in Caucasians were estimated with genome-wide complex trait analysis (GCTA). Pathway analyses were performed using the Interval enRICHment Test (INRICH), the Disease Association Protein-Protein Link Evaluator (DAPPLE), and the Genomic Regions Enrichment of Annotations Tool (GREAT). We did not find genome-wide significance for single SNPs but did find an increased burden of large, rare CNVs in the ADHD sample (P = 0.038). SNP-heritability was estimated to be 0.42 (standard error, 0.13, P = 0.0017) and the SNP-genetic correlation with European Ancestry ADHD samples was 0.39 (SE 0.15, P = 0.0072). The INRICH, DAPPLE, and GREAT analyses implicated several gene ontology cellular components, including neuron projections and synaptic components, which are consistent with a neurodevelopmental pathophysiology for ADHD. This study suggested the genetic architecture of ADHD comprises both common and rare variants. Some common causal variants are likely to be shared between Han Chinese and Caucasians. Complex neurodevelopmental networks may underlie ADHD's etiology.

  6. Informational laws of genome structures

    PubMed Central

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-01-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined. PMID:27354155

  7. Advances in plant chromosome genomics.

    PubMed

    Doležel, Jaroslav; Vrána, Jan; Cápal, Petr; Kubaláková, Marie; Burešová, Veronika; Simková, Hana

    2014-01-01

    Next generation sequencing (NGS) is revolutionizing genomics and is providing novel insights into genome organization, evolution and function. The number of plant genomes targeted for sequencing is rising. For the moment, however, the acquisition of full genome sequences in large genome species remains difficult, largely because the short reads produced by NGS platforms are inadequate to cope with repeat-rich DNA, which forms a large part of these genomes. The problem of sequence redundancy is compounded in polyploids, which dominate the plant kingdom. An approach to overcoming some of these difficulties is to reduce the full nuclear genome to its individual chromosomes using flow-sorting. The DNA acquired in this way has proven to be suitable for many applications, including PCR-based physical mapping, in situ hybridization, forming DNA arrays, the development of DNA markers, the construction of BAC libraries and positional cloning. Coupling chromosome sorting with NGS offers opportunities for the study of genome organization at the single chromosomal level, for comparative analyses between related species and for the validation of whole genome assemblies. Apart from the primary aim of reducing the complexity of the template, taking a chromosome-based approach enables independent teams to work in parallel, each tasked with the analysis of a different chromosome(s). Given that the number of plant species tractable for chromosome sorting is increasing, the likelihood is that chromosome genomics - the marriage of cytology and genomics - will make a significant contribution to the field of plant genetics.

  8. Evolution of small prokaryotic genomes.

    PubMed

    Martínez-Cano, David J; Reyes-Prieto, Mariana; Martínez-Romero, Esperanza; Partida-Martínez, Laila P; Latorre, Amparo; Moya, Andrés; Delaye, Luis

    2014-01-01

    As revealed by genome sequencing, the biology of prokaryotes with reduced genomes is strikingly diverse. These include free-living prokaryotes with ∼800 genes as well as endosymbiotic bacteria with as few as ∼140 genes. Comparative genomics is revealing the evolutionary mechanisms that led to these small genomes. In the case of free-living prokaryotes, natural selection directly favored genome reduction, while in the case of endosymbiotic prokaryotes neutral processes played a more prominent role. However, new experimental data suggest that selective processes may be at operation as well for endosymbiotic prokaryotes at least during the first stages of genome reduction. Endosymbiotic prokaryotes have evolved diverse strategies for living with reduced gene sets inside a host-defined medium. These include utilization of host-encoded functions (some of them coded by genes acquired by gene transfer from the endosymbiont and/or other bacteria); metabolic complementation between co-symbionts; and forming consortiums with other bacteria within the host. Recent genome sequencing projects of intracellular mutualistic bacteria showed that previously believed universal evolutionary trends like reduced G+C content and conservation of genome synteny are not always present in highly reduced genomes. Finally, the simplified molecular machinery of some of these organisms with small genomes may be used to aid in the design of artificial minimal cells. Here we review recent genomic discoveries of the biology of prokaryotes endowed with small gene sets and discuss the evolutionary mechanisms that have been proposed to explain their peculiar nature.

  9. Informational laws of genome structures

    NASA Astrophysics Data System (ADS)

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-06-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.

  10. Evolution of small prokaryotic genomes

    PubMed Central

    Martínez-Cano, David J.; Reyes-Prieto, Mariana; Martínez-Romero, Esperanza; Partida-Martínez, Laila P.; Latorre, Amparo; Moya, Andrés; Delaye, Luis

    2015-01-01

    As revealed by genome sequencing, the biology of prokaryotes with reduced genomes is strikingly diverse. These include free-living prokaryotes with ∼800 genes as well as endosymbiotic bacteria with as few as ∼140 genes. Comparative genomics is revealing the evolutionary mechanisms that led to these small genomes. In the case of free-living prokaryotes, natural selection directly favored genome reduction, while in the case of endosymbiotic prokaryotes neutral processes played a more prominent role. However, new experimental data suggest that selective processes may be at operation as well for endosymbiotic prokaryotes at least during the first stages of genome reduction. Endosymbiotic prokaryotes have evolved diverse strategies for living with reduced gene sets inside a host-defined medium. These include utilization of host-encoded functions (some of them coded by genes acquired by gene transfer from the endosymbiont and/or other bacteria); metabolic complementation between co-symbionts; and forming consortiums with other bacteria within the host. Recent genome sequencing projects of intracellular mutualistic bacteria showed that previously believed universal evolutionary trends like reduced G+C content and conservation of genome synteny are not always present in highly reduced genomes. Finally, the simplified molecular machinery of some of these organisms with small genomes may be used to aid in the design of artificial minimal cells. Here we review recent genomic discoveries of the biology of prokaryotes endowed with small gene sets and discuss the evolutionary mechanisms that have been proposed to explain their peculiar nature. PMID:25610432

  11. Sequencing technologies and genome sequencing.

    PubMed

    Pareek, Chandra Shekhar; Smoczynski, Rafal; Tretyn, Andrzej

    2011-11-01

    The high-throughput - next generation sequencing (HT-NGS) technologies are currently the hottest topic in the field of human and animals genomics researches, which can produce over 100 times more data compared to the most sophisticated capillary sequencers based on the Sanger method. With the ongoing developments of high throughput sequencing machines and advancement of modern bioinformatics tools at unprecedented pace, the target goal of sequencing individual genomes of living organism at a cost of $1,000 each is seemed to be realistically feasible in the near future. In the relatively short time frame since 2005, the HT-NGS technologies are revolutionizing the human and animal genome researches by analysis of chromatin immunoprecipitation coupled to DNA microarray (ChIP-chip) or sequencing (ChIP-seq), RNA sequencing (RNA-seq), whole genome genotyping, genome wide structural variation, de novo assembling and re-assembling of genome, mutation detection and carrier screening, detection of inherited disorders and complex human diseases, DNA library preparation, paired ends and genomic captures, sequencing of mitochondrial genome and personal genomics. In this review, we addressed the important features of HT-NGS like, first generation DNA sequencers, birth of HT-NGS, second generation HT-NGS platforms, third generation HT-NGS platforms: including single molecule Heliscope™, SMRT™ and RNAP sequencers, Nanopore, Archon Genomics X PRIZE foundation, comparison of second and third HT-NGS platforms, applications, advances and future perspectives of sequencing technologies on human and animal genome research.

  12. Comparative genomics of Brassicaceae crops

    PubMed Central

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-01-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  13. Advances on Genome Duplication Distances

    NASA Astrophysics Data System (ADS)

    Gagnon, Yves; Savard, Olivier Tremblay; Bertrand, Denis; El-Mabrouk, Nadia

    Given a phylogenetic tree involving Whole Genome Duplication events, we contribute to the problem of computing the rearrangement distance on a branch of a tree linking a duplication node d to a speciation node or a leaf s. In the case of a genome G at s containing exactly two copies of each gene, the genome halving problem is to find a perfectly duplicated genome D at d minimizing the rearrangement distance with G. We generalize the existing exact linear-time algorithm for genome halving to the case of a genome G with missing gene copies. In the case of a known ancestral duplicated genome D, we develop a greedy approach for computing the distance between G and D that is shown time-efficient and very accurate for both the rearrangement and DCJ distances.

  14. Big cat genomics.

    PubMed

    O'Brien, Stephen J; Johnson, Warren E

    2005-01-01

    Advances in population and quantitative genomics, aided by the computational algorithms that employ genetic theory and practice, are now being applied to biological questions that surround free-ranging species not traditionally suitable for genetic enquiry. Here we review how applications of molecular genetic tools have been used to describe the natural history, present status, and future disposition of wild cat species. Insight into phylogenetic hierarchy, demographic contractions, geographic population substructure, behavioral ecology, and infectious diseases have revealed strategies for survival and adaptation of these fascinating predators. Conservation, stabilization, and management of the big cats are important areas that derive benefit from the genome resources expanded and applied to highly successful species, imperiled by an expanding human population.

  15. Bacterial genome annotation.

    PubMed

    Beckloff, Nicholas; Starkenburg, Shawn; Freitas, Tracey; Chain, Patrick

    2012-01-01

    Annotation of prokaryotic sequences can be separated into structural and functional annotation. Structural annotation is dependent on algorithmic interrogation of experimental evidence to discover the physical characteristics of a gene. This is done in an effort to construct accurate gene models, so understanding function or evolution of genes among organisms is not impeded. Functional annotation is dependent on sequence similarity to other known genes or proteins in an effort to assess the function of the gene. Combining structural and functional annotation across genomes in a comparative manner promotes higher levels of accurate annotation as well as an advanced understanding of genome evolution. As the availability of bacterial sequences increases and annotation methods improve, the value of comparative annotation will increase.

  16. [Genomics in medicine].

    PubMed

    Ruiz Esparza-Garrido, Ruth; Velázquez-Flores, Miguel Angel; Arenas-Aranda, Diego Julio; Salamanca-Gómez, Fabio

    2014-01-01

    The development of new fields of study in genetics, as the -omic sciences (transcriptomics, proteomics, metabolomics), has allowed the study of the regulation and expression of genomes. Therefore, nowadays it is possible to study global alterations--in the whole genome--and their effect at the protein and metabolic levels. Importantly, this new way of studying genetics has opened new areas of knowledge, and new cellular mechanisms that regulate the functioning of biological systems have been elucidated. In the clinical field, in the last years new molecular tools have been implemented. These tools are favorable to a better classification, diagnosis and prognosis of several human diseases. Additionally, in some cases best treatments, which improve the quality of life of patients, have been established. Due to the previous assertion, it is important to review and divulge changes in the study of genetics as a result of the development of the -omic sciences, which is the aim of this review.

  17. Viruses within animal genomes.

    PubMed

    De Brognier, A; Willems, L

    2016-04-01

    Viruses and their hosts can co-evolve to reach a fragile equilibrium that allows the survival of both. An excess of pathogenicity in the absence of a reservoir would be detrimental to virus survival. A significant proportion of all animal genomes has been shaped by the insertion of viruses that subsequently became 'fossilised'. Most endogenous viruses have lost the capacity to replicate via an infectious cycle and now replicate passively. The insertion of endogenous viruses has contributed to the evolution of animal genomes, for example in the reproductive biology of mammals. However, spontaneous viral integration still occasionally occurs in a number of virus-host systems. This constitutes a potential risk to host survival but also provides an opportunity for diversification and evolution.

  18. Mapping the human genome

    SciTech Connect

    Annas, G.C.; Elias, S.

    1992-01-01

    This article is a review of the book Mapping the Human Genome: Using Law and Ethics as Guides, edited by George C. Annas and Sherman Elias. The book is a collection of essays on the subject of using ethics and laws as guides to justify human gene mapping. It addresses specific issues such problems related to eugenics, patents, insurance as well as broad issues such as the societal definitions of normality.

  19. Genomic landscape of liposarcoma

    PubMed Central

    Kanojia, Deepika; Nagata, Yasunobu; Garg, Manoj; Lee, Dhong Hyun; Sato, Aiko; Yoshida, Kenichi; Sato, Yusuke; Sanada, Masashi; Mayakonda, Anand; Bartenhagen, Christoph; Klein, Hans-Ulrich; Doan, Ngan B.; Said, Jonathan W.; Mohith, S.; Gunasekar, Swetha; Shiraishi, Yuichi; Chiba, Kenichi; Tanaka, Hiroko; Miyano, Satoru; Myklebost, Ola; Yang, Henry; Dugas, Martin; Meza-Zepeda, Leonardo A.; Silberman, Allan W.; Forscher, Charles; Tyner, Jeffrey W.; Ogawa, Seishi; Koeffler, H. Phillip

    2015-01-01

    Liposarcoma (LPS) is the most common type of soft tissue sarcoma accounting for 20% of all adult sarcomas. Due to absence of clinically effective treatment options in inoperable situations and resistance to chemotherapeutics, a critical need exists to identify novel therapeutic targets. We analyzed LPS genomic landscape using SNP arrays, whole exome sequencing and targeted exome sequencing to uncover the genomic information for development of specific anti-cancer targets. SNP array analysis indicated known amplified genes (MDM2, CDK4, HMGA2) and important novel genes (UAP1, MIR557, LAMA4, CPM, IGF2, ERBB3, IGF1R). Carboxypeptidase M (CPM), recurrently amplified gene in well-differentiated/de-differentiated LPS was noted as a putative oncogene involved in the EGFR pathway. Notable deletions were found at chromosome 1p (RUNX3, ARID1A), chromosome 11q (ATM, CHEK1) and chromosome 13q14.2 (MIR15A, MIR16-1). Significantly and recurrently mutated genes (false discovery rate < 0.05) included PLEC (27%), MXRA5 (21%), FAT3 (24%), NF1 (20%), MDC1 (10%), TP53 (7%) and CHEK2 (6%). Further, in vitro and in vivo functional studies provided evidence for the tumor suppressor role for Neurofibromin 1 (NF1) gene in different subtypes of LPS. Pathway analysis of recurrent mutations demonstrated signaling through MAPK, JAK-STAT, Wnt, ErbB, axon guidance, apoptosis, DNA damage repair and cell cycle pathways were involved in liposarcomagenesis. Interestingly, we also found mutational and copy number heterogeneity within a primary LPS tumor signifying the importance of multi-region sequencing for cancer-genome guided therapy. In summary, these findings provide insight into the genomic complexity of LPS and highlight potential druggable pathways for targeted therapeutic approach. PMID:26643872

  20. Genomics of cellulosic biofuels.

    PubMed

    Rubin, Edward M

    2008-08-14

    The development of alternatives to fossil fuels as an energy source is an urgent global priority. Cellulosic biomass has the potential to contribute to meeting the demand for liquid fuel, but land-use requirements and process inefficiencies represent hurdles for large-scale deployment of biomass-to-biofuel technologies. Genomic information gathered from across the biosphere, including potential energy crops and microorganisms able to break down biomass, will be vital for improving the prospects of significant cellulosic biofuel production.

  1. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  2. Personalized Genomic Medicine with a Patchwork, Partially Owned Genome

    PubMed Central

    Mason, Christopher E.; Seringhaus, Michael R.; Sattler de Sousa e Brito, Clara

    2008-01-01

    “His book was known as the Book of Sand, because neither the book nor the sand have any beginning or end.” — Jorge Luis Borges The human genome is a three billion-letter recipe for the genesis of a human being, directing development from a single-celled embryo to the trillions of adult cells. Since the sequencing of the human genome was announced in 2001, researchers have an increased ability to discern the genetic basis for diseases. This reference genome has opened the door to genomic medicine, aimed at detecting and understanding all genetic variations of the human genome that contribute to the manifestation and progression of disease. The overarching vision of genomic (or “personalized”) medicine is to custom-tailor each treatment for maximum effectiveness in an individual patient. Detecting the variation in a patient’s deoxyribonucleic acid (DNA), ribonucleic acid (RNA), and protein structures is no longer an insurmountable hurdle. Today, the challenge for genomic medicine lies in contextualizing those myriad genetic variations in terms of their functional consequences for a person’s health and development throughout life and in terms of that patient’s susceptibility to disease and differential clinical responses to medication. Additionally, several recent developments have complicated our understanding of the nominal human genome and, thereby, altered the progression of genomic medicine. In this brief review, we shall focus on these developments and examine how they are changing our understanding of our genome. PMID:18449389

  3. Mapping the human genome

    SciTech Connect

    Cantor, Charles R.

    1989-06-01

    The following pages aim to lay a foundation for understanding the excitement surrounding the ''human genome project,'' as well as to convey a flavor of the ongoing efforts and plans at the Human Genome Center at the Lawrence Berkeley Laboratory. Our own work, of course, is only part of a broad international effort that will dramatically enhance our understanding of human molecular genetics before the end of this century. In this country, the bulk of the effort will be carried out under the auspices of the Department of Energy and the National Institutes of Health, but significant contributions have already been made both by nonprofit private foundations and by private corporation. The respective roles of the DOE and the NIH are being coordinated by an inter-agency committee, the aims of which are to emphasize the strengths of each agency, to facilitate cooperation, and to avoid unnecessary duplication of effort. The NIH, for example, will continue its crucial work in medical genetics and in mapping the genomes of nonhuman species. The DOE, on the other hand, has unique experience in managing large projects, and its national laboratories are repositories of expertise in physics, engineering, and computer science, as well as the life sciences. The tools and techniques the project will ultimately rely on are thus likely to be developed in multidisciplinary efforts at laboratories like LBL. Accordingly, we at LBL take great pride in this enterprise -- an enterprise that will eventually transform our understanding of ourselves.

  4. The canine genome.

    PubMed

    Ostrander, Elaine A; Wayne, Robert K

    2005-12-01

    The dog has emerged as a premier species for the study of morphology, behavior, and disease. The recent availability of a high-quality draft sequence lifts the dog system to a new threshold. We provide a primer to use the dog genome by first focusing on its evolutionary history. We overview the relationship of dogs to wild canids and discuss their origin and domestication. Dogs clearly originated from a substantial number of gray wolves and dog breeds define distinct genetic units that can be divided into at least four hierarchical groupings. We review evidence showing that dogs have high levels of linkage disequilibrium. Consequently, given that dog breeds express specific phenotypic traits and vary in behavior and the incidence of genetic disease, genomic-wide scans for linkage disequilibrium may allow the discovery of genes influencing breed-specific characteristics. Finally, we review studies that have utilized the dog to understand the genetic underpinning of several traits, and we summarize genomic resources that can be used to advance such studies. We suggest that given these resources and the unique characteristics of breeds, that the dog is a uniquely valuable resource for studying the genetic basis of complex traits.

  5. Whole-genome sequencing for comparative genomics and de novo genome assembly.

    PubMed

    Benjak, Andrej; Sala, Claudia; Hartkoorn, Ruben C

    2015-01-01

    Next-generation sequencing technologies for whole-genome sequencing of mycobacteria are rapidly becoming an attractive alternative to more traditional sequencing methods. In particular this technology is proving useful for genome-wide identification of mutations in mycobacteria (comparative genomics) as well as for de novo assembly of whole genomes. Next-generation sequencing however generates a vast quantity of data that can only be transformed into a usable and comprehensible form using bioinformatics. Here we describe the methodology one would use to prepare libraries for whole-genome sequencing, and the basic bioinformatics to identify mutations in a genome following Illumina HiSeq or MiSeq sequencing, as well as de novo genome assembly following sequencing using Pacific Biosciences (PacBio).

  6. An Analysis of Adenovirus Genomes Using Whole Genome Software Tools

    PubMed Central

    Mahadevan, Padmanabhan

    2016-01-01

    The evolution of sequencing technology has lead to an enormous increase in the number of genomes that have been sequenced. This is especially true in the field of virus genomics. In order to extract meaningful biological information from these genomes, whole genome data mining software tools must be utilized. Hundreds of tools have been developed to analyze biological sequence data. However, only some of these tools are user-friendly to biologists. Several of these tools that have been successfully used to analyze adenovirus genomes are described here. These include Artemis, EMBOSS, pDRAW, zPicture, CoreGenes, GeneOrder, and PipMaker. These tools provide functionalities such as visualization, restriction enzyme analysis, alignment, and proteome comparisons that are extremely useful in the bioinformatics analysis of adenovirus genomes. PMID:28293072

  7. Efficient Breeding by Genomic Mating.

    PubMed

    Akdemir, Deniz; Sánchez, Julio I

    2016-01-01

    Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population.

  8. Efficient Breeding by Genomic Mating

    PubMed Central

    Akdemir, Deniz; Sánchez, Julio I.

    2016-01-01

    Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population. PMID:27965707

  9. The UCSC Ebola Genome Portal

    PubMed Central

    Haeussler, Maximilian; Karolchik, Donna; Clawson, Hiram; Raney, Brian J; Rosenbloom, Kate R.; Fujita, Pauline A.; Hinrichs, Angie S.; Speir, Matthew L; Eisenhart, Chris; Zweig, Ann S.; Haussler, David; Kent, W. James

    2014-01-01

    Background: With the Ebola epidemic raging out of control in West Africa, there has been a flurry of research into the Ebola virus, resulting in the generation of much genomic data. Methods: In response to the clear need for tools that integrate multiple strands of research around molecular sequences, we have created the University of California Santa Cruz (UCSC) Ebola Genome Browser, an adaptation of our popular UCSC Genome Browser web tool, which can be used to view the Ebola virus genome sequence from GenBank and nearly 30 annotation tracks generated by mapping external data to the reference sequence. Significant annotations include a multiple alignment comprising 102 Ebola genomes from the current outbreak, 56 from previous outbreaks, and 2 Marburg genomes as an outgroup; a gene track curated by NCBI; protein annotations curated by UniProt and antibody-binding epitopes curated by IEDB. We have extended the Genome Browser’s multiple alignment color-coding scheme to distinguish mutations resulting from non-synonymous coding changes, synonymous changes, or changes in untranslated regions. Discussion: Our Ebola Genome portal at http://genome.ucsc.edu/ebolaPortal/ links to the Ebola virus Genome Browser and an aggregate of useful information, including a collection of Ebola antibodies we are curating. PMID:25685613

  10. Genome-wide association and genomic selection in animal breeding.

    PubMed

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  11. Analysis of genomic aberrations and gene expression profiling identifies novel lesions and pathways in myeloproliferative neoplasms

    PubMed Central

    Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W

    2011-01-01

    Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal. PMID:22829077

  12. Genome-wide gene expression profiling of SCID mice with T-cell-mediated Colitis.

    PubMed

    Brudzewsky, D; Pedersen, A E; Claesson, M H; Gad, M; Kristensen, N N; Lage, K; Jensen, T; Tommerup, N; Larsen, L A; Knudsen, S; Tümer, Z

    2009-05-01

    Inflammatory bowel disease (IBD) is a multifactorial disorder with an unknown aetiology. The aim of this study is to employ a murine model of IBD to identify pathways and genes, which may play a key role in the pathogenesis of IBD and could be important for discovery of new disease markers in human disease. Here, we have investigated severe combined immunodeficient (SCID) mice, which upon adoptive transfer with concanavalin A-activated CD4(+) T cells develop inflammation of the colon with predominance in rectum. Mice with increasing level of inflammation was studied. RNA from rectum of transplanted and non-transplanted SCID mice was investigated by a genome-wide gene expression analysis using the Affymetrix mouse expression array 430A (MOE430A) including 22,626 probe sets. A significant change in gene expression (P = 0.00001) is observed in 152 of the genes between the non-transplanted control mice and colitis mice, and among these genes there is an overrepresentation of genes involved in inflammatory processes. Some of the most significant genes showing higher expression encode S100A proteins and chemokines involved in trafficking of leucocytes in inflammatory areas. Classification by gene clustering based on the genes with the significantly altered gene expression corresponds to two different levels of inflammation as established by the histological scoring of the inflamed rectum. These data demonstrate that this SCID T-cell transfer model is a useful animal model for human IBD and can be used for suggesting candidate genes involved in the pathogenesis and for identifying new molecular markers of chronic inflammation in human IBD.

  13. Genome-Wide Survey of Cold Stress Regulated Alternative Splicing in Arabidopsis thaliana with Tiling Microarray

    PubMed Central

    Leviatan, Noam; Alkan, Noam; Leshkowitz, Dena; Fluhr, Robert

    2013-01-01

    Alternative splicing plays a major role in expanding the potential informational content of eukaryotic genomes. It is an important post-transcriptional regulatory mechanism that can increase protein diversity and affect mRNA stability. Alternative splicing is often regulated in a tissue-specific and stress-responsive manner. Cold stress, which adversely affects plant growth and development, regulates the transcription and splicing of plant splicing factors. This can affect the pre-mRNA processing of many genes. To identify cold regulated alternative splicing we applied Affymetrix Arabidopsis tiling arrays to survey the transcriptome under cold treatment conditions. A novel algorithm was used for detection of statistically relevant changes in intron expression within a transcript between control and cold growth conditions. A reverse transcription polymerase chain reaction (RT-PCR) analysis of a number of randomly selected genes confirmed the changes in splicing patterns under cold stress predicted by tiling array. Our analysis revealed new types of cold responsive genes. While their expression level remains relatively unchanged under cold stress their splicing pattern shows detectable changes in the relative abundance of isoforms. The majority of cold regulated alternative splicing introduced a premature termination codon (PTC) into the transcripts creating potential targets for degradation by the nonsense mediated mRNA decay (NMD) process. A number of these genes were analyzed in NMD-defective mutants by RT-PCR and shown to evade NMD. This may result in new and truncated proteins with altered functions or dominant negative effects. The results indicate that cold affects both quantitative and qualitative aspects of gene expression. PMID:23776682

  14. Genome-wide survey of cold stress regulated alternative splicing in Arabidopsis thaliana with tiling microarray.

    PubMed

    Leviatan, Noam; Alkan, Noam; Leshkowitz, Dena; Fluhr, Robert

    2013-01-01

    Alternative splicing plays a major role in expanding the potential informational content of eukaryotic genomes. It is an important post-transcriptional regulatory mechanism that can increase protein diversity and affect mRNA stability. Alternative splicing is often regulated in a tissue-specific and stress-responsive manner. Cold stress, which adversely affects plant growth and development, regulates the transcription and splicing of plant splicing factors. This can affect the pre-mRNA processing of many genes. To identify cold regulated alternative splicing we applied Affymetrix Arabidopsis tiling arrays to survey the transcriptome under cold treatment conditions. A novel algorithm was used for detection of statistically relevant changes in intron expression within a transcript between control and cold growth conditions. A reverse transcription polymerase chain reaction (RT-PCR) analysis of a number of randomly selected genes confirmed the changes in splicing patterns under cold stress predicted by tiling array. Our analysis revealed new types of cold responsive genes. While their expression level remains relatively unchanged under cold stress their splicing pattern shows detectable changes in the relative abundance of isoforms. The majority of cold regulated alternative splicing introduced a premature termination codon (PTC) into the transcripts creating potential targets for degradation by the nonsense mediated mRNA decay (NMD) process. A number of these genes were analyzed in NMD-defective mutants by RT-PCR and shown to evade NMD. This may result in new and truncated proteins with altered functions or dominant negative effects. The results indicate that cold affects both quantitative and qualitative aspects of gene expression.

  15. A genome-wide association study for age-related hearing impairment in the Saami.

    PubMed

    Van Laer, Lut; Huyghe, Jeroen R; Hannula, Samuli; Van Eyken, Els; Stephan, Dietrich A; Mäki-Torkko, Elina; Aikio, Pekka; Fransen, Erik; Lysholm-Bernacchi, Alana; Sorri, Martti; Huentelman, Matthew J; Van Camp, Guy

    2010-06-01

    This study aimed at contributing to the elucidation of the genetic basis of age-related hearing impairment (ARHI), a common multifactorial disease with an important genetic contribution as demonstrated by heritability studies. We conducted a genome-wide association study (GWAS) in the Finnish Saami, a small, ancient, genetically isolated population without evidence of demographic expansion. The choice of this study population was motivated by its anticipated higher extent of LD, potentially offering a substantial power advantage for association mapping. DNA samples and audiometric measurements were collected from 352 Finnish Saami individuals, aged between 50 and 75 years. To reduce the burden of multiple testing, we applied principal component (PC) analysis to the multivariate audiometric phenotype. The first three PCs captured 80% of the variation in hearing thresholds, while maintaining biologically important audiometric features. All subjects were genotyped with the Affymetrix 100 K chip. To account for multiple levels of relatedness among subjects, as well as for population stratification, association testing was performed using a mixed model. We summarised the top-ranking association signals for the three traits under study. The top-ranked SNP, rs457717 (P-value 3.55 x 10(-7)), was associated with PC3 and was localised in an intron of the IQ motif-containing GTPase-activating-like protein (IQGAP2). Intriguingly, the SNP rs161927 (P-value 0.000149), seventh-ranked for PC1, was positioned immediately downstream from the metabotropic glutamate receptor-7 gene (GRM7). As a previous GWAS of a European and Finnish sample set already suggested a role for GRM7 in ARHI, this study provides further evidence for the involvement of this gene.

  16. Genome-wide expression profiling in the peripheral blood of patients with fibromyalgia

    PubMed Central

    Jones, Kim D.; Gelbart, Terri; Whisenant, Thomas C.; Waalen, Jill; Mondala, Tony S.; Iklé, David N.; Salomon, Daniel R.; Bennett, Robert M.; Kurian, Sunil M.

    2016-01-01

    Objective Fibromyalgia (FM) is a common pain disorder characterised by nociceptive dysregulation. The basic biology of FM is poorly understood. Herein we have used agnostic gene expression as a potential probe for informing its underlying biology and the development of a proof-of-concept diagnostic gene expression signature. Methods We analysed RNA expression in 70 FM patients and 70 healthy controls. The isolated RNA was amplified and hybridised to Affymetrix® Human Gene 1.1 ST Peg arrays. The data was analysed using Partek Genomics Suite v. 6.6. Results Fibromyalgia patients exhibited a differential expression of 421 genes (p<0.001), several relevant to pathways for pain processing, such as glutamine/glutamate signaling and axonal development. There was also an upregulation of several inflammatory pathways and downregulation of pathways related to hypersensitivity and allergy. Using rigorous diagnostic modeling strategies, we show “locked” gene signatures discovered on Training and Test cohorts, that have a mean Area Under the Curve (AUC) of 0.81 on randomised, independent external data cohorts. Lastly, we identified a subset of 10 probesets that provided a diagnostic sensitivity for FM of 95% and a specificity of 96%. We also show that the signatures for FM were very specific to FM rather than common FM comorbidities. Conclusion These findings provide new insights relevant to the pathogenesis of FM, and provide several testable hypotheses that warrant further exploration and also establish the foundation for a first blood-based molecular signature in FM that needs to be validated in larger cohorts of patients. PMID:27157394

  17. A genome-wide association study for age-related hearing impairment in the Saami

    PubMed Central

    Van Laer, Lut; Huyghe, Jeroen R; Hannula, Samuli; Van Eyken, Els; Stephan, Dietrich A; Mäki-Torkko, Elina; Aikio, Pekka; Fransen, Erik; Lysholm-Bernacchi, Alana; Sorri, Martti; Huentelman, Matthew J; Van Camp, Guy

    2010-01-01

    This study aimed at contributing to the elucidation of the genetic basis of age-related hearing impairment (ARHI), a common multifactorial disease with an important genetic contribution as demonstrated by heritability studies. We conducted a genome-wide association study (GWAS) in the Finnish Saami, a small, ancient, genetically isolated population without evidence of demographic expansion. The choice of this study population was motivated by its anticipated higher extent of LD, potentially offering a substantial power advantage for association mapping. DNA samples and audiometric measurements were collected from 352 Finnish Saami individuals, aged between 50 and 75 years. To reduce the burden of multiple testing, we applied principal component (PC) analysis to the multivariate audiometric phenotype. The first three PCs captured 80% of the variation in hearing thresholds, while maintaining biologically important audiometric features. All subjects were genotyped with the Affymetrix 100 K chip. To account for multiple levels of relatedness among subjects, as well as for population stratification, association testing was performed using a mixed model. We summarised the top-ranking association signals for the three traits under study. The top-ranked SNP, rs457717 (P-value 3.55 × 10−7), was associated with PC3 and was localised in an intron of the IQ motif-containing GTPase-activating-like protein (IQGAP2). Intriguingly, the SNP rs161927 (P-value 0.000149), seventh-ranked for PC1, was positioned immediately downstream from the metabotropic glutamate receptor-7 gene (GRM7). As a previous GWAS of a European and Finnish sample set already suggested a role for GRM7 in ARHI, this study provides further evidence for the involvement of this gene. PMID:20068591

  18. Detection of selective sweeps in cattle using genome-wide SNP data

    PubMed Central

    2013-01-01

    Background The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps. Results Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour. Conclusions The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra

  19. A Genome-Wide Association Search for Type 2 Diabetes Genes in African Americans

    PubMed Central

    Palmer, Nicholette D.; McDonough, Caitrin W.; Hicks, Pamela J.; Roh, Bong H.; Wing, Maria R.; An, S. Sandy; Hester, Jessica M.; Cooke, Jessica N.; Bostrom, Meredith A.; Rudock, Megan E.; Talbert, Matthew E.; Lewis, Joshua P.; Ferrara, Assiamira; Lu, Lingyi; Ziegler, Julie T.; Sale, Michele M.; Divers, Jasmin; Shriner, Daniel; Adeyemo, Adebowale; Rotimi, Charles N.; Ng, Maggie C. Y.; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10−8). SNP rs7560163 (P = 7.0×10−9, OR (95% CI) = 0.75 (0.67–0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10−5) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations. PMID:22238593

  20. Genomic Data Commons and Genomic Cloud Pilots - Google Hangout

    Cancer.gov

    Join us for a live, moderated discussion about two NCI efforts to expand access to cancer genomics data: the Genomic Data Commons and Genomic Cloud Pilots. NCI subject matters experts will include Louis M. Staudt, M.D., Ph.D., Director Center for Cancer Genomics, Warren Kibbe, Ph.D., Director, NCI Center for Biomedical Informatics and Information Technology, and moderated by Anthony Kerlavage, Ph.D., Chief, Cancer Informatics Branch, Center for Biomedical Informatics and Information Technology. We welcome your questions before and during the Hangout on Twitter using the hashtag #AskNCI.

  1. Shrinking genomes? Evidence from genome size variation in Crepis (Compositae).

    PubMed

    Enke, N; Fuchs, J; Gemeinholzer, B

    2011-01-01

    Large-scale surveys of genome size evolution in angiosperms show that the ancestral genome was most likely small, with a tendency towards an increase in DNA content during evolution. Due to polyploidisation and self-replicating DNA elements, angiosperm genomes were considered to have a 'one-way ticket to obesity' (Bennetzen & Kellogg 1997). New findings on how organisms can lose DNA challenged the hypotheses of unidirectional evolution of genome size. The present study is based on the classical work of Babcock (1947a) on karyotype evolution within Crepis and analyses karyotypic diversification within the genus in a phylogenetic context. Genome size of 21 Crepis species was estimated using flow cytometry. Additional data of 17 further species were taken from the literature. Within 30 diploid Crepis species there is a striking trend towards genome contraction. The direction of genome size evolution was analysed by reconstructing ancestral character states on a molecular phylogeny based on ITS sequence data. DNA content is correlated to distributional aspects as well as life form. Genome size is significantly higher in perennials than in annuals. Within sampled species, very small genomes are only present in Mediterranean or European species, whereas their Central and East Asian relatives have larger 1C values.

  2. Genome instability mechanisms and the structure of cancer genomes.

    PubMed

    Cassidy, Liam D; Venkitaraman, Ashok R

    2012-02-01

    Genomic instability is a hallmark of cancer cells, and arises from the aberrations that these cells exhibit in the normal biological mechanisms that repair and replicate the genome, or ensure its accurate segregation during cell division. Increasingly detailed descriptions of cancer genomes have begun to emerge from next-generation sequencing (NGS), providing snapshots of their nature and heterogeneity in different cancers at different stages in their evolution. Here, we attempt to extract from these sequencing studies insights into the role of genome instability mechanisms in carcinogenesis, and to identify challenges impeding further progress.

  3. The coffee genome hub: a resource for coffee genomes

    PubMed Central

    Dereeper, Alexis; Bocs, Stéphanie; Rouard, Mathieu; Guignon, Valentin; Ravel, Sébastien; Tranchant-Dubreuil, Christine; Poncet, Valérie; Garsmeur, Olivier; Lashermes, Philippe; Droc, Gaëtan

    2015-01-01

    The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager. PMID:25392413

  4. The Anolis Lizard Genome: An Amniote Genome without Isochores?

    PubMed Central

    Costantini, Maria; Greif, Gonzalo; Alvarez-Valin, Fernando; Bernardi, Giorgio

    2016-01-01

    Two articles published 5 years ago concluded that the genome of the lizard Anolis carolinensis is an amniote genome without isochores. This claim was apparently contradicting previous results on the general presence of an isochore organization in all vertebrate genomes tested (including Anolis). In this investigation, we demonstrate that the Anolis genome is indeed heterogeneous in base composition, since its macrochromosomes comprise isochores mainly from the L2 and H1 families (a moderately GC-poor and a moderately GC-rich family, respectively), and since the majority of the sequenced microchromosomes consists of H1 isochores. These families are associated with different features of genome structure, including gene density and compositional correlations (e.g., GC3 vs flanking sequence GC and intron GC), as in the case of mammalian and avian genomes. Moreover, the assembled Anolis chromosomes have an enormous number of gaps, which could be due to sequencing problems in GC-rich regions of the genome. In conclusion, the Anolis genome is no exception to the general rule of an isochore organization in the genomes of vertebrates (and other eukaryotes). PMID:26992416

  5. A preliminary study of the whole-genome expression profile of sporadic and monogenic early-onset Alzheimer's disease.

    PubMed

    Antonell, Anna; Lladó, Albert; Altirriba, Jordi; Botta-Orfila, Teresa; Balasa, Mircea; Fernández, Manel; Ferrer, Isidre; Sánchez-Valle, Raquel; Molinuevo, José Luis

    2013-07-01

    Alzheimer's disease (AD) is the most common neurodegenerative dementia. Approximately 10% of cases present at an age of onset before 65 years old, which in turn can be monogenic familial AD (FAD) or sporadic early-onset AD (sEOAD). Mutations in PSEN1, PSEN2, and APP genes have been linked with FAD. The aim of our study is to describe the brain whole-genome RNA expression profile of the posterior cingulate area in sEOAD and FAD caused by PSEN1 mutations (FAD-PSEN1). Fourteen patients (7 sEOAD and 7 FAD-PSEN1) and 7 neurologically healthy control subjects were selected and whole-genome expression was measured using Affymetrix Human Gene 1.1 microarrays. We identified statistically significant expression changes in sEOAD and FAD-PSEN1 brains with respect to control subjects (3183 and 3350 differentially expressed genes [DEG] respectively, false discovery rate-corrected p < 0.05). Of them, 1916 DEG were common between the 2 comparisons. We did not identify DEG between sEOAD and FAD-PSEN1. Microarray data were validated through real-time quantitative polymerase chain reaction. In silico analysis of DEG revealed an alteration in biological pathways related to intracellular signaling pathways (particularly calcium signaling), neuroactive ligand-receptor interactions, axon guidance, and long-term potentiation in both groups of patients. In conclusion, the altered biological final pathways in sEOAD and FAD-PSEN1 are mainly related with cell signaling cascades, synaptic plasticity, and learning and memory processes. We hypothesize that these 2 groups of early-onset AD with distinct etiologies and likely different could present a neurodegenerative process with potential different pathways that might converge in a common and similar final stage of the disease.

  6. Computational Systems Biology Approach Predicts Regulators and Targets of microRNAs and Their Genomic Hotspots in Apoptosis Process.

    PubMed

    Alanazi, Ibrahim O; Ebrahimie, Esmaeil

    2016-07-01

    Novel computational systems biology tools such as common targets analysis, common regulators analysis, pathway discovery, and transcriptomic-based hotspot discovery provide new opportunities in understanding of apoptosis molecular mechanisms. In this study, after measuring the global contribution of microRNAs in the course of apoptosis by Affymetrix platform, systems biology tools were utilized to obtain a comprehensive view on the role of microRNAs in apoptosis process. Network analysis and pathway discovery highlighted the crosstalk between transcription factors and microRNAs in apoptosis. Within the transcription factors, PRDM1 showed the highest upregulation during the course of apoptosis, with more than 9-fold expression increase compared to non-apoptotic condition. Within the microRNAs, MIR1208 showed the highest expression in non-apoptotic condition and downregulated by more than 6 fold during apoptosis. Common regulators algorithm showed that TNF receptor is the key upstream regulator with a high number of regulatory interactions with the differentially expressed microRNAs. BCL2 and AKT1 were the key downstream targets of differentially expressed microRNAs. Enrichment analysis of the genomic locations of differentially expressed microRNAs led us to the discovery of chromosome bands which were highly enriched (p < 0.01) with the apoptosis-related microRNAs, such as 13q31.3, 19p13.13, and Xq27.3 This study opens a new avenue in understanding regulatory mechanisms and downstream functions in the course of apoptosis as well as distinguishing genomic-enriched hotspots for apoptosis process.

  7. Genome-Wide Gene Expression Profiling Reveals Conserved and Novel Molecular Functions of the Stigma in Rice1[W

    PubMed Central

    Li, Meina; Xu, Wenying; Yang, Wenqiang; Kong, Zhaosheng; Xue, Yongbiao

    2007-01-01

    In angiosperms, the stigma provides initial nutrients and guidance cues for pollen grain germination and tube growth. However, little is known about the genes that regulate these processes in rice (Oryza sativa). Here, we generate rice stigma-specific or -preferential gene expression profiles through comparing genome-wide expression patterns of hand-dissected, unpollinated stigma at anthesis with seven tissues, including seedling shoot, seedling root, mature anther, ovary at anthesis, seeds 5 d after pollination, 10-d-old embryo, 10-d-old endosperm, and suspension-cultured cells by using both 57 K Affymetrix rice whole-genome array and 10 K rice cDNA microarray. A high reproducibility of the microarray results was detected between the two different technology platforms. In total, we identified 548 genes to be expressed specifically or predominantly in the stigma papillar cells of rice. Real-time quantitative reverse transcription-polymerase chain reaction analysis of 34 selected genes all confirmed their stigma-specific expression. The expression of five selected genes was further validated by RNA in situ hybridization. Gene Ontology analysis shows that several auxin-signaling components, transcription, and stress-related genes are significantly overrepresented in the rice stigma gene set. Interestingly, most of them also share several cis-regulatory elements with known stress-responsive genes, supporting the notion of an overlap of genetic programs regulating pollination and stress/defense responses. We also found that genes involved in cell wall metabolism and cellular communication appear to be conserved in the stigma between rice and Arabidopsis (Arabidopsis thaliana). Our results indicate that the stigmas appear to have conserved and novel molecular functions between rice and Arabidopsis. PMID:17556504

  8. Gene-environment interaction effects on lung function- a genome-wide association study within the Framingham heart study

    PubMed Central

    2013-01-01

    Background Previous studies in occupational exposure and lung function have focused only on the main effect of occupational exposure or genetics on lung function. Some disease-susceptible genes may be missed due to their low marginal effects, despite potential involvement in the disease process through interactions with the environment. Through comprehensive genome-wide gene-environment interaction studies, we can uncover these susceptibility genes. Our objective in this study was to explore gene by occupational exposure interaction effects on lung function using both the individual SNPs approach and the genetic network approach. Methods The study population comprised the Offspring Cohort and the Third Generation from the Framingham Heart Study. We used forced expiratory volume in one second (FEV1) and ratio of FEV1 to forced vital capacity (FVC) as outcomes. Occupational exposures were classified using a population-specific job exposure matrix. We performed genome-wide gene-environment interaction analysis, using the Affymetrix 550 K mapping array for genotyping. A linear regression-based generalized estimating equation was applied to account for within-family relatedness. Network analysis was conducted using results from single-nucleotide polymorphism (SNP)-level analyses and from gene expression study results. Results There were 4,785 participants in total. SNP-level analysis and network analysis identified SNP rs9931086 (Pinteraction =1.16 × 10-7) in gene SLC38A8, which may significantly modify the effects of occupational exposure on FEV1. Genes identified from the network analysis included CTLA-4, HDAC, and PPAR-alpha. Conclusions Our study implies that SNP rs9931086 in SLC38A8 and genes CTLA-4, HDAC, and PPAR-alpha, which are related to inflammatory processes, may modify the effect of occupational exposure on lung function. PMID:24289273

  9. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

    PubMed

    2007-06-07

    There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined approximately 2,000 individuals for each of 7 major diseases and a shared set of approximately 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 x 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10(-5) and 5 x 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a

  10. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls

    PubMed Central

    2009-01-01

    There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined ~2,000 individuals for each of 7 major diseases and a shared set of ~3,000 controls. Case-control comparisons identified 24 independent association signals at P<5×10-7: 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn’s disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10-5 and 5×10-7) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics

  11. Identification of Susceptible Loci and Enriched Pathways for Bipolar II Disorder Using Genome-Wide Association Studies

    PubMed Central

    Kao, Chung-Feng; Chen, Hui-Wen; Chen, Hsi-Chung; Yang, Jenn-Hwai; Huang, Ming-Chyi; Chiu, Yi-Hang; Lin, Shih-Ku; Lee, Ya-Chin; Liu, Chih-Min; Chuang, Li-Chung; Chen, Chien-Hsiun; Wu, Jer-Yuarn

    2016-01-01

    Background: This study aimed to identify susceptible loci and enriched pathways for bipolar disorder subtype II. Methods: We conducted a genome-wide association scan in discovery samples with 189 bipolar disorder subtype II patients and 1773 controls, and replication samples with 283 bipolar disorder subtype II patients and 500 controls in a Taiwanese Han population using Affymetrix Axiom Genome-Wide CHB1 Array. We performed single-marker and gene-based association analyses, as well as calculated polygeneic risk scores for bipolar disorder subtype II. Pathway enrichment analyses were employed to reveal significant biological pathways. Results: Seven markers were found to be associated with bipolar disorder subtype II in meta-analysis combining both discovery and replication samples (P<5.0×10–6), including markers in or close to MYO16, HSP90AB3P, noncoding gene LOC100507632, and markers in chromosomes 4 and 10. A novel locus, ETF1, was associated with bipolar disorder subtype II (P<6.0×10–3) in gene-based association tests. Results of risk evaluation demonstrated that higher genetic risk scores were able to distinguish bipolar disorder subtype II patients from healthy controls in both discovery (P=3.9×10–4~1.0×10–3) and replication samples (2.8×10–4~1.7×10–3). Genetic variance explained by chip markers for bipolar disorder subtype II was substantial in the discovery (55.1%) and replication (60.5%) samples. Moreover, pathways related to neurodevelopmental function, signal transduction, neuronal system, and cell adhesion molecules were significantly associated with bipolar disorder subtype II. Conclusion: We reported novel susceptible loci for pure bipolar subtype II disorder that is less addressed in the literature. Future studies are needed to confirm the roles of these loci for bipolar disorder subtype II. PMID:27450446

  12. The Giardia genome project database.

    PubMed

    McArthur, A G; Morrison, H G; Nixon, J E; Passamaneck, N Q; Kim, U; Hinkle, G; Crocker, M K; Holder, M E; Farr, R; Reich, C I; Olsen, G E; Aley, S B; Adam, R D; Gillin, F D; Sogin, M L

    2000-08-15

    The Giardia genome project database provides an online resource for Giardia lamblia (WB strain, clone C6) genome sequence information. The database includes edited single-pass reads, the results of BLASTX searches, and details of progress towards sequencing the entire 12 million-bp Giardia genome. Pre-sorted BLASTX results can be retrieved based on keyword searches and BLAST searches of the high throughput Giardia data can be initiated from the web site or through NCBI. Descriptions of the genomic DNA libraries, project protocols and summary statistics are also available. Although the Giardia genome project is ongoing, new sequences are made available on a bi-monthly basis to ensure that researchers have access to information that may assist them in the search for genes and their biological function. The current URL of the Giardia genome project database is www.mbl.edu/Giardia.

  13. The genome of Eucalyptus grandis.

    PubMed

    Myburg, Alexander A; Grattapaglia, Dario; Tuskan, Gerald A; Hellsten, Uffe; Hayes, Richard D; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R K; Hussey, Steven G; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B; Togawa, Roberto C; Pappas, Marilia R; Faria, Danielle A; Sansaloni, Carolina P; Petroli, Cesar D; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A; Bornberg-Bauer, Erich; Kersting, Anna R; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E; Liston, Aaron; Spatafora, Joseph W; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C; Steane, Dorothy A; Vaillancourt, René E; Potts, Brad M; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J; Strauss, Steven H; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S; Schmutz, Jeremy

    2014-06-19

    Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.

  14. Genomic Rearrangements in Prostate Cancer

    PubMed Central

    Barbieri, Christopher E.; Rubin, Mark A.

    2014-01-01

    Purpose of review Genomic instability is a fundamental feature of human cancer, leading to the activation of oncogenes and inactivation of tumor suppressors. In prostate cancer, structural genomic rearrangements, resulting in gene fusions, amplifications and deletions, are a critical mechanism effecting these alterations. Here we review recent literature regarding the importance of genomic rearrangements in the pathogenesis of prostate cancer and the potential impact on patient care. Recent findings Next generation sequencing has revealed a striking abundance, complexity, and heterogeneity of genomic rearrangements in prostate cancer. These recent studies have nominated a number of processes in predisposing prostate cancer to genomic rearrangements, including androgen-induced transcription. Summary Structural rearrangements are the critical mechanism resulting in the characteristic genomic changes associated with prostate cancer pathogenesis and progression. Future studies will determine if the impact of these events on tumor phenotypes can be translated to clinical utility for patient prognosis and choices of management strategies. PMID:25393273

  15. Phage genomics: small is beautiful.

    PubMed

    Brüssow, Harald; Hendrix, Roger W

    2002-01-11

    The Age of Genomics dawned only gradually for bacteriophages. It was 1977 when the genome of phage phi X174 was published and 1983 when the "large" genome of phage lambda hit the streets. More recently, the pace has quickened, so that we now have over 100 complete phage genomes and can expect thousands in a very few years. These sequences have been marvelously informative for the biology of the individual phages, but with the advent of high volume sequencing technology, the real excitement for phage biology is that it is now possible to analyze the sequences together and thereby address--for the first time at whole genome resolution--a set of fundamental biological questions related to populations: What is the structure of the global phage population? What are its dynamics? How do phages evolve? This is Comparative Genomics with a capital "C".

  16. Big Data: Astronomical or Genomical?

    PubMed

    Stephens, Zachary D; Lee, Skylar Y; Faghri, Faraz; Campbell, Roy H; Zhai, Chengxiang; Efron, Miles J; Iyer, Ravishankar; Schatz, Michael C; Sinha, Saurabh; Robinson, Gene E

    2015-07-01

    Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a "four-headed beast"--it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the "genomical" challenges of the next decade.

  17. Fungal Genome Sequencing and Bioenergy

    SciTech Connect

    Baker, Scott E.; Thykaer, Jette; Adney, William S.; Brettin, T.; Brockman, Fred J.; D'haeseleer, Patrik; Martinez, Antonio D.; Miller, R. M.; Rokhsar, Daniel S.; Schadt, Christopher W.; Torok, Tamas; Tuskan, Gerald; Bennett, Joan W.; Berka, Randy; Briggs, Steve; Heitman, Joseph; Taylor, John; Turgeon, Barbara G.; Werner-Washburne, Maggie; Himmel, Michael E.

    2008-09-30

    To date, the number of ongoing filamentous fungal genome sequencing projects is almost tenfold fewer than those of bacterial and archaeal genome projects. The fungi chosen for sequencing represent narrow kingdom diversity; most are pathogens or models. We advocate an ambitious, forward-looking phylogenetic-based genome sequencing program, designed to capture metabolic diversity within the fungal kingdom, thereby enhancing research into alternative bioenergy sources, bioremediation, and fungal-environment interactions.

  18. Programs | Office of Cancer Genomics

    Cancer.gov

    OCG facilitates cancer genomics research through a series of highly-focused programs. These programs generate and disseminate genomic data for use by the cancer research community. OCG programs also promote advances in technology-based infrastructure and create valuable experimental reagents and tools. OCG programs encourage collaboration by interconnecting with other genomics and cancer projects in order to accelerate translation of findings into the clinic. Below are OCG’s current, completed, and initiated programs:

  19. Datasets for evolutionary comparative genomics

    PubMed Central

    Liberles, David A

    2005-01-01

    Many decisions about genome sequencing projects are directed by perceived gaps in the tree of life, or towards model organisms. With the goal of a better understanding of biology through the lens of evolution, however, there are additional genomes that are worth sequencing. One such rationale for whole-genome sequencing is discussed here, along with other important strategies for understanding the phenotypic divergence of species. PMID:16086856

  20. Genomics Nursing Faculty Champion Initiative

    PubMed Central

    Jenkins, Jean; Calzone, Kathleen A.

    2016-01-01

    Nurse faculty are challenged to keep up with the emerging and fast-paced field of genomics and the mandate to prepare the nursing workforce to be able to translate genomic research advances into routine clinical care. Using Faculty Champions and other options, the initiative stimulated curriculum development and promoted genomics curriculum integration. The authors summarize this yearlong initiative for undergraduate and graduate nursing faculty. PMID:24300251

  1. Transcriptional Analysis of Arabidopsis thaliana Response to Lima Bean Volatiles

    PubMed Central

    Zhang, Sufang; Wei, Jianing; Kang, Le

    2012-01-01

    Background Exposure of plants to herbivore-induced plant volatiles (HIPVs) alters their resistance to herbivores. However, the whole-genome transcriptional responses of treated plants remain unknown, and the signal pathways that produce HIPVs are also unclear. Methodology/Principal Findings Time course patterns of the gene expression of Arabidopsis thaliana exposed to Lima bean volatiles were examined using Affymetrix ATH1 genome arrays. Results showed that A. thaliana received and responded to leafminer-induced volatiles from Lima beans through up-regulation of genes related to the ethylene (ET) and jasmonic acid pathways. Time course analysis revealed strong and partly qualitative differences in the responses between exposure at 24 and that at 48 h. Further experiments using either A. thaliana ET mutant ein2-1 or A. thaliana jasmonic acid mutant coi1-2 indicated that both pathways are involved in the volatile response process but that the ET pathway is indispensable for detecting volatiles. Moreover, transcriptional comparisons showed that plant responses to larval feeding do not merely magnify the volatile response process. Finally, (Z)-3-hexen-ol, ocimene, (3E)-4,8-dimethyl-1,3,7-nonatriene, and (3E,7E)-4,8,12-trimethyl-1,3,7,11-tridecatetraene triggered responses in A. thaliana similar to those induced by the entire suite of Lima bean volatiles after 24 and 48 h. Conclusions/Significance This study shows that the transcriptional responses of plants to HIPVs become stronger as treatment time increases and that ET signals are critical during this process. PMID:22558246

  2. Toward nanoscale genome sequencing.

    PubMed

    Ryan, Declan; Rahimi, Maryam; Lund, John; Mehta, Ranjana; Parviz, Babak A

    2007-09-01

    This article reports on the state-of-the-art technologies that sequence DNA using miniaturized devices. The article considers the miniaturization of existing technologies for sequencing DNA and the opportunities for cost reduction that 'on-chip' devices can deliver. The ability to construct nano-scale structures and perform measurements using novel nano-scale effects has provided new opportunities to identify nucleotides directly using physical, and not chemical, methods. The challenges that these technologies need to overcome to provide a US$1000-genome sequencing technology are also presented.

  3. Genomics of Bacillus Species

    NASA Astrophysics Data System (ADS)

    Økstad, Ole Andreas; Kolstø, Anne-Brit

    Members of the genus Bacillus are rod-shaped spore-forming bacteria belonging to the Firmicutes, the low G+C gram-positive bacteria. The Bacillus genus was first described and classified by Ferdinand Cohn in Cohn (1872), and Bacillus subtilis was defined as the type species (Soule, 1932). Several Bacilli may be linked to opportunistic infections. However, pathogenicity among Bacillus spp. is mainly a feature of bacteria belonging to the Bacillus cereus group, including B. cereus, Bacillus anthracis, and Bacillus thuringiensis. Here we review the genomics of B. cereus group bacteria in relation to their roles as etiological agents of two food poisoning syndromes (emetic and diarrhoeal).

  4. Genomic medicine and neurological disease.

    PubMed

    Boone, Philip M; Wiszniewski, Wojciech; Lupski, James R

    2011-07-01

    "Genomic medicine" refers to the diagnosis, optimized management, and treatment of disease--as well as screening, counseling, and disease gene identification--in the context of information provided by an individual patient's personal genome. Genomic medicine, to some extent synonymous with "personalized medicine," has been made possible by recent advances in genome technologies. Genomic medicine represents a new approach to health care and disease management that attempts to optimize the care of a patient based upon information gleaned from his or her personal genome sequence. In this review, we describe recent progress in genomic medicine as it relates to neurological disease. Many neurological disorders either segregate as Mendelian phenotypes or occur sporadically in association with a new mutation in a single gene. Heritability also contributes to other neurological conditions that appear to exhibit more complex genetics. In addition to discussing current knowledge in this field, we offer suggestions for maximizing the utility of genomic information in clinical practice as the field of genomic medicine unfolds.

  5. Advances in yeast genome engineering.

    PubMed

    David, Florian; Siewers, Verena

    2015-02-01

    Genome engineering based on homologous recombination has been applied to yeast for many years. However, the growing importance of yeast as a cell factory in metabolic engineering and chassis in synthetic biology demands methods for fast and efficient introduction of multiple targeted changes such as gene knockouts and introduction of multistep metabolic pathways. In this review, we summarize recent improvements of existing genome engineering methods, the development of novel techniques, for example for advanced genome redesign and evolution, and the importance of endonucleases as genome engineering tools.

  6. Beyond the dna: a prototype for functional genomics

    SciTech Connect

    Albala, J

    2000-03-02

    A prototype oligonucleotide ''functional chip'' has been developed to screen novel DNA repair proteins for their ability to bind or alter different forms of DNA. This chip has been developed as a functional genomics screen for analysis of protein-DNA interactions for novel proteins identified from the Human Genome Project The process of novel gene identification that has ensued as a consequence of available sequence information is remarkable. The challenge how lies in determining the function of newly identified gene products in a time-and cost-effective high-throughput manner. The functional chip is generated by the robotic application of DNA spotted in a microarray format onto a glass slide. Individual proteins are then analyzed against the different form of DNA bound to the slide. Several prototype functional chips were designed to contain various DNA fragments tethered to a glass slide for analysis of protein-DNA binding or enzymatic activity of known proteins. The technology has been developed to screen novel, putative DNA repair proteins for their ability to bind various types of DNA alone and in concert with protein partners. An additional scheme has been devised to screen putative repair enzymes for their ability to process different types of DNA molecules. Current methods to analyze gene expression primarily utilize either of two technologies. The oligonucleotide chip, pioneered by Fodor and co-workers and Affymetrix, Inc., consists of greater than 64,000 oligonucleotides attached in situ to a glass support. The oligonucleotide chip has been used primarily to identify specific mutations in a given gene by hybridization against a fluorescently-labeled substrate. The second method is the microarray, whereby DNA targets are systematically arranged on a glass slide and then hybridized with fluorescently-labeled complex targets for gene expression analysis (Jordan, 1998). By this technique, a large amount of information can be obtained examining global

  7. Genomics of apicomplexan parasites.

    PubMed

    Swapna, Lakshmipuram Seshadri; Parkinson, John

    2017-02-22

    The increasing prevalence of infections involving intracellular apicomplexan parasites such as Plasmodium, Toxoplasma, and Cryptosporidium (the causative agents of malaria, toxoplasmosis, and cryptosporidiosis, respectively) represent a significant global healthcare burden. Despite their significance, few treatments are available; a situation that is likely to deteriorate with the emergence of new resistant strains of parasites. To lay the foundation for programs of drug discovery and vaccine development, genome sequences for many of these organisms have been generated, together with large-scale expression and proteomic datasets. Comparative analyses of these datasets are beginning to identify the molecular innovations supporting both conserved processes mediating fundamental roles in parasite survival and persistence, as well as lineage-specific adaptations associated with divergent life-cycle strategies. The challenge is how best to exploit these data to derive insights into parasite virulence and identify those genes representing the most amenable targets. In this review, we outline genomic datasets currently available for apicomplexans and discuss biological insights that have emerged as a consequence of their analysis. Of particular interest are systems-based resources, focusing on areas of metabolism and host invasion that are opening up opportunities for discovering new therapeutic targets.

  8. Genomics of Myeloproliferative Neoplasms.

    PubMed

    Zoi, Katerina; Cross, Nicholas C P

    2017-03-20

    Myeloproliferative neoplasms (MPNs) are a group of related clonal hematologic disorders characterized by excess accumulation of one or more myeloid cell lineages and a tendency to transform to acute myeloid leukemia. Deregulated JAK2 signaling has emerged as the central phenotypic driver of BCR -ABL1-negative MPNs and a unifying therapeutic target. In addition, MPNs show unexpected layers of genetic complexity, with multiple abnormalities associated with disease progression, interactions between inherited factors and phenotype driver mutations, and effects related to the order in which mutations are acquired. Although morphology and clinical laboratory analysis continue to play an important role in defining these conditions, genomic analysis is providing a platform for better disease definition, more accurate diagnosis, direction of therapy, and refined prognostication. There is an emerging consensus with regard to many prognostic factors, but there is a clear need to synthesize genomic findings into robust, clinically actionable and widely accepted scoring systems as well as the need to standardize the laboratory methodologies that are used.

  9. Parsing of genomic graffiti

    SciTech Connect

    Tibbetts, C.; Golden, J. III; Torgersen, D.

    1996-12-31

    A focal point of modern biology is investigation of wide varieties of phenomena at the level of molecular genetics. The nucleotide sequences of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) define the ultimate resolution of this reductionist approach to understand the determinants of heritable traits. The structure and function of genes, their composite genomic organization, and their regulated expression have been studied in systems representing every class of organism. Many human diseases or pathogenic syndromes can be directly attributed to inherited defects in either the regulated expression, or the quality of the products of specific genes. Genetic determinants of susceptibility to infectious agents or environmental hazards are amply documented. Mapping and sequencing of the DNA molecules encoding human genes have provided powerful technology for pharmaceutical bioengineering and forensic investigations. From an alternative perspective, we may anticipate that voluminous archives of singular DNA sequences alone will not suffice to define and understand the functional determinants of genome organization, allelic diversity and evolutionary plasticity of living organisms. New insights will accumulate pertaining to human evolutionary origins and relationships of human biology to models based on other mammals. Investigators of population genetics and epidemiology now exploit the technology of molecular genetics to more powerfully probe variation within the human gene pool at the level of DNA sequences. 40 refs., 7 figs., 2 tabs.

  10. Finding the Alloy Genome

    NASA Astrophysics Data System (ADS)

    Hart, Gus L. W.; Nelson, Lance J.; Zhou, Fei; Ozolins, Vidvuds

    2012-10-01

    First-principles codes can nowadays provide hundreds of high-fidelity enthalpies on thousands of alloy systems with a modest investment of a few tens of millions of CPU hours. But a mere database of enthalpies provides only the starting point for uncovering the ``alloy genome.'' What one needs to fundamentally change alloy discovery and design are complete searches over candidate structures (not just hundreds of known experimental phases) and models that can be used to simulate both kinetics and thermodynamics. Despite more than a decade of effort by many groups, developing robust models for these simulations is still a human-time-intensive endeavor. Compressive sensing solves this problem in dramatic fashion by automatically extracting the ``sparse model'' of an alloy in only minutes. This new paradigm to model building has enabled a new framework that will uncover, automatically and in a general way across the periodic table, the important components of such models and reveal the underlying ``genome'' of alloy physics.

  11. A Taste of Algal Genomes from the Joint Genome Institute

    SciTech Connect

    Kuo, Alan; Grigoriev, Igor

    2012-06-17

    Algae play profound roles in aquatic food chains and the carbon cycle, can impose health and economic costs through toxic blooms, provide models for the study of symbiosis, photosynthesis, and eukaryotic evolution, and are candidate sources for bio-fuels; all of these research areas are part of the mission of DOE's Joint Genome Institute (JGI). To date JGI has sequenced, assembled, annotated, and released to the public the genomes of 18 species and strains of algae, sampling almost all of the major clades of photosynthetic eukaryotes. With more algal genomes currently undergoing analysis, JGI continues its commitment to driving forward basic and applied algal science. Among these ongoing projects are the pan-genome of the dominant coccolithophore Emiliania huxleyi, the interrelationships between the 4 genomes in the nucleomorph-containing Bigelowiella natans and Guillardia theta, and the search for symbiosis genes of lichens.

  12. Human Genome Program Image Gallery (from genomics.energy.gov)

    DOE Data Explorer

    This collection contains approximately 240 images from the genome programs of DOE's Office of Science. The images are divided into galleries related to biofuels research, systems biology, and basic genomics. Each image has a title, a basic citation, and a credit or source. Most of the images are original graphics created by the Genome Management Information System (GMIS). GMIS images are recognizable by their credit line. Permission to use these graphics is not needed, but please credit the U.S. Department of Energy Genome Programs and provide the website http://genomics.energy.gov. Other images were provided by third parties and not created by the U.S. Department of Energy. Users must contact the person listed in the credit line before using those images. The high-resolution images can be downloaded.

  13. Genome Wide Association Study to Identify Single Nucleotide Polymorphisms (SNPs) Associated with the Development of Erectile Dysfunction in African-American Men Following Radiotherapy for Prostate Cancer

    PubMed Central

    Kerns, Sarah L.; Ostrer, Harry; Stock, Richard; Li, William; Moore, Julian; Pearlman, Alexander; Campbell, Christopher; Shao, Yongzhao; Stone, Nelson; Kusnetz, Lynda; Rosenstein, Barry S.

    2010-01-01

    Purpose To identify single nucleotide polymorphisms (SNPs) associated with erectile dysfunction (ED) among African American prostate cancer patients treated with external beam radiation therapy (EBRT). Methods and Materials A cohort of African American prostate cancer patients treated with EBRT was followed for development of ED using the five-item Sexual Health Inventory for Men (SHIM) questionnaire. Final analysis included 27 cases (post-treatment SHIM score ≤ 7) and 52 controls (post-treatment SHIM score ≥ 16). A genome-wide association study was performed using ∼909,000 SNPs genotyped on Affymetrix 6.0 arrays. Results We identified SNP rs2268363, located in the follicle stimulating hormone receptor (FSHR) gene, as significantly associated with ED after correcting for multiple comparisons (unadjusted p-value = 5.46×10−8; Bonferroni p-value = 0.028). We identified four additional SNPs that tended toward significant association with unadjusted p-value < 10−06. Inference of population substructure revealed that cases had a higher proportion of African ancestry compared to controls (77% compared to 60%, p=0.005). A multivariate logistic regression model that incorporated estimated ancestry and four of the top-ranked SNPs was a more accurate classifier of ED than a model that included only clinical variables. Conclusions To the best of our knowledge, this is the first genome wide association study to identify SNPs associated with adverse effects resulting from radiotherapy. It is important to note that the SNP that proved significantly associated with ED is located within a gene whose encoded product plays a role in male gonad development and function. Another key finding of this project is that the four SNPs most strongly associated with ED were specific to people of African ancestry and would therefore not have been identified had a cohort of European ancestry been screened. This study demonstrates the feasibility of a genome-wide approach to investigate

  14. A 2-Stage Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms Associated With Development of Erectile Dysfunction Following Radiation Therapy for Prostate Cancer

    SciTech Connect

    Kerns, Sarah L.; Stock, Richard; Stone, Nelson; Buckstein, Michael; Shao, Yongzhao; Campbell, Christopher; Rath, Lynda; De Ruysscher, Dirk; Lammering, Guido; Hixson, Rosetta; Cesaretti, Jamie; Terk, Mitchell; Ostrer, Harry; Rosenstein, Barry S.

    2013-01-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in the replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.

  15. OryzaGenome: Genome Diversity Database of Wild Oryza Species.

    PubMed

    Ohyanagi, Hajime; Ebata, Toshinobu; Huang, Xuehui; Gong, Hao; Fujita, Masahiro; Mochizuki, Takako; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu; Feng, Qi; Wang, Zi-Xuan; Han, Bin; Kurata, Nori

    2016-01-01

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a text-based browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tab-delimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.

  16. Venturia carpophila draft genome sequence

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Venturia carpophila causes peach scab, a disease that renders peach fruit unmarketable. We report a high-quality draft genome sequence (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia in the United States. The genome sequence described will be a useful resour...

  17. Surveying Breast Cancer's Genomic Landscape.

    PubMed

    2016-07-01

    An in-depth analysis has produced the most comprehensive portrait to date of the myriad genomic alterations involved in breast cancer. In sequencing the whole genomes of 560 breast cancers and combining this information with published data from another 772 breast tumors, the research team uncovered several new genes and mutational signatures that potentially influence this disease.

  18. Cocoa/Cotton Comparative Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With genome sequence from two members of the Malvaceae family recently made available, we are exploring syntenic relationships, gene content, and evolutionary trajectories between the cacao and cotton genomes. An assembly of cacao (Theobroma cacao) using Illumina and 454 sequence technology yielded ...

  19. The Atlas Genome Assembly System

    PubMed Central

    Havlak, Paul; Chen, Rui; Durbin, K. James; Egan, Amy; Ren, Yanru; Song, Xing-Zhi; Weinstock, George M.; Gibbs, Richard A.

    2004-01-01

    Atlas is a suite of programs developed for assembly of genomes by a “combined approach” that uses DNA sequence reads from both BACs and whole-genome shotgun (WGS) libraries. The BAC clones afford advantages of localized assembly with reduced computational load, and provide a robust method for dealing with repeated sequences. Inclusion of WGS sequences facilitates use of different clone insert sizes and reduces data production costs. A core function of Atlas software is recruitment of WGS sequences into appropriate BACs based on sequence overlaps. Because construction of consensus sequences is from local assembly of these reads, only small (<0.1%) units of the genome are assembled at a time. Once assembled, each BAC is used to derive a genomic layout. This “sequence-based” growth of the genome map has greater precision than with non-sequence-based methods. Use of BACs allows correction of artifacts due to repeats at each stage of the process. This is aided by ancillary data such as BAC fingerprint, other genomic maps, and syntenic relations with other genomes. Atlas was used to assemble a draft DNA sequence of the rat genome; its major components including overlapper and split-scaffold are also being used in pure WGS projects. PMID:15060016

  20. How Can Genomics Inform Education?

    ERIC Educational Resources Information Center

    Grigorenko, Elena L.

    2007-01-01

    This article offers some thoughts on possible connections between genomics and education. Genomics is already revolutionizing the way medical care is delivered and distributed; it will inevitably affect children's developmental trajectories by introducing more pharmacological and behavioral therapies. Educators should be prepared to understand the…

  1. Genomics and proteomics in cancer.

    PubMed

    Baak, J P A; Path, F R C; Hermsen, M A J A; Meijer, G; Schmidt, J; Janssen, E A M

    2003-06-01

    Cancer development is driven by the accumulation of DNA changes in the approximately 40000 chromosomal genes. In solid tumours, chromosomal numerical/structural aberrations are common. DNA repair defects may lead to genome-wide genetic instability, which can drive further cancer progression. The genes code the actual players in the cellular processes, the 100000-10 million proteins, which in (pre)malignant cells can also be altered in a variety of ways. Over the past decade, our knowledge of the human genome and Genomics (the study of the human genome) in (pre)malignancies has increased enormously and Proteomics (the analysis of the protein complement of the genome) has taken off as well. Both will play an increasingly important role. In this article, a short description of the essential molecular biological cell processes is given. Important genomic and proteomic research methods are described and illustrated. Applications are still limited, but the evidence so far is exciting. Will genomics replace classical diagnostic or prognostic procedures? In breast cancers, the gene expression array is stronger than classical criteria, but in endometrial hyperplasia, quantitative morphological features are more cost-effective than genetic testing. It is still too early to make strong statements, the more so because it is expected that genomics and proteomics will expand rapidly. However, it is likely that they will take a central place in the understanding, diagnosis, monitoring and treatment of (pre)cancers of many different sites.

  2. Fueling Future with Algal Genomics

    SciTech Connect

    Grigoriev, Igor

    2012-07-05

    Algae constitute a major component of fundamental eukaryotic diversity, play profound roles in the carbon cycle, and are prominent candidates for biofuel production. The US Department of Energy Joint Genome Institute (JGI) is leading the world in algal genome sequencing (http://jgi.doe.gov/Algae) and contributes of the algal genome projects worldwide (GOLD database, 2012). The sequenced algal genomes offer catalogs of genes, networks, and pathways. The sequenced first of its kind genomes of a haptophyte E.huxleyii, chlorarachniophyte B.natans, and cryptophyte G.theta fill the gaps in the eukaryotic tree of life and carry unique genes and pathways as well as molecular fossils of secondary endosymbiosis. Natural adaptation to conditions critical for industrial production is encoded in algal genomes, for example, growth of A.anophagefferens at very high cell densities during the harmful algae blooms or a global distribution across diverse environments of E.huxleyii, able to live on sparse nutrients due to its expanded pan-genome. Communications and signaling pathways can be derived from simple symbiotic systems like lichens or complex marine algae metagenomes. Collectively these datasets derived from algal genomics contribute to building a comprehensive parts list essential for algal biofuel development.

  3. Crop genomics: advances and applications

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The completion of reference genome sequences for many important crops and the ability to perform high-throughput resequencing are providing opportunities for improving our understanding of the history of plant domestication and to accelerate crop improvement. Crop plant comparative genomics is being...

  4. Genome editing in cardiovascular diseases.

    PubMed

    Strong, Alanna; Musunuru, Kiran

    2017-01-01

    Genome-editing tools, which include zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) systems, have emerged as an invaluable technology to achieve somatic and germline genomic manipulation in cells and model organisms for multiple applications, including the creation of knockout alleles, introducing desired mutations into genomic DNA, and inserting novel transgenes. Genome editing is being rapidly adopted into all fields of biomedical research, including the cardiovascular field, where it has facilitated a greater understanding of lipid metabolism, electrophysiology, cardiomyopathies, and other cardiovascular disorders, has helped to create a wider variety of cellular and animal models, and has opened the door to a new class of therapies. In this Review, we discuss the applications of genome-editing technology throughout cardiovascular disease research and the prospect of in vivo genome-editing therapies in the future. We also describe some of the existing limitations of genome-editing tools that will need to be addressed if cardiovascular genome editing is to achieve its full scientific and therapeutic potential.

  5. A Million Cancer Genome Warehouse

    DTIC Science & Technology

    2012-11-20

    Fitzpatrick, A. L., Agrawal, A., Barnes, K., Boyd, H. A., et al. (2011). Phenotype harmonization and cross‐study collaboration in GWAS consortia...Genome Warehouse is performing genome- wide association studies ( GWAS ) of both common and rare inherited single nucleotide polymorphisms (SNPs) to compare

  6. All about the Human Genome Project (HGP)

    MedlinePlus

    ... Genome Resources Access to the full human sequence All About The Human Genome Project (HGP) The Human ... an international research effort to sequence and map all of the genes - together known as the genome - ...

  7. International genomic evaluation methods for dairy cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Genomic evaluations are rapidly replacing traditional evaluation systems used for dairy cattle selection. Economies of scale in genomics promote cooperation across country borders. Genomic information can be transferred across countries using simple conversion equations, by modifying mult...

  8. Advances in targeted genome editing.

    PubMed

    Perez-Pinera, Pablo; Ousterout, David G; Gersbach, Charles A

    2012-08-01

    New technologies have recently emerged that enable targeted editing of genomes in diverse systems. This includes precise manipulation of gene sequences in their natural chromosomal context and addition of transgenes to specific genomic loci. This progress has been facilitated by advances in engineering targeted nucleases with programmable, site-specific DNA-binding domains, including zinc finger proteins and transcription activator-like effectors (TALEs). Recent improvements have enhanced nuclease performance, accelerated nuclease assembly, and lowered the cost of genome editing. These advances are driving new approaches to many areas of biotechnology, including biopharmaceutical production, agriculture, creation of transgenic organisms and cell lines, and studies of genome structure, regulation, and function. Genome editing is also being investigated in preclinical and clinical gene therapies for many diseases.

  9. Pathophysiology of MDS: genomic aberrations.

    PubMed

    Ichikawa, Motoshi

    Myelodysplastic syndromes (MDS) are characterized by clonal proliferation of hematopoietic stem/progenitor cells and their apoptosis, and show a propensity to progress to acute myelogenous leukemia (AML). Although MDS are recognized as neoplastic diseases caused by genomic aberrations of hematopoietic cells, the details of the genetic abnormalities underlying disease development have not as yet been fully elucidated due to difficulties in analyzing chromosomal abnormalities. Recent advances in comprehensive analyses of disease genomes including whole-genome sequencing technologies have revealed the genomic abnormalities in MDS. Surprisingly, gene mutations were found in approximately 80-90% of cases with MDS, and the novel mutations discovered with these technologies included previously unknown, MDS-specific, mutations such as those of the genes in the RNA-splicing machinery. It is anticipated that these recent studies will shed new light on the pathophysiology of MDS due to genomic aberrations.

  10. Big Data: Astronomical or Genomical?

    PubMed Central

    Stephens, Zachary D.; Lee, Skylar Y.; Faghri, Faraz; Campbell, Roy H.; Zhai, Chengxiang; Efron, Miles J.; Iyer, Ravishankar; Schatz, Michael C.; Sinha, Saurabh; Robinson, Gene E.

    2015-01-01

    Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a “four-headed beast”—it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the “genomical” challenges of the next decade. PMID:26151137

  11. Sequencing the maize genome.

    PubMed

    Martienssen, Robert A; Rabinowicz, Pablo D; O'Shaughnessy, Andrew; McCombie, W Richard

    2004-04-01

    Sequencing of complex genomes can be accomplished by enriching shotgun libraries for genes. In maize, gene-enrichment by copy-number normalization (high C(0)t) and methylation filtration (MF) have been used to generate up to two-fold coverage of the gene-space with less than 1 million sequencing reads. Simulations using sequenced bacterial artificial chromosome (BAC) clones predict that 5x coverage of gene-rich regions, accompanied by less than 1x coverage of subclones from BAC contigs, will generate high-quality mapped sequence that meets the needs of geneticists while accommodating unusually high levels of structural polymorphism. By sequencing several inbred strains, we propose a strategy for capturing this polymorphism to investigate hybrid vigor or heterosis.

  12. Genomics in neurological disorders.

    PubMed

    Han, Guangchun; Sun, Jiya; Wang, Jiajia; Bai, Zhouxian; Song, Fuhai; Lei, Hongxing

    2014-08-01

    Neurological disorders comprise a variety of complex diseases in the central nervous system, which can be roughly classified as neurodegenerative diseases and psychiatric disorders. The basic and translational research of neurological disorders has been hindered by the difficulty in accessing the pathological center (i.e., the brain) in live patients. The rapid advancement of sequencing and array technologies has made it possible to investigate the disease mechanism and biomarkers from a systems perspective. In this review, recent progresses in the discovery of novel risk genes, treatment targets and peripheral biomarkers employing genomic technologies will be discussed. Our major focus will be on two of the most heavily investigated neurological disorders, namely Alzheimer's disease and autism spectrum disorder.

  13. Genomics of sex determination.

    PubMed

    Zhang, Jisen; Boualem, Adnane; Bendahmane, Abdelhafid; Ming, Ray

    2014-04-01

    Sex determination is a major switch in the evolutionary history of angiosperm, resulting 11% monoecious and dioecious species. The genomic sequences of papaya sex chromosomes unveiled the molecular basis of recombination suppression in the sex determination region, and candidate genes for sex determination. Identification and analyses of sex determination genes in cucurbits and maize demonstrated conservation of sex determination mechanism in one lineage and divergence between the two systems. Epigenetic control and hormonal influence of sex determination were elucidated in both plants and animals. Intensive investigation of potential sex determination genes in model species will improve our understanding of sex determination gene network. Such network will in turn accelerate the identification of sex determination genes in dioecious species with sex chromosomes, which are burdensome due to no recombination in sex determining regions. The sex determination genes in dioecious species are crucial for understanding the origin of dioecy and sex chromosomes, particularly in their early stage of evolution.

  14. Privacy in the Genomic Era.

    PubMed

    Naveed, Muhammad; Ayday, Erman; Clayton, Ellen W; Fellay, Jacques; Gunter, Carl A; Hubaux, Jean-Pierre; Malin, Bradley A; Wang, Xiaofeng

    2015-09-01

    Genome sequencing technology has advanced at a rapid pace and it is now possible to generate highly-detailed genotypes inexpensively. The collection and analysis of such data has the potential to support various applications, including personalized medical services. While the benefits of the genomics revolution are trumpeted by the biomedical community, the increased availability of such data has major implications for personal privacy; notably because the genome has certain essential features, which include (but are not limited to) (i) an association with traits and certain diseases, (ii) identification capability (e.g., forensics), and (iii) revelation of family relationships. Moreover, direct-to-consumer DNA testing increases the likelihood that genome data will be made available in less regulated environments, such as the Internet and for-profit companies. The problem of genome data privacy thus resides at the crossroads of computer science, medicine, and public policy. While the computer scientists have addressed data privacy for various data types, there has been less attention dedicated to genomic data. Thus, the goal of this paper is to provide a systematization of knowledge for the computer science community. In doing so, we address some of the (sometimes erroneous) beliefs of this field and we report on a survey we conducted about genome data privacy with biomedical specialists. Then, after characterizing the genome privacy problem, we review the state-of-the-art regarding privacy attacks on genomic data and strategies for mitigating such attacks, as well as contextualizing these attacks from the perspective of medicine and public policy. This paper concludes with an enumeration of the challenges for genome data privacy and presents a framework to systematize the analysis of threats and the design of countermeasures as the field moves forward.

  15. Recombination Drives Vertebrate Genome Contraction

    PubMed Central

    Nam, Kiwoong; Ellegren, Hans

    2012-01-01

    Selective and/or neutral processes may govern variation in DNA content and, ultimately, genome size. The observation in several organisms of a negative correlation between recombination rate and intron size could be compatible with a neutral model in which recombination is mutagenic for length changes. We used whole-genome data on small insertions and deletions within transposable elements from chicken and zebra finch to demonstrate clear links between recombination rate and a number of attributes of reduced DNA content. Recombination rate was negatively correlated with the length of introns, transposable elements, and intergenic spacer and with the rate of short insertions. Importantly, it was positively correlated with gene density, the rate of short deletions, the deletion bias, and the net change in sequence length. All these observations point at a pattern of more condensed genome structure in regions of high recombination. Based on the observed rates of small insertions and deletions and assuming that these rates are representative for the whole genome, we estimate that the genome of the most recent common ancestor of birds and lizards has lost nearly 20% of its DNA content up until the present. Expansion of transposable elements can counteract the effect of deletions in an equilibrium mutation model; however, since the activity of transposable elements has been low in the avian lineage, the deletion bias is likely to have had a significant effect on genome size evolution in dinosaurs and birds, contributing to the maintenance of a small genome. We also demonstrate that most of the observed correlations between recombination rate and genome contraction parameters are seen in the human genome, including for segregating indel polymorphisms. Our data are compatible with a neutral model in which recombination drives vertebrate genome size evolution and gives no direct support for a role of natural selection in this process. PMID:22570634

  16. Privacy in the Genomic Era

    PubMed Central

    NAVEED, MUHAMMAD; AYDAY, ERMAN; CLAYTON, ELLEN W.; FELLAY, JACQUES; GUNTER, CARL A.; HUBAUX, JEAN-PIERRE; MALIN, BRADLEY A.; WANG, XIAOFENG

    2015-01-01

    Genome sequencing technology has advanced at a rapid pace and it is now possible to generate highly-detailed genotypes inexpensively. The collection and analysis of such data has the potential to support various applications, including personalized medical services. While the benefits of the genomics revolution are trumpeted by the biomedical community, the increased availability of such data has major implications for personal privacy; notably because the genome has certain essential features, which include (but are not limited to) (i) an association with traits and certain diseases, (ii) identification capability (e.g., forensics), and (iii) revelation of family relationships. Moreover, direct-to-consumer DNA testing increases the likelihood that genome data will be made available in less regulated environments, such as the Internet and for-profit companies. The problem of genome data privacy thus resides at the crossroads of computer science, medicine, and public policy. While the computer scientists have addressed data privacy for various data types, there has been less attention dedicated to genomic data. Thus, the goal of this paper is to provide a systematization of knowledge for the computer science community. In doing so, we address some of the (sometimes erroneous) beliefs of this field and we report on a survey we conducted about genome data privacy with biomedical specialists. Then, after characterizing the genome privacy problem, we review the state-of-the-art regarding privacy attacks on genomic data and strategies for mitigating such attacks, as well as contextualizing these attacks from the perspective of medicine and public policy. This paper concludes with an enumeration of the challenges for genome data privacy and presents a framework to systematize the analysis of threats and the design of countermeasures as the field moves forward. PMID:26640318

  17. Genome-Wide Gene-Sodium Interaction Analyses on Blood Pressure: The Genetic Epidemiology Network of Salt-Sensitivity Study.

    PubMed

    Li, Changwei; He, Jiang; Chen, Jing; Zhao, Jinying; Gu, Dongfeng; Hixson, James E; Rao, Dabeeru C; Jaquish, Cashell E; Gu, Charles C; Chen, Jichun; Huang, Jianfeng; Chen, Shufeng; Kelly, Tanika N

    2016-08-01

    We performed genome-wide analyses to identify genomic loci that interact with sodium to influence blood pressure (BP) using single-marker-based (1 and 2 df joint tests) and gene-based tests among 1876 Chinese participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. Among GenSalt participants, the average of 3 urine samples was used to estimate sodium excretion. Nine BP measurements were taken using a random zero sphygmomanometer. A total of 2.05 million single-nucleotide polymorphisms were imputed using Affymetrix 6.0 genotype data and the Chinese Han of Beijing and Japanese of Tokyo HapMap reference panel. Promising findings (P<1.00×10(-4)) from GenSalt were evaluated for replication among 775 Chinese participants of the Multi-Ethnic Study of Atherosclerosis (MESA). Single-nucleotide polymorphism and gene-based results were meta-analyzed across the GenSalt and MESA studies to determine genome-wide significance. The 1 df tests identified interactions for UST rs13211840 on diastolic BP (P=3.13×10(-9)). The 2 df tests additionally identified associations for CLGN rs2567241 (P=3.90×10(-12)) and LOC105369882 rs11104632 (P=4.51×10(-8)) with systolic BP. The CLGN variant rs2567241 was also associated with diastolic BP (P=3.11×10(-22)) and mean arterial pressure (P=2.86×10(-15)). Genome-wide gene-based analysis identified MKNK1 (P=6.70×10(-7)), C2orf80 (P<1.00×10(-12)), EPHA6 (P=2.88×10(-7)), SCOC-AS1 (P=4.35×10(-14)), SCOC (P=6.46×10(-11)), CLGN (P=3.68×10(-13)), MGAT4D (P=4.73×10(-11)), ARHGAP42 (P≤1.00×10(-12)), CASP4 (P=1.31×10(-8)), and LINC01478 (P=6.75×10(-10)) that were associated with at least 1 BP phenotype. In summary, we identified 8 novel and 1 previously reported BP loci through the examination of single-nucleotide polymorphism and gene-based interactions with sodium.

  18. Wheat Landrace Genome Diversity.

    PubMed

    Wingen, Luzie U; West, Claire; Leverington-Waite, Michelle; Collier, Sarah; Orford, Simon; Goram, Richard; Yang, Cai-Yun; King, Julie; Allen, Alexandra M; Burridge, Amanda; Edwards, Keith J; Griffiths, Simon

    2017-04-01

    Understanding the genomic complexity of bread wheat (Triticum aestivum L.) is a cornerstone in the quest to unravel the processes of domestication and the following adaptation of domesticated wheat to a wide variety of environments across the globe. Additionally, it is of importance for future improvement of the crop, particularly in the light of climate change. Focusing on the adaptation after domestication, a nested association mapping (NAM) panel of 60 segregating biparental populations was developed, mainly involving landrace accessions from the core set of the Watkins hexaploid wheat collection optimized for genetic diversity. A modern spring elite variety, "Paragon," was used as common reference parent. Genetic maps were constructed following identical rules to make them comparable. In total, 1611 linkage groups were identified, based on recombination from an estimated 126,300 crossover events over the whole NAM panel. A consensus map, named landrace consensus map (LRC), was constructed and contained 2498 genetic loci. These newly developed genetics tools were used to investigate the rules underlying genome fluidity or rigidity, e.g., by comparing marker distances and marker orders. In general, marker order was highly correlated, which provides support for strong synteny between bread wheat accessions. However, many exceptional cases of incongruent linkage groups and increased marker distances were also found. Segregation distortion was detected for many markers, sometimes as hot spots present in different populations. Furthermore, evidence for translocations in at least 36 of the maps was found. These translocations fell, in general, into many different translocation classes, but a few translocation classes were found in several accessions, the most frequent one being the well-known T5B:7B translocation. Loci involved in recombination rate, which is an interesting trait for plant breeding, were identified by QTL analyses using the crossover counts as a trait

  19. Wheat Landrace Genome Diversity

    PubMed Central

    Wingen, Luzie U.; West, Claire; Leverington-Waite, Michelle; Collier, Sarah; Orford, Simon; Goram, Richard; Yang, Cai-Yun; King, Julie; Allen, Alexandra M.; Burridge, Amanda; Edwards, Keith J.; Griffiths, Simon

    2017-01-01

    Understanding the genomic complexity of bread wheat (Triticum aestivum L.) is a cornerstone in the quest to unravel the processes of domestication and the following adaptation of domesticated wheat to a wide variety of environments across the globe. Additionally, it is of importance for future improvement of the crop, particularly in the light of climate change. Focusing on the adaptation after domestication, a nested association mapping (NAM) panel of 60 segregating biparental populations was developed, mainly involving landrace accessions from the core set of the Watkins hexaploid wheat collection optimized for genetic diversity. A modern spring elite variety, “Paragon,” was used as common reference parent. Genetic maps were constructed following identical rules to make them comparable. In total, 1611 linkage groups were identified, based on recombination from an estimated 126,300 crossover events over the whole NAM panel. A consensus map, named landrace consensus map (LRC), was constructed and contained 2498 genetic loci. These newly developed genetics tools were used to investigate the rules underlying genome fluidity or rigidity, e.g., by comparing marker distances and marker orders. In general, marker order was highly correlated, which provides support for strong synteny between bread wheat accessions. However, many exceptional cases of incongruent linkage groups and increased marker distances were also found. Segregation distortion was detected for many markers, sometimes as hot spots present in different populations. Furthermore, evidence for translocations in at least 36 of the maps was found. These translocations fell, in general, into many different translocation classes, but a few translocation classes were found in several accessions, the most frequent one being the well-known T5B:7B translocation. Loci involved in recombination rate, which is an interesting trait for plant breeding, were identified by QTL analyses using the crossover counts as a

  20. Bovine Genome Database: integrated tools for genome annotation and discovery.

    PubMed

    Childers, Christopher P; Reese, Justin T; Sundaram, Jaideep P; Vile, Donald C; Dickens, C Michael; Childs, Kevin L; Salih, Hanni; Bennett, Anna K; Hagen, Darren E; Adelson, David L; Elsik, Christine G

    2011-01-01

    The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome browsers, available for both scaffold and chromosome coordinate systems, display the bovine Official Gene Set (OGS), RefSeq and Ensembl gene models, non-coding RNA, repeats, pseudogenes, single-nucleotide polymorphism, markers, QTL and alignments to complementary DNAs, ESTs and protein homologs. The Bovine QTL viewer is connected to the BGD Chromosome GBrowse, allowing for the identification of candidate genes underlying QTL. The Apollo Annotation Editor connects directly to the BGD Chado database to provide researchers with remote access to gene evidence in a graphical interface that allows editing and creating new gene models. Researchers may upload their annotations to the BGD server for review and integration into the subsequent release of the OGS. Gene pages display information for individual OGS gene models, including gene structure, transcript variants, functional descriptions, gene symbols, Gene Ontology terms, annotator comments and links to National Center for Biotechnology Information and Ensembl. Each gene page is linked to a wiki page to allow input from the research community.

  1. Integrated genome browser: visual analytics platform for genomics

    PubMed Central

    Norris, David C.; Loraine, Ann E.

    2016-01-01

    Motivation: Genome browsers that support fast navigation through vast datasets and provide interactive visual analytics functions can help scientists achieve deeper insight into biological systems. Toward this end, we developed Integrated Genome Browser (IGB), a highly configurable, interactive and fast open source desktop genome browser. Results: Here we describe multiple updates to IGB, including all-new capabilities to display and interact with data from high-throughput sequencing experiments. To demonstrate, we describe example visualizations and analyses of datasets from RNA-Seq, ChIP-Seq and bisulfite sequencing experiments. Understanding results from genome-scale experiments requires viewing the data in the context of reference genome annotations and other related datasets. To facilitate this, we enhanced IGB’s ability to consume data from diverse sources, including Galaxy, Distributed Annotation and IGB-specific Quickload servers. To support future visualization needs as new genome-scale assays enter wide use, we transformed the IGB codebase into a modular, extensible platform for developers to create and deploy all-new visualizations of genomic data. Availability and implementation: IGB is open source and is freely available from http://bioviz.org/igb. Contact: aloraine@uncc.edu PMID:27153568

  2. RECORD: Reference-Assisted Genome Assembly for Closely Related Genomes.

    PubMed

    Buza, Krisztian; Wilczynski, Bartek; Dojer, Norbert

    2015-01-01

    Background. Next-generation sequencing technologies are now producing multiple times the genome size in total reads from a single experiment. This is enough information to reconstruct at least some of the differences between the individual genome studied in the experiment and the reference genome of the species. However, in most typical protocols, this information is disregarded and the reference genome is used. Results. We provide a new approach that allows researchers to reconstruct genomes very closely related to the reference genome (e.g., mutants of the same species) directly from the reads used in the experiment. Our approach applies de novo assembly software to experimental reads and so-called pseudoreads and uses the resulting contigs to generate a modified reference sequence. In this way, it can very quickly, and at no additional sequencing cost, generate new, modified reference sequence that is closer to the actual sequenced genome and has a full coverage. In this paper, we describe our approach and test its implementation called RECORD. We evaluate RECORD on both simulated and real data. We made our software publicly available on sourceforge. Conclusion. Our tests show that on closely related sequences RECORD outperforms more general assisted-assembly software.

  3. Microbial Genomics Data from the DOE Joint Genome Institute (JGI)

    DOE Data Explorer

    The JGI makes high-quality genome sequencing data freely available to the greater scientific community through its web portal. Having played a significant role in the federally funded Human Genome Project -- generating the complete sequences of Chromosomes 5, 16, and 19--the JGI has now moved on to contributing in other critical areas of genomics research. While NIH-funded genome sequencing activities continue to emphasize human biomedical targets and applications, the JGI has since shifted its focus to the non-human components of the biosphere, particularly those relevant to the science mission of the Department of Energy. With efficiencies of scale established at the PGF, and capacity now exceeding three billion bases generated on a monthly basis, the JGI has tackled scores of additional genomes. These include more than 60 microbial genomes and many important multicellular organisms and communities of microbes. In partnership with other federal institutions and universities, the JGI is in the process of sequencing a frog (Xenopus tropicalis), a green alga (Chlamydomonas reinhardtii), a diatom (Thalassiosira pseudonana) , the cottonwood tree (Populus trichocarpa), and a host of agriculturally important plants and plant pathogens. Microorganisms, for example those that thrive under extreme conditions such as high acidity, radiation, and metal contamination, are of particular interest to the DOE and JGI. Investigations by JGI and its partners are shedding light on the cellular machinery of microbes and how they can be harnessed to clean up contaminated soil or water, capture carbon from the atmosphere, and produce potentially important sources of energy such as hydrogen and methane. [Excerpt from the JGI page "Who We Are" at http://www.jgi.doe.gov/whoweare/whoweare.html] From the JGI webportal users can view a photo grid of organisims, check assemblies for status, access the Integrated Microbial Genomes (IMG) system to do comparative analysis of publicly available

  4. Exploring cancer genomic data from the cancer genome atlas project

    PubMed Central

    Lee, Ju-Seog

    2016-01-01

    The Cancer Genome Atlas (TCGA) has compiled genomic, epigenomic, and proteomic data from more than 10,000 samples derived from 33 types of cancer, aiming to improve our understanding of the molecular basis of cancer development. Availability of these genome-wide information provides an unprecedented opportunity for uncovering new key regulators of signaling pathways or new roles of pre-existing members in pathways. To take advantage of the advancement, it will be necessary to learn systematic approaches that can help to uncover novel genes reflecting genetic alterations, prognosis, or response to treatments. This minireview describes the updated status of TCGA project and explains how to use TCGA data. PMID:27530686

  5. Genome-wide analysis of DNA methylation and gene expression patterns in purified, uncultured human liver cells and activated hepatic stellate cells

    PubMed Central

    Reiner, Andrew H.; Coll, Mar; Verhulst, Stefaan; Mannaerts, Inge; Øie, Cristina I.; Smedsrød, Bård; Najimi, Mustapha; Sokal, Etienne; Luttun, Aernout; Sancho-Bru, Pau; Collas, Philippe; van Grunsven, Leo A.

    2015-01-01

    Background & Aims Liver fibrogenesis – scarring of the liver that can lead to cirrhosis and liver cancer – is characterized by hepatocyte impairment, capillarization of liver sinusoidal endothelial cells (LSECs) and hepatic stellate cell (HSC) activation. To date, the molecular determinants of a healthy human liver cell phenotype remain largely uncharacterized. Here, we assess the transcriptome and the genome-wide promoter methylome specific for purified, non-cultured human hepatocytes, LSECs and HSCs, and investigate the nature of epigenetic changes accompanying transcriptional changes associated with activation of HSCs. Material and methods Gene expression profile and promoter methylome of purified, uncultured human liver cells and culture-activated HSCs were respectively determined using Affymetrix HG-U219 genechips and by methylated DNA immunoprecipitation coupled to promoter array hybridization. Histone modification patterns were assessed at the single-gene level by chromatin immunoprecipitation and quantitative PCR. Results We unveil a DNA-methylation-based epigenetic relationship between hepatocytes, LSECs and HSCs despite their distinct ontogeny. We show that liver cell type-specific DNA methylation targets early developmental and differentiation-associated functions. Integrative analysis of promoter methylome and transcriptome reveals partial concordance between DNA methylation and transcriptional changes associated with human HSC activation. Further, we identify concordant histone methylation and acetylation changes in the promoter and putative novel enhancer elements of genes involved in liver fibrosis. Conclusions Our study provides the first epigenetic blueprint of three distinct freshly isolated, human hepatic cell types and of epigenetic changes elicited upon HSC activation. PMID:26353929

  6. The genome of Eucalyptus grandis

    SciTech Connect

    Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.; Hellsten, Uffe; Hayes, Richard D.; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M.; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R. K.; Hussey, Steven G.; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B.; Togawa, Roberto C.; Pappas, Marilia R.; Faria, Danielle A.; Sansaloni, Carolina P.; Petroli, Cesar D.; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J.; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A.; Bornberg-Bauer, Erich; Kersting, Anna R.; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E.; Liston, Aaron; Spatafora, Joseph W.; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H.; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C.; Steane, Dorothy A.; Vaillancourt, René E.; Potts, Brad M.; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J.; Strauss, Steven H.; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S.; Schmutz, Jeremy

    2014-06-11

    Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defence against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.

  7. Components of Adenovirus Genome Packaging

    PubMed Central

    Ahi, Yadvinder S.; Mittal, Suresh K.

    2016-01-01

    Adenoviruses (AdVs) are icosahedral viruses with double-stranded DNA (dsDNA) genomes. Genome packaging in AdV is thought to be similar to that seen in dsDNA containing icosahedral bacteriophages and herpesviruses. Specific recognition of the AdV genome is mediated by a packaging domain located close to the left end of the viral genome and is mediated by the viral packaging machinery. Our understanding of the role of various components of the viral packaging machinery in AdV genome packaging has greatly advanced in recent years. Characterization of empty capsids assembled in the absence of one or more components involved in packaging, identification of the unique vertex, and demonstration of the role of IVa2, the putative packaging ATPase, in genome packaging have provided compelling evidence that AdVs follow a sequential assembly pathway. This review provides a detailed discussion on the functions of the various viral and cellular factors involved in AdV genome packaging. We conclude by briefly discussing the roles of the empty capsids, assembly intermediates, scaffolding proteins, portal vertex and DNA encapsidating enzymes in AdV assembly and packaging. PMID:27721809

  8. [Genome editing of industrial microorganism].

    PubMed

    Zhu, Linjiang; Li, Qi

    2015-03-01

    Genome editing is defined as highly-effective and precise modification of cellular genome in a large scale. In recent years, such genome-editing methods have been rapidly developed in the field of industrial strain improvement. The quickly-updating methods thoroughly change the old mode of inefficient genetic modification, which is "one modification, one selection marker, and one target site". Highly-effective modification mode in genome editing have been developed including simultaneous modification of multiplex genes, highly-effective insertion, replacement, and deletion of target genes in the genome scale, cut-paste of a large DNA fragment. These new tools for microbial genome editing will certainly be applied widely, and increase the efficiency of industrial strain improvement, and promote the revolution of traditional fermentation industry and rapid development of novel industrial biotechnology like production of biofuel and biomaterial. The technological principle of these genome-editing methods and their applications were summarized in this review, which can benefit engineering and construction of industrial microorganism.

  9. Functional genomics of intracellular bacteria.

    PubMed

    de Barsy, Marie; Greub, Gilbert

    2013-07-01

    During the genomic era, a large amount of whole-genome sequences accumulated, which identified many hypothetical proteins of unknown function. Rapidly, functional genomics, which is the research domain that assign a function to a given gene product, has thus been developed. Functional genomics of intracellular pathogenic bacteria exhibit specific peculiarities due to the fastidious growth of most of these intracellular micro-organisms, due to the close interaction with the host cell, due to the risk of contamination of experiments with host cell proteins and, for some strict intracellular bacteria such as Chlamydia, due to the absence of simple genetic system to manipulate the bacterial genome. To identify virulence factors of intracellular pathogenic bacteria, functional genomics often rely on bioinformatic analyses compared with model organisms such as Escherichia coli and Bacillus subtilis. The use of heterologous expression is another common approach. Given the intracellular lifestyle and the many effectors that are used by the intracellular bacteria to corrupt host cell functions, functional genomics is also often targeting the identification of new effectors such as those of the T4SS of Brucella and Legionella.

  10. Environmental genomics, the big picture?

    PubMed

    Rodríguez-Valera, Francisco

    2004-02-16

    The enormous sequencing capabilities of our times might be reaching the point of overflowing the possibilities to analyse data and allow for a feedback on where to focus the available resources. We have now a foreseeable future in which most bacterial species will have an annotated genome. However, we know also that most prokaryotic diversity would not be included there. On the one hand, there is the problem of many groups not being easily amenable to culture and hence not represented in culture-centred microbial taxonomy. On the other hand, the gene pools present in one species can be orders of magnitude larger than the genome of one strain (selected for genome sequencing). Contrasting with eukaryotic genomes, the repertoire of genes present in one prokaryotic cell genome does not correlate stringently with its taxonomic identity. Hence gene catalogues from one environment might provide more meaningful information than the classical species catalogues. Metagenomics or microbial environmental genomics provide a different tool that gravitates around the habitat rather than the species. Such a tool could be just the right way to complement "organismal genomics". Its potential to advance our understanding of microbial ecology and prokaryotic diversity and evolution is discussed.

  11. A genome-wide meta-analysis identifies novel loci associated with schizophrenia and bipolar disorder.

    PubMed

    Wang, Ke-Sheng; Liu, Xue-Feng; Aragam, Nagesh

    2010-12-01

    Schizophrenia and bipolar disorder both have strong inherited components. Recent studies have indicated that schizophrenia and bipolar disorder may share more than half of their genetic determinants. In this study, we performed a meta-analysis (combined analysis) for genome-wide association data of the Affymetrix Genome-Wide Human SNP array 6.0 to detect genetic variants influencing both schizophrenia and bipolar disorder using European-American samples (653 bipolar cases and 1034 controls, 1172 schizophrenia cases and 1379 controls). The best associated SNP rs11789399 was located at 9q33.1 (p=2.38 × 10(-6), 5.74 × 10(-4), and 5.56 × 10(-9), for schizophrenia, bipolar disorder and meta-analysis of schizophrenia and bipolar disorder, respectively), where one flanking gene, ASTN2 (220kb away) has been associated with attention deficit/hyperactivity disorder and schizophrenia. The next best SNP was rs12201676 located at 6q15 (p=2.67 × 10(-4), 2.12 × 10(-5), 3.88 × 10(-8) for schizophrenia, bipolar disorder and meta-analysis, respectively), near two flanking genes, GABRR1 and GABRR2 (15 and 17kb away, respectively). The third interesting SNP rs802568 was at 7q35 within CNTNAP2 (p=8.92 × 10(-4), 1.38 × 10(-5), and 1.62 × 10(-7) for schizophrenia, bipolar disorder and meta-analysis, respectively). Through meta-analysis, we found two additional associated genes NALCN (the top SNP is rs2044117, p=4.57 × 10(-7)) and NAP5 (the top SNP is rs10496702, p=7.15 × 10(-7)). Haplotype analyses of above five loci further supported the associations with schizophrenia and bipolar disorder. These results provide evidence of common genetic variants influencing schizophrenia and bipolar disorder. These findings will serve as a resource for replication in other populations to elucidate the potential role of these genetic variants in schizophrenia and bipolar disorder.

  12. Behavior, Brain, and Genome in Genomic Disorders: Finding the Correspondences

    PubMed Central

    Grigorenko, Elena L.; Urban, Alexander E.; Mencl, Einar

    2014-01-01

    Objective Within the last decade or so, there has been an acceleration of research attempting to connect specific genetic lesions to patterns of brain structure and activation. This article comments on observations that have been made based on these recent data and discusses their importance for the field of investigations into developmental disorders. Method In making these observations, we focus on one specific genomic lesion, the well-studied, yet still incompletely understood, 22q11.2 deletion syndrome (22q11.2DS). Results We demonstrate the degree of variability in the phenotype that occurs at both the brain and behavioral levels of genomic disorders, and describe how this variability is, upon close inspection, represented at the genomic level. Conclusion We emphasize the importance of combining genetic/genomic analyses and neuroimaging for research and for future clinical diagnostic purposes, and for the purposes of developing individualized, patient-tailored treatment and remediation approaches. PMID:20814258

  13. The bonobo genome compared with the chimpanzee and human genomes.

    PubMed

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R; Mullikin, James C; Meader, Stephen J; Ponting, Chris P; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M; Fischer, Anne; Ptak, Susan E; Lachmann, Michael; Symer, David E; Mailund, Thomas; Schierup, Mikkel H; Andrés, Aida M; Kelso, Janet; Pääbo, Svante

    2012-06-28

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other.

  14. Plant genomics: homoplasy heaven in a lycophyte genome.

    PubMed

    Friedman, William E

    2011-07-26

    The recent genomic sequencing of Selaginella, a member of the lycophyte lineage of vascular plants, opens up all kinds of new opportunities to examine the patterns of evolutionary innovation and the creation of the basic bauplan of plants.

  15. The bonobo genome compared with the chimpanzee and human genomes

    PubMed Central

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R.; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R.; Mullikin, James C.; Meader, Stephen J.; Ponting, Chris P.; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E.; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M.; Fischer, Anne; Ptak, Susan E.; Lachmann, Michael; Symer, David E.; Mailund, Thomas; Schierup, Mikkel H.; Andrés, Aida M.; Kelso, Janet; Pääbo, Svante

    2012-01-01

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours1–4, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other. PMID:22722832

  16. Genome Modeling System: A Knowledge Management Platform for Genomics

    PubMed Central

    Griffith, Malachi; Griffith, Obi L.; Smith, Scott M.; Ramu, Avinash; Callaway, Matthew B.; Brummett, Anthony M.; Kiwala, Michael J.; Coffman, Adam C.; Regier, Allison A.; Oberkfell, Ben J.; Sanderson, Gabriel E.; Mooney, Thomas P.; Nutter, Nathaniel G.; Belter, Edward A.; Du, Feiyu; Long, Robert L.; Abbott, Travis E.; Ferguson, Ian T.; Morton, David L.; Burnett, Mark M.; Weible, James V.; Peck, Joshua B.; Dukes, Adam; McMichael, Joshua F.; Lolofie, Justin T.; Derickson, Brian R.; Hundal, Jasreet; Skidmore, Zachary L.; Ainscough, Benjamin J.; Dees, Nathan D.; Schierding, William S.; Kandoth, Cyriac; Kim, Kyung H.; Lu, Charles; Harris, Christopher C.; Maher, Nicole; Maher, Christopher A.; Magrini, Vincent J.; Abbott, Benjamin S.; Chen, Ken; Clark, Eric; Das, Indraniel; Fan, Xian; Hawkins, Amy E.; Hepler, Todd G.; Wylie, Todd N.; Leonard, Shawn M.; Schroeder, William E.; Shi, Xiaoqi; Carmichael, Lynn K.; Weil, Matthew R.; Wohlstadter, Richard W.; Stiehr, Gary; McLellan, Michael D.; Pohl, Craig S.; Miller, Christopher A.; Koboldt, Daniel C.; Walker, Jason R.; Eldred, James M.; Larson, David E.; Dooling, David J.; Ding, Li; Mardis, Elaine R.; Wilson, Richard K.

    2015-01-01

    In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms. PMID:26158448

  17. [Human genome project: a federator program of genomic medicine].

    PubMed

    Sfar, S; Chouchane, L

    2008-05-01

    The Human Genome Project improves our understanding of the molecular genetics basis of the inherited and complex diseases such as diabetes, schizophrenia, and cancer. Information from the human genome sequence is essential for several antenatal and neonatal screening programmes. The new genomic tools emerging from this project have revolutionized biology and medicine and have transformed our understanding of health and the provision of healthcare. Its implications pervade all areas of medicine, from disease prediction and prevention to the diagnosis and treatment of all forms of disease. Increasingly, it will be possible to drive predisposition testing into clinical practice, to develop new treatments or to adapt available treatments more specifically to an individual's genetic make-up. This genomic information should transform the traditional medications that are effective for every members of the population to personalized medicine and personalized therapy. The pharmacogenomics could give rise to a new generation of highly effective drugs that treat causes, not just symptoms.

  18. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

  19. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  20. Radiation Induced Genomic Instability

    SciTech Connect

    Morgan, William F.

    2011-03-01

    Radiation induced genomic instability can be observed in the progeny of irradiated cells multiple generations after irradiation of parental cells. The phenotype is well established both in vivo (Morgan 2003) and in vitro (Morgan 2003), and may be critical in radiation carcinogenesis (Little 2000, Huang et al. 2003). Instability can be induced by both the deposition of energy in irradiated cells as well as by signals transmitted by irradiated (targeted) cells to non-irradiated (non-targeted) cells (Kadhim et al. 1992, Lorimore et al. 1998). Thus both targeted and non-targeted cells can pass on the legacy of radiation to their progeny. However the radiation induced events and cellular processes that respond to both targeted and non-targeted radiation effects that lead to the unstable phenotype remain elusive. The cell system we have used to study radiation induced genomic instability utilizes human hamster GM10115 cells. These cells have a single copy of human chromosome 4 in a background of hamster chromosomes. Instability is evaluated in the clonal progeny of irradiated cells and a clone is considered unstable if it contains three or more metaphase sub-populations involving unique rearrangements of the human chromosome (Marder and Morgan 1993). Many of these unstable clones have been maintained in culture for many years and have been extensively characterized. As initially described by Clutton et al., (Clutton et al. 1996) many of our unstable clones exhibit persistently elevated levels of reactive oxygen species (Limoli et al. 2003), which appear to be due dysfunctional mitochondria (Kim et al. 2006, Kim et al. 2006). Interestingly, but perhaps not surprisingly, our unstable clones do not demonstrate a “mutator phenotype” (Limoli et al. 1997), but they do continue to rearrange their genomes for many years. The limiting factor with this system is the target – the human chromosome. While some clones demonstrate amplification of this chromosome and thus lend

  1. Capturing prokaryotic dark matter genomes.

    PubMed

    Gasc, Cyrielle; Ribière, Céline; Parisot, Nicolas; Beugnot, Réjane; Defois, Clémence; Petit-Biderre, Corinne; Boucher, Delphine; Peyretaillade, Eric; Peyret, Pierre

    2015-12-01

    Prokaryotes are the most diverse and abundant cellular life forms on Earth. Most of them, identified by indirect molecular approaches, belong to microbial dark matter. The advent of metagenomic and single-cell genomic approaches has highlighted the metabolic capabilities of numerous members of this dark matter through genome reconstruction. Thus, linking functions back to the species has revolutionized our understanding of how ecosystem function is sustained by the microbial world. This review will present discoveries acquired through the illumination of prokaryotic dark matter genomes by these innovative approaches.

  2. Genomic imprinting syndromes and cancer.

    PubMed

    Lim, Derek Hock Kiat; Maher, Eamonn Richard

    2010-01-01

    Genomic imprinting represents a form of epigenetic control of gene expression in which one allele of a gene is preferentially expressed according to the parent-of-origin of the allele. Genomic imprinting plays an important role in normal growth and development. Disruption of imprinting can result in a number of human imprinting syndromes and predispose to cancer. In this chapter, we describe a number of human imprinting syndromes to illustrate the concepts of genomic imprinting and how loss of imprinting of imprinted genes their relationship to human neoplasia.

  3. Human genome. 1993 Program report

    SciTech Connect

    Not Available

    1994-03-01

    The purpose of this report is to update the Human Genome 1991-92 Program Report and provide new information on the DOE genome program to researchers, program managers, other government agencies, and the interested public. This FY 1993 supplement includes abstracts of 60 new or renewed projects and listings of 112 continuing and 28 completed projects. These two reports, taken together, present the most complete published view of the DOE Human Genome Program through FY 1993. Research is progressing rapidly toward 15-year goals of mapping and sequencing the DNA of each of the 24 different human chromosomes.

  4. Processing massive datasets in genomics

    NASA Astrophysics Data System (ADS)

    Artiguenave, F.

    2011-02-01

    Life science researches have been profoundly impacted by technological advances allowing faster and cheaper DNA sequencing. Opening a wide range of applications in medical and biology, the last generation sequencing platforms raised new challenges, in particular in processing, analysing and interpreting massive data. In this talk, the growing role of bioinformatics will be illustrated by providing some figures about genome sequencing and others applications aimed at unravelling biological mechanisms. Methods to gather insights from massive amount of data will be illustrated by the genome annotation process, by which genes are identified in the genome sequence.

  5. Computational Challenges of Personal Genomics

    PubMed Central

    Bolouri, Hamid

    2008-01-01

    It is widely predicted that cost and efficiency gains in sequencing will usher in an era of personal genomics and personalized, predictive, preventive, and participatory medicine within a decade. I review the computational challenges ahead and propose general and specific directions for research and development. There is an urgent need to develop semantic ontologies that span genomics, molecular systems biology, and medical data. Although the development of such ontologies would be costly and difficult, the benefits will far outweigh the costs. I argue that availability of such ontologies would allow a revolution in web-services for personal genomics and medicine. PMID:19440448

  6. Genome-wide association study of type 2 diabetes in a sample from Mexico City and a meta-analysis of a Mexican-American sample from Starr County, Texas

    PubMed Central

    Parra, E. J.; Below, J. E.; Krithika, S.; Valladares, A.; Barta, J. L.; Cox, N. J.; Hanis, C. L.; Wacher, N.; Garcia-Mena, J.; Hu, P.; Shriver, M. D.; Kumate, J.; McKeigue, P. M.; Escobedo, J.; Cruz, M.

    2013-01-01

    Aims/hypothesis We report a genome-wide association study of type 2 diabetes in an admixed sample from Mexico City and describe the results of a meta-analysis of this study and another genome-wide scan in a Mexican-American sample from Starr County, TX, USA. The top signals observed in this meta-analysis were followed up in the Diabetes Genetics Replication and Meta-analysis Consortium (DIAGRAM) and DIAGRAM+ datasets. Methods We analysed 967 cases and 343 normoglycaemic controls. The samples were genotyped with the Affymetrix Genome-wide Human SNP array 5.0. Associations of genotyped and imputed markers with type 2 diabetes were tested using a missing data likelihood score test. A fixed-effects meta-analysis including 1,804 cases and 780 normoglycaemic controls was carried out by weighting the effect estimates by their inverse variances. Results In the meta-analysis of the two Hispanic studies, markers showing suggestive associations (p<10−5) were identified in two known diabetes genes, HNF1A and KCNQ1, as well as in several additional regions. Meta-analysis of the two Hispanic studies and the recent DIAGRAM+ dataset identified genome-wide significant signals (p<5×10−8) within or near the genes HNF1A and CDKN2A/CDKN2B, as well as suggestive associations in three additional regions, IGF2BP2, KCNQ1 and the previously unreported C14orf70. Conclusions/interpretation We observed numerous regions with suggestive associations with type 2 diabetes. Some of these signals correspond to regions described in previous studies. However, many of these regions could not be replicated in the DIAGRAM datasets. It is critical to carry out additional studies in Hispanic and American Indian populations, which have a high prevalence of type 2 diabetes. PMID:21573907

  7. The Materials Genome Project

    NASA Astrophysics Data System (ADS)

    Aourag, H.

    2008-09-01

    In the past, the search for new and improved materials was characterized mostly by the use of empirical, trial- and-error methods. This picture of materials science has been changing as the knowledge and understanding of fundamental processes governing a material's properties and performance (namely, composition, structure, history, and environment) have increased. In a number of cases, it is now possible to predict a material's properties before it has even been manufactured thus greatly reducing the time spent on testing and development. The objective of modern materials science is to tailor a material (starting with its chemical composition, constituent phases, and microstructure) in order to obtain a desired set of properties suitable for a given application. In the short term, the traditional "empirical" methods for developing new materials will be complemented to a greater degree by theoretical predictions. In some areas, computer simulation is already used by industry to weed out costly or improbable synthesis routes. Can novel materials with optimized properties be designed by computers? Advances in modelling methods at the atomic level coupled with rapid increases in computer capabilities over the last decade have led scientists to answer this question with a resounding "yes'. The ability to design new materials from quantum mechanical principles with computers is currently one of the fastest growing and most exciting areas of theoretical research in the world. The methods allow scientists to evaluate and prescreen new materials "in silico" (in vitro), rather than through time consuming experimentation. The Materials Genome Project is to pursue the theory of large scale modeling as well as powerful methods to construct new materials, with optimized properties. Indeed, it is the intimate synergy between our ability to predict accurately from quantum theory how atoms can be assembled to form new materials and our capacity to synthesize novel materials atom

  8. Genome Update. Let the consumer beware: Streptomyces genome sequence quality.

    PubMed

    Studholme, David J

    2016-01-01

    A genome sequence assembly represents a model of a genome. This article explores some tools and methods for assessing the quality of an assembly, using publicly available data for Streptomyces species as the example. There is great variability in quality of assemblies deposited in GenBank. Only in a small minority of these assemblies are the raw data available, enabling full appraisal of the assembly quality.

  9. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    SciTech Connect

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  10. Genomics, environmental genomics and the issue of microbial species.

    PubMed

    Ward, D M; Cohan, F M; Bhaya, D; Heidelberg, J F; Kühl, M; Grossman, A

    2008-02-01

    A microbial species concept is crucial for interpreting the variation detected by genomics and environmental genomics among cultivated microorganisms and within natural microbial populations. Comparative genomic analyses of prokaryotic species as they are presently described and named have led to the provocative idea that prokaryotes may not form species as we think about them for plants and animals. There are good reasons to doubt whether presently recognized prokaryotic species are truly species. To achieve a better understanding of microbial species, we believe it is necessary to (i) re-evaluate traditional approaches in light of evolutionary and ecological theory, (ii) consider that different microbial species may have evolved in different ways and (iii) integrate genomic, metagenomic and genome-wide expression approaches with ecological and evolutionary theory. Here, we outline how we are using genomic methods to (i) identify ecologically distinct populations (ecotypes) predicted by theory to be species-like fundamental units of microbial communities, and (ii) test their species-like character through in situ distribution and gene expression studies. By comparing metagenomic sequences obtained from well-studied hot spring cyanobacterial mats with genomic sequences of two cultivated cyanobacterial ecotypes, closely related to predominant native populations, we can conduct in situ population genetics studies that identify putative ecotypes and functional genes that determine the ecotypes' ecological distinctness. If individuals within microbial communities are found to be grouped into ecologically distinct, species-like populations, knowing about such populations should guide us to a better understanding of how genomic variation is linked to community function.

  11. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of human, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific t...

  12. Invisible genomes: the genomics revolution and patenting practice.

    PubMed

    Bostanci, Adam; Calvert, Jane

    2008-03-01

    In the mid-1990s, the company Human Genome Sciences submitted three potentially revolutionary patent applications to the US Patent and Trademark Office, each of which claimed the entire genome sequence of a microorganism. The patent examiners, however, objected to these applications, and after negotiation they were eventually re-written to resemble more traditional gene patents. In this paper, which is based on a study of the patent examination files, we examine the reasons why these patent applications were unsuccessful in their original form. We show that with respect to utility and novelty, the patent attorney's case built on an understanding of the genome as a computer-related invention. The patent examiners did not object to the patenting of complete genome sequences as computer-related inventions on moral grounds or in terms of the distinction between a discovery and an invention. Instead, their objections were based on classification, rules and procedure. Rather than patent examiners having a notion of a genome that should not be patented, the notion of a 'genome', and the ways in which it may be different from a 'gene', played no role in these debates. We discuss the consequences of our findings for patenting in the biosciences.

  13. The Global Cancer Genomics Consortium: interfacing genomics and cancer medicine.

    PubMed

    2012-08-01

    The Global Cancer Genomics Consortium (GCGC) is an international collaborative platform that amalgamates cancer biologists, cutting-edge genomics, and high-throughput expertise with medical oncologists and surgical oncologists; they address the most important translational questions that are central to cancer research and treatment. The annual GCGC symposium was held at the Advanced Centre for Treatment Research and Education in Cancer, Mumbai, India, from November 9 to 11, 2011. The symposium showcased international next-generation sequencing efforts that explore cancer-specific transcriptomic changes, single-nucleotide polymorphism, and copy number variations in various types of cancers, as well as the structural genomics approach to develop new therapeutic targets and chemical probes. From the spectrum of studies presented at the symposium, it is evident that the translation of emerging cancer genomics knowledge into clinical applications can only be achieved through the integration of multidisciplinary expertise. In summary, the GCGC symposium provided practical knowledge on structural and cancer genomics approaches, as well as an exclusive platform for focused cancer genomics endeavors.

  14. Natural Genomic Design in Sinorhizobium meliloti: Novel Genomic Architectures

    PubMed Central

    Guo, Xianwu; Flores, Margarita; Mavingui, Patrick; Fuentes, Sara Isabel; Hernández, Georgina; Dávila, Guillermo; Palacios, Rafael

    2003-01-01

    The complete nucleotide sequence of the genome of Sinorhizobium meliloti, the symbiont of alfalfa, was reported in 2001 by an international consortium of laboratories. The genome comprises a chromosome of 3.65 megabases (Mb) and two megaplasmids, pSymA and pSymB, of 1.35 Mb and 1.68 Mb, respectively. Based on the nucleotide sequence of the whole genome, we designed a pathway of consecutive rearrangements leading to novel genomic architectures. In a first step we obtained derivative strains containing two replicons; in a second step we obtained a strain containing the genetic information in one single replicon of 6.68 MB. From this last architecture we isolated revertants containing two replicons, and from these we could return to the original architecture showing the three replicons. We found that the relative frequency of excision of cointegrated replicons is higher at the site used for the cointegration than at other sites. This might conciliate two apparently opposed facts: the highly dynamic state of genomic architecture in S. meliloti and the common observation that different isolates and derived cellular clones of S. meliloti usually present the architecture of one chromosome and two distinct megaplasmids. Different aspects that must be considered to obtain full advantage of the strategy of natural genomic design are discussed. PMID:12902376

  15. Comparative genomic analysis of sixty mycobacteriophage genomes: Genome clustering, gene acquisition and gene size

    PubMed Central

    Hatfull, Graham F.; Jacobs-Sera, Deborah; Lawrence, Jeffrey G.; Pope, Welkin H.; Russell, Daniel A.; Ko, Ching-Chung; Weber, Rebecca J.; Patel, Manisha C.; Germane, Katherine L.; Edgar, Robert H.; Hoyte, Natasha N.; Bowman, Charles A.; Tantoco, Anthony T.; Paladin, Elizabeth C.; Myers, Marlana S.; Smith, Alexis L.; Grace, Molly S.; Pham, Thuy T.; O'Brien, Matthew B.; Vogelsberger, Amy M.; Hryckowian, Andrew J.; Wynalek, Jessica L.; Donis-Keller, Helen; Bogel, Matt W.; Peebles, Craig L.; Cresawn, Steve G.; Hendrix, Roger W.

    2010-01-01

    Mycobacteriophages are viruses that infect mycobacterial hosts. Expansion of a collection of sequenced phage genomes to a total of sixty – all infecting a common bacterial host – provides further insight into their diversity and evolution. Of the sixty phage genomes, 55 can be grouped into nine clusters according to their nucleotide sequence similarities, five of which can be further divided into subclusters; five genomes do not cluster with other phages. The sequence diversity between genomes within a cluster varies greatly; for example, the six genomes in cluster D share more than 97.5% average nucleotide similarity with each other. In contrast, similarity between the two genomes in Cluster I is barely detectable by diagonal plot analysis. The total of 6,858 predicted ORFs have been grouped into 1523 phamilies (phams) of related sequences, 46% of which possess only a single member. Only 18.8% of the phams have sequence similarity to non-mycobacteriophage database entries and fewer than 10% of all phams can be assigned functions based on database searching or synteny. Genome clustering facilitates the identification of genes that are in greatest genetic flux and are more likely to have been exchanged horizontally in relatively recent evolutionary time. Although mycobacteriophage genes exhibit smaller average size than genes of their host (205 residues compared to 315), phage genes in higher flux average only ∼100 amino acids, suggesting that the primary units of genetic exchange correspond to single protein domains. PMID:20064525

  16. The soft genome

    PubMed Central

    Anava, Sarit; Posner, Rachel; Rechavi, Oded

    2014-01-01

    Caenorhabditis elegans (C. elegans) nematodes transmit small RNAs across generations, a process that enables transgenerational regulation of genes. In contrast to changes to the DNA sequence, transgenerational transmission of small RNA-mediated responses is reversible, and thus enables “soft” or “flexible” inheritance of acquired characteristics. Until very recently only introduction of foreign genetic material (viruses, transposons, transgenes) was shown to directly lead to inheritance of small RNAs. New discoveries however, demonstrate that starvation also triggers inheritance of endogenous small RNAs in C.elegans. Multiple generations of worms inherit starvation-responsive endogenous small RNAs, and starvation also results in heritable extension of the progeny's lifespan. In this Commentary paper we explore the intriguing possibility that large parts of the genome and many additional traits are similarly subjected to heritable small RNA-mediated regulation, and focus on the potential influence of transgenerational RNAi on the worm's physiology. While the universal relevance of this mechanism remains to be discovered, we will examine how the discoveries made in worms already challenge long held dogmas in genetics and evolution. PMID:26430554

  17. inGeno – an integrated genome and ortholog viewer for improved genome to genome comparisons

    PubMed Central

    Liang, Chunguang; Dandekar, Thomas

    2006-01-01

    Background Systematic genome comparisons are an important tool to reveal gene functions, pathogenic features, metabolic pathways and genome evolution in the era of post-genomics. Furthermore, such comparisons provide important clues for vaccines and drug development. Existing genome comparison software often lacks accurate information on orthologs, the function of similar genes identified and genome-wide reports and lists on specific functions. All these features and further analyses are provided here in the context of a modular software tool "inGeno" written in Java with Biojava subroutines. Results InGeno provides a user-friendly interactive visualization platform for sequence comparisons (comprehensive reciprocal protein – protein comparisons) between complete genome sequences and all associated annotations and features. The comparison data can be acquired from several different sequence analysis programs in flexible formats. Automatic dot-plot analysis includes output reduction, filtering, ortholog testing and linear regression, followed by smart clustering (local collinear blocks; LCBs) to reveal similar genome regions. Further, the system provides genome alignment and visualization editor, collinear relationships and strain-specific islands. Specific annotations and functions are parsed, recognized, clustered, logically concatenated and visualized and summarized in reports. Conclusion As shown in this study, inGeno can be applied to study and compare in particular prokaryotic genomes against each other (gram positive and negative as well as close and more distantly related species) and has been proven to be sensitive and accurate. This modular software is user-friendly and easily accommodates new routines to meet specific user-defined requirements. PMID:17054788

  18. Collaborators | Office of Cancer Genomics

    Cancer.gov

    The TARGET initiative is jointly managed within the National Cancer Institute (NCI) by the Office of Cancer Genomics (OCG)Opens in a New Tab and the Cancer Therapy Evaluation Program (CTEP)Opens in a New Tab.

  19. Genomic Datasets for Cancer Research

    Cancer.gov

    A variety of datasets from genome-wide association studies of cancer and other genotype-phenotype studies, including sequencing and molecular diagnostic assays, are available to approved investigators through the Extramural National Cancer Institute Data Access Committee.

  20. Genome engineering in human cells.

    PubMed

    Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

    2014-01-01

    Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.

  1. Genomic Resources for Cancer Epidemiology

    Cancer.gov

    This page provides links to research resources, complied by the Epidemiology and Genomics Research Program, that may be of interest to genetic epidemiologists conducting cancer research, but is not exhaustive.

  2. Do Echinoderm Genomes Measure Up?

    PubMed Central

    Cameron, R. Andrew; Kudtarkar, Parul; Gordon, Susan M.; Worley, Kim C.; Gibbs, Richard A.

    2015-01-01

    Echinoderm genome sequences are a corpus of useful information about a clade of animals that serve as research models in fields ranging from marine ecology to cell and developmental biology. Genomic information from echinoids has contributed to insights into the gene interactions that drive the developmental process at the molecular level. Such insights often rely heavily on genomic information and the kinds of questions that can be asked thus depend on the quality of the sequence information. Here we describe the history of echinoderm genomic sequence assembly and present details about the quality of the data obtained. All of the sequence information discussed here is posted on the echinoderm information web system, Echinobase.org. PMID:25701080

  3. Genomic Contraindications for Heart Transplantation.

    PubMed

    Char, Danton S; Lázaro-Muñoz, Gabriel; Barnes, Aliessa; Magnus, David; Deem, Michael J; Lantos, John D

    2017-03-02

    Genome sequencing raises new ethical challenges. Decoding the genome produces new forms of diagnostic and prognostic information; however, the information is often difficult to interpret. The connection between most genetic variants and their phenotypic manifestations is not understood. This scenario is particularly true for disorders that are not associated with an autosomal genetic variant. The analytic uncertainty is compounded by moral uncertainty about how, exactly, the results of genomic testing should influence clinical decisions. In this Ethics Rounds, we present a case in which genomic findings seemed to play a role in deciding whether a patient was to be listed as a transplant candidate. We then asked experts in bioethics and cardiology to discuss the implications of such decisions.

  4. Genomic characterization of Nontuberculous Mycobacteria

    PubMed Central

    Fedrizzi, Tarcisio; Meehan, Conor J.; Grottola, Antonella; Giacobazzi, Elisabetta; Fregni Serpini, Giulia; Tagliazucchi, Sara; Fabio, Anna; Bettua, Clotilde; Bertorelli, Roberto; De Sanctis, Veronica; Rumpianesi, Fabio; Pecorari, Monica; Jousson, Olivier; Tortoli, Enrico; Segata, Nicola

    2017-01-01

    Mycobacterium tuberculosis and Mycobacterium leprae have remained, for many years, the primary species of the genus Mycobacterium of clinical and microbiological interest. The other members of the genus, referred to as nontuberculous mycobacteria (NTM), have long been underinvestigated. In the last decades, however, the number of reports linking various NTM species with human diseases has steadily increased and treatment difficulties have emerged. Despite the availability of whole genome sequencing technologies, limited effort has been devoted to the genetic characterization of NTM species. As a consequence, the taxonomic and phylogenetic structure of the genus remains unsettled and genomic information is lacking to support the identification of these organisms in a clinical setting. In this work, we widen the knowledge of NTMs by reconstructing and analyzing the genomes of 41 previously uncharacterized NTM species. We provide the first comprehensive characterization of the genomic diversity of NTMs and open new venues for the clinical identification of opportunistic pathogens from this genus. PMID:28345639

  5. Genomics and Health Impact Update

    MedlinePlus

    ... Publications Birth Defects/ Child Health Cancer Cardiovascular Diseases Chronic Disease Ethics, Policy and Law Genomics in Practice Newborn Screening Pharmacogenomics Reproductive Health Tools/ Databases AMD Clips News Concepts/ Comments Pathogenicity/ Antimicrobial Resistance Epidemiology/ ...

  6. Genomic understanding of glioblastoma expanded

    Cancer.gov

    Glioblastoma multiforme (GBM) was the first cancer type to be systematically studied by TCGA in 2008. In a new, complementary report, TCGA experts examined more than 590 GBM samples--the largest to date utilizing genomic characterization techniques and ne

  7. Eukaryotic Genomics Data from the DOE Joint Genome Institute (JGI)

    DOE Data Explorer

    The JGI makes high-quality genome sequencing data freely available to the greater scientific community through its web portal. Having played a significant role in the federally funded Human Genome Project -- generating the complete sequences of Chromosomes 5, 16, and 19--the JGI has now moved on to contributing in other critical areas of genomics research. While NIH-funded genome sequencing activities continue to emphasize human biomedical targets and applications, the JGI has since shifted its focus to the non-human components of the biosphere, particularly those relevant to the science mission of the Department of Energy. With efficiencies of scale established at the PGF, and capacity now exceeding three billion bases generated on a monthly basis, the JGI has tackled scores of additional genomes. These include more than 60 microbial genomes and many important multicellular organisms and communities of microbes. In partnership with other federal institutions and universities, the JGI is in the process of sequencing a frog (Xenopus tropicalis), a green alga (Chlamydomonas reinhardtii), a diatom (Thalassiosira pseudonana) , the cottonwood tree (Populus trichocarpa), and a host of agriculturally important plants and plant pathogens. Microorganisms, for example those that thrive under extreme conditions such as high acidity, radiation, and metal contamination, are of particular interest to the DOE and JGI. Investigations by JGI and its partners are shedding light on the cellular machinery of microbes and how they can be harnessed to clean up contaminated soil or water, capture carbon from the atmosphere, and produce potentially important sources of energy such as hydrogen and methane. [Excerpt from the JGI page "Who We Are" at http://www.jgi.doe.gov/whoweare/whoweare.html] From the JGI webportal users can choose Eukaryotic genomes from a photo list, access the JGI FTP directories to download data files, use the Tree of Life navigation tool, or choose a genome and go

  8. Draft Genome Sequence of Lactobacillus rhamnosus 2166

    PubMed Central

    Melnikov, Vyacheslav G.; Kosarev, Igor V.; Abramov, Vyacheslav M.

    2014-01-01

    In this report, we present a draft sequence of the genome of Lactobacillus rhamnosus strain 2166, a potential novel probiotic. Genome annotation and read mapping onto a reference genome of L. rhamnosus strain GG allowed for the identification of the differences and similarities in the genomic contents and gene arrangements of these strains. PMID:24558254

  9. Genomic Aspects of Research Involving Polyploid Plants

    SciTech Connect

    Yang, Xiaohan; Ye, Chuyu; Tschaplinski, Timothy J; Wullschleger, Stan D; Tuskan, Gerald A

    2011-01-01

    Almost all extant plant species have spontaneously doubled their genomes at least once in their evolutionary histories, resulting in polyploidy which provided a rich genomic resource for evolutionary processes. Moreover, superior polyploid clones have been created during the process of crop domestication. Polyploid plants generated by evolutionary processes and/or crop domestication have been the intentional or serendipitous focus of research dealing with the dynamics and consequences of genome evolution. One of the new trends in genomics research is to create synthetic polyploid plants which provide materials for studying the initial genomic changes/responses immediately after polyploid formation. Polyploid plants are also used in functional genomics research to study gene expression in a complex genomic background. In this review, we summarize the recent progress in genomics research involving ancient, young, and synthetic polyploid plants, with a focus on genome size evolution, genomics diversity, genomic rearrangement, genetic and epigenetic changes in duplicated genes, gene discovery, and comparative genomics. Implications on plant sciences including evolution, functional genomics, and plant breeding are presented. It is anticipated that polyploids will be a regular subject of genomics research in the foreseeable future as the rapid advances in DNA sequencing technology create unprecedented opportunities for discovering and monitoring genomic and transcriptomic changes in polyploid plants. The fast accumulation of knowledge on polyploid formation, maintenance, and divergence at whole-genome and subgenome levels will not only help plant biologists understand how plants have evolved and diversified, but also assist plant breeders in designing new strategies for crop improvement.

  10. 2004 Structural, Function and Evolutionary Genomics

    SciTech Connect

    Douglas L. Brutlag Nancy Ryan Gray

    2005-03-23

    This Gordon conference will cover the areas of structural, functional and evolutionary genomics. It will take a systematic approach to genomics, examining the evolution of proteins, protein functional sites, protein-protein interactions, regulatory networks, and metabolic networks. Emphasis will be placed on what we can learn from comparative genomics and entire genomes and proteomes.

  11. Upstream—News in Genomics

    PubMed Central

    2002-01-01

    This report on the literature spans from May to July, highlighting breakthroughs on several important genomes, including mouse, zebrafish, Fugu and Plasmodium. Recent papers have reported on a mechanism for genome size reduction in Arabidopsis, comparisons and verifications of large-scale protein–protein interaction datasets, developments in RNA interference approaches for mammalian systems and a solidphase peptide tagging method for proteomics. PMID:18629049

  12. Plague in the genomic area.

    PubMed

    Drancourt, M

    2012-03-01

    With plague being not only a subject of interest for historians, but still a disease of public health concern in several countries, mainly in Africa, there were hopes that analyses of the Yersinia pestis genomes would put an end to this deadly epidemic pathogen. Genomics revealed that Y. pestis isolates evolved from Yersinia pseudotuberculosis in Central Asia some millennia ago, after the acquisition of two Y. pestis-specific plasmids balanced genomic reduction parallel with the expansion of insertion sequences, illustrating the modern concept that, except for the acquisition of plasmid-borne toxin-encoding genes, the increased virulence of Y. pestis resulted from gene loss rather than gene acquisition. The telluric persistence of Y. pestis reminds us of this close relationship, and matters in terms of plague epidemiology. Whereas biotype Orientalis isolates spread worldwide, the Antiqua and Medievalis isolates showed more limited expansion. In addition to animal ectoparasites, human ectoparasites such as the body louse may have participated in this expansion and in devastating historical epidemics. The recent analysis of a Black Death genome indicated that it was more closely related to the Orientalis branch than to the Medievalis branch. Modern Y. pestis isolates grossly exhibit the same gene content, but still undergo micro-evolution in geographically limited areas by differing in the genome architecture, owing to inversions near insertion sequences and the stabilization of the YpfPhi prophage in Orientalis biotype isolates. Genomics have provided several new molecular tools for the genotyping and phylogeographical tracing of isolates and description of plague foci. However, genomics and post-genomics approaches have not yet provided new tools for the prevention, diagnosis and management of plague patients and the plague epidemics still raging in some sub-Saharan countries.

  13. Genome Exploitation and Bioinformatics Tools

    NASA Astrophysics Data System (ADS)

    de Jong, Anne; van Heel, Auke J.; Kuipers, Oscar P.

    Bioinformatic tools can greatly improve the efficiency of bacteriocin screening efforts by limiting the amount of strains. Different classes of bacteriocins can be detected in genomes by looking at different features. Finding small bacteriocins can be especially challenging due to low homology and because small open reading frames (ORFs) are often omitted from annotations. In this chapter, several bioinformatic tools/strategies to identify bacteriocins in genomes are discussed.

  14. Contact | Office of Cancer Genomics

    Cancer.gov

    For more information about the Office of Cancer Genomics, please contact: Office of Cancer Genomics National Cancer Institute 31 Center Drive, 10A07 Bethesda, Maryland 20892-2580 Phone: (301) 451-8027 Fax: (301) 480-4368 Email: ocg@mail.nih.gov *Please note that this site will not function properly in Internet Explorer unless you completely turn off the Compatibility View*

  15. Zebrafish genomics comes of age.

    PubMed

    Tan, Haihan; Zsigmond, Aron

    2013-09-01

    The ZF-HEALTH/EuFishBiomed workshop on "Genomics and High-throughput Sequencing Technologies with the Zebrafish Model" took place in December 2012 in Cambridge, United Kingdom. The organisers, Fiona Wardle and Ferenc Müller, brought together developmental biologists, geneticists, and bioinformaticians from Europe and the rest of the world to share findings and insights about the latest genomic capabilities and applications in this popular model organism.

  16. The dynamic genome of Hydra

    PubMed Central

    Chapman, Jarrod A.; Kirkness, Ewen F.; Simakov, Oleg; Hampson, Steven E.; Mitros, Therese; Weinmaier, Therese; Rattei, Thomas; Balasubramanian, Prakash G.; Borman, Jon; Busam, Dana; Disbennett, Kathryn; Pfannkoch, Cynthia; Sumin, Nadezhda; Sutton, Granger G.; Viswanathan, Lakshmi Devi; Walenz, Brian; Goodstein, David M.; Hellsten, Uffe; Kawashima, Takeshi; Prochnik, Simon E.; Putnam, Nicholas H.; Shu, Shengquiang; Blumberg, Bruce; Dana, Catherine E.; Gee, Lydia; Kibler, Dennis F.; Law, Lee; Lindgens, Dirk; Martinez, Daniel E.; Peng, Jisong; Wigge, Philip A.; Bertulat, Bianca; Guder, Corina; Nakamura, Yukio; Ozbek, Suat; Watanabe, Hiroshi; Khalturin, Konstantin; Hemmrich, Georg; Franke, André; Augustin, René; Fraune, Sebastian; Hayakawa, Eisuke; Hayakawa, Shiho; Hirose, Mamiko; Hwang, Jung Shan; Ikeo, Kazuho; Nishimiya-Fujisawa, Chiemi; Ogura, Atshushi; Takahashi, Toshio; Steinmetz, Patrick R. H.; Zhang, Xiaoming; Aufschnaiter, Roland; Eder, Marie-Kristin; Gorny, Anne-Kathrin; Salvenmoser, Willi; Heimberg, Alysha M.; Wheeler, Benjamin M.; Peterson, Kevin J.; Böttger, Angelika; Tischler, Patrick; Wolf, Alexander; Gojobori, Takashi; Remington, Karin A.; Strausberg, Robert L.; Venter, J. Craig; Technau, Ulrich; Hobmayer, Bert; Bosch, Thomas C. G.; Holstein, Thomas W.; Fujisawa, Toshitaka; Bode, Hans R.; David, Charles N.; Rokhsar, Daniel S.; Steele, Robert E.

    2015-01-01

    The freshwater cnidarian Hydra was first described in 17021 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals2. Today, Hydra is an important model for studies of axial patterning3, stem cell biology4 and regeneration5. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis6 and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann–Mangold organizer, pluripotency genes and the neuromuscular junction. PMID:20228792

  17. Shannon Information in Complete Genomes

    NASA Astrophysics Data System (ADS)

    Hsieh, Li-Ching; Chang, Chang-Heng; Lee, Hoong-Chien

    2004-03-01

    Genomes are books of life and necessarily carry a huge amount of information. This study was first motivated by the question: "How much information do complete genomes have?" As an answer we measured a particular type of Shannon information in all prokaryotes and eukaryotes whose complete genomes have been sequenced and are available in publically assessible database. The Shannon information in complete genome sequences follow an extremely simple pattern. With the exception of one eukaryote the Shannon information in all (more than 200) complete sequences belong to a single universality class given by a simple geometric recursion formula. The data are interpreted in terms of models for genome growth and inferred to suggest that the ancestors of present day genomes began to grow, mainly by stochastic, selectively neutral, duplications and short mutations, most likely when they were not more than 300 nt long. This notion of selective neutralism independently corroborates Kimura's neutral theory of evolution which was based on the investigation of polymorphisms of genes.

  18. Comparative genomic analyses in Asparagus.

    PubMed

    Kuhl, Joseph C; Havey, Michael J; Martin, William J; Cheung, Foo; Yuan, Qiaoping; Landherr, Lena; Hu, Yi; Leebens-Mack, James; Town, Christopher D; Sink, Kenneth C

    2005-12-01

    Garden asparagus (Asparagus officinalis L.) belongs to the monocot family Asparagaceae in the order Asparagales. Onion (Allium cepa L.) and Asparagus officinalis are 2 of the most economically important plants of the core Asparagales, a well supported monophyletic group within the Asparagales. Coding regions in onion have lower GC contents than the grasses. We compared the GC content of 3374 unique expressed sequence tags (ESTs) from A. officinalis with Lycoris longituba and onion (both members of the core Asparagales), Acorus americanus (sister to all other monocots), the grasses, and Arabidopsis. Although ESTs in A. officinalis and Acorus had a higher average GC content than Arabidopsis, Lycoris, and onion, all were clearly lower than the grasses. The Asparagaceae have the smallest nuclear genomes among all plants in the core Asparagales, which typically have huge genomes. Within the Asparagaceae, European Asparagus species have approximately twice the nuclear DNA of that of southern African Asparagus species. We cloned and sequenced 20 genomic amplicons from European A. officinalis and the southern African species Asparagus plumosus and observed no clear evidence for a recent genome doubling in A. officinalis relative to A. plumosus. These results indicate that members of the genus Asparagus with smaller genomes may be useful genomic models for plants in the core Asparagales.

  19. Correlation between genome reduction and bacterial growth.

    PubMed

    Kurokawa, Masaomi; Seno, Shigeto; Matsuda, Hideo; Ying, Bei-Wen

    2016-12-01

    Genome reduction by removing dispensable genomic sequences in bacteria is commonly used in both fundamental and applied studies to determine the minimal genetic requirements for a living system or to develop highly efficient bioreactors. Nevertheless, whether and how the accumulative loss of dispensable genomic sequences disturbs bacterial growth remains unclear. To investigate the relationship between genome reduction and growth, a series of Escherichia coli strains carrying genomes reduced in a stepwise manner were used. Intensive growth analyses revealed that the accumulation of multiple genomic deletions caused decreases in the exponential growth rate and the saturated cell density in a deletion-length-dependent manner as well as gradual changes in the patterns of growth dynamics, regardless of the growth media. Accordingly, a perspective growth model linking genome evolution to genome engineering was proposed. This study provides the first demonstration of a quantitative connection between genomic sequence and bacterial growth, indicating that growth rate is potentially associated with dispensable genomic sequences.

  20. Genomics-based screening of differentially expressed genes in the brains of mice exposed to silver nanoparticles via inhalation

    NASA Astrophysics Data System (ADS)

    Lee, Hye-Young; Choi, You-Jin; Jung, Eun-Jung; Yin, Hu-Quan; Kwon, Jung-Taek; Kim, Ji-Eun; Im, Hwang-Tae; Cho, Myung-Haing; Kim, Ju-Han; Kim, Hyun-Young; Lee, Byung-Hoon

    2010-06-01

    Silver nanoparticles (AgNP) are among the fastest growing product categories in the nanotechnology industry. Despite the importance of AgNP in consumer products and clinical applications, relatively little is known regarding AgNP toxicity and its associated risks. We investigated the effects of AgNP on gene expression in the mouse brain using Affymetrix Mouse Genome Arrays. C57BL/6 mice were exposed to AgNP (geometric mean diameter, 22.18 ± 1.72 nm; 1.91 × 107 particles/cm3) for 6 h/day, 5 days/week using the nose-only exposure system for 2 weeks. Total RNA isolated from the cerebrum and cerebellum was subjected to hybridization. From over 39,000 probe sets, 468 genes in the cerebrum and 952 genes in the cerebellum were identified as AgNP-responsive (one-way analysis of variance; p < 0.05). The largest groups of gene products affected by AgNP exposure included 73 genes in the cerebrum and 144 genes in the cerebellum. AgNP exposure modulated the expression of several genes associated with motor neuron disorders, neurodegenerative disease, and immune cell function, indicating potential neurotoxicity and immunotoxicity associated with AgNP exposure. Real-time PCR data for five genes analyzed from whole blood showed good correlation with the observed changes in the brain. Following rigorous validation and substantiation, these genes may assist in the development of surrogate markers for AgNP exposure and/or toxicity.

  1. Characterization of biological pathways associated with a 1.37 Mbp genomic region protective of hypertension in Dahl S rats

    PubMed Central

    Moreno, Carol; Jacob, Howard J.; Peterson, Christine B.; Stingo, Francesco C.; Ahn, Kwang Woo; Liu, Pengyuan; Vannucci, Marina; Laud, Purushottam W.; Reddy, Prajwal; Lazar, Jozef; Evans, Louise; Yang, Chun; Kurth, Theresa; Liang, Mingyu

    2014-01-01

    The goal of the present study was to narrow a region of chromosome 13 to only several genes and then apply unbiased statistical approaches to identify molecular networks and biological pathways relevant to blood-pressure salt sensitivity in Dahl salt-sensitive (SS) rats. The analysis of 13 overlapping subcongenic strains identified a 1.37 Mbp region on chromosome 13 that influenced the mean arterial blood pressure by at least 25 mmHg in SS rats fed a high-salt diet. DNA sequencing and analysis filled genomic gaps and provided identification of five genes in this region, Rfwd2, Fam5b, Astn1, Pappa2, and Tnr. A cross-platform normalization of transcriptome data sets obtained from our previously published Affymetrix GeneChip dataset and newly acquired RNA-seq data from renal outer medullary tissue provided 90 observations for each gene. Two Bayesian methods were used to analyze the data: 1) a linear model analysis to assess 243 biological pathways for their likelihood to discriminate blood pressure levels across experimental groups and 2) a Bayesian graphical modeling of pathways to discover genes with potential relationships to the candidate genes in this region. As none of these five genes are known to be involved in hypertension, this unbiased approach has provided useful clues to be experimentally explored. Of these five genes, Rfwd2, the gene most strongly expressed in the renal outer medulla, was notably associated with pathways that can affect blood pressure via renal transcellular Na+ and K+ electrochemical gradients and tubular Na+ transport, mitochondrial TCA cycle and cell energetics, and circadian rhythms. PMID:24714719

  2. Prenatal stress-induced programming of genome-wide promoter DNA methylation in 5-HTT-deficient mice.

    PubMed

    Schraut, K G; Jakob, S B; Weidner, M T; Schmitt, A G; Scholz, C J; Strekalova, T; El Hajj, N; Eijssen, L M T; Domschke, K; Reif, A; Haaf, T; Ortega, G; Steinbusch, H W M; Lesch, K P; Van den Hove, D L

    2014-10-21

    The serotonin transporter gene (5-HTT/SLC6A4)-linked polymorphic region has been suggested to have a modulatory role in mediating effects of early-life stress exposure on psychopathology rendering carriers of the low-expression short (s)-variant more vulnerable to environmental adversity in later life. The underlying molecular mechanisms of this gene-by-environment interaction are not well understood, but epigenetic regulation including differential DNA methylation has been postulated to have a critical role. Recently, we used a maternal restraint stress paradigm of prenatal stress (PS) in 5-HTT-deficient mice and showed that the effects on behavior and gene expression were particularly marked in the hippocampus of female 5-Htt+/- offspring. Here, we examined to which extent these effects are mediated by differential methylation of DNA. For this purpose, we performed a genome-wide hippocampal DNA methylation screening using methylated-DNA immunoprecipitation (MeDIP) on Affymetrix GeneChip Mouse Promoter 1.0 R arrays. Using hippocampal DNA from the same mice as assessed before enabled us to correlate gene-specific DNA methylation, mRNA expression and behavior. We found that 5-Htt genotype, PS and their interaction differentially affected the DNA methylation signature of numerous genes, a subset of which showed overlap with the expression profiles of the corresponding transcripts. For example, a differentially methylated region in the gene encoding myelin basic protein (Mbp) was associated with its expression in a 5-Htt-, PS- and 5-Htt × PS-dependent manner. Subsequent fine-mapping of this Mbp locus linked the methylation status of two specific CpG sites to Mbp expression and anxiety-related behavior. In conclusion, hippocampal DNA methylation patterns and expression profiles of female prenatally stressed 5-Htt+/- mice suggest that distinct molecular mechanisms, some of which are promoter methylation-dependent, contribute to the behavioral effects of the 5-Htt

  3. Genomic analysis of human lung fibroblasts exposed to vanadium pentoxide to identify candidate genes for occupational bronchitis

    PubMed Central

    Ingram, Jennifer L; Antao-Menezes, Aurita; Turpin, Elizabeth A; Wallace, Duncan G; Mangum, James B; Pluta, Linda J; Thomas, Russell S; Bonner, James C

    2007-01-01

    Background Exposure to vanadium pentoxide (V2O5) is a cause of occupational bronchitis. We evaluated gene expression profiles in cultured human lung fibroblasts exposed to V2O5 in vitro in order to identify candidate genes that could play a role in inflammation, fibrosis, and repair during the pathogenesis of V2O5-induced bronchitis. Methods Normal human lung fibroblasts were exposed to V2O5 in a time course experiment. Gene expression was measured at various time points over a 24 hr period using the Affymetrix Human Genome U133A 2.0 Array. Selected genes that were significantly changed in the microarray experiment were validated by RT-PCR. Results V2O5 altered more than 1,400 genes, of which ~300 were induced while >1,100 genes were suppressed. Gene ontology categories (GO) categories unique to induced genes included inflammatory response and immune response, while GO catogories unique to suppressed genes included ubiquitin cycle and cell cycle. A dozen genes were validated by RT-PCR, including growth factors (HBEGF, VEGF, CTGF), chemokines (IL8, CXCL9, CXCL10), oxidative stress response genes (SOD2, PIPOX, OXR1), and DNA-binding proteins (GAS1, STAT1). Conclusion Our study identified a variety of genes that could play pivotal roles in inflammation, fibrosis and repair during V2O5-induced bronchitis. The induction of genes that mediate inflammation and immune responses, as well as suppression of genes involved in growth arrest appear to be important to the lung fibrotic reaction to V2O5. PMID:17459161

  4. Integrating genomics and proteomics permits identification of immunodominant antigens associated with drug resistance in human visceral leishmaniasis in India.

    PubMed

    Singh, Neeloo; Sundar, Shyam

    2017-05-01

    Resistance of human pathogens like Leishmania to drugs is a growing concern where the multidrug-resistant phenotype renders chemotherapy ineffective. The acquired resistance of Leishmania to antimony has promoted intense research on the mechanisms involved but the question has not been resolved yet. In this study we have explored host-pathogen- drug interactions leading to identification of pharmacological determinants of host macrophages that resist the sodium antimony gluconate (SAG) mediated intracellular parasite killing. mRNA profiling of mammalian host stage amastigotes of sodium antimony gluconate (SAG) 'sensitive' and 'resistant' parasite lines was carried out using Affymetrix GeneChip(®) Human Genome U133 Plus 2.0 Array. Patient sera was used to identify immunogenic proteins by two-dimensional gel analysis (2DE) and mass spectrometric analysis (LC-MS/MS). Immunofluorescence microscopy confirmed the identities on 'sensitive' and 'resistant' parasite lines. A total of nine immunogenic proteins whose intensities changed significantly and consistently in multiple experiments were detected, suggesting that a cohort of proteins are altered in expression levels in the 'resistant' parasites. Global expression profiling using microarrays revealed this regulation was not reflected by changes in the levels of the cognate mRNAs. Following identification of proteins by mass spectrometry, one such regulated protein, enolase, was chosen for more detailed analysis. Immunofluorescence microscopy employing antisera against this enzyme confirmed that its level was differentially regulated in the 'resistant' isolate. We show that high serum level of immunoreactive protein is associated with 'resistant' phenotype. Differentially expressed proteins with immunomodulatory activities were found to be associated with the 'resistant phenotype'.

  5. Genomic repeats, genome plasticity and the dynamics of Mycoplasma evolution

    PubMed Central

    Rocha, Eduardo P. C.; Blanchard, Alain

    2002-01-01

    Mycoplasmas evolved by a drastic reduction in genome size, but their genomes contain numerous repeated sequences with important roles in their evolution. We have established a bioinformatic strategy to detect the major recombination hot-spots in the genomes of Mycoplasma pneumoniae, Mycoplasma genitalium, Ureaplasma urealyticum and Mycoplasma pulmonis. This allowed the identification of large numbers of potentially variable regions, as well as a comparison of the relative recombination potentials of different genomic regions. Different trends are perceptible among mycoplasmas, probably due to different functional and structural constraints. The largest potential for illegitimate recombination in M.pulmonis is found at the vsa locus and its comparison in two different strains reveals numerous changes since divergence. On the other hand, the main M.pneumoniae and M.genitalium adhesins rely on large distant repeats and, hence, homologous recombination for variation. However, the relation between the existence of repeats and antigenic variation is not necessarily straightforward, since repeats of P1 adhesin were found to be anti-correlated with epitopes recognized by patient antibodies. These different strategies have important consequences for the structures of genomes, since large distant repeats correlate well with the major chromosomal rearrangements. Probably to avoid such events, mycoplasmas strongly avoid inverse repeats, in comparison to co-oriented repeats. PMID:11972343

  6. High Resolution Copy Number Variation Data in the NCI-60 Cancer Cell Lines from Whole Genome Microarrays Accessible through CellMiner

    PubMed Central

    Varma, Sudhir; Pommier, Yves; Sunshine, Margot; Weinstein, John N.; Reinhold, William C.

    2014-01-01

    Array-based comparative genomic hybridization (aCGH) is a powerful technique for detecting gene copy number variation. It is generally considered to be robust and convenient since it measures DNA rather than RNA. In the current study, we combine copy number estimates from four different platforms (Agilent 44 K, NimbleGen 385 K, Affymetrix 500 K and Illumina Human1Mv1_C) to compute a reliable, high-resolution, easy to understand output for the measure of copy number changes in the 60 cancer cells of the NCI-DTP (the NCI-60). We then relate the results to gene expression. We explain how to access that database using our CellMiner web-tool and provide an example of the ease of comparison with transcript expression, whole exome sequencing, microRNA expression and response to 20,000 drugs and other chemical compounds. We then demonstrate how the data can be analyzed integratively with transcript expression data for the whole genome (26,065 genes). Comparison of copy number and expression levels shows an overall medium high correlation (median r = 0.247), with significantly higher correlations (median r = 0.408) for the known tumor suppressor genes. That observation is consistent with the hypothesis that gene loss is an important mechanism for tumor suppressor inactivation. An integrated analysis of concurrent DNA copy number and gene expression change is presented. Limiting attention to focal DNA gains or losses, we identify and reveal novel candidate tumor suppressors with matching alterations in transcript level. PMID:24670534

  7. Trace levels of mitomycin C disrupt genomic integrity and lead to DNA damage response defect in long-term-cultured human embryonic stem cells.

    PubMed

    Zhou, Di; Lin, Ge; Zeng, Si-Cong; Xiong, Bo; Xie, Ping-Yuan; Cheng, De-Hua; Zheng, Qing; Ouyang, Qi; Zhou, Xiao-Ying; Tang, Wei-Ling; Sun, Yi; Lu, Guang-Ying; Lu, Guang-Xiu

    2015-01-01

    How to maintain the genetic integrity of cultured human embryonic stem (hES) cells is raising crucial concerns for future clinical use in regenerative medicine. Mitomycin C(MMC), a DNA damage agent, is widely used for preparation of feeder cells in many laboratories. However, to what extent MMC affects the karyotypic stability of hES cells is not clear. Here, we measured residual MMC using High Performance Liquid Chromatography-Mass Spectrometry/Mass Spectrometry following each step of feeder preparation and found that 2.26 ± 0.77 and 3.50 ± 0.92 ng/ml remained in mouse feeder cells and human feeder cells, respectively. In addition, different amounts of MMC caused different chromosomal aberrations in hES cells. In particular, one abnormality, dup(1)(p32p36), was the same identical to one we previously reported in another hES cell line. Using Affymetrix SNP 6.0 arrays, the copy number variation changes of the hES cells maintained on MMC-inactivated feeders (MMC-feeder) were significantly more than those cultured on γ-inactivated feeder (IR-feeder) cells. Furthermore, DNA damage response (DDR) genes were down-regulated during long-term culture in the MMC-containing system, leading to DDR defect and shortened telomeres of hES cells, a sign of genomic instability. Therefore, MMC-feeder and MMC-induced genomic variation present an important safety problem that would limit such hES from being applied for future clinic use and drug screening.

  8. Global transcriptomic profiling using small volumes of whole blood: a cost-effective method for translational genomic biomarker identification in small animals.

    PubMed

    Fricano, Meagan M; Ditewig, Amy C; Jung, Paul M; Liguori, Michael J; Blomme, Eric A G; Yang, Yi

    2011-01-01

    Blood is an ideal tissue for the identification of novel genomic biomarkers for toxicity or efficacy. However, using blood for transcriptomic profiling presents significant technical challenges due to the transcriptomic changes induced by ex vivo handling and the interference of highly abundant globin mRNA. Most whole blood RNA stabilization and isolation methods also require significant volumes of blood, limiting their effective use in small animal species, such as rodents. To overcome these challenges, a QIAzol-based RNA stabilization and isolation method (QSI) was developed to isolate sufficient amounts of high quality total RNA from 25 to 500 μL of rat whole blood. The method was compared to the standard PAXgene Blood RNA System using blood collected from rats exposed to saline or lipopolysaccharide (LPS). The QSI method yielded an average of 54 ng total RNA per μL of rat whole blood with an average RNA Integrity Number (RIN) of 9, a performance comparable with the standard PAXgene method. Total RNA samples were further processed using the NuGEN Ovation Whole Blood Solution system and cDNA was hybridized to Affymetrix Rat Genome 230 2.0 Arrays. The microarray QC parameters using RNA isolated with the QSI method were within the acceptable range for microarray analysis. The transcriptomic profiles were highly correlated with those using RNA isolated with the PAXgene method and were consistent with expected LPS-induced inflammatory responses. The present study demonstrated that the QSI method coupled with NuGEN Ovation Whole Blood Solution system is cost-effective and particularly suitable for transcriptomic profiling of minimal volumes of whole blood, typical of those obtained with small animal species.

  9. Genomic predictor of residual risk of recurrence after adjuvant chemotherapy and endocrine therapy in high risk estrogen receptor-positive breast cancers.

    PubMed

    Khan, Sabrina S; Karn, Thomas; Symmans, W Fraser; Rody, Achim; Müller, Volkmar; Holtrich, Uwe; Becker, Sven; Pusztai, Lajos; Hatzis, Christos

    2015-02-01

    A subset of early stage estrogen receptor (ER)-positive breast cancers considered "high risk" for recurrence with endocrine therapy alone by current genomic prognostic predictors, such as Oncotype DX, is no longer high risk after receiving adjuvant chemotherapy. We hypothesized that a recently described gene expression-based outcome predictor adjuvant chemotherapy and endocrine therapy sensitivity (ACES) could re-stratify these patients into high and low risk groups for relapse when treated with both chemo- and endocrine therapies. ACES involves four separate modules (endocrine sensitivity, chemotherapy sensitivity, chemotherapy resistance, and survival prediction) that yield a prediction for good or poor outcome with current standard of care multimodality therapy. ACES was applied to Affymetrix gene expression data from 2 retrospectively collected ER-positive and HER2-negative patient cohorts that were uniformly treated with adjuvant endocrine and chemotherapy (n = 250). Each sample was first risk stratified by a genomic surrogate of Oncotype DX, and the high risk patients (n = 76) were re-stratified by ACES. Recurrence-free survival (RFS) was evaluated with ACES risk categories. The Oncotype DX high risk but ACES good prognosis patients (n = 24, 32%) had an RFS of 95% compared to 76% in the poor prognosis group (n = 52; log-rank p = 0.033) at 5 years. ACES risk category remained an independent predictor in multivariate analysis after adjusting for age, T-stage, and lymph node involvement at diagnosis (hazard ratio 0.15; p = 0.072). Tertiary risk prediction that takes into account chemotherapy and endocrine sensitivity, and baseline prognosis may help identify high risk ER-positive patients who have excellent survival after chemotherapy.

  10. Genetic variation in one-carbon metabolism in relation to genome-wide DNA methylation in breast tissue from heathy women.

    PubMed

    Song, Min-Ae; Brasky, Theodore M; Marian, Catalin; Weng, Daniel Y; Taslim, Cenny; Llanos, Adana A; Dumitrescu, Ramona G; Liu, Zhenua; Mason, Joel B; Spear, Scott L; Kallakury, Bhaskar V S; Freudenheim, Jo L; Shields, Peter G

    2016-03-09

    Single nucleotide polymorphisms (SNPs) in one-carbon metabolism genes and lifestyle factors (alcohol drinking and breast folate) may be determinants of whole-genome methylation in the breast. DNA methylation profiling was performed using the Illumina Infinium HumanMethylation450 BeadChip in 81 normal breast tissues from women undergoing reduction mammoplasty and no history of cancer. ANCOVA, adjusting for age, race and BMI, was used to identify differentially-methylated (DM) CpGs. Gene expression, by the Affymetrix GeneChip Human Transcriptome Array 2.0, was correlated with DM. Biological networks of DM genes were assigned using Ingenuity Pathway Analysis. Fifty-seven CpG sites were DM in association with eight SNPs in FTHFD, MTHFD1, MTHFR, MTR, MTRR, and TYMS (P <5.0 x 10(-5)); 56% of the DM CpGs were associated with FTHFD SNPs, including DM within FTHFD. Gene expression was negatively correlated with FTHFD methylation (r=-0.25, P=0.017). Four DM CpGs identified by SNPs in MTRR, MTHFR, and FTHFD were significantly associated with alcohol consumption and/or breast folate. The top biological network of DM CpGs was associated with Energy Production, Molecular Transportation, and Nucleic Acid Metabolism. This is the first comprehensive study of the association between SNPs in one-carbon metabolism genes and genome-wide DNA methylation in normal breast tissues. These SNPs, especially FTHFD, as well as alcohol intake and folate exposure, appear to affect DM in breast tissues of healthy women. The finding that SNPs in FTHFD and MTR are associated with their own methylation is novel and highlights a role for these SNPs as cis-methylation quantitative trait loci.

  11. A genome-wide meta-analysis of nodular sclerosing Hodgkin lymphoma identifies risk loci at 6p21.32.

    PubMed

    Cozen, Wendy; Li, Dalin; Best, Timothy; Van Den Berg, David J; Gourraud, Pierre-Antoine; Cortessis, Victoria K; Skol, Andrew D; Mack, Thomas M; Glaser, Sally L; Weiss, Lawrence M; Nathwani, Bharat N; Bhatia, Smita; Schumacher, Fredrick R; Edlund, Christopher K; Hwang, Amie E; Slager, Susan L; Fredericksen, Zachary S; Strong, Louise C; Habermann, Thomas M; Link, Brian K; Cerhan, James R; Robison, Leslie L; Conti, David V; Onel, Kenan

    2012-01-12

    Nodular sclerosing Hodgkin lymphoma (NSHL) is a distinct, highly heritable Hodgkin lymphoma subtype. We undertook a genome-wide meta-analysis of 393 European-origin adolescent/young adult NSHL patients and 3315 controls using the Illumina Human610-Quad Beadchip and Affymetrix Genome-Wide Human SNP Array 6.0. We identified 3 single nucleotide polymorphisms (SNPs) on chromosome 6p21.32 that were significantly associated with NSHL risk: rs9268542 (P = 5.35 × 10(-10)), rs204999 (P = 1.44 × 10(-9)), and rs2858870 (P = 1.69 × 10(-8)). We also confirmed a previously reported association in the same region, rs6903608 (P = 3.52 × 10(-10)). rs204999 and rs2858870 were weakly correlated (r(2) = 0.257), and the remaining pairs of SNPs were not correlated (r(2) < 0.1). In an independent set of 113 NSHL cases and 214 controls, 2 SNPs were significantly associated with NSHL and a third showed a comparable odds ratio (OR). These SNPs are found on 2 haplotypes associated with NSHL risk (rs204999-rs9268528-rs9268542-rs6903608-rs2858870; AGGCT, OR = 1.7, P = 1.71 × 10(-6); GAATC, OR = 0.4, P = 1.16 × 10(-4)). All individuals with the GAATC haplotype also carried the HLA class II DRB1*0701 allele. In a separate analysis, the DRB1*0701 allele was associated with a decreased risk of NSHL (OR = 0.5, 95% confidence interval = 0.4, 0.7). These data support the importance of the HLA class II region in NSHL etiology.

  12. A comparison of whole genome gene expression profiles of HepaRG cells and HepG2 cells to primary human hepatocytes and human liver tissues.

    PubMed

    Hart, Steven N; Li, Ye; Nakamoto, Kaori; Subileau, Eva-anne; Steen, David; Zhong, Xiao-bo

    2010-06-01

    HepaRG cells, derived from a female hepatocarcinoma patient, are capable of differentiating into biliary epithelial cells and hepatocytes. More importantly, differentiated HepaRG cells are able to maintain activities of many xenobiotic-metabolizing enzymes, and expression of the metabolizing enzyme genes can be induced by xenobiotics. The ability of these cells to express and induce xenobiotic-metabolizing enzymes is in stark contrast to the frequently used HepG2 cells. The previous studies have mainly focused on a set of selected genes; therefore, it is of significant interest to know the extent of similarity of gene expression at whole genome levels in HepaRG cells and HepG2 cells compared with primary human hepatocytes and human liver tissues. To accomplish this objective, we used Affymetrix (Santa Clara, CA) U133 Plus 2.0 arrays to characterize the whole genome gene expression profiles in triplicate biological samples from HepG2 cells, HepaRG cells (undifferentiated and differentiated cells), freshly isolated primary human hepatocytes, and frozen liver tissues. After using similarity matrix, principal components, and hierarchical clustering methods, we found that HepaRG cells globally transcribe genes at levels more similar to human primary hepatocytes and human liver tissues than HepG2 cells. In particular, many genes encoding drug-processing proteins are transcribed at a more similar level in HepaRG cells than in HepG2 cells compared with primary human hepatocytes and liver samples. The transcriptomic similarity of HepaRG with primary human hepatocytes is encouraging for use of HepaRG cells in the study of xenobiotic metabolism, hepatotoxicology, and hepatocyte differentiation.

  13. GIPSy: Genomic island prediction software.

    PubMed

    Soares, Siomar C; Geyik, Hakan; Ramos, Rommel T J; de Sá, Pablo H C G; Barbosa, Eudes G V; Baumbach, Jan; Figueiredo, Henrique C P; Miyoshi, Anderson; Tauch, Andreas; Silva, Artur; Azevedo, Vasco

    2016-08-20

    Bacteria are highly diverse organisms that are able to adapt to a broad range of environments and hosts due to their high genomic plasticity. Horizontal gene transfer plays a pivotal role in this genome plasticity and in evolution by leaps through the incorporation of large blocks of genome sequences, ordinarily known as genomic islands (GEIs). GEIs may harbor genes encoding virulence, metabolism, antibiotic resistance and symbiosis-related functions, namely pathogenicity islands (PAIs), metabolic islands (MIs), resistance islands (RIs) and symbiotic islands (SIs). Although many software for the prediction of GEIs exist, they only focus on PAI prediction and present other limitations, such as complicated installation and inconvenient user interfaces. Here, we present GIPSy, the genomic island prediction software, a standalone and user-friendly software for the prediction of GEIs, built on our previously developed pathogenicity island prediction software (PIPS). We also present four application cases in which we crosslink data from literature to PAIs, MIs, RIs and SIs predicted by GIPSy. Briefly, GIPSy correctly predicted the following previously described GEIs: 13 PAIs larger than 30kb in Escherichia coli CFT073; 1 MI for Burkholderia pseudomallei K96243, which seems to be a miscellaneous island; 1 RI of Acinetobacter baumannii AYE, named AbaR1; and, 1 SI of Mesorhizobium loti MAFF303099 presenting a mosaic structure. GIPSy is the first life-style-specific genomic island prediction software to perform analyses of PAIs, MIs, RIs and SIs, opening a door for a better understanding of bacterial genome plasticity and the adaptation to new traits.

  14. GOLD: The Genomes Online Database

    DOE Data Explorer

    Kyrpides, Nikos; Liolios, Dinos; Chen, Amy; Tavernarakis, Nektarios; Hugenholtz, Philip; Markowitz, Victor; Bernal, Alex

    Since its inception in 1997, GOLD has continuously monitored genome sequencing projects worldwide and has provided the community with a unique centralized resource that integrates diverse information related to Archaea, Bacteria, Eukaryotic and more recently Metagenomic sequencing projects. As of September 2007, GOLD recorded 639 completed genome projects. These projects have their complete sequence deposited into the public archival sequence databases such as GenBank EMBL,and DDBJ. From the total of 639 complete and published genome projects as of 9/2007, 527 were bacterial, 47 were archaeal and 65 were eukaryotic. In addition to the complete projects, there were 2158 ongoing sequencing projects. 1328 of those were bacterial, 59 archaeal and 771 eukaryotic projects. Two types of metadata are provided by GOLD: (i) project metadata and (ii) organism/environment metadata. GOLD CARD pages for every project are available from the link of every GOLD_STAMP ID. The information in every one of these pages is organized into three tables: (a) Organism information, (b) Genome project information and (c) External links. [The Genomes On Line Database (GOLD) in 2007: Status of genomic and metagenomic projects and their associated metadata, Konstantinos Liolios, Konstantinos Mavromatis, Nektarios Tavernarakis and Nikos C. Kyrpides, Nucleic Acids Research Advance Access published online on November 2, 2007, Nucleic Acids Research, doi:10.1093/nar/gkm884]

    The basic tables in the GOLD database that can be browsed or searched include the following information:

    • Gold Stamp ID
    • Organism name
    • Domain
    • Links to information sources
    • Size and link to a map, when available
    • Chromosome number, Plas number, and GC content
    • A link for downloading the actual genome data
    • Institution that did the sequencing
    • Funding source
    • Database where information resides
    • Publication status and information

    • Mapping whole genome shotgun sequence and variant calling in mammalian species without their reference genomes

      Technology Transfer Automated Retrieval System (TEKTRAN)

      Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these s...

    • Exploring cancer genomic data from the cancer genome atlas project.

      PubMed

      Lee, Ju-Seog

      2016-11-01

      The Cancer Genome Atlas (TCGA) has compiled genomic, epigenomic, and proteomic data from more than 10,000 samples derived from 33 types of cancer, aiming to improve our understanding of the molecular basis of cancer development. Availability of these genome-wide information provides an unprecedented opportunity for uncovering new key regulators of signaling pathways or new roles of pre-existing members in pathways. To take advantage of the advancement, it will be necessary to learn systematic approaches that can help to uncover novel genes reflecting genetic alterations, prognosis, or response to treatments. This minireview describes the updated status of TCGA project and explains how to use TCGA data. [BMB Reports 2016; 49(11): 607-611].

    • Genomics made easier: an introductory tutorial to genome datamining.

      PubMed

      Schattner, Peter

      2009-03-01

      Integrated genome databases--such as the UCSC, Ensembl and NCBI MapViewer databases--and their associated data querying and visualization interfaces (e.g. the genome browsers) have transformed the way that molecular biologists, geneticists and bioinformaticists analyze genomic data. Nevertheless, because of the complexity of these tools, many researchers take advantage of only a fraction of their capabilities. In this tutorial, using examples from medical genetics and alternative splicing, I describe some of the biological questions that can be addressed with these techniques. I also show why doing so typically is more effective than using alternative methods and indicate some of the resources available for learning more about the advanced capabilities of these powerful tools.

    • Genome size: a novel genomic signature in support of Afrotheria.

      PubMed

      Redi, Carlo Alberto; Garagna, Silvia; Zuccotti, Maurizio; Capanna, Ernesto

      2007-04-01

      Molecular phylogenetic analyses suggest an emerging phylogeny for the extant Placentalia (eutherian) that radically departs from morphologically based constructions of the past. Placental mammals are partitioned into four supraordinal clades: Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. Afrotheria form an endemic African clade that includes elephant shrews, golden moles, tenrecs, aardvarks, hyraxes, elephants, dugongs, and manatees. Datamining databases of genome size (GS) shows that till today just one afrotherian GS has been evaluated, that of the aardvark Orycteropus afer. We show that the GSs of six selected representatives across the Afrotheria supraordinal group are among the highest for the extant Placentalia, providing a novel genomic signature of this enigmatic group. The mean GS value of Afrotheria, 5.3 +/- 0.7 pg, is the highest reported for the extant Placentalia. This should assist in planning new genome sequencing initiatives.

    • Human Genome Education Program

      SciTech Connect

      Richard Myers; Lane Conn

      2000-05-01

      The funds from the DOE Human Genome Program, for the project period 2/1/96 through 1/31/98, have provided major support for the curriculum development and field testing efforts for two high school level instructional units: Unit 1, ''Exploring Genetic Conditions: Genes, Culture and Choices''; and Unit 2, ''DNA Snapshots: Peaking at Your DNA''. In the original proposal, they requested DOE support for the partial salary and benefits of a Field Test Coordinator position to: (1) complete the field testing and revision of two high school curriculum units, and (2) initiate the education of teachers using these units. During the project period of this two-year DOE grant, a part-time Field-Test Coordinator was hired (Ms. Geraldine Horsma) and significant progress has been made in both of the original proposal objectives. Field testing for Unit 1 has occurred in over 12 schools (local and non-local sites with diverse student populations). Field testing for Unit 2 has occurred in over 15 schools (local and non-local sites) and will continue in 12-15 schools during the 96-97 school year. For both curricula, field-test sites and site teachers were selected for their interest in genetics education and in hands-on science education. Many of the site teachers had no previous experience with HGEP or the unit under development. Both of these first-year biology curriculum units, which contain genetics, biotechnology, societal, ethical and cultural issues related to HGP, are being implemented in many local and non-local schools (SF Bay Area, Southern California, Nebraska, Hawaii, and Texas) and in programs for teachers. These units will reach over 10,000 students in the SF Bay Area and continues to receive support from local corporate and private philanthropic organizations. Although HGEP unit development is nearing completion for both units, data is still being gathered and analyzed on unit effectiveness and student learning. The final field testing result from this analysis will

    • AcCNET (Accessory Genome Constellation Network): comparative genomics software for accessory genome analysis using bipartite networks.

      PubMed

      Lanza, Val F; Baquero, Fernando; de la Cruz, Fernando; Coque, Teresa M

      2017-01-15

      AcCNET (Accessory genome Constellation Network) is a Perl application that aims to compare accessory genomes of a large number of genomic units, both at qualitative and quantitative levels. Using the proteomes extracted from the analysed genomes, AcCNET creates a bipartite network compatible with standard network analysis platforms. AcCNET allows merging phylogenetic and functional information about the concerned genomes, thus improving the capability of current methods of network analysis. The AcCNET bipartite network opens a new perspective to explore the pangenome of bacterial species, focusing on the accessory genome behind the idiosyncrasy of a particular strain and/or population.

  1. Comparative genomics for biodiversity conservation

    PubMed Central

    Grueber, Catherine E.

    2015-01-01

    Genomic approaches are gathering momentum in biology and emerging opportunities lie in the creative use of comparative molecular methods for revealing the processes that influence diversity of wildlife. However, few comparative genomic studies are performed with explicit and specific objectives to aid conservation of wild populations. Here I provide a brief overview of comparative genomic approaches that offer specific benefits to biodiversity conservation. Because conservation examples are few, I draw on research from other areas to demonstrate how comparing genomic data across taxa may be used to inform the characterisation of conservation units and studies of hybridisation, as well as studies that provide conservation outcomes from a better understanding of the drivers of divergence. A comparative approach can also provide valuable insight into the threatening processes that impact rare species, such as emerging diseases and their management in conservation. In addition to these opportunities, I note areas where additional research is warranted. Overall, comparing and contrasting the genomic composition of threatened and other species provide several useful tools for helping to preserve the molecular biodiversity of the global ecosystem. PMID:26106461

  2. NCBI prokaryotic genome annotation pipeline.

    PubMed

    Tatusova, Tatiana; DiCuccio, Michael; Badretdin, Azat; Chetvernin, Vyacheslav; Nawrocki, Eric P; Zaslavsky, Leonid; Lomsadze, Alexandre; Pruitt, Kim D; Borodovsky, Mark; Ostell, James

    2016-08-19

    Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.

  3. Genomic dissection of the seed

    PubMed Central

    Becker, Michael G.; Hsu, Ssu-Wei; Harada, John J.; Belmonte, Mark F.

    2014-01-01

    Seeds play an integral role in the global food supply and account for more than 70% of the calories that we consume on a daily basis. To meet the demands of an increasing population, scientists are turning to seed genomics research to find new and innovative ways to increase food production. Seed genomics is evolving rapidly, and the information produced from seed genomics research has exploded over the past two decades. Advances in modern sequencing strategies that profile every molecule in every cell, tissue, and organ and the emergence of new model systems have provided the tools necessary to unravel many of the biological processes underlying seed development. Despite these advances, the analyses and mining of existing seed genomics data remain a monumental task for plant biologists. This review summarizes seed region and subregion genomic data that are currently available for existing and emerging oilseed models. We provide insight into the development of tools on how to analyze large-scale datasets. PMID:25309563

  4. Genome edited sheep and cattle.

    PubMed

    Proudfoot, Chris; Carlson, Daniel F; Huddart, Rachel; Long, Charles R; Pryor, Jane H; King, Tim J; Lillico, Simon G; Mileham, Alan J; McLaren, David G; Whitelaw, C Bruce A; Fahrenkrug, Scott C

    2015-02-01

    Genome editing tools enable efficient and accurate genome manipulation. An enhanced ability to modify the genomes of livestock species could be utilized to improve disease resistance, productivity or breeding capability as well as the generation of new biomedical models. To date, with respect to the direct injection of genome editor mRNA into livestock zygotes, this technology has been limited to the generation of pigs with edited genomes. To capture the far-reaching applications of gene-editing, from disease modelling to agricultural improvement, the technology must be easily applied to a number of species using a variety of approaches. In this study, we demonstrate zygote injection of TALEN mRNA can also produce gene-edited cattle and sheep. In both species we have targeted the myostatin (MSTN) gene. In addition, we report a critical innovation for application of gene-editing to the cattle industry whereby gene-edited calves can be produced with specified genetics by ovum pickup, in vitro fertilization and zygote microinjection (OPU-IVF-ZM). This provides a practical alternative to somatic cell nuclear transfer for gene knockout or introgression of desirable alleles into a target breed/genetic line.

  5. The genome of Chenopodium quinoa.

    PubMed

    Jarvis, David E; Ho, Yung Shwen; Lightfoot, Damien J; Schmöckel, Sandra M; Li, Bo; Borm, Theo J A; Ohyanagi, Hajime; Mineta, Katsuhiko; Michell, Craig T; Saber, Noha; Kharbatia, Najeh M; Rupper, Ryan R; Sharp, Aaron R; Dally, Nadine; Boughton, Berin A; Woo, Yong H; Gao, Ge; Schijlen, Elio G W M; Guo, Xiujie; Momin, Afaque A; Negrão, Sónia; Al-Babili, Salim; Gehring, Christoph; Roessner, Ute; Jung, Christian; Murphy, Kevin; Arold, Stefan T; Gojobori, Takashi; Linden, C Gerard van der; van Loo, Eibertus N; Jellen, Eric N; Maughan, Peter J; Tester, Mark

    2017-02-16

    Chenopodium quinoa (quinoa) is a highly nutritious grain identified as an important crop to improve world food security. Unfortunately, few resources are available to facilitate its genetic improvement. Here we report the assembly of a high-quality, chromosome-scale reference genome sequence for quinoa, which was produced using single-molecule real-time sequencing in combination with optical, chromosome-contact and genetic maps. We also report the sequencing of two diploids from the ancestral gene pools of quinoa, which enables the identification of sub-genomes in quinoa, and reduced-coverage genome sequences for 22 other samples of the allotetraploid goosefoot complex. The genome sequence facilitated the identification of the transcription factor likely to control the production of anti-nutritional triterpenoid saponins found in quinoa seeds, including a mutation that appears to cause alternative splicing and a premature stop codon in sweet quinoa strains. These genomic resources are an important first step towards the genetic improvement of quinoa.

  6. Expanding genomics of mycorrhizal symbiosis

    DOE PAGES

    Kuo, Alan; Kohler, Annegret; Martin, Francis M.; ...

    2014-11-04

    The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolvemore » through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism.« less

  7. Expanding genomics of mycorrhizal symbiosis

    SciTech Connect

    Kuo, Alan; Kohler, Annegret; Martin, Francis M.; Grigoriev, Igor V.

    2014-11-04

    The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolve through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism.

  8. Advances in Genome Biology & Technology

    SciTech Connect

    Thomas J. Albert, Jon R. Armstrong, Raymond K. Auerback, W. Brad Barbazuk, et al.

    2007-12-01

    This year's meeting focused on the latest advances in new DNA sequencing technologies and the applications of genomics to disease areas in biology and biomedicine. Daytime plenary sessions highlighted cutting-edge research in areas such as complex genetic diseases, comparative genomics, medical sequencing, massively parallel DNA sequencing, and synthetic biology. Technical approaches being developed and utilized in contemporary genomics research were presented during evening concurrent sessions. Also, as in previous years, poster sessions bridged the morning and afternoon plenary sessions. In addition, for the third year in a row, the Advances in Genome Biology and Technology (AGBT) meeting was preceded by a pre-meeting workshop that aimed to provide an introductory overview for trainees and other meeting attendees. This year, speakers at the workshop focused on next-generation sequencing technologies, including their experiences, findings, and helpful advise for others contemplating using these platforms in their research. Speakers from genome centers and core sequencing facilities were featured and the workshop ended with a roundtable discussion, during which speakers fielded questions from the audience.

  9. Evolutionary engineering by genome shuffling.

    PubMed

    Biot-Pelletier, Damien; Martin, Vincent J J

    2014-05-01

    An upsurge in the bioeconomy drives the need for engineering microorganisms with increasingly complex phenotypes. Gains in productivity of industrial microbes depend on the development of improved strains. Classical strain improvement programmes for the generation, screening and isolation of such mutant strains have existed for several decades. An alternative to traditional strain improvement methods, genome shuffling, allows the directed evolution of whole organisms via recursive recombination at the genome level. This review deals chiefly with the technical aspects of genome shuffling. It first presents the diversity of organisms and phenotypes typically evolved using this technology and then reviews available sources of genetic diversity and recombination methodologies. Analysis of the literature reveals that genome shuffling has so far been restricted to microorganisms, both prokaryotes and eukaryotes, with an overepresentation of antibiotics- and biofuel-producing microbes. Mutagenesis is the main source of genetic diversity, with few studies adopting alternative strategies. Recombination is usually done by protoplast fusion or sexual recombination, again with few exceptions. For both diversity and recombination, prospective methods that have not yet been used are also presented. Finally, the potential of genome shuffling for gaining insight into the genetic basis of complex phenotypes is also discussed.

  10. Accelerated genome engineering through multiplexing.

    PubMed

    Bao, Zehua; Cobb, Ryan E; Zhao, Huimin

    2016-01-01

    Throughout the biological sciences, the past 15 years have seen a push toward the analysis and engineering of biological systems at the organism level. Given the complexity of even the simplest organisms, though, to elicit a phenotype of interest often requires genotypic manipulation of several loci. By traditional means, sequential editing of genomic targets requires a significant investment of time and labor, as the desired editing event typically occurs at a very low frequency against an overwhelming unedited background. In recent years, the development of a suite of new techniques has greatly increased editing efficiency, opening up the possibility for multiple editing events to occur in parallel. Termed as multiplexed genome engineering, this approach to genome editing has greatly expanded the scope of possible genome manipulations in diverse hosts, ranging from bacteria to human cells. The enabling technologies for multiplexed genome engineering include oligonucleotide-based and nuclease-based methodologies, and their application has led to the great breadth of successful examples described in this review. While many technical challenges remain, there also exists a multiplicity of opportunities in this rapidly expanding field.

  11. Accelerated Genome Engineering through Multiplexing

    PubMed Central

    Zhao, Huimin

    2015-01-01

    Throughout the biological sciences, the past fifteen years have seen a push towards the analysis and engineering of biological systems at the organism level. Given the complexity of even the simplest organisms, though, to elicit a phenotype of interest often requires genotypic manipulation of several loci. By traditional means, sequential editing of genomic targets requires a significant investment of time and labor, as the desired editing event typically occurs at a very low frequency against an overwhelming unedited background. In recent years, the development of a suite of new techniques has greatly increased editing efficiency, opening up the possibility for multiple editing events to occur in parallel. Termed as multiplexed genome engineering, this approach to genome editing has greatly expanded the scope of possible genome manipulations in diverse hosts, ranging from bacteria to human cells. The enabling technologies for multiplexed genome engineering include oligonucleotide-based and nuclease-based methodologies, and their application has led to the great breadth of successful examples described in this review. While many technical challenges remain, there also exists a multiplicity of opportunities in this rapidly expanding field. PMID:26394307

  12. Expanding genomics of mycorrhizal symbiosis

    PubMed Central

    Kuo, Alan; Kohler, Annegret; Martin, Francis M.; Grigoriev, Igor V.

    2014-01-01

    The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolve through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism. PMID:25408690

  13. Genomes on the Edge: Programmed Genome Instability in Ciliates

    PubMed Central

    Bracht, John R.; Fang, Wenwen; Goldman, Aaron David; Dolzhenko, Egor; Stein, Elizabeth M.; Landweber, Laura F.

    2013-01-01

    Ciliates are an ancient and diverse group of microbial eukaryotes that have emerged as powerful models for RNA-mediated epigenetic inheritance. They possess extensive sets of both tiny and long noncoding RNAs that, together with a suite of proteins that includes transposases, orchestrate a broad cascade of genome rearrangements during somatic nuclear development. This Review emphasizes three important themes: the remarkable role of RNA in shaping genome structure, recent discoveries that unify many deeply diverged ciliate genetic systems, and a surprising evolutionary “sign change” in the role of small RNAs between major species groups. PMID:23374338

  14. Freshwater bacterial lifestyles inferred from comparative genomics.

    PubMed

    Livermore, Joshua A; Emrich, Scott J; Tan, John; Jones, Stuart E

    2014-03-01

    While micro-organisms actively mediate and participate in freshwater ecosystem services, we know little about freshwater microbial genetic diversity. Genome sequences are available for many bacteria from the human microbiome and the ocean (over 800 and 200, respectively), but only two freshwater genomes are currently available: the streamlined genomes of Polynucleobacter necessarius ssp. asymbioticus and the Actinobacterium AcI-B1. Here, we sequenced and analysed draft genomes of eight phylogentically diverse freshwater bacteria exhibiting a range of lifestyle characteristics. Comparative genomics of these bacteria reveals putative freshwater bacterial lifestyles based on differences in predicted growth rate, capability to respond to environmental stimuli and diversity of useable carbon substrates. Our conceptual model based on these genomic characteristics provides a foundation on which further ecophysiological and genomic studies can be built. In addition, these genomes greatly expand the diversity of existing genomic context for future studies on the ecology and genetics of freshwater bacteria.

  15. The UCSC Genome Browser database: 2015 update.

    PubMed

    Rosenbloom, Kate R; Armstrong, Joel; Barber, Galt P; Casper, Jonathan; Clawson, Hiram; Diekhans, Mark; Dreszer, Timothy R; Fujita, Pauline A; Guruvadoo, Luvina; Haeussler, Maximilian; Harte, Rachel A; Heitner, Steve; Hickey, Glenn; Hinrichs, Angie S; Hubley, Robert; Karolchik, Donna; Learned, Katrina; Lee, Brian T; Li, Chin H; Miga, Karen H; Nguyen, Ngan; Paten, Benedict; Raney, Brian J; Smit, Arian F A; Speir, Matthew L; Zweig, Ann S; Haussler, David; Kuhn, Robert M; Kent, W James

    2015-01-01

    Launched in 2001 to showcase the draft human genome assembly, the UCSC Genome Browser database (http://genome.ucsc.edu) and associated tools continue to grow, providing a comprehensive resource of genome assemblies and annotations to scientists and students worldwide. Highlights of the past year include the release of a browser for the first new human genome reference assembly in 4 years in December 2013 (GRCh38, UCSC hg38), a watershed comparative genomics annotation (100-species multiple alignment and conservation) and a novel distribution mechanism for the browser (GBiB: Genome Browser in a Box). We created browsers for new species (Chinese hamster, elephant shark, minke whale), 'mined the web' for DNA sequences and expanded the browser display with stacked color graphs and region highlighting. As our user community increasingly adopts the UCSC track hub and assembly hub representations for sharing large-scale genomic annotation data sets and genome sequencing projects, our menu of public data hubs has tripled.

  16. Intrapopulation Genome Size Dynamics in Festuca pallens

    PubMed Central

    Šmarda, Petr; Bureš, Petr; Horová, Lucie; Rotreklová, Olga

    2008-01-01

    Background and Aims It is well known that genome size differs among species. However, information on the variation and dynamics of genome size in wild populations and on the early phase of genome size divergence between taxa is currently lacking. Genome size dynamics, heritability and phenotype effects are analysed here in a wild population of Festuca pallens (Poaceae). Methods Genome size was measured using flow cytometry with DAPI dye in 562 seedlings from 17 maternal plants varying in genome size. The repeatability of genome size measurements was verified at different seasons through the use of different standards and with propidium iodide dye; the range of variation observed was tested via analysis of double-peaks. Additionally, chromosome counts were made in selected seedlings. Key Results and Conclusions Analysis of double-peaks showed that genome size varied up to 1·188-fold within all 562 seedlings, 1·119-fold within the progeny of a single maternal plant and 1·117-fold in seedlings from grains of a single inflorescence. Generally, genome sizes of seedlings and their mothers were highly correlated. However, in maternal plants with both larger and smaller genomes, genome sizes of seedlings were shifted towards the population median. This was probably due to the frequency of available paternal genomes (pollen grains) in the population. There was a stabilizing selection on genome size during the development of seedlings into adults, which may be important for stabilizing genome size within species. Furthermore, a positive correlation was found between genome size and the development rate of seedlings. A larger genome may therefore provide a competitive advantage, perhaps explaining the higher proportion of plants with larger genomes in the population studied. The reason for the observed variation may be the recent induction of genome size variation, e.g. by activity of retrotransposons, which may be preserved in the long term by the segregation of

  17. Multiscale Representation of Genomic Signals

    PubMed Central

    Knijnenburg, Theo A.; Ramsey, Stephen A.; Berman, Benjamin P.; Kennedy, Kathleen A.; Smit, Arian F.A.; Wessels, Lodewyk F.A.; Laird, Peter W.; Aderem, Alan; Shmulevich, Ilya

    2014-01-01

    Genomic information is encoded on a wide range of distance scales, ranging from tens of base pairs to megabases. We developed a multiscale framework to analyze and visualize the information content of genomic signals. Different types of signals, such as GC content or DNA methylation, are characterized by distinct patterns of signal enrichment or depletion across scales spanning several orders of magnitude. These patterns are associated with a variety of genomic annotations, including genes, nuclear lamina associated domains, and repeat elements. By integrating the information across all scales, as compared to using any single scale, we demonstrate improved prediction of gene expression from Polymerase II chromatin immunoprecipitation sequencing (ChIP-seq) measurements and we observed that gene expression differences in colorectal cancer are not most strongly related to gene body methylation, but rather to methylation patterns that extend beyond the single-gene scale. PMID:24727652

  18. Clinical Genomics: Challenges and Opportunities.

    PubMed

    Vijay, Priyanka; McIntyre, Alexa B R; Mason, Christopher E; Greenfield, Jeffrey P; Li, Sheng

    2016-01-01

    Next-generation sequencing (NGS) approaches are highly applicable to clinical studies. We review recent advances in sequencing technologies, as well as their benefits and tradeoffs, to provide an overview of clinical genomics from study design to computational analysis. Sequencing technologies enable genomic, transcriptomic, and epigenomic evaluations. Studies that use a combination of whole genome, exome, mRNA, and bisulfite sequencing are now feasible due to decreasing sequencing costs. Single-molecule sequencing increases read length, with the MinIONTM nanopore sequencer, which offers a uniquely portable option at a lower cost. Many of the published comparisons we review here address the challenges associated with different sequencing methods. Overall, NGS techniques, coupled with continually improving analysis algorithms, are useful for clinical studies in many realms, including cancer, chronic illness, and neurobiology. We, and others in the field, anticipate the clinical use of NGS approaches will continue to grow, especially as we shift into an era of precision medicine.

  19. Enhancer Identification through Comparative Genomics

    SciTech Connect

    Visel, Axel; Bristow, James; Pennacchio, Len A.

    2006-10-01

    With the availability of genomic sequence from numerousvertebrates, a paradigm shift has occurred in the identification ofdistant-acting gene regulatory elements. In contrast to traditionalgene-centric studies in which investigators randomly scanned genomicfragments that flank genes of interest in functional assays, the modernapproach begins electronically with publicly available comparativesequence datasets that provide investigators with prioritized lists ofputative functional sequences based on their evolutionary conservation.However, although a large number of tools and resources are nowavailable, application of comparative genomic approaches remains far fromtrivial. In particular, it requires users to dynamically consider thespecies and methods for comparison depending on the specific biologicalquestion under investigation. While there is currently no single generalrule to this end, it is clear that when applied appropriately,comparative genomic approaches exponentially increase our power ingenerating biological hypotheses for subsequent experimentaltesting.

  20. Staphylococcus aureus: superbug, super genome?

    PubMed

    Lindsay, Jodi A; Holden, Matthew T G

    2004-08-01

    Staphylococcus aureus is a common cause of infection in both hospitals and the community, and it is becoming increasingly virulent and resistant to antibiotics. The recent sequencing of seven strains of S. aureus provides unprecedented information about its genome diversity. Subtle differences in core (stable) regions of the genome have been exploited by multi-locus sequence typing (MLST) to understand S. aureus population structure. Dramatic differences in the carriage and spread of accessory genes, including those involved in virulence and resistance, contribute to the emergence of new strains with healthcare implications. Understanding the differences between S. aureus genomes and the controls that govern these changes is helping to improve our knowledge of S. aureus pathogenicity and to predict the evolution of super-superbugs.

  1. How good is our genome?

    PubMed

    Weill, Jean-Claude; Radman, Miroslav

    2004-01-29

    Our genome has evolved to perpetuate itself through the maintenance of the species via an uninterrupted chain of reproductive somas. Accordingly, evolution is not concerned with diseases occurring after the soma's reproductive stage. Following Richard Dawkins, we would like to reassert that we indeed live as disposable somas, slaves of our germline genome, but could soon start rebelling against such slavery. Cancer and its relation to the TP53 gene may offer a paradigmatic example. The observation that the latency period in cancer can be prolonged in mice by increasing the number of TP53 genes in their genome, suggests that sooner or later we will have to address the question of heritable disease avoidance via the manipulation of the human germline.

  2. Bioprospecting in the genomic age.

    PubMed

    Hicks, Michael A; Prather, Kristala L J

    2014-01-01

    The genomic revolution promises great advances in the search for useful biocatalysts. Function-based metagenomic approaches have identified several enzymes with properties that make them useful candidates for a variety of bioprocesses. As DNA sequencing costs continue to decline, the volume of genomic data, along with their corresponding predicted protein sequences, will continue to increase dramatically, necessitating new approaches to leverage this information for gene-based bioprospecting efforts. Additionally, as new functions are discovered and correlated with this sequence information, the knowledge of the often complex relationship between a protein's sequence and function will improve. This in turn will lead to better gene-based bioprospecting approaches and facilitate the tailoring of desired properties through protein engineering projects. In this chapter, we discuss a number of recent advances in bioprospecting within the context of the genomic age.

  3. The genome of Theobroma cacao.

    PubMed

    Argout, Xavier; Salse, Jerome; Aury, Jean-Marc; Guiltinan, Mark J; Droc, Gaetan; Gouzy, Jerome; Allegre, Mathilde; Chaparro, Cristian; Legavre, Thierry; Maximova, Siela N; Abrouk, Michael; Murat, Florent; Fouet, Olivier; Poulain, Julie; Ruiz, Manuel; Roguet, Yolande; Rodier-Goud, Maguy; Barbosa-Neto, Jose Fernandes; Sabot, Francois; Kudrna, Dave; Ammiraju, Jetty Siva S; Schuster, Stephan C; Carlson, John E; Sallet, Erika; Schiex, Thomas; Dievart, Anne; Kramer, Melissa; Gelley, Laura; Shi, Zi; Bérard, Aurélie; Viot, Christopher; Boccara, Michel; Risterucci, Ange Marie; Guignon, Valentin; Sabau, Xavier; Axtell, Michael J; Ma, Zhaorong; Zhang, Yufan; Brown, Spencer; Bourge, Mickael; Golser, Wolfgang; Song, Xiang; Clement, Didier; Rivallan, Ronan; Tahi, Mathias; Akaza, Joseph Moroh; Pitollat, Bertrand; Gramacho, Karina; D'Hont, Angélique; Brunel, Dominique; Infante, Diogenes; Kebe, Ismael; Costet, Pierre; Wing, Rod; McCombie, W Richard; Guiderdoni, Emmanuel; Quetier, Francis; Panaud, Olivier; Wincker, Patrick; Bocs, Stephanie; Lanaud, Claire

    2011-02-01

    We sequenced and assembled the draft genome of Theobroma cacao, an economically important tropical-fruit tree crop that is the source of chocolate. This assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of these genes anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example, flavonoid-related genes. It also provides a major source of candidate genes for T. cacao improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions.

  4. Genomics of Escherichia and Shigella

    NASA Astrophysics Data System (ADS)

    Perna, Nicole T.

    The laboratory workhorse Escherichia coli K-12 is among the most intensively studied living organisms on earth, and this single strain serves as the model system behind much of our understanding of prokaryotic molecular biology. Dense genome sequencing and recent insightful comparative analyses are making the species E. coli, as a whole, an emerging system for studying prokaryotic population genetics and the relationship between system-scale, or genome-scale, molecular evolution and complex traits like host range and pathogenic potential. Genomic perspective has revealed a coherent but dynamic species united by intraspecific gene flow via homologous lateral or horizontal transfer and differentiated by content flux mediated by acquisition of DNA segments from interspecies transfers.

  5. Genome inside genome: NGS based identification and assembly of endophytic Sphingopyxis granuli and Pseudomonas aeruginosa genomes from rice genomic reads.

    PubMed

    Battu, Latha; Reddy, Mettu Madhavi; Goud, Burragoni Sravanthi; Ulaganathan, Kayalvili; Kandasamy, Ulaganathan

    2017-02-10

    The interactions between crop plants and the endophytic bacteria colonizing them are poorly understood and experimental methods were found to be inadequate to meet the complexities associated with the interaction. Moreover, research on endophytic bacteria was focused at host plant species level and not at cultivar level which is essential for understanding the role played by them on the productivity of specific crop genotype. High throughput genomics offers valuable tools for identification, characterization of endophytic bacteria and understand their interaction with host plants. In this paper we report the use of high throughput plant genomic data for identification of endophytic bacteria colonizing rice plants. Using this novel next generation sequencing based computational method Sphingopyxis granuli and Pseudomonas aeruginosa were identified as endophytes colonizing the elite indica rice cultivar RP Bio-226 and their draft genome sequences were assembled.

  6. Genome: twisting stories with DNA.

    PubMed

    Noguera-Solano, Ricardo; Ruiz-Gutierrez, Rosaura; Rodriguez-Caso, Juan Manuel

    2013-12-01

    In 1920, the German botanist Hans Winkler coined the concept of the 'genome'. This paper explores the history of a concept that has developed in parallel with advances in biology and supports novel and powerful heuristic biological research in the 21st century. From a structural interpretation (the genome as the haploid number of chromosomes), it has changed to keep pace with technological progress and new interpretations of the material of heredity. In the first place, the 'genome' was extended to include all the material in the nucleus, then the sum of all genes, and (with the discovery of the structure of DNA) the sum of the nucleotide base sequences. In the early 21st century, it has become a much more complex and central concept that has spawned the growing field of studies referred to as the 'omics'.

  7. [Comparison of mitochondrial genomes of bivalves].

    PubMed

    SONG, Wen-Tao; GAO, Xiang-Gang; LI, Yun-Feng; LIU, Wei-Dong; LIU, Ying; HE, Chong-Bo

    2009-11-01

    The structure and organization of mitochondrial genomes of 14 marine bivalves and two freshwater bivalves were analyzed using comparative genomics and bioinformatics methods. The results showed that the organization and gene order of the mitochondrial genomes of these bivalve species studied were different from each other. The size, organization, gene numbers, and gene order of mitochondrial genomes in bivalves at different taxa were different. Phylogenetic analysis using the whole mitochondrial genomes and all the coding genes showed different results-- phylogenetic analysis conducted using the whole mitochondrial genomes was consistent with the existing classification and phylogenetic analysis conducted using all coding genes not consistent with the existing classification.

  8. Genomics and the origin of species.

    PubMed

    Seehausen, Ole; Butlin, Roger K; Keller, Irene; Wagner, Catherine E; Boughman, Janette W; Hohenlohe, Paul A; Peichel, Catherine L; Saetre, Glenn-Peter; Bank, Claudia; Brännström, Ake; Brelsford, Alan; Clarkson, Chris S; Eroukhmanoff, Fabrice; Feder, Jeffrey L; Fischer, Martin C; Foote, Andrew D; Franchini, Paolo; Jiggins, Chris D; Jones, Felicity C; Lindholm, Anna K; Lucek, Kay; Maan, Martine E; Marques, David A; Martin, Simon H; Matthews, Blake; Meier, Joana I; Möst, Markus; Nachman, Michael W; Nonaka, Etsuko; Rennison, Diana J; Schwarzer, Julia; Watson, Eric T; Westram, Anja M; Widmer, Alex

    2014-03-01

    Speciation is a fundamental evolutionary process, the knowledge of which is crucial for understanding the origins of biodiversity. Genomic approaches are an increasingly important aspect of this research field. We review current understanding of genome-wide effects of accumulating reproductive isolation and of genomic properties that influence the process of speciation. Building on this work, we identify emergent trends and gaps in our understanding, propose new approaches to more fully integrate genomics into speciation research, translate speciation theory into hypotheses that are testable using genomic tools and provide an integrative definition of the field of speciation genomics.

  9. Functional genomics of pathogenic bacteria.

    PubMed Central

    Moxon, E R; Hood, D W; Saunders, N J; Schweda, E K H; Richards, J C

    2002-01-01

    Microbial diseases remain the commonest cause of global mortality and morbidity. Automated-DNA sequencing has revolutionized the investigation of pathogenic microbes by making the immense fund of information contained in their genomes available at reasonable cost. The challenge is how this information can be used to increase current understanding of the biology of commensal and virulence behaviour of pathogens with particular emphasis on in vivo function and novel approaches to prevention. One example of the application of whole-genome-sequence information is afforded by investigations of the pathogenic role of Haemophilus influenzae lipopolysaccharide and its candidacy as a vaccine. PMID:11839188

  10. The energetics of genome complexity.

    PubMed

    Lane, Nick; Martin, William

    2010-10-21

    All complex life is composed of eukaryotic (nucleated) cells. The eukaryotic cell arose from prokaryotes just once in four billion years, and otherwise prokaryotes show no tendency to evolve greater complexity. Why not? Prokaryotic genome size is constrained by bioenergetics. The endosymbiosis that gave rise to mitochondria restructured the distribution of DNA in relation to bioenergetic membranes, permitting a remarkable 200,000-fold expansion in the number of genes expressed. This vast leap in genomic capacity was strictly dependent on mitochondrial power, and prerequisite to eukaryote complexity: the key innovation en route to multicellular life.

  11. [The genome and the consumer].

    PubMed

    Christiansen, Gunna

    2014-11-10

    Consumergenetics has developed so fast that it became possible for consumers to obtain genome risk information based on single nucleotide polymorphisms data of over 250 diseases/conditions for just 99 USD. In November 2013, the American Food and Drug Administration (FDA) ordered the company 23andMe to stop returning health results because they found a lack of scientific evidence of the reposted disease risks. The ethical dilemmas associated with this are reviewed, and the recommendations are described in genome testing. Ethical dilemmas in relation direct-to-consumer testing are discussed.

  12. Biocommunication and natural genome editing

    PubMed Central

    Witzany, Guenther

    2010-01-01

    The biocommunicative approach investigates communication processes within and among cells, tissues, organs and organisms as sign-mediated interactions, and nucleotide sequences as code, i.e. language-like text, which follows in parallel three kinds of rules: combinatorial (syntactic), context-sensitive (pragmatic), and content-specific (semantic). Natural genome editing from a biocommunicative perspective is competent agent-driven generation and integration of meaningful nucleotide sequences into pre-existing genomic content arrangements and the ability to (re-)combine and (re-)regulate them according to context-dependent (i.e. adaptational) purposes of the host organism. PMID:21537469

  13. Translating genomics in cancer care.

    PubMed

    Bombard, Yvonne; Bach, Peter B; Offit, Kenneth

    2013-11-01

    There is increasing enthusiasm for genomics and its promise in advancing personalized medicine. Genomic information has been used to personalize health care for decades, spanning the fields of cardiovascular disease, infectious disease, endocrinology, metabolic medicine, and hematology. However, oncology has often been the first test bed for the clinical translation of genomics for diagnostic, prognostic, and therapeutic applications. Notable hereditary cancer examples include testing for mutations in BRCA1 or BRCA2 in unaffected women to identify those at significantly elevated risk for developing breast and ovarian cancers, and screening patients with newly diagnosed colorectal cancer for mutations in 4 mismatch repair genes to reduce morbidity and mortality in their relatives. Somatic genomic testing is also increasingly used in oncology, with gene expression profiling of breast tumors and EGFR testing to predict treatment response representing commonly used examples. Health technology assessment provides a rigorous means to inform clinical and policy decision-making through systematic assessment of the evidentiary base, along with precepts of clinical effectiveness, cost-effectiveness, and consideration of risks and benefits for health care delivery and society. Although this evaluation is a fundamental step in the translation of any new therapeutic, procedure, or diagnostic test into clinical care, emerging developments may threaten this standard. These include "direct to consumer" genomic risk assessment services and the challenges posed by incidental results generated from next-generation sequencing (NGS) technologies. This article presents a review of the evidentiary standards and knowledge base supporting the translation of key cancer genomic technologies along the continuum of validity, utility, cost-effectiveness, health service impacts, and ethical and societal issues, and offers future research considerations to guide the responsible introduction of

  14. Deafness in the genomics era.

    PubMed

    Shearer, A Eliot; Hildebrand, Michael S; Sloan, Christina M; Smith, Richard J H

    2011-12-01

    Our understanding of hereditary hearing loss has greatly improved since the discovery of the first human deafness gene. These discoveries have only accelerated due to the great strides in DNA sequencing technology since the completion of the human genome project. Here, we review the immense impact that these developments have had in both deafness research and clinical arenas. We review commonly used genomic technologies as well as the application of these technologies to the genetic diagnosis of hereditary hearing loss and to the discovery of novel deafness genes.

  15. Genome editing comes of age.

    PubMed

    Kim, Jin-Soo

    2016-09-01

    Genome editing harnesses programmable nucleases to cut and paste genetic information in a targeted manner in living cells and organisms. Here, I review the development of programmable nucleases, including zinc finger nucleases (ZFNs), TAL (transcription-activator-like) effector nucleases (TALENs) and CRISPR (cluster of regularly interspaced palindromic repeats)-Cas9 (CRISPR-associated protein 9) RNA-guided endonucleases (RGENs). I specifically highlight the key advances that set the foundation for the rapid and widespread implementation of CRISPR-Cas9 genome editing approaches that has revolutionized the field.

  16. Pfizer targets genomics through Pfizergen

    SciTech Connect

    Glaser, V.

    1995-06-01

    Recently, Pfizer (New York) formed Pfizergen to develop and commercialize genomics. For starters, Pfizergen involves investments by Pfizer of more than $115 million - excluding milestone payments and royalties on future products - in four biotech firms. Seeking a strong foothold in genomics, Pfizer is piecing together a multifaceted network of technologies. Through its alliance with Incyte, Pfizer has already accessed gene databases, high-throughput gene sequencing, and transcription analysis. Through Pfizergen, it will access expertise in microbial genetic engineering and combinatorial chemistry, as well as antiviral, antisense, and gene therapy capabilities. Future investments could target firms specializing in such products as positional cloning and bioinformatics.

  17. Delivery technologies for genome editing.

    PubMed

    Yin, Hao; Kauffman, Kevin J; Anderson, Daniel G

    2017-03-24

    With the recent development of CRISPR technology, it is becoming increasingly easy to engineer the genome. Genome-editing systems based on CRISPR, as well as transcription activator-like effector nucleases (TALENs) and zinc-finger nucleases (ZFNs), are becoming valuable tools for biomedical research, drug discovery and development, and even gene therapy. However, for each of these systems to effectively enter cells of interest and perform their function, efficient and safe delivery technologies are needed. This Review discusses the principles of biomacromolecule delivery and gene editing, examines recent advances and challenges in non-viral and viral delivery methods, and highlights the status of related clinical trials.

  18. Cancer Genome Anatomy Project (CGAP) | Office of Cancer Genomics

    Cancer.gov

    CGAP generated a wide range of genomics data on cancerous cells that are accessible through easy-to-use online tools. Researchers, educators, and students can find "in silico" answers to biological questions through the CGAP website. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov to learn how to navigate the website.

  19. A genome wide dosage suppressor network reveals genomic robustness

    PubMed Central

    Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh

    2017-01-01

    Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637

  20. Cancer Genome Anatomy Project | Office of Cancer Genomics

    Cancer.gov

    The National Cancer Institute (NCI) Cancer Genome Anatomy Project (CGAP) is an online resource designed to provide the research community access to biological tissue characterization data. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov.