Science.gov

Sample records for genome program contractor-grantee

  1. DOE Human Genome Program contractor-grantee workshop

    SciTech Connect

    1996-01-01

    This volume contains the proceedings for the DOE Human Genome Program`s Contractor-Grantee Workshop V held in Sante Fe, New Mexico January 28, February 1, 1996. Presentations were divided into sessions entitled Sequencing; Mapping; Informatics; Ethical, Legal, and Social Issues; and Infrastructure. Reports of individual projects described herein are separately indexed and abstracted for the database.

  2. DOE Human Genome Program: Contractor-Grantee Workshop IV, November 13--17, 1994, Santa Fe, New Mexico

    SciTech Connect

    Not Available

    1994-10-01

    This volume contains the proceedings of the fourth Contractor-Grantee Workshop for the Department of Energy (DOE) Human Genome Program. Of the 204 abstracts in this book, some 200 describe the genome research of DOE-funded grantees and contractors located at the multidisciplinary centers at Lawrence Berkeley Laboratory, Lawrence Livermore National Laboratory, and Los Alamos National Laboratory; other DOE-supported laboratories; and more than 54 universities, research organizations, and companies in the United States and abroad. Included are 16 abstracts from ongoing projects in the Ethical, Legal, and Social Issues (ELSI) component, an area that continues to attract considerable attention from a wide variety of interested parties. Three abstracts summarize work in the new Microbial Genome Initiative launched this year by the Office of Health and Environmental Research (OHER) to provide genome sequence and mapping data on industrially important microorganisms and those that live under extreme conditions. Many of the projects will be discussed at plenary sessions held throughout the workshop, and all are represented in the poster sessions.

  3. Genomics:GTL Contractor-Grantee Workshop IV and Metabolic Engineering Working Group Inter-Agency Conference on Metabolic Engineering 2006

    SciTech Connect

    Mansfield, Betty Kay; Martin, Sheryl A

    2006-02-01

    Welcome to the 2006 joint meeting of the fourth Genomics:GTL Contractor-Grantee Workshop and the six Metabolic Engineering Working Group Inter-Agency Conference. The vision and scope of the Genomics:GTL program continue to expand and encompass research and technology issues from diverse scientific disciplines, attracting broad interest and support from researchers at universities, DOE national laboratories, and industry. Metabolic engineering's vision is the targeted and purposeful alteration of metabolic pathways to improve the understanding and use of cellular pathways for chemical transformation, energy transduction, and supramolecular assembly. These two programs have much complementarity in both vision and technological approaches, as reflected in this joint workshop. GLT's challenge to the scientific community remains the further development and use of a broad array of innovative technologies and computational tools to systematically leverage the knowledge and capabilities brought to us by DNA sequencing projects. The goal is to seek a broad and predictive understanding of the functioning and control of complex systems--individual microbes, microbial communities, and plants. GTL's prominent position at the interface of the physical, computational, and biological sciences is both a strength and challenge. Microbes remain GTL's principal biological focus. In the complex 'simplicity' of microbes, they find capabilities needed by DOE and the nation for clean and secure energy, cleanup of environmental contamination, and sequestration of atmospheric carbon dioxide that contributes to global warming. An ongoing challenge for the entire GTL community is to demonstrate that the fundamental science conducted in each of your research projects brings us a step closer to biology-based solutions for these important national energy and environmental needs.

  4. JGI Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  5. Epidemiology & Genomics Research Program

    Cancer.gov

    The Epidemiology and Genomics Research Program, in the National Cancer Institute's Division of Cancer Control and Population Sciences, funds research in human populations to understand the determinants of cancer occurrence and outcomes.

  6. Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor

    2012-03-12

    The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.

  7. Human Genome Program

    SciTech Connect

    Not Available

    1993-01-01

    The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.

  8. The Human Genome Program

    SciTech Connect

    Bell, G.I.

    1989-01-01

    Early in 1986, Charles DeLisi, then head of the Office of Health and Environmental Research at the Department of Energy (DOE) requested the Los Alamos National Laboratory (LANL) to organize a workshop charged with inquiring whether the state of technology and potential payoffs in biological knowledge and medical practice were such as to justify an organized program to map and sequence the human genome. The DOE's interest arose from its mission to assess the effects of radiation and other products of energy generation on human health in general and genetic material in particular. The workshop concluded that the technology was ripe, the benefits would be great, and a national program should be promptly initiated. Later committees, reporting to DOE, to the NIH, to the Office of Technology Assessment of the US Congress, and to the National Academy of Science have reviewed these issues more deliberately and come to the same conclusion. As a consequence, there has been established in the United States, a Human Genome Program, with funding largely from the NIH and the DOE, as indicated in Table 1. Moreover, the Program has attracted international interest, and Great Britain, France, Italy, and the Soviet Union, among other countries, have been reported to be starting human genome initiatives. Coordination of these programs, clearly in the interests of each, remains to be worked out, although an international Human Genome Organization (HUGO) is considering such coordination. 5 refs., 1 fig., 2 tabs.

  9. Programs | Office of Cancer Genomics

    Cancer.gov

    OCG facilitates cancer genomics research through a series of highly-focused programs. These programs generate and disseminate genomic data for use by the cancer research community. OCG programs also promote advances in technology-based infrastructure and create valuable experimental reagents and tools. OCG programs encourage collaboration by interconnecting with other genomics and cancer projects in order to accelerate translation of findings into the clinic. Below are OCG’s current, completed, and initiated programs:

  10. Human genome. 1993 Program report

    SciTech Connect

    Not Available

    1994-03-01

    The purpose of this report is to update the Human Genome 1991-92 Program Report and provide new information on the DOE genome program to researchers, program managers, other government agencies, and the interested public. This FY 1993 supplement includes abstracts of 60 new or renewed projects and listings of 112 continuing and 28 completed projects. These two reports, taken together, present the most complete published view of the DOE Human Genome Program through FY 1993. Research is progressing rapidly toward 15-year goals of mapping and sequencing the DNA of each of the 24 different human chromosomes.

  11. Human Genome Education Program

    SciTech Connect

    Richard Myers; Lane Conn

    2000-05-01

    The funds from the DOE Human Genome Program, for the project period 2/1/96 through 1/31/98, have provided major support for the curriculum development and field testing efforts for two high school level instructional units: Unit 1, ''Exploring Genetic Conditions: Genes, Culture and Choices''; and Unit 2, ''DNA Snapshots: Peaking at Your DNA''. In the original proposal, they requested DOE support for the partial salary and benefits of a Field Test Coordinator position to: (1) complete the field testing and revision of two high school curriculum units, and (2) initiate the education of teachers using these units. During the project period of this two-year DOE grant, a part-time Field-Test Coordinator was hired (Ms. Geraldine Horsma) and significant progress has been made in both of the original proposal objectives. Field testing for Unit 1 has occurred in over 12 schools (local and non-local sites with diverse student populations). Field testing for Unit 2 has occurred in over 15 schools (local and non-local sites) and will continue in 12-15 schools during the 96-97 school year. For both curricula, field-test sites and site teachers were selected for their interest in genetics education and in hands-on science education. Many of the site teachers had no previous experience with HGEP or the unit under development. Both of these first-year biology curriculum units, which contain genetics, biotechnology, societal, ethical and cultural issues related to HGP, are being implemented in many local and non-local schools (SF Bay Area, Southern California, Nebraska, Hawaii, and Texas) and in programs for teachers. These units will reach over 10,000 students in the SF Bay Area and continues to receive support from local corporate and private philanthropic organizations. Although HGEP unit development is nearing completion for both units, data is still being gathered and analyzed on unit effectiveness and student learning. The final field testing result from this analysis will

  12. Mating programs including genomic relationships

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Computer mating programs have helped breeders minimize pedigree inbreeding and avoid recessive defects by mating animals with parents that have fewer common ancestors. With genomic selection, breed associations, AI organizations, and on-farm software providers could use new programs to minimize geno...

  13. Human Genome Program Image Gallery (from genomics.energy.gov)

    DOE Data Explorer

    This collection contains approximately 240 images from the genome programs of DOE's Office of Science. The images are divided into galleries related to biofuels research, systems biology, and basic genomics. Each image has a title, a basic citation, and a credit or source. Most of the images are original graphics created by the Genome Management Information System (GMIS). GMIS images are recognizable by their credit line. Permission to use these graphics is not needed, but please credit the U.S. Department of Energy Genome Programs and provide the website http://genomics.energy.gov. Other images were provided by third parties and not created by the U.S. Department of Energy. Users must contact the person listed in the credit line before using those images. The high-resolution images can be downloaded.

  14. Economic evaluation of genomic breeding programs.

    PubMed

    König, S; Simianer, H; Willam, A

    2009-01-01

    The objective of this study was to compare a conventional dairy cattle breeding program characterized by a progeny testing scheme with different scenarios of genomic breeding programs. The ultimate economic evaluation criterion was discounted profit reflecting discounted returns minus discounted costs per cow in a balanced breeding goal of production and functionality. A deterministic approach mainly based on the gene flow method and selection index calculations was used to model a conventional progeny testing program and different scenarios of genomic breeding programs. As a novel idea, the modeling of the genomic breeding program accounted for the proportion of farmers waiting for daughter records of genotyped young bulls before using them for artificial insemination. Technical and biological coefficients for modeling were chosen to correspond to a German breeding organization. The conventional breeding program for 50 test bulls per year within a population of 100,000 cows served as a base scenario. Scenarios of genomic breeding programs considered the variation of costs for genotyping, selection intensity of cow sires, proportion of farmers waiting for daughter records of genotyped young bulls, and different accuracies of genomic indices for bulls and cows. Given that the accuracies of genomic indices are greater than 0.70, a distinct economic advantage was found for all scenarios of genomic breeding programs up to factor 2.59, mainly due to the reduction in generation intervals. Costs for genotyping were negligible when focusing on a population-wide perspective and considering additional costs for herdbook registration, milk recording, or keeping of bulls, especially if there is no need for yearly recalculation of effects of single nucleotide polymorphisms. Genomic breeding programs generated a higher discounted profit than a conventional progeny testing program for all scenarios where at least 20% of the inseminations were done by genotyped young bulls without

  15. Genomic selection in animal breeding programs.

    PubMed

    van der Werf, Julius

    2013-01-01

    Genomic selection can have a major impact on animal breeding programs, especially where traits that are important in the breeding objective are hard to select for otherwise. Genomic selection provides more accurate estimates for breeding value earlier in the life of breeding animals, giving more selection accuracy and allowing lower generation intervals. From sheep to dairy cattle, the rates of genetic improvement could increase from 20 to 100 % and hard-to-measure traits can be improved more effectively.Reference populations for genomic selection need to be large, with thousands of animals measured for phenotype and genotype. The smaller the effective size of the breeding population, the larger the DNA segments they potentially share and the more accurate genomic prediction will be. The relative contribution of information from relatives in the reference population will be larger if the baseline accuracy is low, but such information is limited to closely related individuals and does not last over generations.

  16. Mating programs including genomic relationships and dominance effects

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Breed associations, artificial-insemination organizations, and on-farm software providers need new computerized mating programs for genomic selection so that genomic inbreeding could be minimized by comparing genotypes of potential mates. Efficient methods for transferring elements of the genomic re...

  17. [Human genome project: a federator program of genomic medicine].

    PubMed

    Sfar, S; Chouchane, L

    2008-05-01

    The Human Genome Project improves our understanding of the molecular genetics basis of the inherited and complex diseases such as diabetes, schizophrenia, and cancer. Information from the human genome sequence is essential for several antenatal and neonatal screening programmes. The new genomic tools emerging from this project have revolutionized biology and medicine and have transformed our understanding of health and the provision of healthcare. Its implications pervade all areas of medicine, from disease prediction and prevention to the diagnosis and treatment of all forms of disease. Increasingly, it will be possible to drive predisposition testing into clinical practice, to develop new treatments or to adapt available treatments more specifically to an individual's genetic make-up. This genomic information should transform the traditional medications that are effective for every members of the population to personalized medicine and personalized therapy. The pharmacogenomics could give rise to a new generation of highly effective drugs that treat causes, not just symptoms.

  18. Genomes on the Edge: Programmed Genome Instability in Ciliates

    PubMed Central

    Bracht, John R.; Fang, Wenwen; Goldman, Aaron David; Dolzhenko, Egor; Stein, Elizabeth M.; Landweber, Laura F.

    2013-01-01

    Ciliates are an ancient and diverse group of microbial eukaryotes that have emerged as powerful models for RNA-mediated epigenetic inheritance. They possess extensive sets of both tiny and long noncoding RNAs that, together with a suite of proteins that includes transposases, orchestrate a broad cascade of genome rearrangements during somatic nuclear development. This Review emphasizes three important themes: the remarkable role of RNA in shaping genome structure, recent discoveries that unify many deeply diverged ciliate genetic systems, and a surprising evolutionary “sign change” in the role of small RNAs between major species groups. PMID:23374338

  19. Survey of university programs in remote sensing funded under grants from the NASA University-Space Applications program

    NASA Technical Reports Server (NTRS)

    Madigan, J. A.; Earhart, R. W.

    1978-01-01

    NASA's Office of Space and Terrestrial Applications (OSTA) is currently assessing approaches to transferring NASA technology to both the public and private sectors. As part of this assessment, NASA is evaluating the effectiveness of an ongoing program in remote sensing technology transfer conducted by 20 university contractors/grantees, each supported totally or partially by NASA funds. The University-Space Applications program has as its objective the demonstration of practical benefits from the use of remote sensing technology to a broad spectrum of new users, principally in state and local governments. To evaluate the University-Space Applications program, NASA has a near-term requirement for data on each university effort including total funding, funding sources, length of program, program description, and effectiveness measures.

  20. Mating programs including genomic relationships and dominance effects

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Computer mating programs have helped breeders minimize pedigree inbreeding and avoid recessive defects by mating animals with parents that have fewer common ancestors. With genomic selection, breed associations, AI organizations, and on-farm software providers could use new programs to minimize geno...

  1. Human Genome Program Report. Part 1, Overview and Progress

    DOE R&D Accomplishments Database

    1997-11-01

    This report contains Part 1 of a two-part report to reflect research and progress in the U.S. Department of Energy Human Genome Program from 1994 through 1996, with specified updates made just before publication. Part 1 consists of the program overview and report on progress.

  2. Human genome program report. Part 1, overview and progress

    SciTech Connect

    1997-11-01

    This report contains Part 1 of a two-part report to reflect research and progress in the U.S. Department of Energy Human Genome Program from 1994 through 1996, with specified updates made just before publication. Part 1 consists of the program overview and report on progress.

  3. Primer on molecular genetics. DOE Human Genome Program

    SciTech Connect

    Not Available

    1992-04-01

    This report is taken from the April 1992 draft of the DOE Human Genome 1991--1992 Program Report, which is expected to be published in May 1992. The primer is intended to be an introduction to basic principles of molecular genetics pertaining to the genome project. The material contained herein is not final and may be incomplete. Techniques of genetic mapping and DNA sequencing are described.

  4. Primer on Molecular Genetics; DOE Human Genome Program

    DOE R&D Accomplishments Database

    1992-04-01

    This report is taken from the April 1992 draft of the DOE Human Genome 1991--1992 Program Report, which is expected to be published in May 1992. The primer is intended to be an introduction to basic principles of molecular genetics pertaining to the genome project. The material contained herein is not final and may be incomplete. Techniques of genetic mapping and DNA sequencing are described.

  5. Human Genome Program Report. Part 2, 1996 Research Abstracts

    DOE R&D Accomplishments Database

    1997-11-01

    This report contains Part 2 of a two-part report to reflect research and progress in the US Department of Energy Human Genome Program from 1994 through 1996, with specified updates made just before publication. Part 2 consists of 1996 research abstracts. Attention is focused on the following: sequencing; mapping; informatics; ethical, legal, and social issues; infrastructure; and small business innovation research.

  6. Human genome program report. Part 2, 1996 research abstracts

    SciTech Connect

    1997-11-01

    This report contains Part 2 of a two-part report to reflect research and progress in the US Department of Energy Human Genome Program from 1994 through 1996, with specified updates made just before publication. Part 2 consists of 1996 research abstracts. Attention is focused on the following: sequencing; mapping; informatics; ethical, legal, and social issues; infrastructure; and small business innovation research.

  7. Genomic Tools in Groundnut Breeding Program: Status and Perspectives

    PubMed Central

    Janila, P.; Variath, Murali T.; Pandey, Manish K.; Desmae, Haile; Motagi, Babu N.; Okori, Patrick; Manohar, Surendra S.; Rathnakumar, A. L.; Radhakrishnan, T.; Liao, Boshou; Varshney, Rajeev K.

    2016-01-01

    Groundnut, a nutrient-rich food legume, is cultivated world over. It is valued for its good quality cooking oil, energy and protein rich food, and nutrient-rich fodder. Globally, groundnut improvement programs have developed varieties to meet the preferences of farmers, traders, processors, and consumers. Enhanced yield, tolerance to biotic and abiotic stresses and quality parameters have been the target traits. Spurt in genetic information of groundnut was facilitated by development of molecular markers, genetic, and physical maps, generation of expressed sequence tags (EST), discovery of genes, and identification of quantitative trait loci (QTL) for some important biotic and abiotic stresses and quality traits. The first groundnut variety developed using marker assisted breeding (MAB) was registered in 2003. Since then, USA, China, Japan, and India have begun to use genomic tools in routine groundnut improvement programs. Introgression lines that combine foliar fungal disease resistance and early maturity were developed using MAB. Establishment of marker-trait associations (MTA) paved way to integrate genomic tools in groundnut breeding for accelerated genetic gain. Genomic Selection (GS) tools are employed to improve drought tolerance and pod yield, governed by several minor effect QTLs. Draft genome sequence and low cost genotyping tools such as genotyping by sequencing (GBS) are expected to accelerate use of genomic tools to enhance genetic gains for target traits in groundnut. PMID:27014312

  8. Genomic Tools in Groundnut Breeding Program: Status and Perspectives.

    PubMed

    Janila, P; Variath, Murali T; Pandey, Manish K; Desmae, Haile; Motagi, Babu N; Okori, Patrick; Manohar, Surendra S; Rathnakumar, A L; Radhakrishnan, T; Liao, Boshou; Varshney, Rajeev K

    2016-01-01

    Groundnut, a nutrient-rich food legume, is cultivated world over. It is valued for its good quality cooking oil, energy and protein rich food, and nutrient-rich fodder. Globally, groundnut improvement programs have developed varieties to meet the preferences of farmers, traders, processors, and consumers. Enhanced yield, tolerance to biotic and abiotic stresses and quality parameters have been the target traits. Spurt in genetic information of groundnut was facilitated by development of molecular markers, genetic, and physical maps, generation of expressed sequence tags (EST), discovery of genes, and identification of quantitative trait loci (QTL) for some important biotic and abiotic stresses and quality traits. The first groundnut variety developed using marker assisted breeding (MAB) was registered in 2003. Since then, USA, China, Japan, and India have begun to use genomic tools in routine groundnut improvement programs. Introgression lines that combine foliar fungal disease resistance and early maturity were developed using MAB. Establishment of marker-trait associations (MTA) paved way to integrate genomic tools in groundnut breeding for accelerated genetic gain. Genomic Selection (GS) tools are employed to improve drought tolerance and pod yield, governed by several minor effect QTLs. Draft genome sequence and low cost genotyping tools such as genotyping by sequencing (GBS) are expected to accelerate use of genomic tools to enhance genetic gains for target traits in groundnut.

  9. 75 FR 26846 - Genomic Medicine Program Advisory Committee; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-05-12

    ...] [FR Doc No: 2010-11322] DEPARTMENT OF VETERANS AFFAIRS Genomic Medicine Program Advisory Committee... Advisory Committee Act) that the Genomic Medicine Program Advisory Committee will meet on May 21, 2010, at... genomic medicine delivery within VHA and proof of concept for genome-phenome associations using...

  10. Genomic prediction in CIMMYT maize and wheat breeding programs

    PubMed Central

    Crossa, J; Pérez, P; Hickey, J; Burgueño, J; Ornella, L; Cerón-Rojas, J; Zhang, X; Dreisigacker, S; Babu, R; Li, Y; Bonnett, D; Mathews, K

    2014-01-01

    Genomic selection (GS) has been implemented in animal and plant species, and is regarded as a useful tool for accelerating genetic gains. Varying levels of genomic prediction accuracy have been obtained in plants, depending on the prediction problem assessed and on several other factors, such as trait heritability, the relationship between the individuals to be predicted and those used to train the models for prediction, number of markers, sample size and genotype × environment interaction (GE). The main objective of this article is to describe the results of genomic prediction in International Maize and Wheat Improvement Center's (CIMMYT's) maize and wheat breeding programs, from the initial assessment of the predictive ability of different models using pedigree and marker information to the present, when methods for implementing GS in practical global maize and wheat breeding programs are being studied and investigated. Results show that pedigree (population structure) accounts for a sizeable proportion of the prediction accuracy when a global population is the prediction problem to be assessed. However, when the prediction uses unrelated populations to train the prediction equations, prediction accuracy becomes negligible. When genomic prediction includes modeling GE, an increase in prediction accuracy can be achieved by borrowing information from correlated environments. Several questions on how to incorporate GS into CIMMYT's maize and wheat programs remain unanswered and subject to further investigation, for example, prediction within and between related bi-parental crosses. Further research on the quantification of breeding value components for GS in plant breeding populations is required. PMID:23572121

  11. Asking complex questions of the genome without programming.

    PubMed

    Woollard, Peter M

    2010-01-01

    Increasingly, vast amounts of genomics and genetic data are available. Although much of the data is largely accessible to relatively simple web queries, in some cases, more complex queries are required. This paper reviews the hierarchy of tools for querying genetic and genomic data. For querying multiple genes, variants or regions ENSEMBL BioMart and the UCSC Table Browser offer flexible interfaces. For more complex queries, GALAXY is a sophisticated tool for building workflows over existing internet resources. For the most challenging genome scale queries, programmatic access may be required through a defined application programming interface (API) - such as the one provided by Ensembl. All these tools allow one to rapidly ask many questions that were difficult to answer a few years ago, but choosing the appropriate tool for the job is critical.

  12. Genomic resources in mungbean for future breeding programs

    PubMed Central

    Kim, Sue K.; Nair, Ramakrishnan M.; Lee, Jayern; Lee, Suk-Ha

    2015-01-01

    Among the legume family, mungbean (Vigna radiata) has become one of the important crops in Asia, showing a steady increase in global production. It provides a good source of protein and contains most notably folate and iron. Beyond the nutritional value of mungbean, certain features make it a well-suited model organism among legume plants because of its small genome size, short life-cycle, self-pollinating, and close genetic relationship to other legumes. In the past, there have been several efforts to develop molecular markers and linkage maps associated with agronomic traits for the genetic improvement of mungbean and, ultimately, breeding for cultivar development to increase the average yields of mungbean. The recent release of a reference genome of the cultivated mungbean (V. radiata var. radiata VC1973A) and an additional de novo sequencing of a wild relative mungbean (V. radiata var. sublobata) has provided a framework for mungbean genetic and genome research, that can further be used for genome-wide association and functional studies to identify genes related to specific agronomic traits. Moreover, the diverse gene pool of wild mungbean comprises valuable genetic resources of beneficial genes that may be helpful in widening the genetic diversity of cultivated mungbean. This review paper covers the research progress on molecular and genomics approaches and the current status of breeding programs that have developed to move toward the ultimate goal of mungbean improvement. PMID:26322067

  13. Programming cells by multiplex genome engineering and accelerated evolution

    PubMed Central

    Carr, Peter A.; Sun, Zachary Z.; Xu, George; Forest, Craig R.; Church, George M.

    2015-01-01

    The breadth of genomic diversity found among organisms in nature allows populations to adapt to diverse environments1,2. However, genomic diversity is difficult to generate in the laboratory and new phenotypes do not easily arise on practical timescales3. Although in vitro and directed evolution methods4–9 have created genetic variants with usefully altered phenotypes, these methods are limited to laborious and serial manipulation of single genes and are not used for parallel and continuous directed evolution of gene networks or genomes. Here, we describe multiplex automated genome engineering (MAGE) for large-scale programming and evolution of cells. MAGE simultaneously targets many locations on the chromosome for modification in a single cell or across a population of cells, thus producing combinatorial genomic diversity. Because the process is cyclical and scalable, we constructed prototype devices that automate the MAGE technology to facilitate rapid and continuous generation of a diverse set of genetic changes (mismatches, insertions, deletions). We applied MAGE to optimize the 1-deoxy-d-xylulose-5-phosphate (DXP) biosynthesis pathway in Escherichia coli to overproduce the industrially important isoprenoid lycopene. Twenty-four genetic components in the DXP pathway were modified simultaneously using a complex pool of synthetic DNA, creating over 4.3 billion combinatorial genomic variants per day. We isolated variants with more than fivefold increase in lycopene production within 3 days, a significant improvement over existing metabolic engineering techniques. Our multiplex approach embraces engineering in the context of evolution by expediting the design and evolution of organisms with new and improved properties. PMID:19633652

  14. Post-Genome Era Pedagogy: How a BS Biotechnology Program Benefits the Liberal Arts Institution

    ERIC Educational Resources Information Center

    Eden, Peter

    2005-01-01

    Genomics profoundly affects society, because genome sequence information is widely used in such areas as genetic testing, genomic medicine/vaccine development, and so forth. Therefore, a responsibility to modernize science curricula exists for "post-genome era" educators. At my university, we developed a BS biotechnology program within a…

  15. RNA-programmed genome editing in human cells.

    PubMed

    Jinek, Martin; East, Alexandra; Cheng, Aaron; Lin, Steven; Ma, Enbo; Doudna, Jennifer

    2013-01-29

    Type II CRISPR immune systems in bacteria use a dual RNA-guided DNA endonuclease, Cas9, to cleave foreign DNA at specific sites. We show here that Cas9 assembles with hybrid guide RNAs in human cells and can induce the formation of double-strand DNA breaks (DSBs) at a site complementary to the guide RNA sequence in genomic DNA. This cleavage activity requires both Cas9 and the complementary binding of the guide RNA. Experiments using extracts from transfected cells show that RNA expression and/or assembly into Cas9 is the limiting factor for Cas9-mediated DNA cleavage. In addition, we find that extension of the RNA sequence at the 3' end enhances DNA targeting activity in vivo. These results show that RNA-programmed genome editing is a facile strategy for introducing site-specific genetic changes in human cells.DOI:http://dx.doi.org/10.7554/eLife.00471.001.

  16. Assessing the integration of genomic medicine in genetic counseling training programs.

    PubMed

    Profato, Jessica; Gordon, Erynn S; Dixon, Shannan; Kwan, Andrea

    2014-08-01

    Medical genetics has entered a period of transition from genetics to genomics. Genetic counselors (GCs) may take on roles in the clinical implementation of genomics. This study explores the perspectives of program directors (PDs) on including genomic medicine in GC training programs, as well as the status of this integration. Study methods included an online survey, an optional one-on-one telephone interview, and an optional curricula content analysis. The majority of respondents (15/16) reported that it is important to include genomic medicine in program curricula. Most topics of genomic medicine are either "currently taught" or "under development" in all participating programs. Interview data from five PDs and one faculty member supported the survey data. Integrating genomics in training programs is challenging, and it is essential to develop genomics resources for curricula.

  17. 78 FR 18680 - Genomic Medicine Program Advisory Committee, Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-03-27

    ... AFFAIRS Genomic Medicine Program Advisory Committee, Notice of Meeting The Department of Veterans Affairs... Medicine Program Advisory Committee will meet on April 11, 2013, in Suite 1000 at the United States Access... Million Veteran Program, as well as the clinical Genomic Medicine Service. The emerging implications...

  18. Genomic Tools in Cowpea Breeding Programs: Status and Perspectives

    PubMed Central

    Boukar, Ousmane; Fatokun, Christian A.; Huynh, Bao-Lam; Roberts, Philip A.; Close, Timothy J.

    2016-01-01

    Cowpea is one of the most important grain legumes in sub-Saharan Africa (SSA). It provides strong support to the livelihood of small-scale farmers through its contributions to their nutritional security, income generation and soil fertility enhancement. Worldwide about 6.5 million metric tons of cowpea are produced annually on about 14.5 million hectares. The low productivity of cowpea is attributable to numerous abiotic and biotic constraints. The abiotic stress factors comprise drought, low soil fertility, and heat while biotic constraints include insects, diseases, parasitic weeds, and nematodes. Cowpea farmers also have limited access to quality seeds of improved varieties for planting. Some progress has been made through conventional breeding at international and national research institutions in the last three decades. Cowpea improvement could also benefit from modern breeding methods based on molecular genetic tools. A number of advances in cowpea genetic linkage maps, and quantitative trait loci associated with some desirable traits such as resistance to Striga, Macrophomina, Fusarium wilt, bacterial blight, root-knot nematodes, aphids, and foliar thrips have been reported. An improved consensus genetic linkage map has been developed and used to identify QTLs of additional traits. In order to take advantage of these developments single nucleotide polymorphism (SNP) genotyping is being streamlined to establish an efficient workflow supported by genotyping support service (GSS)-client interactions. About 1100 SNPs mapped on the cowpea genome were converted by LGC Genomics to KASP assays. Several cowpea breeding programs have been exploiting these resources to implement molecular breeding, especially for MARS and MABC, to accelerate cowpea variety improvement. The combination of conventional breeding and molecular breeding strategies, with workflow managed through the CGIAR breeding management system (BMS), promises an increase in the number of improved

  19. Genomic Tools in Cowpea Breeding Programs: Status and Perspectives.

    PubMed

    Boukar, Ousmane; Fatokun, Christian A; Huynh, Bao-Lam; Roberts, Philip A; Close, Timothy J

    2016-01-01

    Cowpea is one of the most important grain legumes in sub-Saharan Africa (SSA). It provides strong support to the livelihood of small-scale farmers through its contributions to their nutritional security, income generation and soil fertility enhancement. Worldwide about 6.5 million metric tons of cowpea are produced annually on about 14.5 million hectares. The low productivity of cowpea is attributable to numerous abiotic and biotic constraints. The abiotic stress factors comprise drought, low soil fertility, and heat while biotic constraints include insects, diseases, parasitic weeds, and nematodes. Cowpea farmers also have limited access to quality seeds of improved varieties for planting. Some progress has been made through conventional breeding at international and national research institutions in the last three decades. Cowpea improvement could also benefit from modern breeding methods based on molecular genetic tools. A number of advances in cowpea genetic linkage maps, and quantitative trait loci associated with some desirable traits such as resistance to Striga, Macrophomina, Fusarium wilt, bacterial blight, root-knot nematodes, aphids, and foliar thrips have been reported. An improved consensus genetic linkage map has been developed and used to identify QTLs of additional traits. In order to take advantage of these developments single nucleotide polymorphism (SNP) genotyping is being streamlined to establish an efficient workflow supported by genotyping support service (GSS)-client interactions. About 1100 SNPs mapped on the cowpea genome were converted by LGC Genomics to KASP assays. Several cowpea breeding programs have been exploiting these resources to implement molecular breeding, especially for MARS and MABC, to accelerate cowpea variety improvement. The combination of conventional breeding and molecular breeding strategies, with workflow managed through the CGIAR breeding management system (BMS), promises an increase in the number of improved

  20. Programming biological operating systems: genome design, assembly and activation.

    PubMed

    Gibson, Daniel G

    2014-05-01

    The DNA technologies developed over the past 20 years for reading and writing the genetic code converged when the first synthetic cell was created 4 years ago. An outcome of this work has been an extraordinary set of tools for synthesizing, assembling, engineering and transplanting whole bacterial genomes. Technical progress, options and applications for bacterial genome design, assembly and activation are discussed.

  1. BARC 2010 Annual Report for NRSP-8 National Animal Genome Research Program (NAGRP)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    BARC 2010 has contributed to the National Animal Genome Research Program (NAGRP). The NRSP8 NAGRP’s objectives are 1) Enhance and integrate genetic and physical maps of agriculturally important animals for cross species comparisons and sequence annotation; 2) Facilitate integration of genomic, trans...

  2. BARC 2009 Annual Report for NRSP-8 National Animal Genome Research Program (NAGRP)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    BARC 2009 has contributed to the National Animal Genome Research Program (NAGRP). The NRSP8 NAGRP’s objectives are 1) Enhance and integrate genetic and physical maps of agriculturally important animals for cross species comparisons and sequence annotation; 2) Facilitate integration of genomic, trans...

  3. 77 FR 58913 - Genomic Medicine Program Advisory Committee, Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-09-24

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF VETERANS AFFAIRS Genomic Medicine Program Advisory Committee, Notice of Meeting The Department of Veterans Affairs (VA) gives notice under Public Law 92-463 (Federal Advisory Committee Act) that the Genomic...

  4. 77 FR 16898 - Genomic Medicine Program Advisory Committee, Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-03-22

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF VETERANS AFFAIRS Genomic Medicine Program Advisory Committee, Notice of Meeting The Department of Veterans Affairs (VA) gives notice under Public Law 92-463 (Federal Advisory Committee Act) that the Genomic...

  5. 78 FR 58612 - Genomic Medicine Program Advisory Committee, Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-09-24

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF VETERANS AFFAIRS Genomic Medicine Program Advisory Committee, Notice of Meeting The Department of Veterans Affairs (VA) gives notice under the Federal Advisory Committee Act, 5 U.S.C. App. 2, that the Genomic...

  6. 75 FR 61861 - Genomic Medicine Program Advisory Committee; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-10-06

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF VETERANS AFFAIRS Genomic Medicine Program Advisory Committee; Notice of Meeting The Department of Veterans Affairs (VA) gives notice under Public Law 92-463 (Federal Advisory Committee Act) that the Genomic...

  7. 76 FR 24573 - Genomic Medicine Program Advisory Committee; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-05-02

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF VETERANS AFFAIRS Genomic Medicine Program Advisory Committee; Notice of Meeting The Department of Veterans Affairs (VA) gives notice under Public Law 92-463 (Federal Advisory Committee Act) that the Genomic...

  8. Genomic selection accuracy using multi-family prediction models in a wheat breeding program

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic selection (GS) uses genome-wide molecular marker data to predict the genetic value of selection candidates in breeding programs. In plant breeding, the ability to produce large numbers of progeny per cross allows GS to be conducted within each family. However, this approach requires phenotyp...

  9. USDA animal genomics program: the view from the chicken coop

    PubMed Central

    2009-01-01

    In 2007, the USDA Animal Genomics Strategic Planning Task Force prepared a Blueprint to direct national needs for future research, education, and extension efforts in agricultural animal genomics. This plan is entitled "Blueprint for USDA Efforts in Agricultural Animal Genomics 2008–2017". The Blueprint is reviewed from the perspective of a molecular biologist working within the poultry breeding industry. The diverse species used in animal agriculture require different tools, resources and technologies for their improvement. The specific requirements for chickens are described in this report. PMID:19607651

  10. RNA-Mediated Epigenetic Programming of Genome Rearrangements

    PubMed Central

    Nowacki, Mariusz; Shetty, Keerthi; Landweber, Laura F.

    2012-01-01

    RNA, normally thought of as a conduit in gene expression, has a novel mode of action in ciliated protozoa. Maternal RNA templates provide both an organizing guide for DNA rearrangements and a template that can transport somatic mutations to the next generation. This opportunity for RNA-mediated genome rearrangement and DNA repair is profound in the ciliate Oxytricha, which deletes 95% of its germline genome during development in a process that severely fragments its chromosomes and then sorts and reorders the hundreds of thousands of pieces remaining. Oxytricha’s somatic nuclear genome is therefore an epigenome formed through RNA templates and signals arising from the previous generation. Furthermore, this mechanism of RNA-mediated epigenetic inheritance can function across multiple generations, and the discovery of maternal template RNA molecules has revealed new biological roles for RNA and has hinted at the power of RNA molecules to sculpt genomic information in cells. PMID:21801022

  11. Coordination of Programs on Domestic Animal Genomics: The Federal Framework

    DTIC Science & Technology

    2004-06-01

    including: • Large-scale sequencing to produce draft genome sequences (8-fold sequence coverage) of honeybee, chicken, dog , cattle, swine, and cat ...develop draft-quality genome sequences for the chicken, cow, honeybee, pig, dog , and cat . All were designated to be “high priority” for sequencing by...NHGRI supported sequencing centers (with the understanding that close evolutionary relationships between dog / cat and cow/pig would only allow

  12. Genomic selection needs to be carefully assessed to meet specific requirements in livestock breeding programs.

    PubMed

    Jonas, Elisabeth; de Koning, Dirk-Jan

    2015-01-01

    Genomic selection is a promising development in agriculture, aiming improved production by exploiting molecular genetic markers to design novel breeding programs and to develop new markers-based models for genetic evaluation. It opens opportunities for research, as novel algorithms and lab methodologies are developed. Genomic selection can be applied in many breeds and species. Further research on the implementation of genomic selection (GS) in breeding programs is highly desirable not only for the common good, but also the private sector (breeding companies). It has been projected that this approach will improve selection routines, especially in species with long reproduction cycles, late or sex-limited or expensive trait recording and for complex traits. The task of integrating GS into existing breeding programs is, however, not straightforward. Despite successful integration into breeding programs for dairy cattle, it has yet to be shown how much emphasis can be given to the genomic information and how much additional phenotypic information is needed from new selection candidates. Genomic selection is already part of future planning in many breeding companies of pigs and beef cattle among others, but further research is needed to fully estimate how effective the use of genomic information will be for the prediction of the performance of future breeding stock. Genomic prediction of production in crossbreeding and across-breed schemes, costs and choice of individuals for genotyping are reasons for a reluctance to fully rely on genomic information for selection decisions. Breeding objectives are highly dependent on the industry and the additional gain when using genomic information has to be considered carefully. This review synthesizes some of the suggested approaches in selected livestock species including cattle, pig, chicken, and fish. It outlines tasks to help understanding possible consequences when applying genomic information in breeding scenarios.

  13. Genomic Tools in Pea Breeding Programs: Status and Perspectives

    PubMed Central

    Tayeh, Nadim; Aubert, Grégoire; Pilet-Nayel, Marie-Laure; Lejeune-Hénaut, Isabelle; Warkentin, Thomas D.; Burstin, Judith

    2015-01-01

    Pea (Pisum sativum L.) is an annual cool-season legume and one of the oldest domesticated crops. Dry pea seeds contain 22–25% protein, complex starch and fiber constituents, and a rich array of vitamins, minerals, and phytochemicals which make them a valuable source for human consumption and livestock feed. Dry pea ranks third to common bean and chickpea as the most widely grown pulse in the world with more than 11 million tons produced in 2013. Pea breeding has achieved great success since the time of Mendel's experiments in the mid-1800s. However, several traits still require significant improvement for better yield stability in a larger growing area. Key breeding objectives in pea include improving biotic and abiotic stress resistance and enhancing yield components and seed quality. Taking advantage of the diversity present in the pea genepool, many mapping populations have been constructed in the last decades and efforts have been deployed to identify loci involved in the control of target traits and further introgress them into elite breeding materials. Pea now benefits from next-generation sequencing and high-throughput genotyping technologies that are paving the way for genome-wide association studies and genomic selection approaches. This review covers the significant development and deployment of genomic tools for pea breeding in recent years. Future prospects are discussed especially in light of current progress toward deciphering the pea genome. PMID:26640470

  14. Coordination of Programs on Domestic Animal Genomics: A Federal Framework

    DTIC Science & Technology

    2003-09-01

    Executive Office of the President Food Animal Production Agriculture Research Service U.S. Department of Agriculture Associates: Rodney J. Brown...diseases will benefit human health and the development of new pharmaceutical products . With significant input from Federal agencies currently investing...their functional products on a molecular level and their interactions with other genes. Genome sequencing projects are underway for several animals

  15. Flexible approaches for teaching computational genomics in a health information management program.

    PubMed

    Zhou, Leming; Watzlaf, Valerie; Abdelhak, Mervat

    2013-01-01

    The astonishing improvement of high-throughput biotechnologies in recent years makes it possible to access a huge amount of genomic data. The association between genomic data and genetic disease has already been and will continue to be applied to personalized healthcare. Health information management (HIM) professionals are the ones who will handle personal genetic information and provide solid evidence to support physicians' diagnoses and personalized treatment strategies, and therefore they will need to have the knowledge and skills to process genomic data. In this paper, we describe flexible approaches for teaching a computational genomics course in the HIM program at the University of Pittsburgh. HIM programs at other universities may choose an appropriate approach to fit into their own curriculum.

  16. CHALLENGES FOR IMPLEMENTING A PTSD PREVENTIVE GENOMIC SEQUENCING PROGRAM IN THE U.S. MILITARY

    PubMed Central

    Lázaro-Muñoz, Gabriel; Juengst, Eric T.

    2015-01-01

    There is growing interest in using the quickly developing field of genomics to contribute to military readiness and effectiveness. Specifically, influential military advisory panels have recommended that the U.S. military apply genomics to help treat, prevent, or minimize the risk for post-traumatic stress disorder (PTSD) among service members. This article highlights some important scientific, legal, and ethical challenges regarding the development and deployment of a preventive genomic sequencing (PGS) program to predict the risk of PTSD among military service members. PMID:26401056

  17. Cancer Therapy Evaluation Program | Office of Cancer Genomics

    Cancer.gov

    The Cancer Therapy Evaluation Program (CTEP) seeks to improve the lives of cancer patients by finding better treatments, control mechanisms, and cures for cancer. CTEP funds a national program of cancer research, sponsoring clinical trials to evaluate new anti-cancer agents.

  18. Genome-wide alterations of the DNA replication program during tumor progression

    NASA Astrophysics Data System (ADS)

    Arneodo, A.; Goldar, A.; Argoul, F.; Hyrien, O.; Audit, B.

    2016-08-01

    Oncogenic stress is a major driving force in the early stages of cancer development. Recent experimental findings reveal that, in precancerous lesions and cancers, activated oncogenes may induce stalling and dissociation of DNA replication forks resulting in DNA damage. Replication timing is emerging as an important epigenetic feature that recapitulates several genomic, epigenetic and functional specificities of even closely related cell types. There is increasing evidence that chromosome rearrangements, the hallmark of many cancer genomes, are intimately associated with the DNA replication program and that epigenetic replication timing changes often precede chromosomic rearrangements. The recent development of a novel methodology to map replication fork polarity using deep sequencing of Okazaki fragments has provided new and complementary genome-wide replication profiling data. We review the results of a wavelet-based multi-scale analysis of genomic and epigenetic data including replication profiles along human chromosomes. These results provide new insight into the spatio-temporal replication program and its dynamics during differentiation. Here our goal is to bring to cancer research, the experimental protocols and computational methodologies for replication program profiling, and also the modeling of the spatio-temporal replication program. To illustrate our purpose, we report very preliminary results obtained for the chronic myelogeneous leukemia, the archetype model of cancer. Finally, we discuss promising perspectives on using genome-wide DNA replication profiling as a novel efficient tool for cancer diagnosis, prognosis and personalized treatment.

  19. Programming the genome in embryonic and somatic stem cells.

    PubMed

    Collas, Philippe; Noer, Agate; Timoskainen, Sanna

    2007-01-01

    In opposition to terminally differentiated cells, stem cells can self-renew and give rise to multiple cell types. Embryonic stem cells retain the ability of the inner cell mass of blastocysts to differentiate into all cell types of the body and have acquired in culture unlimited self-renewal capacity. Somatic stem cells are found in many adult tissues, have an extensive but finite lifespan and can differentiate into a more restricted array of cell types. A growing body of evidence indicates that multi-lineage differentiation ability of stem cells can be defined by the potential for expression of lineage-specification genes. Gene expression, or as emphasized here, potential for gene expression, is largely controlled by epigenetic modifications of DNA and chromatin on genomic regulatory and coding regions. These modifications modulate chromatin organization not only on specific genes but also at the level of the whole nucleus; they can also affect timing of DNA replication. This review highlights how mechanisms by which genes are poised for transcription in undifferentiated stem cells are being uncovered through primarily the mapping of DNA methylation, histone modifications and transcription factor binding throughout the genome. The combinatorial association of epigenetic marks on developmentally regulated and lineage-specifying genes in undifferentiated cells seems to define a pluripotent state.

  20. Introduction to Metagenomics at DOE JGI: Program Overview and Program Informatics (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    ScienceCinema

    Tringe, Susannah [DOE JGI

    2016-07-12

    Susannah Tringe of the DOE Joint Genome Institute talks about the Program Overview and Program Informatics at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011

  1. Bromodomain-dependent stage-specific male genome programming by Brdt.

    PubMed

    Gaucher, Jonathan; Boussouar, Fayçal; Montellier, Emilie; Curtet, Sandrine; Buchou, Thierry; Bertrand, Sarah; Hery, Patrick; Jounier, Sylvie; Depaux, Arnaud; Vitte, Anne-Laure; Guardiola, Philippe; Pernet, Karin; Debernardi, Alexandra; Lopez, Fabrice; Holota, Hélène; Imbert, Jean; Wolgemuth, Debra J; Gérard, Matthieu; Rousseaux, Sophie; Khochbin, Saadi

    2012-10-03

    Male germ cell differentiation is a highly regulated multistep process initiated by the commitment of progenitor cells into meiosis and characterized by major chromatin reorganizations in haploid spermatids. We report here that a single member of the double bromodomain BET factors, Brdt, is a master regulator of both meiotic divisions and post-meiotic genome repackaging. Upon its activation at the onset of meiosis, Brdt drives and determines the developmental timing of a testis-specific gene expression program. In meiotic and post-meiotic cells, Brdt initiates a genuine histone acetylation-guided programming of the genome by activating essential genes and repressing a 'progenitor cells' gene expression program. At post-meiotic stages, a global chromatin hyperacetylation gives the signal for Brdt's first bromodomain to direct the genome-wide replacement of histones by transition proteins. Brdt is therefore a unique and essential regulator of male germ cell differentiation, which, by using various domains in a developmentally controlled manner, first drives a specific spermatogenic gene expression program, and later controls the tight packaging of the male genome.

  2. Strategies, actions, and outcomes of pilot state programs in public health genomics, 2003-2008.

    PubMed

    St Pierre, Jeanette; Bach, Janice; Duquette, Debra; Oehlke, Kristen; Nystrom, Robert; Silvey, Kerry; Zlot, Amy; Giles, Rebecca; Johnson, Jenny; Anders, H Mack; Gwinn, Marta; Bowen, Scott; Khoury, Muin J

    2014-06-12

    State health departments in Michigan, Minnesota, Oregon, and Utah explored the use of genomic information, including family health history, in chronic disease prevention programs. To support these explorations, the Office of Public Health Genomics at the Centers for Disease Control and Prevention provided cooperative agreement funds from 2003 through 2008. The 4 states' chronic disease programs identified advocates, formed partnerships, and assessed public data; they integrated genomics into existing state plans for genetics and chronic disease prevention; they developed projects focused on prevention of asthma, cancer, cardiovascular disease, diabetes, and other chronic conditions; and they created educational curricula and materials for health workers, policymakers, and the public. Each state's program was different because of the need to adapt to existing culture, infrastructure, and resources, yet all were able to enhance their chronic disease prevention programs with the use of family health history, a low-tech "genomic tool." Additional states are drawing on the experience of these 4 states to develop their own approaches.

  3. Potential benefits of genomic selection on genetic gain of small ruminant breeding programs.

    PubMed

    Shumbusho, F; Raoul, J; Astruc, J M; Palhiere, I; Elsen, J M

    2013-08-01

    In conventional small ruminant breeding programs, only pedigree and phenotype records are used to make selection decisions but prospects of including genomic information are now under consideration. The objective of this study was to assess the potential benefits of genomic selection on the genetic gain in French sheep and goat breeding designs of today. Traditional and genomic scenarios were modeled with deterministic methods for 3 breeding programs. The models included decisional variables related to male selection candidates, progeny testing capacity, and economic weights that were optimized to maximize annual genetic gain (AGG) of i) a meat sheep breeding program that improved a meat trait of heritability (h(2)) = 0.30 and a maternal trait of h(2) = 0.09 and ii) dairy sheep and goat breeding programs that improved a milk trait of h(2) = 0.30. Values of ±0.20 of genetic correlation between meat and maternal traits were considered to study their effects on AGG. The Bulmer effect was accounted for and the results presented here are the averages of AGG after 10 generations of selection. Results showed that current traditional breeding programs provide an AGG of 0.095 genetic standard deviation (σa) for meat and 0.061 σa for maternal trait in meat breed and 0.147 σa and 0.120 σa in sheep and goat dairy breeds, respectively. By optimizing decisional variables, the AGG with traditional selection methods increased to 0.139 σa for meat and 0.096 σa for maternal traits in meat breeding programs and to 0.174 σa and 0.183 σa in dairy sheep and goat breeding programs, respectively. With a medium-sized reference population (nref) of 2,000 individuals, the best genomic scenarios gave an AGG that was 17.9% greater than with traditional selection methods with optimized values of decisional variables for combined meat and maternal traits in meat sheep, 51.7% in dairy sheep, and 26.2% in dairy goats. The superiority of genomic schemes increased with the size of the

  4. The Ethical, Legal, and Social Implications Program of the National Human Genome Research Institute: reflections on an ongoing experiment.

    PubMed

    McEwen, Jean E; Boyer, Joy T; Sun, Kathie Y; Rothenberg, Karen H; Lockhart, Nicole C; Guyer, Mark S

    2014-01-01

    For more than 20 years, the Ethical, Legal, and Social Implications (ELSI) Program of the National Human Genome Research Institute has supported empirical and conceptual research to anticipate and address the ethical, legal, and social implications of genomics. As a component of the agency that funds much of the underlying science, the program has always been an experiment. The ever-expanding number of issues the program addresses and the relatively low level of commitment on the part of other funding agencies to support such research make setting priorities especially challenging. Program-supported studies have had a significant impact on the conduct of genomics research, the implementation of genomic medicine, and broader public policies. The program's influence is likely to grow as ELSI research, genomics research, and policy development activities become increasingly integrated. Achieving the benefits of increased integration while preserving the autonomy, objectivity, and intellectual independence of ELSI investigators presents ongoing challenges and new opportunities.

  5. Development of Structural Neurobiology and Genomics Programs in the Neurogenetic Institute

    SciTech Connect

    Henderson, Brian E., M.D.

    2006-11-10

    The purpose of the DOE equipment-only grant was to purchase instrumentation in support of structural biology and genomics core facilities in the Zilkha Neurogenetic Institute (ZNI). The ZNI, a new laboratory facility (125,000 GSF) and a center of excellence at the Keck School of Medicine of USC, was opened in 2003. The goal of the ZNI is to recruit upwards of 30 new faculty investigators engaged in interdisciplinary research programs that will add breadth and depth to existing school strengths in neuroscience, epidemiology and genetics. Many of these faculty, and other faculty researchers at the Keck School will access structural biology and genomics facilities developed in the ZNI.

  6. Data Standards for the Genomes to Life Program

    SciTech Connect

    Arkin, Adam; Ambrosiano, John; Babnigg, Gyorgy; Frank, Ed; Geist,Al; Giometti, Carol; Jacobsen, Janet; Samatova, Nagiza; Slater, Nancy; Taylor, Ron

    2004-01-31

    Existing GTL Projects already have produced volumes of dataand, over the course of the next five years, will produce an estimatedhundreds, or possibly thousands, of terabytes of data from hundreds ofexperiments conducted at dozens of laboratories in National Labs anduniversities across the nation. These data will be the basis forpublications by individual researchers, research groups, andmulti-institutional collaborations, and the basis for future DOEdecisions on funding further research in bioremediation. The short-termand long-term value of the data to project participants, to the DOE, andto the nation depends, however, on being able to access the data and onhow, or whether, the data are archived. The ability to access data is thestarting point for data analysis and interpretation, data integration,data mining, and development of data-driven models. Limited orinefficient data access means that less data are analyzed in acost-effective and timely manner. Data production in the GTL Program willlikely outstrip, or may have already outstripped, the ability to analyzethe data. Being able to access data depends on two key factors: datastandards and implementation of the data standards. For the purpose ofthis proposal, a data standard is defined as a standard, documented wayin which data and information about the data are describe. The attributesof the experiment in which the data were collected need to be known andthe measurements corresponding to the data collected need to bedescribed. In general terms, a data standard could be a form (electronicor paper) that is completed by a researcher or a document that prescribeshow a protocol or experiment should be described in writing.Datastandards are critical to data access because they provide a frameworkfor organizing and managing data. Researchers spend significant amountsof time managing data and information about experiments using labnotebooks, computer files, Excel spreadsheets, etc. In addition, dataoutput format

  7. Efficient and exact maximum likelihood quantisation of genomic features using dynamic programming.

    PubMed

    Song, Mingzhou; Haralick, Robert M; Boissinot, Stéphane

    2010-01-01

    An efficient and exact dynamic programming algorithm is introduced to quantise a continuous random variable into a discrete random variable that maximises the likelihood of the quantised probability distribution for the original continuous random variable. Quantisation is often useful before statistical analysis and modelling of large discrete network models from observations of multiple continuous random variables. The quantisation algorithm is applied to genomic features including the recombination rate distribution across the chromosomes and the non-coding transposable element LINE-1 in the human genome. The association pattern is studied between the recombination rate, obtained by quantisation at genomic locations around LINE-1 elements, and the length groups of LINE-1 elements, also obtained by quantisation on LINE-1 length. The exact and density-preserving quantisation approach provides an alternative superior to the inexact and distance-based univariate iterative k-means clustering algorithm for discretisation.

  8. Programmed Minichromosome Elimination as a Mechanism for Somatic Genome Reduction in Tetrahymena thermophila

    PubMed Central

    Yao, Meng-Chao

    2016-01-01

    The maintenance of chromosome integrity is crucial for genetic stability. However, programmed chromosome fragmentations are known to occur in many organisms, and in the ciliate Tetrahymena the five germline chromosomes are fragmented into hundreds of minichromosomes during somatic nuclear differentiation. Here, we showed that there are different fates of these minichromosomes after chromosome breakage. Among the 326 somatic minichromosomes identified using genomic data, 50 are selectively eliminated from the mature somatic genome. Interestingly, many and probably most of these minichromosomes are eliminated during the growth period between 6 and 20 doublings right after conjugation. Genes with potential conjugation-specific functions are found in these minichromosomes. This study revealed a new mode of programmed DNA elimination in ciliates similar to those observed in parasitic nematodes, which could play a role in developmental gene regulation. PMID:27806059

  9. Economic evaluation of genomic selection in small ruminants: a sheep meat breeding program.

    PubMed

    Shumbusho, F; Raoul, J; Astruc, J M; Palhiere, I; Lemarié, S; Fugeray-Scarbel, A; Elsen, J M

    2016-06-01

    Recent genomic evaluation studies using real data and predicting genetic gain by modeling breeding programs have reported moderate expected benefits from the replacement of classic selection schemes by genomic selection (GS) in small ruminants. The objectives of this study were to compare the cost, monetary genetic gain and economic efficiency of classic selection and GS schemes in the meat sheep industry. Deterministic methods were used to model selection based on multi-trait indices from a sheep meat breeding program. Decisional variables related to male selection candidates and progeny testing were optimized to maximize the annual monetary genetic gain (AMGG), that is, a weighted sum of meat and maternal traits annual genetic gains. For GS, a reference population of 2000 individuals was assumed and genomic information was available for evaluation of male candidates only. In the classic selection scheme, males breeding values were estimated from own and offspring phenotypes. In GS, different scenarios were considered, differing by the information used to select males (genomic only, genomic+own performance, genomic+offspring phenotypes). The results showed that all GS scenarios were associated with higher total variable costs than classic selection (if the cost of genotyping was 123 euros/animal). In terms of AMGG and economic returns, GS scenarios were found to be superior to classic selection only if genomic information was combined with their own meat phenotypes (GS-Pheno) or with their progeny test information. The predicted economic efficiency, defined as returns (proportional to number of expressions of AMGG in the nucleus and commercial flocks) minus total variable costs, showed that the best GS scenario (GS-Pheno) was up to 15% more efficient than classic selection. For all selection scenarios, optimization increased the overall AMGG, returns and economic efficiency. As a conclusion, our study shows that some forms of GS strategies are more advantageous

  10. A survey of application: genomics and genetic programming, a new frontier.

    PubMed

    Khan, Mohammad Wahab; Alam, Mansaf

    2012-08-01

    The aim of this paper is to provide an introduction to the rapidly developing field of genetic programming (GP). Particular emphasis is placed on the application of GP to genomics. First, the basic methodology of GP is introduced. This is followed by a review of applications in the areas of gene network inference, gene expression data analysis, SNP analysis, epistasis analysis and gene annotation. Finally this paper concluded by suggesting potential avenues of possible future research on genetic programming, opportunities to extend the technique, and areas for possible practical applications.

  11. Design and Implementation of a Genomics Field Trip Program Aimed at Secondary School Students

    PubMed Central

    Fox, Joanne A.

    2012-01-01

    With the rapid pace of advancements in biological research brought about by the application of computer science and information technology, we believe the time is right for introducing genomics and bioinformatics tools and concepts to secondary school students. Our approach has been to offer a full-day field trip in our research facility where secondary school students carry out experiments at the laboratory bench and on a laptop computer. This experience offers benefits for students, teachers, and field trip instructors. In delivering a wide variety of science outreach and education programs, we have learned that a number of factors contribute to designing a successful experience for secondary school students. First, it is important to engage students with authentic and fun activities that are linked to real-world applications and/or research questions. Second, connecting with a local high school teacher to pilot programs and linking to curricula taught in secondary schools will enrich the field trip experience. Whether or not programs are linked directly to local teachers, it is important to be flexible and build in mechanisms for collecting feedback in field trip programs. Finally, graduate students can be very powerful mentors for students and should be encouraged to share their enthusiasm for science and to talk about career paths. Our experiences suggest a real need for effective science outreach programs at the secondary school level and that genomics and bioinformatics are ideal areas to explore. PMID:22956895

  12. Prokaryotic Super Program Advisory Committee DOE Joint Genome Institute, Walnut Creek, CA, March 27, 2013

    PubMed Central

    Garrity, George M.; Banfield, Jill; Eisen, Jonathan; van der Lelie, Niels; McMahon, Trina; Rusch, Doug; DeLong, Edward; Moran, Mary Ann; Currie, Cameron; Furhman, Jed; Hallam, Steve; Hugenholtz, Phil; Moran, Nancy; Nelson, Karen; Roberts, Richard; Stepanauskas, Ramunas

    2013-01-01

    The Prokaryotic Super Program Advisory Committee met on March 27, 2013 for their annual review the Prokaryotic Super Program at the DOE Joint Genome Institute. As is the case with any site visit or program review, the objective is to evaluate progress in meeting organizational objectives, provide feedback to from the user-community and to assist the JGI in formulating plans for the coming year. The advisors want to commend the JGI for its central role in developing new technologies and capabilities, and for catalyzing the formation of new collaborative user communities. Highlights of the post-meeting exchanges among the advisors focused on the importance of programmatic initiatives including: • GEBA, which serves as a phylogenetic “base-map” on which our knowledge of functional diversity can be layered. • FEBA, which promises to provide new insights into the physiological capabilities of prokaryotes under highly standardized conditions. • Single-cell genomics technology, which is seen to significantly enhance our ability to interpret genomic and metagenomic data and broaden the scope of the GEBA program to encompass at least a part of the microbial “dark-matter”. • IMG, which is seen to play a central role in JGI programs and is viewed as a strategically important asset in the JGI portfolio. On this latter point, the committee encourages the formation of a strategic relationship between IMG and the Kbase to ensure that the intelligence, deep knowledge and experience captured in the former is not lost. The committee strongly urges the DOE to continue its support for maintaining this critical resource. PMID:24501639

  13. Report on the Imaging Workshop for the Genomes to Life Program, April 16-18, 2002

    SciTech Connect

    Colson, STEVEN

    2003-08-04

    This report is a result of the Imaging Workshop for the Genomes to Life (GTL) program held April 16-19, 2002, in Charlotte, North Carolina. The meeting was sponsored by the Office of Biological and Environmental Research and the Office of Advanced Scientific Computing Research of the U.S. Department of Energy's (DOE) Office of Science. The purpose of the workshop was to project a broad vision for future needs and determine the value of imaging to GTL program research. The workshop included four technical sessions with plenary lectures on biology and technology perspectives and technical presentations on needs and approaches as they related to the following areas of the GTL program: (1) Molecular machines (protein complexes); (2) Intracellular and cellular structure, function, and processes; (3) Multicellular: Monoclonal and heterogeneous multicellular systems, cell-cell signaling, and model systems; and (4) Cells in situ and in vivo: Bacteria in the natural environment, microenvironment, and in vivo systems.

  14. Whole-Genome Screening of Newborns? The Constitutional Boundaries of State Newborn Screening Programs.

    PubMed

    King, Jaime S; Smith, Monica E

    2016-01-01

    State newborn screening (NBS) programs routinely screen nearly all of the 4 million newborns in the United States each year for ∼30 primary conditions and a number of secondary conditions. NBS could be on the cusp of an unprecedented expansion as a result of advances in whole-genome sequencing (WGS). As WGS becomes cheaper and easier and as our knowledge and understanding of human genetics expand, the question of whether WGS has a role to play in state NBS programs becomes increasingly relevant and complex. As geneticists and state public health officials begin to contemplate the technical and procedural details of whether WGS could benefit existing NBS programs, this is an opportune time to revisit the legal framework of state NBS programs. In this article, we examine the constitutional underpinnings of state-mandated NBS and explore the range of current state statutes and regulations that govern the programs. We consider the legal refinements that will be needed to keep state NBS programs within constitutional bounds, focusing on 2 areas of concern: consent procedures and the criteria used to select new conditions for NBS panels. We conclude by providing options for states to consider when contemplating the use of WGS for NBS.

  15. Whole-Genome Screening of Newborns? The Constitutional Boundaries of State Newborn Screening Programs

    PubMed Central

    King, Jaime S.; Smith, Monica E.

    2016-01-01

    State newborn screening (NBS) programs routinely screen nearly all of the 4 million newborns in the United States each year for ~30 primary conditions and a number of secondary conditions. NBS could be on the cusp of an unprecedented expansion as a result of advances in whole-genome sequencing (WGS). As WGS becomes cheaper and easier and as our knowledge and understanding of human genetics expand, the question of whether WGS has a role to play in state NBS programs becomes increasingly relevant and complex. As geneticists and state public health officials begin to contemplate the technical and procedural details of whether WGS could benefit existing NBS programs, this is an opportune time to revisit the legal framework of state NBS programs. In this article, we examine the constitutional underpinnings of state-mandated NBS and explore the range of current state statutes and regulations that govern the programs. We consider the legal refinements that will be needed to keep state NBS programs within constitutional bounds, focusing on 2 areas of concern: consent procedures and the criteria used to select new conditions for NBS panels. We conclude by providing options for states to consider when contemplating the use of WGS for NBS. PMID:26729704

  16. The Human Genome Project and Mental Retardation: An Educational Program. Final Progress Report

    SciTech Connect

    Davis, Sharon

    1999-05-03

    The Arc, a national organization on mental retardation, conducted an educational program for members, many of whom have a family member with a genetic condition causing mental retardation. The project informed members about the Human Genome scientific efforts, conducted training regarding ethical, legal and social implications and involved members in issue discussions. Short reports and fact sheets on genetic and ELSI topics were disseminated to 2,200 of the Arc's leaders across the country and to other interested individuals. Materials produced by the project can e found on the Arc's web site, TheArc.org.

  17. READSCAN: a fast and scalable pathogen discovery program with accurate genome relative abundance estimation

    PubMed Central

    Rashid, Mamoon; Pain, Arnab

    2013-01-01

    Summary: READSCAN is a highly scalable parallel program to identify non-host sequences (of potential pathogen origin) and estimate their genome relative abundance in high-throughput sequence datasets. READSCAN accurately classified human and viral sequences on a 20.1 million reads simulated dataset in <27 min using a small Beowulf compute cluster with 16 nodes (Supplementary Material). Availability: http://cbrc.kaust.edu.sa/readscan Contact: arnab.pain@kaust.edu.sa or raeece.naeem@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23193222

  18. New accuracy estimators for genomic selection with application in a cassava (Manihot esculenta) breeding program.

    PubMed

    Azevedo, C F; Resende, M D V; Silva, F F; Viana, J M S; Valente, M S F; Resende, M F R; Oliveira, E J

    2016-10-05

    Genomic selection is the main force driving applied breeding programs and accuracy is the main measure for evaluating its efficiency. The traditional estimator (TE) of experimental accuracy is not fully adequate. This study proposes and evaluates the performance and efficiency of two new accuracy estimators, called regularized estimator (RE) and hybrid estimator (HE), which were applied to a practical cassava breeding program and also to simulated data. The simulation study considered two individual narrow sense heritability levels and two genetic architectures for traits. TE, RE, and HE were compared under four validation procedures: without validation (WV), independent validation, ten-fold validation through jacknife allowing different markers, and with the same markers selected in each cycle. RE presented accuracies closer to the parametric ones and less biased and more precise ones than TE. HE proved to be very effective in the WV procedure. The estimators were applied to five traits evaluated in a cassava experiment, including 358 clones genotyped for 390 SNPs. Accuracies ranged from 0.67 to 1.12 with TE and from 0.22 to 0.51 with RE. These results indicated that TE overestimated the accuracy and led to one accuracy estimate (1.12) higher than one, which is outside of the parameter space. Use of RE turned the accuracy into the parameter space. Cassava breeding programs can be more realistically implemented using the new estimators proposed in this study, providing less risky practical inferences.

  19. Control of VEGF-A transcriptional programs by pausing and genomic compartmentalization

    PubMed Central

    Kaikkonen, Minna U.; Niskanen, Henri; Romanoski, Casey E.; Kansanen, Emilia; Kivelä, Annukka M.; Laitalainen, Jarkko; Heinz, Sven; Benner, Christopher; Glass, Christopher K.; Ylä-Herttuala, Seppo

    2014-01-01

    Vascular endothelial growth factor A (VEGF-A) is a master regulator of angiogenesis, vascular development and function. In this study we investigated the transcriptional regulation of VEGF-A-responsive genes in primary human aortic endothelial cells (HAECs) and human umbilical vein endothelial cells (HUVECs) using genome-wide global run-on sequencing (GRO-Seq). We demonstrate that half of VEGF-A-regulated gene promoters are characterized by a transcriptionally competent paused RNA polymerase II (Pol II). We show that transition into productive elongation is a major mechanism of gene activation of virtually all VEGF-regulated genes, whereas only ∼40% of the genes are induced at the level of initiation. In addition, we report a comprehensive chromatin interaction map generated in HUVECs using tethered conformation capture (TCC) and characterize chromatin interactions in relation to transcriptional activity. We demonstrate that sites of active transcription are more likely to engage in chromatin looping and cell type-specific transcriptional activity reflects the boundaries of chromatin interactions. Furthermore, we identify large chromatin compartments with a tendency to be coordinately transcribed upon VEGF-A stimulation. We provide evidence that these compartments are enriched for clusters of regulatory regions such as super-enhancers and for disease-associated single nucleotide polymorphisms (SNPs). Collectively, these findings provide new insights into mechanisms behind VEGF-A-regulated transcriptional programs in endothelial cells. PMID:25352550

  20. Genetic testing and Alzheimer disease: recommendations of the Stanford Program in Genomics, Ethics, and Society.

    PubMed

    McConnell, L M; Koenig, B A; Greely, H T; Raffin, T A

    1999-01-01

    Several genes associated with Alzheimer disease (AD) have been localized and cloned; two genetic tests are already commercially available, and new tests are being developed. Genetic testing for AD--either for disease prediction or for diagnosis--raises critical ethical concerns. The multidisciplinary Alzheimer Disease Working Group of the Stanford Program in Genomics, Ethics, and Society (PGES) presents comprehensive recommendations on genetic testing for AD. The Group concludes that under current conditions, genetic testing for AD prediction or diagnosis is only rarely appropriate. Criteria for judging the readiness of a test for introduction into routine clinical practice typically rely heavily on evaluation of technical efficacy. PGES recommends a broader and more comprehensive approach, considering: 1) the unique social and historical meanings of AD; 2) the availability of procedures to promote good surrogate decision making for incompetent patients and to safeguard confidentiality; 3) access to sophisticated genetic counselors able to communicate complex risk information and effectively convey the social costs and psychological burdens of testing, such as unintentional disclosure of predictive genetic information to family members; 4) protection from inappropriate advertising and marketing of genetic tests; and 5) recognition of the need for public education about the meaning and usefulness of predictive and diagnostic tests for AD. In this special issue of Genetic Testing, the PGES recommendations are published along with comprehensive background papers authored by Working Group members.

  1. FrameD: A flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences.

    PubMed

    Schiex, Thomas; Gouzy, Jérôme; Moisan, Annick; de Oliveira, Yannick

    2003-07-01

    We describe FrameD, a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output. Its performances are evaluated on different bacterial genomes. The web site (http://genopole.toulouse.inra.fr/bioinfo/FrameD/FD) allows direct prediction, sequence correction and translation and the ability to learn new models for new organisms.

  2. FrameD: a flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences

    PubMed Central

    Schiex, Thomas; Gouzy, Jérôme; Moisan, Annick; de Oliveira, Yannick

    2003-01-01

    We describe FrameD, a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output. Its performances are evaluated on different bacterial genomes. The web site (http://genopole.toulouse.inra.fr/bioinfo/FrameD/FD) allows direct prediction, sequence correction and translation and the ability to learn new models for new organisms. PMID:12824407

  3. Assessment of the scientific-technological production in molecular biology in Brazil (1996-2007): the contribution of genomics programs.

    PubMed

    Meneghini, Rogério; Gamba, Estêvão C

    2011-06-01

    Several genome sequencing programs were launched in Brazil by the end of the nineties and the early 2000s.The most important initiatives were supported by the ONSA program (http://watson.fapesp.br/onsa/Genoma3.htm) and aimed at gaining domain in genomic technology and bringing molecular biology to the state of art. Two main sets of data were collected in the 1996-2007 period to evaluate the results of these genome programs: the scientific production (Scopus and Web of Science databases) and the register of patents (US Patent and Trademark Office), both related to the progress of molecular biology along this period. In regard to the former, Brazil took a great leap in comparison to 17 other developed and developing countries, being only surpassed by China. As to the register of patents in the area of molecular biology, Brazil's performance lags far behind most of the countries focused in the present study, confirming the Brazilian long-standing tendency of poor achievements in technological innovations when compared with scientific production. Possible solutions to surpass this inequality are discussed.

  4. An Innovative Plant Genomics and Gene Annotation Program for High School, Community College, and University Faculty

    ERIC Educational Resources Information Center

    Hacisalihoglu, Gokhan; Hilgert, Uwe; Nash, E. Bruce; Micklos, David A.

    2008-01-01

    Today's biology educators face the challenge of training their students in modern molecular biology techniques including genomics and bioinformatics. The Dolan DNA Learning Center (DNALC) of Cold Spring Harbor Laboratory has developed and disseminated a bench- and computer-based plant genomics curriculum for biology faculty. In 2007, a five-day…

  5. 76 FR 65563 - Genomic Medicine Program Advisory Committee; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-10-21

    ... (VA) gives notice under Public Law 92-463 (Federal Advisory Committee Act) that the Genomic Medicine... protecting the privacy of Veterans. The meeting focus will be on current and upcoming biological...

  6. EXONSAMPLER: a computer program for genome-wide and candidate gene exon sampling for targeted next-generation sequencing.

    PubMed

    Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon

    2014-11-01

    The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection.

  7. Whole Genome Sequence Analysis of Salmonella Typhi Isolated in Thailand before and after the Introduction of a National Immunization Program.

    PubMed

    Dyson, Zoe A; Thanh, Duy Pham; Bodhidatta, Ladaporn; Mason, Carl Jeffries; Srijan, Apichai; Rabaa, Maia A; Vinh, Phat Voong; Thanh, Tuyen Ha; Thwaites, Guy E; Baker, Stephen; Holt, Kathryn E

    2017-01-01

    Vaccines against Salmonella Typhi, the causative agent of typhoid fever, are commonly used by travellers, however, there are few examples of national immunization programs in endemic areas. There is therefore a paucity of data on the impact of typhoid immunization programs on localised populations of S. Typhi. Here we have used whole genome sequencing (WGS) to characterise 44 historical bacterial isolates collected before and after a national typhoid immunization program that was implemented in Thailand in 1977 in response to a large outbreak; the program was highly effective in reducing typhoid case numbers. Thai isolates were highly diverse, including 10 distinct phylogenetic lineages or genotypes. Novel prophage and plasmids were also detected, including examples that were previously only reported in Shigella sonnei and Escherichia coli. The majority of S. Typhi genotypes observed prior to the immunization program were not observed following it. Post-vaccine era isolates were more closely related to S. Typhi isolated from neighbouring countries than to earlier Thai isolates, providing no evidence for the local persistence of endemic S. Typhi following the national immunization program. Rather, later cases of typhoid appeared to be caused by the occasional importation of common genotypes from neighbouring Vietnam, Laos, and Cambodia. These data show the value of WGS in understanding the impacts of vaccination on pathogen populations and provide support for the proposal that large-scale typhoid immunization programs in endemic areas could result in lasting local disease elimination, although larger prospective studies are needed to test this directly.

  8. Whole Genome Sequence Analysis of Salmonella Typhi Isolated in Thailand before and after the Introduction of a National Immunization Program

    PubMed Central

    Thanh, Duy Pham; Bodhidatta, Ladaporn; Mason, Carl Jeffries; Srijan, Apichai; Rabaa, Maia A.; Vinh, Phat Voong; Thanh, Tuyen Ha; Thwaites, Guy E.; Baker, Stephen; Holt, Kathryn E.

    2017-01-01

    Vaccines against Salmonella Typhi, the causative agent of typhoid fever, are commonly used by travellers, however, there are few examples of national immunization programs in endemic areas. There is therefore a paucity of data on the impact of typhoid immunization programs on localised populations of S. Typhi. Here we have used whole genome sequencing (WGS) to characterise 44 historical bacterial isolates collected before and after a national typhoid immunization program that was implemented in Thailand in 1977 in response to a large outbreak; the program was highly effective in reducing typhoid case numbers. Thai isolates were highly diverse, including 10 distinct phylogenetic lineages or genotypes. Novel prophage and plasmids were also detected, including examples that were previously only reported in Shigella sonnei and Escherichia coli. The majority of S. Typhi genotypes observed prior to the immunization program were not observed following it. Post-vaccine era isolates were more closely related to S. Typhi isolated from neighbouring countries than to earlier Thai isolates, providing no evidence for the local persistence of endemic S. Typhi following the national immunization program. Rather, later cases of typhoid appeared to be caused by the occasional importation of common genotypes from neighbouring Vietnam, Laos, and Cambodia. These data show the value of WGS in understanding the impacts of vaccination on pathogen populations and provide support for the proposal that large-scale typhoid immunization programs in endemic areas could result in lasting local disease elimination, although larger prospective studies are needed to test this directly. PMID:28060810

  9. Controlling inbreeding and maximizing genetic gain using semi-definite programming with pedigree-based and genomic relationships.

    PubMed

    Schierenbeck, S; Pimentel, E C G; Tietze, M; Körte, J; Reents, R; Reinhardt, F; Simianer, H; König, S

    2011-12-01

    Because of the relatively high levels of genetic relationships among potential bull sires and bull dams, innovative selection tools should consider both genetic gain and genetic relationships in a long-term perspective. Optimum genetic contribution theory using official estimated breeding values for a moderately heritable trait (production index, Index-PROD), and a lowly heritable functional trait (index for somatic cell score, Index-SCS) was applied to find optimal allocations of bull dams and bull sires. In contrast to previous practical applications using optimizations based on Lagrange multipliers, we focused on semi-definite programming (SDP). The SDP methodology was combined with either pedigree (a(ij)) or genomic relationships (f(ij)) among selection candidates. Selection candidates were 484 genotyped bulls, and 499 preselected genotyped bull dams completing a central test on station. In different scenarios separately for PROD and SCS, constraints on the average pedigree relationships among future progeny were varied from a(ij)=0.08 to a(ij)=0.20 in increments of 0.01. Corresponding constraints for single nucleotide polymorphism-based kinship coefficients were derived from regression analysis. Applying the coefficient of 0.52 with an intercept of 0.14 estimated for the regression pedigree relationship on genomic relationship, the corresponding range to alter genomic relationships varied from f(ij) = 0.18 to f(ij) = 0.24. Despite differences for some bulls in genomic and pedigree relationships, the same trends were observed for constraints on pedigree and corresponding genomic relationships regarding results in genetic gain and achieved coefficients of relationships. Generally, allowing higher values for relationships resulted in an increase of genetic gain for Index-PROD and Index-SCS and in a reduction in the number of selected sires. Interestingly, more sires were selected for all scenarios when restricting genomic relationships compared with restricting

  10. Democratizing Human Genome Project Information: A Model Program for Education, Information and Debate in Public Libraries.

    ERIC Educational Resources Information Center

    Pollack, Miriam

    The "Mapping the Human Genome" project demonstrated that librarians can help whomever they serve in accessing information resources in the areas of biological and health information, whether it is the scientists who are developing the information or a member of the public who is using the information. Public libraries can guide library…

  11. The Human Genome Initiative: Implications for the Comprehensive School Health Program.

    ERIC Educational Resources Information Center

    James, Delores C. S.

    1994-01-01

    The Human Genome Initiative (HGI) constructs common resources for studying human genetics. Early identification of people at risk for genetic disorders allows for early education and counseling. HGI research will create inexpensive, reliable genetic tests and diagnoses to help teachers and school staff assess, compare, and channel students. (SM)

  12. Ku-mediated coupling of DNA cleavage and repair during programmed genome rearrangements in the ciliate Paramecium tetraurelia.

    PubMed

    Marmignon, Antoine; Bischerour, Julien; Silve, Aude; Fojcik, Clémentine; Dubois, Emeline; Arnaiz, Olivier; Kapusta, Aurélie; Malinsky, Sophie; Bétermier, Mireille

    2014-08-01

    During somatic differentiation, physiological DNA double-strand breaks (DSB) can drive programmed genome rearrangements (PGR), during which DSB repair pathways are mobilized to safeguard genome integrity. Because of their unique nuclear dimorphism, ciliates are powerful unicellular eukaryotic models to study the mechanisms involved in PGR. At each sexual cycle, the germline nucleus is transmitted to the progeny, but the somatic nucleus, essential for gene expression, is destroyed and a new somatic nucleus differentiates from a copy of the germline nucleus. In Paramecium tetraurelia, the development of the somatic nucleus involves massive PGR, including the precise elimination of at least 45,000 germline sequences (Internal Eliminated Sequences, IES). IES excision proceeds through a cut-and-close mechanism: a domesticated transposase, PiggyMac, is essential for DNA cleavage, and DSB repair at excision sites involves the Ligase IV, a specific component of the non-homologous end-joining (NHEJ) pathway. At the genome-wide level, a huge number of programmed DSBs must be repaired during this process to allow the assembly of functional somatic chromosomes. To understand how DNA cleavage and DSB repair are coordinated during PGR, we have focused on Ku, the earliest actor of NHEJ-mediated repair. Two Ku70 and three Ku80 paralogs are encoded in the genome of P. tetraurelia: Ku70a and Ku80c are produced during sexual processes and localize specifically in the developing new somatic nucleus. Using RNA interference, we show that the development-specific Ku70/Ku80c heterodimer is essential for the recovery of a functional somatic nucleus. Strikingly, at the molecular level, PiggyMac-dependent DNA cleavage is abolished at IES boundaries in cells depleted for Ku80c, resulting in IES retention in the somatic genome. PiggyMac and Ku70a/Ku80c co-purify as a complex when overproduced in a heterologous system. We conclude that Ku has been integrated in the Paramecium DNA cleavage

  13. Ku-Mediated Coupling of DNA Cleavage and Repair during Programmed Genome Rearrangements in the Ciliate Paramecium tetraurelia

    PubMed Central

    Marmignon, Antoine; Bischerour, Julien; Silve, Aude; Fojcik, Clémentine; Dubois, Emeline; Arnaiz, Olivier; Kapusta, Aurélie; Malinsky, Sophie; Bétermier, Mireille

    2014-01-01

    During somatic differentiation, physiological DNA double-strand breaks (DSB) can drive programmed genome rearrangements (PGR), during which DSB repair pathways are mobilized to safeguard genome integrity. Because of their unique nuclear dimorphism, ciliates are powerful unicellular eukaryotic models to study the mechanisms involved in PGR. At each sexual cycle, the germline nucleus is transmitted to the progeny, but the somatic nucleus, essential for gene expression, is destroyed and a new somatic nucleus differentiates from a copy of the germline nucleus. In Paramecium tetraurelia, the development of the somatic nucleus involves massive PGR, including the precise elimination of at least 45,000 germline sequences (Internal Eliminated Sequences, IES). IES excision proceeds through a cut-and-close mechanism: a domesticated transposase, PiggyMac, is essential for DNA cleavage, and DSB repair at excision sites involves the Ligase IV, a specific component of the non-homologous end-joining (NHEJ) pathway. At the genome-wide level, a huge number of programmed DSBs must be repaired during this process to allow the assembly of functional somatic chromosomes. To understand how DNA cleavage and DSB repair are coordinated during PGR, we have focused on Ku, the earliest actor of NHEJ-mediated repair. Two Ku70 and three Ku80 paralogs are encoded in the genome of P. tetraurelia: Ku70a and Ku80c are produced during sexual processes and localize specifically in the developing new somatic nucleus. Using RNA interference, we show that the development-specific Ku70/Ku80c heterodimer is essential for the recovery of a functional somatic nucleus. Strikingly, at the molecular level, PiggyMac-dependent DNA cleavage is abolished at IES boundaries in cells depleted for Ku80c, resulting in IES retention in the somatic genome. PiggyMac and Ku70a/Ku80c co-purify as a complex when overproduced in a heterologous system. We conclude that Ku has been integrated in the Paramecium DNA cleavage

  14. Recurrent parent genome recovery analysis in a marker-assisted backcrossing program of rice (Oryza sativa L.).

    PubMed

    Miah, Gous; Rafii, Mohd Y; Ismail, Mohd R; Puteh, Adam B; Rahim, Harun A; Latif, Mohammad A

    2015-02-01

    Backcross breeding is the most commonly used method for incorporating a blast resistance gene into a rice cultivar. Linkage between the resistance gene and undesirable units can persist for many generations of backcrossing. Marker-assisted backcrossing (MABC) along with marker-assisted selection (MAS) contributes immensely to overcome the main limitation of the conventional breeding and accelerates recurrent parent genome (RPG) recovery. The MABC approach was employed to incorporate (a) blast resistance gene(s) from the donor parent Pongsu Seribu 1, the blast-resistant local variety in Malaysia, into the genetic background of MR219, a popular high-yielding rice variety that is blast susceptible, to develop a blast-resistant MR219 improved variety. In this perspective, the recurrent parent genome recovery was analyzed in early generations of backcrossing using simple sequence repeat (SSR) markers. Out of 375 SSR markers, 70 markers were found polymorphic between the parents, and these markers were used to evaluate the plants in subsequent generations. Background analysis revealed that the extent of RPG recovery ranged from 75.40% to 91.3% and from 80.40% to 96.70% in BC1F1 and BC2F1 generations, respectively. In this study, the recurrent parent genome content in the selected BC2F2 lines ranged from 92.7% to 97.7%. The average proportion of the recurrent parent in the selected improved line was 95.98%. MAS allowed identification of the plants that are more similar to the recurrent parent for the loci evaluated in backcross generations. The application of MAS with the MABC breeding program accelerated the recovery of the RP genome, reducing the number of generations and the time for incorporating resistance against rice blast.

  15. Proceedings of the relevance of mass spectrometry to DNA sequence determination: Research needs for the Human Genome Program

    SciTech Connect

    Edmonds, C.G.; Smith, R.D. ); Smith, L.M. )

    1990-11-01

    A workshop was sponsored for the US Department of Energy (DOE), Office of Health and Environmental Research by Pacific Northwest Laboratory, April 4--5, 1990, in Seattle, Washington, to examine the potential role of mass spectrometry in the joint DOE/National Institutes of Health (NIH) Human Genome Program. The workshop was occasioned by recent developments in mass spectrometry that are providing new levels for selectivity, sensitivity, and, in particular, new methods of ionization appropriate for large biopolymers such as DNA. During discussions, three general mass spectrometric approaches to the determination of DNA sequence were considered: (1) the mass spectrometric detection of isotopic labels from DNA sequencing mixtures separated using gel electrophoresis, (2) the direct mass spectrometric analysis from direct ionization of unfractionated sequencing mixtures where the measured mass of the constituents functions to identify and order the base sequence (replacing separation by gel electrophoresis), and (3) an approach in which a single highly charged molecular ion of a large DNA segment produced is rapidly sequenced in an ion cyclotron resonance ion trap. The consensus of the workshop was that, on the basis of the new developments, mass spectrometry has the potential to provide the substantial increases in sequencing speed required for the Human Genome Program. 66 refs., 3 tabs.

  16. An Innovative Plant Genomics and Gene Annotation Program for High School, Community College, and University Faculty

    PubMed Central

    Hilgert, Uwe; Nash, E. Bruce; Micklos, David A.

    2008-01-01

    Today's biology educators face the challenge of training their students in modern molecular biology techniques including genomics and bioinformatics. The Dolan DNA Learning Center (DNALC) of Cold Spring Harbor Laboratory has developed and disseminated a bench- and computer-based plant genomics curriculum for biology faculty. In 2007, a five-day “Plant Genomics and Gene Annotation” workshop was held at Florida A&M University in Tallahassee, FL, to enhance participants' knowledge and understanding of plant molecular genetics and assist them in developing and honing their laboratory and computer skills. Florida A&M University is a historically black university with over 95% African-American student enrollment. Sixteen participants, including high school (56%) and community college faculty (25%), attended the workshop. Participants carried out in vitro and in silico experiments with maize, Arabidopsis, soybean, and food products to determine the genotype of the samples. Benefits of the workshop included increased awareness of plant biology research for high school and college level students. Participants completed pre- and postworkshop evaluations for the measurement of effectiveness. Participants demonstrated an overall improvement in their postworkshop evaluation scores. This article provides a detailed description of workshop activities, as well as assessment and long-term support for broad classroom implementation. PMID:18765753

  17. Genome-wide analysis of genetic and epigenetic control of programmed DNA deletion

    PubMed Central

    Swart, Estienne C.; Wilkes, Cyril Denby; Sandoval, Pamela Y.; Arambasic, Miroslav; Sperling, Linda; Nowacki, Mariusz

    2014-01-01

    During the development of the somatic genome from the Paramecium germline genome the bulk of the copies of ∼45 000 unique, internal eliminated sequences (IESs) are deleted. IES targeting is facilitated by two small RNA (sRNA) classes: scnRNAs, which relay epigenetic information from the parental nucleus to the developing nucleus, and iesRNAs, which are produced and used in the developing nucleus. Why only certain IESs require sRNAs for their removal has been enigmatic. By analyzing the silencing effects of three genes: PGM (responsible for DNA excision), DCL2/3 (scnRNA production) and DCL5 (iesRNA production), we identify key properties required for IES elimination. Based on these results, we propose that, depending on the exact combination of their lengths and end bases, some IESs are less efficiently recognized or excised and have a greater requirement for targeting by scnRNAs and iesRNAs. We suggest that the variation in IES retention following silencing of DCL2/3 is not primarily due to scnRNA density, which is comparatively uniform relative to IES retention, but rather the genetic properties of IESs. Taken together, our analyses demonstrate that in Paramecium the underlying genetic properties of developmentally deleted DNA sequences are essential in determining the sensitivity of these sequences to epigenetic control. PMID:25016527

  18. Enhanced genome annotation using structural profiles in the program 3D-PSSM.

    PubMed

    Kelley, L A; MacCallum, R M; Sternberg, M J

    2000-06-02

    A method (three-dimensional position-specific scoring matrix, 3D-PSSM) to recognise remote protein sequence homologues is described. The method combines the power of multiple sequence profiles with knowledge of protein structure to provide enhanced recognition and thus functional assignment of newly sequenced genomes. The method uses structural alignments of homologous proteins of similar three-dimensional structure in the structural classification of proteins (SCOP) database to obtain a structural equivalence of residues. These equivalences are used to extend multiply aligned sequences obtained by standard sequence searches. The resulting large superfamily-based multiple alignment is converted into a PSSM. Combined with secondary structure matching and solvation potentials, 3D-PSSM can recognise structural and functional relationships beyond state-of-the-art sequence methods. In a cross-validated benchmark on 136 homologous relationships unambiguously undetectable by position-specific iterated basic local alignment search tool (PSI-Blast), 3D-PSSM can confidently assign 18 %. The method was applied to the remaining unassigned regions of the Mycoplasma genitalium genome and an additional 13 regions were assigned with 95 % confidence. 3D-PSSM is available to the community as a web server: http://www.bmm.icnet.uk/servers/3dpssm

  19. Identification of programmed translational -1 frameshifting sites in the genome of Saccharomyces cerevisiae.

    PubMed

    Bekaert, Michaël; Richard, Hugues; Prum, Bernard; Rousset, Jean-Pierre

    2005-10-01

    Frameshifting is a recoding event that allows the expression of two polypeptides from the same mRNA molecule. Most recoding events described so far are used by viruses and transposons to express their replicase protein. The very few number of cellular proteins known to be expressed by a -1 ribosomal frameshifting has been identified by chance. The goal of the present work was to set up a systematic strategy, based on complementary bioinformatics, molecular biology, and functional approaches, without a priori knowledge of the mechanism involved. Two independent methods were devised. The first looks for genomic regions in which two ORFs, each carrying a protein pattern, are in a frameshifted arrangement. The second uses Hidden Markov Models and likelihood in a two-step approach. When this strategy was applied to the Saccharomyces cerevisiae genome, 189 candidate regions were found, of which 58 were further functionally investigated. Twenty-eight of them expressed a full-length mRNA covering the two ORFs, and 11 showed a -1 frameshift efficiency varying from 5% to 13% (50-fold higher than background), some of which corresponds to genes with known functions. From other ascomycetes, four frameshifted ORFs are found fully conserved. Strikingly, most of the candidates do not display a classical viral-like frameshift signal and would have escaped a search based on current models of frameshifting. These results strongly suggest that -1 frameshifting might be more widely distributed than previously thought.

  20. Identification of programmed translational -1 frameshifting sites in the genome of Saccharomyces cerevisiae

    PubMed Central

    Bekaert, Michaël; Richard, Hugues; Prum, Bernard; Rousset, Jean-Pierre

    2005-01-01

    Frameshifting is a recoding event that allows the expression of two polypeptides from the same mRNA molecule. Most recoding events described so far are used by viruses and transposons to express their replicase protein. The very few number of cellular proteins known to be expressed by a -1 ribosomal frameshifting has been identified by chance. The goal of the present work was to set up a systematic strategy, based on complementary bioinformatics, molecular biology, and functional approaches, without a priori knowledge of the mechanism involved. Two independent methods were devised. The first looks for genomic regions in which two ORFs, each carrying a protein pattern, are in a frameshifted arrangement. The second uses Hidden Markov Models and likelihood in a two-step approach. When this strategy was applied to the Saccharomyces cerevisiae genome, 189 candidate regions were found, of which 58 were further functionally investigated. Twenty-eight of them expressed a full-length mRNA covering the two ORFs, and 11 showed a -1 frameshift efficiency varying from 5% to 13% (50-fold higher than background), some of which corresponds to genes with known functions. From other ascomycetes, four frameshifted ORFs are found fully conserved. Strikingly, most of the candidates do not display a classical viral-like frameshift signal and would have escaped a search based on current models of frameshifting. These results strongly suggest that -1 frameshifting might be more widely distributed than previously thought. PMID:16204194

  1. Multimedia Presentations on the Human Genome: Implementation and Assessment of a Teaching Program for the Introduction to Genome Science Using a Poster and Animations

    ERIC Educational Resources Information Center

    Kano, Kei; Yahata, Saiko; Muroi, Kaori; Kawakami, Masahiro; Tomoda, Mari; Miyaki, Koichi; Nakayama, Takeo; Kosugi, Shinji; Kato, Kazuto

    2008-01-01

    Genome science, including topics such as gene recombination, cloning, genetic tests, and gene therapy, is now an established part of our daily lives; thus we need to learn genome science to better equip ourselves for the present day. Learning from topics directly related to the human has been suggested to be more effective than learning from…

  2. Integrating Public Health and Deliberative Public Bioethics: Lessons from the Human Genome Project Ethical, Legal, and Social Implications Program.

    PubMed

    Meagher, Karen M; Lee, Lisa M

    2016-01-01

    Public health policy works best when grounded in firm public health standards of evidence and widely shared social values. In this article, we argue for incorporating a specific method of ethical deliberation--deliberative public bioethics--into public health. We describe how deliberative public bioethics is a method of engagement that can be helpful in public health. Although medical, research, and public health ethics can be considered some of what bioethics addresses, deliberative public bioethics offers both a how and where. Using the Human Genome Project Ethical, Legal, and Social Implications program as an example of effective incorporation of deliberative processes to integrate ethics into public health policy, we examine how deliberative public bioethics can integrate both public health and bioethics perspectives into three areas of public health practice: research, education, and health policy. We then offer recommendations for future collaborations that integrate deliberative methods into public health policy and practice.

  3. Integrating Public Health and Deliberative Public Bioethics: Lessons from the Human Genome Project Ethical, Legal, and Social Implications Program

    PubMed Central

    Meagher, Karen M.

    2016-01-01

    Public health policy works best when grounded in firm public health standards of evidence and widely shared social values. In this article, we argue for incorporating a specific method of ethical deliberation—deliberative public bioethics—into public health. We describe how deliberative public bioethics is a method of engagement that can be helpful in public health. Although medical, research, and public health ethics can be considered some of what bioethics addresses, deliberative public bioethics offers both a how and where. Using the Human Genome Project Ethical, Legal, and Social Implications program as an example of effective incorporation of deliberative processes to integrate ethics into public health policy, we examine how deliberative public bioethics can integrate both public health and bioethics perspectives into three areas of public health practice: research, education, and health policy. We then offer recommendations for future collaborations that integrate deliberative methods into public health policy and practice. PMID:26843669

  4. Cultural differences define diagnosis and genomic medicine practice: implications for undiagnosed diseases program in China

    PubMed Central

    Duan, Xiaohong; Markello, Thomas; Adams, David; Toro, Camilo; Tifft, Cynthia; Gahl, William A.; Boerkoel, Cornelius F.

    2013-01-01

    Despite the current acceleration and increasing leadership of Chinese genetics research, genetics and its clinical application have largely been imported to China from the Occident. Neither genetics nor the scientific reductionism underpinning its clinical application is integral to the traditional Chinese worldview. Given that disease concepts and their incumbent diagnoses are historically derived and culturally meaningful, we hypothesize that the cultural expectations of genetic diagnoses and medical genetics practice differs between the Occident and China. Specifically, we suggest that an undiagnosed diseases program in China will differ from the recently established Undiagnosed Diseases Program at the United States National Institutes of Health; a culturally sensitive concept will integrate traditional Chinese understanding of disease with the scientific reductionism of Occidental medicine. PMID:23856975

  5. Cultural differences define diagnosis and genomic medicine practice: implications for undiagnosed diseases program in China.

    PubMed

    Duan, Xiaohong; Markello, Thomas; Adams, David; Toro, Camilo; Tifft, Cynthia; Gahl, William A; Boerkoel, Cornelius F

    2013-09-01

    Despite the current acceleration and increasing leadership of Chinese genetics research, genetics and its clinical application have largely been imported to China from the Occident. Neither genetics nor the scientific reductionism underpinning its clinical application is integral to the traditional Chinese worldview. Given that disease concepts and their incumbent diagnoses are historically derived and culturally meaningful, we hypothesize that the cultural expectations of genetic diagnoses and medical genetics practice differ between the Occident and China. Specifically, we suggest that an undiagnosed diseases program in China will differ from the recently established Undiagnosed Diseases Program at the United States National Institutes of Health; a culturally sensitive concept will integrate traditional Chinese understanding of disease with the scientific reductionism of Occidental medicine.

  6. Adenovirus type 5 exerts genome-wide control over cellular programs governing proliferation, quiescence, and survival

    PubMed Central

    Miller, Daniel L; Myers, Chad L; Rickards, Brenden; Coller, Hilary A; Flint, S Jane

    2007-01-01

    Background Human adenoviruses, such as serotype 5 (Ad5), encode several proteins that can perturb cellular mechanisms that regulate cell cycle progression and apoptosis, as well as those that mediate mRNA production and translation. However, a global view of the effects of Ad5 infection on such programs in normal human cells is not available, despite widespread efforts to develop adenoviruses for therapeutic applications. Results We used two-color hybridization and oligonucleotide microarrays to monitor changes in cellular RNA concentrations as a function of time after Ad5 infection of quiescent, normal human fibroblasts. We observed that the expression of some 2,000 genes, about 10% of those examined, increased or decreased by a factor of two or greater following Ad5 infection, but were not altered in mock-infected cells. Consensus k-means clustering established that the temporal patterns of these changes were unexpectedly complex. Gene Ontology terms associated with cell proliferation were significantly over-represented in several clusters. The results of comparative analyses demonstrate that Ad5 infection induces reversal of the quiescence program and recapitulation of the core serum response, and that only a small subset of the observed changes in cellular gene expression can be ascribed to well characterized functions of the viral E1A and E1B proteins. Conclusion These findings establish that the impact of adenovirus infection on host cell programs is far greater than appreciated hitherto. Furthermore, they provide a new framework for investigating the molecular functions of viral early proteins and information relevant to the design of conditionally replicating adenoviral vectors. PMID:17430596

  7. The pea aphid (Acyrthosiphon pisum) genome encodes two divergent early developmental programs.

    PubMed

    Duncan, Elizabeth J; Leask, Megan P; Dearden, Peter K

    2013-05-01

    The pea aphid (Acyrthosiphon pisum) can reproduce either sexually or asexually (parthenogenetically), giving rise, in each case, to almost identical adults. These two modes of reproduction are accompanied by differences in ovarian morphology and the developmental environment of the offspring, with sexual forms producing eggs that are laid, whereas asexual development occurs within the mother. Here we examine the effect each mode of reproduction has on the expression of key maternal and axis patterning genes; orthodenticle (otd), hunchback (hb), caudal (cad) and nanos (nos). We show that three of these genes (Ap-hb, Ap-otd and Ap-cad) are expressed differently between the sexually and asexually produced oocytes and embryos of the pea aphid. We also show, using immunohistochemistry and cytoskeletal inhibitors, that Ap-hb RNA is localized differently between sexually and asexually produced oocytes, and that this is likely due to differences in the 3' untranslated regions of the RNA. Furthermore, Ap-hb and Ap-otd have extensive expression domains in early sexually produced embryos, but are not expressed at equivalent stages in asexually produced embryos. These differences in expression likely correspond with substantial changes in the gene regulatory networks controlling early development in the pea aphid. These data imply that in the evolution of parthenogenesis a new program has evolved to control the development of asexually produced embryos, whilst retaining the existing, sexual, developmental program. The patterns of modification of these developmental processes mirror the changes that we see in developmental processes between species, in that early acting pathways in development are less constrained, and evolve faster, than later ones. We suggest that the evolution of the novel asexual development pathway in aphids is not a simple modification of an ancestral system, but the evolution of two very different developmental mechanisms occurring within a single

  8. Integrating Genomics and Proteomics Data to Predict Drug Effects Using Binary Linear Programming

    PubMed Central

    Ji, Zhiwei; Su, Jing; Liu, Chenglin; Wang, Hongyan; Huang, Deshuang; Zhou, Xiaobo

    2014-01-01

    The Library of Integrated Network-Based Cellular Signatures (LINCS) project aims to create a network-based understanding of biology by cataloging changes in gene expression and signal transduction that occur when cells are exposed to a variety of perturbations. It is helpful for understanding cell pathways and facilitating drug discovery. Here, we developed a novel approach to infer cell-specific pathways and identify a compound's effects using gene expression and phosphoproteomics data under treatments with different compounds. Gene expression data were employed to infer potential targets of compounds and create a generic pathway map. Binary linear programming (BLP) was then developed to optimize the generic pathway topology based on the mid-stage signaling response of phosphorylation. To demonstrate effectiveness of this approach, we built a generic pathway map for the MCF7 breast cancer cell line and inferred the cell-specific pathways by BLP. The first group of 11 compounds was utilized to optimize the generic pathways, and then 4 compounds were used to identify effects based on the inferred cell-specific pathways. Cross-validation indicated that the cell-specific pathways reliably predicted a compound's effects. Finally, we applied BLP to re-optimize the cell-specific pathways to predict the effects of 4 compounds (trichostatin A, MS-275, staurosporine, and digoxigenin) according to compound-induced topological alterations. Trichostatin A and MS-275 (both HDAC inhibitors) inhibited the downstream pathway of HDAC1 and caused cell growth arrest via activation of p53 and p21; the effects of digoxigenin were totally opposite. Staurosporine blocked the cell cycle via p53 and p21, but also promoted cell growth via activated HDAC1 and its downstream pathway. Our approach was also applied to the PC3 prostate cancer cell line, and the cross-validation analysis showed very good accuracy in predicting effects of 4 compounds. In summary, our computational model can be

  9. Integrating genomics and proteomics data to predict drug effects using binary linear programming.

    PubMed

    Ji, Zhiwei; Su, Jing; Liu, Chenglin; Wang, Hongyan; Huang, Deshuang; Zhou, Xiaobo

    2014-01-01

    The Library of Integrated Network-Based Cellular Signatures (LINCS) project aims to create a network-based understanding of biology by cataloging changes in gene expression and signal transduction that occur when cells are exposed to a variety of perturbations. It is helpful for understanding cell pathways and facilitating drug discovery. Here, we developed a novel approach to infer cell-specific pathways and identify a compound's effects using gene expression and phosphoproteomics data under treatments with different compounds. Gene expression data were employed to infer potential targets of compounds and create a generic pathway map. Binary linear programming (BLP) was then developed to optimize the generic pathway topology based on the mid-stage signaling response of phosphorylation. To demonstrate effectiveness of this approach, we built a generic pathway map for the MCF7 breast cancer cell line and inferred the cell-specific pathways by BLP. The first group of 11 compounds was utilized to optimize the generic pathways, and then 4 compounds were used to identify effects based on the inferred cell-specific pathways. Cross-validation indicated that the cell-specific pathways reliably predicted a compound's effects. Finally, we applied BLP to re-optimize the cell-specific pathways to predict the effects of 4 compounds (trichostatin A, MS-275, staurosporine, and digoxigenin) according to compound-induced topological alterations. Trichostatin A and MS-275 (both HDAC inhibitors) inhibited the downstream pathway of HDAC1 and caused cell growth arrest via activation of p53 and p21; the effects of digoxigenin were totally opposite. Staurosporine blocked the cell cycle via p53 and p21, but also promoted cell growth via activated HDAC1 and its downstream pathway. Our approach was also applied to the PC3 prostate cancer cell line, and the cross-validation analysis showed very good accuracy in predicting effects of 4 compounds. In summary, our computational model can be

  10. Genomic Encyclopedia of Fungi

    SciTech Connect

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  11. Generating information-rich high-throughput experimental materials genomes using functional clustering via multitree genetic programming and information theory.

    PubMed

    Suram, Santosh K; Haber, Joel A; Jin, Jian; Gregoire, John M

    2015-04-13

    High-throughput experimental methodologies are capable of synthesizing, screening and characterizing vast arrays of combinatorial material libraries at a very rapid rate. These methodologies strategically employ tiered screening wherein the number of compositions screened decreases as the complexity, and very often the scientific information obtained from a screening experiment, increases. The algorithm used for down-selection of samples from higher throughput screening experiment to a lower throughput screening experiment is vital in achieving information-rich experimental materials genomes. The fundamental science of material discovery lies in the establishment of composition-structure-property relationships, motivating the development of advanced down-selection algorithms which consider the information value of the selected compositions, as opposed to simply selecting the best performing compositions from a high throughput experiment. Identification of property fields (composition regions with distinct composition-property relationships) in high throughput data enables down-selection algorithms to employ advanced selection strategies, such as the selection of representative compositions from each field or selection of compositions that span the composition space of the highest performing field. Such strategies would greatly enhance the generation of data-driven discoveries. We introduce an informatics-based clustering of composition-property functional relationships using a combination of information theory and multitree genetic programming concepts for identification of property fields in a composition library. We demonstrate our approach using a complex synthetic composition-property map for a 5 at. % step ternary library consisting of four distinct property fields and finally explore the application of this methodology for capturing relationships between composition and catalytic activity for the oxygen evolution reaction for 5429 catalyst compositions in a

  12. Ontology for Genome Comparison and Genomic Rearrangements

    PubMed Central

    Flanagan, Keith; Stevens, Robert; Pocock, Matthew; Lee, Pete

    2004-01-01

    We present an ontology for describing genomes, genome comparisons, their evolution and biological function. This ontology will support the development of novel genome comparison algorithms and aid the community in discussing genomic evolution. It provides a framework for communication about comparative genomics, and a basis upon which further automated analysis can be built. The nomenclature defined by the ontology will foster clearer communication between biologists, and also standardize terms used by data publishers in the results of analysis programs. The overriding aim of this ontology is the facilitation of consistent annotation of genomes through computational methods, rather than human annotators. To this end, the ontology includes definitions that support computer analysis and automated transfer of annotations between genomes, rather than relying upon human mediation. PMID:18629137

  13. Phytozome Comparative Plant Genomics Portal

    SciTech Connect

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  14. Fisher: a program for the detection of H/ACA snoRNAs using MFE secondary structure prediction and comparative genomics – assessment and update

    PubMed Central

    Freyhult, Eva; Edvardsson, Sverker; Tamas, Ivica; Moulton, Vincent; Poole, Anthony M

    2008-01-01

    Background The H/ACA family of small nucleolar RNAs (snoRNAs) plays a central role in guiding the pseudouridylation of ribosomal RNA (rRNA). In an effort to systematically identify the complete set of rRNA-modifying H/ACA snoRNAs from the genome sequence of the budding yeast, Saccharomyces cerevisiae, we developed a program – Fisher – and previously presented several candidate snoRNAs based on our analysis [1]. Findings In this report, we provide a brief update of this work, which was aborted after the publication of experimentally-identified snoRNAs [2] identical to candidates we had identified bioinformatically using Fisher. Our motivation for revisiting this work is to report on the status of the candidate snoRNAs described in [1], and secondly, to report that a modified version of Fisher together with the available multiple yeast genome sequences was able to correctly identify several H/ACA snoRNAs for modification sites not identified by the snoGPS program [3]. While we are no longer developing Fisher, we briefly consider the merits of the Fisher algorithm relative to snoGPS, which may be of use for workers considering pursuing a similar search strategy for the identification of small RNAs. The modified source code for Fisher is made available as supplementary material. Conclusion Our results confirm the validity of using minimum free energy (MFE) secondary structure prediction to guide comparative genomic screening for RNA families with few sequence constraints. PMID:18710502

  15. Optimizing the creation of base populations for aquaculture breeding programs using phenotypic and genomic data and its consequences on genetic progress.

    PubMed

    Fernández, Jesús; Toro, Miguel Á; Sonesson, Anna K; Villanueva, Beatriz

    2014-01-01

    The success of an aquaculture breeding program critically depends on the way in which the base population of breeders is constructed since all the genetic variability for the traits included originally in the breeding goal as well as those to be included in the future is contained in the initial founders. Traditionally, base populations were created from a number of wild strains by sampling equal numbers from each strain. However, for some aquaculture species improved strains are already available and, therefore, mean phenotypic values for economically important traits can be used as a criterion to optimize the sampling when creating base populations. Also, the increasing availability of genome-wide genotype information in aquaculture species could help to refine the estimation of relationships within and between candidate strains and, thus, to optimize the percentage of individuals to be sampled from each strain. This study explores the advantages of using phenotypic and genome-wide information when constructing base populations for aquaculture breeding programs in terms of initial and subsequent trait performance and genetic diversity level. Results show that a compromise solution between diversity and performance can be found when creating base populations. Up to 6% higher levels of phenotypic performance can be achieved at the same level of global diversity in the base population by optimizing the selection of breeders instead of sampling equal numbers from each strain. The higher performance observed in the base population persisted during 10 generations of phenotypic selection applied in the subsequent breeding program.

  16. Optimizing the creation of base populations for aquaculture breeding programs using phenotypic and genomic data and its consequences on genetic progress

    PubMed Central

    Fernández, Jesús; Toro, Miguel Á.; Sonesson, Anna K.; Villanueva, Beatriz

    2014-01-01

    The success of an aquaculture breeding program critically depends on the way in which the base population of breeders is constructed since all the genetic variability for the traits included originally in the breeding goal as well as those to be included in the future is contained in the initial founders. Traditionally, base populations were created from a number of wild strains by sampling equal numbers from each strain. However, for some aquaculture species improved strains are already available and, therefore, mean phenotypic values for economically important traits can be used as a criterion to optimize the sampling when creating base populations. Also, the increasing availability of genome-wide genotype information in aquaculture species could help to refine the estimation of relationships within and between candidate strains and, thus, to optimize the percentage of individuals to be sampled from each strain. This study explores the advantages of using phenotypic and genome-wide information when constructing base populations for aquaculture breeding programs in terms of initial and subsequent trait performance and genetic diversity level. Results show that a compromise solution between diversity and performance can be found when creating base populations. Up to 6% higher levels of phenotypic performance can be achieved at the same level of global diversity in the base population by optimizing the selection of breeders instead of sampling equal numbers from each strain. The higher performance observed in the base population persisted during 10 generations of phenotypic selection applied in the subsequent breeding program. PMID:25505485

  17. Assessing the impact of natural service bulls and genotype by environment interactions on genetic gain and inbreeding in organic dairy cattle genomic breeding programs.

    PubMed

    Yin, T; Wensch-Dorendorf, M; Simianer, H; Swalve, H H; König, S

    2014-06-01

    The objective of the present study was to compare genetic gain and inbreeding coefficients of dairy cattle in organic breeding program designs by applying stochastic simulations. Evaluated breeding strategies were: (i) selecting bulls from conventional breeding programs, and taking into account genotype by environment (G×E) interactions, (ii) selecting genotyped bulls within the organic environment for artificial insemination (AI) programs and (iii) selecting genotyped natural service bulls within organic herds. The simulated conventional population comprised 148 800 cows from 2976 herds with an average herd size of 50 cows per herd, and 1200 cows were assigned to 60 organic herds. In a young bull program, selection criteria of young bulls in both production systems (conventional and organic) were either 'conventional' estimated breeding values (EBV) or genomic estimated breeding values (GEBV) for two traits with low (h 2=0.05) and moderate heritability (h 2=0.30). GEBV were calculated for different accuracies (r mg), and G×E interactions were considered by modifying originally simulated true breeding values in the range from r g=0.5 to 1.0. For both traits (h 2=0.05 and 0.30) and r mg⩾0.8, genomic selection of bulls directly in the organic population and using selected bulls via AI revealed higher genetic gain than selecting young bulls in the larger conventional population based on EBV; also without the existence of G×E interactions. Only for pronounced G×E interactions (r g=0.5), and for highly accurate GEBV for natural service bulls (r mg>0.9), results suggests the use of genotyped organic natural service bulls instead of implementing an AI program. Inbreeding coefficients of selected bulls and their offspring were generally lower when basing selection decisions for young bulls on GEBV compared with selection strategies based on pedigree indices.

  18. A genome-wide association study of malting quality across eight U.S. barley breeding programs

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This study leverages the breeding data of 1,862 breeding lines evaluated in 97 field trials for genome-wide association study of malting quality traits in barley. The breeding lines were six-row and two-row barley advanced breeding lines from eight barley breeding populations established at six pub...

  19. Whole-Genome Sequences of Two Campylobacter coli Isolates from the Antimicrobial Resistance Monitoring Program in Colombia

    PubMed Central

    Bernal, Johan F.; Donado-Godoy, Pilar; Valencia, María Fernanda; León, Maribel; Gómez, Yolanda; Rodríguez, Fernando; Agarwala, Richa; Landsman, David

    2016-01-01

    Campylobacter coli, along with Campylobacter jejuni, is a major agent of gastroenteritis and acute enterocolitis in humans. We report the whole-genome sequences of two multidrug-resistance C. coli strains, isolated from the Colombian poultry chain. The isolates contain a variety of antimicrobial resistance genes for aminoglycosides, lincosamides, fluoroquinolones, and tetracycline. PMID:26988048

  20. Whole-Genome Sequences of Two Campylobacter coli Isolates from the Antimicrobial Resistance Monitoring Program in Colombia.

    PubMed

    Bernal, Johan F; Donado-Godoy, Pilar; Valencia, María Fernanda; León, Maribel; Gómez, Yolanda; Rodríguez, Fernando; Agarwala, Richa; Landsman, David; Mariño-Ramírez, Leonardo

    2016-03-17

    Campylobacter coli, along with Campylobacter jejuni, is a major agent of gastroenteritis and acute enterocolitis in humans. We report the whole-genome sequences of two multidrug-resistance C. coli strains, isolated from the Colombian poultry chain. The isolates contain a variety of antimicrobial resistance genes for aminoglycosides, lincosamides, fluoroquinolones, and tetracycline.

  1. Microbial genome program report: Optical approaches for physical mapping and sequence assembly of the Deinococcus radiodurans chromosome

    SciTech Connect

    Schwartz, David C.

    1999-11-23

    Maps of genomic or cloned DNA are frequently constructed by analyzing the cleavage patterns produced by restriction enzymes. Restriction enzymes are remarkable reagents that faithfully cleave only at specific sequences of between 4 and 8 nucleotides, which vary according to the specific enzymes. Restriction enzymes are reliable, numerous, and easily obtainable and presently, there are approximately 250 different sequences represented among thousands of enzymes. Restriction maps characterize gene structure and even entire genomes. Furthermore, such maps provide a useful scaffold for the alignment and verification of sequence data. Restriction maps generated by computer and predicted from the sequence are aligned with the actual restriction map. Restriction enzyme action has traditionally been assayed by gel electrophoresis. This technique separates cleaved molecules on the basis of their nobilities under the influence of an applied electrical field, within a gel separation matrix (small fragments have a greater mobility than large ones). Although gel electrophoresis distinguishes different sized DNA fragments (known as a fingerprint), the original order of these fragments remains unknown. The subsequent task of determining the order of such fragments is a labor intensive task, especially when making restriction maps of whole genomes, and therefore despite its obvious utility to genome analysis, it is not widely used.

  2. Genotype by environment interaction and the use of unbalanced historical data for genomic selection in an international wheat breeding program

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic selection (GS) offers breeders the possibility of using historic data and unbalanced breeding trials to form training populations for predicting the performance of new lines. However, in using datasets that are unbalanced over time and space, there is increasing exposure to particular genoty...

  3. Querying genomic databases

    SciTech Connect

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  4. Genome Maps, a new generation genome browser

    PubMed Central

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-01-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  5. Genome Maps, a new generation genome browser.

    PubMed

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org.

  6. Kaposi’s Sarcoma-Associated Herpesvirus Genome Programming during the Early Stages of Primary Infection of Peripheral Blood Mononuclear Cells

    PubMed Central

    Jha, Hem C.; Lu, Jie; Verma, Subhash C.; Banerjee, Shuvomoy; Mehta, Devan

    2014-01-01

    ABSTRACT The early period of Kaposi’s sarcoma-associated herpesvirus (KSHV) infection involves the dynamic expression of viral genes, which are temporally and epigenetically regulated. KSHV can effectively infect and persist in endothelial as well as human B cells with different gene expression patterns. To understand the temporal epigenetic changes which occur when KSHV infects the lymphocytic compartment, we infected human peripheral blood mononuclear cells (PBMCs) and comprehensively analyzed the changes which occurred at the binding sites of virally encoded lytic as well as latent proteins along with epigenetic modifications across the KSHV genome during early primary infection. Using chromatin immunoprecipitation (ChIP) assays, we showed that the KSHV genome acquires a uniquely distinct histone modification pattern of methylation (H3K4me3, H3K9me3, and H3K27me3) and acetylation (H3Ac) during de novo infection of human PBMCs. This pattern showed that the epigenetic changes were temporally controlled. The binding profiles of KSHV latent protein LANA and the immediate early proteins RTA and K8 showed specific patterns at different times postinfection, which reflects the gene expression program. Further analysis demonstrated that KSHV can concurrently express lytic and latent genes which were associated with histone modifications at these specific regions on the viral genome. We identified three KSHV genes, K3, ORF49, and ORF64, which exhibited different profiles of histone modifications during the early stages of PBMC infection. These studies established a distinct pattern of epigenetic modification which correlates with viral gene expression temporally regulated during the first 7 days of PBMC infection and provides clues to the regulatory program required for successful infection by KSHV of human PBMCs. PMID:25516617

  7. Program in Functional Genomics of Autoimmunity and Immunology of yhe University of Kentucky and the University of Alabama

    SciTech Connect

    Alan M Kaplan

    2012-10-12

    This grant will be used to augment the equipment infrastructure and core support at the University of Kentucky and the University of Alabama particularly in the areas of genomics/informatics, molecular analysis and cell separation. In addition, we will promote collaborative research interactions through scientific workshops and exchange of scientists, as well as joint exploration of the role of immune receptors as targets in autoimmunity and host defense, innate and adaptive immune responses, and mucosal immunity in host defense.

  8. Genome-wide analysis of MEF2 transcriptional program reveals synaptic target genes and neuronal activity-dependent polyadenylation site selection

    PubMed Central

    Flavell, Steven W.; Kim, Tae-Kyung; Gray, Jesse M.; Harmin, David A.; Hemberg, Martin; Hong, Elizabeth J.; Markenscoff-Papadimitriou, Eirene; Bear, Daniel M.; Greenberg, Michael E.

    2009-01-01

    SUMMARY Although many transcription factors are known to control important aspects of neural development, the genome-wide programs that are directly regulated by these factors are not known. We have characterized the genetic program that is activated by MEF2, a key regulator of activity-dependent synapse development. These MEF2 target genes have diverse functions at synapses, revealing a broad role for MEF2 in synapse development. Several of the MEF2 targets are mutated in human neurological disorders including epilepsy and autism-spectrum disorders, suggesting that these disorders may be caused by disruption of an activity-dependent gene program that controls synapse development. Our analyses also reveal that neuronal activity promotes alternative polyadenylation site usage at many of the MEF2 target genes, leading to the production of truncated mRNAs that may have different functions than their full-length counterparts. Taken together, these analyses suggest that the ubiquitously expressed transcription factor MEF2 regulates an intricate transcriptional program in neurons that controls synapse development. PMID:19109909

  9. Human Genome Project

    SciTech Connect

    Block, S.; Cornwall, J.; Dally, W.; Dyson, F.; Fortson, N.; Joyce, G.; Kimble, H. J.; Lewis, N.; Max, C.; Prince, T.; Schwitters, R.; Weinberger, P.; Woodin, W. H.

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  10. Satellite-tagged transcribing sequences in Bubalus bubalis genome undergo programmed modulation in meiocytes: possible implications for transcriptional inactivation.

    PubMed

    Chattopadhyay, M; Gangadharan, S; Kapur, V; Azfer, M A; Prakash, B; Ali, S

    2001-09-01

    We cloned and sequenced a 1378 bp BamHI satellite DNA fraction from the water buffalo Bubalus bubalis and have studied its expression in different tissues. The GC-rich sequences of the resultant contig pDS5 crosshybridize only with bovid DNA and are not conserved evolutionarily. Typing of buffalo genomic DNA using pDS5 with several restriction enzymes revealed multilocus monomorphic bands. Similar typing of cattle, buffalo, goat, sheep, and gaur genomic DNA revealed variations in copy number and allele length giving rise to species-specific band patterns. Expression study of pDS5 in bubaline samples by RNA slot-blot, Northern blot, and RT-PCR showed various levels of signal in all the somatic tissues and germline cells except heart. A GenBank database search revealed homology of pDS5 sequences in the 5' region from nt 1-1261 with collagen gene. An AluI typing analysis of DNA from bubaline semen samples showed consistent loss of two bands. The presence of corresponding bands in somatic tissues suggests a sequence modulation within the pDS5 array in meiocytes during spermatogenesis, which is restored in the somatic cells after fertilization. Modulation of the satellite-tagged transcribing sequence in the meiocytes may be a mechanism of its inactivation.

  11. Genomic-Based Optimum Contribution in Conservation and Genetic Improvement Programs with Antagonistic Fitness and Productivity Traits.

    PubMed

    Sánchez-Molano, Enrique; Pong-Wong, Ricardo; Banos, Georgios

    2016-01-01

    Animal selection for genetic improvement of productivity may lead to an increase in inbreeding through the use of techniques that enhance the reproductive capability of selected animals. Therefore, breeding strategies aim to balance maintaining genetic variability and acceptable fitness levels with increasing productivity. The present study demonstrates the effectiveness of genomic-based optimum contribution strategies at addressing this objective when fitness and productivity are genetically antagonistic traits. Strategies are evaluated in directional selection (increasing productivity) or conservation (maintaining fitness) scenarios. In the former case, substantial rates of genetic gain can be achieved while greatly constraining the rate of increase in inbreeding. Under a conservation approach, inbreeding depression can be effectively halted while also achieving a modest rate of genetic gain for productivity. Furthermore, the use of optimum contribution strategies when combined with a simple non-random mating scheme (minimum kinship method) showed an additional delay in the increase of inbreeding in the short term. In conclusion, genomic-based optimum contribution methods can be effectively used to control inbreeding and inbreeding depression, and still allow genetic gain for productivity traits even when fitness and productivity are antagonistically correlated.

  12. Genome-wide functional analysis of CREB/long-term memory-dependent transcription reveals distinct basal and memory gene expression programs.

    PubMed

    Lakhina, Vanisha; Arey, Rachel N; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T

    2015-01-21

    Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components.

  13. HIV-1 and M-PMV RNA Nuclear Export Elements Program Viral Genomes for Distinct Cytoplasmic Trafficking Behaviors

    PubMed Central

    Pocock, Ginger M.; Becker, Jordan T.; Swanson, Chad M.; Ahlquist, Paul; Sherer, Nathan M.

    2016-01-01

    Retroviruses encode cis-acting RNA nuclear export elements that override nuclear retention of intron-containing viral mRNAs including the full-length, unspliced genomic RNAs (gRNAs) packaged into assembling virions. The HIV-1 Rev-response element (RRE) recruits the cellular nuclear export receptor CRM1 (also known as exportin-1/XPO1) using the viral protein Rev, while simple retroviruses encode constitutive transport elements (CTEs) that directly recruit components of the NXF1(Tap)/NXT1(p15) mRNA nuclear export machinery. How gRNA nuclear export is linked to trafficking machineries in the cytoplasm upstream of virus particle assembly is unknown. Here we used long-term (>24 h), multicolor live cell imaging to directly visualize HIV-1 gRNA nuclear export, translation, cytoplasmic trafficking, and virus particle production in single cells. We show that the HIV-1 RRE regulates unique, en masse, Rev- and CRM1-dependent “burst-like” transitions of mRNAs from the nucleus to flood the cytoplasm in a non-localized fashion. By contrast, the CTE derived from Mason-Pfizer monkey virus (M-PMV) links gRNAs to microtubules in the cytoplasm, driving them to cluster markedly to the centrosome that forms the pericentriolar core of the microtubule-organizing center (MTOC). Adding each export element to selected heterologous mRNAs was sufficient to confer each distinct export behavior, as was directing Rev/CRM1 or NXF1/NXT1 transport modules to mRNAs using a site-specific RNA tethering strategy. Moreover, multiple CTEs per transcript enhanced MTOC targeting, suggesting that a cooperative mechanism links NXF1/NXT1 to microtubules. Combined, these results reveal striking, unexpected features of retroviral gRNA nucleocytoplasmic transport and demonstrate roles for mRNA export elements that extend beyond nuclear pores to impact gRNA distribution in the cytoplasm. PMID:27070420

  14. HIV-1 and M-PMV RNA Nuclear Export Elements Program Viral Genomes for Distinct Cytoplasmic Trafficking Behaviors.

    PubMed

    Pocock, Ginger M; Becker, Jordan T; Swanson, Chad M; Ahlquist, Paul; Sherer, Nathan M

    2016-04-01

    Retroviruses encode cis-acting RNA nuclear export elements that override nuclear retention of intron-containing viral mRNAs including the full-length, unspliced genomic RNAs (gRNAs) packaged into assembling virions. The HIV-1 Rev-response element (RRE) recruits the cellular nuclear export receptor CRM1 (also known as exportin-1/XPO1) using the viral protein Rev, while simple retroviruses encode constitutive transport elements (CTEs) that directly recruit components of the NXF1(Tap)/NXT1(p15) mRNA nuclear export machinery. How gRNA nuclear export is linked to trafficking machineries in the cytoplasm upstream of virus particle assembly is unknown. Here we used long-term (>24 h), multicolor live cell imaging to directly visualize HIV-1 gRNA nuclear export, translation, cytoplasmic trafficking, and virus particle production in single cells. We show that the HIV-1 RRE regulates unique, en masse, Rev- and CRM1-dependent "burst-like" transitions of mRNAs from the nucleus to flood the cytoplasm in a non-localized fashion. By contrast, the CTE derived from Mason-Pfizer monkey virus (M-PMV) links gRNAs to microtubules in the cytoplasm, driving them to cluster markedly to the centrosome that forms the pericentriolar core of the microtubule-organizing center (MTOC). Adding each export element to selected heterologous mRNAs was sufficient to confer each distinct export behavior, as was directing Rev/CRM1 or NXF1/NXT1 transport modules to mRNAs using a site-specific RNA tethering strategy. Moreover, multiple CTEs per transcript enhanced MTOC targeting, suggesting that a cooperative mechanism links NXF1/NXT1 to microtubules. Combined, these results reveal striking, unexpected features of retroviral gRNA nucleocytoplasmic transport and demonstrate roles for mRNA export elements that extend beyond nuclear pores to impact gRNA distribution in the cytoplasm.

  15. Prenatal stress-induced programming of genome-wide promoter DNA methylation in 5-HTT-deficient mice.

    PubMed

    Schraut, K G; Jakob, S B; Weidner, M T; Schmitt, A G; Scholz, C J; Strekalova, T; El Hajj, N; Eijssen, L M T; Domschke, K; Reif, A; Haaf, T; Ortega, G; Steinbusch, H W M; Lesch, K P; Van den Hove, D L

    2014-10-21

    The serotonin transporter gene (5-HTT/SLC6A4)-linked polymorphic region has been suggested to have a modulatory role in mediating effects of early-life stress exposure on psychopathology rendering carriers of the low-expression short (s)-variant more vulnerable to environmental adversity in later life. The underlying molecular mechanisms of this gene-by-environment interaction are not well understood, but epigenetic regulation including differential DNA methylation has been postulated to have a critical role. Recently, we used a maternal restraint stress paradigm of prenatal stress (PS) in 5-HTT-deficient mice and showed that the effects on behavior and gene expression were particularly marked in the hippocampus of female 5-Htt+/- offspring. Here, we examined to which extent these effects are mediated by differential methylation of DNA. For this purpose, we performed a genome-wide hippocampal DNA methylation screening using methylated-DNA immunoprecipitation (MeDIP) on Affymetrix GeneChip Mouse Promoter 1.0 R arrays. Using hippocampal DNA from the same mice as assessed before enabled us to correlate gene-specific DNA methylation, mRNA expression and behavior. We found that 5-Htt genotype, PS and their interaction differentially affected the DNA methylation signature of numerous genes, a subset of which showed overlap with the expression profiles of the corresponding transcripts. For example, a differentially methylated region in the gene encoding myelin basic protein (Mbp) was associated with its expression in a 5-Htt-, PS- and 5-Htt × PS-dependent manner. Subsequent fine-mapping of this Mbp locus linked the methylation status of two specific CpG sites to Mbp expression and anxiety-related behavior. In conclusion, hippocampal DNA methylation patterns and expression profiles of female prenatally stressed 5-Htt+/- mice suggest that distinct molecular mechanisms, some of which are promoter methylation-dependent, contribute to the behavioral effects of the 5-Htt

  16. Breeding-assisted genomics.

    PubMed

    Poland, Jesse

    2015-04-01

    The revolution of inexpensive sequencing has ushered in an unprecedented age of genomics. The promise of using this technology to accelerate plant breeding is being realized with a vision of genomics-assisted breeding that will lead to rapid genetic gain for expensive and difficult traits. The reality is now that robust phenotypic data is an increasing limiting resource to complement the current wealth of genomic information. While genomics has been hailed as the discipline to fundamentally change the scope of plant breeding, a more symbiotic relationship is likely to emerge. In the context of developing and evaluating large populations needed for functional genomics, none excel in this area more than plant breeders. While genetic studies have long relied on dedicated, well-structured populations, the resources dedicated to these populations in the context of readily available, inexpensive genotyping is making this philosophy less tractable relative to directly focusing functional genomics on material in breeding programs. Through shifting effort for basic genomic studies from dedicated structured populations, to capturing the entire scope of genetic determinants in breeding lines, we can move towards not only furthering our understanding of functional genomics in plants, but also rapidly improving crops for increased food security, availability and nutrition.

  17. Bisprimer--a program for the design of primers for bisulfite-based genomic sequencing of both plant and Mammalian DNA samples.

    PubMed

    Kovacova, Viera; Janousek, Bohuslav

    2012-01-01

    Plants and animals differ in the sequence context of the methylated sites in DNA. Plants exhibit cytosine methylation in CG, CHG, and CHH sites, whereas CG methylation is the only form present in mammals (with an exception of the early embryonic development). This fact must be taken into account in the design of primers for bisulfite-based genomic sequencing because CHG and CHH sites can remain unmodified. Surprisingly, no user-friendly primer design program is publicly available that could be used to design primers in plants and to simultaneously check the properties of primers such as the potential for primer-dimer formation. For studies concentrating on particular DNA loci, the correct design of primers is crucial. The program, called BisPrimer, includes 2 different subprograms for the primer design, the first one for mammals and the second one for angiosperm plants. Each subprogram is divided into 2 variants. The first variant serves to design primers that preferentially bind to the bisulfite-modified primer-binding sites (C to U conversion). This type of primer preferentially amplifies the bisulfite-converted DNA strands. This feature can help to avoid problems connected with an incomplete bisulfite modification that can sometimes occur for technical reasons. The second variant is intended for the analysis of samples that are supposed to consist of a mixture of DNA molecules that have different levels of cytosine methylation (e.g., pollen DNA). In this case, the aim is to minimize the selection in favor of either less methylated or more methylated molecules.

  18. Fungal Genome Sequencing and Bioenergy

    SciTech Connect

    Baker, Scott E.; Thykaer, Jette; Adney, William S.; Brettin, T.; Brockman, Fred J.; D'haeseleer, Patrik; Martinez, Antonio D.; Miller, R. M.; Rokhsar, Daniel S.; Schadt, Christopher W.; Torok, Tamas; Tuskan, Gerald; Bennett, Joan W.; Berka, Randy; Briggs, Steve; Heitman, Joseph; Taylor, John; Turgeon, Barbara G.; Werner-Washburne, Maggie; Himmel, Michael E.

    2008-09-30

    To date, the number of ongoing filamentous fungal genome sequencing projects is almost tenfold fewer than those of bacterial and archaeal genome projects. The fungi chosen for sequencing represent narrow kingdom diversity; most are pathogens or models. We advocate an ambitious, forward-looking phylogenetic-based genome sequencing program, designed to capture metabolic diversity within the fungal kingdom, thereby enhancing research into alternative bioenergy sources, bioremediation, and fungal-environment interactions.

  19. Collaborators | Office of Cancer Genomics

    Cancer.gov

    The TARGET initiative is jointly managed within the National Cancer Institute (NCI) by the Office of Cancer Genomics (OCG)Opens in a New Tab and the Cancer Therapy Evaluation Program (CTEP)Opens in a New Tab.

  20. Genomic Resources for Cancer Epidemiology

    Cancer.gov

    This page provides links to research resources, complied by the Epidemiology and Genomics Research Program, that may be of interest to genetic epidemiologists conducting cancer research, but is not exhaustive.

  1. Molluscan Evolutionary Genomics

    SciTech Connect

    Simison, W. Brian; Boore, Jeffrey L.

    2005-12-01

    In the last 20 years there have been dramatic advances in techniques of high-throughput DNA sequencing, most recently accelerated by the Human Genome Project, a program that has determined the three billion base pair code on which we are based. Now this tremendous capability is being directed at other genome targets that are being sampled across the broad range of life. This opens up opportunities as never before for evolutionary and organismal biologists to address questions of both processes and patterns of organismal change. We stand at the dawn of a new 'modern synthesis' period, paralleling that of the early 20th century when the fledgling field of genetics first identified the underlying basis for Darwin's theory. We must now unite the efforts of systematists, paleontologists, mathematicians, computer programmers, molecular biologists, developmental biologists, and others in the pursuit of discovering what genomics can teach us about the diversity of life. Genome-level sampling for mollusks to date has mostly been limited to mitochondrial genomes and it is likely that these will continue to provide the best targets for broad phylogenetic sampling in the near future. However, we are just beginning to see an inroad into complete nuclear genome sequencing, with several mollusks and other eutrochozoans having been selected for work about to begin. Here, we provide an overview of the state of molluscan mitochondrial genomics, highlight a few of the discoveries from this research, outline the promise of broadening this dataset, describe upcoming projects to sequence whole mollusk nuclear genomes, and challenge the community to prepare for making the best use of these data.

  2. Efficient Breeding by Genomic Mating.

    PubMed

    Akdemir, Deniz; Sánchez, Julio I

    2016-01-01

    Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population.

  3. Efficient Breeding by Genomic Mating

    PubMed Central

    Akdemir, Deniz; Sánchez, Julio I.

    2016-01-01

    Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population. PMID:27965707

  4. Fungal genome sequencing: basic biology to biotechnology.

    PubMed

    Sharma, Krishna Kant

    2016-08-01

    The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research.

  5. Genome-wide association and genomic selection in animal breeding.

    PubMed

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  6. Genome Science: A Video Tour of the Washington University Genome Sequencing Center for High School and Undergraduate Students

    ERIC Educational Resources Information Center

    Flowers, Susan K.; Easter, Carla; Holmes, Andrea; Cohen, Brian; Bednarski, April E.; Mardis, Elaine R.; Wilson, Richard K.; Elgin, Sarah C. R.

    2005-01-01

    Sequencing of the human genome has ushered in a new era of biology. The technologies developed to facilitate the sequencing of the human genome are now being applied to the sequencing of other genomes. In 2004, a partnership was formed between Washington University School of Medicine Genome Sequencing Center's Outreach Program and Washington…

  7. Antarctic Genomics

    PubMed Central

    Clarke, Andrew; Cockell, Charles S.; Convey, Peter; Detrich III, H. William; Fraser, Keiron P. P.; Johnston, Ian A.; Methe, Barbara A.; Murray, Alison E.; Peck, Lloyd S.; Römisch, Karin; Rogers, Alex D.

    2004-01-01

    With the development of genomic science and its battery of technologies, polar biology stands on the threshold of a revolution, one that will enable the investigation of important questions of unprecedented scope and with extraordinary depth and precision. The exotic organisms of polar ecosystems are ideal candidates for genomic analysis. Through such analyses, it will be possible to learn not only the novel features that enable polar organisms to survive, and indeed thrive, in their extreme environments, but also fundamental biological principles that are common to most, if not all, organisms. This article aims to review recent developments in Antarctic genomics and to demonstrate the global context of such studies. PMID:18629155

  8. Genomic Testing

    MedlinePlus

    ... Services released a report identifying gaps in the regulation, oversight, and usefulness of genetic testing. They expressed ... December 20, 2016 Content source: Center for Surveillance, Epidemiology and Laboratory Services (CSELS) , Public Health Genomics Email ...

  9. The life cycle of a genome project: perspectives and guidelines inspired by insect genome projects

    PubMed Central

    Papanicolaou, Alexie

    2016-01-01

    Many research programs on non-model species biology have been empowered by genomics. In turn, genomics is underpinned by a reference sequence and ancillary information created by so-called “genome projects”. The most reliable genome projects are the ones created as part of an active research program and designed to address specific questions but their life extends past publication. In this opinion paper I outline four key insights that have facilitated maintaining genomic communities: the key role of computational capability, the iterative process of building genomic resources, the value of community participation and the importance of manual curation. Taken together, these ideas can and do ensure the longevity of genome projects and the growing non-model species community can use them to focus a discussion with regards to its future genomic infrastructure. PMID:27006757

  10. 78 FR 68856 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-11-15

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... Review Officer, Scientific Review Branch, National Human Genome Research Institute, National Institutes... of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  11. 78 FR 20933 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-04-08

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel Loan Repayment Program... applications. Place: National Human Genome Research Institute, Room 3055, 5635 Fishers Lane, Rockville,...

  12. Endometrial and acute myeloid leukemia cancer genomes characterized

    Cancer.gov

    Two studies from The Cancer Genome Atlas (TCGA) program reveal details about the genomic landscapes of acute myeloid leukemia (AML) and endometrial cancer. Both provide new insights into the molecular underpinnings of these cancers.

  13. Genotypes are useful for more than genomic evaluation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    New services that provide pedigree discovery, breed composition, mating programs, genomic inbreeding, fertility defects, and inheritance tracking all are possible from low cost genotyping, in addition to genomic evaluation. Genetic markers let breeders select among sibs before their phenotypes becam...

  14. Breeding nursery tissue collection for possible genomic analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phenotyping is considered a major bottleneck in breeding programs. With new genomic technologies, high throughput genotype schemes are constantly being developed. However, every genomic technology requires phenotypic data to inform prediction models generated from the technology. Forage breeders con...

  15. The Cancer Genome Atlas (TCGA): The next stage - TCGA

    Cancer.gov

    The Cancer Genome Atlas (TCGA), the NIH research program that has helped set the standards for characterizing the genomic underpinnings of dozens of cancers on a large scale, is moving to its next phase.

  16. Genome Radio Project: Quarterly report

    SciTech Connect

    1997-08-01

    The process of conducting background research for the programs of the Genome Radio Project is continuing. The most developed of the program ``backgrounders`` have been reviewed by series and program advisors from various fields. Preliminary and background interviews have been conducted with dozens of potential program participants and advisors. Structurally, efforts are being directed toward developing and formalizing the project and series advisor relationships so that the best use can be made of those experts who have offered to assist the project in its presentation of program content. The library of research materials has been expanded considerably, creating a useful resource library for the producers.

  17. The Atlas Genome Assembly System

    PubMed Central

    Havlak, Paul; Chen, Rui; Durbin, K. James; Egan, Amy; Ren, Yanru; Song, Xing-Zhi; Weinstock, George M.; Gibbs, Richard A.

    2004-01-01

    Atlas is a suite of programs developed for assembly of genomes by a “combined approach” that uses DNA sequence reads from both BACs and whole-genome shotgun (WGS) libraries. The BAC clones afford advantages of localized assembly with reduced computational load, and provide a robust method for dealing with repeated sequences. Inclusion of WGS sequences facilitates use of different clone insert sizes and reduces data production costs. A core function of Atlas software is recruitment of WGS sequences into appropriate BACs based on sequence overlaps. Because construction of consensus sequences is from local assembly of these reads, only small (<0.1%) units of the genome are assembled at a time. Once assembled, each BAC is used to derive a genomic layout. This “sequence-based” growth of the genome map has greater precision than with non-sequence-based methods. Use of BACs allows correction of artifacts due to repeats at each stage of the process. This is aided by ancillary data such as BAC fingerprint, other genomic maps, and syntenic relations with other genomes. Atlas was used to assemble a draft DNA sequence of the rat genome; its major components including overlapper and split-scaffold are also being used in pure WGS projects. PMID:15060016

  18. Genome Sequencing.

    PubMed

    Verma, Mansi; Kulshrestha, Samarth; Puri, Ayush

    2017-01-01

    Genome sequencing is an important step toward correlating genotypes with phenotypic characters. Sequencing technologies are important in many fields in the life sciences, including functional genomics, transcriptomics, oncology, evolutionary biology, forensic sciences, and many more. The era of sequencing has been divided into three generations. First generation sequencing involved sequencing by synthesis (Sanger sequencing) and sequencing by cleavage (Maxam-Gilbert sequencing). Sanger sequencing led to the completion of various genome sequences (including human) and provided the foundation for development of other sequencing technologies. Since then, various techniques have been developed which can overcome some of the limitations of Sanger sequencing. These techniques are collectively known as "Next-generation sequencing" (NGS), and are further classified into second and third generation technologies. Although NGS methods have many advantages in terms of speed, cost, and parallelism, the accuracy and read length of Sanger sequencing is still superior and has confined the use of NGS mainly to resequencing genomes. Consequently, there is a continuing need to develop improved real time sequencing techniques. This chapter reviews some of the options currently available and provides a generic workflow for sequencing a genome.

  19. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  20. Single genome amplification of proviral HIV-1 DNA from dried blood spot specimens collected during early infant screening programs in Lusaka, Zambia.

    PubMed

    Seu, Lillian; Mwape, Innocent; Guffey, M Bradford

    2014-07-01

    The ability to evaluate individual HIV-1 virions from the quasispecies of vertically infected infants was evaluated in a field setting at the Centre for Infectious Disease Research in Zambia. Infant heel-prick blood specimens were spotted onto dried blood spot (DBS) filter paper cards at government health clinics. Nucleic acid was extracted and used as a template for HIV-1 proviral DNA detection by a commercial Amplicor HIV-1 PCR test (Roche, version 1.5). On samples that tested positive by commercial diagnostic assay, amplification of DNA was performed using an in-house assay of the 5' and 3' region of the HIV-1 genome. Additionally, fragments covering 1200 nucleotides within pol (full length protease and partial reverse transcriptase) and 1400 nucleotides within env (variable 1-variable 5 region) were further analyzed by single genome amplification (SGA). In summary, we have demonstrated an in-house assay for amplifying the 5' and 3' proviral HIV-1 DNA as well as pol and env proviral DNA fragments from DBS cards collected and analyzed entirely in Zambia. In conclusion, this study shows the feasibility of utilizing DBS cards to amplify the whole proviral HIV-1 genome as well as perform SGA on key HIV-1 genes.

  1. Listeria Genomics

    NASA Astrophysics Data System (ADS)

    Cabanes, Didier; Sousa, Sandra; Cossart, Pascale

    The opportunistic intracellular foodborne pathogen Listeria monocytogenes has become a paradigm for the study of host-pathogen interactions and bacterial adaptation to mammalian hosts. Analysis of L. monocytogenes infection has provided considerable insight into how bacteria invade cells, move intracellularly, and disseminate in tissues, as well as tools to address fundamental processes in cell biology. Moreover, the vast amount of knowledge that has been gathered through in-depth comparative genomic analyses and in vivo studies makes L. monocytogenes one of the most well-studied bacterial pathogens. This chapter provides an overview of progress in the exploration of genomic, transcriptomic, and proteomic data in Listeria spp. to understand genome evolution and diversity, as well as physiological aspects of metabolism used by bacteria when growing in diverse environments, in particular in infected hosts.

  2. Genome Improvement at JGI-HAGSC

    SciTech Connect

    Grimwood, Jane; Schmutz, Jeremy J.; Myers, Richard M.

    2012-03-03

    Since the completion of the sequencing of the human genome, the Joint Genome Institute (JGI) has rapidly expanded its scientific goals in several DOE mission-relevant areas. At the JGI-HAGSC, we have kept pace with this rapid expansion of projects with our focus on assessing, assembling, improving and finishing eukaryotic whole genome shotgun (WGS) projects for which the shotgun sequence is generated at the Production Genomic Facility (JGI-PGF). We follow this by combining the draft WGS with genomic resources generated at JGI-HAGSC or in collaborator laboratories (including BAC end sequences, genetic maps and FLcDNA sequences) to produce an improved draft sequence. For eukaryotic genomes important to the DOE mission, we then add further information from directed experiments to produce reference genomic sequences that are publicly available for any scientific researcher. Also, we have continued our program for producing BAC-based finished sequence, both for adding information to JGI genome projects and for small BAC-based sequencing projects proposed through any of the JGI sequencing programs. We have now built our computational expertise in WGS assembly and analysis and have moved eukaryotic genome assembly from the JGI-PGF to JGI-HAGSC. We have concentrated our assembly development work on large plant genomes and complex fungal and algal genomes.

  3. Fungal Genomics for Energy and Environment

    SciTech Connect

    Grigoriev, Igor V.

    2013-03-11

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). One of its projects, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts) by means of genome sequencing and analysis. New chapters of the Encyclopedia can be opened with user proposals to the JGI Community Sequencing Program (CSP). Another JGI project, the 1000 fungal genomes, explores fungal diversity on genome level at scale and is open for users to nominate new species for sequencing. Over 200 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  4. Teaching residents genomic pathology: a novel approach for new technology.

    PubMed

    Haspel, Richard L

    2013-03-01

    Genomics-based diagnostics have become part of patient care. As pathologists have the expertise in clinical laboratory testing as well as access to patient samples, all genomic medicine is genomic pathology. This article will review the evidence that there is a critical need for pathology resident training in genomics. Several individual program curricula are described as well as the progress of the Training Residents in Genomics Working Group. This group has made significant advances toward developing, implementing, and evaluating a national curriculum in genomics for pathology residents. The novel approach of the Training Residents in Genomics Working Group can be used as a model for training pathology professionals in any new technology.

  5. The Human Genome Project: Past, Present, and Future

    NASA Astrophysics Data System (ADS)

    Watson, James D.

    1990-04-01

    This article presents a short discussion of the development of the human genome program in the United States, a summary of the current status of the organization and administration of the National Institutes of Health component of the program, and some prospects for the future directions of the program and the applications of genome information.

  6. Genome mapping

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome maps can be thought of much like road maps except that, instead of traversing across land, they traverse across the chromosomes of an organism. Genetic markers serve as landmarks along the chromosome and provide researchers information as to how close they may be to a gene or region of inter...

  7. The Human Genome Initiative of the Department of Energy

    SciTech Connect

    1988-01-01

    The structural characterization of genes and elucidation of their encoded functions have become a cornerstone of modern health research, biology and biotechnology. A genome program is an organized effort to locate and identify the functions of all the genes of an organism. Beginning with the DOE-sponsored, 1986 human genome workshop at Santa Fe, the value of broadly organized efforts supporting total genome characterization became a subject of intensive study. There is now national recognition that benefits will rapidly accrue from an effective scientific infrastructure for total genome research. In the US genome research is now receiving dedicated funds. Several other nations are implementing genome programs. Supportive infrastructure is being improved through both national and international cooperation. The Human Genome Initiative of the Department of Energy (DOE) is a focused program of Resource and Technology Development, with objectives of speeding and bringing economies to the national human genome effort. This report relates the origins and progress of the Initiative. 34 refs.

  8. The Human Genome Initiative of the Department of Energy

    DOE R&D Accomplishments Database

    1988-01-01

    The structural characterization of genes and elucidation of their encoded functions have become a cornerstone of modern health research, biology and biotechnology. A genome program is an organized effort to locate and identify the functions of all the genes of an organism. Beginning with the DOE-sponsored, 1986 human genome workshop at Santa Fe, the value of broadly organized efforts supporting total genome characterization became a subject of intensive study. There is now national recognition that benefits will rapidly accrue from an effective scientific infrastructure for total genome research. In the US genome research is now receiving dedicated funds. Several other nations are implementing genome programs. Supportive infrastructure is being improved through both national and international cooperation. The Human Genome Initiative of the Department of Energy (DOE) is a focused program of Resource and Technology Development, with objectives of speeding and bringing economies to the national human genome effort. This report relates the origins and progress of the Initiative.

  9. Genome cartography: charting the apicomplexan genome.

    PubMed

    Kissinger, Jessica C; DeBarry, Jeremy

    2011-08-01

    Genes reside in particular genomic contexts that can be mapped at many levels. Historically, 'genetic maps' were used primarily to locate genes. Recent technological advances in the determination of genome sequences have made the analysis and comparison of whole genomes possible and increasingly tractable. What do we see if we shift our focus from gene content (the 'inventory' of genes contained within a genome) to the composition and organization of a genome? This review examines what has been learned about the evolution of the apicomplexan genome as well as the significance and impact of genomic location on our understanding of the eukaryotic genome and parasite biology.

  10. Fueling the Future with Fungal Genomes

    SciTech Connect

    Grigoriev, Igor V.

    2014-10-27

    Genomes of fungi relevant to energy and environment are in focus of the JGI Fungal Genomic Program. One of its projects, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts and pathogens) and biorefinery processes (cellulose degradation and sugar fermentation) by means of genome sequencing and analysis. New chapters of the Encyclopedia can be opened with user proposals to the JGI Community Science Program (CSP). Another JGI project, the 1000 fungal genomes, explores fungal diversity on genome level at scale and is open for users to nominate new species for sequencing. Over 400 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics will lead to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such ‘parts’ suggested by comparative genomics and functional analysis in these areas are presented here.

  11. Personal genomics services: whose genomes?

    PubMed Central

    Gurwitz, David; Bregman-Eschet, Yael

    2009-01-01

    New companies offering personal whole-genome information services over the internet are dynamic and highly visible players in the personal genomics field. For fees currently ranging from US$399 to US$2500 and a vial of saliva, individuals can now purchase online access to their individual genetic information regarding susceptibility to a range of chronic diseases and phenotypic traits based on a genome-wide SNP scan. Most of the companies offering such services are based in the United States, but their clients may come from nearly anywhere in the world. Although the scientific validity, clinical utility and potential future implications of such services are being hotly debated, several ethical and regulatory questions related to direct-to-consumer (DTC) marketing strategies of genetic tests have not yet received sufficient attention. For example, how can we minimize the risk of unauthorized third parties from submitting other people's DNA for testing? Another pressing question concerns the ownership of (genotypic and phenotypic) information, as well as the unclear legal status of customers regarding their own personal information. Current legislation in the US and Europe falls short of providing clear answers to these questions. Until the regulation of personal genomics services catches up with the technology, we call upon commercial providers to self-regulate and coordinate their activities to minimize potential risks to individual privacy. We also point out some specific steps, along the trustee model, that providers of DTC personal genomics services as well as regulators and policy makers could consider for addressing some of the concerns raised below. PMID:19259127

  12. Citrus Genomics

    PubMed Central

    Talon, Manuel; Gmitter Jr., Fred G.

    2008-01-01

    Citrus is one of the most widespread fruit crops globally, with great economic and health value. It is among the most difficult plants to improve through traditional breeding approaches. Currently, there is risk of devastation by diseases threatening to limit production and future availability to the human population. As technologies rapidly advance in genomic science, they are quickly adapted to address the biological challenges of the citrus plant system and the world's industries. The historical developments of linkage mapping, markers and breeding, EST projects, physical mapping, an international citrus genome sequencing project, and critical functional analysis are described. Despite the challenges of working with citrus, there has been substantial progress. Citrus researchers engaged in international collaborations provide optimism about future productivity and contributions to the benefit of citrus industries worldwide and to the human population who can rely on future widespread availability of this health-promoting and aesthetically pleasing fruit crop. PMID:18509486

  13. The human genome project and international health

    SciTech Connect

    Watson, J.D.; Cook-Deegan, R.M. )

    1990-06-27

    The human genome project is designed to provide common resources for the study of human genetics, and to assist biomedical researchers in their assault on disease. The main benefit will be to provide several kinds of maps of the human genome, and those of other organisms, to permit rapid isolation of genes for further study about DNA structure and function. This article describes genome research programs in developed and developing countries, and the international efforts that have contributed to genome research programs. For example, the large-scale collaborations to study Duchenne's muscular dystrophy, Huntington's disease, Alzheimer's disease, cystic fibrosis involve collaborators from many nations and families spread throughout the world. In the USA, the US Department of Energy was first to start a dedicated genome research program in 1987. Since then, another major government program has begun at the National Center for Human Genome Research of the National Institutes of Health. Italy, China, Australia, France, Canada, and Japan have genome research programs also.

  14. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  15. Ancient genomics.

    PubMed

    Der Sarkissian, Clio; Allentoft, Morten E; Ávila-Arcos, María C; Barnett, Ross; Campos, Paula F; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D; Moreno-Mayar, J Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M Thomas P; Willerslev, Eske; Orlando, Ludovic

    2015-01-19

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past.

  16. Jean C. Zenklusen, M.S., Ph.D., Discusses the NCI Genomics Data Commons at AACR 2014 - TCGA

    Cancer.gov

    At the AACR 2014 meeting, Dr. Jean C. Zenklusen, Director of The Cancer Genome Atlas Program Office, highlights the Genomics Data Commons, a harmonized data repository that will allow simultaneous access and analysis of NCI genomics data, including The Ca

  17. Genomic selection in wheat using genotyping-by-sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic selection (GS) is a promising approach to accelerate gain in plant breeding programs. In GS, genome-wide molecular markers are used to predict total breeding values and make selections of individuals or breeding lines prior to phenotyping. One premise of applying GS is that low-cost genome...

  18. gmos: Rapid Detection of Genome Mosaicism over Short Evolutionary Distances

    PubMed Central

    Domazet-Lošo, Mirjana; Domazet-Lošo, Tomislav

    2016-01-01

    Prokaryotic and viral genomes are often altered by recombination and horizontal gene transfer. The existing methods for detecting recombination are primarily aimed at viral genomes or sets of loci, since the expensive computation of underlying statistical models often hinders the comparison of complete prokaryotic genomes. As an alternative, alignment-free solutions are more efficient, but cannot map (align) a query to subject genomes. To address this problem, we have developed gmos (Genome MOsaic Structure), a new program that determines the mosaic structure of query genomes when compared to a set of closely related subject genomes. The program first computes local alignments between query and subject genomes and then reconstructs the query mosaic structure by choosing the best local alignment for each query region. To accomplish the analysis quickly, the program mostly relies on pairwise alignments and constructs multiple sequence alignments over short overlapping subject regions only when necessary. This fine-tuned implementation achieves an efficiency comparable to an alignment-free tool. The program performs well for simulated and real data sets of closely related genomes and can be used for fast recombination detection; for instance, when a new prokaryotic pathogen is discovered. As an example, gmos was used to detect genome mosaicism in a pathogenic Enterococcus faecium strain compared to seven closely related genomes. The analysis took less than two minutes on a single 2.1 GHz processor. The output is available in fasta format and can be visualized using an accessory program, gmosDraw (freely available with gmos). PMID:27846272

  19. gmos: Rapid Detection of Genome Mosaicism over Short Evolutionary Distances.

    PubMed

    Domazet-Lošo, Mirjana; Domazet-Lošo, Tomislav

    2016-01-01

    Prokaryotic and viral genomes are often altered by recombination and horizontal gene transfer. The existing methods for detecting recombination are primarily aimed at viral genomes or sets of loci, since the expensive computation of underlying statistical models often hinders the comparison of complete prokaryotic genomes. As an alternative, alignment-free solutions are more efficient, but cannot map (align) a query to subject genomes. To address this problem, we have developed gmos (Genome MOsaic Structure), a new program that determines the mosaic structure of query genomes when compared to a set of closely related subject genomes. The program first computes local alignments between query and subject genomes and then reconstructs the query mosaic structure by choosing the best local alignment for each query region. To accomplish the analysis quickly, the program mostly relies on pairwise alignments and constructs multiple sequence alignments over short overlapping subject regions only when necessary. This fine-tuned implementation achieves an efficiency comparable to an alignment-free tool. The program performs well for simulated and real data sets of closely related genomes and can be used for fast recombination detection; for instance, when a new prokaryotic pathogen is discovered. As an example, gmos was used to detect genome mosaicism in a pathogenic Enterococcus faecium strain compared to seven closely related genomes. The analysis took less than two minutes on a single 2.1 GHz processor. The output is available in fasta format and can be visualized using an accessory program, gmosDraw (freely available with gmos).

  20. inGeno – an integrated genome and ortholog viewer for improved genome to genome comparisons

    PubMed Central

    Liang, Chunguang; Dandekar, Thomas

    2006-01-01

    Background Systematic genome comparisons are an important tool to reveal gene functions, pathogenic features, metabolic pathways and genome evolution in the era of post-genomics. Furthermore, such comparisons provide important clues for vaccines and drug development. Existing genome comparison software often lacks accurate information on orthologs, the function of similar genes identified and genome-wide reports and lists on specific functions. All these features and further analyses are provided here in the context of a modular software tool "inGeno" written in Java with Biojava subroutines. Results InGeno provides a user-friendly interactive visualization platform for sequence comparisons (comprehensive reciprocal protein – protein comparisons) between complete genome sequences and all associated annotations and features. The comparison data can be acquired from several different sequence analysis programs in flexible formats. Automatic dot-plot analysis includes output reduction, filtering, ortholog testing and linear regression, followed by smart clustering (local collinear blocks; LCBs) to reveal similar genome regions. Further, the system provides genome alignment and visualization editor, collinear relationships and strain-specific islands. Specific annotations and functions are parsed, recognized, clustered, logically concatenated and visualized and summarized in reports. Conclusion As shown in this study, inGeno can be applied to study and compare in particular prokaryotic genomes against each other (gram positive and negative as well as close and more distantly related species) and has been proven to be sensitive and accurate. This modular software is user-friendly and easily accommodates new routines to meet specific user-defined requirements. PMID:17054788

  1. A pilot examination of the genome-wide DNA methylation signatures of subjects entering and exiting short-term alcohol dependence treatment programs

    PubMed Central

    Philibert, Robert A; Penaluna, Brandan; White, Teresa; Shires, Sarah; Gunter, Tracy; Liesveld, Jill; Erwin, Cheryl; Hollenbeck, Nancy; Osborn, Terry

    2014-01-01

    Alcoholism has a profound impact on millions of people throughout the world. However, the ability to determine if a patient needs treatment is hindered by reliance on self-reporting and the clinician’s capability to monitor the patient’s response to treatment is challenged by the lack of reliable biomarkers. Using a genome-wide approach, we have previously shown that chronic alcohol use is associated with methylation changes in DNA from human cell lines. In this pilot study, we now examine DNA methylation in peripheral mononuclear cell DNA gathered from subjects as they enter and leave short-term alcohol treatment. When compared with abstinent controls, subjects with heavy alcohol use show widespread changes in DNA methylation that have a tendency to reverse with abstinence. Pathway analysis demonstrates that these changes map to gene networks involved in apoptosis. There is no significant overlap of the alcohol signature with the methylation signature previously derived for smoking. We conclude that DNA methylation may have future clinical utility in assessing acute alcohol use status and monitoring treatment response. PMID:25147915

  2. Teaching strategies to incorporate genomics education into academic nursing curricula.

    PubMed

    Quevedo Garcia, Sylvia P; Greco, Karen E; Loescher, Lois J

    2011-11-01

    The translation of genomic science into health care has expanded our ability to understand the effects of genomics on human health and disease. As genomic advances continue, nurses are expected to have the knowledge and skills to translate genomic information into improved patient care. This integrative review describes strategies used to teach genomics in academic nursing programs and their facilitators and barriers to inclusion in nursing curricula. The Learning Engagement Model and the Diffusion of Innovations Theory guided the interpretation of findings. CINAHL, Medline, and Web of Science were resources for articles published during the past decade that included strategies for teaching genomics in academic nursing programs. Of 135 articles, 13 met criteria for review. Examples of effective genomics teaching strategies included clinical application through case studies, storytelling, online genomics resources, student self-assessment, guest lecturers, and a genetics focus group. Most strategies were not evaluated for effectiveness.

  3. Dissection of genomic correlation matrices using multivariate factor analysis in dairy and dual-purpose cattle breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    SNP effects estimated in genomic selection programs allow for the prediction of direct genomic values (DGV) both at genome-wide and chromosomal level. As a consequence, genome-wide (G_GW) or chromosomal (G_CHR) correlation matrices between genomic predictions for different traits can be calculated. ...

  4. Building International Genomics Collaboration for Global Health Security

    PubMed Central

    Cui, Helen H.; Erkkila, Tracy; Chain, Patrick S. G.; Vuyisich, Momchilo

    2015-01-01

    Genome science and technologies are transforming life sciences globally in many ways and becoming a highly desirable area for international collaboration to strengthen global health. The Genome Science Program at the Los Alamos National Laboratory is leveraging a long history of expertise in genomics research to assist multiple partner nations in advancing their genomics and bioinformatics capabilities. The capability development objectives focus on providing a molecular genomics-based scientific approach for pathogen detection, characterization, and biosurveillance applications. The general approaches include introduction of basic principles in genomics technologies, training on laboratory methodologies and bioinformatic analysis of resulting data, procurement, and installation of next-generation sequencing instruments, establishing bioinformatics software capabilities, and exploring collaborative applications of the genomics capabilities in public health. Genome centers have been established with public health and research institutions in the Republic of Georgia, Kingdom of Jordan, Uganda, and Gabon; broader collaborations in genomics applications have also been developed with research institutions in many other countries. PMID:26697418

  5. Building international genomics collaboration for global health security

    SciTech Connect

    Cui, Helen H.; Erkkila, Tracy; Chain, Patrick S. G.; Vuyisich, Momchilo

    2015-12-07

    Genome science and technologies are transforming life sciences globally in many ways and becoming a highly desirable area for international collaboration to strengthen global health. The Genome Science Program at the Los Alamos National Laboratory is leveraging a long history of expertise in genomics research to assist multiple partner nations in advancing their genomics and bioinformatics capabilities. The capability development objectives focus on providing a molecular genomics-based scientific approach for pathogen detection, characterization, and biosurveillance applications. The general approaches include introduction of basic principles in genomics technologies, training on laboratory methodologies and bioinformatic analysis of resulting data, procurement, and installation of next-generation sequencing instruments, establishing bioinformatics software capabilities, and exploring collaborative applications of the genomics capabilities in public health. Genome centers have been established with public health and research institutions in the Republic of Georgia, Kingdom of Jordan, Uganda, and Gabon; broader collaborations in genomics applications have also been developed with research institutions in many other countries.

  6. The platypus genome unraveled.

    PubMed

    O'Brien, Stephen J

    2008-06-13

    The genome of the platypus has been sequenced, assembled, and annotated by an international genomics team. Like the animal itself the platypus genome contains an amalgam of mammal, reptile, and bird-like features.

  7. Genome evolution: the dynamics of static genomes.

    PubMed

    Stechmann, Alexandra

    2004-06-22

    A random survey of a microsporidian genome has revealed some striking features. Although the genomes of microsporidians are among the smallest known for eukaryotes, their organisation appears to be well conserved.

  8. Plant Genome Duplication Database.

    PubMed

    Lee, Tae-Ho; Kim, Junah; Robertson, Jon S; Paterson, Andrew H

    2017-01-01

    Genome duplication, widespread in flowering plants, is a driving force in evolution. Genome alignments between/within genomes facilitate identification of homologous regions and individual genes to investigate evolutionary consequences of genome duplication. PGDD (the Plant Genome Duplication Database), a public web service database, provides intra- or interplant genome alignment information. At present, PGDD contains information for 47 plants whose genome sequences have been released. Here, we describe methods for identification and estimation of dates of genome duplication and speciation by functions of PGDD.The database is freely available at http://chibba.agtec.uga.edu/duplication/.

  9. Transposable element junctions in marker development and genomic characterization of barley

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Barley is a model plant in genomic studies of Triticeae species. A complete barley genome sequence will facilitate not only barley breeding programs, but also those for related species. However, the large genome size and high repetitive sequence content complicate the barley genome assembly. The ma...

  10. 76 FR 3643 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-01-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: January...

  11. 75 FR 2148 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-01-14

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group, Genome Research Review... Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS)...

  12. 75 FR 52537 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-08-26

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS)...

  13. Ensembl genomes 2016: more genomes, more complexity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent...

  14. Ensembl Genomes 2016: more genomes, more complexity

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J.; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K.; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D.; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello–Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin L.; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  15. Ensembl Genomes 2016: more genomes, more complexity.

    PubMed

    Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

    2016-01-04

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.

  16. Comparative genomics of phylogenetically diverse unicellular eukaryotes provide new insights into the genetic basis for the evolution of the programmed cell death machinery.

    PubMed

    Nedelcu, Aurora M

    2009-03-01

    Programmed cell death (PCD) represents a significant component of normal growth and development in multicellular organisms. Recently, PCD-like processes have been reported in single-celled eukaryotes, implying that some components of the PCD machinery existed early in eukaryotic evolution. This study provides a comparative analysis of PCD-related sequences across more than 50 unicellular genera from four eukaryotic supergroups: Unikonts, Excavata, Chromalveolata, and Plantae. A complex set of PCD-related sequences that correspond to domains or proteins associated with all main functional classes--from ligands and receptors to executors of PCD--was found in many unicellular lineages. Several PCD domains and proteins previously thought to be restricted to animals or land plants are also present in unicellular species. Noteworthy, the yeast, Saccharomyces cerevisiae--used as an experimental model system for PCD research, has a rather reduced set of PCD-related sequences relative to other unicellular species. The phylogenetic distribution of the PCD-related sequences identified in unicellular lineages suggests that the genetic basis for the evolution of the complex PCD machinery present in extant multicellular lineages has been established early in the evolution of eukaryotes. The shaping of the PCD machinery in multicellular lineages involved the duplication, co-option, recruitment, and shuffling of domains already present in their unicellular ancestors.

  17. Funding Opportunity: Genomic Data Centers

    Cancer.gov

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  18. Enabling functional genomics with genome engineering.

    PubMed

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances.

  19. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  20. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer.

  1. Culex genome is not just another genome for comparative genomics.

    PubMed

    Reddy, B P Niranjan; Labbé, Pierrick; Corbel, Vincent

    2012-03-30

    Formal publication of the Culex genome sequence has closed the human disease vector triangle by meeting the Anopheles gambiae and Aedes aegypti genome sequences. Compared to these other mosquitoes, Culex quinquefasciatus possesses many specific hallmark characteristics, and may thus provide different angles for research which ultimately leads to a practical solution for controlling the ever increasing burden of insect-vector-borne diseases around the globe. We argue the special importance of the cosmopolitan species- Culex genome sequence by invoking many interesting questions and the possible of potential of the Culex genome to answer those.

  2. Exploring Other Genomes: Bacteria.

    ERIC Educational Resources Information Center

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  3. Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome.

    PubMed

    Hamilton, Eileen P; Kapusta, Aurélie; Huvos, Piroska E; Bidwell, Shelby L; Zafar, Nikhat; Tang, Haibao; Hadjithomas, Michalis; Krishnakumar, Vivek; Badger, Jonathan H; Caler, Elisabet V; Russ, Carsten; Zeng, Qiandong; Fan, Lin; Levin, Joshua Z; Shea, Terrance; Young, Sarah K; Hegarty, Ryan; Daza, Riza; Gujja, Sharvari; Wortman, Jennifer R; Birren, Bruce W; Nusbaum, Chad; Thomas, Jainy; Carey, Clayton M; Pritham, Ellen J; Feschotte, Cédric; Noto, Tomoko; Mochizuki, Kazufumi; Papazyan, Romeo; Taverna, Sean D; Dear, Paul H; Cassidy-Hanley, Donna M; Xiong, Jie; Miao, Wei; Orias, Eduardo; Coyne, Robert S

    2016-11-28

    The germline genome of the binucleated ciliate Tetrahymena thermophila undergoes programmed chromosome breakage and massive DNA elimination to generate the somatic genome. Here, we present a complete sequence assembly of the germline genome and analyze multiple features of its structure and its relationship to the somatic genome, shedding light on the mechanisms of genome rearrangement as well as the evolutionary history of this remarkable germline/soma differentiation. Our results strengthen the notion that a complex, dynamic, and ongoing interplay between mobile DNA elements and the host genome have shaped Tetrahymena chromosome structure, locally and globally. Non-standard outcomes of rearrangement events, including the generation of short-lived somatic chromosomes and excision of DNA interrupting protein-coding regions, may represent novel forms of developmental gene regulation. We also compare Tetrahymena's germline/soma differentiation to that of other characterized ciliates, illustrating the wide diversity of adaptations that have occurred within this phylum.

  4. Genomic definition of species. Revision 2

    SciTech Connect

    Crkvenjakov, R.; Drmanac, R.

    1993-03-01

    A genome is the sum total of the DNA sequences in the cells of an individual organism. The common usage that species possess genomes comes naturally to biochemists, who have shown that all protein and nucleic acid molecules are at the same time species- and individual-specific, with minor individual variations being superimposed on a consensus sequence that is constant for a species. By extension, this property is attributed to the common features of DNA in the chromosomes of members of a given species and is called species genome. Our proposal for the definition of a biological species is as follows: A species comprises a group of actual and potential biological organisms built according to a unique genome program that is recorded, and at least in part expressed, in the structures of their genomic nucleic acid molecule(s), having intragroup sequence differences which can be fully interconverted in the process of organismal reproduction.

  5. Genomics of apicomplexan parasites.

    PubMed

    Swapna, Lakshmipuram Seshadri; Parkinson, John

    2017-02-22

    The increasing prevalence of infections involving intracellular apicomplexan parasites such as Plasmodium, Toxoplasma, and Cryptosporidium (the causative agents of malaria, toxoplasmosis, and cryptosporidiosis, respectively) represent a significant global healthcare burden. Despite their significance, few treatments are available; a situation that is likely to deteriorate with the emergence of new resistant strains of parasites. To lay the foundation for programs of drug discovery and vaccine development, genome sequences for many of these organisms have been generated, together with large-scale expression and proteomic datasets. Comparative analyses of these datasets are beginning to identify the molecular innovations supporting both conserved processes mediating fundamental roles in parasite survival and persistence, as well as lineage-specific adaptations associated with divergent life-cycle strategies. The challenge is how best to exploit these data to derive insights into parasite virulence and identify those genes representing the most amenable targets. In this review, we outline genomic datasets currently available for apicomplexans and discuss biological insights that have emerged as a consequence of their analysis. Of particular interest are systems-based resources, focusing on areas of metabolism and host invasion that are opening up opportunities for discovering new therapeutic targets.

  6. MycoCosm, an Integrated Fungal Genomics Resource

    SciTech Connect

    Shabalov, Igor; Grigoriev, Igor

    2012-03-16

    MycoCosm is a web-based interactive fungal genomics resource, which was first released in March 2010, in response to an urgent call from the fungal community for integration of all fungal genomes and analytical tools in one place (Pan-fungal data resources meeting, Feb 21-22, 2010, Alexandria, VA). MycoCosm integrates genomics data and analysis tools to navigate through over 100 fungal genomes sequenced at JGI and elsewhere. This resource allows users to explore fungal genomes in the context of both genome-centric analysis and comparative genomics, and promotes user community participation in data submission, annotation and analysis. MycoCosm has over 4500 unique visitors/month or 35000+ visitors/year as well as hundreds of registered users contributing their data and expertise to this resource. Its scalable architecture allows significant expansion of the data expected from JGI Fungal Genomics Program, its users, and integration with external resources used by fungal community.

  7. Exploiting the Genome

    DTIC Science & Technology

    1998-09-11

    complete human genome sequence . 14. SUBJECT TERMS 15. NUMBER OF PAGES 16. PRICE CODE 17. SECURITY CLASSIFICATION OF REPORT Unclassified 18. SECURITY...goal of the project is to ob- tain the complete sequence of the human genome by the year 2005. The genome contains approximately 3.3 Gb (billion base...and second, to consider possible roles for the DOE in the "post- genomic " era, following acquisition of the complete human genome

  8. Accelerating Genome Sequencing 100X with FPGAs

    SciTech Connect

    Storaasli, Olaf O; Strenski, Dave

    2007-01-01

    The performance of two Cray XD1 systems with Virtex-II Pro 50 and Virtex-4 LX160 FPGAs was evaluated using the FASTA computational biology program for human genome (DNA and protein) sequence comparisons. FPGA speedups of 50X (Virtex-II Pro 50) and 100X (Virtex-4 LX160) over a 2.2 GHz Opteron were obtained. FPGA coding issues for human genome data are described.

  9. TCGA's Pan-Cancer Efforts and Expansion to Include Whole Genome Sequence - TCGA

    Cancer.gov

    Carolyn Hutter, Ph.D., Program Director of NHGRI's Division of Genomic Medicine, discusses the expansion of TCGA's Pan-Cancer efforts to include the Pan-Cancer Analysis of Whole Genomes (PAWG) project.

  10. A flexible approach to genome map assembly

    SciTech Connect

    Harley, E.; Bonner, A.J.

    1994-12-31

    A major goal of the Human Genome Project is to construct detailed physical maps of the human genome. A physical map is an assignment of DNA fragments to their locations on the genome. Complete maps of large genomes require the integration of many kinds of experimental data, each with its own forms of noise and experimental error. To facilitate this integration, we are developing a flexible approach to map assembly based on logic programming and data visualization. Logic programming provides a convenient, and mathematically rigorous way of reasoning about data, while data visualization provides layout algorithms for assembling and displaying genome maps. To demonstrate the approach, this paper describes numerous rules for map assembly implemented in a data-visualization system called Hy+. Using these rules, we have successfully assembled contigs (partial maps) from real and simulated mapping data-data that is noisy, imprecise and contradictory. The main advantage of the approach is that it allows a user to rapidly develop, implement and test new rules for genome map assembly, with a minimum of programming effort.

  11. Intrauterine programming

    PubMed Central

    Sedaghat, Katayoun; Zahediasl, Saleh; Ghasemi, Asghar

    2015-01-01

    In mammals, the intrauterine condition has an important role in the development of fetal physiological systems in later life. Suboptimal maternal environment can alter the regulatory pathways that determine the normal development of the fetus in utero, which in post-natal life may render the individual more susceptible to cardiovascular or metabolic adult-life diseases. Changes in the intrauterine availability of nutrients, oxygen and hormones can change the fetal tissue developmental regulatory planning, which occurs genomically and non-genomically and can cause permanent structural and functional changes in the systems, leading to diseases in early years of life and those that particularly become overt in adulthood. In this review we take a brief look at the main elements which program the fetal system development and consequently induce a crucial impact on the cardiovascular, nervous and hormonal systems in adulthood. PMID:25945232

  12. Plant genomics: an overview.

    PubMed

    Campos-de Quiroz, Hugo

    2002-01-01

    Recent technological advancements have substantially expanded our ability to analyze and understand plant genomes and to reduce the gap existing between genotype and phenotype. The fast evolving field of genomics allows scientists to analyze thousand of genes in parallel, to understand the genetic architecture of plant genomes and also to isolate the genes responsible for mutations. Furthermore, whole genomes can now be sequenced. This review addresses these issues and also discusses ways to extract biological meaning from DNA data. Although genomic issuesare addressed from a plant perspective, this review provides insights into the genomic analyses of other organisms.

  13. Genome instability, cancer and aging

    PubMed Central

    Maslov, Alexander Y.; Vijg, Jan

    2015-01-01

    DNA damage-driven genome instability underlies the diversity of life forms generated by the evolutionary process but is detrimental to the somatic cells of individual organisms. The cellular response to DNA damage can be roughly divided in two parts. First, when damage is severe, programmed cell death may occur or, alternatively, temporary or permanent cell cycle arrest. This protects against cancer but can have negative effects on the long term, e.g., by depleting stem cell reservoirs. Second, damage can be repaired through one or more of the many sophisticated genome maintenance pathways. However, erroneous DNA repair and incomplete restoration of chromatin after damage is resolved, produce mutations and epimutations, respectively, both of which have been shown to accumulate with age. An increased burden of mutations and/or epimutations in aged tissues increases cancer risk and adversely affects gene transcriptional regulation, leading to progressive decline in organ function. Cellular degeneration and uncontrolled cell proliferation are both major hallmarks of aging. Despite the fact that one seems to exclude the other, they both may be driven by a common mechanism. Here, we review age related changes in the mammalian genome and their possible functional consequences, with special emphasis on genome instability in stem/progenitor cells. PMID:19344750

  14. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  15. GenomeVISTA—an integrated software package for whole-genome alignment and visualization

    PubMed Central

    Poliakov, Alexandre; Foong, Justin; Brudno, Michael; Dubchak, Inna

    2014-01-01

    Summary: With the ubiquitous generation of complete genome assemblies for a variety of species, efficient tools for whole-genome alignment along with user-friendly visualization are critically important. Our VISTA family of tools for comparative genomics, based on algorithms for pairwise and multiple alignments of genomic sequences and whole-genome assemblies, has become one of the standard techniques for comparative analysis. Most of the VISTA programs have been implemented as Web-accessible servers and are extensively used by the biomedical community. In this manuscript, we introduce GenomeVISTA: a novel implementation that incorporates most features of the VISTA family—fast and accurate alignment, visualization capabilities, GUI and analytical tools within a stand-alone software package. GenomeVISTA thus provides flexibility and security for users who need to conduct whole-genome comparisons on their own computers. Availability and implementation: Implemented in Perl, C/C++ and Java, the source code is freely available for download at the VISTA Web site: http://genome.lbl.gov/vista/ Contact: avpoliakov@lbl.gov or ildubchak@lbl.gov Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24860159

  16. Genomic Data Commons | Office of Cancer Genomics

    Cancer.gov

    The NCI’s Center for Cancer Genomics launches the Genomic Data Commons (GDC), a unified data sharing platform for the cancer research community. The mission of the GDC is to enable data sharing across the entire cancer research community, to ultimately support precision medicine in oncology.

  17. Directed genome engineering for genome optimization.

    PubMed

    D'Halluin, Kathleen; Ruiter, Rene

    2013-01-01

    The ability to develop nucleases with tailor-made activities for targeted DNA double-strand break induction at will at any desired position in the genome has been a major breakthrough to make targeted genome optimization feasible in plants. The development of site specific nucleases for precise genome modification has expanded the repertoire of tools for the development and optimization of traits, already including mutation breeding, molecular breeding and transgenesis.Through directed genome engineering technology, the huge amount of information provided by genomics and systems biology can now more effectively be used for the creation of plants with improved or new traits, and for the dissection of gene functions. Although still in an early phase of deployment, its utility has been demonstrated for engineering disease resistance, herbicide tolerance, altered metabolite profiles, and for molecular trait stacking to allow linked transmission of transgenes. In this article, we will briefly review the different approaches for directed genome engineering with the emphasis on double strand break (DSB)-mediated engineering to-wards genome optimization for crop improvement and towards the acceleration of functional genomics.

  18. About the Epidemiology and Genomics Research Program

    Cancer.gov

    Epidemiology is the scientific study of the causes and distribution of disease in populations. NCI-funded epidemiology research is conducted through research at institutions in the United States and internationally.

  19. Genomics to feed a switchgrass breeding program

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Development of improved cultivars is one of three pillars, along with sustainable production and efficient conversion, required for dedicated cellulosic bioenergy crops to succeed. Breeding new cultivars is a long, slow process requiring patience, dedication, and motivation to realize gains and adva...

  20. GENOMICS AND ENVIRONMENTAL RESEARCH

    EPA Science Inventory

    The impact of recently developed and emerging genomics technologies on environmental sciences has significant implications for human and ecological risk assessment issues. The linkage of data generated from genomics, transcriptomics, proteomics, metabalomics, and ecology can be ...

  1. Genomic Data Commons launches

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  2. Whole-genome patenting.

    PubMed

    O'Malley, Maureen A; Bostanci, Adam; Calvert, Jane

    2005-06-01

    Gene patenting is now a familiar commercial practice, but there is little awareness that several patents claim ownership of the complete genome sequence of a prokaryote or virus. When these patents are analysed and compared to those for other biological entities, it becomes clear that genome patents seek to exploit the genome as an information base and are part of a broader shift towards intangible intellectual property in genomics.

  3. Exploiting the genome

    SciTech Connect

    Block, S.; Cornwall, J.; Dyson, F.; Koonin, S.; Lewis, N.; Schwitters, R.

    1998-09-11

    In 1997, JASON conducted a DOE-sponsored study of the human genome project with special emphasis on the areas of technology, quality assurance and quality control, and informatics. The present study has two aims: first, to update the 1997 Report in light of recent developments in genome sequencing technology, and second, to consider possible roles for the DOE in the ''post-genomic" era, following acquisition of the complete human genome sequence.

  4. Assembly complexity of prokaryotic genomes using short reads

    PubMed Central

    2010-01-01

    Background De Bruijn graphs are a theoretical framework underlying several modern genome assembly programs, especially those that deal with very short reads. We describe an application of de Bruijn graphs to analyze the global repeat structure of prokaryotic genomes. Results We provide the first survey of the repeat structure of a large number of genomes. The analysis gives an upper-bound on the performance of genome assemblers for de novo reconstruction of genomes across a wide range of read lengths. Further, we demonstrate that the majority of genes in prokaryotic genomes can be reconstructed uniquely using very short reads even if the genomes themselves cannot. The non-reconstructible genes are overwhelmingly related to mobile elements (transposons, IS elements, and prophages). Conclusions Our results improve upon previous studies on the feasibility of assembly with short reads and provide a comprehensive benchmark against which to compare the performance of the short-read assemblers currently being developed. PMID:20064276

  5. Office of Cancer Genomics |

    Cancer.gov

    The mission of the NCI’s Office of Cancer Genomics (OCG) is to enhance the understanding of the molecular mechanisms of cancer, advance and accelerate genomics science and technology development, and efficiently translate the genomics data to improve cancer research, prevention, early detection, diagnosis and treatment.

  6. 10. international mouse genome conference

    SciTech Connect

    Meisler, M.H.

    1996-12-31

    Ten years after hosting the First International Mammalian Genome Conference in Paris in 1986, Dr. Jean-Louis Guenet presided over the Tenth Conference at the Pasteur Institute, October 7--10, 1996. The 1986 conference was a satellite to the Human Gene Mapping Workshop and had approximately 50 attendees. The 1996 meeting was attended by 300 scientists from around the world. In the interim, the number of mapped loci in the mouse increased from 1,000 to over 20,000. This report contains a listing of the program and its participants, and two articles that review the meeting and the role of the laboratory mouse in the Human Genome project. More than 200 papers were presented at the conference covering the following topics: International mouse chromosome committee meetings; Mutant generation and identification; Physical and genetic maps; New technology and resources; Chromatin structure and gene regulation; Rate and hamster genetic maps; Informatics and databases; and Quantitative trait analysis.

  7. Wheat Genomics: Present Status and Future Prospects

    PubMed Central

    Gupta, P. K.; Mir, R. R.; Mohan, A.; Kumar, J.

    2008-01-01

    Wheat (Triticum aestivum L.), with a large genome (16000 Mb) and high proportion (∼80%) of repetitive sequences, has been a difficult crop for genomics research. However, the availability of extensive cytogenetics stocks has been an asset, which facilitated significant progress in wheat genomic research in recent years. For instance, fairly dense molecular maps (both genetic and physical maps) and a large set of ESTs allowed genome-wide identification of gene-rich and gene-poor regions as well as QTL including eQTL. The availability of markers associated with major economic traits also allowed development of major programs on marker-assisted selection (MAS) in some countries, and facilitated map-based cloning of a number of genes/QTL. Resources for functional genomics including TILLING and RNA interference (RNAi) along with some new approaches like epigenetics and association mapping are also being successfully used for wheat genomics research. BAC/BIBAC libraries for the subgenome D and some individual chromosomes have also been prepared to facilitate sequencing of gene space. In this brief review, we discuss all these advances in some detail, and also describe briefly the available resources, which can be used for future genomics research in this important crop. PMID:18528518

  8. Collaborative Research to Advance Precision Medicine in the Post-Genomic World | Office of Cancer Genomics

    Cancer.gov

    My name is Subhashini Jagu, and I am the Scientific Program Manager for the Cancer Target Discovery and Development (CTD2) Network at the Office of Cancer Genomics (OCG). In my new role, I help CTD2 work toward its mission, which is to develop new scientific approaches to accelerate the translation of genomic discoveries into new treatments. Collaborative efforts that bring together a variety of expertise and infrastructure are needed to understand and successfully treat cancer, a highly complex disease.

  9. Almost finished: the complete genome sequence of Mycosphaerella graminicola

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mycosphaerella graminicola causes septoria tritici blotch of wheat. An 8.9x shotgun sequence of bread wheat strain IPO323 was generated through the Community Sequencing Program of the U.S. Department of Energy’s Joint Genome Institute (JGI), and was finished at the Stanford Human Genome Center. The ...

  10. Genotypes are useful for more than genomic evaluation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    New services that provide pedigree discovery, breed composition, mating programs, genomic inbreeding, fertility defects, and inheritance tracking all are possible from low-cost genotyping in addition to genomic evaluation. Genetic markers let breeders select among sibs before their phenotypes became...

  11. 2012 U.S. Department of Energy: Joint Genome Institute: Progress Report

    SciTech Connect

    Gilbert, David

    2013-01-01

    The mission of the U.S. Department of Energy Joint Genome Institute (DOE JGI) is to serve the diverse scientific community as a user facility, enabling the application of large-scale genomics and analysis of plants, microbes, and communities of microbes to address the DOE mission goals in bioenergy and the environment. The DOE JGI's sequencing efforts fall under the Eukaryote Super Program, which includes the Plant and Fungal Genomics Programs; and the Prokaryote Super Program, which includes the Microbial Genomics and Metagenomics Programs. In 2012, several projects made news for their contributions to energy and environment research.

  12. The Bluejay genome browser.

    PubMed

    Soh, Jung; Gordon, Paul M K; Sensen, Christoph W

    2012-03-01

    The Bluejay genome browser is a stand-alone visualization tool for the multi-scale viewing of annotated genomes and other genomic elements. Bluejay allows users to customize display features to suit their needs, and produces publication-quality graphics. Bluejay provides a multitude of ways to interrelate biological data at the genome scale. Users can load gene expression data into a genome display for expression visualization in context. Multiple genomes can be compared concurrently, including time series expression data, based on Gene Ontology labels. External, context-sensitive biological Web Services are linked to the displayed genomic elements ad hoc for in-depth genomic data analysis and interpretation. Users can mark multiple points of interest in a genome by creating waypoints, and exploit them for easy navigation of single or multiple genomes. Using this comprehensive visual environment, users can study a gene not just in relation to its genome, but also its transcriptome and evolutionary origins. Written in Java, Bluejay is platform-independent and is freely available from http://bluejay.ucalgary.ca.

  13. Bacterial Genome Instability

    PubMed Central

    Darmon, Elise

    2014-01-01

    SUMMARY Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease. PMID:24600039

  14. UCSC genome browser tutorial.

    PubMed

    Zweig, Ann S; Karolchik, Donna; Kuhn, Robert M; Haussler, David; Kent, W James

    2008-08-01

    The University of California Santa Cruz (UCSC) Genome Bioinformatics website consists of a suite of free, open-source, on-line tools that can be used to browse, analyze, and query genomic data. These tools are available to anyone who has an Internet browser and an interest in genomics. The website provides a quick and easy-to-use visual display of genomic data. It places annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information. Many of the annotation tracks are submitted by scientists worldwide; the others are computed by the UCSC Genome Bioinformatics group from publicly available sequence data. It also allows users to upload and display their own experimental results or annotation sets by creating a custom track. The suite of tools, downloadable data files, and links to documentation and other information can be found at http://genome.ucsc.edu/.

  15. Enabling responsible public genomics.

    PubMed

    Conley, John M; Doerr, Adam K; Vorhaus, Daniel B

    2010-01-01

    As scientific understandings of genetics advance, researchers require increasingly rich datasets that combine genomic data from large numbers of individuals with medical and other personal information. Linking individuals' genetic data and personal information precludes anonymity and produces medically significant information--a result not contemplated by the established legal and ethical conventions governing human genomic research. To pursue the next generation of human genomic research and commerce in a responsible fashion, scientists, lawyers, and regulators must address substantial new issues, including researchers' duties with respect to clinically significant data, the challenges to privacy presented by genomic data, the boundary between genomic research and commerce, and the practice of medicine. This Article presents a new model for understanding and addressing these new challenges--a "public genomics" premised on the idea that ethically, legally, and socially responsible genomics research requires openness, not privacy, as its organizing principle. Responsible public genomics combines the data contributed by informed and fully consenting information altruists and the research potential of rich datasets in a genomic commons that is freely and globally available. This Article examines the risks and benefits of this public genomics model in the context of an ambitious genetic research project currently under way--the Personal Genome Project. This Article also (i) demonstrates that large-scale genomic projects are desirable, (ii) evaluates the risks and challenges presented by public genomics research, and (iii) determines that the current legal and regulatory regimes restrict beneficial and responsible scientific inquiry while failing to adequately protect participants. The Article concludes by proposing a modified normative and legal framework that embraces and enables a future of responsible public genomics.

  16. Efficient Methods to Compute Genomic Predictions

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Efficient methods for processing genomic data were developed to increase reliability of estimated breeding values and simultaneously estimate thousands of marker effects. Algorithms were derived and computer programs tested on simulated data for 50,000 markers and 2,967 bulls. Accurate estimates of ...

  17. R for genome-wide association studies.

    PubMed

    Gondro, Cedric; Porto-Neto, Laercio R; Lee, Seung Hwan

    2013-01-01

    In recent years R has become de facto statistical programming language of choice for statisticians and it is also arguably the most widely used generic environment for analysis of high-throughput genomic data. In this chapter we discuss some approaches to improve performance of R when working with large SNP datasets.

  18. Whole-exome/genome sequencing and genomics.

    PubMed

    Grody, Wayne W; Thompson, Barry H; Hudgins, Louanne

    2013-12-01

    As medical genetics has progressed from a descriptive entity to one focused on the functional relationship between genes and clinical disorders, emphasis has been placed on genomics. Genomics, a subelement of genetics, is the study of the genome, the sum total of all the genes of an organism. The human genome, which is contained in the 23 pairs of nuclear chromosomes and in the mitochondrial DNA of each cell, comprises >6 billion nucleotides of genetic code. There are some 23,000 protein-coding genes, a surprisingly small fraction of the total genetic material, with the remainder composed of noncoding DNA, regulatory sequences, and introns. The Human Genome Project, launched in 1990, produced a draft of the genome in 2001 and then a finished sequence in 2003, on the 50th anniversary of the initial publication of Watson and Crick's paper on the double-helical structure of DNA. Since then, this mass of genetic information has been translated at an ever-increasing pace into useable knowledge applicable to clinical medicine. The recent advent of massively parallel DNA sequencing (also known as shotgun, high-throughput, and next-generation sequencing) has brought whole-genome analysis into the clinic for the first time, and most of the current applications are directed at children with congenital conditions that are undiagnosable by using standard genetic tests for single-gene disorders. Thus, pediatricians must become familiar with this technology, what it can and cannot offer, and its technical and ethical challenges. Here, we address the concepts of human genomic analysis and its clinical applicability for primary care providers.

  19. Complete genome sequence of Methanoculleus marisnigri type strain JR1

    SciTech Connect

    Anderson, Iain; Sieprawska-Lupa, Magdalena; Goltsman, Eugene; Lapidus, Alla L.; Copeland, A; Glavina Del Rio, Tijana; Tice, Hope; Dalin, Eileen; Barry, Kerrie; Saunders, Elizabeth H; Han, Cliff; Brettin, Tom; Detter, J. Chris; Bruce, David; Mikhailova, Natalia; Pitluck, Sam; Hauser, Loren John; Land, Miriam L; Lucas, Susan; Richardson, P M; Whitman, W. B.; Kyrpides, Nikos C

    2009-01-01

    Methanoculleus marisnigri Romesser et al. 1981 is a methanogen belonging to the order Methanomicrobiales within the archaeal phylum Euryarchaeota. The type strain, JR1, was isolated from anoxic sediments of the Black Sea. M. marisnigri is of phylogenetic interest because at the time the sequencing project began only one genome had previously been sequenced from the order Methanomicrobiales. We report here the complete genome sequence of M. marisnigri type strain JR1 and its annotation. This is part of a Joint Genome Institute 2006 Community Sequencing Program to sequence genomes of diverse Archaea.

  20. Complete genome sequence of Methanocorpusculum labreanum type strain Z

    SciTech Connect

    Anderson, Iain; Sieprawska-Lupa, Magdalena; Goltsman, Eugene; Lapidus, Alla L.; Copeland, A; Glavina Del Rio, Tijana; Tice, Hope; Dalin, Eileen; Barry, Kerrie; Pitluck, Sam; Hauser, Loren John; Land, Miriam L; Lucas, Susan; Richardson, P M; Whitman, W. B.; Kyrpides, Nikos C

    2009-01-01

    Methanocorpusculum labreanum is a methanogen belonging to the order Methanomicrobiales within the archaeal phylum Euryarchaeota. The type strain Z was isolated from surface sediments of Tar Pit Lake in the La Brea Tar Pits in Los Angeles, California. M. labreanum is of phylogenetic interest because at the time the sequencing project began only one genome had previously been sequenced from the order Methanomicrobiales. We report here the complete genome sequence of M. labreanum type strain Z and its annotation. This is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.

  1. Methods of Genomic Competency Integration in Practice

    PubMed Central

    Jenkins, Jean; Calzone, Kathleen A.; Caskey, Sarah; Culp, Stacey; Weiner, Marsha; Badzek, Laurie

    2015-01-01

    Purpose Genomics is increasingly relevant to health care, necessitating support for nurses to incorporate genomic competencies into practice. The primary aim of this project was to develop, implement, and evaluate a year-long genomic education intervention that trained, supported, and supervised institutional administrator and educator champion dyads to increase nursing capacity to integrate genomics through assessments of program satisfaction and institutional achieved outcomes. Design Longitudinal study of 23 Magnet Recognition Program® Hospitals (21 intervention, 2 controls) participating in a 1-year new competency integration effort aimed at increasing genomic nursing competency and overcoming barriers to genomics integration in practice. Methods Champion dyads underwent genomic training consisting of one in-person kick-off training meeting followed by monthly education webinars. Champion dyads designed institution-specific action plans detailing objectives, methods or strategies used to engage and educate nursing staff, timeline for implementation, and outcomes achieved. Action plans focused on a minimum of seven genomic priority areas: champion dyad personal development; practice assessment; policy content assessment; staff knowledge needs assessment; staff development; plans for integration; and anticipated obstacles and challenges. Action plans were updated quarterly, outlining progress made as well as inclusion of new methods or strategies. Progress was validated through virtual site visits with the champion dyads and chief nursing officers. Descriptive data were collected on all strategies or methods utilized, and timeline for achievement. Descriptive data were analyzed using content analysis. Findings The complexity of the competency content and the uniqueness of social systems and infrastructure resulted in a significant variation of champion dyad interventions. Conclusions Nursing champions can facilitate change in genomic nursing capacity through

  2. Universal genome in the origin of metazoa: thoughts about evolution.

    PubMed

    Sherman, Michael

    2007-08-01

    Recent advances in paleontology, genome analysis, genetics and embryology raise a number of questions about the origin of Animal Kingdom. These questions include:(1) seemingly simultaneous appearance of diverse Metazoan phyla in Cambrian period, (2) similarities of genomes among Metazoan phyla of diverse complexity, (3) seemingly excessive complexity of genomes of lower taxons and (4) similar genetic switches of functionally similar but non-homologous developmental programs. Here I propose an experimentally testable hypothesis of Universal Genome that addresses these questions. According to this model, (a) the Universal Genome that encodes all major developmental programs essential for various phyla of Metazoa emerged in a unicellular or a primitive multicellular organism shortly before the Cambrian period; (b) The Metazoan phyla, all having similar genomes, are nonetheless so distinct because they utilize specific combinations of developmental programs. This model has two major predictions, first that a significant fraction of genetic information in lower taxons must be functionally useless but becomes useful in higher taxons, and second that one should be able to turn on in lower taxons some of the complex latent developmental programs, e.g., a program of eye development or antibody synthesis in sea urchin. An example of natural turning on of a complex latent program in a lower taxon is discussed.

  3. 75 FR 13558 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-03-22

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed....), notice is hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research... individual intramural programs and projects conducted by the National Human Genome Research...

  4. 75 FR 48977 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-08-12

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  5. 75 FR 67380 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-11-02

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...: Ken D. Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome... Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 26,...

  6. 75 FR 62548 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-10-12

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed..., PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National Institutes... . Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  7. 76 FR 19780 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-04-08

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... Officer, CIDR, National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane... Assistance Program No. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: April...

  8. 77 FR 50140 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-08-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  9. 76 FR 22112 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-04-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel, Special Emphasis Panel... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: April...

  10. 75 FR 8373 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-02-24

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel, GWAS Comparing Design... of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  11. 76 FR 66731 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-10-27

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel, DAP for CEGS-SEP. Date...@mail.nih.gov . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome...

  12. 77 FR 35991 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-06-15

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  13. 75 FR 56115 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-09-15

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel; CEGS DAP. Date... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: September...

  14. 75 FR 35821 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-06-23

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed..., Scientific Review Officer, CIDR, National Human Genome Research Institute, National Institutes of Health... Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health,...

  15. 76 FR 35224 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-06-16

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Day, PhD, Scientific Review Officer, CIR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  16. 77 FR 64816 - National Human Genome Research Institute; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-23

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Meeting... hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research Institute... intramural programs and projects conducted by the National Human Genome Research Institute,...

  17. 77 FR 20646 - National Human Genome Research Institute; Notice of Closed Meetings

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-04-05

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel; Loan Repayment Program...: National Human Genome Research Institute, 5635 Fishers Lane, 3rd Floor Conference Room, Rockville, MD...

  18. 78 FR 11898 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-02-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Day, Ph.D., Scientific Review Officer CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  19. 76 FR 36930 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-06-23

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel, DAP R-25. Date: July...@mail.nih.gov . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome...

  20. 75 FR 60467 - National Human Genome Research Institute; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-09-30

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Meeting... hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research Institute... intramural programs and projects conducted by the National Human Genome Research Institute,...

  1. 78 FR 77477 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-12-23

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  2. 76 FR 50486 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-08-15

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed....), notice is hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research... individual intramural programs and projects conducted by the National Human Genome Research...

  3. 76 FR 50486 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-08-15

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  4. 77 FR 31863 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-05-30

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel DAP R25 Eppig.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  5. 76 FR 65204 - National Human Genome Research Institute; Notice of Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-10-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Meeting... hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research Institute... intramural programs and projects conducted by the National Human Genome Research Institute,...

  6. 78 FR 47715 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-08-06

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...., Scientific Review Officer, CIDR, National Human Genome Research Institute, National Institutes of Health... Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health,...

  7. 76 FR 22407 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-04-21

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel; Loan Repayment Program....172, Human Genome Research, National Institutes of Health, HHS) Dated: April 12, 2011. Jennifer...

  8. 76 FR 28056 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-05-13

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed....), notice is hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research... individual intramural programs and projects conducted by the National Human Genome Research...

  9. 76 FR 79199 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-12-21

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...., Scientific Review Officer, CIDR, National Human Genome Research Institute, National Institutes of Health... Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health,...

  10. 75 FR 8977 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-02-26

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome Research...-402-0838. (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome...

  11. 76 FR 9031 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-02-16

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed..., PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National Institutes... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  12. 77 FR 64816 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-23

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  13. 76 FR 10909 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-02-28

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed.... Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome Research...-402-0838. (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome...

  14. 78 FR 70063 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-11-22

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed....), notice is hereby given of a meeting of the Board of Scientific Counselors, National Human Genome Research... individual intramural programs and projects conducted by the NATIONAL HUMAN GENOME RESEARCH...

  15. 75 FR 32957 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-06-10

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Special Emphasis Panel, Protein Resource RFA... of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  16. 75 FR 8977 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-02-26

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed...: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes...

  17. 77 FR 74676 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-12-17

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research Institute...@nih.gov . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome...

  18. Multiple models for Rosaceae genomics.

    PubMed

    Shulaev, Vladimir; Korban, Schuyler S; Sosinski, Bryon; Abbott, Albert G; Aldwinckle, Herb S; Folta, Kevin M; Iezzoni, Amy; Main, Dorrie; Arús, Pere; Dandekar, Abhaya M; Lewers, Kim; Brown, Susan K; Davis, Thomas M; Gardiner, Susan E; Potter, Daniel; Veilleux, Richard E

    2008-07-01

    The plant family Rosaceae consists of over 100 genera and 3,000 species that include many important fruit, nut, ornamental, and wood crops. Members of this family provide high-value nutritional foods and contribute desirable aesthetic and industrial products. Most rosaceous crops have been enhanced by human intervention through sexual hybridization, asexual propagation, and genetic improvement since ancient times, 4,000 to 5,000 B.C. Modern breeding programs have contributed to the selection and release of numerous cultivars having significant economic impact on the U.S. and world markets. In recent years, the Rosaceae community, both in the United States and internationally, has benefited from newfound organization and collaboration that have hastened progress in developing genetic and genomic resources for representative crops such as apple (Malus spp.), peach (Prunus spp.), and strawberry (Fragaria spp.). These resources, including expressed sequence tags, bacterial artificial chromosome libraries, physical and genetic maps, and molecular markers, combined with genetic transformation protocols and bioinformatics tools, have rendered various rosaceous crops highly amenable to comparative and functional genomics studies. This report serves as a synopsis of the resources and initiatives of the Rosaceae community, recent developments in Rosaceae genomics, and plans to apply newly accumulated knowledge and resources toward breeding and crop improvement.

  19. Marine genomics: News and views.

    PubMed

    Ribeiro, Ângela M; Foote, Andrew D; Kupczok, Anne; Frazão, Bárbara; Limborg, Morten T; Piñeiro, Rosalía; Abalde, Samuel; Rocha, Sara; da Fonseca, Rute R

    2017-02-01

    Marine ecosystems occupy 71% of the surface of our planet, yet we know little about their diversity. Although the inventory of species is continually increasing, as registered by the Census of Marine Life program, only about 10% of the estimated two million marine species are known. This lag between observed and estimated diversity is in part due to the elusiveness of most aquatic species and the technical difficulties of exploring extreme environments, as for instance the abyssal plains and polar waters. In the last decade, the rapid development of affordable and flexible high-throughput sequencing approaches have been helping to improve our knowledge of marine biodiversity, from the rich microbial biota that forms the base of the tree of life to a wealth of plant and animal species. In this review, we present an overview of the applications of genomics to the study of marine life, from evolutionary biology of non-model organisms to species of commercial relevance for fishing, aquaculture and biomedicine. Instead of providing an exhaustive list of available genomic data, we rather set to present contextualized examples that best represent the current status of the field of marine genomics.

  20. Brad Ozenberger, Ph.D., Presents the Achievements of The Cancer Genome Atlas During its Early Years - TCGA

    Cancer.gov

    Dr. Brad Ozenberger, former TCGA Program Director for the National Human Genome Research Institute, describes the goals and achievements of TCGA during its pilot phase, which involved the genomic characterization of brain, ovarian, and lung cancers.

  1. State of cat genomics.

    PubMed

    O'Brien, Stephen J; Johnson, Warren; Driscoll, Carlos; Pontius, Joan; Pecon-Slattery, Jill; Menotti-Raymond, Marilyn

    2008-06-01

    Our knowledge of cat family biology was recently expanded to include a genomics perspective with the completion of a draft whole genome sequence of an Abyssinian cat. The utility of the new genome information has been demonstrated by applications ranging from disease gene discovery and comparative genomics to species conservation. Patterns of genomic organization among cats and inbred domestic cat breeds have illuminated our view of domestication, revealing linkage disequilibrium tracks consequent of breed formation, defining chromosome exchanges that punctuated major lineages of mammals and suggesting ancestral continental migration events that led to 37 modern species of Felidae. We review these recent advances here. As the genome resources develop, the cat is poised to make a major contribution to many areas in genetics and biology.

  2. Next Generation Characterisation of Cereal Genomes for Marker Discovery

    PubMed Central

    Visendi, Paul; Batley, Jacqueline; Edwards, David

    2013-01-01

    Cereal crops form the bulk of the world’s food sources, and thus their importance cannot be understated. Crop breeding programs increasingly rely on high-resolution molecular genetic markers to accelerate the breeding process. The development of these markers is hampered by the complexity of some of the major cereal crop genomes, as well as the time and cost required. In this review, we address current and future methods available for the characterisation of cereal genomes, with an emphasis on faster and more cost effective approaches for genome sequencing and the development of markers for trait association and marker assisted selection (MAS) in crop breeding programs. PMID:24833229

  3. Genomic sequencing of Pleistocene cave bears

    SciTech Connect

    Noonan, James P.; Hofreiter, Michael; Smith, Doug; Priest, JamesR.; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J. Chris; Paabo, Svante; Rubin, Edward M.

    2005-04-01

    Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome, the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.

  4. The UCSC Genome Browser database: 2017 update.

    PubMed

    Tyner, Cath; Barber, Galt P; Casper, Jonathan; Clawson, Hiram; Diekhans, Mark; Eisenhart, Christopher; Fischer, Clayton M; Gibson, David; Gonzalez, Jairo Navarro; Guruvadoo, Luvina; Haeussler, Maximilian; Heitner, Steve; Hinrichs, Angie S; Karolchik, Donna; Lee, Brian T; Lee, Christopher M; Nejad, Parisa; Raney, Brian J; Rosenbloom, Kate R; Speir, Matthew L; Villarreal, Chris; Vivian, John; Zweig, Ann S; Haussler, David; Kuhn, Robert M; Kent, W James

    2017-01-04

    Since its 2001 debut, the University of California, Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu/) team has provided continuous support to the international genomics and biomedical communities through a web-based, open source platform designed for the fast, scalable display of sequence alignments and annotations landscaped against a vast collection of quality reference genome assemblies. The browser's publicly accessible databases are the backbone of a rich, integrated bioinformatics tool suite that includes a graphical interface for data queries and downloads, alignment programs, command-line utilities and more. This year's highlights include newly designed home and gateway pages; a new 'multi-region' track display configuration for exon-only, gene-only and custom regions visualization; new genome browsers for three species (brown kiwi, crab-eating macaque and Malayan flying lemur); eight updated genome assemblies; extended support for new data types such as CRAM, RNA-seq expression data and long-range chromatin interaction pairs; and the unveiling of a new supported mirror site in Japan.

  5. The UCSC Genome Browser database: 2017 update

    PubMed Central

    Tyner, Cath; Barber, Galt P.; Casper, Jonathan; Clawson, Hiram; Diekhans, Mark; Eisenhart, Christopher; Fischer, Clayton M.; Gibson, David; Gonzalez, Jairo Navarro; Guruvadoo, Luvina; Haeussler, Maximilian; Heitner, Steve; Hinrichs, Angie S.; Karolchik, Donna; Lee, Brian T.; Lee, Christopher M.; Nejad, Parisa; Raney, Brian J.; Rosenbloom, Kate R.; Speir, Matthew L.; Villarreal, Chris; Vivian, John; Zweig, Ann S.; Haussler, David; Kuhn, Robert M.; Kent, W. James

    2017-01-01

    Since its 2001 debut, the University of California, Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu/) team has provided continuous support to the international genomics and biomedical communities through a web-based, open source platform designed for the fast, scalable display of sequence alignments and annotations landscaped against a vast collection of quality reference genome assemblies. The browser's publicly accessible databases are the backbone of a rich, integrated bioinformatics tool suite that includes a graphical interface for data queries and downloads, alignment programs, command-line utilities and more. This year's highlights include newly designed home and gateway pages; a new ‘multi-region’ track display configuration for exon-only, gene-only and custom regions visualization; new genome browsers for three species (brown kiwi, crab-eating macaque and Malayan flying lemur); eight updated genome assemblies; extended support for new data types such as CRAM, RNA-seq expression data and long-range chromatin interaction pairs; and the unveiling of a new supported mirror site in Japan. PMID:27899642

  6. [Landscape and ecological genomics].

    PubMed

    Tetushkin, E Ia

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment.

  7. Genomics of Clostridium tetani.

    PubMed

    Brüggemann, Holger; Brzuszkiewicz, Elzbieta; Chapeton-Montes, Diana; Plourde, Lucile; Speck, Denis; Popoff, Michel R

    2015-05-01

    Genomic information about Clostridium tetani, the causative agent of the tetanus disease, is scarce. The genome of strain E88, a strain used in vaccine production, was sequenced about 10 years ago. One additional genome (strain 12124569) has recently been released. Here we report three new genomes of C. tetani and describe major differences among all five C. tetani genomes. They all harbor tetanus-toxin-encoding plasmids that contain highly conserved genes for TeNT (tetanus toxin), TetR (transcriptional regulator of TeNT) and ColT (collagenase), but substantially differ in other plasmid regions. The chromosomes share a large core genome that contains about 85% of all genes of a given chromosome. The non-core chromosome comprises mainly prophage-like genomic regions and genes encoding environmental interaction and defense functions (e.g. surface proteins, restriction-modification systems, toxin-antitoxin systems, CRISPR/Cas systems) and other fitness functions (e.g. transport systems, metabolic activities). This new genome information will help to assess the level of genome plasticity of the species C. tetani and provide the basis for detailed comparative studies.

  8. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  9. Between two fern genomes.

    PubMed

    Sessa, Emily B; Banks, Jo Ann; Barker, Michael S; Der, Joshua P; Duffy, Aaron M; Graham, Sean W; Hasebe, Mitsuyasu; Langdale, Jane; Li, Fay-Wei; Marchant, D Blaine; Pryer, Kathleen M; Rothfels, Carl J; Roux, Stanley J; Salmi, Mari L; Sigel, Erin M; Soltis, Douglas E; Soltis, Pamela S; Stevenson, Dennis W; Wolf, Paul G

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves.

  10. Genomics of pear and other Rosaceae fruit trees

    PubMed Central

    Yamamoto, Toshiya; Terakami, Shingo

    2016-01-01

    The family Rosaceae includes many economically important fruit trees, such as pear, apple, peach, cherry, quince, apricot, plum, raspberry, and loquat. Over the past few years, whole-genome sequences have been released for Chinese pear, European pear, apple, peach, Japanese apricot, and strawberry. These sequences help us to conduct functional and comparative genomics studies and to develop new cultivars with desirable traits by marker-assisted selection in breeding programs. These genomics resources also allow identification of evolutionary relationships in Rosaceae, development of genome-wide SNP and SSR markers, and construction of reference genetic linkage maps, which are available through the Genome Database for the Rosaceae website. Here, we review the recent advances in genomics studies and their practical applications for Rosaceae fruit trees, particularly pear, apple, peach, and cherry. PMID:27069399

  11. Genomics of pear and other Rosaceae fruit trees.

    PubMed

    Yamamoto, Toshiya; Terakami, Shingo

    2016-01-01

    The family Rosaceae includes many economically important fruit trees, such as pear, apple, peach, cherry, quince, apricot, plum, raspberry, and loquat. Over the past few years, whole-genome sequences have been released for Chinese pear, European pear, apple, peach, Japanese apricot, and strawberry. These sequences help us to conduct functional and comparative genomics studies and to develop new cultivars with desirable traits by marker-assisted selection in breeding programs. These genomics resources also allow identification of evolutionary relationships in Rosaceae, development of genome-wide SNP and SSR markers, and construction of reference genetic linkage maps, which are available through the Genome Database for the Rosaceae website. Here, we review the recent advances in genomics studies and their practical applications for Rosaceae fruit trees, particularly pear, apple, peach, and cherry.

  12. Exploring Horizons for Domestic Animal Genomics: Workshop Summary

    SciTech Connect

    Board on Agriculture and Natural Resources, Board on Life Sciences, Division on Earth and Life Studies, National Research Council by Robert Pool and Kim Waddell

    2002-09-03

    Recognizing the important contributions that genomics analysis can make to agriculture,production and companion animal science, evolutionary biology and human health with respect to the creation of models for genetic disorders, the National Academies convened a group of individuals to plan a public workshop that would (1) assess these contributions; (2) identify potential research directions for existing genomics programs; and (3) highlight the opportunities of a coordinated, multi-species genomics effort for the science and policymaking communities. Their efforts culminated in a workshop the goal of which was to focus on domestic animal genomics and its integration with other genomics and functional genomics projects. A summary and synthesis of the discussion was produced and is a factual account of what occurred at the workshop.

  13. MIPS plant genome information resources.

    PubMed

    Spannagl, Manuel; Haberer, Georg; Ernst, Rebecca; Schoof, Heiko; Mayer, Klaus F X

    2007-01-01

    The Munich Institute for Protein Sequences (MIPS) has been involved in maintaining plant genome databases since the Arabidopsis thaliana genome project. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable data sets for model plant genomes as a backbone against which experimental data, for example from high-throughput functional genomics, can be organized and evaluated. In addition, model genomes also form a scaffold for comparative genomics, and much can be learned from genome-wide evolutionary studies.

  14. Home - The Cancer Genome Atlas - Cancer Genome - TCGA

    Cancer.gov

    The Cancer Genome Atlas (TCGA) is a comprehensive and coordinated effort to accelerate our understanding of the molecular basis of cancer through the application of genome analysis technologies, including large-scale genome sequencing.

  15. The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population.

    PubMed

    Lack, Justin B; Cardeno, Charis M; Crepeau, Marc W; Taylor, William; Corbett-Detig, Russell B; Stevens, Kristian A; Langley, Charles H; Pool, John E

    2015-04-01

    Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets.

  16. The emerging genomics and systems biology research lead to systems genomics studies.

    PubMed

    Yang, Mary Qu; Yoshigoe, Kenji; Yang, William; Tong, Weida; Qin, Xiang; Dunker, A; Chen, Zhongxue; Arbania, Hamid R; Liu, Jun S; Niemierko, Andrzej; Yang, Jack Y

    2014-01-01

    Synergistically integrating multi-layer genomic data at systems level not only can lead to deeper insights into the molecular mechanisms related to disease initiation and progression, but also can guide pathway-based biomarker and drug target identification. With the advent of high-throughput next-generation sequencing technologies, sequencing both DNA and RNA has generated multi-layer genomic data that can provide DNA polymorphism, non-coding RNA, messenger RNA, gene expression, isoform and alternative splicing information. Systems biology on the other hand studies complex biological systems, particularly systematic study of complex molecular interactions within specific cells or organisms. Genomics and molecular systems biology can be merged into the study of genomic profiles and implicated biological functions at cellular or organism level. The prospectively emerging field can be referred to as systems genomics or genomic systems biology. The Mid-South Bioinformatics Centre (MBC) and Joint Bioinformatics Ph.D. Program of University of Arkansas at Little Rock and University of Arkansas for Medical Sciences are particularly interested in promoting education and research advancement in this prospectively emerging field. Based on past investigations and research outcomes, MBC is further utilizing differential gene and isoform/exon expression from RNA-seq and co-regulation from the ChiP-seq specific for different phenotypes in combination with protein-protein interactions, and protein-DNA interactions to construct high-level gene networks for an integrative genome-phoneme investigation at systems biology level.

  17. TNF-α modulates genome-wide redistribution of ΔNp63α/TAp73 and NF-κB c-REL interactive binding on TP53 and AP-1 motifs to promote an oncogenic gene program in squamous cancer

    PubMed Central

    Si, Han; Lu, Hai; Yang, Xinping; Mattox, Austin; Jang, Minyoung; Bian, Yansong; Sano, Eleanor; Viadiu, Hector; Yan, Bin; Yau, Christina; Ng, Sam; Lee, Steven K.; Romano, Rose-Anne; Davis, Sean; Walker, Robert L.; Xiao, Wenming; Sun, Hongwei; Wei, Lai; Sinha, Satrajit; Benz, Christopher C; Stuart, Joshua M.; Meltzer, Paul S.; Van Waes, Carter; Chen, Zhong

    2016-01-01

    The Cancer Genome Atlas (TCGA) network study of 12 cancer types (PanCancer 12) revealed frequent mutation of TP53, and amplification and expression of related TP63 isoform ΔNp63 in squamous cancers. Further, aberrant expression of inflammatory genes and TP53/p63/p73 targets were detected in the PanCancer 12 project, reminiscent of gene programs co-modulated by cREL/ΔNp63/TAp73 transcription factors we uncovered in head and neck squamous cell carcinomas (HNSCC). However, how inflammatory gene signatures and cREL/p63/p73 targets are co-modulated genome-wide is unclear. Here, we examined how inflammatory factor TNF-α broadly modulates redistribution of cREL with ΔNp63α/TAp73 complexes and signatures genome-wide in the HNSCC model UM-SCC46 using chromatin immunoprecipitation sequencing (ChIP-seq). TNF-α enhanced genome-wide co-occupancy of cREL with ΔNp63α on TP53/p63 sites, while unexpectedly promoting redistribution of TAp73 from TP53 to Activator Protein-1 (AP-1) sites. cREL, ΔNp63α, and TAp73 binding and oligomerization on NF-κB, TP53 or AP-1 specific sequences were independently validated by ChIP-qPCR, oligonucleotide-binding assays, and analytical ultracentrifugation. Function of the binding activity was confirmed using TP53, AP-1, and NF-κB specific response elements, or p21, SERPINE1, and IL-6 promoter luciferase reporter activities. Concurrently, TNF-α regulated a broad gene network with co-binding activities for cREL, ΔNp63α, and TAp73 observed upon array profiling and RT-PCR. Overlapping target gene signatures were observed in squamous cancer subsets and in inflamed skin of transgenic mice overexpressing ΔNp63α. Furthermore, multiple target genes identified in this study were linked to TP63 and TP73 activity and increased gene expression in large squamous cancer samples from PanCancer 12 TCGA by CircleMap. PARADIGM inferred pathway analysis revealed the network connection of TP63 and NF-κB complexes through an AP-1 hub, further supporting

  18. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace

    PubMed Central

    Thorvaldsdottir, Helga; Liefeld, Ted; Ocana, Marco; Borges-Rivera, Diego; Pochet, Nathalie; Robinson, James T.; Demchak, Barry; Hull, Tim; Ben-Artzi, Gil; Blankenberg, Daniel; Barber, Galt P.; Lee, Brian T.; Kuhn, Robert M.; Nekrutenko, Anton; Segal, Eran; Ideker, Trey; Reich, Michael; Regev, Aviv; Chang, Howard Y.; Mesirov, Jill P.

    2015-01-01

    Integrative analysis of multiple data types to address complex biomedical questions requires the use of multiple software tools in concert and remains an enormous challenge for most of the biomedical research community. Here we introduce GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative community resource. Seeded as a collaboration of six of the most popular genomics analysis tools, GenomeSpace now supports the streamlined interaction of 20 bioinformatics tools and data resources. To facilitate the ability of non-programming users’ to leverage GenomeSpace in integrative analysis, it offers a growing set of ‘recipes’, short workflows involving a few tools and steps to guide investigators through high utility analysis tasks. PMID:26780094

  19. Genomics of Disease

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This edited book represents the 23rd symposium in the Stadler Genetics Symposia series, and the general theme of this conference was "The Genomics of Disease." The 24 national and international speakers were invited to discuss their world-class research into the advances that genomics has made on c...

  20. Genetics and Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Good progress is being made on genetics and genomics of sugar beet, however it is in process and the tools are now being generated and some results are being analyzed. The GABI BeetSeq project released a first draft of the sugar beet genome of KWS2320, a dihaploid (see http://bvseq.molgen.mpg.de/Gen...

  1. Automated Microbial Genome Annotation

    SciTech Connect

    Land, Miriam

    2009-05-29

    Miriam Land of the DOE Joint Genome Institute at Oak Ridge National Laboratory gives a talk on the current state and future challenges of moving toward automated microbial genome annotation at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  2. Genomics for Weed Science

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and ...

  3. Unlocking the bovine genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The draft genome sequence of cattle (Bos taurus) has now been analyzed by the Bovine Genome Sequencing and Analysis Consortium and the Bovine HapMap Consortium, which together represent an extensive collaboration involving more than 300 scientists from 25 different countries. ...

  4. The Future of Microbial Genomics

    SciTech Connect

    Kyrpides, Nikos

    2010-06-02

    Nikos Kyrpides, head of the Genome Biology group at the DOE Joint Genome Institute discusses current challenges in the field of microbial genomics on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  5. The UCSC Genome Browser

    PubMed Central

    Karolchik, Donna; Hinrichs, Angie S.; Kent, W. James

    2011-01-01

    The University of California Santa Cruz (UCSC) Genome Browser is a popular Web-based tool for quickly displaying a requested portion of a genome at any scale, accompanied by a series of aligned annotation “tracks.” The annotations generated by the UCSC Genome Bioinformatics Group and external collaborators include gene predictions, mRNA and expressed sequence tag alignments, simple nucleotide polymorphisms, expression and regulatory data, phenotype and variation data, and pairwise and multiple-species comparative genomics data. All information relevant to a region is presented in one window, facilitating biological analysis and interpretation. The database tables underlying the Genome Browser tracks can be viewed, downloaded, and manipulated using another Web-based application, the UCSC Table Browser. Users can upload personal datasets in a wide variety of formats as custom annotation tracks in both browsers for research or educational purposes. PMID:21975940

  6. AutoGenomics, Inc.

    PubMed

    Vairavan, Ram

    2004-07-01

    AutoGenomics has created an automated multiplexing microarray platform to make genomic and proteomic analyses routine and efficient for clinical and research laboratories. While the emergence of microarrays has advanced genomic analyses, a number of underlying issues, such as cross-hybridization, poor spot morphology and intrinsic fluorescence of the solid substrate, have yet to be fully resolved. Current methods use discrete instrumentation, are manual and require highly skilled labor, which leads to inconsistent results. AutoGenomics' automated platform uses a three-dimensional BioFilmChip microarray to circumvent these issues, providing optimal spot morphology and utilizing solution-based hybridization with allele-specific primer extension to improve single-base discrimination. AutoGenomics is developing applications for the early detection and management of complex disease states in oncology, cardiology, and mental disorders. Customers include clinical reference laboratories, hospitals, academic institutions, and pharmaceutical and biotech companies. Founded in 1999, the company is headquartered in Carlsbad, California, USA.

  7. Microbial Genomes Multiply

    NASA Technical Reports Server (NTRS)

    Doolittle, Russell F.

    2002-01-01

    The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.

  8. Comparative genomics of nematodes.

    PubMed

    Mitreva, Makedonka; Blaxter, Mark L; Bird, David M; McCarter, James P

    2005-10-01

    Recent transcriptome and genome projects have dramatically expanded the biological data available across the phylum Nematoda. Here we summarize analyses of these sequences, which have revealed multiple unexpected results. Despite a uniform body plan, nematodes are more diverse at the molecular level than was previously recognized, with many species- and group-specific novel genes. In the genus Caenorhabditis, changes in chromosome arrangement, particularly local inversions, are also rapid, with breakpoints occurring at 50-fold the rate in vertebrates. Tylenchid plant parasitic nematode genomes contain several genes closely related to genes in bacteria, implicating horizontal gene transfer events in the origins of plant parasitism. Functional genomics techniques are also moving from Caenorhabditis elegans to application throughout the phylum. Soon, eight more draft nematode genome sequences will be available. This unique resource will underpin both molecular understanding of these most abundant metazoan organisms and aid in the examination of the dynamics of genome evolution in animals.

  9. Entering the Public Health Genomics Era: Why Must Health Educators Develop Genomic Competencies?

    ERIC Educational Resources Information Center

    Chen, Lei-Shih; Goodson, Patricia

    2007-01-01

    Although the completion of the Human Genome Project will offer new insight into diseases and help develop efficient, personalized treatment or prevention programs, it will also raise new and non-trivial public health issues. Many of these issues fall under the professional purview of public health workers. As members of the public health…

  10. Genomic sequencing of Pleistocene cave bears.

    PubMed

    Noonan, James P; Hofreiter, Michael; Smith, Doug; Priest, James R; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J Chris; Pääbo, Svante; Rubin, Edward M

    2005-07-22

    Despite the greater information content of genomic DNA, ancient DNA studies have largely been limited to the amplification of mitochondrial sequences. Here we describe metagenomic libraries constructed with unamplified DNA extracted from skeletal remains of two 40,000-year-old extinct cave bears. Analysis of approximately 1 megabase of sequence from each library showed that despite significant microbial contamination, 5.8 and 1.1% of clones contained cave bear inserts, yielding 26,861 base pairs of cave bear genome sequence. Comparison of cave bear and modern bear sequences revealed the evolutionary relationship of these lineages. The metagenomic approach used here establishes the feasibility of ancient DNA genome sequencing programs.

  11. Implementing genomic medicine in the clinic: the future is here

    PubMed Central

    Manolio, Teri A.; Chisholm, Rex L.; Ozenberger, Brad; Roden, Dan M.; Williams, Marc S.; Wilson, Richard; Bick, David; Bottinger, Erwin P.; Brilliant, Murray H.; Eng, Charis; Frazer, Kelly A.; Korf, Bruce; Ledbetter, David H.; Lupski, James R.; Marsh, Clay; Mrazek, David; Murray, Michael F.; O'Donnell, Peter H.; Rader, Daniel J.; Relling, Mary V.; Shuldiner, Alan R.; Valle, David; Weinshilboum, Richard; Green, Eric D.; Ginsburg, Geoffrey S.

    2013-01-01

    Although the potential for genomics to contribute to clinical care has long been anticipated, the pace of defining the risks and benefits of incorporating genomic findings into medical practice has been relatively slow. Several institutions have recently begun genomic medicine programs, encountering many of the same obstacles and developing the same solutions, often independently. Recognizing that successful early experiences can inform subsequent efforts, the National Human Genome Research Institute brought together a number of these groups to describe their ongoing projects and challenges, identify common infrastructure and research needs, and outline an implementation framework for investigating and introducing similar programs elsewhere. Chief among the challenges were limited evidence and consensus on which genomic variants were medically relevant; lack of reimbursement for genomically driven interventions; and burden to patients and clinicians of assaying, reporting, intervening, and following up genomic findings. Key infrastructure needs included an openly accessible knowledge base capturing sequence variants and their phenotypic associations and a framework for defining and cataloging clinically actionable variants. Multiple institutions are actively engaged in using genomic information in clinical care. Much of this work is being done in isolation and would benefit from more structured collaboration and sharing of best practices. Genet Med 2013:15(4):258–267 PMID:23306799

  12. Training in Psychiatric Genomics during Residency: A New Challenge

    ERIC Educational Resources Information Center

    Winner, Joel G.; Goebert, Deborah; Matsu, Courtenay; Mrazek, David A.

    2010-01-01

    Objective: The authors ascertained the amount of training in psychiatric genomics that is provided in North American psychiatric residency programs. Methods: A sample of 217 chief residents in psychiatric residency programs in the United States and Canada were identified by e-mail and surveyed to assess their training in psychiatric genetics and…

  13. Genomic Dark Matter Sheds Light on EVI1-driven Leukemia

    PubMed Central

    Koche, Richard; Armstrong, Scott A.

    2014-01-01

    The orchestration of transcriptional programs depends on proper gene-enhancer pairing. While much remains to be learned about this process in normal development, two recent studies in Cell and Cancer Cell highlight how the genomic rearrangement of an enhancer plays a causal role in the onset of a leukemogenic program. PMID:24735919

  14. NCBI viral genomes resource.

    PubMed

    Brister, J Rodney; Ako-Adjei, Danso; Bao, Yiming; Blinkova, Olga

    2015-01-01

    Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets.

  15. The banana genome hub.

    PubMed

    Droc, Gaëtan; Larivière, Delphine; Guignon, Valentin; Yahiaoui, Nabila; This, Dominique; Garsmeur, Olivier; Dereeper, Alexis; Hamelin, Chantal; Argout, Xavier; Dufayard, Jean-François; Lengelle, Juliette; Baurens, Franc-Christophe; Cenci, Alberto; Pitollat, Bertrand; D'Hont, Angélique; Ruiz, Manuel; Rouard, Mathieu; Bocs, Stéphanie

    2013-01-01

    Banana is one of the world's favorite fruits and one of the most important crops for developing countries. The banana reference genome sequence (Musa acuminata) was recently released. Given the taxonomic position of Musa, the completed genomic sequence has particular comparative value to provide fresh insights about the evolution of the monocotyledons. The study of the banana genome has been enhanced by a number of tools and resources that allows harnessing its sequence. First, we set up essential tools such as a Community Annotation System, phylogenomics resources and metabolic pathways. Then, to support post-genomic efforts, we improved banana existing systems (e.g. web front end, query builder), we integrated available Musa data into generic systems (e.g. markers and genetic maps, synteny blocks), we have made interoperable with the banana hub, other existing systems containing Musa data (e.g. transcriptomics, rice reference genome, workflow manager) and finally, we generated new results from sequence analyses (e.g. SNP and polymorphism analysis). Several uses cases illustrate how the Banana Genome Hub can be used to study gene families. Overall, with this collaborative effort, we discuss the importance of the interoperability toward data integration between existing information systems. Database URL: http://banana-genome.cirad.fr/

  16. Genomic Insights into Bifidobacteria

    PubMed Central

    Lee, Ju-Hoon; O'Sullivan, Daniel J.

    2010-01-01

    Summary: Since the discovery in 1899 of bifidobacteria as numerically dominant microbes in the feces of breast-fed infants, there have been numerous studies addressing their role in modulating gut microflora as well as their other potential health benefits. Because of this, they are frequently incorporated into foods as probiotic cultures. An understanding of their full interactions with intestinal microbes and the host is needed to scientifically validate any health benefits they may afford. Recently, the genome sequences of nine strains representing four species of Bifidobacterium became available. A comparative genome analysis of these genomes reveals a likely efficient capacity to adapt to their habitats, with B. longum subsp. infantis exhibiting more genomic potential to utilize human milk oligosaccharides, consistent with its habitat in the infant gut. Conversely, B. longum subsp. longum exhibits a higher genomic potential for utilization of plant-derived complex carbohydrates and polyols, consistent with its habitat in an adult gut. An intriguing observation is the loss of much of this genome potential when strains are adapted to pure culture environments, as highlighted by the genomes of B. animalis subsp. lactis strains, which exhibit the least potential for a gut habitat and are believed to have evolved from the B. animalis species during adaptation to dairy fermentation environments. PMID:20805404

  17. Ensembl comparative genomics resources.

    PubMed

    Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org.

  18. What Is a Genome?

    PubMed Central

    Goldman, Aaron David; Landweber, Laura F.

    2016-01-01

    The genome is often described as the information repository of an organism. Whether millions or billions of letters of DNA, its transmission across generations confers the principal medium for inheritance of organismal traits. Several emerging areas of research demonstrate that this definition is an oversimplification. Here, we explore ways in which a deeper understanding of genomic diversity and cell physiology is challenging the concepts of physical permanence attached to the genome as well as its role as the sole information source for an organism. PMID:27442251

  19. Genetics and genomic medicine.

    PubMed

    Bogaard, Kali; Johnson, Marlene

    2009-01-01

    Genetics is playing an increasingly important role in the diagnosis, monitoring and treatment of diseases, and the expansion of genetics into health care has generated the field of genomic medicine. Health care delivery is shifting away from general diagnostic evaluation toward a generation of therapeutics based on a patient's genetic makeup. Meanwhile, the scientific community debates how best to incorporate genetics and genomic medicine into practice. While obstacles remain, the ultimate goal is to use information generated from the study of human genetics to improve disease treatment, cure and prevention. As the use of genetics in medical diagnosis and treatment increases, health care workers will require an understanding of genetics and genomic medicine.

  20. Genomic variation in maize

    SciTech Connect

    Rivin, C.J.

    1990-01-01

    We have endeavored to learn to learn how different DNA sequences and sequence arrangements contribute to genome plasticity in maize. We describe quantitative variation among maize inbred lines for tandemly arrayed and dispersed repeated DNA sequences and gene families, and qualitative variation for sequences homologous to the Mutator family of transposons. The potential of these sequences to undergo unequal crossing over, non-allelic (ectopic) recombination and transposition makes them a source of genome instability. We have found examples of rapid genomic change involving these sequences in F1 hybrids, tissue culture cells and regenerated plants.

  1. Center for Cancer Genomics | Office of Cancer Genomics

    Cancer.gov

    The Center for Cancer Genomics (CCG) was established to unify the National Cancer Institute's activities in cancer genomics, with the goal of advancing genomics research and translating findings into the clinic to improve the precise diagnosis and treatment of cancers. In addition to promoting genomic sequencing app

  2. Building international genomics collaboration for global health security

    DOE PAGES

    Cui, Helen H.; Erkkila, Tracy; Chain, Patrick S. G.; ...

    2015-12-07

    Genome science and technologies are transforming life sciences globally in many ways and becoming a highly desirable area for international collaboration to strengthen global health. The Genome Science Program at the Los Alamos National Laboratory is leveraging a long history of expertise in genomics research to assist multiple partner nations in advancing their genomics and bioinformatics capabilities. The capability development objectives focus on providing a molecular genomics-based scientific approach for pathogen detection, characterization, and biosurveillance applications. The general approaches include introduction of basic principles in genomics technologies, training on laboratory methodologies and bioinformatic analysis of resulting data, procurement, and installationmore » of next-generation sequencing instruments, establishing bioinformatics software capabilities, and exploring collaborative applications of the genomics capabilities in public health. Genome centers have been established with public health and research institutions in the Republic of Georgia, Kingdom of Jordan, Uganda, and Gabon; broader collaborations in genomics applications have also been developed with research institutions in many other countries.« less

  3. Predicting discovery rates of genomic features.

    PubMed

    Gravel, Simon

    2014-06-01

    Successful sequencing experiments require judicious sample selection. However, this selection must often be performed on the basis of limited preliminary data. Predicting the statistical properties of the final sample based on preliminary data can be challenging, because numerous uncertain model assumptions may be involved. Here, we ask whether we can predict "omics" variation across many samples by sequencing only a fraction of them. In the infinite-genome limit, we find that a pilot study sequencing 5% of a population is sufficient to predict the number of genetic variants in the entire population within 6% of the correct value, using an estimator agnostic to demography, selection, or population structure. To reach similar accuracy in a finite genome with millions of polymorphisms, the pilot study would require ∼15% of the population. We present computationally efficient jackknife and linear programming methods that exhibit substantially less bias than the state of the art when applied to simulated data and subsampled 1000 Genomes Project data. Extrapolating based on the National Heart, Lung, and Blood Institute Exome Sequencing Project data, we predict that 7.2% of sites in the capture region would be variable in a sample of 50,000 African Americans and 8.8% in a European sample of equal size. Finally, we show how the linear programming method can also predict discovery rates of various genomic features, such as the number of transcription factor binding sites across different cell types.

  4. Genomic libraries: I. Construction and screening of fosmid genomic libraries.

    PubMed

    Quail, Mike A; Matthews, Lucy; Sims, Sarah; Lloyd, Christine; Beasley, Helen; Baxter, Simon W

    2011-01-01

    Large insert genome libraries have been a core resource required to sequence genomes, analyze haplotypes, and aid gene discovery. While next generation sequencing technologies are revolutionizing the field of genomics, traditional genome libraries will still be required for accurate genome assembly. Their utility is also being extended to functional studies for understanding DNA regulatory elements. Here, we present a detailed method for constructing genomic fosmid libraries, testing for common contaminants, gridding the library to nylon membranes, then hybridizing the library membranes with a radiolabeled probe to identify corresponding genomic clones. While this chapter focuses on fosmid libraries, many of these steps can also be applied to bacterial artificial chromosome libraries.

  5. Comparative primate genomics: emerging patterns of genome content and dynamics

    PubMed Central

    Rogers, Jeffrey; Gibbs, Richard A.

    2014-01-01

    Preface Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for several primates, with analyses of several others underway. Whole genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other nonhuman primates provide valuable insight into genetic similarities and differences among species used as models for disease-related research. This review summarizes current knowledge regarding primate genome content and dynamics and offers a series of goals for the near future. PMID:24709753

  6. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

    PubMed Central

    Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  7. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-04

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search.

  8. Genomic imprinting and reproduction.

    PubMed

    Swales, A K E; Spears, N

    2005-10-01

    Genomic imprinting is the parent-of-origin specific gene expression which is a vital mechanism through both development and adult life. One of the key elements of the imprinting mechanism is DNA methylation, controlled by DNA methyltransferase enzymes. Germ cells undergo reprogramming to ensure that sex-specific genomic imprinting is initiated, thus allowing normal embryo development to progress after fertilisation. In some cases, errors in genomic imprinting are embryo lethal while in others they lead to developmental disorders and disease. Recent studies have suggested a link between the use of assisted reproductive techniques and an increase in normally rare imprinting disorders. A greater understanding of the mechanisms of genomic imprinting and the factors that influence them are important in assessing the safety of these techniques.

  9. Rubicon Genomics, Inc.

    PubMed

    Langmore, John P

    2002-07-01

    Rubicon Genomics, Inc. is a leader in development and application of effective methods to analyze human DNA for genome-wide genotyping and haplotyping. The company is developing its proprietary OmniPlex technology as an integrated platform for archiving, amplifying and analyzing patient DNA for drug target discovery, pharmacogenomics and diagnostics. Single-site, multiple-site or whole genome amplification can be done using small samples of DNA that have been archived as OmniPlex DNA. Rubicon technology will make genome-wide SNP scoring faster, more accurate, more robust and less expensive. Rubicon will partner with pharmaceutical and diagnostic companies, as well as the makers of instruments and reagents to bring OmniPlex technology to the widest market - increasing the pipeline of more effective and safer drugs and ushering in the practice of gene-based medicine.

  10. Mouse genome database 2016

    PubMed Central

    Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.; Kadin, James A.; Richardson, Joel E.

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  11. The rise of genomics.

    PubMed

    Weissenbach, Jean

    2016-01-01

    A brief history of the development of genomics is provided. Complete sequencing of genomes of uni- and multicellular organisms is based on important progress in sequencing and bioinformatics. Evolution of these methods is ongoing and has triggered an explosion in data production and analysis. Initial analyses focused on the inventory of genes encoding proteins. Completeness and quality of gene prediction remains crucial. Genome analyses profoundly modified our views on evolution, biodiversity and contributed to the detection of new functions, yet to be fully elucidated, such as those fulfilled by non-coding RNAs. Genomics has become the basis for the study of biology and provides the molecular support for a bunch of large-scale studies, the omics.

  12. Mouse genome database 2016.

    PubMed

    Bult, Carol J; Eppig, Janan T; Blake, Judith A; Kadin, James A; Richardson, Joel E

    2016-01-04

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data.

  13. Human genomic variation

    PubMed Central

    Disotell, Todd R

    2000-01-01

    The recent completion and assembly of the first draft of the human genome, which combines samples from several ethnically diverse males and females, provides preliminary data on the extent of human genetic variation. PMID:11178257

  14. Genomic definition of species

    SciTech Connect

    Crkvenjakov, R.; Drmanac, R.

    1991-07-01

    The subject of this paper is the definition of species based on the assumption that genome is the fundamental level for the origin and maintenance of biological diversity. For this view to be logically consistent it is necessary to assume the existence and operation of the new law which we call genome law. For this reason the genome law is included in the explanation of species phenomenon presented here even if its precise formulation and elaboration are left for the future. The intellectual underpinnings of this definition can be traced to Goldschmidt. We wish to explore some philosophical aspects of the definition of species in terms of the genome. The point of proposing the definition on these grounds is that any real advance in evolutionary theory has to be correct in both its philosophy and its science.

  15. Lophotrochozoan mitochondrial genomes

    SciTech Connect

    Valles, Yvonne; Boore, Jeffrey L.

    2005-10-01

    Progress in both molecular techniques and phylogeneticmethods has challenged many of the interpretations of traditionaltaxonomy. One example is in the recognition of the animal superphylumLophotrochozoa (annelids, mollusks, echiurans, platyhelminthes,brachiopods, and other phyla), although the relationships within thisgroup and the inclusion of some phyla remain uncertain. While much ofthis progress in phylogenetic reconstruction has been based on comparingsingle gene sequences, we are beginning to see the potential of comparinglarge-scale features of genomes, such as the relative order of genes.Even though tremendous progress is being made on the sequencedetermination of whole nuclear genomes, the dataset of choice forgenome-level characters for many animals across a broad taxonomic rangeremains mitochondrial genomes. We review here what is known aboutmitochondrial genomes of the lophotrochozoans and discuss the promisethat this dataset will enable insight into theirrelationships.

  16. Platyzoan mitochondrial genomes.

    PubMed

    Wey-Fabrizius, Alexandra R; Podsiadlowski, Lars; Herlyn, Holger; Hankeln, Thomas

    2013-11-01

    Platyzoa is a putative lophotrochozoan (spiralian) subtaxon within the protostome clade of Metazoa, comprising a range of biologically diverse, mostly small worm-shaped animals. The monophyly of Platyzoa, the relationships between the putative subgroups Platyhelminthes, Gastrotricha and Gnathifera (the latter comprising at least Gnathostomulida, "Rotifera" and Acanthocephala) as well as some aspects of the internal phylogenies of these subgroups are highly debated. Here we review how complete mitochondrial (mt) genome data contribute to these debates. We highlight special features of the mt genomes and discuss problems in mtDNA phylogenies of the clade. Mitochondrial genome data seem to be insufficient to resolve the position of the platyzoan clade within the Spiralia but can help to address internal phylogenetic questions. The present review includes a tabular survey of all published platyzoan mt genomes.

  17. Biobanks for Genomics and Genomics for Biobanks

    PubMed Central

    Ducournau, Pascal; Gourraud, Pierre-Antoine; Pontille, David

    2003-01-01

    Biobanks include biological samples and attached databases. Human biobanks occur in research, technological development and medical activities. Population genomics is highly dependent on the availability of large biobanks. Ethical issues must be considered: protecting the rights of those people whose samples or data are in biobanks (information, autonomy, confidentiality, protection of private life), assuring the non-commercial use of human body elements and the optimal use of samples and data. They balance other issues, such as protecting the rights of researchers and companies, allowing long-term use of biobanks while detailed information on future uses is not available. At the level of populations, the traditional form of informed consent is challenged. Other dimensions relate to the rights of a group as such, in addition to individual rights. Conditions of return of results and/or benefit to a population need to be defined. With ‘large-scale biobanking’ a marked trend in genomics, new societal dimensions appear, regarding communication, debate, regulation, societal control and valorization of such large biobanks. Exploring how genomics can help health sector biobanks to become more rationally constituted and exploited is an interesting perspective. For example, evaluating how genomic approaches can help in optimizing haematopoietic stem cell donor registries using new markers and high-throughput techniques to increase immunogenetic variability in such registries is a challenge currently being addressed. Ethical issues in such contexts are important, as not only individual decisions or projects are concerned, but also national policies in the international arena and organization of democratic debate about science, medicine and society. PMID:18629026

  18. Using comparative genome analysis to identify problems in annotated microbial genomes.

    PubMed

    Poptsova, Maria S; Gogarten, J Peter

    2010-07-01

    Genome annotation is a tedious task that is mostly done by automated methods; however, the accuracy of these approaches has been questioned since the beginning of the sequencing era. Genome annotation is a multilevel process, and errors can emerge at different stages: during sequencing, as a result of gene-calling procedures, and in the process of assigning gene functions. Missed or wrongly annotated genes differentially impact different types of analyses. Here we discuss and demonstrate how the methods of comparative genome analysis can refine annotations by locating missing orthologues. We also discuss possible reasons for errors and show that the second-generation annotation systems, which combine multiple gene-calling programs with similarity-based methods, perform much better than the first annotation tools. Since old errors may propagate to the newly sequenced genomes, we emphasize that the problem of continuously updating popular public databases is an urgent and unresolved one. Due to the progress in genome-sequencing technologies, automated annotation techniques will remain the main approach in the future. Researchers need to be aware of the existing errors in the annotation of even well-studied genomes, such as Escherichia coli, and consider additional quality control for their results.

  19. Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome

    PubMed Central

    Hamilton, Eileen P; Kapusta, Aurélie; Huvos, Piroska E; Bidwell, Shelby L; Zafar, Nikhat; Tang, Haibao; Hadjithomas, Michalis; Krishnakumar, Vivek; Badger, Jonathan H; Caler, Elisabet V; Russ, Carsten; Zeng, Qiandong; Fan, Lin; Levin, Joshua Z; Shea, Terrance; Young, Sarah K; Hegarty, Ryan; Daza, Riza; Gujja, Sharvari; Wortman, Jennifer R; Birren, Bruce W; Nusbaum, Chad; Thomas, Jainy; Carey, Clayton M; Pritham, Ellen J; Feschotte, Cédric; Noto, Tomoko; Mochizuki, Kazufumi; Papazyan, Romeo; Taverna, Sean D; Dear, Paul H; Cassidy-Hanley, Donna M; Xiong, Jie; Miao, Wei; Orias, Eduardo; Coyne, Robert S

    2016-01-01

    The germline genome of the binucleated ciliate Tetrahymena thermophila undergoes programmed chromosome breakage and massive DNA elimination to generate the somatic genome. Here, we present a complete sequence assembly of the germline genome and analyze multiple features of its structure and its relationship to the somatic genome, shedding light on the mechanisms of genome rearrangement as well as the evolutionary history of this remarkable germline/soma differentiation. Our results strengthen the notion that a complex, dynamic, and ongoing interplay between mobile DNA elements and the host genome have shaped Tetrahymena chromosome structure, locally and globally. Non-standard outcomes of rearrangement events, including the generation of short-lived somatic chromosomes and excision of DNA interrupting protein-coding regions, may represent novel forms of developmental gene regulation. We also compare Tetrahymena’s germline/soma differentiation to that of other characterized ciliates, illustrating the wide diversity of adaptations that have occurred within this phylum. DOI: http://dx.doi.org/10.7554/eLife.19090.001 PMID:27892853

  20. An Introduction to Genome Annotation.

    PubMed

    Campbell, Michael S; Yandell, Mark

    2015-12-17

    Genome projects have evolved from large international undertakings to tractable endeavors for a single lab. Accurate genome annotation is critical for successful genomic, genetic, and molecular biology experiments. These annotations can be generated using a number of approaches and available software tools. This unit describes methods for genome annotation and a number of software tools commonly used in gene annotation.

  1. Automated Microfluidics for Genomics

    DTIC Science & Technology

    2007-11-02

    the automation of it, see [4]. In the Genomation Laboratory at the Univ. of Washington (http://rcs.ee.washington.edu/GNL/genomation.html) and with Orca ...reproducible biology without contamination . The high throughput capability is competitive with large scale robotic batch processing. III. INSTRUMENTATION...essentially arbitrary low volume, and without any contact that might cause contamination . A. ACAPELLA-5K Core Processor The ACAPELLA-5K was designed with

  2. Bacteriophage T4 genome.

    PubMed

    Miller, Eric S; Kutter, Elizabeth; Mosig, Gisela; Arisaka, Fumio; Kunisawa, Takashi; Rüger, Wolfgang

    2003-03-01

    Phage T4 has provided countless contributions to the paradigms of genetics and biochemistry. Its complete genome sequence of 168,903 bp encodes about 300 gene products. T4 biology and its genomic sequence provide the best-understood model for modern functional genomics and proteomics. Variations on gene expression, including overlapping genes, internal translation initiation, spliced genes, translational bypassing, and RNA processing, alert us to the caveats of purely computational methods. The T4 transcriptional pattern reflects its dependence on the host RNA polymerase and the use of phage-encoded proteins that sequentially modify RNA polymerase; transcriptional activator proteins, a phage sigma factor, anti-sigma, and sigma decoy proteins also act to specify early, middle, and late promoter recognition. Posttranscriptional controls by T4 provide excellent systems for the study of RNA-dependent processes, particularly at the structural level. The redundancy of DNA replication and recombination systems of T4 reveals how phage and other genomes are stably replicated and repaired in different environments, providing insight into genome evolution and adaptations to new hosts and growth environments. Moreover, genomic sequence analysis has provided new insights into tail fiber variation, lysis, gene duplications, and membrane localization of proteins, while high-resolution structural determination of the "cell-puncturing device," combined with the three-dimensional image reconstruction of the baseplate, has revealed the mechanism of penetration during infection. Despite these advances, nearly 130 potential T4 genes remain uncharacterized. Current phage-sequencing initiatives are now revealing the similarities and differences among members of the T4 family, including those that infect bacteria other than Escherichia coli. T4 functional genomics will aid in the interpretation of these newly sequenced T4-related genomes and in broadening our understanding of the complex

  3. National Plant Genome Initiative

    DTIC Science & Technology

    2005-01-01

    Genomics” was held to bring together researchers working on legumes such as Medicago, alfalfa, soybean, bean, lotus, cowpea , and chickpea to discuss... Cowpea and Pigeonpea for India and Africa Chickpea, cowpea , and pigeonpea are staple crops in India and Africa yet lack a critical mass of genomic tools...Team in the fi eld; The NSF Potato Genome Project Page 14 - Cowpea and Chickpea images; Dr. Jane Silverthorne, NSF Page 15 - CCGI Logo; Jennifer Foltz

  4. Accuracy of genomic selection in European maize elite breeding populations.

    PubMed

    Zhao, Yusheng; Gowda, Manje; Liu, Wenxin; Würschum, Tobias; Maurer, Hans P; Longin, Friedrich H; Ranc, Nicolas; Reif, Jochen C

    2012-03-01

    Genomic selection is a promising breeding strategy for rapid improvement of complex traits. The objective of our study was to investigate the prediction accuracy of genomic breeding values through cross validation. The study was based on experimental data of six segregating populations from a half-diallel mating design with 788 testcross progenies from an elite maize breeding program. The plants were intensively phenotyped in multi-location field trials and fingerprinted with 960 SNP markers. We used random regression best linear unbiased prediction in combination with fivefold cross validation. The prediction accuracy across populations was higher for grain moisture (0.90) than for grain yield (0.58). The accuracy of genomic selection realized for grain yield corresponds to the precision of phenotyping at unreplicated field trials in 3-4 locations. As for maize up to three generations are feasible per year, selection gain per unit time is high and, consequently, genomic selection holds great promise for maize breeding programs.

  5. Multilevel Research and the Challenges of Implementing Genomic Medicine

    PubMed Central

    Coates, Ralph J.; Fennell, Mary L.; Glasgow, Russell E.; Scheuner, Maren T.; Schully, Sheri D.; Williams, Marc S.; Clauser, Steven B.

    2012-01-01

    Advances in genomics and related fields promise a new era of personalized medicine in the cancer care continuum. Nevertheless, there are fundamental challenges in integrating genomic medicine into cancer practice. We explore how multilevel research can contribute to implementation of genomic medicine. We first review the rapidly developing scientific discoveries in this field and the paucity of current applications that are ready for implementation in clinical and public health programs. We then define a multidisciplinary translational research agenda for successful integration of genomic medicine into policy and practice and consider challenges for successful implementation. We illustrate the agenda using the example of Lynch syndrome testing in newly diagnosed cases of colorectal cancer and cascade testing in relatives. We synthesize existing information in a framework for future multilevel research for integrating genomic medicine into the cancer care continuum. PMID:22623603

  6. DNABIT Compress - Genome compression algorithm.

    PubMed

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-22

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

  7. Ebolavirus comparative genomics

    DOE PAGES

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; ...

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  8. Genomic Instability in Cancer

    PubMed Central

    Abbas, Tarek; Keaton, Mignon A.; Dutta, Anindya

    2013-01-01

    One of the fundamental challenges facing the cell is to accurately copy its genetic material to daughter cells. When this process goes awry, genomic instability ensues in which genetic alterations ranging from nucleotide changes to chromosomal translocations and aneuploidy occur. Organisms have developed multiple mechanisms that can be classified into two major classes to ensure the fidelity of DNA replication. The first class includes mechanisms that prevent premature initiation of DNA replication and ensure that the genome is fully replicated once and only once during each division cycle. These include cyclin-dependent kinase (CDK)-dependent mechanisms and CDK-independent mechanisms. Although CDK-dependent mechanisms are largely conserved in eukaryotes, higher eukaryotes have evolved additional mechanisms that seem to play a larger role in preventing aberrant DNA replication and genome instability. The second class ensures that cells are able to respond to various cues that continuously threaten the integrity of the genome by initiating DNA-damage-dependent “checkpoints” and coordinating DNA damage repair mechanisms. Defects in the ability to safeguard against aberrant DNA replication and to respond to DNA damage contribute to genomic instability and the development of human malignancy. In this article, we summarize our current knowledge of how genomic instability arises, with a particular emphasis on how the DNA replication process can give rise to such instability. PMID:23335075

  9. Human Genome Annotation

    NASA Astrophysics Data System (ADS)

    Gerstein, Mark

    A central problem for 21st century science is annotating the human genome and making this annotation useful for the interpretation of personal genomes. My talk will focus on annotating the 99% of the genome that does not code for canonical genes, concentrating on intergenic features such as structural variants (SVs), pseudogenes (protein fossils), binding sites, and novel transcribed RNAs (ncRNAs). In particular, I will describe how we identify regulatory sites and variable blocks (SVs) based on processing next-generation sequencing experiments. I will further explain how we cluster together groups of sites to create larger annotations. Next, I will discuss a comprehensive pseudogene identification pipeline, which has enabled us to identify >10K pseudogenes in the genome and analyze their distribution with respect to age, protein family, and chromosomal location. Throughout, I will try to introduce some of the computational algorithms and approaches that are required for genome annotation. Much of this work has been carried out in the framework of the ENCODE, modENCODE, and 1000 genomes projects.

  10. An archaeal genomic signature

    NASA Technical Reports Server (NTRS)

    Graham, D. E.; Overbeek, R.; Olsen, G. J.; Woese, C. R.

    2000-01-01

    Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein-encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal "design fabric." Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).

  11. Human Social Genomics

    PubMed Central

    Cole, Steven W.

    2014-01-01

    A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA) characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural “social signal transduction” pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving. PMID:25166010

  12. How the genome folds

    NASA Astrophysics Data System (ADS)

    Lieberman Aiden, Erez

    2012-02-01

    I describe Hi-C, a novel technology for probing the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. Working with collaborators at the Broad Institute and UMass Medical School, we used Hi-C to construct spatial proximity maps of the human genome at a resolution of 1Mb. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.

  13. A comparison of virus genome sequences with their host silkworm, Bombyx mori.

    PubMed

    Tang, Xu-Dong; Yue, Ya-Jie; Wang, Wei; Li, Nan; Shen, Zhong-Yuan

    2016-01-15

    With the recent availability of the genomes of many viruses and the silkworm, Bombyx mori, as well as a variety of Basic Local Alignment Search Tool (BLAST) programs, a new opportunity to gain insight into the interaction of viruses with the silkworm is possible. This study aims to determine the possible existence of sequence identities between the genomes of viruses and the silkworm and attempts to explain this phenomenon. BLAST searches of the genomes of viruses against the silkworm genome were performed using the resources of the National Center for Biotechnology Information. All studied viruses contained variable numbers of short regions with sequence identity to the genome of the silkworm. The short regions of sequence identity in the genome of the silkworm may be derived from the genomes of viruses in the long history of silkworm-virus interaction. This study is the first to compare these genomes, and may contribute to research on the interaction between viruses and the silkworm.

  14. Genomics Education for the Public: Perspectives of Genomic Researchers and ELSI Advisors

    PubMed Central

    Jones, Sondra Smolek; Markey, Janell M.; Byerly, Katherine W.; Roberts, Megan C.

    2014-01-01

    Aims: For more than two decades genomic education of the public has been a significant challenge. As genomic information becomes integrated into daily life and routine clinical care, the need for public education is even more critical. We conducted a pilot study to learn how genomic researchers and ethical, legal, and social implications advisors who were affiliated with large-scale genomic variation studies have approached the issue of educating the public about genomics. Methods/Results: Semi-structured telephone interviews were conducted with researchers and advisors associated with the SNP/HAPMAP studies and the Cancer Genome Atlas Study. Respondents described approach(es) associated with educating the public about their study. Interviews were audio-recorded, transcribed, coded, and analyzed by team review. Although few respondents described formal educational efforts, most provided recommendations for what should/could be done, emphasizing the need for an overarching entity(s) to take responsibility to lead the effort to educate the public. Opposing views were described related to: who this should be; the overall goal of the educational effort; and the educational approach. Four thematic areas emerged: What is the rationale for educating the public about genomics?; Who is the audience?; Who should be responsible for this effort?; and What should the content be? Policy issues associated with these themes included the need to agree on philosophical framework(s) to guide the rationale, content, and target audiences for education programs; coordinate previous/ongoing educational efforts; and develop a centralized knowledge base. Suggestions for next steps are presented. Conclusion: A complex interplay of philosophical, professional, and cultural issues can create impediments to genomic education of the public. Many challenges, however, can be addressed by agreement on a guiding philosophical framework(s) and identification of a responsible entity(s) to provide

  15. Database of Periodic DNA Regions in Major Genomes.

    PubMed

    Frenkel, Felix E; Korotkova, Maria A; Korotkov, Eugene V

    2017-01-01

    Summary. We analyzed several prokaryotic and eukaryotic genomes looking for the periodicity sequences availability and employing a new mathematical method. The method envisaged using the random position weight matrices and dynamic programming. Insertions and deletions were allowed inside periodicities, thus adding a novelty to the results we obtained. A periodicity length, one of the key periodicity features, varied from 2 to 50 nt. Totally over 60,000 periodicity sequences were found in 15 genomes including some chromosomes of the H. sapiens (partial), C. elegans, D. melanogaster, and A. thaliana genomes.

  16. Database of Periodic DNA Regions in Major Genomes

    PubMed Central

    2017-01-01

    Summary. We analyzed several prokaryotic and eukaryotic genomes looking for the periodicity sequences availability and employing a new mathematical method. The method envisaged using the random position weight matrices and dynamic programming. Insertions and deletions were allowed inside periodicities, thus adding a novelty to the results we obtained. A periodicity length, one of the key periodicity features, varied from 2 to 50 nt. Totally over 60,000 periodicity sequences were found in 15 genomes including some chromosomes of the H. sapiens (partial), C. elegans, D. melanogaster, and A. thaliana genomes. PMID:28182099

  17. Complete genome sequence of Staphylothermus hellenicus P8T

    SciTech Connect

    Anderson, Iain; Wirth, Reinhard; Lucas, Susan; Copeland, A; Lapidus, Alla L.; Cheng, Jan-Fang; Goodwin, Lynne A.; Pitluck, Sam; Davenport, Karen W.; Detter, J. Chris; Han, Cliff; Tapia, Roxanne; Land, Miriam L; Hauser, Loren John; Pati, Amrita; Mikhailova, Natalia; Woyke, Tanja; Klenk, Hans-Peter; Kyrpides, Nikos C; Ivanova, N

    2011-01-01

    Staphylothermus hellenicus belongs to the order Desulfurococcales within the archaeal phy- lum Crenarchaeota. Strain P8T is the type strain of the species and was isolated from a shal- low hydrothermal vent system at Palaeochori Bay, Milos, Greece. It is a hyperthermophilic, anaerobic heterotroph. Here we describe the features of this organism together with the com- plete genome sequence and annotation. The 1,580,347 bp genome with its 1,668 protein- coding and 48 RNA genes was sequenced as part of a DOE Joint Genome Institute (JGI) La- boratory Sequencing Program (LSP) project.

  18. Complete genome sequence of Serratia plymuthica strain AS12

    PubMed Central

    Finlay, Roger D.; Alström, Sadhna; Goodwin, Lynne; Kyrpides, Nikos C.; Lucas, Susan; Lapidus, Alla; Bruce, David; Pitluck, Sam; Peters, Lin; Ovchinnikova, Galina; Chertkov, Olga; Han, James; Han, Cliff; Tapia, Roxanne; Detter, John C.; Land, Miriam; Hauser, Loren; Cheng, Jan-Fang; Ivanova, Natalia; Pagani, Ioanna; Klenk, Hans-Peter; Woyke, Tanja; Högberg, Nils

    2012-01-01

    A plant-associated member of the family Enterobacteriaceae, Serratia plymuthica strain AS12 was isolated from rapeseed roots. It is of scientific interest because it promotes plant growth and inhibits plant pathogens. The genome of S. plymuthica AS12 comprises a 5,443,009 bp long circular chromosome, which consists of 4,952 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced within the 2010 DOE-JGI Community Sequencing Program (CSP2010) as part of the project entitled “Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens”. PMID:22768360

  19. WheatGenome.info: A Resource for Wheat Genomics Resource.

    PubMed

    Lai, Kaitao

    2016-01-01

    An integrated database with a variety of Web-based systems named WheatGenome.info hosting wheat genome and genomic data has been developed to support wheat research and crop improvement. The resource includes multiple Web-based applications, which are implemented as a variety of Web-based systems. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This portal provides links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/ .

  20. Genome Project Standards in a New Era of Sequencing

    SciTech Connect

    GSC Consortia; HMP Jumpstart Consortia; Chain, P. S. G.; Grafham, D. V.; Fulton, R. S.; FitzGerald, M. G.; Hostetler, J.; Muzny, D.; Detter, J. C.; Ali, J.; Birren, B.; Bruce, D. C.; Buhay, C.; Cole, J. R.; Ding, Y.; Dugan, S.; Field, D.; Garrity, G. M.; Gibbs, R.; Graves, T.; Han, C. S.; Harrison, S. H.; Highlander, S.; Hugenholtz, P.; Khouri, H. M.; Kodira, C. D.; Kolker, E.; Kyrpides, N. C.; Lang, D.; Lapidus, A.; Malfatti, S. A.; Markowitz, V.; Metha, T.; Nelson, K. E.; Parkhill, J.; Pitluck, S.; Qin, X.; Read, T. D.; Schmutz, J.; Sozhamannan, S.; Strausberg, R.; Sutton, G.; Thomson, N. R.; Tiedje, J. M.; Weinstock, G.; Wollam, A.

    2009-06-01

    For over a decade, genome 43 sequences have adhered to only two standards that are relied on for purposes of sequence analysis by interested third parties (1, 2). However, ongoing developments in revolutionary sequencing technologies have resulted in a redefinition of traditional whole genome sequencing that requires a careful reevaluation of such standards. With commercially available 454 pyrosequencing (followed by Illumina, SOLiD, and now Helicos), there has been an explosion of genomes sequenced under the moniker 'draft', however these can be very poor quality genomes (due to inherent errors in the sequencing technologies, and the inability of assembly programs to fully address these errors). Further, one can only infer that such draft genomes may be of poor quality by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database. The difficulty in assessing the quality of such deposited genomes has created some havoc for genome analysis pipelines and contributed to many wasted hours of (mis)interpretation. These same novel sequencing technologies have also brought an exponential leap in raw sequencing capability, and at greatly reduced prices that have further skewed the time- and cost-ratios of draft data generation versus the painstaking process of improving and finishing a genome. The resulting effect is an ever-widening gap between drafted and finished genomes that only promises to continue (Figure 1), hence there is an urgent need to distinguish good and poor datasets. The sequencing institutes in the authorship, along with the NIH's Human Microbiome Project Jumpstart Consortium (3), strongly believe that a new set of standards is required for genome sequences. The following represents a set of six community-defined categories of genome sequence standards that better reflect the

  1. Translational genomics for plant breeding with the genome sequence explosion.

    PubMed

    Kang, Yang Jae; Lee, Taeyoung; Lee, Jayern; Shim, Sangrea; Jeong, Haneul; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

    2016-04-01

    The use of next-generation sequencers and advanced genotyping technologies has propelled the field of plant genomics in model crops and plants and enhanced the discovery of hidden bridges between genotypes and phenotypes. The newly generated reference sequences of unstudied minor plants can be annotated by the knowledge of model plants via translational genomics approaches. Here, we reviewed the strategies of translational genomics and suggested perspectives on the current databases of genomic resources and the database structures of translated information on the new genome. As a draft picture of phenotypic annotation, translational genomics on newly sequenced plants will provide valuable assistance for breeders and researchers who are interested in genetic studies.

  2. Genomes to Proteomes

    SciTech Connect

    Panisko, Ellen A.; Grigoriev, Igor; Daly, Don S.; Webb-Robertson, Bobbie-Jo; Baker, Scott E.

    2009-03-01

    Biologists are awash with genomic sequence data. In large part, this is due to the rapid acceleration in the generation of DNA sequence that occurred as public and private research institutes raced to sequence the human genome. In parallel with the large human genome effort, mostly smaller genomes of other important model organisms were sequenced. Projects following on these initial efforts have made use of technological advances and the DNA sequencing infrastructure that was built for the human and other organism genome projects. As a result, the genome sequences of many organisms are available in high quality draft form. While in many ways this is good news, there are limitations to the biological insights that can be gleaned from DNA sequences alone; genome sequences offer only a bird's eye view of the biological processes endemic to an organism or community. Fortunately, the genome sequences now being produced at such a high rate can serve as the foundation for other global experimental platforms such as proteomics. Proteomic methods offer a snapshot of the proteins present at a point in time for a given biological sample. Current global proteomics methods combine enzymatic digestion, separations, mass spectrometry and database searching for peptide identification. One key aspect of proteomics is the prediction of peptide sequences from mass spectrometry data. Global proteomic analysis uses computational matching of experimental mass spectra with predicted spectra based on databases of gene models that are often generated computationally. Thus, the quality of gene models predicted from a genome sequence is crucial in the generation of high quality peptide identifications. Once peptides are identified they can be assigned to their parent protein. Proteins identified as expressed in a given experiment are most useful when compared to other expressed proteins in a larger biological context or biochemical pathway. In this chapter we will discuss the automatic

  3. Recent updates and developments to plant genome size databases

    PubMed Central

    Garcia, Sònia; Leitch, Ilia J.; Anadon-Rosell, Alba; Canela, Miguel Á.; Gálvez, Francisco; Garnatje, Teresa; Gras, Airy; Hidalgo, Oriane; Johnston, Emmeline; Mas de Xaxars, Gemma; Pellicer, Jaume; Siljak-Yakovlev, Sonja; Vallès, Joan; Vitales, Daniel; Bennett, Michael D.

    2014-01-01

    Two plant genome size databases have been recently updated and/or extended: the Plant DNA C-values database (http://data.kew.org/cvalues), and GSAD, the Genome Size in Asteraceae database (http://www.asteraceaegenomesize.com). While the first provides information on nuclear DNA contents across land plants and some algal groups, the second is focused on one of the largest and most economically important angiosperm families, Asteraceae. Genome size data have numerous applications: they can be used in comparative studies on genome evolution, or as a tool to appraise the cost of whole-genome sequencing programs. The growing interest in genome size and increasing rate of data accumulation has necessitated the continued update of these databases. Currently, the Plant DNA C-values database (Release 6.0, Dec. 2012) contains data for 8510 species, while GSAD has 1219 species (Release 2.0, June 2013), representing increases of 17 and 51%, respectively, in the number of species with genome size data, compared with previous releases. Here we provide overviews of the most recent releases of each database, and outline new features of GSAD. The latter include (i) a tool to visually compare genome size data between species, (ii) the option to export data and (iii) a webpage containing information about flow cytometry protocols. PMID:24288377

  4. Exploration of plant genomes in the FLAGdb++ environment

    PubMed Central

    2011-01-01

    Background In the contexts of genomics, post-genomics and systems biology approaches, data integration presents a major concern. Databases provide crucial solutions: they store, organize and allow information to be queried, they enhance the visibility of newly produced data by comparing them with previously published results, and facilitate the exploration and development of both existing hypotheses and new ideas. Results The FLAGdb++ information system was developed with the aim of using whole plant genomes as physical references in order to gather and merge available genomic data from in silico or experimental approaches. Available through a JAVA application, original interfaces and tools assist the functional study of plant genes by considering them in their specific context: chromosome, gene family, orthology group, co-expression cluster and functional network. FLAGdb++ is mainly dedicated to the exploration of large gene groups in order to decipher functional connections, to highlight shared or specific structural or functional features, and to facilitate translational tasks between plant species (Arabidopsis thaliana, Oryza sativa, Populus trichocarpa and Vitis vinifera). Conclusion Combining original data with the output of experts and graphical displays that differ from classical plant genome browsers, FLAGdb++ presents a powerful complementary tool for exploring plant genomes and exploiting structural and functional resources, without the need for computer programming knowledge. First launched in 2002, a 15th version of FLAGdb++ is now available and comprises four model plant genomes and over eight million genomic features. PMID:21447150

  5. Recent updates and developments to plant genome size databases.

    PubMed

    Garcia, Sònia; Leitch, Ilia J; Anadon-Rosell, Alba; Canela, Miguel Á; Gálvez, Francisco; Garnatje, Teresa; Gras, Airy; Hidalgo, Oriane; Johnston, Emmeline; Mas de Xaxars, Gemma; Pellicer, Jaume; Siljak-Yakovlev, Sonja; Vallès, Joan; Vitales, Daniel; Bennett, Michael D

    2014-01-01

    Two plant genome size databases have been recently updated and/or extended: the Plant DNA C-values database (http://data.kew.org/cvalues), and GSAD, the Genome Size in Asteraceae database (http://www.asteraceaegenomesize.com). While the first provides information on nuclear DNA contents across land plants and some algal groups, the second is focused on one of the largest and most economically important angiosperm families, Asteraceae. Genome size data have numerous applications: they can be used in comparative studies on genome evolution, or as a tool to appraise the cost of whole-genome sequencing programs. The growing interest in genome size and increasing rate of data accumulation has necessitated the continued update of these databases. Currently, the Plant DNA C-values database (Release 6.0, Dec. 2012) contains data for 8510 species, while GSAD has 1219 species (Release 2.0, June 2013), representing increases of 17 and 51%, respectively, in the number of species with genome size data, compared with previous releases. Here we provide overviews of the most recent releases of each database, and outline new features of GSAD. The latter include (i) a tool to visually compare genome size data between species, (ii) the option to export data and (iii) a webpage containing information about flow cytometry protocols.

  6. Evaluation of genome-enabled selection for bacterial cold water disease resistance using progeny performance data in Rainbow Trout: Insights on genotyping methods and genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bacterial cold water disease (BCWD) causes significant economic losses in salmonid aquaculture, and traditional family-based breeding programs aimed at improving BCWD resistance have been limited to exploiting only between-family variation. We used genomic selection (GS) models to predict genomic br...

  7. Translational Genomics in Low- and Middle-Income Countries: Opportunities and Challenges.

    PubMed

    Tekola-Ayele, Fasil; Rotimi, Charles N

    2015-01-01

    Translation of genomic discoveries into patient care is slowly becoming a reality in developed economies around the world. In contrast, low- and middle-income countries (LMIC) have participated minimally in genomic research for several reasons including the lack of coherent national policies, the limited number of well-trained genomic scientists, poor research infrastructure, and local economic and cultural challenges. Recent initiatives such as the Human Heredity and Health in Africa (H3Africa), the Qatar Genome Project, and the Mexico National Institute of Genomic Medicine (INMEGEN) that aim to address these problems through capacity building and empowerment of local researchers have sparked a paradigm shift. In this short communication, we describe experiences of small-scale medical genetics and translational genomic research programs in LMIC. The lessons drawn from these programs drive home the importance of addressing resource, policy, and sociocultural dynamics to realize the promise of precision medicine driven by genomic science globally. By echoing lessons from a bench-to-community translational genomic research, we advocate that large-scale genomic research projects can be successfully linked with health care programs. To harness the benefits of genomics-led health care, LMIC governments should begin to develop national genomics policies that will address human and technology capacity development within the context of their national economic and sociocultural uniqueness. These policies should encourage international collaboration and promote the link between the public health program and genomics researchers. Finally, we highlight the potential catalytic roles of the global community to foster translational genomics in LMIC.

  8. Mapping DNA-protein interactions in large genomes by sequence tag analysis of genomic enrichment.

    PubMed

    Kim, Jonghwan; Bhinge, Akshay A; Morgan, Xochitl C; Iyer, Vishwanath R

    2005-01-01

    Identifying the chromosomal targets of transcription factors is important for reconstructing the transcriptional regulatory networks underlying global gene expression programs. We have developed an unbiased genomic method called sequence tag analysis of genomic enrichment (STAGE) to identify the direct binding targets of transcription factors in vivo. STAGE is based on high-throughput sequencing of concatemerized tags derived from target DNA enriched by chromatin immunoprecipitation. We first used STAGE in yeast to confirm that RNA polymerase III genes are the most prominent targets of the TATA-box binding protein. We optimized the STAGE protocol and developed analysis methods to allow the identification of transcription factor targets in human cells. We used STAGE to identify several previously unknown binding targets of human transcription factor E2F4 that we independently validated by promoter-specific PCR and microarray hybridization. STAGE provides a means of identifying the chromosomal targets of DNA-associated proteins in any sequenced genome.

  9. Genomics for Weed Science

    PubMed Central

    Horvath, David

    2010-01-01

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and evolutionary processes of weedy plants. Genomics-based tools such as extensive EST databases and microarrays have been developed for a limited number of weedy species, although application of information and resources developed for model plants and crops are possible and have been exploited. These tools have just begun to provide insights into the response of these weeds to herbivore and pathogen attack, survival of extreme environmental conditions, and interaction with crops. The potential of these tools to illuminate mechanisms controlling the traits that allow weeds to invade novel habitats, survive extreme environments, and that make weeds difficult to eradicate have potential for both improving crops and developing novel methods to control weeds. PMID:20808523

  10. Genes, genome and Gestalt.

    PubMed

    Grisolia, Cesar Koppe

    2005-03-31

    According to Gestalt thinking, biological systems cannot be viewed as the sum of their elements, but as processes of the whole. To understand organisms we must start from the whole, observing how the various parts are related. In genetics, we must observe the genome over and above the sum of its genes. Either loss or addition of one gene in a genome can change the function of the organism. Genomes are organized in networks of genes, which need to be well integrated. In the case of genetically modified organisms (GMOs), for example, soybeans, rats, Anopheles mosquitoes, and pigs, the insertion of an exogenous gene into a receptive organism generally causes disturbance in the networks, resulting in the breakdown of gene interactions. In these cases, genetic modification increased the genetic load of the GMO and consequently decreased its adaptability (fitness). Therefore, it is hard to claim that the production of such organisms with an increased genetic load does not have ethical implications.

  11. Genomics of Preterm Birth

    PubMed Central

    Swaggart, Kayleigh A.; Pavlicev, Mihaela; Muglia, Louis J.

    2015-01-01

    The molecular mechanisms controlling human birth timing at term, or resulting in preterm birth, have been the focus of considerable investigation, but limited insights have been gained over the past 50 years. In part, these processes have remained elusive because of divergence in reproductive strategies and physiology shown by model organisms, making extrapolation to humans uncertain. Here, we summarize the evolution of progesterone signaling and variation in pregnancy maintenance and termination. We use this comparative physiology to support the hypothesis that selective pressure on genomic loci involved in the timing of parturition have shaped human birth timing, and that these loci can be identified with comparative genomic strategies. Previous limitations imposed by divergence of mechanisms provide an important new opportunity to elucidate fundamental pathways of parturition control through increasing availability of sequenced genomes and associated reproductive physiology characteristics across diverse organisms. PMID:25646385

  12. Genomics of preterm birth.

    PubMed

    Swaggart, Kayleigh A; Pavlicev, Mihaela; Muglia, Louis J

    2015-02-02

    The molecular mechanisms controlling human birth timing at term, or resulting in preterm birth, have been the focus of considerable investigation, but limited insights have been gained over the past 50 years. In part, these processes have remained elusive because of divergence in reproductive strategies and physiology shown by model organisms, making extrapolation to humans uncertain. Here, we summarize the evolution of progesterone signaling and variation in pregnancy maintenance and termination. We use this comparative physiology to support the hypothesis that selective pressure on genomic loci involved in the timing of parturition have shaped human birth timing, and that these loci can be identified with comparative genomic strategies. Previous limitations imposed by divergence of mechanisms provide an important new opportunity to elucidate fundamental pathways of parturition control through increasing availability of sequenced genomes and associated reproductive physiology characteristics across diverse organisms.

  13. Genomics for weed science.

    PubMed

    Horvath, David

    2010-03-01

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and evolutionary processes of weedy plants. Genomics-based tools such as extensive EST databases and microarrays have been developed for a limited number of weedy species, although application of information and resources developed for model plants and crops are possible and have been exploited. These tools have just begun to provide insights into the response of these weeds to herbivore and pathogen attack, survival of extreme environmental conditions, and interaction with crops. The potential of these tools to illuminate mechanisms controlling the traits that allow weeds to invade novel habitats, survive extreme environments, and that make weeds difficult to eradicate have potential for both improving crops and developing novel methods to control weeds.

  14. Genomics of Salmonella Species

    NASA Astrophysics Data System (ADS)

    Canals, Rocio; McClelland, Michael; Santiviago, Carlos A.; Andrews-Polymenis, Helene

    Progress in the study of Salmonella survival, colonization, and virulence has increased rapidly with the advent of complete genome sequencing and higher capacity assays for transcriptomic and proteomic analysis. Although many of these techniques have yet to be used to directly assay Salmonella growth on foods, these assays are currently in use to determine Salmonella factors necessary for growth in animal models including livestock animals and in in vitro conditions that mimic many different environments. As sequencing of the Salmonella genome and microarray analysis have revolutionized genomics and transcriptomics of salmonellae over the last decade, so are new high-throughput sequencing technologies currently accelerating the pace of our studies and allowing us to approach complex problems that were not previously experimentally tractable.

  15. Genomics and drug discovery.

    PubMed

    Haseltine, W A

    2001-09-01

    Genomics, the systematic study of all the genes of an organism, offers a new and much-needed source of systematic productivity for the pharmaceutical industry. The isolation of the majority of human genes in their most useful form is leading to the creation of new drugs based on human proteins, antibodies, peptides, and genes. Human Genome Sciences, Inc, was the first company to use the systematic, genomics approach to discovering drugs, and we have placed 4 of these in clinical trials. Two are described: repifermin (keratinocyte growth factor-2, KGF-2) for wound healing and treatment of mucositis caused by cancer therapy, and B lymphocyte stimulator (BLyS) for stimulation of the immune system. An anti-BLyS antibody drug is in advanced preclinical development for treatment of autoimmune diseases.

  16. Genomics of Volvocine Algae

    PubMed Central

    Umen, James G.; Olson, Bradley J.S.C.

    2015-01-01

    Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411

  17. Identification of Genomic Signatures for the Design of Assays for the Detection and Monitoring of Anthrax Threats

    DTIC Science & Technology

    2004-09-01

    based genome alignment programs first create a suffix tree from the two input genomes. A suffix tree is a compact representation of all suffixes in the...suffix tree is searched for sequences that appear in both input genomes. These exact matching subsequences are known as maxi- mal exact matches (MEMs). The...suffix trees for the non-anchored parts of the input genomes and hence, gradu- ally reducing the gap sizes. Once the gaps are smaller than a threshold

  18. Ebolavirus comparative genomics.

    PubMed

    Jun, Se-Ran; Leuze, Michael R; Nookaew, Intawat; Uberbacher, Edward C; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S; Pedersen, Thomas D; Wassenaar, Trudy M; Ussery, David W

    2015-09-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).

  19. Ebolavirus comparative genomics

    PubMed Central

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  20. Landscape evolutionary genomics.

    PubMed

    Lowry, David B

    2010-08-23

    Tremendous advances in genetic and genomic techniques have resulted in the capacity to identify genes involved in adaptive evolution across numerous biological systems. One of the next major steps in evolutionary biology will be to determine how landscape-level geographical and environmental features are involved in the distribution of this functional adaptive genetic variation. Here, I outline how an emerging synthesis of multiple disciplines has and will continue to facilitate a deeper understanding of the ways in which heterogeneity of the natural landscapes mould the genomes of organisms.

  1. The cancer genome

    PubMed Central

    Stratton, Michael R.; Campbell, Peter J.; Futreal, P. Andrew

    2010-01-01

    All cancers arise as a result of changes that have occurred in the DNA sequence of the genomes of cancer cells. Over the past quarter of a century much has been learnt about these mutations and the abnormal genes that operate in human cancers. We are now, however, moving into an era in which it will be possible to obtain the complete DNA sequence of large numbers of cancer genomes. These studies will provide us with a detailed and comprehensive perspective on how individual cancers have developed. PMID:19360079

  2. The genomics of mycobacteria.

    PubMed

    Viale, M N; Zumárraga, M J; Araújo, F R; Zarraga, A M; Cataldi, A A; Romano, M I; Bigi, F

    2016-04-01

    The species Mycobacterium bovis and Mycobacterium avium subspecies paratuberculosis are the causal agents, respectively, of tuberculosis and paratuberculosis in animals. Both mycobacteria, especially M. bovis, are also important to public health because they can infect humans. In recent years, this and the impact of tuberculosis and paratuberculosis on animal production have led to significant advances in knowledge about both pathogens and their host interactions. This article describes the contribution of genomics and functional genomics to studies of the evolution, virulence, epidemiology and diagnosis of both these pathogenic mycobacteria.

  3. Methanococcus jannaschii genome: revisited

    NASA Technical Reports Server (NTRS)

    Kyrpides, N. C.; Olsen, G. J.; Klenk, H. P.; White, O.; Woese, C. R.

    1996-01-01

    Analysis of genomic sequences is necessarily an ongoing process. Initial gene assignments tend (wisely) to be on the conservative side (Venter, 1996). The analysis of the genome then grows in an iterative fashion as additional data and more sophisticated algorithms are brought to bear on the data. The present report is an emendation of the original gene list of Methanococcus jannaschii (Bult et al., 1996). By using a somewhat more updated database and more relaxed (and operator-intensive) pattern matching methods, we were able to add significantly to, and in a few cases amend, the gene identification table originally published by Bult et al. (1996).

  4. Brief Guide to Genomics: DNA, Genes and Genomes

    MedlinePlus

    ... guía de genómica A Brief Guide to Genomics DNA, Genes and Genomes Deoxyribonucleic acid (DNA) is the ... and lead to a disease such as cancer. DNA Sequencing Sequencing simply means determining the exact order ...

  5. Visualizing Genomic Annotations with the UCSC Genome Browser.

    PubMed

    Hung, Jui-Hung; Weng, Zhiping

    2016-11-01

    Genomic data and annotations are rapidly accumulating in databases such as the UCSC Genome Browser, NCBI, and Ensembl. Given the massive scale of these genomic databases, it is important to be able to easily retrieve known data and annotations of a specified genomic locus. For example, for a newly identified cis-regulatory element bound by a transcription factor, questions that immediately come to mind include whether the element is near a transcriptional start site and, if so, the name of the corresponding gene, and whether the histones or DNA at the locus are modified. The UCSC Genome Browser organizes data and annotations (called tracks) around the reference sequences or draft assemblies of many eukaryotic genomes and presents them using a powerful web-based graphical interface. This protocol describes how to use the UCSC Genome Browser to visualize selected tracks at specified genomic regions, download the data and annotations for further analysis, and retrieve multiple sequence alignments and their conservation scores.

  6. Playing with heart and soul…and genomes: sports implications and applications of personal genomics

    PubMed Central

    2013-01-01

    Whether the integration of genetic/omic technologies in sports contexts will facilitate player success, promote player safety, or spur genetic discrimination depends largely upon the game rules established by those currently designing genomic sports medicine programs. The integration has already begun, but there is not yet a playbook for best practices. Thus far discussions have focused largely on whether the integration would occur and how to prevent the integration from occurring, rather than how it could occur in such a way that maximizes benefits, minimizes risks, and avoids the exacerbation of racial disparities. Previous empirical research has identified members of the personal genomics industry offering sports-related DNA tests, and previous legal research has explored the impact of collective bargaining in professional sports as it relates to the employment protections of the Genetic Information Nondiscrimination Act (GINA). Building upon that research and upon participant observations with specific sports-related DNA tests purchased from four direct-to-consumer companies in 2011 and broader personal genomics (PGx) services, this anthropological, legal, and ethical (ALE) discussion highlights fundamental issues that must be addressed by those developing personal genomic sports medicine programs, either independently or through collaborations with commercial providers. For example, the vulnerability of student-athletes creates a number of issues that require careful, deliberate consideration. More broadly, however, this ALE discussion highlights potential sports-related implications (that ultimately might mitigate or, conversely, exacerbate racial disparities among athletes) of whole exome/genome sequencing conducted by biomedical researchers and clinicians for non-sports purposes. For example, the possibility that exome/genome sequencing of individuals who are considered to be non-patients, asymptomatic, normal, etc. will reveal the presence of variants of

  7. The Cassava Genome: Current Progress, Future Directions.

    PubMed

    Prochnik, Simon; Marri, Pradeep Reddy; Desany, Brian; Rabinowicz, Pablo D; Kodira, Chinnappa; Mohiuddin, Mohammed; Rodriguez, Fausto; Fauquet, Claude; Tohme, Joseph; Harkins, Timothy; Rokhsar, Daniel S; Rounsley, Steve

    2012-03-01

    The starchy swollen roots of cassava provide an essential food source for nearly a billion people, as well as possibilities for bioenergy, yet improvements to nutritional content and resistance to threatening diseases are currently impeded. A 454-based whole genome shotgun sequence has been assembled, which covers 69% of the predicted genome size and 96% of protein-coding gene space, with genome finishing underway. The predicted 30,666 genes and 3,485 alternate splice forms are supported by 1.4 M expressed sequence tags (ESTs). Maps based on simple sequence repeat (SSR)-, and EST-derived single nucleotide polymorphisms (SNPs) already exist. Thanks to the genome sequence, a high-density linkage map is currently being developed from a cross between two diverse cassava cultivars: one susceptible to cassava brown streak disease; the other resistant. An efficient genotyping-by-sequencing (GBS) approach is being developed to catalog SNPs both within the mapping population and among diverse African farmer-preferred varieties of cassava. These resources will accelerate marker-assisted breeding programs, allowing improvements in disease-resistance and nutrition, and will help us understand the genetic basis for disease resistance.

  8. Comparative Analysis of Genome Sequences with VISTA

    DOE Data Explorer

    Dubchak, Inna

    VISTA is a comprehensive suite of programs and databases developed by and hosted at the Genomics Division of Lawrence Berkeley National Laboratory. They provide information and tools designed to facilitate comparative analysis of genomic sequences. Users have two ways to interact with the suite of applications at the VISTA portal. They can submit their own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species. A key menu option is the Enhancer Browser and Database at http://enhancer.lbl.gov/. The VISTA Enhancer Browser is a central resource for experimentally validated human noncoding fragments with gene enhancer activity as assessed in transgenic mice. Most of these noncoding elements were selected for testing based on their extreme conservation with other vertebrates. The results of this enhancer screen are provided through this publicly available website. The browser also features relevant results by external contributors and a large collection of additional genome-wide conserved noncoding elements which are candidate enhancer sequences. The LBL developers invite external groups to submit computational predictions of developmental enhancers. As of 10/19/2009 the database contains information on 1109 in vivo tested elements - 508 elements with enhancer activity.

  9. Center for Cancer Genomics | Office of Cancer Genomics

    Cancer.gov

    The Center for Cancer Genomics (CCG) was established to unify the National Cancer Institute's activities in cancer genomics, with the goal of advancing genomics research and translating findings into the clinic to improve the precise diagnosis and treatment of cancers. In addition to promoting genomic sequencing approaches, CCG aims to accelerate structural, functional and computational research to explore cancer mechanisms, discover new cancer targets, and develop new therapeutics.

  10. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research.

    PubMed

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    2016-06-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome), extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a), as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/).

  11. GRAbB: Selective Assembly of Genomic Regions, a New Niche for Genomic Research

    PubMed Central

    Zhang, Hao; van Diepeningen, Anne D.; van der Lee, Theo A. J.; Waalwijk, Cees; de Hoog, G. Sybren

    2016-01-01

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome), extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a), as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions) of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04), Fedora (23), CentOS (7.1.1503) and Mac OS X (10.7). Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/). PMID

  12. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia.

    PubMed

    Iwata, Hiroyoshi; Hayashi, Takeshi; Terakami, Shingo; Takada, Norio; Sawamura, Yutaka; Yamamoto, Toshiya

    2013-03-01

    Although the potential of marker-assisted selection (MAS) in fruit tree breeding has been reported, bi-parental QTL mapping before MAS has hindered the introduction of MAS to fruit tree breeding programs. Genome-wide association studies (GWAS) are an alternative to bi-parental QTL mapping in long-lived perennials. Selection based on genomic predictions of breeding values (genomic selection: GS) is another alternative for MAS. This study examined the potential of GWAS and GS in pear breeding with 76 Japanese pear cultivars to detect significant associations of 162 markers with nine agronomic traits. We applied multilocus Bayesian models accounting for ordinal categorical phenotypes for GWAS and GS model training. Significant associations were detected at harvest time, black spot resistance and the number of spurs and two of the associations were closely linked to known loci. Genome-wide predictions for GS were accurate at the highest level (0.75) in harvest time, at medium levels (0.38-0.61) in resistance to black spot, firmness of flesh, fruit shape in longitudinal section, fruit size, acid content and number of spurs and at low levels (<0.2) in all soluble solid content and vigor of tree. Results suggest the potential of GWAS and GS for use in future breeding programs in Japanese pear.

  13. Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes

    SciTech Connect

    Gupta, Nitin; Benhamida, Jamal; Bhargava, Vipul; Goodman, Daniel; Kain , Elisabeth; Kerman, Ian; Nguyen , Ngan; Ollikainen, Noah; Rodriguez, Jesse; Wang, J.; Lipton, Mary S.; Romine, Margaret F.; Bafna, Vineet; Smith, Richard D.; Pevzner, Pavel A.

    2008-07-30

    While bacterial genome annotations have significantly improved in recent years, techniques for bacterial proteome annotation (including post-translational chemical modifications, signal peptides, proteolytic events, etc.) are still in their infancy. At the same time, the number of sequenced bacterial genomes is rising sharply, far outpacing our ability to validate the predicted genes, let alone annotate bacterial proteomes. In this study, we use tandem mass spectrometry (MS/MS) to annotate the proteome of Shewanella oneidensis MR-1, an important microbe for bioremediation. In particular, we provide the first comprehensive map of post-translational modifications in a bacterial genome, including a large number of chemical modifications, signal peptide cleavages and cleavage of N-terminal methionine residues. We also detect multiple genes that were missed or assigned incorrect start positions by gene prediction programs and suggest corrections to improve the gene annotation. This study demonstrates that complementing every genome sequencing project by an MS/MS project would significantly improve both genome and proteome annotations for a reasonable cost.

  14. Genetic Transformation and Genomic Resources for Next-Generation Precise Genome Engineering in Vegetable Crops

    PubMed Central

    Cardi, Teodoro; D’Agostino, Nunzio; Tripodi, Pasquale

    2017-01-01

    In the frame of modern agriculture facing the predicted increase of population and general environmental changes, the securement of high quality food remains a major challenge to deal with. Vegetable crops include a large number of species, characterized by multiple geographical origins, large genetic variability and diverse reproductive features. Due to their nutritional value, they have an important place in human diet. In recent years, many crop genomes have been sequenced permitting the identification of genes and superior alleles associated with desirable traits. Furthermore, innovative biotechnological approaches allow to take a step forward towards the development of new improved cultivars harboring precise genome modifications. Sequence-based knowledge coupled with advanced biotechnologies is supporting the widespread application of new plant breeding techniques to enhance the success in modification and transfer of useful alleles into target varieties. Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 system, zinc-finger nucleases, and transcription activator-like effector nucleases represent the main methods available for plant genome engineering through targeted modifications. Such technologies, however, require efficient transformation protocols as well as extensive genomic resources and accurate knowledge before they can be efficiently exploited in practical breeding programs. In this review, we revise the state of the art in relation to availability of such scientific and technological resources in various groups of vegetables, describe genome editing results obtained so far and discuss the implications for future applications. PMID:28275380

  15. The tomato genome: implications for plant breeding, genomics and evolution

    PubMed Central

    2012-01-01

    The genome sequence of tomato (Solanum lycopersicum), one of the most important vegetable crops, has recently been decoded. We address implications of the tomato genome for plant breeding, genomics and evolutionary studies, and its potential to fuel future crop biology research. PMID:22943138

  16. Dynamic evolution of genomes and the concept of genome space.

    PubMed

    Bellgard, M I; Itoh, T; Watanabe, H; Imanishi, T; Gojobori, T

    1999-05-18

    A new era in the elucidation of genome evolution has been heralded with the availability of numerous genome sequences. With these data, it has been possible to study evolutionary processes at a greater level of detail in order to characterize features such as gene shuffling, genome rearrangements, base bias composition, and horizontal gene transfer. In this paper, we discuss the evolutionary implications of significant rearrangements within genomes as well as characteristic genomic regions that have been conserved across genomes. This is based on our analysis of orthologous and paralogous genes. We argue that genome plasticity has most likely contributed substantially to the dynamic evolution of genomes. We also describe the characteristic mosaic features of an archaea genome that is comprised of both bacterial and eukaryal elements. Here we investigate base compositional differences as well as the similarity of this species' genes to either bacteria or eukarya. We conclude that these features can be largely explained by the mechanism of horizontal gene transfer. Finally, we introduce the concept of genome space which is defined as the entire set of genomes of all living organisms. We explain its usefulness to describe as well as to gain deeper insight into the general features of the dynamic genomic evolutionary process.

  17. Genomic Data Commons launches - TCGA

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  18. RIKEN mouse genome encyclopedia.

    PubMed

    Hayashizaki, Yoshihide

    2003-01-01

    We have been working to establish the comprehensive mouse full-length cDNA collection and sequence database to cover as many genes as we can, named Riken mouse genome encyclopedia. Recently we are constructing higher-level annotation (Functional ANnoTation Of Mouse cDNA; FANTOM) not only with homology search based annotation but also with expression data profile, mapping information and protein-protein database. More than 1,000,000 clones prepared from 163 tissues were end-sequenced to classify into 159,789 clusters and 60,770 representative clones were fully sequenced. As a conclusion, the 60,770 sequences contained 33,409 unique. The next generation of life science is clearly based on all of the genome information and resources. Based on our cDNA clones we developed the additional system to explore gene function. We developed cDNA microarray system to print all of these cDNA clones, protein-protein interaction screening system, protein-DNA interaction screening system and so on. The integrated database of all the information is very useful not only for analysis of gene transcriptional network and for the connection of gene to phenotype to facilitate positional candidate approach. In this talk, the prospect of the application of these genome resourced should be discussed. More information is available at the web page: http://genome.gsc.riken.go.jp/.

  19. Better chocolate through genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Theobroma cacao, the cacao or chocolate tree, is a tropical understory tree whose seeds are used to make chocolate. And like any important crop, cacao is the subject of much research. On September 15, 2010, scientists publicly released a preliminary sequence of the cacao genome--which contains all o...

  20. Prenatal Whole Genome Sequencing

    PubMed Central

    Donley, Greer; Hull, Sara Chandros; Berkman, Benjamin E.

    2014-01-01

    With whole genome sequencing set to become the preferred method of prenatal screening, we need to pay more attention to the massive amount of information it will deliver to parents—and the fact that we don't yet understand what most of it means. PMID:22777977

  1. The tomato genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The tomato genome sequence was undertaken at a time when state-of-the-art sequencing methodologies were undergoing a transition to co-called next generation methodologies. The result was an international consortium undertaking a strategy merging both old and new approaches. Because biologists were...

  2. [Genomic instability in atherosclerosis].

    PubMed

    Dzhokhadze, T A; Buadze, T Zh; Gaiozishvili, M N; Kakauridze, N G; Lezhava, T A

    2014-11-01

    A comparative study of the level of genomic instability, parameters of quantitative and structural mutations of chromosomes (aberration, aneuploidy, polyploidy) in lymphocyte cultures from patients with atherosclerosis of age 80 years and older (control group - 30-35 years old) was conducted. The possibility of correction of disturbed genomic indicators by peptide bioregulators - Livagen (Lys-Glu-Asp-Ala) and cobalt ions with separate application or in combination was also studied. Control was lymphocyte culture of two healthy respective age groups. It was also shown that patients with atherosclerosis exhibit high level of genomic instability in all studied parameters, regardless of age, which may suggest that there is marked increase in chromatin condensation in atherosclerosis. It was also shown that Livagen (characterized by modifying influence on chromatin) separately and in combination with cobalt ions, promotes normalization of altered genomic indicators of atherosclerosis in both age groups. The results show that Livagen separately and in combination with cobalt ions has impact on chromatin of patients with atherosclerosis. The identified protective action of Livagen proves its efficacy in prevention of atherosclerosis.

  3. Poster: the macaque genome.

    PubMed

    2007-04-13

    The rhesus macaque (Macaca mulatta) facilitates an extraordinary range of biomedical and basic research, and the publication of the genome only makes it a more powerful model for studies of human disease; moreover, the macaque's position relative to humans and chimpanzees affords the opportunity to learn about the processes that have shaped the last 25 million years of primate evolution. To allow users to explore these themes of the macaque genome, Science has created a special interactive version of the poster published in the print edition of the 13 April 2007 issue. The interactive version includes additional text and exploration, as well as embedded video featuring seven scientists discussing the importance of the macaque and its genome sequence in studies of biomedicine and evolution. We have also created an accompanying teaching resource, including a lesson plan aimed at teachers of advanced high school life science students, for exploring what a comparison of the macaque and human genomes can tell us about human biology and evolution. These items are free to all site visitors.

  4. The Nostoc punctiforme Genome

    SciTech Connect

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9 Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.

  5. Ascaris suum draft genome.

    PubMed

    Jex, Aaron R; Liu, Shiping; Li, Bo; Young, Neil D; Hall, Ross S; Li, Yingrui; Yang, Linfeng; Zeng, Na; Xu, Xun; Xiong, Zijun; Chen, Fangyuan; Wu, Xuan; Zhang, Guojie; Fang, Xiaodong; Kang, Yi; Anderson, Garry A; Harris, Todd W; Campbell, Bronwyn E; Vlaminck, Johnny; Wang, Tao; Cantacessi, Cinzia; Schwarz, Erich M; Ranganathan, Shoba; Geldhof, Peter; Nejsum, Peter; Sternberg, Paul W; Yang, Huanming; Wang, Jun; Wang, Jian; Gasser, Robin B

    2011-10-26

    Parasitic diseases have a devastating, long-term impact on human health, welfare and food production worldwide. More than two billion people are infected with geohelminths, including the roundworms Ascaris (common roundworm), Necator and Ancylostoma (hookworms), and Trichuris (whipworm), mainly in developing or impoverished nations of Asia, Africa and Latin America. In humans, the diseases caused by these parasites result in about 135,000 deaths annually, with a global burden comparable with that of malaria or tuberculosis in disability-adjusted life years. Ascaris alone infects around 1.2 billion people and, in children, causes nutritional deficiency, impaired physical and cognitive development and, in severe cases, death. Ascaris also causes major production losses in pigs owing to reduced growth, failure to thrive and mortality. The Ascaris-swine model makes it possible to study the parasite, its relationship with the host, and ascariasis at the molecular level. To enable such molecular studies, we report the 273 megabase draft genome of Ascaris suum and compare it with other nematode genomes. This genome has low repeat content (4.4%) and encodes about 18,500 protein-coding genes. Notably, the A. suum secretome (about 750 molecules) is rich in peptidases linked to the penetration and degradation of host tissues, and an assemblage of molecules likely to modulate or evade host immune responses. This genome provides a comprehensive resource to the scientific community and underpins the development of new and urgently needed interventions (drugs, vaccines and diagnostic tests) against ascariasis and other nematodiases.

  6. (Genomic variation in maize)

    SciTech Connect

    Rivin, C.J.

    1991-01-01

    These studies have sought to learn how different DNA sequences and sequence arrangements contribute to genome plasticity in maize. We describe quantitative variation among maize inbred lines for tandemly arrayed and dispersed repeated DNA sequences and gene families, and qualitative variation for sequences homologous to the Mutator family of transposons. The potential of these sequences to undergo unequal crossing over, non-allelic (ectopic) recombination and transposition makes them a source of genome instability. We have found examples of rapid genomic change involving these sequences in Fl hybrids, tissue culture cells and regenerated plants. We describe the repetitive portion of the maize genome as composed primarily of sequences that vary markedly in copy number among different genetic stocks. The most highly variable is the 185 bp repeat associated with the heterochromatic chromosome knobs. Even in lines without visible knobs, there is a considerable quantity of tandemly arrayed repeats. We also found a high degree of variability for the tandemly arrayed 5S and ribosomal DNA repeats. While such variation might be expected as the result of unequal cross-over, we were surprised to find considerable variation among lower copy number, dispersed repeats as well. One highly repeated sequence that showed a complex tandem and dispersed arrangement stood out as showing no detectable variability among the maize lines. In striking contrast to the variability seen between the inbred stocks, individuals within a stock were indistinguishable with regard to their repeated sequence multiplicities.

  7. Genetics, genomics and fertility

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In order to enhance the sustainability of dairy businesses, new management tools are needed to increase the fertility of dairy cattle. Genomic selection has been successfully used by AI studs to screen potential sires and significantly decrease the generation interval of bulls. Buoyed by the success...

  8. The G4 Genome

    PubMed Central

    Maizels, Nancy; Gray, Lucas T.

    2013-01-01

    Recent experiments provide fascinating examples of how G4 DNA and G4 RNA structures—aka quadruplexes—may contribute to normal biology and to genomic pathologies. Quadruplexes are transient and therefore difficult to identify directly in living cells, which initially caused skepticism regarding not only their biological relevance but even their existence. There is now compelling evidence for functions of some G4 motifs and the corresponding quadruplexes in essential processes, including initiation of DNA replication, telomere maintenance, regulated recombination in immune evasion and the immune response, control of gene expression, and genetic and epigenetic instability. Recognition and resolution of quadruplex structures is therefore an essential component of genome biology. We propose that G4 motifs and structures that participate in key processes compose the G4 genome, analogous to the transcriptome, proteome, or metabolome. This is a new view of the genome, which sees DNA as not only a simple alphabet but also a more complex geography. The challenge for the future is to systematically identify the G4 motifs that form quadruplexes in living cells and the features that confer on specific G4 motifs the ability to function as structural elements. PMID:23637633

  9. The human genome project.

    PubMed Central

    Olson, M V

    1993-01-01

    The Human Genome Project in the United States is now well underway. Its programmatic direction was largely set by a National Research Council report issued in 1988. The broad framework supplied by this report has survived almost unchanged despite an upheaval in the technology of genome analysis. This upheaval has primarily affected physical and genetic mapping, the two dominant activities in the present phase of the project. Advances in mapping techniques have allowed good progress toward the specific goals of the project and are also providing strong corollary benefits throughout biomedical research. Actual DNA sequencing of the genomes of the human and model organisms is still at an early stage. There has been little progress in the intrinsic efficiency of DNA-sequence determination. However, refinements in experimental protocols, instrumentation, and project management have made it practical to acquire sequence data on an enlarged scale. It is also increasingly apparent that DNA-sequence data provide a potent means of relating knowledge gained from the study of model organisms to human biology. There is as yet little indication that the infusion of technology from outside biology into the Human Genome Project has been effectively stimulated. Opportunities in this area remain large, posing substantial technical and policy challenges. PMID:8506271

  10. Genomics in Cardiovascular Disease

    PubMed Central

    Roberts, Robert; Marian, A.J.; Dandona, Sonny; Stewart, Alexandre F.R.

    2013-01-01

    A paradigm shift towards biology occurred in the 1990’s subsequently catalyzed by the sequencing of the human genome in 2000. The cost of DNA sequencing has gone from millions to thousands of dollars with sequencing of one’s entire genome costing only $1,000. Rapid DNA sequencing is being embraced for single gene disorders, particularly for sporadic cases and those from small families. Transmission of lethal genes such as associated with Huntington’s disease can, through in-vitro fertilization, avoid passing it on to one’s offspring. DNA sequencing will meet the challenge of elucidating the genetic predisposition for common polygenic diseases, especially in determining the function of the novel common genetic risk variants and identifying the rare variants, which may also partially ascertain the source of the missing heritability. The challenge for DNA sequencing remains great, despite human genome sequences being 99.5% identical, the 3 million single nucleotide polymorphisms (SNPs) responsible for most of the unique features add up to 60 new mutations per person which, for 7 billion people, is 420 billion mutations. It is claimed that DNA sequencing has increased 10,000 fold while information storage and retrieval only 16 fold. The physician and health user will be challenged by the convergence of two major trends, whole genome sequencing and the storage/retrieval and integration of the data. PMID:23524054

  11. Genomic imprinting: parental influence on the genome.

    PubMed

    Reik, W; Walter, J

    2001-01-01

    Genomic imprinting affects several dozen mammalian genes and results in the expression of those genes from only one of the two parental chromosomes. This is brought about by epigenetic instructions--imprints--that are laid down in the parental germ cells. Imprinting is a particularly important genetic mechanism in mammals, and is thought to influence the transfer of nutrients to the fetus and the newborn from the mother. Consistent with this view is the fact that imprinted genes tend to affect growth in the womb and behaviour after birth. Aberrant imprinting disturbs development and is the cause of various disease syndromes. The study of imprinting also provides new insights into epigenetic gene modification during development.

  12. Targeted Large-Scale Deletion of Bacterial Genomes Using CRISPR-Nickases.

    PubMed

    Standage-Beier, Kylie; Zhang, Qi; Wang, Xiao

    2015-11-20

    Programmable CRISPR-Cas systems have augmented our ability to produce precise genome manipulations. Here we demonstrate and characterize the ability of CRISPR-Cas derived nickases to direct targeted recombination of both small and large genomic regions flanked by repetitive elements in Escherichia coli. While CRISPR directed double-stranded DNA breaks are highly lethal in many bacteria, we show that CRISPR-guided nickase systems can be programmed to make precise, nonlethal, single-stranded incisions in targeted genomic regions. This induces recombination events and leads to targeted deletion. We demonstrate that dual-targeted nicking enables deletion of 36 and 97 Kb of the genome. Furthermore, multiplex targeting enables deletion of 133 Kb, accounting for approximately 3% of the entire E. coli genome. This technology provides a framework for methods to manipulate bacterial genomes using CRISPR-nickase systems. We envision this system working synergistically with preexisting bacterial genome engineering methods.

  13. Plant functional genomics

    NASA Astrophysics Data System (ADS)

    Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf

    2002-04-01

    Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.

  14. TUTORIAL ON NETWORK GENOMICS.

    SciTech Connect

    Forst, C.

    2001-01-01

    With the ever-increasing genomic information pouring into the databases researchers start to look for pattern in genomes. Key questions are the identification of function. In the past function was mainly understood to be assigned to a single gene isolated from other cellular components or mechanisms. Sequence comparison fo single genes and their products (proteins) as well as of intergenic space are a consequence of a well established one-gene one-function interpretation. prediction of function solely by sequence similarity searches are powerful techniques that initiated the advent of bioinformatics and computational biology. Seminal work on sequence alignment by Temple Smith and Michael Waterman [33] and sequence searches with the BLAST algorithm by Altschul et al. [2] provide essential methods for sequence based determination of function. Similar outstanding contributions to determination of function have been archived in the area of structure prediction, molecular modeling and molecular dynamics. Techniques covering ab initio and homology modeling up to biophysical interpretation of long-run molecular dynamics simulations are mentioned ehre. With the ever-increasing number of information of different genetic/genomic origin, new aspect are looked for that deviate from the single gene at a time method. Especially with the identification of surprisingly few human genes the emerging perception in the scientific community that the concept of function has to be extended to include other sequence based as well as non-sequenced based information. A schema of determination of function by different concepts is shown in Figure 1. The tutorial is comprised of the following sections: The first two sections discuss the differences between genomic and non-genomic based context information, section three will cover combined methods. Finally, section four lsits web-resources and databases. All presented approaches extensively employ comparative methods.

  15. Towards Sequencing Cotton (Gossypium) Genomes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Despite rapidly decreasing costs and innovative technologies, sequencing of angiosperm genomes is not yet undertaken lightly. Generating larger amounts of sequence data more quickly does not address the difficulties of sequencing and assembling complex genomes de novo. The cotton genomes represent a...

  16. From human genome to cancer genome: The first decade

    PubMed Central

    Wheeler, David A.; Wang, Linghua

    2013-01-01

    The realization that cancer progression required the participation of cellular genes provided one of several key rationales, in 1986, for embarking on the human genome project. Only with a reference genome sequence could the full spectrum of somatic changes leading to cancer be understood. Since its completion in 2003, the human reference genome sequence has fulfilled its promise as a foundational tool to illuminate the pathogenesis of cancer. Herein, we review the key historical milestones in cancer genomics since the completion of the genome, and some of the novel discoveries that are shaping our current understanding of cancer. PMID:23817046

  17. Comprehensive genome sequencing of the liver cancer genome.

    PubMed

    Nakagawa, Hidewaki; Shibata, Tatsuhiro

    2013-11-01

    Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. Recently, comprehensive whole genome and exome sequencing analyses for HCC revealed new cancer-associated genes and a variety of genomic alterations. In particular, frequent genetic alterations of the chromatin remodeling genes were observed, suggesting a new potential therapeutic target for HCC. Sequencing analysis has further identified the molecular complexities of multicentric lesions and intratumoral heterogeneity. Detailed analyses of the somatic substitution pattern of the cancer genome and the HBV virus genome integration sites by using whole-genome sequencing will elucidate the molecular basis and diverse etiological factors involved in liver cancer development.

  18. Implications of the Human Genome Project

    SciTech Connect

    Kitcher, P.

    1998-11-01

    The Human Genome Project (HGP), launched in 1991, aims to map and sequence the human genome by 2006. During the fifteen-year life of the project, it is projected that $3 billion in federal funds will be allocated to it. The ultimate aims of spending this money are to analyze the structure of human DNA, to identify all human genes, to recognize the functions of those genes, and to prepare for the biology and medicine of the twenty-first century. The following summary examines some of the implications of the program, concentrating on its scientific import and on the ethical and social problems that it raises. Its aim is to expose principles that might be used in applying the information which the HGP will generate. There is no attempt here to translate the principles into detailed proposals for legislation. Arguments and discussion can be found in the full report, but, like this summary, that report does not contain any legislative proposals.

  19. Programming languages for synthetic biology.

    PubMed

    Umesh, P; Naveen, F; Rao, Chanchala Uma Maheswara; Nair, Achuthsankar S

    2010-12-01

    In the backdrop of accelerated efforts for creating synthetic organisms, the nature and scope of an ideal programming language for scripting synthetic organism in-silico has been receiving increasing attention. A few programming languages for synthetic biology capable of defining, constructing, networking, editing and delivering genome scale models of cellular processes have been recently attempted. All these represent important points in a spectrum of possibilities. This paper introduces Kera, a state of the art programming language for synthetic biology which is arguably ahead of similar languages or tools such as GEC, Antimony and GenoCAD. Kera is a full-fledged object oriented programming language which is tempered by biopart rule library named Samhita which captures the knowledge regarding the interaction of genome components and catalytic molecules. Prominent feature of the language are demonstrated through a toy example and the road map for the future development of Kera is also presented.

  20. Using Genomics for Natural Product Structure Elucidation.

    PubMed

    Tietz, Jonathan I; Mitchell, Douglas A

    2016-01-01

    Natural products (NPs) are the most historically bountiful source of chemical matter for drug development-especially for anti-infectives. With insights gleaned from genome mining, interest in natural product discovery has been reinvigorated. An essential stage in NP discovery is structural elucidation, which sheds light not only on the chemical composition of a molecule but also its novelty, properties, and derivatization potential. The history of structure elucidation is replete with techniquebased revolutions: combustion analysis, crystallography, UV, IR, MS, and NMR have each provided game-changing advances; the latest such advance is genomics. All natural products have a genetic basis, and the ability to obtain and interpret genomic information for structure elucidation is increasingly available at low cost to non-specialists. In this review, we describe the value of genomics as a structural elucidation technique, especially from the perspective of the natural product chemist approaching an unknown metabolite. Herein we first introduce the databases and programs of interest to the natural products chemist, with an emphasis on those currently most suited for general usability. We describe strategies for linking observed natural product-linked phenotypes to their corresponding gene clusters. We then discuss techniques for extracting structural information from genes, illustrated with numerous case examples. We also provide an analysis of the biases and limitations of the field with recommendations for future development. Our overview is not only aimed at biologically-oriented researchers already at ease with bioinformatic techniques, but also, in particular, at natural product, organic, and/or medicinal chemists not previously familiar with genomic techniques.

  1. Complete genome sequence of Allochromatium vinosum DSM 180T

    PubMed Central

    Weissgerber, Thomas; Zigann, Renate; Bruce, David; Chang, Yun-juan; Detter, John C.; Han, Cliff; Hauser, Loren; Jeffries, Cynthia D.; Land, Miriam; Munk, A. Christine; Tapia, Roxanne; Dahl, Christiane

    2011-01-01

    Allochromatium vinosum formerly Chromatium vinosum is a mesophilic purple sulfur bacterium belonging to the family Chromatiaceae in the bacterial class Gammaproteobacteria. The genus Allochromatium contains currently five species. All members were isolated from freshwater, brackish water or marine habitats and are predominately obligate phototrophs. Here we describe the features of the organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of a member of the Chromatiaceae within the purple sulfur bacteria thriving in globally occurring habitats. The 3,669,074 bp genome with its 3,302 protein-coding and 64 RNA genes was sequenced within the Joint Genome Institute Community Sequencing Program. PMID:22675582

  2. Global implementation of genomic medicine: We are not alone.

    PubMed

    Manolio, Teri A; Abramowicz, Marc; Al-Mulla, Fahd; Anderson, Warwick; Balling, Rudi; Berger, Adam C; Bleyl, Steven; Chakravarti, Aravinda; Chantratita, Wasun; Chisholm, Rex L; Dissanayake, Vajira H W; Dunn, Michael; Dzau, Victor J; Han, Bok-Ghee; Hubbard, Tim; Kolbe, Anne; Korf, Bruce; Kubo, Michiaki; Lasko, Paul; Leego, Erkki; Mahasirimongkol, Surakameth; Majumdar, Partha P; Matthijs, Gert; McLeod, Howard L; Metspalu, Andres; Meulien, Pierre; Miyano, Satoru; Naparstek, Yaakov; O'Rourke, P Pearl; Patrinos, George P; Rehm, Heidi L; Relling, Mary V; Rennert, Gad; Rodriguez, Laura Lyman; Roden, Dan M; Shuldiner, Alan R; Sinha, Sukdeb; Tan, Patrick; Ulfendahl, Mats; Ward, Robyn; Williams, Marc S; Wong, John E L; Green, Eric D; Ginsburg, Geoffrey S

    2015-06-03

    Around the world, innovative genomic-medicine programs capitalize on singular capabilities arising from local health care systems, cultural or political milieus, and unusual selected risk alleles or disease burdens. Such individual efforts might benefit from the sharing of approaches and lessons learned in other locales. The U.S. National Human Genome Research Institute and the National Academy of Medicine recently brought together 25 of these groups to compare projects, to examine the current state of implementation and desired near-term capabilities, and to identify opportunities for collaboration that promote the responsible practice of genomic medicine. Efforts to coalesce these groups around concrete but compelling signature projects should accelerate the responsible implementation of genomic medicine in efforts to improve clinical care worldwide.

  3. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.

    PubMed

    Haas, Brian J; Delcher, Arthur L; Mount, Stephen M; Wortman, Jennifer R; Smith, Roger K; Hannick, Linda I; Maiti, Rama; Ronning, Catherine M; Rusch, Douglas B; Town, Christopher D; Salzberg, Steven L; White, Owen

    2003-10-01

    The spliced alignment of expressed sequence data to genomic sequence has proven a key tool in the comprehensive annotation of genes in eukaryotic genomes. A novel algorithm was developed to assemble clusters of overlapping transcript alignments (ESTs and full-length cDNAs) into maximal alignment assemblies, thereby comprehensively incorporating all available transcript data and capturing subtle splicing variations. Complete and partial gene structures identified by this method were used to improve The Institute for Genomic Research Arabidopsis genome annotation (TIGR release v.4.0). The alignment assemblies permitted the automated modeling of several novel genes and >1000 alternative splicing variations as well as updates (including UTR annotations) to nearly half of the approximately 27 000 annotated protein coding genes. The algorithm of the Program to Assemble Spliced Alignments (PASA) tool is described, as well as the results of automated updates to Arabidopsis gene annotations.

  4. Complete genome sequence of Arthrobacter sp. strain FB24

    PubMed Central

    Nakatsu, Cindy H.; Barabote, Ravi; Thompson, Sue; Bruce, David; Detter, Chris; Brettin, Thomas; Han, Cliff; Beasley, Federico; Chen, Weimin; Konopka, Allan; Xie, Gary

    2013-01-01

    Arthrobacter sp. strain FB24 is a species in the genus Arthrobacter Conn and Dimmick 1947, in the family Micrococcaceae and class Actinobacteria. A number of Arthrobacter genome sequences have been completed because of their important role in soil, especially bioremediation. This isolate is of special interest because it is tolerant to multiple metals and it is extremely resistant to elevated concentrations of chromate. The genome consists of a 4,698,945 bp circular chromosome and three plasmids (96,488, 115,507, and 159,536 bp, a total of 5,070,478 bp), coding 4,536 proteins of which 1,257 are without known function. This genome was sequenced as part of the DOE Joint Genome Institute Program. PMID:24501649

  5. Complete genome sequence of Arthrobacter sp. strain FB24

    SciTech Connect

    Nakatsu, C. H.; Barabote, Ravi; Thompson, Sue; Bruce, David; Detter, Chris; Brettin, T.; Han, Cliff F.; Beasley, Federico; Chen, Weimin; Konopka, Allan; Xie, Gary

    2013-09-30

    Arthrobacter sp. strain FB24 is a species in the genus Arthrobacter Conn and Dimmick 1947, in the family Micrococcaceae and class Actinobacteria. A number of Arthrobacter genome sequences have been completed because of their important role in soil, especially bioremediation. This isolate is of special interest because it is tolerant to multiple metals and it is extremely resistant to elevated concentrations of chromate. The genome consists of a 4,698,945 bp circular chromosome and three plasmids (96,488, 115,507, and 159,536 bp, a total of 5,070,478 bp), coding 4,536 proteins of which 1,257 are without known function. This genome was sequenced as part of the DOE Joint Genome Institute Program.

  6. Determining protein function and interaction from genome analysis

    DOEpatents

    Eisenberg, David; Marcotte, Edward M.; Thompson, Michael J.; Pellegrini, Matteo; Yeates, Todd O.

    2004-08-03

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  7. Assigning protein functions by comparative genome analysis protein phylogenetic profiles

    DOEpatents

    Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

    2003-05-13

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  8. Genome-wide synteny through highly sensitive sequence alignment: Satsuma

    PubMed Central

    Grabherr, Manfred G.; Russell, Pamela; Meyer, Miriah; Mauceli, Evan; Alföldi, Jessica; Di Palma, Federica; Lindblad-Toh, Kerstin

    2010-01-01

    Motivation: Comparative genomics heavily relies on alignments of large and often complex DNA sequences. From an engineering perspective, the problem here is to provide maximum sensitivity (to find all there is to find), specificity (to only find real homology) and speed (to accommodate the billions of base pairs of vertebrate genomes). Results: Satsuma addresses all three issues through novel strategies: (i) cross-correlation, implemented via fast Fourier transform; (ii) a match scoring scheme that eliminates almost all false hits; and (iii) an asynchronous ‘battleship’-like search that allows for aligning two entire fish genomes (470 and 217 Mb) in 120 CPU hours using 15 processors on a single machine. Availability: Satsuma is part of the Spines software package, implemented in C++ on Linux. The latest version of Spines can be freely downloaded under the LGPL license from http://www.broadinstitute.org/science/programs/genome-biology/spines/ Contact: grabherr@broadinstitute.org PMID:20208069

  9. A report from the Sixth International Mouse Genome Conference

    SciTech Connect

    Brown, S.

    1992-12-31

    The Sixth Annual Mouse Genome Conference was held in October, 1992 at Buffalo, USA. The mouse is one of the primary model organisms in the Human Genome Project. Through the use of gene targeting studies the mouse has become a powerful biological model for the study of gene function and, in addition, the comparison of the many homologous mutations identified in human and mouse have widened our understanding of the biology of these two organisms. A primary goal in the mouse genome program has been to create a genetic map of STSs of high resolution (<1cM) that would form the basis for the physical mapping of the whole mouse genome. Buffalo saw substantial new progress towards the goal of a very high density genetic map and the beginnings of substantive efforts towards physical mapping in chromosome regions with a high density of genetic markers.

  10. Whole genome sequencing of a begomovirus-resistant tomato inbred reveals introgressions from wild Solanum species

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The low cost of next generation sequencing (NGS) technology and the availability of a large number of well annotated plant genomes has made sequencing technology useful to breeding programs. With the published high quality tomato reference genome of the processing cultivar Heinz 1706, we can now uti...

  11. 75 FR 53703 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-09-01

    ... National Human Genome Research Institute; Notice of Closed Meeting Pursuant to section 10(d) of the Federal... Review Officer, Scientific Review Branch, National Human Genome Research Institute, National Institutes... review and funding cycle. (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human...

  12. Impact of marker ascertainment bias on genomic selection accuracy and estimates of genetic diversity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide molecular markers are readily being applied to evaluate genetic diversity in germplasm collections and for making genomic selections in breeding programs. To accurately predict phenotypes and assay genetic diversity, molecular markers should assay a representative sample of the polymorp...

  13. High-quality draft assemblies of mammalian genomes from massively parallel sequence data

    PubMed Central

    Gnerre, Sante; MacCallum, Iain; Przybylski, Dariusz; Ribeiro, Filipe J.; Burton, Joshua N.; Walker, Bruce J.; Sharpe, Ted; Hall, Giles; Shea, Terrance P.; Sykes, Sean; Berlin, Aaron M.; Aird, Daniel; Costello, Maura; Daza, Riza; Williams, Louise; Nicol, Robert; Gnirke, Andreas; Nusbaum, Chad; Lander, Eric S.; Jaffe, David B.

    2011-01-01

    Massively parallel DNA sequencing technologies are revolutionizing genomics by making it possible to generate billions of relatively short (~100-base) sequence reads at very low cost. Whereas such data can be readily used for a wide range of biomedical applications, it has proven difficult to use them to generate high-quality de novo genome assemblies of large, repeat-rich vertebrate genomes. To date, the genome assemblies generated from such data have fallen far short of those obtained with the older (but much more expensive) capillary-based sequencing approach. Here, we report the development of an algorithm for genome assembly, ALLPATHS-LG, and its application to massively parallel DNA sequence data from the human and mouse genomes, generated on the Illumina platform. The resulting draft genome assemblies have good accuracy, short-range contiguity, long-range connectivity, and coverage of the genome. In particular, the base accuracy is high (≥99.95%) and the scaffold sizes (N50 size = 11.5 Mb for human and 7.2 Mb for mouse) approach those obtained with capillary-based sequencing. The combination of improved sequencing technology and improved computational methods should now make it possible to increase dramatically the de novo sequencing of large genomes. The ALLPATHS-LG program is available at http://www.broadinstitute.org/science/programs/genome-biology/crd. PMID:21187386

  14. Genome of Crocodilepox Virus

    PubMed Central

    Afonso, C. L.; Tulman, E. R.; Delhon, G.; Lu, Z.; Viljoen, G. J.; Wallace, D. B.; Kutish, G. F.; Rock, D. L.

    2006-01-01

    Here, we present the genome sequence, with analysis, of a poxvirus infecting Nile crocodiles (Crocodylus niloticus) (crocodilepox virus; CRV). The genome is 190,054 bp (62% G+C) and predicted to contain 173 genes encoding proteins of 53 to 1,941 amino acids. The central genomic region contains genes conserved and generally colinear with those of other chordopoxviruses (ChPVs). CRV is distinct, as the terminal 33-kbp (left) and 13-kbp (right) genomic regions are largely CRV specific, containing 48 unique genes which lack similarity to other poxvirus genes. Notably, CRV also contains 14 unique genes which disrupt ChPV gene colinearity within the central genomic region, including 7 genes encoding GyrB-like ATPase domains similar to those in cellular type IIA DNA topoisomerases, suggestive of novel ATP-dependent functions. The presence of 10 CRV proteins with similarity to components of cellular multisubunit E3 ubiquitin-protein ligase complexes, including 9 proteins containing F-box motifs and F-box-associated regions and a homologue of cellular anaphase-promoting complex subunit 11 (Apc11), suggests that modification of host ubiquitination pathways may be significant for CRV-host cell interaction. CRV encodes a novel complement of proteins potentially involved in DNA replication, including a NAD+-dependent DNA ligase and a protein with similarity to both vaccinia virus F16L and prokaryotic serine site-specific resolvase-invertases. CRV lacks genes encoding proteins for nucleotide metabolism. CRV shares notable genomic similarities with molluscum contagiosum virus, including genes found only in these two viruses. Phylogenetic analysis indicates that CRV is quite distinct from other ChPVs, representing a new genus within the subfamily Chordopoxvirinae, and it lacks recognizable homologues of most ChPV genes involved in virulence and host range, including those involving interferon response, intracellular signaling, and host immune response modulation. These data reveal

  15. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

    PubMed

    Manolio, Teri A

    2016-10-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so.

  16. The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

    PubMed

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-02-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.

  17. Nongenetic functions of the genome.

    PubMed

    Bustin, Michael; Misteli, Tom

    2016-05-06

    The primary function of the genome is to store, propagate, and express the genetic information that gives rise to a cell's architectural and functional machinery. However, the genome is also a major structural component of the cell. Besides its genetic roles, the genome affects cellular functions by nongenetic means through its physical and structural properties, particularly by exerting mechanical forces and by serving as a scaffold for binding of cellular components. Major cellular processes affected by nongenetic functions of the genome include establishment of nuclear structure, signal transduction, mechanoresponses, cell migration, and vision in nocturnal animals. We discuss the concept, mechanisms, and implications of nongenetic functions of the genome.

  18. Genomics and the immune system.

    PubMed

    Pipkin, Matthew E; Monticelli, Silvia

    2008-05-01

    While the hereditary information encoded in the Watson-Crick base pairing of genomes is largely static within a given individual, access to this information is controlled by dynamic mechanisms. The human genome is pervasively transcribed, but the roles played by the majority of the non-protein-coding genome sequences are still largely unknown. In this review we focus on insights to gene transcriptional regulation by placing special emphasis on genome-wide approaches, and on how non-coding RNAs, which derive from global transcription of the genome, in turn control gene expression. We review recent progress in the field with highlights on the immune system.

  19. Are we Genomic Mosaics? Variations of the Genome of Somatic Cells can Contribute to Diversify our Phenotypes.

    PubMed

    Astolfi, P A; Salamini, F; Sgaramella, V

    2010-09-01

    Theoretical and experimental evidences support the hypothesis that the genomes and the epigenomes may be different in the somatic cells of complex organisms. In the genome, the differences range from single base substitutions to chromosome number; in the epigenome, they entail multiple postsynthetic modifications of the chromatin. Somatic genome variations (SGV) may accumulate during development in response both to genetic programs, which may differ from tissue to tissue, and to environmental stimuli, which are often undetected and generally irreproducible. SGV may jeopardize physiological cellular functions, but also create novel coding and regulatory sequences, to be exposed to intraorganismal Darwinian selection. Genomes acknowledged as comparatively poor in genes, such as humans', could thus increase their pristine informational endowment. A better understanding of SGV will contribute to basic issues such as the "nature vs nurture" dualism and the inheritance of acquired characters. On the applied side, they may explain the low yield of cloning via somatic cell nuclear transfer, provide clues to some of the problems associated with transdifferentiation, and interfere with individual DNA analysis. SGV may be unique in the different cells types and in the different developmental stages, and thus explain the several hundred gaps persisting in the human genomes "completed" so far. They may compound the variations associated to our epigenomes and make of each of us an "(epi)genomic" mosaic. An ensuing paradigm is the possibility that a single genome (the ephemeral one assembled at fertilization) has the capacity to generate several different brains in response to different environments.

  20. Informational laws of genome structures

    PubMed Central

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-01-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined. PMID:27354155

  1. Advances in plant chromosome genomics.

    PubMed

    Doležel, Jaroslav; Vrána, Jan; Cápal, Petr; Kubaláková, Marie; Burešová, Veronika; Simková, Hana

    2014-01-01

    Next generation sequencing (NGS) is revolutionizing genomics and is providing novel insights into genome organization, evolution and function. The number of plant genomes targeted for sequencing is rising. For the moment, however, the acquisition of full genome sequences in large genome species remains difficult, largely because the short reads produced by NGS platforms are inadequate to cope with repeat-rich DNA, which forms a large part of these genomes. The problem of sequence redundancy is compounded in polyploids, which dominate the plant kingdom. An approach to overcoming some of these difficulties is to reduce the full nuclear genome to its individual chromosomes using flow-sorting. The DNA acquired in this way has proven to be suitable for many applications, including PCR-based physical mapping, in situ hybridization, forming DNA arrays, the development of DNA markers, the construction of BAC libraries and positional cloning. Coupling chromosome sorting with NGS offers opportunities for the study of genome organization at the single chromosomal level, for comparative analyses between related species and for the validation of whole genome assemblies. Apart from the primary aim of reducing the complexity of the template, taking a chromosome-based approach enables independent teams to work in parallel, each tasked with the analysis of a different chromosome(s). Given that the number of plant species tractable for chromosome sorting is increasing, the likelihood is that chromosome genomics - the marriage of cytology and genomics - will make a significant contribution to the field of plant genetics.

  2. Evolution of small prokaryotic genomes.

    PubMed

    Martínez-Cano, David J; Reyes-Prieto, Mariana; Martínez-Romero, Esperanza; Partida-Martínez, Laila P; Latorre, Amparo; Moya, Andrés; Delaye, Luis

    2014-01-01

    As revealed by genome sequencing, the biology of prokaryotes with reduced genomes is strikingly diverse. These include free-living prokaryotes with ∼800 genes as well as endosymbiotic bacteria with as few as ∼140 genes. Comparative genomics is revealing the evolutionary mechanisms that led to these small genomes. In the case of free-living prokaryotes, natural selection directly favored genome reduction, while in the case of endosymbiotic prokaryotes neutral processes played a more prominent role. However, new experimental data suggest that selective processes may be at operation as well for endosymbiotic prokaryotes at least during the first stages of genome reduction. Endosymbiotic prokaryotes have evolved diverse strategies for living with reduced gene sets inside a host-defined medium. These include utilization of host-encoded functions (some of them coded by genes acquired by gene transfer from the endosymbiont and/or other bacteria); metabolic complementation between co-symbionts; and forming consortiums with other bacteria within the host. Recent genome sequencing projects of intracellular mutualistic bacteria showed that previously believed universal evolutionary trends like reduced G+C content and conservation of genome synteny are not always present in highly reduced genomes. Finally, the simplified molecular machinery of some of these organisms with small genomes may be used to aid in the design of artificial minimal cells. Here we review recent genomic discoveries of the biology of prokaryotes endowed with small gene sets and discuss the evolutionary mechanisms that have been proposed to explain their peculiar nature.

  3. Informational laws of genome structures

    NASA Astrophysics Data System (ADS)

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-06-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.

  4. Evolution of small prokaryotic genomes

    PubMed Central

    Martínez-Cano, David J.; Reyes-Prieto, Mariana; Martínez-Romero, Esperanza; Partida-Martínez, Laila P.; Latorre, Amparo; Moya, Andrés; Delaye, Luis

    2015-01-01

    As revealed by genome sequencing, the biology of prokaryotes with reduced genomes is strikingly diverse. These include free-living prokaryotes with ∼800 genes as well as endosymbiotic bacteria with as few as ∼140 genes. Comparative genomics is revealing the evolutionary mechanisms that led to these small genomes. In the case of free-living prokaryotes, natural selection directly favored genome reduction, while in the case of endosymbiotic prokaryotes neutral processes played a more prominent role. However, new experimental data suggest that selective processes may be at operation as well for endosymbiotic prokaryotes at least during the first stages of genome reduction. Endosymbiotic prokaryotes have evolved diverse strategies for living with reduced gene sets inside a host-defined medium. These include utilization of host-encoded functions (some of them coded by genes acquired by gene transfer from the endosymbiont and/or other bacteria); metabolic complementation between co-symbionts; and forming consortiums with other bacteria within the host. Recent genome sequencing projects of intracellular mutualistic bacteria showed that previously believed universal evolutionary trends like reduced G+C content and conservation of genome synteny are not always present in highly reduced genomes. Finally, the simplified molecular machinery of some of these organisms with small genomes may be used to aid in the design of artificial minimal cells. Here we review recent genomic discoveries of the biology of prokaryotes endowed with small gene sets and discuss the evolutionary mechanisms that have been proposed to explain their peculiar nature. PMID:25610432

  5. Sequencing technologies and genome sequencing.

    PubMed

    Pareek, Chandra Shekhar; Smoczynski, Rafal; Tretyn, Andrzej

    2011-11-01

    The high-throughput - next generation sequencing (HT-NGS) technologies are currently the hottest topic in the field of human and animals genomics researches, which can produce over 100 times more data compared to the most sophisticated capillary sequencers based on the Sanger method. With the ongoing developments of high throughput sequencing machines and advancement of modern bioinformatics tools at unprecedented pace, the target goal of sequencing individual genomes of living organism at a cost of $1,000 each is seemed to be realistically feasible in the near future. In the relatively short time frame since 2005, the HT-NGS technologies are revolutionizing the human and animal genome researches by analysis of chromatin immunoprecipitation coupled to DNA microarray (ChIP-chip) or sequencing (ChIP-seq), RNA sequencing (RNA-seq), whole genome genotyping, genome wide structural variation, de novo assembling and re-assembling of genome, mutation detection and carrier screening, detection of inherited disorders and complex human diseases, DNA library preparation, paired ends and genomic captures, sequencing of mitochondrial genome and personal genomics. In this review, we addressed the important features of HT-NGS like, first generation DNA sequencers, birth of HT-NGS, second generation HT-NGS platforms, third generation HT-NGS platforms: including single molecule Heliscope™, SMRT™ and RNAP sequencers, Nanopore, Archon Genomics X PRIZE foundation, comparison of second and third HT-NGS platforms, applications, advances and future perspectives of sequencing technologies on human and animal genome research.

  6. Nuclear envelope and genome interactions in cell fate

    PubMed Central

    Talamas, Jessica A.; Capelson, Maya

    2015-01-01

    The eukaryotic cell nucleus houses an organism’s genome and is the location within the cell where all signaling induced and development-driven gene expression programs are ultimately specified. The genome is enclosed and separated from the cytoplasm by the nuclear envelope (NE), a double-lipid membrane bilayer, which contains a large variety of trans-membrane and associated protein complexes. In recent years, research regarding multiple aspects of the cell nucleus points to a highly dynamic and coordinated concert of efforts between chromatin and the NE in regulation of gene expression. Details of how this concert is orchestrated and how it directs cell differentiation and disease are coming to light at a rapid pace. Here we review existing and emerging concepts of how interactions between the genome and the NE may contribute to tissue specific gene expression programs to determine cell fate. PMID:25852741

  7. Comparative genomics of Brassicaceae crops

    PubMed Central

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-01-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  8. Advances on Genome Duplication Distances

    NASA Astrophysics Data System (ADS)

    Gagnon, Yves; Savard, Olivier Tremblay; Bertrand, Denis; El-Mabrouk, Nadia

    Given a phylogenetic tree involving Whole Genome Duplication events, we contribute to the problem of computing the rearrangement distance on a branch of a tree linking a duplication node d to a speciation node or a leaf s. In the case of a genome G at s containing exactly two copies of each gene, the genome halving problem is to find a perfectly duplicated genome D at d minimizing the rearrangement distance with G. We generalize the existing exact linear-time algorithm for genome halving to the case of a genome G with missing gene copies. In the case of a known ancestral duplicated genome D, we develop a greedy approach for computing the distance between G and D that is shown time-efficient and very accurate for both the rearrangement and DCJ distances.

  9. Complete Genome Sequences of Bordetella pertussis Vaccine Reference Strains 134 and 10536

    PubMed Central

    Peng, Yanhui; Loparev, Vladimir; Batra, Dhwani; Burroughs, Mark; Johnson, Taccara; Juieng, Phalasy; Rowe, Lori; Tondella, M. Lucia; Williams, Margaret M.

    2016-01-01

    Vaccine formulations and vaccination programs against whooping cough (pertussis) vary worldwide. Here, we report the complete genome sequences of two divergent Bordetella pertussis reference strains used in the production of pertussis vaccines. PMID:27635001

  10. Big cat genomics.

    PubMed

    O'Brien, Stephen J; Johnson, Warren E

    2005-01-01

    Advances in population and quantitative genomics, aided by the computational algorithms that employ genetic theory and practice, are now being applied to biological questions that surround free-ranging species not traditionally suitable for genetic enquiry. Here we review how applications of molecular genetic tools have been used to describe the natural history, present status, and future disposition of wild cat species. Insight into phylogenetic hierarchy, demographic contractions, geographic population substructure, behavioral ecology, and infectious diseases have revealed strategies for survival and adaptation of these fascinating predators. Conservation, stabilization, and management of the big cats are important areas that derive benefit from the genome resources expanded and applied to highly successful species, imperiled by an expanding human population.

  11. Bacterial genome annotation.

    PubMed

    Beckloff, Nicholas; Starkenburg, Shawn; Freitas, Tracey; Chain, Patrick

    2012-01-01

    Annotation of prokaryotic sequences can be separated into structural and functional annotation. Structural annotation is dependent on algorithmic interrogation of experimental evidence to discover the physical characteristics of a gene. This is done in an effort to construct accurate gene models, so understanding function or evolution of genes among organisms is not impeded. Functional annotation is dependent on sequence similarity to other known genes or proteins in an effort to assess the function of the gene. Combining structural and functional annotation across genomes in a comparative manner promotes higher levels of accurate annotation as well as an advanced understanding of genome evolution. As the availability of bacterial sequences increases and annotation methods improve, the value of comparative annotation will increase.

  12. [Genomics in medicine].

    PubMed

    Ruiz Esparza-Garrido, Ruth; Velázquez-Flores, Miguel Angel; Arenas-Aranda, Diego Julio; Salamanca-Gómez, Fabio

    2014-01-01

    The development of new fields of study in genetics, as the -omic sciences (transcriptomics, proteomics, metabolomics), has allowed the study of the regulation and expression of genomes. Therefore, nowadays it is possible to study global alterations--in the whole genome--and their effect at the protein and metabolic levels. Importantly, this new way of studying genetics has opened new areas of knowledge, and new cellular mechanisms that regulate the functioning of biological systems have been elucidated. In the clinical field, in the last years new molecular tools have been implemented. These tools are favorable to a better classification, diagnosis and prognosis of several human diseases. Additionally, in some cases best treatments, which improve the quality of life of patients, have been established. Due to the previous assertion, it is important to review and divulge changes in the study of genetics as a result of the development of the -omic sciences, which is the aim of this review.

  13. Viruses within animal genomes.

    PubMed

    De Brognier, A; Willems, L

    2016-04-01

    Viruses and their hosts can co-evolve to reach a fragile equilibrium that allows the survival of both. An excess of pathogenicity in the absence of a reservoir would be detrimental to virus survival. A significant proportion of all animal genomes has been shaped by the insertion of viruses that subsequently became 'fossilised'. Most endogenous viruses have lost the capacity to replicate via an infectious cycle and now replicate passively. The insertion of endogenous viruses has contributed to the evolution of animal genomes, for example in the reproductive biology of mammals. However, spontaneous viral integration still occasionally occurs in a number of virus-host systems. This constitutes a potential risk to host survival but also provides an opportunity for diversification and evolution.

  14. Mapping the human genome

    SciTech Connect

    Annas, G.C.; Elias, S.

    1992-01-01

    This article is a review of the book Mapping the Human Genome: Using Law and Ethics as Guides, edited by George C. Annas and Sherman Elias. The book is a collection of essays on the subject of using ethics and laws as guides to justify human gene mapping. It addresses specific issues such problems related to eugenics, patents, insurance as well as broad issues such as the societal definitions of normality.

  15. Genomic landscape of liposarcoma

    PubMed Central

    Kanojia, Deepika; Nagata, Yasunobu; Garg, Manoj; Lee, Dhong Hyun; Sato, Aiko; Yoshida, Kenichi; Sato, Yusuke; Sanada, Masashi; Mayakonda, Anand; Bartenhagen, Christoph; Klein, Hans-Ulrich; Doan, Ngan B.; Said, Jonathan W.; Mohith, S.; Gunasekar, Swetha; Shiraishi, Yuichi; Chiba, Kenichi; Tanaka, Hiroko; Miyano, Satoru; Myklebost, Ola; Yang, Henry; Dugas, Martin; Meza-Zepeda, Leonardo A.; Silberman, Allan W.; Forscher, Charles; Tyner, Jeffrey W.; Ogawa, Seishi; Koeffler, H. Phillip

    2015-01-01

    Liposarcoma (LPS) is the most common type of soft tissue sarcoma accounting for 20% of all adult sarcomas. Due to absence of clinically effective treatment options in inoperable situations and resistance to chemotherapeutics, a critical need exists to identify novel therapeutic targets. We analyzed LPS genomic landscape using SNP arrays, whole exome sequencing and targeted exome sequencing to uncover the genomic information for development of specific anti-cancer targets. SNP array analysis indicated known amplified genes (MDM2, CDK4, HMGA2) and important novel genes (UAP1, MIR557, LAMA4, CPM, IGF2, ERBB3, IGF1R). Carboxypeptidase M (CPM), recurrently amplified gene in well-differentiated/de-differentiated LPS was noted as a putative oncogene involved in the EGFR pathway. Notable deletions were found at chromosome 1p (RUNX3, ARID1A), chromosome 11q (ATM, CHEK1) and chromosome 13q14.2 (MIR15A, MIR16-1). Significantly and recurrently mutated genes (false discovery rate < 0.05) included PLEC (27%), MXRA5 (21%), FAT3 (24%), NF1 (20%), MDC1 (10%), TP53 (7%) and CHEK2 (6%). Further, in vitro and in vivo functional studies provided evidence for the tumor suppressor role for Neurofibromin 1 (NF1) gene in different subtypes of LPS. Pathway analysis of recurrent mutations demonstrated signaling through MAPK, JAK-STAT, Wnt, ErbB, axon guidance, apoptosis, DNA damage repair and cell cycle pathways were involved in liposarcomagenesis. Interestingly, we also found mutational and copy number heterogeneity within a primary LPS tumor signifying the importance of multi-region sequencing for cancer-genome guided therapy. In summary, these findings provide insight into the genomic complexity of LPS and highlight potential druggable pathways for targeted therapeutic approach. PMID:26643872

  16. Genomics of cellulosic biofuels.

    PubMed

    Rubin, Edward M

    2008-08-14

    The development of alternatives to fossil fuels as an energy source is an urgent global priority. Cellulosic biomass has the potential to contribute to meeting the demand for liquid fuel, but land-use requirements and process inefficiencies represent hurdles for large-scale deployment of biomass-to-biofuel technologies. Genomic information gathered from across the biosphere, including potential energy crops and microorganisms able to break down biomass, will be vital for improving the prospects of significant cellulosic biofuel production.

  17. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  18. Personalized Genomic Medicine with a Patchwork, Partially Owned Genome

    PubMed Central

    Mason, Christopher E.; Seringhaus, Michael R.; Sattler de Sousa e Brito, Clara

    2008-01-01

    “His book was known as the Book of Sand, because neither the book nor the sand have any beginning or end.” — Jorge Luis Borges The human genome is a three billion-letter recipe for the genesis of a human being, directing development from a single-celled embryo to the trillions of adult cells. Since the sequencing of the human genome was announced in 2001, researchers have an increased ability to discern the genetic basis for diseases. This reference genome has opened the door to genomic medicine, aimed at detecting and understanding all genetic variations of the human genome that contribute to the manifestation and progression of disease. The overarching vision of genomic (or “personalized”) medicine is to custom-tailor each treatment for maximum effectiveness in an individual patient. Detecting the variation in a patient’s deoxyribonucleic acid (DNA), ribonucleic acid (RNA), and protein structures is no longer an insurmountable hurdle. Today, the challenge for genomic medicine lies in contextualizing those myriad genetic variations in terms of their functional consequences for a person’s health and development throughout life and in terms of that patient’s susceptibility to disease and differential clinical responses to medication. Additionally, several recent developments have complicated our understanding of the nominal human genome and, thereby, altered the progression of genomic medicine. In this brief review, we shall focus on these developments and examine how they are changing our understanding of our genome. PMID:18449389

  19. Mapping the human genome

    SciTech Connect

    Cantor, Charles R.

    1989-06-01

    The following pages aim to lay a foundation for understanding the excitement surrounding the ''human genome project,'' as well as to convey a flavor of the ongoing efforts and plans at the Human Genome Center at the Lawrence Berkeley Laboratory. Our own work, of course, is only part of a broad international effort that will dramatically enhance our understanding of human molecular genetics before the end of this century. In this country, the bulk of the effort will be carried out under the auspices of the Department of Energy and the National Institutes of Health, but significant contributions have already been made both by nonprofit private foundations and by private corporation. The respective roles of the DOE and the NIH are being coordinated by an inter-agency committee, the aims of which are to emphasize the strengths of each agency, to facilitate cooperation, and to avoid unnecessary duplication of effort. The NIH, for example, will continue its crucial work in medical genetics and in mapping the genomes of nonhuman species. The DOE, on the other hand, has unique experience in managing large projects, and its national laboratories are repositories of expertise in physics, engineering, and computer science, as well as the life sciences. The tools and techniques the project will ultimately rely on are thus likely to be developed in multidisciplinary efforts at laboratories like LBL. Accordingly, we at LBL take great pride in this enterprise -- an enterprise that will eventually transform our understanding of ourselves.

  20. The canine genome.

    PubMed

    Ostrander, Elaine A; Wayne, Robert K

    2005-12-01

    The dog has emerged as a premier species for the study of morphology, behavior, and disease. The recent availability of a high-quality draft sequence lifts the dog system to a new threshold. We provide a primer to use the dog genome by first focusing on its evolutionary history. We overview the relationship of dogs to wild canids and discuss their origin and domestication. Dogs clearly originated from a substantial number of gray wolves and dog breeds define distinct genetic units that can be divided into at least four hierarchical groupings. We review evidence showing that dogs have high levels of linkage disequilibrium. Consequently, given that dog breeds express specific phenotypic traits and vary in behavior and the incidence of genetic disease, genomic-wide scans for linkage disequilibrium may allow the discovery of genes influencing breed-specific characteristics. Finally, we review studies that have utilized the dog to understand the genetic underpinning of several traits, and we summarize genomic resources that can be used to advance such studies. We suggest that given these resources and the unique characteristics of breeds, that the dog is a uniquely valuable resource for studying the genetic basis of complex traits.

  1. Genome Sequence of Bacterial Interference Strain Staphylococcus aureus 502A.

    PubMed

    Parker, Dane; Narechania, Apurva; Sebra, Robert; Deikus, Gintaras; Larussa, Samuel; Ryan, Chanelle; Smith, Hannah; Prince, Alice; Mathema, Barun; Ratner, Adam J; Kreiswirth, Barry; Planet, Paul J

    2014-04-10

    Staphylococcus aureus 502A was a strain used in bacterial interference programs during the 1960s and early 1970s. Infants were deliberately colonized with 502A with the goal of preventing colonization with more invasive strains. We present the completed genome sequence of this organism.

  2. Sequencing the Genome of the Heirloom Watermelon Cultivar Charleston Gray

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genome of the watermelon cultivar Charleston Gray, a major heirloom which has been used in breeding programs of many watermelon cultivars, was sequenced. Our strategy involved a hybrid approach using the Illumina and 454/Titanium next-generation sequencing technologies. For Illumina, shotgun g...

  3. The Functional Genomics Initiative at Oak Ridge National Laboratory

    SciTech Connect

    Johnson, Dabney; Justice, Monica; Beattle, Ken; Buchanan, Michelle; Ramsey, Michael; Ramsey, Rose; Paulus, Michael; Ericson, Nance; Allison, David; Kress, Reid; Mural, Richard; Uberbacher, Ed; Mann, Reinhold

    1997-12-31

    The Functional Genomics Initiative at the Oak Ridge National Laboratory integrates outstanding capabilities in mouse genetics, bioinformatics, and instrumentation. The 50 year investment by the DOE in mouse genetics/mutagenesis has created a one-of-a-kind resource for generating mutations and understanding their biological consequences. It is generally accepted that, through the mouse as a surrogate for human biology, we will come to understand the function of human genes. In addition to this world class program in mammalian genetics, ORNL has also been a world leader in developing bioinformatics tools for the analysis, management and visualization of genomic data. Combining this expertise with new instrumentation technologies will provide a unique capability to understand the consequences of mutations in the mouse at both the organism and molecular levels. The goal of the Functional Genomics Initiative is to develop the technology and methodology necessary to understand gene function on a genomic scale and apply these technologies to megabase regions of the human genome. The effort is scoped so as to create an effective and powerful resource for functional genomics. ORNL is partnering with the Joint Genome Institute and other large scale sequencing centers to sequence several multimegabase regions of both human and mouse genomic DNA, to identify all the genes in these regions, and to conduct fundamental surveys to examine gene function at the molecular and organism level. The Initiative is designed to be a pilot for larger scale deployment in the post-genome era. Technologies will be applied to the examination of gene expression and regulation, metabolism, gene networks, physiology and development.

  4. Whole-genome sequencing for comparative genomics and de novo genome assembly.

    PubMed

    Benjak, Andrej; Sala, Claudia; Hartkoorn, Ruben C

    2015-01-01

    Next-generation sequencing technologies for whole-genome sequencing of mycobacteria are rapidly becoming an attractive alternative to more traditional sequencing methods. In particular this technology is proving useful for genome-wide identification of mutations in mycobacteria (comparative genomics) as well as for de novo assembly of whole genomes. Next-generation sequencing however generates a vast quantity of data that can only be transformed into a usable and comprehensible form using bioinformatics. Here we describe the methodology one would use to prepare libraries for whole-genome sequencing, and the basic bioinformatics to identify mutations in a genome following Illumina HiSeq or MiSeq sequencing, as well as de novo genome assembly following sequencing using Pacific Biosciences (PacBio).

  5. An Analysis of Adenovirus Genomes Using Whole Genome Software Tools

    PubMed Central

    Mahadevan, Padmanabhan

    2016-01-01

    The evolution of sequencing technology has lead to an enormous increase in the number of genomes that have been sequenced. This is especially true in the field of virus genomics. In order to extract meaningful biological information from these genomes, whole genome data mining software tools must be utilized. Hundreds of tools have been developed to analyze biological sequence data. However, only some of these tools are user-friendly to biologists. Several of these tools that have been successfully used to analyze adenovirus genomes are described here. These include Artemis, EMBOSS, pDRAW, zPicture, CoreGenes, GeneOrder, and PipMaker. These tools provide functionalities such as visualization, restriction enzyme analysis, alignment, and proteome comparisons that are extremely useful in the bioinformatics analysis of adenovirus genomes. PMID:28293072

  6. The UCSC Ebola Genome Portal

    PubMed Central

    Haeussler, Maximilian; Karolchik, Donna; Clawson, Hiram; Raney, Brian J; Rosenbloom, Kate R.; Fujita, Pauline A.; Hinrichs, Angie S.; Speir, Matthew L; Eisenhart, Chris; Zweig, Ann S.; Haussler, David; Kent, W. James

    2014-01-01

    Background: With the Ebola epidemic raging out of control in West Africa, there has been a flurry of research into the Ebola virus, resulting in the generation of much genomic data. Methods: In response to the clear need for tools that integrate multiple strands of research around molecular sequences, we have created the University of California Santa Cruz (UCSC) Ebola Genome Browser, an adaptation of our popular UCSC Genome Browser web tool, which can be used to view the Ebola virus genome sequence from GenBank and nearly 30 annotation tracks generated by mapping external data to the reference sequence. Significant annotations include a multiple alignment comprising 102 Ebola genomes from the current outbreak, 56 from previous outbreaks, and 2 Marburg genomes as an outgroup; a gene track curated by NCBI; protein annotations curated by UniProt and antibody-binding epitopes curated by IEDB. We have extended the Genome Browser’s multiple alignment color-coding scheme to distinguish mutations resulting from non-synonymous coding changes, synonymous changes, or changes in untranslated regions. Discussion: Our Ebola Genome portal at http://genome.ucsc.edu/ebolaPortal/ links to the Ebola virus Genome Browser and an aggregate of useful information, including a collection of Ebola antibodies we are curating. PMID:25685613

  7. Connecting Genomic Alterations to Cancer Biology with Proteomics: The NCI Clinical Proteomic Tumor Analysis Consortium

    SciTech Connect

    Ellis, Matthew; Gillette, Michael; Carr, Steven A.; Paulovich, Amanda G.; Smith, Richard D.; Rodland, Karin D.; Townsend, Reid; Kinsinger, Christopher; Mesri, Mehdi; Rodriguez, Henry; Liebler, Daniel

    2013-10-03

    The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium is applying the latest generation of proteomic technologies to genomically annotated tumors from The Cancer Genome Atlas (TCGA) program, a joint initiative of the NCI and the National Human Genome Research Institute. By providing a fully integrated accounting of DNA, RNA, and protein abnormalities in individual tumors, these datasets will illuminate the complex relationship between genomic abnormalities and cancer phenotypes, thus producing biologic insights as well as a wave of novel candidate biomarkers and therapeutic targets amenable to verifi cation using targeted mass spectrometry methods.

  8. Complete genome sequence of Halopiger xanaduensis type strain (SH6T)

    SciTech Connect

    Anderson, Iain; Tindall, Brian; Rohde, Manfred; Lucas, Susan; Han, James; Lapidus, Alla L.; Cheng, Jan-Fang; Goodwin, Lynne A.; Pitluck, Sam; Peters, Lin; Pati, Amrita; Mikhailova, Natalia; Pagani, Ioanna; Teshima, Hazuki; Han, Cliff; Tapia, Roxanne; Land, Miriam L; Woyke, Tanja; Klenk, Hans-Peter; Kyrpides, Nikos C; Ivanova, N

    2012-01-01

    Halopiger xanaduensis is the type species of the genus Halopiger and belongs to the euryarchaeal family Halobacteriaceae. H. xanaduensis strain SH-6, which is designated as the type strain, was isolated from the sediment of a salt lake in Inner Mongolia, Lake Shangmatala. Like other members of the family Halobacteriaceae, it is an extreme halophile requiring at least 2.5 M salt for growth. We report here the sequencing and annotation of the 4,355,268 bp genome, which includes one chromosome and three plasmids. This genome is part of a Joint Genome Institute (JGI) Community Sequencing Program (CSP) project to sequence diverse haloarchaeal genomes.

  9. Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34

    DOE PAGES

    Anderson, Iain J.; DasSarma, Priya; Lucas, Susan; ...

    2016-09-10

    Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. In conclusion, this genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.

  10. Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34

    SciTech Connect

    Anderson, Iain J.; DasSarma, Priya; Lucas, Susan; Copeland, Alex; Lapidus, Alla; Del Rio, Tijana Glavina; Tice, Hope; Dalin, Eileen; Bruce, David C.; Goodwin, Lynne; Pitluck, Sam; Sims, David; Brettin, Thomas S.; Detter, John C.; Han, Cliff S.; Larimer, Frank; Hauser, Loren; Land, Miriam; Ivanova, Natalia; Richardson, Paul; Cavicchioli, Ricardo; DasSarma, Shiladitya; Woese, Carl R.; Kyrpides, Nikos C.

    2016-09-10

    Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. In conclusion, this genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.

  11. Human Genome Diversity workshop 1

    SciTech Connect

    1992-12-31

    The Human Genome Diversity Project (HGD) is an international interdisciplinary program whose goal is to reveal as much as possible about the current state of genetic diversity among humans and the processes that were responsible for that diversity. Classical premolecular techniques have already proved that a significant component of human genetic variability lies within populations rather than among them. New molecular techniques will permit a dramatic increase in the resolving power of genetic analysis at the population level. Recent social changes in many parts of the world threaten the identity of a number of populations that may be extremely important for understanding human evolutionary history. It is therefore urgent to conduct research on human variation in these areas, while there is still time. The plan is to identify the most representative descendants of ancestral human populations worldwide and then to preserve genetic records of these populations. This is a report of the Population Genetics Workshop (Workshop 1), the first of three to be held to plan HGD, which was focused on sampling strategies and analytic methods from population genetics. The topics discussed were sampling and population structure; analysis of populations; drift versus natural selection; modeling migration and population subdivision; and population structure and subdivision.

  12. Genomic Data Commons and Genomic Cloud Pilots - Google Hangout

    Cancer.gov

    Join us for a live, moderated discussion about two NCI efforts to expand access to cancer genomics data: the Genomic Data Commons and Genomic Cloud Pilots. NCI subject matters experts will include Louis M. Staudt, M.D., Ph.D., Director Center for Cancer Genomics, Warren Kibbe, Ph.D., Director, NCI Center for Biomedical Informatics and Information Technology, and moderated by Anthony Kerlavage, Ph.D., Chief, Cancer Informatics Branch, Center for Biomedical Informatics and Information Technology. We welcome your questions before and during the Hangout on Twitter using the hashtag #AskNCI.

  13. Shrinking genomes? Evidence from genome size variation in Crepis (Compositae).

    PubMed

    Enke, N; Fuchs, J; Gemeinholzer, B

    2011-01-01

    Large-scale surveys of genome size evolution in angiosperms show that the ancestral genome was most likely small, with a tendency towards an increase in DNA content during evolution. Due to polyploidisation and self-replicating DNA elements, angiosperm genomes were considered to have a 'one-way ticket to obesity' (Bennetzen & Kellogg 1997). New findings on how organisms can lose DNA challenged the hypotheses of unidirectional evolution of genome size. The present study is based on the classical work of Babcock (1947a) on karyotype evolution within Crepis and analyses karyotypic diversification within the genus in a phylogenetic context. Genome size of 21 Crepis species was estimated using flow cytometry. Additional data of 17 further species were taken from the literature. Within 30 diploid Crepis species there is a striking trend towards genome contraction. The direction of genome size evolution was analysed by reconstructing ancestral character states on a molecular phylogeny based on ITS sequence data. DNA content is correlated to distributional aspects as well as life form. Genome size is significantly higher in perennials than in annuals. Within sampled species, very small genomes are only present in Mediterranean or European species, whereas their Central and East Asian relatives have larger 1C values.

  14. Genome instability mechanisms and the structure of cancer genomes.

    PubMed

    Cassidy, Liam D; Venkitaraman, Ashok R

    2012-02-01

    Genomic instability is a hallmark of cancer cells, and arises from the aberrations that these cells exhibit in the normal biological mechanisms that repair and replicate the genome, or ensure its accurate segregation during cell division. Increasingly detailed descriptions of cancer genomes have begun to emerge from next-generation sequencing (NGS), providing snapshots of their nature and heterogeneity in different cancers at different stages in their evolution. Here, we attempt to extract from these sequencing studies insights into the role of genome instability mechanisms in carcinogenesis, and to identify challenges impeding further progress.

  15. The coffee genome hub: a resource for coffee genomes

    PubMed Central

    Dereeper, Alexis; Bocs, Stéphanie; Rouard, Mathieu; Guignon, Valentin; Ravel, Sébastien; Tranchant-Dubreuil, Christine; Poncet, Valérie; Garsmeur, Olivier; Lashermes, Philippe; Droc, Gaëtan

    2015-01-01

    The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager. PMID:25392413

  16. The Anolis Lizard Genome: An Amniote Genome without Isochores?

    PubMed Central

    Costantini, Maria; Greif, Gonzalo; Alvarez-Valin, Fernando; Bernardi, Giorgio

    2016-01-01

    Two articles published 5 years ago concluded that the genome of the lizard Anolis carolinensis is an amniote genome without isochores. This claim was apparently contradicting previous results on the general presence of an isochore organization in all vertebrate genomes tested (including Anolis). In this investigation, we demonstrate that the Anolis genome is indeed heterogeneous in base composition, since its macrochromosomes comprise isochores mainly from the L2 and H1 families (a moderately GC-poor and a moderately GC-rich family, respectively), and since the majority of the sequenced microchromosomes consists of H1 isochores. These families are associated with different features of genome structure, including gene density and compositional correlations (e.g., GC3 vs flanking sequence GC and intron GC), as in the case of mammalian and avian genomes. Moreover, the assembled Anolis chromosomes have an enormous number of gaps, which could be due to sequencing problems in GC-rich regions of the genome. In conclusion, the Anolis genome is no exception to the general rule of an isochore organization in the genomes of vertebrates (and other eukaryotes). PMID:26992416

  17. EMERGE: a flexible modelling framework to predict genomic regulatory elements from genomic signatures

    PubMed Central

    van Duijvenboden, Karel; de Boer, Bouke A.; Capon, Nicolas; Ruijter, Jan M.; Christoffels, Vincent M.

    2016-01-01

    Regulatory DNA elements, short genomic segments that regulate gene expression, have been implicated in developmental disorders and human disease. Despite this clinical urgency, only a small fraction of the regulatory DNA repertoire has been confirmed through reporter gene assays. The overall success rate of functional validation of candidate regulatory elements is low. Moreover, the number and diversity of datasets from which putative regulatory elements can be identified is large and rapidly increasing. We generated a flexible and user-friendly tool to integrate the information from different types of genomic datasets, e.g. ATAC-seq, ChIP-seq, conservation, aiming to increase the ease and success rate of functional prediction. To this end, we developed the EMERGE program that merges all datasets that the user considers informative and uses a logistic regression framework, based on validated functional elements, to set optimal weights to these datasets. ROC curve analysis shows that a combination of datasets leads to improved prediction of tissue-specific enhancers in human, mouse and Drosophila genomes. Functional assays based on this prediction can be expected to have substantially higher success rates. The resulting integrated signal for prediction of functional elements can be plotted in a build-in genome browser or exported for further analysis. PMID:26531828

  18. Genomic MRI - a Public Resource for Studying Sequence Patterns within Genomic DNA

    PubMed Central

    Prakash, Ashwin; Bechtel, Jason; Fedorov, Alexei

    2011-01-01

    Non-coding genomic regions in complex eukaryotes, including intergenic areas, introns, and untranslated segments of exons, are profoundly non-random in their nucleotide composition and consist of a complex mosaic of sequence patterns. These patterns include so-called Mid-Range Inhomogeneity (MRI) regions -- sequences 30-10000 nucleotides in length that are enriched by a particular base or combination of bases (e.g. (G+T)-rich, purine-rich, etc.). MRI regions are associated with unusual (non-B-form) DNA structures that are often involved in regulation of gene expression, recombination, and other genetic processes (Fedorova & Fedorov 2010). The existence of a strong fixation bias within MRI regions against mutations that tend to reduce their sequence inhomogeneity additionally supports the functionality and importance of these genomic sequences (Prakash et al. 2009). Here we demonstrate a freely available Internet resource -- the Genomic MRI program package -- designed for computational analysis of genomic sequences in order to find and characterize various MRI patterns within them (Bechtel et al. 2008). This package also allows generation of randomized sequences with various properties and level of correspondence to the natural input DNA sequences. The main goal of this resource is to facilitate examination of vast regions of non-coding DNA that are still scarcely investigated and await thorough exploration and recognition. PMID:21610667

  19. Antiviral Defenses in Plants through Genome Editing

    PubMed Central

    Romay, Gustavo; Bragard, Claude

    2017-01-01

    Plant–virus interactions based-studies have contributed to increase our understanding on plant resistance mechanisms, providing new tools for crop improvement. In the last two decades, RNA interference, a post-transcriptional gene silencing approach, has been used to induce antiviral defenses in plants with the help of genetic engineering technologies. More recently, the new genome editing systems (GES) are revolutionizing the scope of tools available to confer virus resistance in plants. The most explored GES are zinc finger nucleases, transcription activator-like effector nucleases, and clustered regularly interspaced short palindromic repeats/Cas9 endonuclease. GES are engineered to target and introduce mutations, which can be deleterious, via double-strand breaks at specific DNA sequences by the error-prone non-homologous recombination end-joining pathway. Although GES have been engineered to target DNA, recent discoveries of GES targeting ssRNA molecules, including virus genomes, pave the way for further studies programming plant defense against RNA viruses. Most of plant virus species have an RNA genome and at least 784 species have positive ssRNA. Here, we provide a summary of the latest progress in plant antiviral defenses mediated by GES. In addition, we also discuss briefly the GES perspectives in light of the rebooted debate on genetic modified organisms (GMOs) and the current regulatory frame for agricultural products involving the use of such engineering technologies. PMID:28167937

  20. Regulatory genes in the ancestral chordate genomes.

    PubMed

    Satou, Yutaka; Wada, Shuichi; Sasakura, Yasunori; Satoh, Nori

    2008-12-01

    Changes or innovations in gene regulatory networks for the developmental program in the ancestral chordate genome appear to be a major component in the evolutionary process in which tadpole-type larvae, a unique characteristic of chordates, arose. These alterations may include new genetic interactions as well as the acquisition of new regulatory genes. Previous analyses of the Ciona genome revealed that many genes may have emerged after the divergence of the tunicate and vertebrate lineages. In this paper, we examined this possibility by examining a second non-vertebrate chordate genome. We conclude from this analysis that the ancient chordate included almost the same repertory of regulatory genes, but less redundancy than extant vertebrates, and that approximately 10% of vertebrate regulatory genes were innovated after the emergence of vertebrates. Thus, refined regulatory networks arose during vertebrate evolution mainly as preexisting regulatory genes multiplied rather than by generating new regulatory genes. The inferred regulatory gene sets of the ancestral chordate would be an important foundation for understanding how tadpole-type larvae, a unique characteristic of chordates, evolved.

  1. The Giardia genome project database.

    PubMed

    McArthur, A G; Morrison, H G; Nixon, J E; Passamaneck, N Q; Kim, U; Hinkle, G; Crocker, M K; Holder, M E; Farr, R; Reich, C I; Olsen, G E; Aley, S B; Adam, R D; Gillin, F D; Sogin, M L

    2000-08-15

    The Giardia genome project database provides an online resource for Giardia lamblia (WB strain, clone C6) genome sequence information. The database includes edited single-pass reads, the results of BLASTX searches, and details of progress towards sequencing the entire 12 million-bp Giardia genome. Pre-sorted BLASTX results can be retrieved based on keyword searches and BLAST searches of the high throughput Giardia data can be initiated from the web site or through NCBI. Descriptions of the genomic DNA libraries, project protocols and summary statistics are also available. Although the Giardia genome project is ongoing, new sequences are made available on a bi-monthly basis to ensure that researchers have access to information that may assist them in the search for genes and their biological function. The current URL of the Giardia genome project database is www.mbl.edu/Giardia.

  2. The genome of Eucalyptus grandis.

    PubMed

    Myburg, Alexander A; Grattapaglia, Dario; Tuskan, Gerald A; Hellsten, Uffe; Hayes, Richard D; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R K; Hussey, Steven G; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B; Togawa, Roberto C; Pappas, Marilia R; Faria, Danielle A; Sansaloni, Carolina P; Petroli, Cesar D; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A; Bornberg-Bauer, Erich; Kersting, Anna R; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E; Liston, Aaron; Spatafora, Joseph W; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C; Steane, Dorothy A; Vaillancourt, René E; Potts, Brad M; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J; Strauss, Steven H; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S; Schmutz, Jeremy

    2014-06-19

    Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.

  3. Genomic Rearrangements in Prostate Cancer

    PubMed Central

    Barbieri, Christopher E.; Rubin, Mark A.

    2014-01-01

    Purpose of review Genomic instability is a fundamental feature of human cancer, leading to the activation of oncogenes and inactivation of tumor suppressors. In prostate cancer, structural genomic rearrangements, resulting in gene fusions, amplifications and deletions, are a critical mechanism effecting these alterations. Here we review recent literature regarding the importance of genomic rearrangements in the pathogenesis of prostate cancer and the potential impact on patient care. Recent findings Next generation sequencing has revealed a striking abundance, complexity, and heterogeneity of genomic rearrangements in prostate cancer. These recent studies have nominated a number of processes in predisposing prostate cancer to genomic rearrangements, including androgen-induced transcription. Summary Structural rearrangements are the critical mechanism resulting in the characteristic genomic changes associated with prostate cancer pathogenesis and progression. Future studies will determine if the impact of these events on tumor phenotypes can be translated to clinical utility for patient prognosis and choices of management strategies. PMID:25393273

  4. Phage genomics: small is beautiful.

    PubMed

    Brüssow, Harald; Hendrix, Roger W

    2002-01-11

    The Age of Genomics dawned only gradually for bacteriophages. It was 1977 when the genome of phage phi X174 was published and 1983 when the "large" genome of phage lambda hit the streets. More recently, the pace has quickened, so that we now have over 100 complete phage genomes and can expect thousands in a very few years. These sequences have been marvelously informative for the biology of the individual phages, but with the advent of high volume sequencing technology, the real excitement for phage biology is that it is now possible to analyze the sequences together and thereby address--for the first time at whole genome resolution--a set of fundamental biological questions related to populations: What is the structure of the global phage population? What are its dynamics? How do phages evolve? This is Comparative Genomics with a capital "C".

  5. Big Data: Astronomical or Genomical?

    PubMed

    Stephens, Zachary D; Lee, Skylar Y; Faghri, Faraz; Campbell, Roy H; Zhai, Chengxiang; Efron, Miles J; Iyer, Ravishankar; Schatz, Michael C; Sinha, Saurabh; Robinson, Gene E

    2015-07-01

    Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a "four-headed beast"--it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the "genomical" challenges of the next decade.

  6. Complete genome sequence of Syntrophobacter fumaroxidans strain (MPOB(T)).

    PubMed

    Plugge, Caroline M; Henstra, Anne M; Worm, Petra; Swarts, Daan C; Paulitsch-Fuchs, Astrid H; Scholten, Johannes C M; Lykidis, Athanasios; Lapidus, Alla L; Goltsman, Eugene; Kim, Edwin; McDonald, Erin; Rohlin, Lars; Crable, Bryan R; Gunsalus, Robert P; Stams, Alfons J M; McInerney, Michael J

    2012-10-10

    Syntrophobacter fumaroxidans strain MPOB(T) is the best-studied species of the genus Syntrophobacter. The species is of interest because of its anaerobic syntrophic lifestyle, its involvement in the conversion of propionate to acetate, H2 and CO2 during the overall degradation of organic matter, and its release of products that serve as substrates for other microorganisms. The strain is able to ferment fumarate in pure culture to CO2 and succinate, and is also able to grow as a sulfate reducer with propionate as an electron donor. This is the first complete genome sequence of a member of the genus Syntrophobacter and a member genus in the family Syntrophobacteraceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,990,251 bp long genome with its 4,098 protein-coding and 81 RNA genes is a part of the Microbial Genome Program (MGP) and the Genomes to Life (GTL) Program project.

  7. Complete genome sequence of Syntrophobacter fumaroxidans strain (MPOBT)

    PubMed Central

    Plugge, Caroline M.; Henstra, Anne M.; Worm, Petra; Swarts, Daan C.; Paulitsch-Fuchs, Astrid H.; Scholten, Johannes C.M.; Lykidis, Athanasios; Lapidus, Alla L.; Goltsman, Eugene; Kim, Edwin; McDonald, Erin; Rohlin, Lars; Crable, Bryan R.; Gunsalus, Robert P.; Stams, Alfons J.M.; McInerney, Michael J.

    2012-01-01

    Syntrophobacter fumaroxidans strain MPOBT is the best-studied species of the genus Syntrophobacter. The species is of interest because of its anaerobic syntrophic lifestyle, its involvement in the conversion of propionate to acetate, H2 and CO2 during the overall degradation of organic matter, and its release of products that serve as substrates for other microorganisms. The strain is able to ferment fumarate in pure culture to CO2 and succinate, and is also able to grow as a sulfate reducer with propionate as an electron donor. This is the first complete genome sequence of a member of the genus Syntrophobacter and a member genus in the family Syntrophobacteraceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,990,251 bp long genome with its 4,098 protein-coding and 81 RNA genes is a part of the Microbial Genome Program (MGP) and the Genomes to Life (GTL) Program project. PMID:23450070

  8. Datasets for evolutionary comparative genomics

    PubMed Central

    Liberles, David A

    2005-01-01

    Many decisions about genome sequencing projects are directed by perceived gaps in the tree of life, or towards model organisms. With the goal of a better understanding of biology through the lens of evolution, however, there are additional genomes that are worth sequencing. One such rationale for whole-genome sequencing is discussed here, along with other important strategies for understanding the phenotypic divergence of species. PMID:16086856

  9. Genomics Nursing Faculty Champion Initiative

    PubMed Central

    Jenkins, Jean; Calzone, Kathleen A.

    2016-01-01

    Nurse faculty are challenged to keep up with the emerging and fast-paced field of genomics and the mandate to prepare the nursing workforce to be able to translate genomic research advances into routine clinical care. Using Faculty Champions and other options, the initiative stimulated curriculum development and promoted genomics curriculum integration. The authors summarize this yearlong initiative for undergraduate and graduate nursing faculty. PMID:24300251

  10. Toward nanoscale genome sequencing.

    PubMed

    Ryan, Declan; Rahimi, Maryam; Lund, John; Mehta, Ranjana; Parviz, Babak A

    2007-09-01

    This article reports on the state-of-the-art technologies that sequence DNA using miniaturized devices. The article considers the miniaturization of existing technologies for sequencing DNA and the opportunities for cost reduction that 'on-chip' devices can deliver. The ability to construct nano-scale structures and perform measurements using novel nano-scale effects has provided new opportunities to identify nucleotides directly using physical, and not chemical, methods. The challenges that these technologies need to overcome to provide a US$1000-genome sequencing technology are also presented.

  11. Genomics of Bacillus Species

    NASA Astrophysics Data System (ADS)

    Økstad, Ole Andreas; Kolstø, Anne-Brit

    Members of the genus Bacillus are rod-shaped spore-forming bacteria belonging to the Firmicutes, the low G+C gram-positive bacteria. The Bacillus genus was first described and classified by Ferdinand Cohn in Cohn (1872), and Bacillus subtilis was defined as the type species (Soule, 1932). Several Bacilli may be linked to opportunistic infections. However, pathogenicity among Bacillus spp. is mainly a feature of bacteria belonging to the Bacillus cereus group, including B. cereus, Bacillus anthracis, and Bacillus thuringiensis. Here we review the genomics of B. cereus group bacteria in relation to their roles as etiological agents of two food poisoning syndromes (emetic and diarrhoeal).

  12. GEMINI: integrative exploration of genetic variation and genome annotations.

    PubMed

    Paila, Umadevi; Chapman, Brad A; Kirchner, Rory; Quinlan, Aaron R

    2013-01-01

    Modern DNA sequencing technologies enable geneticists to rapidly identify genetic variation among many human genomes. However, isolating the minority of variants underlying disease remains an important, yet formidable challenge for medical genetics. We have developed GEMINI (GEnome MINIng), a flexible software package for exploring all forms of human genetic variation. Unlike existing tools, GEMINI integrates genetic variation with a diverse and adaptable set of genome annotations (e.g., dbSNP, ENCODE, UCSC, ClinVar, KEGG) into a unified database to facilitate interpretation and data exploration. Whereas other methods provide an inflexible set of variant filters or prioritization methods, GEMINI allows researchers to compose complex queries based on sample genotypes, inheritance patterns, and both pre-installed and custom genome annotations. GEMINI also provides methods for ad hoc queries and data exploration, a simple programming interface for custom analyses that leverage the underlying database, and both command line and graphical tools for common analyses. We demonstrate GEMINI's utility for exploring variation in personal genomes and family based genetic studies, and illustrate its ability to scale to studies involving thousands of human samples. GEMINI is designed for reproducibility and flexibility and our goal is to provide researchers with a standard framework for medical genomics.

  13. The genome sequence of the colonial chordate, Botryllus schlosseri

    PubMed Central

    Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

    2013-01-01

    Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI: http://dx.doi.org/10.7554/eLife.00569.001 PMID:23840927

  14. Low genome content diversity of marine planktonic Thaumarchaeota.

    PubMed

    Luo, Haiwei; Sun, Ying; Hollibaugh, James T; Moran, Mary Ann

    2016-08-01

    Members of Thaumarchaeota are responsible for much of the ammonia oxidation occurring in the ocean. Recent studies showed that marine Thaumarchaeota have versatile metabolic capabilities, but sequencing additional genomes has not significantly increased the gene content ascribed to this group. We used the assembly-free dN pipeline software in combination with phylogenetic analyses to interrogate shotgun metagenomic data sets to gain a better understanding of the genomic diversity of Thaumarchaeota populations. The program confidently assigned ∼3,000 paired-end reads to Thaumarchaeota, independent of homologies to any known Thaumarchaeota genome sequence. Only 2% of these reads potentially harbor new genes that were absent from the genome of 'Candidatus Nitrosopumilus maritimus' str. SCM1, even though this strain was isolated from a marine aquarium rather than directly from the ocean. One of these novel genes encode proteins associated with the CRISPR/Cas system, Cas1, suggesting that phage defense through CRISPR may be also present in planktonic Thaumarchaeota lineages. Our results suggest that marine Thaumarchaeota populations have very low diversity in genome content, which is corroborated using computer simulation analyses of two bacterial lineages with known genome content diversity.

  15. The genome sequence of the colonial chordate, Botryllus schlosseri.

    PubMed

    Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

    2013-07-02

    Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI:http://dx.doi.org/10.7554/eLife.00569.001.

  16. GEMBASSY: an EMBOSS associated software package for comprehensive genome analyses.

    PubMed

    Itaya, Hidetoshi; Oshita, Kazuki; Arakawa, Kazuharu; Tomita, Masaru

    2013-08-29

    The popular European Molecular Biology Open Software Suite (EMBOSS) currently contains over 400 tools used in various bioinformatics researches, equipped with sophisticated development frameworks for interoperability and tool discoverability as well as rich documentations and various user interfaces. In order to further strengthen EMBOSS in the fields of genomics, we here present a novel EMBOSS associated software (EMBASSY) package named GEMBASSY, which adds more than 50 analysis tools from the G-language Genome Analysis Environment and its Representational State Transfer (REST) and SOAP web services. GEMBASSY basically contains wrapper programs of G-language REST/SOAP web services to provide intuitive and easy access to various annotations within complete genome flatfiles, as well as tools for analyzing nucleic composition, calculating codon usage, and visualizing genomic information. For example, analysis methods such as for calculating distance between sequences by genomic signatures and for predicting gene expression levels from codon usage bias are effective in the interpretation of meta-genomic and meta-transcriptomic data. GEMBASSY tools can be used seamlessly with other EMBOSS tools and UNIX command line tools. The source code written in C is available from GitHub (https://github.com/celery-kotone/GEMBASSY/) and the distribution package is freely available from the GEMBASSY web site (http://www.g-language.org/gembassy/).

  17. GEMINI: Integrative Exploration of Genetic Variation and Genome Annotations

    PubMed Central

    Paila, Umadevi; Chapman, Brad A.; Kirchner, Rory; Quinlan, Aaron R.

    2013-01-01

    Modern DNA sequencing technologies enable geneticists to rapidly identify genetic variation among many human genomes. However, isolating the minority of variants underlying disease remains an important, yet formidable challenge for medical genetics. We have developed GEMINI (GEnome MINIng), a flexible software package for exploring all forms of human genetic variation. Unlike existing tools, GEMINI integrates genetic variation with a diverse and adaptable set of genome annotations (e.g., dbSNP, ENCODE, UCSC, ClinVar, KEGG) into a unified database to facilitate interpretation and data exploration. Whereas other methods provide an inflexible set of variant filters or prioritization methods, GEMINI allows researchers to compose complex queries based on sample genotypes, inheritance patterns, and both pre-installed and custom genome annotations. GEMINI also provides methods for ad hoc queries and data exploration, a simple programming interface for custom analyses that leverage the underlying database, and both command line and graphical tools for common analyses. We demonstrate GEMINI's utility for exploring variation in personal genomes and family based genetic studies, and illustrate its ability to scale to studies involving thousands of human samples. GEMINI is designed for reproducibility and flexibility and our goal is to provide researchers with a standard framework for medical genomics. PMID:23874191

  18. Genomic medicine and neurological disease.

    PubMed

    Boone, Philip M; Wiszniewski, Wojciech; Lupski, James R

    2011-07-01

    "Genomic medicine" refers to the diagnosis, optimized management, and treatment of disease--as well as screening, counseling, and disease gene identification--in the context of information provided by an individual patient's personal genome. Genomic medicine, to some extent synonymous with "personalized medicine," has been made possible by recent advances in genome technologies. Genomic medicine represents a new approach to health care and disease management that attempts to optimize the care of a patient based upon information gleaned from his or her personal genome sequence. In this review, we describe recent progress in genomic medicine as it relates to neurological disease. Many neurological disorders either segregate as Mendelian phenotypes or occur sporadically in association with a new mutation in a single gene. Heritability also contributes to other neurological conditions that appear to exhibit more complex genetics. In addition to discussing current knowledge in this field, we offer suggestions for maximizing the utility of genomic information in clinical practice as the field of genomic medicine unfolds.

  19. Advances in yeast genome engineering.

    PubMed

    David, Florian; Siewers, Verena

    2015-02-01

    Genome engineering based on homologous recombination has been applied to yeast for many years. However, the growing importance of yeast as a cell factory in metabolic engineering and chassis in synthetic biology demands methods for fast and efficient introduction of multiple targeted changes such as gene knockouts and introduction of multistep metabolic pathways. In this review, we summarize recent improvements of existing genome engineering methods, the development of novel techniques, for example for advanced genome redesign and evolution, and the importance of endonucleases as genome engineering tools.

  20. A simple and effective method for construction of Escherichia coli strains proficient for genome engineering.

    PubMed

    Ryu, Young Shin; Biswas, Rajesh Kumar; Shin, Kwangsu; Parisutham, Vinuselvi; Kim, Suk Min; Lee, Sung Kuk

    2014-01-01

    Multiplex genome engineering is a standalone recombineering tool for large-scale programming and accelerated evolution of cells. However, this advanced genome engineering technique has been limited to use in selected bacterial strains. We developed a simple and effective strain-independent method for effective genome engineering in Escherichia coli. The method involves introducing a suicide plasmid carrying the λ Red recombination system into the mutS gene. The suicide plasmid can be excised from the chromosome via selection in the absence of antibiotics, thus allowing transient inactivation of the mismatch repair system during genome engineering. In addition, we developed another suicide plasmid that enables integration of large DNA fragments into the lacZ genomic locus. These features enable this system to be applied in the exploitation of the benefits of genome engineering in synthetic biology, as well as the metabolic engineering of different strains of E. coli.