haploid human genome: Topics by Science.gov

Sample records for haploid human genome

Haploid plants produced by centromere-mediated genome elimination.

PubMed

Ravi, Maruthachalam; Chan, Simon W L

2010-03-25

Production of haploid plants that inherit chromosomes from only one parent can greatly accelerate plant breeding. Haploids generated from a heterozygous individual and converted to diploid create instant homozygous lines, bypassing generations of inbreeding. Two methods are generally used to produce haploids. First, cultured gametophyte cells may be regenerated into haploid plants, but many species and genotypes are recalcitrant to this process. Second, haploids can be induced from rare interspecific crosses, in which one parental genome is eliminated after fertilization. The molecular basis for genome elimination is not understood, but one theory posits that centromeres from the two parent species interact unequally with the mitotic spindle, causing selective chromosome loss. Here we show that haploid Arabidopsis thaliana plants can be easily generated through seeds by manipulating a single centromere protein, the centromere-specific histone CENH3 (called CENP-A in human). When cenh3 null mutants expressing altered CENH3 proteins are crossed to wild type, chromosomes from the mutant are eliminated, producing haploid progeny. Haploids are spontaneously converted into fertile diploids through meiotic non-reduction, allowing their genotype to be perpetuated. Maternal and paternal haploids can be generated through reciprocal crosses. We have also exploited centromere-mediated genome elimination to convert a natural tetraploid Arabidopsis into a diploid, reducing its ploidy to simplify breeding. As CENH3 is universal in eukaryotes, our method may be extended to produce haploids in any plant species.
Dramatic improvement in genome assembly achieved using doubled-haploid genomes.

PubMed

Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi

2014-10-27

Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.
Effective de novo assembly of fish genome using haploid larvae.

PubMed

Iwasaki, Yuki; Nishiki, Issei; Nakamura, Yoji; Yasuike, Motoshige; Kai, Wataru; Nomura, Kazuharu; Yoshida, Kazunori; Nomura, Yousuke; Fujiwara, Atushi; Kobayashi, Takanori; Ototake, Mitsuru

2016-02-01

Recent improvements in next-generation sequencing technology have made it possible to do whole genome sequencing, on even non-model eukaryote species with no available reference genomes. However, de novo assembly of diploid genomes is still a big challenge because of allelic variation. The aim of this study was to determine the feasibility of utilizing the genome of haploid fish larvae for de novo assembly of whole-genome sequences. We compared the efficiency of assembly using the haploid genome of yellowtail (Seriola quinqueradiata) with that using the diploid genome obtained from the dam. De novo assembly from the haploid and the diploid sequence reads (100 million reads per each datasets) generated by the Ion Proton sequencer (200 bp) was done under two different assembly algorithms, namely overlap-layout-consensus (OLC) and de Bruijn graph (DBG). This revealed that the assembly of the haploid genome significantly reduced (approximately 22% for OLC, 9% for DBG) the total number of contigs (with longer average and N50 contig lengths) when compared to the diploid genome assembly. The haploid assembly also improved the quality of the scaffolds by reducing the number of regions with unassigned nucleotides (Ns) (total length of Ns; 45,331,916 bp for haploids and 67,724,360 bp for diploids) in OLC-based assemblies. It appears clear that the haploid genome assembly is better because the allelic variation in the diploid genome disrupts the extension of contigs during the assembly process. Our results indicate that utilizing the genome of haploid larvae leads to a significant improvement in the de novo assembly process, thus providing a novel strategy for the construction of reference genomes from non-model diploid organisms such as fish. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
A stable hybrid containing haploid genomes of two obligate diploid Candida species.

PubMed

Chakraborty, Uttara; Mohamed, Aiyaz; Kakade, Pallavi; Mugasimangalam, Raja C; Sadhale, Parag P; Sanyal, Kaustuv

2013-08-01

Candida albicans and Candida dubliniensis are diploid, predominantly asexual human-pathogenic yeasts. In this study, we constructed tetraploid (4n) strains of C. albicans of the same or different lineages by spheroplast fusion. Induction of chromosome loss in the tetraploid C. albicans generated diploid or near-diploid progeny strains but did not produce any haploid progeny. We also constructed stable heterotetraploid somatic hybrid strains (2n + 2n) of C. albicans and C. dubliniensis by spheroplast fusion. Heterodiploid (n + n) progeny hybrids were obtained after inducing chromosome loss in a stable heterotetraploid hybrid. To identify a subset of hybrid heterodiploid progeny strains carrying at least one copy of all chromosomes of both species, unique centromere sequences of various chromosomes of each species were used as markers in PCR analysis. The reduction of chromosome content was confirmed by a comparative genome hybridization (CGH) assay. The hybrid strains were found to be stably propagated. Chromatin immunoprecipitation (ChIP) assays with antibodies against centromere-specific histones (C. albicans Cse4/C. dubliniensis Cse4) revealed that the centromere identity of chromosomes of each species is maintained in the hybrid genomes of the heterotetraploid and heterodiploid strains. Thus, our results suggest that the diploid genome content is not obligatory for the survival of either C. albicans or C. dubliniensis. In keeping with the recent discovery of the existence of haploid C. albicans strains, the heterodiploid strains of our study can be excellent tools for further species-specific genome elimination, yielding true haploid progeny of C. albicans or C. dubliniensis in future.
Novel technologies in doubled haploid line development.

PubMed

Ren, Jiaojiao; Wu, Penghao; Trampe, Benjamin; Tian, Xiaolong; Lübberstedt, Thomas; Chen, Shaojiang

2017-11-01

haploid inducer line can be transferred (DH) technology can not only shorten the breeding process but also increase genetic gain. Haploid induction and subsequent genome doubling are the two main steps required for DH technology. Haploids have been generated through the culture of immature male and female gametophytes, and through inter- and intraspecific via chromosome elimination. Here, we focus on haploidization via chromosome elimination, especially the recent advances in centromere-mediated haploidization. Once haploids have been induced, genome doubling is needed to produce DH lines. This study has proposed a new strategy to improve haploid genome doubling by combing haploids and minichromosome technology. With the progress in haploid induction and genome doubling methods, DH technology can facilitate reverse breeding, cytoplasmic male sterile (CMS) line production, gene stacking and a variety of other genetic analysis. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Recovery and characterization of a Citrus clementina Hort. ex Tan. 'Clemenules' haploid plant selected to establish the reference whole Citrus genome sequence.

PubMed

Aleza, Pablo; Juárez, José; Hernández, María; Pina, José A; Ollitrault, Patrick; Navarro, Luis

2009-08-22

In recent years, the development of structural genomics has generated a growing interest in obtaining haploid plants. The use of homozygous lines presents a significant advantage for the accomplishment of sequencing projects. Commercial citrus species are characterized by high heterozygosity, making it difficult to assemble large genome sequences. Thus, the International Citrus Genomic Consortium (ICGC) decided to establish a reference whole citrus genome sequence from a homozygous plant. Due to the existence of important molecular resources and previous success in obtaining haploid clementine plants, haploid clementine was selected as the target for the implementation of the reference whole genome citrus sequence. To obtain haploid clementine lines we used the technique of in situ gynogenesis induced by irradiated pollen. Flow cytometry, chromosome counts and SSR marker (Simple Sequence Repeats) analysis facilitated the identification of six different haploid lines (2n = x = 9), one aneuploid line (2n = 2x+4 = 22) and one doubled haploid plant (2n = 2x = 18) of 'Clemenules' clementine. One of the haploids, obtained directly from an original haploid embryo, grew vigorously and produced flowers after four years. This is the first haploid plant of clementine that has bloomed and we have, for the first time, characterized the histology of haploid and diploid flowers of clementine. Additionally a double haploid plant was obtained spontaneously from this haploid line. The first haploid plant of 'Clemenules' clementine produced directly by germination of a haploid embryo, which grew vigorously and produced flowers, has been obtained in this work. This haploid line has been selected and it is being used by the ICGC to establish the reference sequence of the nuclear genome of citrus.
A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

PubMed

Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

2018-01-01

To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
The arbuscular mycorrhizal fungus Glomus intraradices is haploid and has a small genome size in the lower limit of eukaryotes.

PubMed

Hijri, Mohamed; Sanders, Ian R

2004-02-01

The genome size, complexity, and ploidy of the arbuscular mycorrhizal fungus (AMF) Glomus intraradices was determined using flow cytometry, reassociation kinetics, and genomic reconstruction. Nuclei of G. intraradices from in vitro culture, were analyzed by flow cytometry. The estimated average length of DNA per nucleus was 14.07+/-3.52 Mb. Reassociation kinetics on G. intraradices DNA indicated a haploid genome size of approximately 16.54 Mb, comprising 88.36% single copy DNA, 1.59% repetitive DNA, and 10.05% fold-back DNA. To determine ploidy, the DNA content per nucleus measured by flow cytometry was compared with the genome estimate of reassociation kinetics. G. intraradices was found to have a DNA index (DNA per nucleus per haploid genome size) of approximately 0.9, indicating that it is haploid. Genomic DNA of G. intraradices was also analyzed by genomic reconstruction using four genes (Malate synthase, RecA, Rad32, and Hsp88). Because we used flow cytometry and reassociation kinetics to reveal the genome size of G. intraradices and show that it is haploid, then a similar value for genome size should be found when using genomic reconstruction as long as the genes studied are single copy. The average genome size estimate was 15.74+/-1.69 Mb indicating that these four genes are single copy per haploid genome and per nucleus of G. intraradices. Our results show that the genome size of G. intraradices is much smaller than estimates of other AMF and that the unusually high within-spore genetic variation that is seen in this fungus cannot be due to high ploidy.
The evolutionary dynamics of haplodiploidy: Genome architecture and haploid viability

PubMed Central

Blackmon, Heath; Hardy, Nate B.; Ross, Laura

2015-01-01

Haplodiploid reproduction, in which males are haploid and females are diploid, is widespread among animals, yet we understand little about the forces responsible for its evolution. The current theory is that haplodiploidy has evolved through genetic conflicts, as it provides a transmission advantage to mothers. Male viability is thought to be a major limiting factor; diploid individuals tend to harbor many recessive lethal mutations. This theory predicts that the evolution of haplodiploidy is more likely in male heterogametic lineages with few chromosomes, as genes on the X chromosome are often expressed in a haploid environment, and the fewer the chromosome number, the greater the proportion of the total genome that is X‐linked. We test this prediction with comparative phylogenetic analyses of mites, among which haplodiploidy has evolved repeatedly. We recover a negative correlation between chromosome number and haplodiploidy, find evidence that low chromosome number evolved prior to haplodiploidy, and that it is unlikely that diplodiploidy has reevolved from haplodiploid lineages of mites. These results are consistent with the predicted importance of haploid male viability. PMID:26462452
The evolutionary dynamics of haplodiploidy: Genome architecture and haploid viability.

PubMed

Blackmon, Heath; Hardy, Nate B; Ross, Laura

2015-11-01

Haplodiploid reproduction, in which males are haploid and females are diploid, is widespread among animals, yet we understand little about the forces responsible for its evolution. The current theory is that haplodiploidy has evolved through genetic conflicts, as it provides a transmission advantage to mothers. Male viability is thought to be a major limiting factor; diploid individuals tend to harbor many recessive lethal mutations. This theory predicts that the evolution of haplodiploidy is more likely in male heterogametic lineages with few chromosomes, as genes on the X chromosome are often expressed in a haploid environment, and the fewer the chromosome number, the greater the proportion of the total genome that is X-linked. We test this prediction with comparative phylogenetic analyses of mites, among which haplodiploidy has evolved repeatedly. We recover a negative correlation between chromosome number and haplodiploidy, find evidence that low chromosome number evolved prior to haplodiploidy, and that it is unlikely that diplodiploidy has reevolved from haplodiploid lineages of mites. These results are consistent with the predicted importance of haploid male viability. © 2015 The Author(s). Evolution published by Wiley Periodicals, Inc. on behalf of The Society for the Study of Evolution.
Haploids: Constraints and opportunities in plant breeding.

PubMed

Dwivedi, Sangam L; Britt, Anne B; Tripathi, Leena; Sharma, Shivali; Upadhyaya, Hari D; Ortiz, Rodomiro

2015-11-01

The discovery of haploids in higher plants led to the use of doubled haploid (DH) technology in plant breeding. This article provides the state of the art on DH technology including the induction and identification of haploids, what factors influence haploid induction, molecular basis of microspore embryogenesis, the genetics underpinnings of haploid induction and its use in plant breeding, particularly to fix traits and unlock genetic variation. Both in vitro and in vivo methods have been used to induce haploids that are thereafter chromosome doubled to produce DH. Various heritable factors contribute to the successful induction of haploids, whose genetics is that of a quantitative trait. Genomic regions associated with in vitro and in vivo DH production were noted in various crops with the aid of DNA markers. It seems that F2 plants are the most suitable for the induction of DH lines than F1 plants. Identifying putative haploids is a key issue in haploid breeding. DH technology in Brassicas and cereals, such as barley, maize, rice, rye and wheat, has been improved and used routinely in cultivar development, while in other food staples such as pulses and root crops the technology has not reached to the stage leading to its application in plant breeding. The centromere-mediated haploid induction system has been used in Arabidopsis, but not yet in crops. Most food staples are derived from genomic resources-rich crops, including those with sequenced reference genomes. The integration of genomic resources with DH technology provides new opportunities for the improving selection methods, maximizing selection gains and accelerate cultivar development. Marker-aided breeding and DH technology have been used to improve host plant resistance in barley, rice, and wheat. Multinational seed companies are using DH technology in large-scale production of inbred lines for further development of hybrid cultivars, particularly in maize. The public sector provides support to
Genome engineering in human cells.

PubMed

Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

2014-01-01

Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.
Diploid, but not haploid, human embryonic stem cells can be derived from microsurgically repaired tripronuclear human zygotes

PubMed Central

Fan, Yong; Li, Rong; Huang, Jin; Yu, Yang; Qiao, Jie

2013-01-01

Human embryonic stem cells have shown tremendous potential in regenerative medicine, and the recent progress in haploid embryonic stem cells provides new insights for future applications of embryonic stem cells. Disruption of normal fertilized embryos remains controversial; thus, the development of a new source for human embryonic stem cells is important for their usefulness. Here, we investigated the feasibility of haploid and diploid embryo reconstruction and embryonic stem cell derivation using microsurgically repaired tripronuclear human zygotes. Diploid and haploid zygotes were successfully reconstructed, but a large proportion of them still had a tripolar spindle assembly. The reconstructed embryos developed to the blastocyst stage, although the loss of chromosomes was observed in these zygotes. Finally, triploid and diploid human embryonic stem cells were derived from tripronuclear and reconstructed zygotes (from which only one pronucleus was removed), but haploid human embryonic stem cells were not successfully derived from the reconstructed zygotes when two pronuclei were removed. Both triploid and diploid human embryonic stem cells showed the general characteristics of human embryonic stem cells. These results indicate that the lower embryo quality resulting from abnormal spindle assembly contributed to the failure of the haploid embryonic stem cell derivation. However, the successful derivation of diploid embryonic stem cells demonstrated that microsurgical tripronuclear zygotes are an alternative source of human embryonic stem cells. In the future, improving spindle assembly will facilitate the application of triploid zygotes to the field of haploid embryonic stem cells. PMID:23255130
A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes

PubMed Central

Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

2018-01-01

We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. PMID:29367403
A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes.

PubMed

Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

2018-04-01

We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. Copyright © 2018 by the Genetics Society of America.
Characterization of in vitro haploid and doubled haploid Chrysanthemum morifolium plants via unfertilized ovule culture for phenotypical traits and DNA methylation pattern

PubMed Central

Wang, Haibin; Dong, Bin; Jiang, Jiafu; Fang, Weimin; Guan, Zhiyong; Liao, Yuan; Chen, Sumei; Chen, Fadi

2014-01-01

Chrysanthemum is one of important ornamental species in the world. Its highly heterozygous state complicates molecular analysis, so it is of interest to derive haploid forms. A total of 2579 non-fertilized chrysanthemum ovules pollinated by Argyranthemum frutescens were cultured in vitro to isolate haploid progeny. One single regenerant emerged from each of three of the 105 calli produced. Chromosome counts and microsatellite fingerprinting showed that only one of the regenerants was a true haploid. Nine doubled haploid derivatives were subsequently generated by colchicine treatment of 80 in vitro cultured haploid nodal segments. Morphological screening showed that the haploid plant was shorter than the doubled haploids, and developed smaller leaves, flowers, and stomata. An in vitro pollen germination test showed that few of the haploid's pollen were able to germinate and those which did so were abnormal. Both the haploid and the doubled haploids produced yellow flowers, whereas those of the maternal parental cultivar were mauve. Methylation-sensitive amplification polymorphism (MSAP) profiling was further used to detect alterations in cytosine methylation caused by the haploidization and/or the chromosome doubling processes. While 52.2% of the resulting amplified fragments were cytosine methylated in the maternal parent's genome, the corresponding proportions for the haploid's and doubled haploids' genomes were, respectively, 47.0 and 51.7%, demonstrating a reduction in global cytosine methylation caused by haploidization and a partial recovery following chromosome doubling. PMID:25566305
Human Genome Sequencing in Health and Disease

PubMed Central

Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

2013-01-01

Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320
A genotyping system capable of simultaneously analyzing >1000 single nucleotide polymorphisms in a haploid genome.

PubMed

Wang, Hui-Yun; Luo, Minjie; Tereshchenko, Irina V; Frikker, Danielle M; Cui, Xiangfeng; Li, James Y; Hu, Guohong; Chu, Yi; Azaro, Marco A; Lin, Yong; Shen, Li; Yang, Qifeng; Kambouris, Manousos E; Gao, Richeng; Shih, Weichung; Li, Honghua

2005-02-01

A high-throughput genotyping system for scoring single nucleotide polymorphisms (SNPs) has been developed. With this system, >1000 SNPs can be analyzed in a single assay, with a sensitivity that allows the use of single haploid cells as starting material. In the multiplex polymorphic sequence amplification step, instead of attaching universal sequences to the amplicons, primers that are unlikely to have nonspecific and productive interactions are used. Genotypes of SNPs are then determined by using the widely accessible microarray technology and the simple single-base extension assay. Three SNP panels, each consisting of >1000 SNPs, were incorporated into this system. The system was used to analyze 24 human genomic DNA samples. With 5 ng of human genomic DNA, the average detection rate was 98.22% when single probes were used, and 96.71% could be detected by dual probes in different directions. When single sperm cells were used, 91.88% of the SNPs were detectable, which is comparable to the level that was reached when very few genetic markers were used. By using a dual-probe assay, the average genotyping accuracy was 99.96% for 5 ng of human genomic DNA and 99.95% for single sperm. This system may be used to significantly facilitate large-scale genetic analysis even if the amount of DNA template is very limited or even highly degraded as that obtained from paraffin-embedded cancer specimens, and to make many unpractical research projects highly realistic and affordable.
Evolution of haploid selection in predominantly diploid organisms

PubMed Central

Otto, Sarah P.; Scott, Michael F.; Immler, Simone

2015-01-01

Diploid organisms manipulate the extent to which their haploid gametes experience selection. Animals typically produce sperm with a diploid complement of most proteins and RNA, limiting selection on the haploid genotype. Plants, however, exhibit extensive expression in pollen, with actively transcribed haploid genomes. Here we analyze models that track the evolution of genes that modify the strength of haploid selection to predict when evolution intensifies and when it dampens the “selective arena” within which male gametes compete for fertilization. Considering deleterious mutations, evolution leads diploid mothers to strengthen selection among haploid sperm/pollen, because this reduces the mutation load inherited by their diploid offspring. If, however, selection acts in opposite directions in haploids and diploids (“ploidally antagonistic selection”), mothers evolve to reduce haploid selection to avoid selectively amplifying alleles harmful to their offspring. Consequently, with maternal control, selection in the haploid phase either is maximized or reaches an intermediate state, depending on the deleterious mutation rate relative to the extent of ploidally antagonistic selection. By contrast, evolution generally leads diploid fathers to mask mutations in their gametes to the maximum extent possible, whenever masking (e.g., through transcript sharing) increases the average fitness of a father’s gametes. We discuss the implications of this maternal–paternal conflict over the extent of haploid selection and describe empirical studies needed to refine our understanding of haploid selection among seemingly diploid organisms. PMID:26669442
A human haploid gene trap collection to study lncRNAs with unusual RNA biology.

PubMed

Kornienko, Aleksandra E; Vlatkovic, Irena; Neesen, Jürgen; Barlow, Denise P; Pauler, Florian M

2016-01-01

Many thousand long non-coding (lnc) RNAs are mapped in the human genome. Time consuming studies using reverse genetic approaches by post-transcriptional knock-down or genetic modification of the locus demonstrated diverse biological functions for a few of these transcripts. The Human Gene Trap Mutant Collection in haploid KBM7 cells is a ready-to-use tool for studying protein-coding gene function. As lncRNAs show remarkable differences in RNA biology compared to protein-coding genes, it is unclear if this gene trap collection is useful for functional analysis of lncRNAs. Here we use the uncharacterized LOC100288798 lncRNA as a model to answer this question. Using public RNA-seq data we show that LOC100288798 is ubiquitously expressed, but inefficiently spliced. The minor spliced LOC100288798 isoforms are exported to the cytoplasm, whereas the major unspliced isoform is nuclear localized. This shows that LOC100288798 RNA biology differs markedly from typical mRNAs. De novo assembly from RNA-seq data suggests that LOC100288798 extends 289kb beyond its annotated 3' end and overlaps the downstream SLC38A4 gene. Three cell lines with independent gene trap insertions in LOC100288798 were available from the KBM7 gene trap collection. RT-qPCR and RNA-seq confirmed successful lncRNA truncation and its extended length. Expression analysis from RNA-seq data shows significant deregulation of 41 protein-coding genes upon LOC100288798 truncation. Our data shows that gene trap collections in human haploid cell lines are useful tools to study lncRNAs, and identifies the previously uncharacterized LOC100288798 as a potential gene regulator.

Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly

PubMed Central

Schneider, Valerie A.; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A.; Murphy, Terence D.; Pruitt, Kim D.; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S.; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T.; Threadgold, Glen; Torrance, James; Wood, Jonathan M.; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M.; Durbin, Richard; Wilson, Richard K.; Flicek, Paul; Eichler, Evan E.; Church, Deanna M.

2017-01-01

The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. PMID:28396521
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

PubMed

Schneider, Valerie A; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A; Murphy, Terence D; Pruitt, Kim D; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T; Threadgold, Glen; Torrance, James; Wood, Jonathan M; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M; Durbin, Richard; Wilson, Richard K; Flicek, Paul; Eichler, Evan E; Church, Deanna M

2017-05-01

The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. © 2017 Schneider et al.; Published by Cold Spring Harbor Laboratory Press.
Identifying structural variation in haploid microbial genomes from short-read resequencing data using breseq.

PubMed

Barrick, Jeffrey E; Colburn, Geoffrey; Deatherage, Daniel E; Traverse, Charles C; Strand, Matthew D; Borges, Jordan J; Knoester, David B; Reba, Aaron; Meyer, Austin G

2014-11-29

Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for ~25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation
Induction of gynogenetic and androgenetic haploid and doubled haploid development in the brown trout (Salmo trutta Linnaeus 1758).

PubMed

Michalik, O; Dobosz, S; Zalewski, T; Sapota, M; Ocalewicz, K

2015-04-01

Gynogenetic and androgenetic brown trout (Salmo trutta Linnaeus 1758) haploids (Hs) and doubled haploids (DHs) were produced in the present research. Haploid development was induced by radiation-induced genetic inactivation of spermatozoa (gynogenesis) or eggs (androgenesis) before insemination. To provide DHs, gynogenetic and androgenetic haploid zygotes were subjected to the high pressure shock to suppress the first mitotic cleavage. Among haploids, gynogenetic embryos were showing lower mortality when compared to the androgenetic embryos; however, most of them die before the first feeding stage. Gynogenetic doubled haploids provided in the course of the brown trout eggs activation performed by homologous and heterologous sperm (rainbow trout) were developing equally showing hatching rates of 14.76 ± 2.4% and 16.14 ± 2.90% and the survival rates at the first feeding stage of 10.48 ± 3.48% and 12.78 ± 2.18%, respectively. Significantly, lower survival rate was observed among androgenetic progenies from the diploid groups with only few specimens that survived to the first feeding stage. Cytogenetic survey showed that among embryos from the diploid variants of the research, only gynogenetic individuals possessed doubled sets of chromosomes. Thus, it is reasonable to assume that radiation employed for the genetic inactivation of the brown trout eggs misaligned mechanism responsible for the cell divisions and might have delayed or even arrested the first mitotic cleavage in the androgenetic brown trout zygotes. Moreover, protocol for the radiation-induced inactivation of the paternal and maternal genome should be adjusted as some of the cytogenetically surveyed gynogenetic and androgenetic embryos exhibited fragments of the irradiated chromosomes. © 2015 Blackwell Verlag GmbH.
Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies

PubMed Central

2014-01-01

Background The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. Results We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. Conclusions In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied. PMID:24647006
A p53-dependent response limits the viability of mammalian haploid cells

PubMed Central

Olbrich, Teresa; Mayor-Ruiz, Cristina; Vega-Sendino, Maria; Gomez, Carmen; Ortega, Sagrario; Ruiz, Sergio; Fernandez-Capetillo, Oscar

2017-01-01

The recent development of haploid cell lines has facilitated forward genetic screenings in mammalian cells. These lines include near-haploid human cell lines isolated from a patient with chronic myelogenous leukemia (KBM7 and HAP1), as well as haploid embryonic stem cells derived from several organisms. In all cases, haploidy was shown to be an unstable state, so that cultures of mammalian haploid cells rapidly become enriched in diploids. Here we show that the observed diploidization is due to a proliferative disadvantage of haploid cells compared with diploid cells. Accordingly, single-cell–sorted haploid mammalian cells maintain the haploid state for prolonged periods, owing to the absence of competing diploids. Although the duration of interphase is similar in haploid and diploid cells, haploid cells spend longer in mitosis, indicative of problems in chromosome segregation. In agreement with this, a substantial proportion of the haploids die at or shortly after the last mitosis through activation of a p53-dependent cytotoxic response. Finally, we show that p53 deletion stabilizes haploidy in human HAP1 cells and haploid mouse embryonic stem cells. We propose that, similar to aneuploidy or tetraploidy, haploidy triggers a p53-dependent response that limits the fitness of mammalian cells. PMID:28808015
Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

PubMed Central

Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

2015-01-01

Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089
The resurgence of haploids in higher plants.

PubMed

Forster, Brian P; Heberle-Bors, Erwin; Kasha, Ken J; Touraev, Alisher

2007-08-01

The life cycle of plants proceeds via alternating generations of sporophytes and gametophytes. The dominant and most obvious life form of higher plants is the free-living sporophyte. The sporophyte is the product of fertilization of male and female gametes and contains a set of chromosomes from each parent; its genomic constitution is 2n. Chromosome reduction at meiosis means cells of the gametophytes carry half the sporophytic complement of chromosomes (n). Plant haploid research began with the discovery that sporophytes can be produced in higher plants carrying the gametic chromosome number (n instead of 2n) and that their chromosome number can subsequently be doubled up by colchicine treatment. Recent technological innovations, greater understanding of underlying control mechanisms and an expansion of end-user applications has brought about a resurgence of interest in haploids in higher plants.
Evolution of haploid-diploid life cycles when haploid and diploid fitnesses are not equal.

PubMed

Scott, Michael F; Rescan, Marie

2017-02-01

Many organisms spend a significant portion of their life cycle as haploids and as diploids (a haploid-diploid life cycle). However, the evolutionary processes that could maintain this sort of life cycle are unclear. Most previous models of ploidy evolution have assumed that the fitness effects of new mutations are equal in haploids and homozygous diploids, however, this equivalency is not supported by empirical data. With different mutational effects, the overall (intrinsic) fitness of a haploid would not be equal to that of a diploid after a series of substitution events. Intrinsic fitness differences between haploids and diploids can also arise directly, for example because diploids tend to have larger cell sizes than haploids. Here, we incorporate intrinsic fitness differences into genetic models for the evolution of time spent in the haploid versus diploid phases, in which ploidy affects whether new mutations are masked. Life-cycle evolution can be affected by intrinsic fitness differences between phases, the masking of mutations, or a combination of both. We find parameter ranges where these two selective forces act and show that the balance between them can favor convergence on a haploid-diploid life cycle, which is not observed in the absence of intrinsic fitness differences. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
Evaluating droplet digital PCR for the quantification of human genomic DNA: converting copies per nanoliter to nanograms nuclear DNA per microliter.

PubMed

Duewer, David L; Kline, Margaret C; Romsos, Erica L; Toman, Blaza

2018-05-01

The highly multiplexed polymerase chain reaction (PCR) assays used for forensic human identification perform best when used with an accurately determined quantity of input DNA. To help ensure the reliable performance of these assays, we are developing a certified reference material (CRM) for calibrating human genomic DNA working standards. To enable sharing information over time and place, CRMs must provide accurate and stable values that are metrologically traceable to a common reference. We have shown that droplet digital PCR (ddPCR) limiting dilution end-point measurements of the concentration of DNA copies per volume of sample can be traceably linked to the International System of Units (SI). Unlike values assigned using conventional relationships between ultraviolet absorbance and DNA mass concentration, entity-based ddPCR measurements are expected to be stable over time. However, the forensic community expects DNA quantity to be stated in terms of mass concentration rather than entity concentration. The transformation can be accomplished given SI-traceable values and uncertainties for the number of nucleotide bases per human haploid genome equivalent (HHGE) and the average molar mass of a nucleotide monomer in the DNA polymer. This report presents the considerations required to establish the metrological traceability of ddPCR-based mass concentration estimates of human nuclear DNA. Graphical abstract The roots of metrological traceability for human nuclear DNA mass concentration results. Values for the factors in blue must be established experimentally. Values for the factors in red have been established from authoritative source materials. HHGE stands for "haploid human genome equivalent"; there are two HHGE per diploid human genome.
The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.

PubMed

Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S

2016-06-03

The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ninety-six haploid yeast strains with individual disruptions of open reading frames between YOR097C and YOR192C, constructed for the Saccharomyces genome deletion project, have an additional mutation in the mismatch repair gene MSH3.

PubMed

Lehner, Kevin R; Stone, Megan M; Farber, Rosann A; Petes, Thomas D

2007-11-01

As part of the Saccharomyces Genome Deletion Project, sets of presumably isogenic haploid and diploid strains that differed only by single gene deletions were constructed. We found that one set of 96 strains (containing deletions of ORFs located between YOR097C and YOR192C) in the collection, which was derived from the haploid BY4741, has an additional mutation in the MSH3 mismatch repair gene.
Generation of genetically modified mice using CRISPR/Cas9 and haploid embryonic stem cell systems

PubMed Central

JIN, Li-Fang; LI, Jin-Song

2016-01-01

With the development of high-throughput sequencing technology in the post-genomic era, researchers have concentrated their efforts on elucidating the relationships between genes and their corresponding functions. Recently, important progress has been achieved in the generation of genetically modified mice based on CRISPR/Cas9 and haploid embryonic stem cell (haESC) approaches, which provide new platforms for gene function analysis, human disease modeling, and gene therapy. Here, we review the CRISPR/Cas9 and haESC technology for the generation of genetically modified mice and discuss the key challenges in the application of these approaches. PMID:27469251
Semiconservative quasispecies equations for polysomic genomes: The general case

NASA Astrophysics Data System (ADS)

Itan, Eran; Tannenbaum, Emmanuel

2010-06-01

This paper develops a formulation of the quasispecies equations appropriate for polysomic, semiconservatively replicating genomes. This paper is an extension of previous work on the subject, which considered the case of haploid genomes. Here, we develop a more general formulation of the quasispecies equations that is applicable to diploid and even polyploid genomes. Interestingly, with an appropriate classification of population fractions, we obtain a system of equations that is formally identical to the haploid case. As with the work for haploid genomes, we consider both random and immortal DNA strand chromosome segregation mechanisms. However, in contrast to the haploid case, we have found that an analytical solution for the mean fitness is considerably more difficult to obtain for the polyploid case. Accordingly, whereas for the haploid case we obtained expressions for the mean fitness for the case of an analog of the single-fitness-peak landscape for arbitrary lesion repair probabilities (thereby allowing for noncomplementary genomes), here we solve for the mean fitness for the restricted case of perfect lesion repair.
Use of doubled haploid technology for development of stable drought tolerant bread wheat (Triticum aestivum L.) transgenics.

PubMed

Chauhan, Harsh; Khurana, Paramjit

2011-04-01

Anther culture-derived haploid embryos were used as explants for Agrobacterium-mediated genetic transformation of bread wheat (Triticum aestivum L. cv CPAN1676) using barley HVA1 gene for drought tolerance. Regenerated plantlets were checked for transgene integration in T₀ generation, and positive transgenic haploid plants were doubled by colchicine treatment. Stable transgenic doubled haploid plants were obtained, and transgene expression was monitored till T₄ generation, and no transgene silencing was observed over the generations. Doubled haploid transgenic plants have faster seed germination and seedling establishment and show better drought tolerance in comparison with nontransgenic, doubled haploid plants, as measured by per cent germination, seedling growth and biomass accumulation. Physiological evaluation for abiotic stress by assessing nitrate reductase enzyme activity and plant yield under post-anthesis water limitation revealed a better tolerance of the transgenics over the wild type. This is the first report on the production of double haploid transgenic wheat through anther culture technique in a commercial cultivar for a desirable trait. This method would also be useful in functional genomics of wheat and other allopolyploids of agronomic importance. © 2010 The Authors. Plant Biotechnology Journal © 2010 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
Androgenesis, gynogenesis, and parthenogenesis haploids in cucurbit species.

PubMed

Dong, Yan-Qi; Zhao, Wei-Xing; Li, Xiao-Hui; Liu, Xi-Cun; Gao, Ning-Ning; Huang, Jin-Hua; Wang, Wen-Ying; Xu, Xiao-Li; Tang, Zhen-Hai

2016-10-01

Haploids and doubled haploids are critical components of plant breeding. This review is focused on studies on haploids and double haploids inducted in cucurbits through in vitro pollination with irradiated pollen, unfertilized ovule/ovary culture, and anther/microspore culture during the last 30 years, as well as comprehensive analysis of the main factors of each process and comparison between chromosome doubling and ploidy identification methods, with special focus on the application of double haploids in plant breeding and genetics. This review identifies existing problems affecting the efficiency of androgenesis, gynogenesis, and parthenogenesis in cucurbit species. Donor plant genotypes and surrounding environments, developmental stages of explants, culture media, stress factors, and chromosome doubling and ploidy identification are compared at length and discussed as methodologies and protocols for androgenesis, gynogenesis, and parthenogenesis in haploid and double haploid production technologies.
Enterovirus D68 receptor requirements unveiled by haploid genetics

PubMed Central

Baggen, Jim; Thibaut, Hendrik Jan; Staring, Jacqueline; Jae, Lucas T.; Liu, Yue; Guo, Hongbo; Slager, Jasper J.; de Bruin, Jost W.; van Vliet, Arno L. W.; Blomen, Vincent A.; Overduin, Pieter; Sheng, Ju; de Haan, Cornelis A. M.; de Vries, Erik; Meijer, Adam; Rossmann, Michael G.; Brummelkamp, Thijn R.; van Kuppeveld, Frank J. M.

2016-01-01

Enterovirus D68 (EV-D68) is an emerging pathogen that can cause severe respiratory disease and is associated with cases of paralysis, especially among children. Heretofore, information on host factor requirements for EV-D68 infection is scarce. Haploid genetic screening is a powerful tool to reveal factors involved in the entry of pathogens. We performed a genome-wide haploid screen with the EV-D68 prototype Fermon strain to obtain a comprehensive overview of cellular factors supporting EV-D68 infection. We identified and confirmed several genes involved in sialic acid (Sia) biosynthesis, transport, and conjugation to be essential for infection. Moreover, by using knockout cell lines and gene reconstitution, we showed that both α2,6- and α2,3-linked Sia can be used as functional cellular EV-D68 receptors. Importantly, the screen did not reveal a specific protein receptor, suggesting that EV-D68 can use multiple redundant sialylated receptors. Upon testing recent clinical strains, we identified strains that showed a similar Sia dependency, whereas others could infect cells lacking surface Sia, indicating they can use an alternative, nonsialylated receptor. Nevertheless, these Sia-independent strains were still able to bind Sia on human erythrocytes, raising the possibility that these viruses can use multiple receptors. Sequence comparison of Sia-dependent and Sia-independent EV-D68 strains showed that many changes occurred near the canyon that might allow alternative receptor binding. Collectively, our findings provide insights into the identity of the EV-D68 receptor and suggest the possible existence of Sia-independent viruses, which are essential for understanding tropism and disease. PMID:26787879
Fixation Probability in a Haploid-Diploid Population.

PubMed

Bessho, Kazuhiro; Otto, Sarah P

2017-01-01

Classical population genetic theory generally assumes either a fully haploid or fully diploid life cycle. However, many organisms exhibit more complex life cycles, with both free-living haploid and diploid stages. Here we ask what the probability of fixation is for selected alleles in organisms with haploid-diploid life cycles. We develop a genetic model that considers the population dynamics using both the Moran model and Wright-Fisher model. Applying a branching process approximation, we obtain an accurate fixation probability assuming that the population is large and the net effect of the mutation is beneficial. We also find the diffusion approximation for the fixation probability, which is accurate even in small populations and for deleterious alleles, as long as selection is weak. These fixation probabilities from branching process and diffusion approximations are similar when selection is weak for beneficial mutations that are not fully recessive. In many cases, particularly when one phase predominates, the fixation probability differs substantially for haploid-diploid organisms compared to either fully haploid or diploid species. Copyright © 2017 by the Genetics Society of America.
Fixation Probability in a Haploid-Diploid Population

PubMed Central

Bessho, Kazuhiro; Otto, Sarah P.

2017-01-01

Classical population genetic theory generally assumes either a fully haploid or fully diploid life cycle. However, many organisms exhibit more complex life cycles, with both free-living haploid and diploid stages. Here we ask what the probability of fixation is for selected alleles in organisms with haploid-diploid life cycles. We develop a genetic model that considers the population dynamics using both the Moran model and Wright–Fisher model. Applying a branching process approximation, we obtain an accurate fixation probability assuming that the population is large and the net effect of the mutation is beneficial. We also find the diffusion approximation for the fixation probability, which is accurate even in small populations and for deleterious alleles, as long as selection is weak. These fixation probabilities from branching process and diffusion approximations are similar when selection is weak for beneficial mutations that are not fully recessive. In many cases, particularly when one phase predominates, the fixation probability differs substantially for haploid-diploid organisms compared to either fully haploid or diploid species. PMID:27866168
Doubled haploid production in Flax (Linum usitatissimum L.).

PubMed

Obert, Bohus; Zácková, Zuzana; Samaj, Jozef; Pretová, Anna

2009-01-01

There is a requirement of haploid and double haploid material and homozygous lines for cell culture studies and breeding in flax. Anther culture is currently the most successful method producing doubled haploid lines in flax. Recently, ovary culture was also described as a good source of doubled haploids. In this review we focus on tissue and plants regeneration using anther culture, and cultivation of ovaries containing unfertilized ovules. The effect of genotype, physiological status of donor plants, donor material pre-treatment and cultivation conditions for flax anthers and ovaries is discussed here. The process of plant regeneration from anther and ovary derived calli is also in the focus of this review. Attention is paid to the ploidy level of regenerated tissue and to the use of molecular markers for determining of gametic origin of flax plants derived from anther and ovary cultures. Finally, some future prospects on the use of doubled haploids in flax biotechnology are outlined here.

Human DAZL, DAZ and BOULE genes modulate primordial germ cell and haploid gamete formation

PubMed Central

Kee, Kehkooi; Angeles, Vanessa T; Flores, Martha; Nguyen, Ha Nam; Pera, Renee A Reijo

2009-01-01

The leading cause of infertility in men and women is quantitative and qualitative defects in human germ cell (oocyte and sperm) development. Yet, it has not been possible to examine the unique developmental genetics of human germ cell formation and differentiation due to inaccessibility of germ cells during fetal development. Although several studies have shown that germ cells can be differentiated from mouse and human embryonic stem cells, human germ cells differentiated in these studies generally did not develop beyond the earliest stages1-8. Here we used a germ cell reporter to quantitate and isolate primordial germ cells derived from both male and female hESCs. Then, by silencing and overexpressing genes that encode germ cell-specific cytoplasmic RNA-binding proteins (not transcription factors), we modulated human germ cell formation and developmental progression. We observed that human DAZL (Deleted in AZoospermia-Like) functions in primordial germ cell formation, whereas closely-related genes, DAZ and BOULE, promote later stages of meiosis and development of haploid gametes. These results are significant to the generation of gametes for future basic science and potential clinical applications. PMID:19865085
Production of haploids and doubled haploids in oil palm

PubMed Central

2010-01-01

Background Oil palm is the world's most productive oil-food crop despite yielding well below its theoretical maximum. This maximum could be approached with the introduction of elite F1 varieties. The development of such elite lines has thus far been prevented by difficulties in generating homozygous parental types for F1 generation. Results Here we present the first high-throughput screen to identify spontaneously-formed haploid (H) and doubled haploid (DH) palms. We secured over 1,000 Hs and one DH from genetically diverse material and derived further DH/mixoploid palms from Hs using colchicine. We demonstrated viability of pollen from H plants and expect to generate 100% homogeneous F1 seed from intercrosses between DH/mixoploids once they develop female inflorescences. Conclusions This study has generated genetically diverse H/DH palms from which parental clones can be selected in sufficient numbers to enable the commercial-scale breeding of F1 varieties. The anticipated step increase in productivity may help to relieve pressure to extend palm cultivation, and limit further expansion into biodiverse rainforest. PMID:20929530
Genome size variation in the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme as determined by flow cytometry

Treesearch

Claire L Anderson; Thomas L Kubisiak; C Dana Nelson; Jason A Smith; John M Davis

2010-01-01

The genome size of the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme (Cqf) was determined by flow cytometric analysis of propidium iodide-stained, intact haploid pycniospores with haploid spores of two genetically well characterized fungal species, Sclerotinia sclerotiorum and Puccinia graminis f.sp. tritici, as size standards. The Cqf haploid genome...
Single haplotype assembly of the human genome from a hydatidiform mole

PubMed Central

Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

2014-01-01

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144
Verification and characterization of chromosome duplication in haploid maize.

PubMed

de Oliveira Couto, E G; Resende Von Pinho, E V; Von Pinho, R G; Veiga, A D; de Carvalho, M R; de Oliveira Bustamante, F; Nascimento, M S

2015-06-26

Doubled haploid technology has been used by various private companies. However, information regarding chromosome duplication methodologies, particularly those concerning techniques used to identify duplication in cells, is limited. Thus, we analyzed and characterized artificially doubled haploids using microsatellites molecular markers, pollen viability, and flow cytometry techniques. Evaluated material was obtained using two different chromosome duplication protocols in maize seeds considered haploids, resulting from the cross between the haploid inducer line KEMS and 4 hybrids (GNS 3225, GNS 3032, GNS 3264, and DKB 393). Fourteen days after duplication, plant samples were collected and assessed by flow cytometry. Further, the plants were transplanted to a field, and samples were collected for DNA analyses using microsatellite markers. The tassels were collected during anthesis for pollen viability analyses. Haploid, diploid, and mixoploid individuals were detected using flow cytometry, demonstrating that this technique was efficient for identifying doubled haploids. The microsatellites markers were also efficient for confirming the ploidies preselected by flow cytometry and for identifying homozygous individuals. Pollen viability showed a significant difference between the evaluated ploidies when the Alexander and propionic-carmin stains were used. The viability rates between the plodies analyzed show potential for fertilization.
Preparation and screening of an arrayed human genomic library generated with the P1 cloning system.

PubMed Central

Shepherd, N S; Pfrogner, B D; Coulby, J N; Ackerman, S L; Vaidyanathan, G; Sauer, R H; Balkenhol, T C; Sternberg, N

1994-01-01

We describe here the construction and initial characterization of a 3-fold coverage genomic library of the human haploid genome that was prepared using the bacteriophage P1 cloning system. The cloned DNA inserts were produced by size fractionation of a Sau3AI partial digest of high molecular weight genomic DNA isolated from primary cells of human foreskin fibroblasts. The inserts were cloned into the pAd10sacBII vector and packaged in vitro into P1 phage. These were used to generate recombinant bacterial clones, each of which was picked robotically from an agar plate into a well of a 96-well microtiter dish, grown overnight, and stored at -70 degrees C. The resulting library, designated DMPC-HFF#1 series A, consists of approximately 130,000-140,000 recombinant clones that were stored in 1500 microtiter dishes. To screen the library, clones were combined in a pooling strategy and specific loci were identified by PCR analysis. On average, the library contains two or three different clones for each locus screened. To date we have identified a total of 17 clones containing the hypoxanthine-guanine phosphoribosyltransferase, human serum albumin-human alpha-fetoprotein, p53, cyclooxygenase I, human apurinic endonuclease, beta-polymerase, and DNA ligase I genes. The cloned inserts average 80 kb in size and range from 70 to 95 kb, with one 49-kb insert and one 62-kb insert. Images PMID:8146166
The Evolution of Haploid Chromosome Numbers in the Sunflower Family

PubMed Central

Mota, Lucie; Torices, Rubén; Loureiro, João

2016-01-01

Chromosome number changes during the evolution of angiosperms are likely to have played a major role in speciation. Their study is of utmost importance, especially now, as a probabilistic model is available to study chromosome evolution within a phylogenetic framework. In the present study, likelihood models of chromosome number evolution were fitted to the largest family of flowering plants, the Asteraceae. Specifically, a phylogenetic supertree of this family was used to reconstruct the ancestral chromosome number and infer genomic events. Our approach inferred that the ancestral chromosome number of the family is n = 9. Also, according to the model that best explained our data, the evolution of haploid chromosome numbers in Asteraceae was a very dynamic process, with genome duplications and descending dysploidy being the most frequent genomic events in the evolution of this family. This model inferred more than one hundred whole genome duplication events; however, it did not find evidence for a paleopolyploidization at the base of this family, which has previously been hypothesized on the basis of sequence data from a limited number of species. The obtained results and potential causes of these discrepancies are discussed. PMID:27797951
Rapid and accurate identification of in vivo-induced haploid seeds based on oil content in maize

PubMed Central

Melchinger, Albrecht E.; Schipprack, Wolfgang; Würschum, Tobias; Chen, Shaojiang; Technow, Frank

2013-01-01

The needs of a growing human population require rapid and efficient development of improved cultivars by plant breeders. The doubled haploid (DH) technology enables generating completely homozygous lines in a single step and, thus, is central to modern genetics and breeding approaches. Rapid and reliable identification of seeds with a haploid embryo after in vivo haploid induction is elementary in the method utilized in maize but current systems have severe shortcomings preventing their use in many germplasm types. Here, we describe an alternative method for discrimination of haploid from diploid seeds based on differences in their oil content stemming from pollination with high oil inducers. After presenting some fundamental theory, we provide a proof-of-concept with experimental results, demonstrating acceptable error rates across different germplasm. Our approach represents a breakthrough in DH technology in maize, because it is amenable to automated high-throughput screening and applicable to any maize germplasm worldwide. PMID:23820577
Single haplotype assembly of the human genome from a hydatidiform mole.

PubMed

Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

2014-12-01

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.
Polyploid titan cells produce haploid and aneuploid progeny to promote stress adaptation.

PubMed

Gerstein, Aleeza C; Fu, Man Shun; Mukaremera, Liliane; Li, Zhongming; Ormerod, Kate L; Fraser, James A; Berman, Judith; Nielsen, Kirsten

2015-10-13

Cryptococcus neoformans is a major life-threatening fungal pathogen. In response to the stress of the host environment, C. neoformans produces large polyploid titan cells. Titan cell production enhances the virulence of C. neoformans, yet whether the polyploid aspect of titan cells is specifically influential remains unknown. We show that titan cells were more likely to survive and produce offspring under multiple stress conditions than typical cells and that even their normally sized daughters maintained an advantage over typical cells in continued exposure to stress. Although polyploid titan cells generated haploid daughter cell progeny upon in vitro replication under nutrient-replete conditions, titan cells treated with the antifungal drug fluconazole produced fluconazole-resistant diploid and aneuploid daughter cells. Interestingly, a single titan mother cell was capable of generating multiple types of aneuploid daughter cells. The increased survival and genomic diversity of titan cell progeny promote rapid adaptation to new or high-stress conditions. The ability to adapt to stress is a key element for survival of pathogenic microbes in the host and thus plays an important role in pathogenesis. Here we investigated the predominantly haploid human fungal pathogen Cryptococcus neoformans, which is capable of ploidy and cell size increases during infection through production of titan cells. The enlarged polyploid titan cells are then able to rapidly undergo ploidy reduction to generate progeny with reduced ploidy and/or aneuploidy. Under stressful conditions, titan cell progeny have a growth and survival advantage over typical cell progeny. Understanding how titan cells enhance the rate of cryptococcal adaptation under stress conditions may assist in the development of novel drugs aimed at blocking ploidy transitions. Copyright © 2015 Gerstein et al.
Parallel or convergent evolution in human population genomic data revealed by genotype networks.

PubMed

R Vahdati, Ali; Wagner, Andreas

2016-08-02

Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g., in a single nucleotide. We analyze human genome variation data from the 1,000 genomes project, and construct haploid genotype (haplotype) networks for 12,235 protein coding genes. The structure of these networks varies widely among genes, indicating different patterns of variation despite a shared evolutionary history. We focus on those genes whose genotype networks show many cycles, which can indicate homoplasy, i.e., parallel or convergent evolution, on the sequence level. For 42 genes, the observed number of cycles is so large that it cannot be explained by either chance homoplasy or recombination. When analyzing possible explanations, we discovered evidence for positive selection in 21 of these genes and, in addition, a potential role for constrained variation and purifying selection. Balancing selection plays at most a small role. The 42 genes with excess cycles are enriched in functions related to immunity and response to pathogens. Genotype networks are representations of genetic variation data that can help understand unusual patterns of genomic variation.
The DNA Methylome of Human Peripheral Blood Mononuclear Cells

PubMed Central

Ye, Mingzhi; Zheng, Hancheng; Yu, Jian; Wu, Honglong; Sun, Jihua; Zhang, Hongyu; Chen, Quan; Luo, Ruibang; Chen, Minfeng; He, Yinghua; Jin, Xin; Zhang, Qinghui; Yu, Chang; Zhou, Guangyu; Sun, Jinfeng; Huang, Yebo; Zheng, Huisong; Cao, Hongzhi; Zhou, Xiaoyu; Guo, Shicheng; Hu, Xueda; Li, Xin; Kristiansen, Karsten; Bolund, Lars; Xu, Jiujin; Wang, Wen; Yang, Huanming; Wang, Jian; Li, Ruiqiang; Beck, Stephan; Wang, Jun; Zhang, Xiuqing

2010-01-01

DNA methylation plays an important role in biological processes in human health and disease. Recent technological advances allow unbiased whole-genome DNA methylation (methylome) analysis to be carried out on human cells. Using whole-genome bisulfite sequencing at 24.7-fold coverage (12.3-fold per strand), we report a comprehensive (92.62%) methylome and analysis of the unique sequences in human peripheral blood mononuclear cells (PBMC) from the same Asian individual whose genome was deciphered in the YH project. PBMC constitute an important source for clinical blood tests world-wide. We found that 68.4% of CpG sites and <0.2% of non-CpG sites were methylated, demonstrating that non-CpG cytosine methylation is minor in human PBMC. Analysis of the PBMC methylome revealed a rich epigenomic landscape for 20 distinct genomic features, including regulatory, protein-coding, non-coding, RNA-coding, and repeat sequences. Integration of our methylome data with the YH genome sequence enabled a first comprehensive assessment of allele-specific methylation (ASM) between the two haploid methylomes of any individual and allowed the identification of 599 haploid differentially methylated regions (hDMRs) covering 287 genes. Of these, 76 genes had hDMRs within 2 kb of their transcriptional start sites of which >80% displayed allele-specific expression (ASE). These data demonstrate that ASM is a recurrent phenomenon and is highly correlated with ASE in human PBMCs. Together with recently reported similar studies, our study provides a comprehensive resource for future epigenomic research and confirms new sequencing technology as a paradigm for large-scale epigenomics studies. PMID:21085693
Chromosomes in a genome-wise order: evidence for metaphase architecture.

PubMed

Weise, Anja; Bhatt, Samarth; Piaszinski, Katja; Kosyakova, Nadezda; Fan, Xiaobo; Altendorf-Hofmann, Annelore; Tanomtong, Alongklod; Chaveerach, Arunrat; de Cioffi, Marcelo Bello; de Oliveira, Edivaldo; Walther, Joachim-U; Liehr, Thomas; Chaudhuri, Jyoti P

2016-01-01

One fundamental finding of the last decade is that, besides the primary DNA sequence information there are several epigenetic "information-layers" like DNA-and histone modifications, chromatin packaging and, last but not least, the position of genes in the nucleus. We postulate that the functional genomic architecture is not restricted to the interphase of the cell cycle but can also be observed in the metaphase stage, when chromosomes are most condensed and microscopically visible. If so, it offers the unique opportunity to directly analyze the functional aspects of genomic architecture in different cells, species and diseases. Another aspect not directly accessible by molecular techniques is the genome merged from two different haploid parental genomes represented by the homologous chromosome sets. Our results show that there is not only a well-known and defined nuclear architecture in interphase but also in metaphase leading to a bilateral organization of the two haploid sets of chromosomes. Moreover, evidence is provided for the parental origin of the haploid grouping. From our findings we postulate an additional epigenetic information layer within the genome including the organization of homologous chromosomes and their parental origin which may now substantially change the landscape of genetics.
Sequencing and assembly of the 22-gb loblolly pine genome.

PubMed

Zimin, Aleksey; Stevens, Kristian A; Crepeau, Marc W; Holtz-Morris, Ann; Koriabine, Maxim; Marçais, Guillaume; Puiu, Daniela; Roberts, Michael; Wegrzyn, Jill L; de Jong, Pieter J; Neale, David B; Salzberg, Steven L; Yorke, James A; Langley, Charles H

2014-03-01

Conifers are the predominant gymnosperm. The size and complexity of their genomes has presented formidable technical challenges for whole-genome shotgun sequencing and assembly. We employed novel strategies that allowed us to determine the loblolly pine (Pinus taeda) reference genome sequence, the largest genome assembled to date. Most of the sequence data were derived from whole-genome shotgun sequencing of a single megagametophyte, the haploid tissue of a single pine seed. Although that constrained the quantity of available DNA, the resulting haploid sequence data were well-suited for assembly. The haploid sequence was augmented with multiple linking long-fragment mate pair libraries from the parental diploid DNA. For the longest fragments, we used novel fosmid DiTag libraries. Sequences from the linking libraries that did not match the megagametophyte were identified and removed. Assembly of the sequence data were aided by condensing the enormous number of paired-end reads into a much smaller set of longer "super-reads," rendering subsequent assembly with an overlap-based assembly algorithm computationally feasible. To further improve the contiguity and biological utility of the genome sequence, additional scaffolding methods utilizing independent genome and transcriptome assemblies were implemented. The combination of these strategies resulted in a draft genome sequence of 20.15 billion bases, with an N50 scaffold size of 66.9 kbp.
Production of haploids from anther culture of banana [Musa balbisiana (BB)].

PubMed

Assani, A; Bakry, F; Kerbellec, F; Haïcour, R; Wenzel, G; Foroughi-Wehr, B

2003-02-01

We report here, for the first time, the production of haploid plants of banana Musa balbisiana (BB). Callus was induced from anthers in which the majority of the microspores were at the uninucleate stage. The frequency of callus induction was 77%. Callus proliferation usually preceded embryo formation. About 8% of the anthers developed androgenic embryos. Of the 147 plantlets obtained, 41 were haploids (n=x=11). The frequency of haploid production depended on genotypes used: 18 haploid plants were produced from genotype Pisang klutuk, 12 from Pisang batu, seven from Pisang klutuk wulung and four from Tani. The frequency of regeneration was 1.1%, which was based on the total number of anthers cultured. Diploid plants (2n=2x=22) were also observed in the regenerated plants. The haploid banana plants that were developed will be important material for the improvement of banana through breeding programmes.
Centromere Size and Its Relationship to Haploid Formation in Plants.

PubMed

Wang, Na; Dawe, R Kelly

2018-03-05

Wide species crosses often result in uniparental genome elimination and visible failures in centromere function. Crosses involving lines with mutated forms of the CENH3 histone variant that organizes the centromere/kinetochore interface have been shown to have similar effects, inducing haploids at high frequencies. Here, we propose a simple centromere size model that endeavors to explain both observations. It is based on the idea of a quantitative centromere architecture where each centromere in an individual is the same size, and the average size is dictated by a natural equilibrium between bound and unbound CENH3 (and its chaperones or binding proteins). While centromere size is determined by the cellular milieu, centromere positions are heritable and defined by the interactions of a small set of proteins that bind to both DNA and CENH3. Lines with defective or mutated CENH3 have a lower loading capacity and support smaller centromeres. In cases where a line with small or defective centromeres is crossed to a line with larger or normal centromeres, the smaller/defective centromeres are selectively degraded or not maintained, resulting in chromosome loss from the small-centromere parent. The model is testable and generalizable, and helps to explain the counterintuitive observation that inducer lines do not induce haploids when crossed to themselves. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.
Induced parthenogenesis by gamma-irradiated pollen in loquat for haploid production.

PubMed

Blasco, Manuel; Badenes, María Luisa; Del Mar Naval, María

2016-09-01

Successful haploid induction in loquat ( Eriobotrya japonica (Thunb.) Lindl.) through in situ-induced parthenogenesis with gamma-ray irradiated pollen has been achieved. Female flowers of cultivar 'Algerie' were pollinated using pollen of cultivars 'Changhong-3', 'Cox' and 'Saval Brasil' irradiated with two doses of gamma rays, 150 and 300 Gy. The fruits were harvested 90, 105 and 120 days after pollination (dap). Four haploid plants were obtained from 'Algerie' pollinated with 300-Gy-treated pollen of 'Saval Brasil' from fruits harvested 105 dap. Haploidy was confirmed by flow cytometry and chromosome count. The haploids showed a very weak development compared to the diploid plants. This result suggests that irradiated pollen can be used to obtain parthenogenetic haploids.
Gametic embryogenesis and haploid technology as valuable support to plant breeding.

PubMed

Germanà, Maria Antonietta

2011-05-01

Plant breeding is focused on continuously increasing crop production to meet the needs of an ever-growing world population, improving food quality to ensure a long and healthy life and address the problems of global warming and environment pollution, together with the challenges of developing novel sources of biofuels. The breeders' search for novel genetic combinations, with which to select plants with improved traits to satisfy both farmers and consumers, is endless. About half of the dramatic increase in crop yield obtained in the second half of the last century has been achieved thanks to the results of genetic improvement, while the residual advance has been due to the enhanced management techniques (pest and disease control, fertilization, and irrigation). Biotechnologies provide powerful tools for plant breeding, and among these ones, tissue culture, particularly haploid and doubled haploid technology, can effectively help to select superior plants. In fact, haploids (Hs), which are plants with gametophytic chromosome number, and doubled haploids (DHs), which are haploids that have undergone chromosome duplication, represent a particularly attractive biotechnological method to accelerate plant breeding. Currently, haploid technology, making possible through gametic embryogenesis the single-step development of complete homozygous lines from heterozygous parents, has already had a huge impact on agricultural systems of many agronomically important crops, representing an integral part in their improvement programmes. The aim of this review was to provide some background, recent advances, and future prospective on the employment of haploid technology through gametic embryogenesis as a powerful tool to support plant breeding.
Induced parthenogenesis by gamma-irradiated pollen in loquat for haploid production

PubMed Central

Blasco, Manuel; Badenes, María Luisa; del Mar Naval, María

2016-01-01

Successful haploid induction in loquat (Eriobotrya japonica (Thunb.) Lindl.) through in situ-induced parthenogenesis with gamma-ray irradiated pollen has been achieved. Female flowers of cultivar ‘Algerie’ were pollinated using pollen of cultivars ‘Changhong-3’, ‘Cox’ and ‘Saval Brasil’ irradiated with two doses of gamma rays, 150 and 300 Gy. The fruits were harvested 90, 105 and 120 days after pollination (dap). Four haploid plants were obtained from ‘Algerie’ pollinated with 300-Gy-treated pollen of ‘Saval Brasil’ from fruits harvested 105 dap. Haploidy was confirmed by flow cytometry and chromosome count. The haploids showed a very weak development compared to the diploid plants. This result suggests that irradiated pollen can be used to obtain parthenogenetic haploids. PMID:27795686
Transcriptome Analysis of Honeybee (Apis Mellifera) Haploid and Diploid Embryos Reveals Early Zygotic Transcription during Cleavage

PubMed Central

Pires, Camilla Valente; Freitas, Flávia Cristina de Paula; Cristino, Alexandre S.; Dearden, Peter K.; Simões, Zilá Luz Paulino

2016-01-01

In honeybees, the haplodiploid sex determination system promotes a unique embryogenesis process wherein females develop from fertilized eggs and males develop from unfertilized eggs. However, the developmental strategies of honeybees during early embryogenesis are virtually unknown. Similar to most animals, the honeybee oocytes are supplied with proteins and regulatory elements that support early embryogenesis. As the embryo develops, the zygotic genome is activated and zygotic products gradually replace the preloaded maternal material. The analysis of small RNA and mRNA libraries of mature oocytes and embryos originated from fertilized and unfertilized eggs has allowed us to explore the gene expression dynamics in the first steps of development and during the maternal-to-zygotic transition (MZT). We localized a short sequence motif identified as TAGteam motif and hypothesized to play a similar role in honeybees as in fruit flies, which includes the timing of early zygotic expression (MZT), a function sustained by the presence of the zelda ortholog, which is the main regulator of genome activation. Predicted microRNA (miRNA)-target interactions indicated that there were specific regulators of haploid and diploid embryonic development and an overlap of maternal and zygotic gene expression during the early steps of embryogenesis. Although a number of functions are highly conserved during the early steps of honeybee embryogenesis, the results showed that zygotic genome activation occurs earlier in honeybees than in Drosophila based on the presence of three primary miRNAs (pri-miRNAs) (ame-mir-375, ame-mir-34 and ame-mir-263b) during the cleavage stage in haploid and diploid embryonic development. PMID:26751956

A haploid system of sex determination in the brown alga Ectocarpus sp.

PubMed

Ahmed, Sophia; Cock, J Mark; Pessia, Eugenie; Luthringer, Remy; Cormier, Alexandre; Robuchon, Marine; Sterck, Lieven; Peters, Akira F; Dittami, Simon M; Corre, Erwan; Valero, Myriam; Aury, Jean-Marc; Roze, Denis; Van de Peer, Yves; Bothwell, John; Marais, Gabriel A B; Coelho, Susana M

2014-09-08

A common feature of most genetic sex-determination systems studied so far is that sex is determined by nonrecombining genomic regions, which can be of various sizes depending on the species. These regions have evolved independently and repeatedly across diverse groups. A number of such sex-determining regions (SDRs) have been studied in animals, plants, and fungi, but very little is known about the evolution of sexes in other eukaryotic lineages. We report here the sequencing and genomic analysis of the SDR of Ectocarpus, a brown alga that has been evolving independently from plants, animals, and fungi for over one giga-annum. In Ectocarpus, sex is expressed during the haploid phase of the life cycle, and both the female (U) and the male (V) sex chromosomes contain nonrecombining regions. The U and V of this species have been diverging for more than 70 mega-annum, yet gene degeneration has been modest, and the SDR is relatively small, with no evidence for evolutionary strata. These features may be explained by the occurrence of strong purifying selection during the haploid phase of the life cycle and the low level of sexual dimorphism. V is dominant over U, suggesting that femaleness may be the default state, adopted when the male haplotype is absent. The Ectocarpus UV system has clearly had a distinct evolutionary trajectory not only to the well-studied XY and ZW systems but also to the UV systems described so far. Nonetheless, some striking similarities exist, indicating remarkable universality of the underlying processes shaping sex chromosome evolution across distant lineages. Copyright © 2014 Elsevier Ltd. All rights reserved.
Production of viable homozygous, doubled haploid channel catfish (Ictalurus punctatus)

USDA-ARS?s Scientific Manuscript database

Production of doubled haploids via mitotic gynogenesis is a useful tool for the creation of completely inbred fish. In order to produce viable doubled haploid channel catfish, we utilized hydrostatic pressure or thermal treatments on eggs fertilized with sperm that had been exposed to ultraviolet l...
Meiosis and Haploid Gametes in the Pathogen Trypanosoma brucei

PubMed Central

Peacock, Lori; Bailey, Mick; Carrington, Mark; Gibson, Wendy

2014-01-01

Summary In eukaryote pathogens, sex is an important driving force in spreading genes for drug resistance, pathogenicity, and virulence [1]. For the parasitic trypanosomes that cause African sleeping sickness, mating occurs during transmission by the tsetse vector [2, 3] and involves meiosis [4], but haploid gametes have not yet been identified. Here, we show that meiosis is a normal part of development in the insect salivary glands for all subspecies of Trypanosoma brucei, including the human pathogens. By observing insect-derived trypanosomes during the window of peak expression of meiosis-specific genes, we identified promastigote-like (PL) cells that interacted with each other via their flagella and underwent fusion, as visualized by the mixing of cytoplasmic red and green fluorescent proteins. PL cells had a short, wide body, a very long anterior flagellum, and either one or two kinetoplasts, but only the anterior kinetoplast was associated with the flagellum. Measurement of nuclear DNA contents showed that PL cells were haploid relative to diploid metacyclics. Trypanosomes are among the earliest diverging eukaryotes, and our results support the hypothesis that meiosis and sexual reproduction are ubiquitous in eukaryotes and likely to have been early innovations [5]. PMID:24388851
Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids

PubMed Central

Bartkiewicz, Annette M.; Chilla, Friederike; Terefe-Ayana, Diro; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Linde, Marcus; Debener, Thomas

2018-01-01

Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP) array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL) for four phenotypic traits. PMID:29868076
A comprehensively molecular haplotype-resolved genome of a European individual

PubMed Central

Suk, Eun-Kyung; McEwen, Gayle K.; Duitama, Jorge; Nowick, Katja; Schulz, Sabrina; Palczewski, Stefanie; Schreiber, Stefan; Holloway, Dustin T.; McLaughlin, Stephen; Peckham, Heather; Lee, Clarence; Huebsch, Thomas; Hoehe, Margret R.

2011-01-01

Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, “Max Planck One” (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ∼1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying “haploid landscapes,” which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for “phase-sensitive” personal genomics. MP1's annotated haploid genomes are available as a public resource. PMID:21813624
The human genome contracts again.

PubMed

Pavlichin, Dmitri S; Weissman, Tsachy; Yona, Golan

2013-09-01

The number of human genomes that have been sequenced completely for different individuals has increased rapidly in recent years. Storing and transferring complete genomes between computers for the purpose of applying various applications and analysis tools will soon become a major hurdle, hindering the analysis phase. Therefore, there is a growing need to compress these data efficiently. Here, we describe a technique to compress human genomes based on entropy coding, using a reference genome and known Single Nucleotide Polymorphisms (SNPs). Furthermore, we explore several intrinsic features of genomes and information in other genomic databases to further improve the compression attained. Using these methods, we compress James Watson's genome to 2.5 megabytes (MB), improving on recent work by 37%. Similar compression is obtained for most genomes available from the 1000 Genomes Project. Our biologically inspired techniques promise even greater gains for genomes of lower organisms and for human genomes as more genomic data become available. Code is available at sourceforge.net/projects/genomezip/
Genomic Hypomethylation in the Human Germline Associates with Selective Structural Mutability in the Human Genome

PubMed Central

Li, Jian; Harris, R. Alan; Cheung, Sau Wai; Coarfa, Cristian; Jeong, Mira; Goodell, Margaret A.; White, Lisa D.; Patel, Ankita; Kang, Sung-Hae; Shaw, Chad; Chinault, A. Craig; Gambin, Tomasz; Gambin, Anna; Lupski, James R.; Milosavljevic, Aleksandar

2012-01-01

The hotspots of structural polymorphisms and structural mutability in the human genome remain to be explained mechanistically. We examine associations of structural mutability with germline DNA methylation and with non-allelic homologous recombination (NAHR) mediated by low-copy repeats (LCRs). Combined evidence from four human sperm methylome maps, human genome evolution, structural polymorphisms in the human population, and previous genomic and disease studies consistently points to a strong association of germline hypomethylation and genomic instability. Specifically, methylation deserts, the ∼1% fraction of the human genome with the lowest methylation in the germline, show a tenfold enrichment for structural rearrangements that occurred in the human genome since the branching of chimpanzee and are highly enriched for fast-evolving loci that regulate tissue-specific gene expression. Analysis of copy number variants (CNVs) from 400 human samples identified using a custom-designed array comparative genomic hybridization (aCGH) chip, combined with publicly available structural variation data, indicates that association of structural mutability with germline hypomethylation is comparable in magnitude to the association of structural mutability with LCR–mediated NAHR. Moreover, rare CNVs occurring in the genomes of individuals diagnosed with schizophrenia, bipolar disorder, and developmental delay and de novo CNVs occurring in those diagnosed with autism are significantly more concentrated within hypomethylated regions. These findings suggest a new connection between the epigenome, selective mutability, evolution, and human disease. PMID:22615578
Meiosis and haploid gametes in the pathogen Trypanosoma brucei.

PubMed

Peacock, Lori; Bailey, Mick; Carrington, Mark; Gibson, Wendy

2014-01-20

In eukaryote pathogens, sex is an important driving force in spreading genes for drug resistance, pathogenicity, and virulence. For the parasitic trypanosomes that cause African sleeping sickness, mating occurs during transmission by the tsetse vector and involves meiosis, but haploid gametes have not yet been identified. Here, we show that meiosis is a normal part of development in the insect salivary glands for all subspecies of Trypanosoma brucei, including the human pathogens. By observing insect-derived trypanosomes during the window of peak expression of meiosis-specific genes, we identified promastigote-like (PL) cells that interacted with each other via their flagella and underwent fusion, as visualized by the mixing of cytoplasmic red and green fluorescent proteins. PL cells had a short, wide body, a very long anterior flagellum, and either one or two kinetoplasts, but only the anterior kinetoplast was associated with the flagellum. Measurement of nuclear DNA contents showed that PL cells were haploid relative to diploid metacyclics. Trypanosomes are among the earliest diverging eukaryotes, and our results support the hypothesis that meiosis and sexual reproduction are ubiquitous in eukaryotes and likely to have been early innovations. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
The evolution of sex chromosomes in organisms with separate haploid sexes.

PubMed

Immler, Simone; Otto, Sarah Perin

2015-03-01

The evolution of dimorphic sex chromosomes is driven largely by the evolution of reduced recombination and the subsequent accumulation of deleterious mutations. Although these processes are increasingly well understood in diploid organisms, the evolution of dimorphic sex chromosomes in haploid organisms (U/V) has been virtually unstudied theoretically. We analyze a model to investigate the evolution of linkage between fitness loci and the sex-determining region in U/V species. In a second step, we test how prone nonrecombining regions are to degeneration due to accumulation of deleterious mutations. Our modeling predicts that the decay of recombination on the sex chromosomes and the addition of strata via fusions will be just as much a part of the evolution of haploid sex chromosomes as in diploid sex chromosome systems. Reduced recombination is broadly favored, as long as there is some fitness difference between haploid males and females. The degeneration of the sex-determining region due to the accumulation of deleterious mutations is expected to be slower in haploid organisms because of the absence of masking. Nevertheless, balancing selection often drives greater differentiation between the U/V sex chromosomes than in X/Y and Z/W systems. We summarize empirical evidence for haploid sex chromosome evolution and discuss our predictions in light of these findings. © 2015 The Author(s).
Human Contamination in Public Genome Assemblies.

PubMed

Kryukov, Kirill; Imanishi, Tadashi

2016-01-01

Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove contaminated sequence, and that new assemblies should be thoroughly checked for presence of human DNA before submitting them to public databases.
PopHuman: the human population genomics browser.

PubMed

Casillas, Sònia; Mulet, Roger; Villegas-Mirón, Pablo; Hervas, Sergi; Sanz, Esteve; Velasco, Daniel; Bertranpetit, Jaume; Laayouni, Hafid; Barbadilla, Antonio

2018-01-04

The 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics. Efficient and reliable parameter estimates have been computed using a novel pipeline that faces the unique features and limitations of the 1000GP data, and include a battery of nucleotide variation measures, divergence and linkage disequilibrium parameters, as well as different tests of neutrality, estimated in non-overlapping windows along the chromosomes and in annotated genes for all 26 populations of the 1000GP. PopHuman is open and freely available at http://pophuman.uab.cat. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
PopHuman: the human population genomics browser

PubMed Central

Mulet, Roger; Villegas-Mirón, Pablo; Hervas, Sergi; Sanz, Esteve; Velasco, Daniel; Bertranpetit, Jaume; Laayouni, Hafid

2018-01-01

Abstract The 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics. Efficient and reliable parameter estimates have been computed using a novel pipeline that faces the unique features and limitations of the 1000GP data, and include a battery of nucleotide variation measures, divergence and linkage disequilibrium parameters, as well as different tests of neutrality, estimated in non-overlapping windows along the chromosomes and in annotated genes for all 26 populations of the 1000GP. PopHuman is open and freely available at http://pophuman.uab.cat. PMID:29059408
Human genetics and genomics a decade after the release of the draft sequence of the human genome.

PubMed

Naidoo, Nasheen; Pawitan, Yudi; Soong, Richie; Cooper, David N; Ku, Chee-Seng

2011-10-01

Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.
Human genetics and genomics a decade after the release of the draft sequence of the human genome

PubMed Central

2011-01-01

Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade. PMID:22155605
Mapping PrBn and Other Quantitative Trait Loci Responsible for the Control of Homeologous Chromosome Pairing in Oilseed Rape (Brassica napus L.) Haploids

PubMed Central

Liu, Zhiqian; Adamczyk, Katarzyna; Manzanares-Dauleux, Maria; Eber, Frédérique; Lucas, Marie-Odile; Delourme, Régine; Chèvre, Anne Marie; Jenczewski, Eric

2006-01-01

In allopolyploid species, fair meiosis could be challenged by homeologous chromosome pairing and is usually achieved by the action of homeologous pairing suppressor genes. Oilseed rape (Brassica napus) haploids (AC, n = 19) represent an attractive model for studying the mechanisms used by allopolyploids to ensure the diploid-like meiotic pairing pattern. In oilseed rape haploids, homeologous chromosome pairing at metaphase I was found to be genetically based and controlled by a major gene, PrBn, segregating in a background of polygenic variation. In this study, we have mapped PrBn within a 10-cM interval on the C genome linkage group DY15 and shown that PrBn displays incomplete penetrance or variable expressivity. We have identified three to six minor QTL/BTL that have slight additive effects on the amount of pairing at metaphase I but do not interact with PrBn. We have also detected a number of other loci that interact epistatically, notably with PrBn. Our results support the idea that, as in other polyploid species, metaphase I homeologous pairing in oilseed rape haploids is controlled by an integrated system of several genes, which function in a complex manner. PMID:16951054
Competition between the sperm of a single male can increase the evolutionary rate of haploid expressed genes.

PubMed

Ezawa, Kiyoshi; Innan, Hideki

2013-07-01

The population genetic behavior of mutations in sperm genes is theoretically investigated. We modeled the processes at two levels. One is the standard population genetic process, in which the population allele frequencies change generation by generation, depending on the difference in selective advantages. The other is the sperm competition during each genetic transmission from one generation to the next generation. For the sperm competition process, we formulate the situation where a huge number of sperm with alleles A and B, produced by a single heterozygous male, compete to fertilize a single egg. This "minimal model" demonstrates that a very slight difference in sperm performance amounts to quite a large difference between the alleles' winning probabilities. By incorporating this effect of paternity-sharing sperm competition into the standard population genetic process, we show that fierce sperm competition can enhance the fixation probability of a mutation with a very small phenotypic effect at the single-sperm level, suggesting a contribution of sperm competition to rapid amino acid substitutions in haploid-expressed sperm genes. Considering recent genome-wide demonstrations that a substantial fraction of the mammalian sperm genes are haploid expressed, our model could provide a potential explanation of rapid evolution of sperm genes with a wide variety of functions (as long as they are expressed in the haploid phase). Another advantage of our model is that it is applicable to a wide range of species, irrespective of whether the species is externally fertilizing, polygamous, or monogamous. The theoretical result was applied to mammalian data to estimate the selection intensity on nonsynonymous mutations in sperm genes.
Genetic Analysis of Haploids from Industrial Strains of Baker's Yeast

PubMed Central

Oda, Yuji; Ouchi, Kozo

1989-01-01

Strains of baker's yeast conventionally used by the baking industry in Japan were tested for the ability to sporulate and produce viable haploid spores. Three isolates which possessed the properties of baker's yeasts were obtained from single spores. Each strain was a haploid, and one of these strains, YOY34, was characterized. YOY34 fermented maltose and sucrose, but did not utilize galactose, unlike its parental strain. Genetic analysis showed that YOY34 carried two MAL genes, one functional and one cryptic; two SUC genes; and one defective gal gene. The genotype of YOY34 was identified as MATα MAL1 MAL3g SUC2 SUC4 gall. The MAL1 gene from this haploid was constitutively expressed, was dominant over other wild-type MAL tester genes, and gave a weak sucrose fermentation. YOY34 was suitable for both bakery products, like conventional baker's yeasts, and for genetic analysis, like laboratory strains. PMID:16347967
Human Germline Genome Editing.

PubMed

Ormond, Kelly E; Mortlock, Douglas P; Scholes, Derek T; Bombard, Yvonne; Brody, Lawrence C; Faucett, W Andrew; Garrison, Nanibaa' A; Hercher, Laura; Isasi, Rosario; Middleton, Anna; Musunuru, Kiran; Shriner, Daniel; Virani, Alice; Young, Caroline E

2017-08-03

With CRISPR/Cas9 and other genome-editing technologies, successful somatic and germline genome editing are becoming feasible. To respond, an American Society of Human Genetics (ASHG) workgroup developed this position statement, which was approved by the ASHG Board in March 2017. The workgroup included representatives from the UK Association of Genetic Nurses and Counsellors, Canadian Association of Genetic Counsellors, International Genetic Epidemiology Society, and US National Society of Genetic Counselors. These groups, as well as the American Society for Reproductive Medicine, Asia Pacific Society of Human Genetics, British Society for Genetic Medicine, Human Genetics Society of Australasia, Professional Society of Genetic Counselors in Asia, and Southern African Society for Human Genetics, endorsed the final statement. The statement includes the following positions. (1) At this time, given the nature and number of unanswered scientific, ethical, and policy questions, it is inappropriate to perform germline gene editing that culminates in human pregnancy. (2) Currently, there is no reason to prohibit in vitro germline genome editing on human embryos and gametes, with appropriate oversight and consent from donors, to facilitate research on the possible future clinical applications of gene editing. There should be no prohibition on making public funds available to support this research. (3) Future clinical application of human germline genome editing should not proceed unless, at a minimum, there is (a) a compelling medical rationale, (b) an evidence base that supports its clinical use, (c) an ethical justification, and (d) a transparent public process to solicit and incorporate stakeholder input. Copyright © 2017 American Society of Human Genetics. All rights reserved.
Human centromere genomics: now it's personal.

PubMed

Hayden, Karen E

2012-07-01

Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.
The Evolution of the Human Genome

PubMed Central

Simonti, Corinne N.; Capra, John A.

2015-01-01

Human genomes hold a record of the evolutionary forces that have shaped our species. Advances in DNA sequencing, functional genomics, and population genetic modeling have deepened our understanding of human demographic history, natural selection, and many other long-studied topics. These advances have also revealed many previously underappreciated factors that influence the evolution of the human genome, including functional modifications to DNA and histones, conserved 3D topological chromatin domains, structural variation, and heterogeneous mutation patterns along the genome. Using evolutionary theory as a lens to study these phenomena will lead to significant breakthroughs in understanding what makes us human and why we get sick. PMID:26338498

The human genome: a multifractal analysis

PubMed Central

2011-01-01

Background Several studies have shown that genomes can be studied via a multifractal formalism. Recently, we used a multifractal approach to study the genetic information content of the Caenorhabditis elegans genome. Here we investigate the possibility that the human genome shows a similar behavior to that observed in the nematode. Results We report here multifractality in the human genome sequence. This behavior correlates strongly on the presence of Alu elements and to a lesser extent on CpG islands and (G+C) content. In contrast, no or low relationship was found for LINE, MIR, MER, LTRs elements and DNA regions poor in genetic information. Gene function, cluster of orthologous genes, metabolic pathways, and exons tended to increase their frequencies with ranges of multifractality and large gene families were located in genomic regions with varied multifractality. Additionally, a multifractal map and classification for human chromosomes are proposed. Conclusions Based on these findings, we propose a descriptive non-linear model for the structure of the human genome, with some biological implications. This model reveals 1) a multifractal regionalization where many regions coexist that are far from equilibrium and 2) this non-linear organization has significant molecular and medical genetic implications for understanding the role of Alu elements in genome stability and structure of the human genome. Given the role of Alu sequences in gene regulation, genetic diseases, human genetic diversity, adaptation and phylogenetic analyses, these quantifications are especially useful. PMID:21999602
Human Genome Research: Decoding DNA

Science.gov Websites

instructions for making all the protein molecules for all the different kinds of cells of the human body dropdown arrow Site Map A-Z Index Menu Synopsis Human Genome Research: Decoding DNA Resources with DeLisi played a pivotal role in proposing and initiating the Human Genome Program in 1986. The U.S
The bonobo genome compared with the chimpanzee and human genomes

PubMed Central

Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R.; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R.; Mullikin, James C.; Meader, Stephen J.; Ponting, Chris P.; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E.; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M.; Fischer, Anne; Ptak, Susan E.; Lachmann, Michael; Symer, David E.; Mailund, Thomas; Schierup, Mikkel H.; Andrés, Aida M.; Kelso, Janet; Pääbo, Svante

2012-01-01

Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours1–4, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other. PMID:22722832
Human genome. 1993 Program report

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

1994-03-01

The purpose of this report is to update the Human Genome 1991-92 Program Report and provide new information on the DOE genome program to researchers, program managers, other government agencies, and the interested public. This FY 1993 supplement includes abstracts of 60 new or renewed projects and listings of 112 continuing and 28 completed projects. These two reports, taken together, present the most complete published view of the DOE Human Genome Program through FY 1993. Research is progressing rapidly toward 15-year goals of mapping and sequencing the DNA of each of the 24 different human chromosomes.
Genomic Selection Outperforms Marker Assisted Selection for Grain Yield and Physiological Traits in a Maize Doubled Haploid Population Across Water Treatments.

PubMed

Cerrudo, Diego; Cao, Shiliang; Yuan, Yibing; Martinez, Carlos; Suarez, Edgar Antonio; Babu, Raman; Zhang, Xuecai; Trachsel, Samuel

2018-01-01

To increase genetic gain for tolerance to drought, we aimed to identify environmentally stable QTL in per se and testcross combination under well-watered (WW) and drought stressed (DS) conditions and evaluate the possible deployment of QTL using marker assisted and/or genomic selection (QTL/GS-MAS). A total of 169 doubled haploid lines derived from the cross between CML495 and LPSC7F64 and 190 testcrosses (tester CML494) were evaluated in a total of 11 treatment-by-population combinations under WW and DS conditions. In response to DS, grain yield (GY) and plant height (PHT) were reduced while time to anthesis and the anthesis silking interval (ASI) increased for both lines and hybrids. Forty-eight QTL were detected for a total of nine traits. The allele derived from CML495 generally increased trait values for anthesis, ASI, PHT, the normalized difference vegetative index (NDVI) and the green leaf area duration (GLAD; a composite trait of NDVI, PHT and senescence) while it reduced trait values for leaf rolling and senescence. The LOD scores for all detected QTL ranged from 2.0 to 7.2 explaining 4.4 to 19.4% of the observed phenotypic variance with R 2 ranging from 0 (GY, DS, lines) to 37.3% (PHT, WW, lines). Prediction accuracy of the model used for genomic selection was generally higher than phenotypic variance explained by the sum of QTL for individual traits indicative of the polygenic control of traits evaluated here. We therefore propose to use QTL-MAS in forward breeding to enrich the allelic frequency for a few desired traits with strong additive QTL in early selection cycles while GS-MAS could be used in more mature breeding programs to additionally capture alleles with smaller additive effects.
Competition Between the Sperm of a Single Male Can Increase the Evolutionary Rate of Haploid Expressed Genes

PubMed Central

Ezawa, Kiyoshi; Innan, Hideki

2013-01-01

The population genetic behavior of mutations in sperm genes is theoretically investigated. We modeled the processes at two levels. One is the standard population genetic process, in which the population allele frequencies change generation by generation, depending on the difference in selective advantages. The other is the sperm competition during each genetic transmission from one generation to the next generation. For the sperm competition process, we formulate the situation where a huge number of sperm with alleles A and B, produced by a single heterozygous male, compete to fertilize a single egg. This “minimal model” demonstrates that a very slight difference in sperm performance amounts to quite a large difference between the alleles’ winning probabilities. By incorporating this effect of paternity-sharing sperm competition into the standard population genetic process, we show that fierce sperm competition can enhance the fixation probability of a mutation with a very small phenotypic effect at the single-sperm level, suggesting a contribution of sperm competition to rapid amino acid substitutions in haploid-expressed sperm genes. Considering recent genome-wide demonstrations that a substantial fraction of the mammalian sperm genes are haploid expressed, our model could provide a potential explanation of rapid evolution of sperm genes with a wide variety of functions (as long as they are expressed in the haploid phase). Another advantage of our model is that it is applicable to a wide range of species, irrespective of whether the species is externally fertilizing, polygamous, or monogamous. The theoretical result was applied to mammalian data to estimate the selection intensity on nonsynonymous mutations in sperm genes. PMID:23666936
Maize Haploid Induction and Doubling II – Experience with Exotic and Elite Maize Populations

USDA-ARS?s Scientific Manuscript database

As a follow-up to our previous study, second year information will be presented addressing questions on haploid induction and doubling, utilizing exotic and elite maize. These projects result from collaborations between Iowa State Doubled Haploid Facility (http://www.plantbreeding.iastate.edu/DHF/D...
Maize Haploid Induction and Doubling – Recent Experience with Exotic and Elite Maize Populations

USDA-ARS?s Scientific Manuscript database

Experience from three maize research projects utilizing the haploid inducer RWS x RWK-76 from the University of Hohenheim will be summarized. These projects result from collaborations between Iowa State Doubled Haploid Facility (http://www.plantbreeding.iastate.edu/DHF/DHF.htm) researchers and USDA...
Human Genome Program

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

1993-01-01

The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.
Germplasm enhancement of maize: A look into haploid induction and chromosomal doubling of haploids from temperate-adapted tropical sources

USDA-ARS?s Scientific Manuscript database

Doubled haploid technology is used to develop completely homozygous inbred lines, where each of the chromatids making up a chromosome pair are identical. Two inbred lines, PHB47 and PHZ51, were used to make backcrosses to 18 maize landraces, generating 36 populations. The landraces were chosen bas...
A Genome-Wide Association Study Identifies Genomic Regions for Virulence in the Non-Model Organism Heterobasidion annosum s.s

PubMed Central

Dalman, Kerstin; Himmelstrand, Kajsa; Olson, Åke; Lind, Mårten; Brandström-Durling, Mikael; Stenlid, Jan

2013-01-01

The dense single nucleotide polymorphisms (SNP) panels needed for genome wide association (GWA) studies have hitherto been expensive to establish and use on non-model organisms. To overcome this, we used a next generation sequencing approach to both establish SNPs and to determine genotypes. We conducted a GWA study on a fungal species, analysing the virulence of Heterobasidion annosum s.s., a necrotrophic pathogen, on its hosts Picea abies and Pinus sylvestris. From a set of 33,018 single nucleotide polymorphisms (SNP) in 23 haploid isolates, twelve SNP markers distributed on seven contigs were associated with virulence (P<0.0001). Four of the contigs harbour known virulence genes from other fungal pathogens and the remaining three harbour novel candidate genes. Two contigs link closely to virulence regions recognized previously by QTL mapping in the congeneric hybrid H. irregulare × H. occidentale. Our study demonstrates the efficiency of GWA studies for dissecting important complex traits of small populations of non-model haploid organisms with small genomes. PMID:23341945
Mapping and Sequencing the Human Genome

DOE R&D Accomplishments Database

1988-01-01

Numerous meetings have been held and a debate has developed in the biological community over the merits of mapping and sequencing the human genome. In response a committee to examine the desirability and feasibility of mapping and sequencing the human genome was formed to suggest options for implementing the project. The committee asked many questions. Should the analysis of the human genome be left entirely to the traditionally uncoordinated, but highly successful, support systems that fund the vast majority of biomedical research. Or should a more focused and coordinated additional support system be developed that is limited to encouraging and facilitating the mapping and eventual sequencing of the human genome. If so, how can this be done without distorting the broader goals of biological research that are crucial for any understanding of the data generated in such a human genome project. As the committee became better informed on the many relevant issues, the opinions of its members coalesced, producing a shared consensus of what should be done. This report reflects that consensus.
Origins of the Human Genome Project.

PubMed

Watson, J D; Cook-Deegan, R M

1991-01-01

The Human Genome Project has become a reality. Building on a debate that dates back to 1985, several genome projects are now in full stride around the world, and more are likely to form in the next several years. Italy began its genome program in 1987, and the United Kingdom and U.S.S.R. in 1988. The European communities mounted several genome projects on yeast, bacteria, Drosophila, and Arabidospis thaliana (a rapidly growing plant with a small genome) in 1988, and in 1990 commenced a new 2-year program on the human genome. In the United States, we have completed the first year of operation of the National Center for Human Genome Research at the National Institutes of Health (NIH), now the largest single funding source for genome research in the world. There have been dedicated budgets focused on genome-scale research at NIH, the U.S. Department of Energy, and the Howard Hughes Medical Institute for several years, and results are beginning to accumulate. There were three annual meetings on genome mapping and sequencing at Cold Spring Harbor, New York, in the spring of 1988, 1989, and 1990; the talks have shifted from a discussion about how to approach problems to presenting results from experiments already performed. We have finally begun to work rather than merely talk. The purpose of genome projects is to assemble data on the structure of DNA in human chromosomes and those of other organisms. A second goal is to develop new technologies to perform mapping and sequencing. There have been impressive technical advances in the past 5 years since the debate about the human genome project began. We are on the verge of beginning pilot projects to test several approaches to sequencing long stretches of DNA, using both automation and manual methods. Ordered sets of yeast artificial chromosome and cosmid clones have been assembled to span more than 2 million base pairs of several human chromosomes, and a region of 10 million base pairs has been assembled for
The role of epistatic interactions underpinning resistance to parasitic Varroa mites in haploid honey bee (Apis mellifera) drones.

PubMed

Conlon, Benjamin H; Frey, Eva; Rosenkranz, Peter; Locke, Barbara; Moritz, Robin F A; Routtu, Jarkko

2018-06-01

The Red Queen hypothesis predicts that host-parasite coevolutionary dynamics can select for host resistance through increased genetic diversity, recombination and evolutionary rates. However, in haplodiploid organisms such as the honeybee (Apis mellifera), models suggest the selective pressure is weaker than in diploids. Haplodiploid sex determination, found in A. mellifera, can allow deleterious recessive alleles to persist in the population through the diploid sex with negative effects predominantly expressed in the haploid sex. To overcome these negative effects in haploid genomes, epistatic interactions have been hypothesized to play an important role. Here, we use the interaction between A. mellifera and the parasitic mite Varroa destructor to test epistasis in the expression of resistance, through the inhibition of parasite reproduction, in haploid drones. We find novel loci on three chromosomes which explain over 45% of the resistance phenotype. Two of these loci interact only additively, suggesting their expression is independent of each other, but both loci interact epistatically with the third locus. With drone offspring inheriting only one copy of the queen's chromosomes, the drones will only possess one of two queen alleles throughout the years-long lifetime of the honeybee colony. Varroa, in comparison, completes its highly inbred reproductive cycle in a matter of weeks, allowing it to rapidly evolve resistance. Faced with the rapidly evolving Varroa, a diversity of pathways and epistatic interactions for the inhibition of Varroa reproduction could therefore provide a selective advantage to the high levels of recombination seen in A. mellifera. This allows for the remixing of phenotypes despite a fixed queen genotype. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.
Schrödinger's Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

PubMed

Mordecai, Gideon J; Verret, Frederic; Highfield, Andrea; Schroeder, Declan C

2017-03-18

Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV); a double stranded DNA (dsDNA) virus belonging to the family Phycodnaviridae . E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the "Cheshire Cat" escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA) that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR) and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger's cat; of being simultaneously both dead and alive.
The Human Microbiome: Our Second Genome*

PubMed Central

Grice, Elizabeth A.; Segre, Julia A.

2012-01-01

The human genome has been referred to as the blueprint of human biology. In this review we consider an essential but largely ignored overlay to that blueprint, the human microbiome, which is composed of those microbes that live in and on our bodies. The human microbiome is a source of genetic diversity, a modifier of disease, an essential component of immunity, and a functional entity that influences metabolism and modulates drug interactions. Characterization and analysis of the human microbiome have been greatly catalyzed by advances in genomic technologies. We discuss how these technologies have shaped this emerging field of study and advanced our understanding of the human microbiome. We also identify future challenges, many of which are common to human genetic studies, and predict that in the future, analyzing genetic variation and risk of human disease will sometimes necessitate the integration of human and microbial genomic data sets. PMID:22703178
History of the DOE Human Genome Program

Science.gov Websites

History of the DOE Human Genome Program The following history is taken from the U.S. Department of Energy 1991-91 Human Genome Program Report (June 1992). This is an archived item. A brief history of the U.S. Department of Energy (DOE) Human Genome Program will be useful in a discussion of the objectives
Proteomic strategy for the identification of critical actors in reorganization of the post-meiotic male genome.

PubMed

Govin, Jerome; Gaucher, Jonathan; Ferro, Myriam; Debernardi, Alexandra; Garin, Jerome; Khochbin, Saadi; Rousseaux, Sophie

2012-01-01

After meiosis, during the final stages of spermatogenesis, the haploid male genome undergoes major structural changes, resulting in a shift from a nucleosome-based genome organization to the sperm-specific, highly compacted nucleoprotamine structure. Recent data support the idea that region-specific programming of the haploid male genome is of high importance for the post-fertilization events and for successful embryo development. Although these events constitute a unique and essential step in reproduction, the mechanisms by which they occur have remained completely obscure and the factors involved have mostly remained uncharacterized. Here, we sought a strategy to significantly increase our understanding of proteins controlling the haploid male genome reprogramming, based on the identification of proteins in two specific pools: those with the potential to bind nucleic acids (basic proteins) and proteins capable of binding basic proteins (acidic proteins). For the identification of acidic proteins, we developed an approach involving a transition-protein (TP)-based chromatography, which has the advantage of retaining not only acidic proteins due to the charge interactions, but also potential TP-interacting factors. A second strategy, based on an in-depth bioinformatic analysis of the identified proteins, was then applied to pinpoint within the lists obtained, male germ cells expressed factors relevant to the post-meiotic genome organization. This approach reveals a functional network of DNA-packaging proteins and their putative chaperones and sheds a new light on the way the critical transitions in genome organizations could take place. This work also points to a new area of research in male infertility and sperm quality assessments.
Standing at the Gateway to Europe - The Genetic Structure of Western Balkan Populations Based on Autosomal and Haploid Markers

PubMed Central

Kovacevic, Lejla; Tambets, Kristiina; Ilumäe, Anne-Mai; Kushniarevich, Alena; Yunusbayev, Bayazit; Solnik, Anu; Bego, Tamer; Primorac, Dragan; Skaro, Vedrana; Leskovac, Andreja; Jakovski, Zlatko; Drobnic, Katja; Tolk, Helle-Viivi; Kovacevic, Sandra; Rudan, Pavao; Metspalu, Ene; Marjanovic, Damir

2014-01-01

Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula. PMID:25148043
Standing at the gateway to Europe--the genetic structure of Western balkan populations based on autosomal and haploid markers.

PubMed

Kovacevic, Lejla; Tambets, Kristiina; Ilumäe, Anne-Mai; Kushniarevich, Alena; Yunusbayev, Bayazit; Solnik, Anu; Bego, Tamer; Primorac, Dragan; Skaro, Vedrana; Leskovac, Andreja; Jakovski, Zlatko; Drobnic, Katja; Tolk, Helle-Viivi; Kovacevic, Sandra; Rudan, Pavao; Metspalu, Ene; Marjanovic, Damir

2014-01-01

Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula.

Oilseed rape seeds with ablated defence cells of the glucosinolate–myrosinase system. Production and characteristics of double haploid MINELESS plants of Brassica napus L.

PubMed Central

Ahuja, Ishita; Borgen, Birgit Hafeld; Hansen, Magnor; Honne, Bjørn Ivar; Müller, Caroline; Rohloff, Jens; Rossiter, John Trevor; Bones, Atle Magnar

2011-01-01

Oilseed rape and other crop plants of the family Brassicaceae contain a unique defence system known as the glucosinolate–myrosinase system or the ‘mustard oil bomb’. The ‘mustard oil bomb’ which includes myrosinase and glucosinolates is triggered by abiotic and biotic stress, resulting in the formation of toxic products such as nitriles and isothiocyanates. Myrosinase is present in specialist cells known as ‘myrosin cells’ and can also be known as toxic mines. The myrosin cell idioblasts of Brassica napus were genetically reprogrammed to undergo controlled cell death (ablation) during seed development. These myrosin cell-free plants have been named MINELESS as they lack toxic mines. This has led to the production of oilseed rape with a significant reduction both in myrosinase levels and in the hydrolysis of glucosinolates. Even though the myrosinase activity in MINELESS was very low compared with the wild type, variation was observed. This variability was overcome by producing homozygous seeds. A microspore culture technique involving non-fertile haploid MINELESS plants was developed and these plants were treated with colchicine to produce double haploid MINELESS plants with full fertility. Double haploid MINELESS plants had significantly reduced myrosinase levels and glucosinolate hydrolysis products. Wild-type and MINELESS plants exhibited significant differences in growth parameters such as plant height, leaf traits, matter accumulation, and yield parameters. The growth and developmental pattern of MINELESS plants was relatively slow compared with the wild type. The characteristics of the pure double haploid MINELESS plant are described and its importance for future biochemical, agricultural, dietary, functional genomics, and plant defence studies is discussed. PMID:21778185
In vitro propagation of the microsporidian pathogen Brachiola algerae and studies of its chromosome and ribosomal DNA organization in the context of the complete genome sequencing project.

PubMed

Belkorchia, Abdel; Biderre, Corinne; Militon, Cécile; Polonais, Valérie; Wincker, Patrick; Jubin, Claire; Delbac, Frédéric; Peyretaillade, Eric; Peyret, Pierre

2008-03-01

Brachiola algerae has a broad host spectrum from human to mosquitoes. The successful infection of two mosquito cell lines (Mos55: embryonic cells and Sua 4.0: hemocyte-like cells) and a human cell line (HFF) highlights the efficient adaptive capacity of this microsporidian pathogen. The molecular karyotype of this microsporidian species was determined in the context of the B. algerae genome sequencing project, showing that its haploid genome consists of 30 chromosomal-sized DNAs ranging from 160 to 2240 kbp giving an estimated genome size of 23 Mbp. A contig of 12,269 bp including the DNA sequence of the B. algerae ribosomal transcription unit has been built from initial genomic sequences and the secondary structure of the large subunit rRNA constructed. The data obtained indicate that B. algerae should be an excellent parasitic model to understand genome evolution in relation to infectious capacity.
Scanning the human genome at kilobase resolution.

PubMed

Chen, Jun; Kim, Yeong C; Jung, Yong-Chul; Xuan, Zhenyu; Dworkin, Geoff; Zhang, Yanming; Zhang, Michael Q; Wang, San Ming

2008-05-01

Normal genome variation and pathogenic genome alteration frequently affect small regions in the genome. Identifying those genomic changes remains a technical challenge. We report here the development of the DGS (Ditag Genome Scanning) technique for high-resolution analysis of genome structure. The basic features of DGS include (1) use of high-frequent restriction enzymes to fractionate the genome into small fragments; (2) collection of two tags from two ends of a given DNA fragment to form a ditag to represent the fragment; (3) application of the 454 sequencing system to reach a comprehensive ditag sequence collection; (4) determination of the genome origin of ditags by mapping to reference ditags from known genome sequences; (5) use of ditag sequences directly as the sense and antisense PCR primers to amplify the original DNA fragment. To study the relationship between ditags and genome structure, we performed a computational study by using the human genome reference sequences as a model, and analyzed the ditags experimentally collected from the well-characterized normal human DNA GM15510 and the leukemic human DNA of Kasumi-1 cells. Our studies show that DGS provides a kilobase resolution for studying genome structure with high specificity and high genome coverage. DGS can be applied to validate genome assembly, to compare genome similarity and variation in normal populations, and to identify genomic abnormality including insertion, inversion, deletion, translocation, and amplification in pathological genomes such as cancer genomes.
Transcriptome analysis of functional differentiation between haploid and diploid cells of Emiliania huxleyi, a globally significant photosynthetic calcifying cell.

PubMed

von Dassow, Peter; Ogata, Hiroyuki; Probert, Ian; Wincker, Patrick; Da Silva, Corinne; Audic, Stéphane; Claverie, Jean-Michel; de Vargas, Colomban

2009-01-01

Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only haploids included signal transduction and motility genes. Diploid-specific transcripts included Ca2+, H+, and HCO3- pumps. Potential factors differentiating the transcriptomes included haploid-specific Myb transcription factor homologs and an unusual diploid-specific histone H4 homolog. This study permitted the identification of genes likely involved in diploid-specific biomineralization, haploid-specific motility, and transcriptional control. Greater transcriptome richness in diploid cells suggests they may be more versatile for exploiting a diversity of rich environments whereas haploid cells are intrinsically more streamlined.
Transcriptome analysis of functional differentiation between haploid and diploid cells of Emiliania huxleyi, a globally significant photosynthetic calcifying cell

PubMed Central

2009-01-01

Background Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. Results The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only ≤ 50% of transcripts estimated to be common between the two phases. The major functional category of transcripts differentiating haploids included signal transduction and motility genes. Diploid-specific transcripts included Ca2+, H+, and HCO3- pumps. Potential factors differentiating the transcriptomes included haploid-specific Myb transcription factor homologs and an unusual diploid-specific histone H4 homolog. Conclusions This study permitted the identification of genes likely involved in diploid-specific biomineralization, haploid-specific motility, and transcriptional control. Greater transcriptome richness in diploid cells suggests they may be more versatile for exploiting a diversity of rich environments whereas haploid cells are intrinsically more streamlined. PMID:19832986
Widespread of horizontal gene transfer in the human genome.

PubMed

Huang, Wenze; Tsai, Lillian; Li, Yulong; Hua, Nan; Sun, Chen; Wei, Chaochun

2017-04-04

A fundamental concept in biology is that heritable material is passed from parents to offspring, a process called vertical gene transfer. An alternative mechanism of gene acquisition is through horizontal gene transfer (HGT), which involves movement of genetic materials between different species. Horizontal gene transfer has been found prevalent in prokaryotes but very rare in eukaryote. In this paper, we investigate horizontal gene transfer in the human genome. From the pair-wise alignments between human genome and 53 vertebrate genomes, 1,467 human genome regions (2.6 M bases) from all chromosomes were found to be more conserved with non-mammals than with most mammals. These human genome regions involve 642 known genes, which are enriched with ion binding. Compared to known horizontal gene transfer regions in the human genome, there were few overlapping regions, which indicated horizontal gene transfer is more common than we expected in the human genome. Horizontal gene transfer impacts hundreds of human genes and this study provided insight into potential mechanisms of HGT in the human genome.
Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

PubMed

Manolio, Teri A

2016-10-01

Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. Published by Elsevier Ireland Ltd.
Fully-Automated High-Throughput NMR System for Screening of Haploid Kernels of Maize (Corn) by Measurement of Oil Content

PubMed Central

Xu, Xiaoping; Huang, Qingming; Chen, Shanshan; Yang, Peiqiang; Chen, Shaojiang; Song, Yiqiao

2016-01-01

One of the modern crop breeding techniques uses doubled haploid plants that contain an identical pair of chromosomes in order to accelerate the breeding process. Rapid haploid identification method is critical for large-scale selections of double haploids. The conventional methods based on the color of the endosperm and embryo seeds are slow, manual and prone to error. On the other hand, there exists a significant difference between diploid and haploid seeds generated by high oil inducer, which makes it possible to use oil content to identify the haploid. This paper describes a fully-automated high-throughput NMR screening system for maize haploid kernel identification. The system is comprised of a sampler unit to select a single kernel to feed for measurement of NMR and weight, and a kernel sorter to distribute the kernel according to the measurement result. Tests of the system show a consistent accuracy of 94% with an average screening time of 4 seconds per kernel. Field test result is described and the directions for future improvement are discussed. PMID:27454427
Polyploid Titan Cells Produce Haploid and Aneuploid Progeny To Promote Stress Adaptation

PubMed Central

Gerstein, Aleeza C.; Fu, Man Shun; Mukaremera, Liliane; Li, Zhongming; Ormerod, Kate L.; Fraser, James A.; Berman, Judith

2015-01-01

ABSTRACT Cryptococcus neoformans is a major life-threatening fungal pathogen. In response to the stress of the host environment, C. neoformans produces large polyploid titan cells. Titan cell production enhances the virulence of C. neoformans, yet whether the polyploid aspect of titan cells is specifically influential remains unknown. We show that titan cells were more likely to survive and produce offspring under multiple stress conditions than typical cells and that even their normally sized daughters maintained an advantage over typical cells in continued exposure to stress. Although polyploid titan cells generated haploid daughter cell progeny upon in vitro replication under nutrient-replete conditions, titan cells treated with the antifungal drug fluconazole produced fluconazole-resistant diploid and aneuploid daughter cells. Interestingly, a single titan mother cell was capable of generating multiple types of aneuploid daughter cells. The increased survival and genomic diversity of titan cell progeny promote rapid adaptation to new or high-stress conditions. PMID:26463162
Schrödinger’s Cheshire Cat: Are Haploid Emiliania huxleyi Cells Resistant to Viral Infection or Not?

PubMed Central

Mordecai, Gideon J.; Verret, Frederic; Highfield, Andrea; Schroeder, Declan C.

2017-01-01

Emiliania huxleyi is the main calcite producer on Earth and is routinely infected by a virus (EhV); a double stranded DNA (dsDNA) virus belonging to the family Phycodnaviridae. E. huxleyi exhibits a haplodiploid life cycle; the calcified diploid stage is non-motile and forms extensive blooms. The haploid phase is a non-calcified biflagellated cell bearing organic scales. Haploid cells are thought to resist infection, through a process deemed the “Cheshire Cat” escape strategy; however, a recent study detected the presence of viral lipids in the same haploid strain. Here we report on the application of an E. huxleyi CCMP1516 EhV-86 combined tiling array (TA) that further confirms an EhV infection in the RCC1217 haploid strain, which grew without any signs of cell lysis. Reverse transcription polymerase chain reaction (RT-PCR) and PCR verified the presence of viral RNA in the haploid cells, yet indicated an absence of viral DNA, respectively. These infected cells are an alternative stage of the virus life cycle deemed the haplococcolithovirocell. In this instance, the host is both resistant to and infected by EhV, i.e., the viral transcriptome is present in haploid cells whilst there is no evidence of viral lysis. This superimposed state is reminiscent of Schrödinger’s cat; of being simultaneously both dead and alive. PMID:28335465
The genome of the fire ant Solenopsis invicta

USDA-ARS?s Scientific Manuscript database

Ants have evolved very complex societies and are key ecosystem members. Some of them are also major pests, as exemplified by the fire ant Solenopsis invicta. We present here the draft genome of S. invicta, assembled from 454 and Illumina reads obtained from a focal haploid male and his brothers. In ...
Minimal Absent Words in Four Human Genome Assemblies

PubMed Central

Garcia, Sara P.; Pinho, Armando J.

2011-01-01

Minimal absent words have been computed in genomes of organisms from all domains of life. Here, we aim to contribute to the catalogue of human genomic variation by investigating the variation in number and content of minimal absent words within a species, using four human genome assemblies. We compare the reference human genome GRCh37 assembly, the HuRef assembly of the genome of Craig Venter, the NA12878 assembly from cell line GM12878, and the YH assembly of the genome of a Han Chinese individual. We find the variation in number and content of minimal absent words between assemblies more significant for large and very large minimal absent words, where the biases of sequencing and assembly methodologies become more pronounced. Moreover, we find generally greater similarity between the human genome assemblies sequenced with capillary-based technologies (GRCh37 and HuRef) than between the human genome assemblies sequenced with massively parallel technologies (NA12878 and YH). Finally, as expected, we find the overall variation in number and content of minimal absent words within a species to be generally smaller than the variation between species. PMID:22220210
Natural mutagenesis of human genomes by endogenous retrotransposons.

PubMed

Iskow, Rebecca C; McCabe, Michael T; Mills, Ryan E; Torene, Spencer; Pittard, W Stephen; Neuwald, Andrew F; Van Meir, Erwin G; Vertino, Paula M; Devine, Scott E

2010-06-25

Two abundant classes of mobile elements, namely Alu and L1 elements, continue to generate new retrotransposon insertions in human genomes. Estimates suggest that these elements have generated millions of new germline insertions in individual human genomes worldwide. Unfortunately, current technologies are not capable of detecting most of these young insertions, and the true extent of germline mutagenesis by endogenous human retrotransposons has been difficult to examine. Here, we describe technologies for detecting these young retrotransposon insertions and demonstrate that such insertions indeed are abundant in human populations. We also found that new somatic L1 insertions occur at high frequencies in human lung cancer genomes. Genome-wide analysis suggests that altered DNA methylation may be responsible for the high levels of L1 mobilization observed in these tumors. Our data indicate that transposon-mediated mutagenesis is extensive in human genomes and is likely to have a major impact on human biology and diseases.
All about the Human Genome Project (HGP)

MedlinePlus

... CSER), and Genome Sequencing Informatics Tools (GS-IT) Comparative Genomics Background information prepared for the media on ... other species to the human sequence. Background on Comparative Genomic Analysis New Process to Prioritize Animal Genomes ...
Plasmodium copy number variation scan: gene copy numbers evaluation in haploid genomes.

PubMed

Beghain, Johann; Langlois, Anne-Claire; Legrand, Eric; Grange, Laura; Khim, Nimol; Witkowski, Benoit; Duru, Valentine; Ma, Laurence; Bouchier, Christiane; Ménard, Didier; Paul, Richard E; Ariey, Frédéric

2016-04-12

In eukaryotic genomes, deletion or amplification rates have been estimated to be a thousand more frequent than single nucleotide variation. In Plasmodium falciparum, relatively few transcription factors have been identified, and the regulation of transcription is seemingly largely influenced by gene amplification events. Thus copy number variation (CNV) is a major mechanism enabling parasite genomes to adapt to new environmental changes. Currently, the detection of CNVs is based on quantitative PCR (qPCR), which is significantly limited by the relatively small number of genes that can be analysed at any one time. Technological advances that facilitate whole-genome sequencing, such as next generation sequencing (NGS) enable deeper analyses of the genomic variation to be performed. Because the characteristics of Plasmodium CNVs need special consideration in algorithms and strategies for which classical CNV detection programs are not suited a dedicated algorithm to detect CNVs across the entire exome of P. falciparum was developed. This algorithm is based on a custom read depth strategy through NGS data and called PlasmoCNVScan. The analysis of CNV identification on three genes known to have different levels of amplification and which are located either in the nuclear, apicoplast or mitochondrial genomes is presented. The results are correlated with the qPCR experiments, usually used for identification of locus specific amplification/deletion. This tool will facilitate the study of P. falciparum genomic adaptation in response to ecological changes: drug pressure, decreased transmission, reduction of the parasite population size (transition to pre-elimination endemic area).
A decade of human genome project conclusion: Scientific diffusion about our genome knowledge.

PubMed

Moraes, Fernanda; Góes, Andréa

2016-05-06

The Human Genome Project (HGP) was initiated in 1990 and completed in 2003. It aimed to sequence the whole human genome. Although it represented an advance in understanding the human genome and its complexity, many questions remained unanswered. Other projects were launched in order to unravel the mysteries of our genome, including the ENCyclopedia of DNA Elements (ENCODE). This review aims to analyze the evolution of scientific knowledge related to both the HGP and ENCODE projects. Data were retrieved from scientific articles published in 1990-2014, a period comprising the development and the 10 years following the HGP completion. The fact that only 20,000 genes are protein and RNA-coding is one of the most striking HGP results. A new concept about the organization of genome arose. The ENCODE project was initiated in 2003 and targeted to map the functional elements of the human genome. This project revealed that the human genome is pervasively transcribed. Therefore, it was determined that a large part of the non-protein coding regions are functional. Finally, a more sophisticated view of chromatin structure emerged. The mechanistic functioning of the genome has been redrafted, revealing a much more complex picture. Besides, a gene-centric conception of the organism has to be reviewed. A number of criticisms have emerged against the ENCODE project approaches, raising the question of whether non-conserved but biochemically active regions are truly functional. Thus, HGP and ENCODE projects accomplished a great map of the human genome, but the data generated still requires further in depth analysis. © 2016 by The International Union of Biochemistry and Molecular Biology, 44:215-223, 2016. © 2016 The International Union of Biochemistry and Molecular Biology.
Genome editing for human gene therapy.

PubMed

Meissner, Torsten B; Mandal, Pankaj K; Ferreira, Leonardo M R; Rossi, Derrick J; Cowan, Chad A

2014-01-01

The rapid advancement of genome-editing techniques holds much promise for the field of human gene therapy. From bacteria to model organisms and human cells, genome editing tools such as zinc-finger nucleases (ZNFs), TALENs, and CRISPR/Cas9 have been successfully used to manipulate the respective genomes with unprecedented precision. With regard to human gene therapy, it is of great interest to test the feasibility of genome editing in primary human hematopoietic cells that could potentially be used to treat a variety of human genetic disorders such as hemoglobinopathies, primary immunodeficiencies, and cancer. In this chapter, we explore the use of the CRISPR/Cas9 system for the efficient ablation of genes in two clinically relevant primary human cell types, CD4+ T cells and CD34+ hematopoietic stem and progenitor cells. By using two guide RNAs directed at a single locus, we achieve highly efficient and predictable deletions that ablate gene function. The use of a Cas9-2A-GFP fusion protein allows FACS-based enrichment of the transfected cells. The ease of designing, constructing, and testing guide RNAs makes this dual guide strategy an attractive approach for the efficient deletion of clinically relevant genes in primary human hematopoietic stem and effector cells and enables the use of CRISPR/Cas9 for gene therapy.
Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute’s genomic medicine portfolio

PubMed Central

Manolio, Teri A.

2016-01-01

Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual’s genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of “Genomic Medicine Meetings,” under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and diffficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI’s genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. PMID:27612677
Opening plenary speaker: Human genomics, precision medicine, and advancing human health.

PubMed

Green, Eric D

2016-08-01

Starting with the launch of the Human Genome Project in 1990, the past quarter-century has brought spectacular achievements in genomics that dramatically empower the study of human biology and disease. The human genomics enterprise is now in the midst of an important transition, as the growing foundation of genomic knowledge is being used by researchers and clinicians to tackle increasingly complex problems in biomedicine. Of particular prominence is the use of revolutionary new DNA sequencing technologies for generating prodigious amounts of DNA sequence data to elucidate the complexities of genome structure, function, and evolution, as well as to unravel the genomic bases of rare and common diseases. Together, these developments are ushering in the era of genomic medicine. Augmenting the advances in human genomics have been innovations in technologies for measuring environmental and lifestyle information, electronic health records, and data science; together, these provide opportunities of unprecedented scale and scope for investigating the underpinnings of health and disease. To capitalize on these opportunities, U.S. President Barack Obama recently announced a major new research endeavor - the U.S. Precision Medicine Initiative. This bold effort will be framed around several key aims, which include accelerating the use of genomically informed approaches to cancer care, making important policy and regulatory changes, and establishing a large research cohort of >1 million volunteers to facilitate precision medicine research. The latter will include making the partnership with all participants a centerpiece feature in the cohort's design and development. The Precision Medicine Initiative represents a broad-based research program that will allow new approaches for individualized medical care to be rigorously tested, so as to establish a new evidence base for advancing clinical practice and, eventually, human health.
A new rainbow trout (Oncorhynchus mykiss) reference genome assembly

USDA-ARS?s Scientific Manuscript database

In an effort to improve the rainbow trout reference genome assembly, we have re-sequenced the doubled-haploid Swanson line using the longest available reads from the Illumina technology. Overall we generated over 510 million 260nt paired-end shotgun reads, and 1 billion 160nt mate-pair reads from f...

Human genome and philosophy: what ethical challenge will human genome studies bring to the medical practices in the 21st century?

PubMed

Renzong, Q

2001-12-01

A human being or person cannot be reduced to a set of human genes, or human genome. Genetic essentialism is wrong, because as a person the entity should have self-conscious and social interaction capacity which is grown in an interpersonal relationship. Genetic determinism is wrong too, the relationship between a gene and a trait is not a linear model of causation, but rather a non-linear one. Human genome is a complexity system and functions in a complexity system of human body and a complexity of systems of natural/social environment. Genetic determinism also caused the issue of how much responsibility an agent should take for her/his action, and how much degrees of freedom will a human being have. Human genome research caused several conceptual issues. Can we call a gene 'good' or 'bad', 'superior' of 'inferior'? Is a boy who is detected to have the gene of Huntington's chorea or Alzheimer disease a patient? What should the term 'eugenics' mean? What do the terms such as 'gene therapy', 'treatment' and 'enhancement' and 'human cloning' mean etc.? The research of human genome and its application caused and will cause ethical issues. Can human genome research and its application be used for eugenics, or only for the treatment and prevention of diseases? Must the principle of informed consent/choice be insisted in human genome research and its application? How to protecting gene privacy and combating the discrimination on the basis of genes? How to promote the quality between persons, harmony between ethnic groups and peace between countries? How to establish a fair, just, equal and equitable relationship between developing and developed countries in regarding to human genome research and its application?
Justice and the Human Genome Project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, T.F.; Lappe, M.

1992-01-01

Most of the essays gathered in this volume were first presented at a conference, Justice and the Human Genome, in Chicago in early November, 1991. The goal of the, conference was to consider questions of justice as they are and will be raised by the Human Genome Project. To achieve its goal of identifying and elucidating the challenges of justice inherent in genomic research and its social applications the conference drew together in one forum members from academia, medicine, and industry with interests divergent as rate-setting for insurance, the care of newborns, and the history of ethics. The essays inmore » this volume address a number of theoretical and practical concerns relative to the meaning of genomic research.« less
Justice and the Human Genome Project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, T.F.; Lappe, M.

1992-12-31

Most of the essays gathered in this volume were first presented at a conference, Justice and the Human Genome, in Chicago in early November, 1991. The goal of the, conference was to consider questions of justice as they are and will be raised by the Human Genome Project. To achieve its goal of identifying and elucidating the challenges of justice inherent in genomic research and its social applications the conference drew together in one forum members from academia, medicine, and industry with interests divergent as rate-setting for insurance, the care of newborns, and the history of ethics. The essays inmore » this volume address a number of theoretical and practical concerns relative to the meaning of genomic research.« less
Genome Editing: A New Approach to Human Therapeutics.

PubMed

Porteus, Matthew

2016-01-01

The ability to manipulate the genome with precise spatial and nucleotide resolution (genome editing) has been a powerful research tool. In the past decade, the tools and expertise for using genome editing in human somatic cells and pluripotent cells have increased to such an extent that the approach is now being developed widely as a strategy to treat human disease. The fundamental process depends on creating a site-specific DNA double-strand break (DSB) in the genome and then allowing the cell's endogenous DSB repair machinery to fix the break such that precise nucleotide changes are made to the DNA sequence. With the development and discovery of several different nuclease platforms and increasing knowledge of the parameters affecting different genome editing outcomes, genome editing frequencies now reach therapeutic relevance for a wide variety of diseases. Moreover, there is a series of complementary approaches to assessing the safety and toxicity of any genome editing process, irrespective of the underlying nuclease used. Finally, the development of genome editing has raised the issue of whether it should be used to engineer the human germline. Although such an approach could clearly prevent the birth of people with devastating and destructive genetic diseases, questions remain about whether human society is morally responsible enough to use this tool.
Human genomics projects and precision medicine.

PubMed

Carrasco-Ramiro, F; Peiró-Pastor, R; Aguado, B

2017-09-01

The completion of the Human Genome Project (HGP) in 2001 opened the floodgates to a deeper understanding of medicine. There are dozens of HGP-like projects which involve from a few tens to several million genomes currently in progress, which vary from having specialized goals or a more general approach. However, data generation, storage, management and analysis in public and private cloud computing platforms have raised concerns about privacy and security. The knowledge gained from further research has changed the field of genomics and is now slowly permeating into clinical medicine. The new precision (personalized) medicine, where genome sequencing and data analysis are essential components, allows tailored diagnosis and treatment according to the information from the patient's own genome and specific environmental factors. P4 (predictive, preventive, personalized and participatory) medicine is introducing new concepts, challenges and opportunities. This review summarizes current sequencing technologies, concentrates on ongoing human genomics projects, and provides some examples in which precision medicine has already demonstrated clinical impact in diagnosis and/or treatment.
De novo assembly of a haplotype-resolved human genome.

PubMed

Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

2015-06-01

The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.
Human evolution: a tale from ancient genomes

PubMed Central

2017-01-01

The field of human ancient DNA (aDNA) has moved from mitochondrial sequencing that suffered from contamination and provided limited biological insights, to become a fully genomic discipline that is changing our conception of human history. Recent successes include the sequencing of extinct hominins, and true population genomic studies of Bronze Age populations. Among the emerging areas of aDNA research, the analysis of past epigenomes is set to provide more new insights into human adaptation and disease susceptibility through time. Starting as a mere curiosity, ancient human genetics has become a major player in the understanding of our evolutionary history. This article is part of the themed issue ‘Evo-devo in the genomics era, and the origins of morphological diversity’. PMID:27994125
Genomic Flexibility of Human Endogenous Retrovirus Type K

PubMed Central

Dube, Derek; Contreras-Galindo, Rafael; He, Shirley; King, Steven R.; Gonzalez-Hernandez, Marta J.; Gitlin, Scott D.; Kaplan, Mark H.

2014-01-01

ABSTRACT Human endogenous retrovirus type K (HERV-K) proviruses are scattered throughout the human genome, but as no infectious HERV-K virus has been detected to date, the mechanism by which these viruses replicated and populated the genome remains unresolved. Here, we provide evidence that, in addition to the RNA genomes that canonical retroviruses package, modern HERV-K viruses can contain reverse-transcribed DNA (RT-DNA) genomes. Indeed, reverse transcription of genomic HERV-K RNA into the DNA form is able to occur in three distinct times and locations: (i) in the virus-producing cell prior to viral release, yielding a DNA-containing extracellular virus particle similar to the spumaviruses; (ii) within the extracellular virus particle itself, transitioning from an RNA-containing particle to a DNA-containing particle; and (iii) after entry of the RNA-containing virus into the target cell, similar to canonical retroviruses, such as murine leukemia virus and HIV. Moreover, using a resuscitated HERV-K virus construct, we show that both viruses with RNA genomes and viruses with DNA genomes are capable of infecting target cells. This high level of genomic flexibility historically could have permitted these viruses to replicate in various host cell environments, potentially assisting in their many integration events and resulting in their high prevalence in the human genome. Moreover, the ability of modern HERV-K viruses to proceed through reverse transcription and package RT-DNA genomes suggests a higher level of replication competency than was previously understood, and it may be relevant in HERV-K-associated human diseases. IMPORTANCE Retroviral elements comprise at least 8% of the human genome. Of all the endogenous retroviruses, HERV-K viruses are the most intact and biologically active. While a modern infectious HERV-K has yet to be found, HERV-K activation has been associated with cancers, autoimmune diseases, and HIV-1 infection. Thus, determining how this
A nine-scaffold genome assembly of the nine chromosome sugar beet

USDA-ARS?s Scientific Manuscript database

Over the course of 20 months, we assembled a sugar beet genome (700 - 800 Mb) into a close representation of the nine haploid chromosomes of beet. This result was obtained by sequentially assembling sequences >40 kb in length, orienting these assemblies via optical mapping, and scaffolding with in v...
Initial Genomics of the Human Nucleolus

PubMed Central

Németh, Attila; Conesa, Ana; Santoyo-Lopez, Javier; Medina, Ignacio; Montaner, David; Péterfia, Bálint; Solovei, Irina; Cremer, Thomas; Dopazo, Joaquin; Längst, Gernot

2010-01-01

We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs) in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD–localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD–specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture. PMID:20361057
Toward the 1,000 dollars human genome.

PubMed

Bennett, Simon T; Barnes, Colin; Cox, Anthony; Davies, Lisa; Brown, Clive

2005-06-01

Revolutionary new technologies, capable of transforming the economics of sequencing, are providing an unparalleled opportunity to analyze human genetic variation comprehensively at the whole-genome level within a realistic timeframe and at affordable costs. Current estimates suggest that it would cost somewhere in the region of 30 million US dollars to sequence an entire human genome using Sanger-based sequencing, and on one machine it would take about 60 years. Solexa is widely regarded as a company with the necessary disruptive technology to be the first to achieve the ultimate goal of the so-called 1,000 dollars human genome - the conceptual cost-point needed for routine analysis of individual genomes. Solexa's technology is based on completely novel sequencing chemistry capable of sequencing billions of individual DNA molecules simultaneously, a base at a time, to enable highly accurate, low cost analysis of an entire human genome in a single experiment. When applied over a large enough genomic region, these new approaches to resequencing will enable the simultaneous detection and typing of known, as well as unknown, polymorphisms, and will also offer information about patterns of linkage disequilibrium in the population being studied. Technological progress, leading to the advent of single-molecule-based approaches, is beginning to dramatically drive down costs and increase throughput to unprecedented levels, each being several orders of magnitude better than that which is currently available. A new sequencing paradigm based on single molecules will be faster, cheaper and more sensitive, and will permit routine analysis at the whole-genome level.
The zebrafish reference genome sequence and its relationship to the human genome.

PubMed

Howe, Kerstin; Clark, Matthew D; Torroja, Carlos F; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T; Guerra-Assunção, José A; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F; Laird, Gavin K; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Elliot, David; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Begum, Sharmin; Mortimore, Beverley; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Lloyd, Christine; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James D; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Lanz, Christa; Raddatz, Günter; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Schuster, Stephan C; Carter, Nigel P; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M J; Enright, Anton; Geisler, Robert; Plasterk, Ronald H A; Lee, Charles; Westerfield, Monte; de Jong, Pieter J; Zon, Leonard I; Postlethwait, John H; Nüsslein-Volhard, Christiane; Hubbard, Tim J P; Roest Crollius, Hugues; Rogers, Jane; Stemple, Derek L

2013-04-25

Zebrafish have become a popular organism for the study of vertebrate gene function. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination.
A genomic storm in critically injured humans

PubMed Central

Xiao, Wenzhong; Mindrinos, Michael N.; Seok, Junhee; Cuschieri, Joseph; Cuenca, Alex G.; Gao, Hong; Hayden, Douglas L.; Hennessy, Laura; Moore, Ernest E.; Minei, Joseph P.; Bankey, Paul E.; Johnson, Jeffrey L.; Sperry, Jason; Nathens, Avery B.; Billiar, Timothy R.; West, Michael A.; Brownstein, Bernard H.; Mason, Philip H.; Baker, Henry V.; Finnerty, Celeste C.; Jeschke, Marc G.; López, M. Cecilia; Klein, Matthew B.; Gamelli, Richard L.; Gibran, Nicole S.; Arnoldo, Brett; Xu, Weihong; Zhang, Yuping; Calvano, Steven E.; McDonald-Smith, Grace P.; Schoenfeld, David A.; Storey, John D.; Cobb, J. Perren; Warren, H. Shaw; Moldawer, Lyle L.; Herndon, David N.; Lowry, Stephen F.; Maier, Ronald V.; Davis, Ronald W.

2011-01-01

Human survival from injury requires an appropriate inflammatory and immune response. We describe the circulating leukocyte transcriptome after severe trauma and burn injury, as well as in healthy subjects receiving low-dose bacterial endotoxin, and show that these severe stresses produce a global reprioritization affecting >80% of the cellular functions and pathways, a truly unexpected “genomic storm.” In severe blunt trauma, the early leukocyte genomic response is consistent with simultaneously increased expression of genes involved in the systemic inflammatory, innate immune, and compensatory antiinflammatory responses, as well as in the suppression of genes involved in adaptive immunity. Furthermore, complications like nosocomial infections and organ failure are not associated with any genomic evidence of a second hit and differ only in the magnitude and duration of this genomic reprioritization. The similarities in gene expression patterns between different injuries reveal an apparently fundamental human response to severe inflammatory stress, with genomic signatures that are surprisingly far more common than different. Based on these transcriptional data, we propose a new paradigm for the human immunological response to severe injury. PMID:22110166
Agrobacterium-mediated transformation of the haploid liverwort Marchantia polymorpha L., an emerging model for plant biology.

PubMed

Ishizaki, Kimitsune; Chiyoda, Shota; Yamato, Katsuyuki T; Kohchi, Takayuki

2008-07-01

Agrobacterium-mediated transformation has not been practical in pteridophytes, bryophytes and algae to date, although it is commonly used in model plants including Arabidopsis and rice. Here we present a rapid Agrobacterium-mediated transformation system for the haploid liverwort Marchantia polymorpha L. using immature thalli developed from spores. Hundreds of hygromycin-resistant plants per sporangium were obtained by co-cultivation of immature thalli with Agrobacterium carrying the binary vector that contains a reporter, the beta-glucuronidase (GUS) gene with an intron, and a selection marker, the hygromycin phosphotransferase (hpt) gene. In this system, individual gemmae, which arise asexually from single initial cells, were analyzed as isogenic transformants. GUS activity staining showed that all hygromycin-resistant plants examined expressed the GUS transgene in planta. DNA analyses verified random integration of 1-5 copies of the intact T-DNA between the right and the left borders into the M. polymorpha genome. The efficient and rapid Agrobacterium-mediated transformation of M. polymorpha should provide molecular techniques to facilitate comparative genomics, taking advantage of this unique model plant that retains many features of the common ancestor of land plants.
Unexplored therapeutic opportunities in the human genome.

PubMed

Oprea, Tudor I; Bologa, Cristian G; Brunak, Søren; Campbell, Allen; Gan, Gregory N; Gaulton, Anna; Gomez, Shawn M; Guha, Rajarshi; Hersey, Anne; Holmes, Jayme; Jadhav, Ajit; Jensen, Lars Juhl; Johnson, Gary L; Karlson, Anneli; Leach, Andrew R; Ma'ayan, Avi; Malovannaya, Anna; Mani, Subramani; Mathias, Stephen L; McManus, Michael T; Meehan, Terrence F; von Mering, Christian; Muthas, Daniel; Nguyen, Dac-Trung; Overington, John P; Papadatos, George; Qin, Jun; Reich, Christian; Roth, Bryan L; Schürer, Stephan C; Simeonov, Anton; Sklar, Larry A; Southall, Noel; Tomita, Susumu; Tudose, Ilinca; Ursu, Oleg; Vidovic, Dušica; Waller, Anna; Westergaard, David; Yang, Jeremy J; Zahoránszky-Köhalmi, Gergely

2018-05-01

A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.
Human genome project and sickle cell disease.

PubMed

Norman, Brenda J; Miller, Sheila D

2011-01-01

Sickle cell disease is one of the most common genetic blood disorders in the United States that affects 1 in every 375 African Americans. Sickle cell disease is an inherited condition caused by abnormal hemoglobin in the red blood cells. The Human Genome Project has provided valuable insight and extensive research advances in the understanding of the human genome and sickle cell disease. Significant progress in genetic knowledge has led to an increase in the ability for researchers to map and sequence genes for diagnosis, treatment, and prevention of sickle cell disease and other chronic illnesses. This article explores some of the recent knowledge and advances about sickle cell disease and the Human Genome Project.
Development and application of Human Genome Epidemiology

NASA Astrophysics Data System (ADS)

Xu, Jingwen

2017-12-01

Epidemiology is a science that studies distribution of diseases and health in population and its influencing factors, it also studies how to prevent and cure disease and promote health strategies and measures. Epidemiology has developed rapidly in recent years and it is an intercross subject with various other disciplines to form a series of branch disciplines such as Genetic epidemiology, molecular epidemiology, drug epidemiology and tumor epidemiology. With the implementation and completion of Human Genome Project (HGP), Human Genome Epidemiology (HuGE) has emerged at this historic moment. In this review, the development of Human Genome Epidemiology, research content, the construction and structure of relevant network, research standards, as well as the existing results and problems are briefly outlined.
Precise detection of de novo single nucleotide variants in human genomes.

PubMed

Gómez-Romero, Laura; Palacios-Flores, Kim; Reyes, José; García, Delfino; Boege, Margareta; Dávila, Guillermo; Flores, Margarita; Schatz, Michael C; Palacios, Rafael

2018-05-22

The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use. Copyright © 2018 the Author(s). Published by PNAS.
The Past, Present, and Future of Human Centromere Genomics

PubMed Central

Aldrup-MacDonald, Megan E.; Sullivan, Beth A.

2014-01-01

The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function. PMID:24683489
The Human Genome Project: how do we protect Australians?

PubMed

Stott Despoja, N

It is the moon landing of the nineties: the ambitious Human Genome Project--identifying the up to 100,000 genes that make up human DNA and the sequences of the three billion base-pairs that comprise the human genome. However, unlike the moon landing, the effects of the genome project will have a fundamental impact on the way we see ourselves and each other.

Genomic clones for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kott, M.; Venta, P.J.; Larsen, J.

1987-05-01

A human genomic library was prepared from peripheral white blood cells from a single donor by inserting an MboI partial digest into BamHI poly-linker sites of EMBL3. This library was screened using an oligolabeled human cholinesterase cDNA probe over 700 bp long. The latter probe was obtained from a human basal ganglia cDNA library. Of approximately 2 million clones screened with high stringency conditions several positive clones were identified; two have been plaque purified. One of these clones has been partially mapped using restriction enzymes known to cut within the coded region of the cDNA for human serum cholinesterase. Hybridizationmore » of the fragments and their sizes are as expected if the genomic clone is cholinesterase. Sequencing of the DNA fragments in M13 is in progress to verify the identify of the clone and the location of introns.« less
The Human Genome Initiative of the Department of Energy

DOE R&D Accomplishments Database

1988-01-01

The structural characterization of genes and elucidation of their encoded functions have become a cornerstone of modern health research, biology and biotechnology. A genome program is an organized effort to locate and identify the functions of all the genes of an organism. Beginning with the DOE-sponsored, 1986 human genome workshop at Santa Fe, the value of broadly organized efforts supporting total genome characterization became a subject of intensive study. There is now national recognition that benefits will rapidly accrue from an effective scientific infrastructure for total genome research. In the US genome research is now receiving dedicated funds. Several other nations are implementing genome programs. Supportive infrastructure is being improved through both national and international cooperation. The Human Genome Initiative of the Department of Energy (DOE) is a focused program of Resource and Technology Development, with objectives of speeding and bringing economies to the national human genome effort. This report relates the origins and progress of the Initiative.
Epigenetic silencing by the HUSH complex mediates position-effect variegation in human cells*

PubMed Central

Matheson, Nicholas J.; Wals, Kim; Antrobus, Robin; Göttgens, Berthold; Dougan, Gordon; Dawson, Mark A.; Lehner, Paul J.

2015-01-01

Forward genetic screens in Drosophila melanogaster for modifiers of position-effect variegation have revealed the basis of much of our understanding of heterochromatin. We took an analogous approach to identify genes required for epigenetic repression in human cells. A non-lethal forward genetic screen in near-haploid KBM7 cells identified the Human Silencing Hub (HUSH), a complex of three poorly-characterised proteins, TASOR, MPP8, and periphilin, which is absent from Drosophila but conserved from fish to humans. Loss of HUSH subunits resulted in decreased H3K9me3 at both endogenous genomic loci and retroviruses integrated into heterochromatin. Our results suggest that the HUSH complex is recruited to genomic loci rich in H3K9me3, where subsequent recruitment of the methyltransferase SETDB1 is required for further H3K9me3 deposition to maintain transcriptional silencing. PMID:26022416
Recurrent DNA inversion rearrangements in the human genome

PubMed Central

Flores, Margarita; Morales, Lucía; Gonzaga-Jauregui, Claudia; Domínguez-Vidaña, Rocío; Zepeda, Cinthya; Yañez, Omar; Gutiérrez, María; Lemus, Tzitziki; Valle, David; Avila, Ma. Carmen; Blanco, Daniel; Medina-Ruiz, Sofía; Meza, Karla; Ayala, Erandi; García, Delfino; Bustos, Patricia; González, Víctor; Girard, Lourdes; Tusie-Luna, Teresa; Dávila, Guillermo; Palacios, Rafael

2007-01-01

Several lines of evidence suggest that reiterated sequences in the human genome are targets for nonallelic homologous recombination (NAHR), which facilitates genomic rearrangements. We have used a PCR-based approach to identify breakpoint regions of rearranged structures in the human genome. In particular, we have identified intrachromosomal identical repeats that are located in reverse orientation, which may lead to chromosomal inversions. A bioinformatic workflow pathway to select appropriate regions for analysis was developed. Three such regions overlapping with known human genes, located on chromosomes 3, 15, and 19, were analyzed. The relative proportion of wild-type to rearranged structures was determined in DNA samples from blood obtained from different, unrelated individuals. The results obtained indicate that recurrent genomic rearrangements occur at relatively high frequency in somatic cells. Interestingly, the rearrangements studied were significantly more abundant in adults than in newborn individuals, suggesting that such DNA rearrangements might start to appear during embryogenesis or fetal life and continue to accumulate after birth. The relevance of our results in regard to human genomic variation is discussed. PMID:17389356
Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina).

PubMed

Schwartze, Volker U; Winter, Sascha; Shelest, Ekaterina; Marcet-Houben, Marina; Horn, Fabian; Wehner, Stefanie; Linde, Jörg; Valiante, Vito; Sammeth, Michael; Riege, Konstantin; Nowrousian, Minou; Kaerger, Kerstin; Jacobsen, Ilse D; Marz, Manja; Brakhage, Axel A; Gabaldón, Toni; Böcker, Sebastian; Voigt, Kerstin

2014-08-01

Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i) the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD), (ii) despite the relatively high incidence of introns, alternative splicing (AS) is not frequently observed for the generation of paralogs and in response to stress, (iii) the content of repetitive elements is strikingly low (<5%), (iv) L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1-4 copies usually found in other fungi. More findings are: (i) lower content of tRNAs, but unique codons in L. corymbifera, (ii) Over 25% of the proteins are apparently specific for L. corymbifera. (iii) L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors) in comparison to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae.
Defining functional DNA elements in the human genome

PubMed Central

Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

2014-01-01

With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594
Human genome project: revolutionizing biology through leveraging technology

NASA Astrophysics Data System (ADS)

Dahl, Carol A.; Strausberg, Robert L.

1996-04-01

The Human Genome Project (HGP) is an international project to develop genetic, physical, and sequence-based maps of the human genome. Since the inception of the HGP it has been clear that substantially improved technology would be required to meet the scientific goals, particularly in order to acquire the complete sequence of the human genome, and that these technologies coupled with the information forthcoming from the project would have a dramatic effect on the way biomedical research is performed in the future. In this paper, we discuss the state-of-the-art for genomic DNA sequencing, technological challenges that remain, and the potential technological paths that could yield substantially improved genomic sequencing technology. The impact of the technology developed from the HGP is broad-reaching and a discussion of other research and medical applications that are leveraging HGP-derived DNA analysis technologies is included. The multidisciplinary approach to the development of new technologies that has been successful for the HGP provides a paradigm for facilitating new genomic approaches toward understanding the biological role of functional elements and systems within the cell, including those encoded within genomic DNA and their molecular products.
The zebrafish reference genome sequence and its relationship to the human genome

PubMed Central

Howe, Kerstin; Clark, Matthew D.; Torroja, Carlos F.; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E.; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C.; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T.; Guerra-Assunção, José A.; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F.; Laird, Gavin K.; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M.; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Carter, Nigel P.; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M. J.; Enright, Anton; Geisler, Robert; Plasterk, Ronald H. A.; Lee, Charles; Westerfield, Monte; de Jong, Pieter J.; Zon, Leonard I.; Postlethwait, John H.; Nüsslein-Volhard, Christiane; Hubbard, Tim J. P.; Crollius, Hugues Roest; Rogers, Jane; Stemple, Derek L.

2013-01-01

Zebrafish have become a popular organism for the study of vertebrate gene function1,2. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease3–5. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes6, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination. PMID:23594743
True-breeding targeted gene knock-out in barley using designer TALE-nuclease in haploid cells.

PubMed

Gurushidze, Maia; Hensel, Goetz; Hiekel, Stefan; Schedel, Sindy; Valkov, Vladimir; Kumlehn, Jochen

2014-01-01

Transcription activator-like effector nucleases (TALENs) are customizable fusion proteins able to cleave virtually any genomic DNA sequence of choice, and thereby to generate site-directed genetic modifications in a wide range of cells and organisms. In the present study, we expressed TALENs in pollen-derived, regenerable cells to establish the generation of instantly true-breeding mutant plants. A gfp-specific TALEN pair was expressed via Agrobacterium-mediated transformation in embryogenic pollen of transgenic barley harboring a functional copy of gfp. Thanks to the haploid nature of the target cells, knock-out mutations were readily detected, and homozygous primary mutant plants obtained following genome duplication. In all, 22% of the TALEN transgenics proved knocked out with respect to gfp, and the loss of function could be ascribed to the deletions of between four and 36 nucleotides in length. The altered gfp alleles were transmitted normally through meiosis, and the knock-out phenotype was consistently shown by the offspring of two independent mutants. Thus, here we describe the efficient production of TALEN-mediated gene knock-outs in barley that are instantaneously homozygous and non-chimeric in regard to the site-directed mutations induced. This TALEN approach has broad applicability for both elucidating gene function and tailoring the phenotype of barley and other crop species.
Convergent evolution of a fused sexual cycle promotes the haploid lifestyle

NASA Astrophysics Data System (ADS)

Sherwood, Racquel Kim; Scaduto, Christine M.; Torres, Sandra E.; Bennett, Richard J.

2014-02-01

Sexual reproduction is restricted to eukaryotic species and involves the fusion of haploid gametes to form a diploid cell that subsequently undergoes meiosis to generate recombinant haploid forms. This process has been extensively studied in the unicellular yeast Saccharomyces cerevisiae, which exhibits separate regulatory control over mating and meiosis. Here we address the mechanism of sexual reproduction in the related hemiascomycete species Candida lusitaniae. We demonstrate that, in contrast to S. cerevisiae, C. lusitaniae exhibits a highly integrated sexual program in which the programs regulating mating and meiosis have fused. Profiling of the C. lusitaniae sexual cycle revealed that gene expression patterns during mating and meiosis were overlapping, indicative of co-regulation. This was particularly evident for genes involved in pheromone MAPK signalling, which were highly induced throughout the sexual cycle of C. lusitaniae. Furthermore, genetic analysis showed that the orthologue of IME2, a `diploid-specific' factor in S. cerevisiae, and STE12, the master regulator of S. cerevisiae mating, were each required for progression through both mating and meiosis in C. lusitaniae. Together, our results establish that sexual reproduction has undergone significant rewiring between S. cerevisiae and C. lusitaniae, and that a concerted sexual cycle operates in C. lusitaniae that is more reminiscent of the distantly related ascomycete, Schizosaccharomyces pombe. We discuss these results in light of the evolution of sexual reproduction in yeast, and propose that regulatory coupling of mating and meiosis has evolved multiple times as an adaptation to promote the haploid lifestyle.
A New and Improved Rainbow Trout (Oncorhynchus mykiss) Reference Genome Assembly

USDA-ARS?s Scientific Manuscript database

In an effort to improve the rainbow trout reference genome assembly, we re-sequenced the doubled-haploid Swanson line using the longest available reads from the Illumina technology; generating over 510 million paired-end shotgun reads (2x260nt), and 1 billion mate-pair reads (2x160nt) from four sequ...
Understanding the Human Genome Project -- A Fact Sheet

MedlinePlus

... cost of sequencing whole exomes or genomes, groundbreaking comparative genomic studies are now identifiying the causes of ... the role of ethical, legal, and social implications research more important than ever. National Human Genome Research ...
77 FR 2304 - National Human Genome Research Institute; Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-01-17

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome....S.C. 281(d)(4)), notice is hereby given that the National Human Genome Research Institute (NHGRI... meeting of the National Advisory Council for Human Genome Research. Background materials on the proposed...
The human genome as public: Justifications and implications.

PubMed

Bayefsky, Michelle J

2017-03-01

Since the human genome was decoded, great emphasis has been placed on the unique, personal nature of the genome, along with the benefits that personalized medicine can bring to individuals and the importance of safeguarding genetic privacy. As a result, an equally important aspect of the human genome - its common nature - has been underappreciated and underrepresented in the ethics literature and policy dialogue surrounding genetics and genomics. This article will argue that, just as the personal nature of the genome has been used to reinforce individual rights and justify important privacy protections, so too the common nature of the genome can be employed to support protections of the genome at a population level and policies designed to promote the public's wellbeing. In order for public health officials to have the authority to develop genetics policies for the sake of the public good, the genome must have not only a common, but also a public, dimension. This article contends that DNA carries a public dimension through the use of two conceptual frameworks: the common heritage (CH) framework and the common resource (CR) framework. Both frameworks establish a public interest in the human genome, but the CH framework can be used to justify policies aimed at preserving and protecting the genome, while the CR framework can be employed to justify policies for utilizing the genome for the public benefit. A variety of possible policy implications are discussed, with special attention paid to the use of large-scale genomics databases for public health research. © Published 2016. This article is a U.S. Government work and is in the public domain in the USA.
"Orphan" retrogenes in the human genome.

PubMed

Ciomborowska, Joanna; Rosikiewicz, Wojciech; Szklarczyk, Damian; Makałowski, Wojciech; Makałowska, Izabela

2013-02-01

Gene duplicates generated via retroposition were long thought to be pseudogenized and consequently decayed. However, a significant number of these genes escaped their evolutionary destiny and evolved into functional genes. Despite multiple studies, the number of functional retrogenes in human and other genomes remains unclear. We performed a comparative analysis of human, chicken, and worm genomes to identify "orphan" retrogenes, that is, retrogenes that have replaced their progenitors. We located 25 such candidates in the human genome. All of these genes were previously known, and the majority has been intensively studied. Despite this, they have never been recognized as retrogenes. Analysis revealed that the phenomenon of replacing parental genes with their retrocopies has been taking place over the entire span of animal evolution. This process was often species specific and contributed to interspecies differences. Surprisingly, these retrogenes, which should evolve in a more relaxed mode, are subject to a very strong purifying selection, which is, on average, two and a half times stronger than other human genes. Also, for retrogenes, they do not show a typical overall tendency for a testis-specific expression. Notably, seven of them are associated with human diseases. Recognizing them as "orphan" retrocopies, which have different regulatory machinery than their parents, is important for any disease studies in model organisms, especially when discoveries made in one species are transferred to humans.
[Novel bidirectional promoter from human genome].

PubMed

Orekhova, A S; Sverdlova, P S; Spirin, P V; Leonova, O G; Popenko, V I; Prasolov, V S; Rubtsov, P M

2011-01-01

In human and other mammalian genomes a number of closely linked gene pairs transcribed in opposite directions are found. According to bioinformatic analysis up to 10% of human genes are arranged in this way. In present work the fragment of human genome was cloned that separates genes localized at 2p13.1 and oriented "head-to-head", coding for hypothetical proteins with unknown functions--CCDC (Coiled Coil Domain Containing) 142 and TTC (TetraTricopeptide repeat Containing) 31. Intergenic CCDC142-TTC31 region overlaps with CpG-island and contains a number of potential binding sites for transcription factors. This fragment functions as bidirectional promoter in the system ofluciferase reporter gene expression upon transfection of human embryonic kidney (HEK293) cells. The vectors containing genes of two fluorescent proteins--green (EGFP) and red (DsRed2) in opposite orientations separated by the fragment of CCDC142-TTC31 intergenic region were constructed. In HEK293 cells transfected with these vectors simultaneous expression of two fluorescent proteins is observed. Truncated versions of intergenic region were obtained and their promoter activity measured. Minimal promoter fragment contains elements Inr, BRE, DPE characteristic for TATA-less promoters. Thus, from the human genome the novel bidirectional promoter was cloned that can be used for simultaneous constitutive expression of two genes in human cells.
Mapping the human genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Annas, G.C.; Elias, S.

1992-01-01

This article is a review of the book Mapping the Human Genome: Using Law and Ethics as Guides, edited by George C. Annas and Sherman Elias. The book is a collection of essays on the subject of using ethics and laws as guides to justify human gene mapping. It addresses specific issues such problems related to eugenics, patents, insurance as well as broad issues such as the societal definitions of normality.
Development of a CRISPR-Cas9 System for Efficient Genome Editing of Candida lusitaniae.

PubMed

Norton, Emily L; Sherwood, Racquel K; Bennett, Richard J

2017-01-01

Candida lusitaniae is a member of the Candida clade that includes a diverse group of fungal species relevant to both human health and biotechnology. This species exhibits a full sexual cycle to undergo interconversion between haploid and diploid forms. C. lusitaniae is also an emerging opportunistic pathogen that can cause serious bloodstream infections in the clinic and yet has often proven to be refractory to facile genetic manipulations. In this work, we develop a clustered regularly interspaced short palindromic repeat (CRISPR) and CRISPR-associated gene 9 (Cas9) system to enable genome editing of C. lusitaniae . We demonstrate that expression of CRISPR-Cas9 components under species-specific promoters is necessary for efficient gene targeting and can be successfully applied to multiple genes in both haploid and diploid isolates. Gene deletion efficiencies with CRISPR-Cas9 were further enhanced in C. lusitaniae strains lacking the established nonhomologous end joining (NHEJ) factors Ku70 and DNA ligase 4. These results indicate that NHEJ plays an important role in directing the repair of DNA double-strand breaks (DSBs) in C. lusitaniae and that removal of this pathway increases integration of gene deletion templates by homologous recombination. The described approaches significantly enhance the ability to perform genetic studies in, and promote understanding of, this emerging human pathogen and model sexual species. IMPORTANCE The ability to perform efficient genome editing is a key development for detailed mechanistic studies of a species. Candida lusitaniae is an important member of the Candida clade and is relevant both as an emerging human pathogen and as a model for understanding mechanisms of sexual reproduction. We highlight the development of a CRISPR-Cas9 system for efficient genome manipulation in C. lusitaniae and demonstrate the importance of species-specific promoters for expression of CRISPR components. We also demonstrate that the NHEJ
Megabase replication domains along the human genome: relation to chromatin structure and genome organisation.

PubMed

Audit, Benjamin; Zaghloul, Lamia; Baker, Antoine; Arneodo, Alain; Chen, Chun-Long; d'Aubenton-Carafa, Yves; Thermes, Claude

2013-01-01

In higher eukaryotes, the absence of specific sequence motifs, marking the origins of replication has been a serious hindrance to the understanding of (i) the mechanisms that regulate the spatio-temporal replication program, and (ii) the links between origins activation, chromatin structure and transcription. In this chapter, we review the partitioning of the human genome into megabased-size replication domains delineated as N-shaped motifs in the strand compositional asymmetry profiles. They collectively span 28.3% of the genome and are bordered by more than 1,000 putative replication origins. We recapitulate the comparison of this partition of the human genome with high-resolution experimental data that confirms that replication domain borders are likely to be preferential replication initiation zones in the germline. In addition, we highlight the specific distribution of experimental and numerical chromatin marks along replication domains. Domain borders correspond to particular open chromatin regions, possibly encoded in the DNA sequence, and around which replication and transcription are highly coordinated. These regions also present a high evolutionary breakpoint density, suggesting that susceptibility to breakage might be linked to local open chromatin fiber state. Altogether, this chapter presents a compartmentalization of the human genome into replication domains that are landmarks of the human genome organization and are likely to play a key role in genome dynamics during evolution and in pathological situations.
Insights from Human/Mouse genome comparisons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pennacchio, Len A.

2003-03-30

Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestrymore » of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.« less

Helminth Genomics: The Implications for Human Health

PubMed Central

Brindley, Paul J.; Mitreva, Makedonka; Ghedin, Elodie; Lustigman, Sara

2009-01-01

More than two billion people (one-third of humanity) are infected with parasitic roundworms or flatworms, collectively known as helminth parasites. These infections cause diseases that are responsible for enormous levels of morbidity and mortality, delays in the physical development of children, loss of productivity among the workforce, and maintenance of poverty. Genomes of the major helminth species that affect humans, and many others of agricultural and veterinary significance, are now the subject of intensive genome sequencing and annotation. Draft genome sequences of the filarial worm Brugia malayi and two of the human schistosomes, Schistosoma japonicum and S. mansoni, are now available, among others. These genome data will provide the basis for a comprehensive understanding of the molecular mechanisms involved in helminth nutrition and metabolism, host-dependent development and maturation, immune evasion, and evolution. They are likely also to predict new potential vaccine candidates and drug targets. In this review, we present an overview of these efforts and emphasize the potential impact and importance of these new findings. PMID:19855829
Tempo and mode of genomic mutations unveil human evolutionary history.

PubMed

Hara, Yuichiro

2015-01-01

Mutations that have occurred in human genomes provide insight into various aspects of evolutionary history such as speciation events and degrees of natural selection. Comparing genome sequences between human and great apes or among humans is a feasible approach for inferring human evolutionary history. Recent advances in high-throughput or so-called 'next-generation' DNA sequencing technologies have enabled the sequencing of thousands of individual human genomes, as well as a variety of reference genomes of hominids, many of which are publicly available. These sequence data can help to unveil the detailed demographic history of the lineage leading to humans as well as the explosion of modern human population size in the last several thousand years. In addition, high-throughput sequencing illustrates the tempo and mode of de novo mutations, which are producing human genetic variation at this moment. Pedigree-based human genome sequencing has shown that mutation rates vary significantly across the human genome. These studies have also provided an improved timescale of human evolution, because the mutation rate estimated from pedigree analysis is half that estimated from traditional analyses based on molecular phylogeny. Because of the dramatic reduction in sequencing cost, sequencing on-demand samples designed for specific studies is now also becoming popular. To produce data of sufficient quality to meet the requirements of the study, it is necessary to set an explicit sequencing plan that includes the choice of sample collection methods, sequencing platforms, and number of sequence reads.
Viral symbiosis and the holobiontic nature of the human genome.

PubMed

Ryan, Francis Patrick

2016-01-01

The human genome is a holobiontic union of the mammalian nuclear genome, the mitochondrial genome and large numbers of endogenized retroviral genomes. This article defines and explores this symbiogenetic pattern of evolution, looking at the implications for human genetics, epigenetics, embryogenesis, physiology and the pathogenesis of inborn errors of metabolism and many other diseases. © 2016 APMIS. Published by John Wiley & Sons Ltd.
The Human Genome Project: big science transforms biology and medicine.

PubMed

Hood, Leroy; Rowen, Lee

2013-01-01

The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called 'big science' - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and analytical tools, and how it brought the expertise of engineers, computer scientists and mathematicians together with biologists. It established an open approach to data sharing and open-source software, thereby making the data resulting from the project accessible to all. The genome sequences of microbes, plants and animals have revolutionized many fields of science, including microbiology, virology, infectious disease and plant biology. Moreover, deeper knowledge of human sequence variation has begun to alter the practice of medicine. The Human Genome Project has inspired subsequent large-scale data acquisition initiatives such as the International HapMap Project, 1000 Genomes, and The Cancer Genome Atlas, as well as the recently announced Human Brain Project and the emerging Human Proteome Project.
The Human Genome Project: big science transforms biology and medicine

PubMed Central

2013-01-01

The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called ‘big science’ - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and analytical tools, and how it brought the expertise of engineers, computer scientists and mathematicians together with biologists. It established an open approach to data sharing and open-source software, thereby making the data resulting from the project accessible to all. The genome sequences of microbes, plants and animals have revolutionized many fields of science, including microbiology, virology, infectious disease and plant biology. Moreover, deeper knowledge of human sequence variation has begun to alter the practice of medicine. The Human Genome Project has inspired subsequent large-scale data acquisition initiatives such as the International HapMap Project, 1000 Genomes, and The Cancer Genome Atlas, as well as the recently announced Human Brain Project and the emerging Human Proteome Project. PMID:24040834
Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

USDA-ARS?s Scientific Manuscript database

Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes—a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-o...
In the Beginning was the Genome: Genomics and the Bi-textuality of Human Existence.

PubMed

Zwart, H A E Hub

2018-04-01

This paper addresses the cultural impact of genomics and the Human Genome Project (HGP) on human self-understanding. Notably, it addresses the claim made by Francis Collins (director of the HGP) that the genome is the language of God and the claim made by Max Delbrück (founding father of molecular life sciences research) that Aristotle must be credited with having predicted DNA as the soul that organises bio-matter. From a continental philosophical perspective I will argue that human existence results from a dialectical interaction between two types of texts: the language of molecular biology and the language of civilisation; the language of the genome and the language of our socio-cultural, symbolic ambiance. Whereas the former ultimately builds on the alphabets of genes and nucleotides, the latter is informed by primordial texts such as the Bible and the Quran. In applied bioethics deliberations on genomics, science is easily framed as liberating and progressive, religious world-views as conservative and restrictive (Zwart 1993). This paper focusses on the broader cultural ambiance of the debate to discern how the bi-textuality of human existence is currently undergoing a transition, as not only the physiological, but also the normative dimension is being reframed in biomolecular and terabyte terms.
From hacking the human genome to editing organs.

PubMed

Tobita, Takamasa; Guzman-Lepe, Jorge; Collin de l'Hortet, Alexandra

2015-01-01

In the recent decades, human genome engineering has been one of the major interesting research subjects, essentially because it raises new possibilities for personalized medicine and biotechnologies. With the development of engineered nucleases such as the Zinc Finger Nucleases (ZFNs), the Transcription activator-like effector nucleases (TALENs) and more recently the Clustered Regularly Interspaced short Palindromic Repeats (CRISPR), the field of human genome edition has evolved very rapidly. Every new genetic tool is broadening the scope of applications on human tissues, even before we can completely master each of these tools. In this review, we will present the recent advances regarding human genome edition tools, we will discuss the numerous implications they have in research and medicine, and we will mention the limits and concerns about such technologies.
From hacking the human genome to editing organs

PubMed Central

Tobita, Takamasa; Guzman-Lepe, Jorge; Collin de l'Hortet, Alexandra

2015-01-01

ABSTRACT In the recent decades, human genome engineering has been one of the major interesting research subjects, essentially because it raises new possibilities for personalized medicine and biotechnologies. With the development of engineered nucleases such as the Zinc Finger Nucleases (ZFNs), the Transcription activator-like effector nucleases (TALENs) and more recently the Clustered Regularly Interspaced short Palindromic Repeats (CRISPR), the field of human genome edition has evolved very rapidly. Every new genetic tool is broadening the scope of applications on human tissues, even before we can completely master each of these tools. In this review, we will present the recent advances regarding human genome edition tools, we will discuss the numerous implications they have in research and medicine, and we will mention the limits and concerns about such technologies PMID:26588350
77 FR 59933 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-01

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research....D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research Institute...
The genome of melon (Cucumis melo L.)

PubMed Central

Garcia-Mas, Jordi; Benjak, Andrej; Sanseverino, Walter; Bourgeois, Michael; Mir, Gisela; González, Víctor M.; Hénaff, Elizabeth; Câmara, Francisco; Cozzuto, Luca; Lowy, Ernesto; Alioto, Tyler; Capella-Gutiérrez, Salvador; Blanca, Jose; Cañizares, Joaquín; Ziarsolo, Pello; Gonzalez-Ibeas, Daniel; Rodríguez-Moreno, Luis; Droege, Marcus; Du, Lei; Alvarez-Tejado, Miguel; Lorente-Galdos, Belen; Melé, Marta; Yang, Luming; Weng, Yiqun; Navarro, Arcadi; Marques-Bonet, Tomas; Aranda, Miguel A.; Nuez, Fernando; Picó, Belén; Gabaldón, Toni; Roma, Guglielmo; Guigó, Roderic; Casacuberta, Josep M.; Arús, Pere; Puigdomènech, Pere

2012-01-01

We report the genome sequence of melon, an important horticultural crop worldwide. We assembled 375 Mb of the double-haploid line DHL92, representing 83.3% of the estimated melon genome. We predicted 27,427 protein-coding genes, which we analyzed by reconstructing 22,218 phylogenetic trees, allowing mapping of the orthology and paralogy relationships of sequenced plant genomes. We observed the absence of recent whole-genome duplications in the melon lineage since the ancient eudicot triplication, and our data suggest that transposon amplification may in part explain the increased size of the melon genome compared with the close relative cucumber. A low number of nucleotide-binding site–leucine-rich repeat disease resistance genes were annotated, suggesting the existence of specific defense mechanisms in this species. The DHL92 genome was compared with that of its parental lines allowing the quantification of sequence variability in the species. The use of the genome sequence in future investigations will facilitate the understanding of evolution of cucurbits and the improvement of breeding strategies. PMID:22753475
Templated sequence insertion polymorphisms in the human genome

NASA Astrophysics Data System (ADS)

Onozawa, Masahiro; Aplan, Peter

2016-11-01

Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.
Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.

PubMed

Kidd, Jeffrey M; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F; Peckham, Heather E; Omberg, Larsson; Bormann Chung, Christina A; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G; Russell, Archie; Reynolds, Andy; Clark, Andrew G; Reese, Martin G; Lincoln, Stephen E; Butte, Atul J; De La Vega, Francisco M; Bustamante, Carlos D

2012-10-05

Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago. Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
78 FR 56905 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-09-16

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research....m. Agenda: To review and evaluate grant applications. Place: National Human Genome Research...
77 FR 5035 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2012-02-01

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... Officer, Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health...
77 FR 58402 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2012-09-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research...: To review and evaluate grant applications. Place: National Human Genome Research Institute, 5635...
78 FR 107 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-01-02

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... evaluate grant applications. Place: National Human Genome Research Institute, 3rd Floor Conference Room....D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research Institute...
First moves of the USSR Human Genome Project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bayev, A.A.

1991-01-01

The USSR Human Genome Project is an intrinsic part of genetic research that still has to recover from the hard ordeal of the past. The imperious influence of Trofim Lysenko and his concepts inhibited the progress of genetics, which had been developing quite successfully before him, and suppressed and often physically destroyed many of our outstanding scientists. Human genome studies were discussed for the first time at a general meeting of the USSR Academy of Sciences in 1988. As early as December 1988, the USSR Council of Ministers adopted a resolution on the creation of a Human Genome Project, whichmore » since 1989 exists in the USSR as one of the national projects.« less
75 FR 19984 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2010-04-16

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075... Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome Research...
76 FR 58023 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-09-19

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial..., Scientific Review Officer, Office of Scientific Review, National Human Genome Research Institute, National...

76 FR 3642 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2011-01-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research....nih.gov . Name of Committee: National Human Genome Research Institute Special Emphasis Panel eMERGE...
75 FR 53703 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-09-01

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health, 5635.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...
78 FR 9707 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2013-02-11

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076...
77 FR 71604 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-12-03

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special..., Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health, 5635...
76 FR 17930 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-03-31

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Review Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane...
77 FR 28888 - National Human Genome Research Institute Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-05-16

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial...: To review and evaluate grant applications. Place: National Human Genome Research Institute, 3635...
75 FR 32957 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-06-10

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... funding cycle. (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...
77 FR 8268 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2012-02-14

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... applications. Place: National Human Genome Research Institute, 5635 Fisher's Lane, Room 4076, Rockville, MD..., CIDR, National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite...
78 FR 70063 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-11-22

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Counselors, National Human Genome Research Institute. The meeting will be closed to the public as indicated... NATIONAL HUMAN GENOME RESEARCH INSTITUTE, including consideration of personnel qualifications and...
77 FR 20646 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-05

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research.... Agenda: To review and evaluate grant applications. Place: National Human Genome Research Institute, 5635...
78 FR 64222 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2013-10-28

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... Review, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, 301...
78 FR 20933 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-04-08

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... review and evaluate grant applications. Place: National Human Genome Research Institute, Room 3055, 5635...
78 FR 14806 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-03-07

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... p.m. Agenda: To review and evaluate grant applications. Place: National Human Genome Research...
78 FR 21382 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-04-10

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... applications. Place: National Human Genome Research Institute, Suite 4076, 5635 Fisher's Lane, Bethesda, MD..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075...
77 FR 60706 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-04

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special.... Nakamura, Ph.D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research...
77 FR 22332 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-13

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special.... Agenda: To review and evaluate grant applications. Place: National Human Genome Research Institute, 5635...
77 FR 12604 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2012-03-01

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. >Name of Committee: National Human Genome Research... review and evaluate contract proposals. Place: National Human Genome Reseach Institute, 5635 Fishers Lane...
76 FR 65204 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2011-10-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... constitute a clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome... Review Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane...
78 FR 31953 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-05-28

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... review and evaluate grant applications. Place: National Human Genome Research Institute, 3rd Floor...
Consensus generation and variant detection by Celera Assembler.

PubMed

Denisov, Gennady; Walenz, Brian; Halpern, Aaron L; Miller, Jason; Axelrod, Nelson; Levy, Samuel; Sutton, Granger

2008-04-15

We present an algorithm to identify allelic variation given a Whole Genome Shotgun (WGS) assembly of haploid sequences, and to produce a set of haploid consensus sequences rather than a single consensus sequence. Existing WGS assemblers take a column-by-column approach to consensus generation, and produce a single consensus sequence which can be inconsistent with the underlying haploid alleles, and inconsistent with any of the aligned sequence reads. Our new algorithm uses a dynamic windowing approach. It detects alleles by simultaneously processing the portions of aligned reads spanning a region of sequence variation, assigns reads to their respective alleles, phases adjacent variant alleles and generates a consensus sequence corresponding to each confirmed allele. This algorithm was used to produce the first diploid genome sequence of an individual human. It can also be applied to assemblies of multiple diploid individuals and hybrid assemblies of multiple haploid organisms. Being applied to the individual human genome assembly, the new algorithm detects exactly two confirmed alleles and reports two consensus sequences in 98.98% of the total number 2,033311 detected regions of sequence variation. In 33,269 out of 460,373 detected regions of size >1 bp, it fixes the constructed errors of a mosaic haploid representation of a diploid locus as produced by the original Celera Assembler consensus algorithm. Using an optimized procedure calibrated against 1 506 344 known SNPs, it detects 438 814 new heterozygous SNPs with false positive rate 12%. The open source code is available at: http://wgs-assembler.cvs.sourceforge.net/wgs-assembler/

Draft genome sequence of the rubber tree Hevea brasiliensis.

PubMed

Rahman, Ahmad Yamin Abdul; Usharraj, Abhilash O; Misra, Biswapriya B; Thottathil, Gincy P; Jayasekaran, Kandakumar; Feng, Yun; Hou, Shaobin; Ong, Su Yean; Ng, Fui Ling; Lee, Ling Sze; Tan, Hock Siew; Sakaff, Muhd Khairul Luqman Muhd; Teh, Beng Soon; Khoo, Bee Feong; Badai, Siti Suriawati; Aziz, Nurohaida Ab; Yuryev, Anton; Knudsen, Bjarne; Dionne-Laporte, Alexandre; Mchunu, Nokuthula P; Yu, Qingyi; Langston, Brennick J; Freitas, Tracey Allen K; Young, Aaron G; Chen, Rui; Wang, Lei; Najimudin, Nazalan; Saito, Jennifer A; Alam, Maqsudul

2013-02-02

Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber.
76 FR 35224 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-06-16

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome...). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIR, National Human Genome Research..., [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...
76 FR 35223 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-06-16

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Person: Rudy O. Pozzatti, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome...
76 FR 66731 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-10-27

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 21, 2011...
75 FR 67380 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-11-02

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Review Branch, National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...
75 FR 26762 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-05-12

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...
76 FR 63932 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-10-14

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 7...
76 FR 3917 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-01-21

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Branch, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076, MSC 9306, Rockville, MD...
77 FR 61770 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-11

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) [[Page 61771...
76 FR 3643 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-01-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...
77 FR 35991 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-06-15

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075, Bethesda.... 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: June 8, 2012. Jennifer S...
75 FR 35821 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-06-23

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...
75 FR 56115 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-09-15

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS...
76 FR 19780 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-04-08

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program No. 93.172, Human Genome Research, National Institutes of...
75 FR 48977 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-08-12

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome.... Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research..., [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...
77 FR 74676 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-12-17

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075, Bethesda.... 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: December 11, 2012. David...
78 FR 47715 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-08-06

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...
76 FR 79199 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-12-21

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome.... Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research..., [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...
75 FR 8977 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-02-26

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075, Bethesda.... 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: February 18, 2010. Jennifer...
75 FR 8977 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-02-26

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4076, MSC..., Human Genome Research, National Institutes of Health, HHS) Dated: February 18, 2010. Jennifer Spaeth...

75 FR 52538 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-08-26

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Person: Ken D. Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome...
76 FR 36930 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-06-23

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special..., Human Genome Research, National Institutes of Health, HHS) Dated: June 17, 2011. Jennifer S. Spaeth...
76 FR 10909 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-02-28

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4076, MSC..., Human Genome Research, National Institutes of Health, HHS). Dated: February 18, 2011. Jennifer S. Spaeth...
78 FR 24223 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-04-24

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial...: To review and evaluate grant applications. Place: National Human Genome Research Institute, 3rd floor...
76 FR 22407 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-04-21

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...
A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing.

PubMed

Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv

2018-01-01

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity
Characters that differ between diploid and haploid honey bee (Apis mellifera) drones.

PubMed

Herrmann, Matthias; Trenzcek, Tina; Fahrenhorst, Hartmut; Engels, Wolf

2005-12-30

Diploid males have long been considered a curiosity contradictory to the haplo-diploid mode of sex determination in the Hymenoptera. In Apis mellifera, 'false' diploid male larvae are eliminated by worker cannibalism immediately after hatching. A 'cannibalism substance' produced by diploid drone larvae to induce worker-assisted suicide has been hypothesized, but it has never been detected. Diploid drones are only removed some hours after hatching. Older larvae are evidently not regarded as 'false males' and instead are regularly nursed by the brood-attending worker bees. As the pheromonal cues presumably are located on the surface of newly hatched bee larvae, we extracted the cuticular secretions and analyzed their chemical composition by gas chromatograph-mass spectrometry (GC-MS) analyses. Larvae were sexed and then reared in vitro for up to three days. The GC-MS pattern that was obtained, with alkanes as the major compounds, was compared between diploid and haploid drone larvae. We also examined some physical parameters of adult drones. There was no difference between diploid and haploid males in their weight at the day of emergence. The diploid adult drones had fewer wing hooks and smaller testes. The sperm DNA content was 0.30 and 0.15 pg per nucleus, giving an exact 2:1 ratio for the gametocytes of diploid and haploid drones, respectively. Vitellogenin was found in the hemolymph of both types of imaginal drones at 5 to 6 days, with a significantly lower titer in the diploids.
Centromere reference models for human chromosomes X and Y satellite arrays

PubMed Central

Miga, Karen H.; Newton, Yulia; Jain, Miten; Altemose, Nicolas; Willard, Huntington F.; Kent, W. James

2014-01-01

The human genome sequence remains incomplete, with multimegabase-sized gaps representing the endogenous centromeres and other heterochromatic regions. Available sequence-based studies within these sites in the genome have demonstrated a role in centromere function and chromosome pairing, necessary to ensure proper chromosome segregation during cell division. A common genomic feature of these regions is the enrichment of long arrays of near-identical tandem repeats, known as satellite DNAs, which offer a limited number of variant sites to differentiate individual repeat copies across millions of bases. This substantial sequence homogeneity challenges available assembly strategies and, as a result, centromeric regions are omitted from ongoing genomic studies. To address this problem, we utilize monomer sequence and ordering information obtained from whole-genome shotgun reads to model two haploid human satellite arrays on chromosomes X and Y, resulting in an initial characterization of 3.83 Mb of centromeric DNA within an individual genome. To further expand the utility of each centromeric reference sequence model, we evaluate sites within the arrays for short-read mappability and chromosome specificity. Because satellite DNAs evolve in a concerted manner, we use these centromeric assemblies to assess the extent of sequence variation among 366 individuals from distinct human populations. We thus identify two satellite array variants in both X and Y centromeres, as determined by array length and sequence composition. This study provides an initial sequence characterization of a regional centromere and establishes a foundation to extend genomic characterization to these sites as well as to other repeat-rich regions within complex genomes. PMID:24501022
76 FR 66076 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-10-25

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 19...
75 FR 80509 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-12-22

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: December 16...
78 FR 61851 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-10-04

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... a.m. to 4:00 p.m. Agenda: To review and evaluate grant applications. Place: National Human Genome...
Revealing the missing expressed genes beyond the human reference genome by RNA-Seq.

PubMed

Chen, Geng; Li, Ruiyuan; Shi, Leming; Qi, Junyi; Hu, Pengzhan; Luo, Jian; Liu, Mingyao; Shi, Tieliu

2011-12-02

The complete and accurate human reference genome is important for functional genomics researches. Therefore, the incomplete reference genome and individual specific sequences have significant effects on various studies. we used two RNA-Seq datasets from human brain tissues and 10 mixed cell lines to investigate the completeness of human reference genome. First, we demonstrated that in previously identified ~5 Mb Asian and ~5 Mb African novel sequences that are absent from the human reference genome of NCBI build 36, ~211 kb and ~201 kb of them could be transcribed, respectively. Our results suggest that many of those transcribed regions are not specific to Asian and African, but also present in Caucasian. Then, we found that the expressions of 104 RefSeq genes that are unalignable to NCBI build 37 in brain and cell lines are higher than 0.1 RPKM. 55 of them are conserved across human, chimpanzee and macaque, suggesting that there are still a significant number of functional human genes absent from the human reference genome. Moreover, we identified hundreds of novel transcript contigs that cannot be aligned to NCBI build 37, RefSeq genes and EST sequences. Some of those novel transcript contigs are also conserved among human, chimpanzee and macaque. By positioning those contigs onto the human genome, we identified several large deletions in the reference genome. Several conserved novel transcript contigs were further validated by RT-PCR. Our findings demonstrate that a significant number of genes are still absent from the incomplete human reference genome, highlighting the importance of further refining the human reference genome and curating those missing genes. Our study also shows the importance of de novo transcriptome assembly. The comparative approach between reference genome and other related human genomes based on the transcriptome provides an alternative way to refine the human reference genome.
78 FR 66752 - National Human Genome Research Institute; Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-11-06

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... National Human Genome Research Institute Special Emphasis Panel, October 15, 2013, 01:00 p.m. to October 15, 2013, 02:30 p.m., National Human Genome Research Institute, 5635 Fishers Lane, Suite 3055, Rockville...
The first genome sequences of human bocaviruses from Vietnam

PubMed Central

Thanh, Tran Tan; Van, Hoang Minh Tu; Hong, Nguyen Thi Thu; Nhu, Le Nguyen Truc; Anh, Nguyen To; Tuan, Ha Manh; Hien, Ho Van; Tuong, Nguyen Manh; Kien, Trinh Trung; Khanh, Truong Huu; Nhan, Le Nguyen Thanh; Hung, Nguyen Thanh; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H. Rogier; Tan, Le Van

2017-01-01

As part of an ongoing effort to generate complete genome sequences of hand, foot and mouth disease-causing enteroviruses directly from clinical specimens, two complete coding sequences and two partial genomic sequences of human bocavirus 1 (n=3) and 2 (n=1) were co-amplified and sequenced, representing the first genome sequences of human bocaviruses from Vietnam. The sequences may aid future study aiming at understanding the evolution of the virus. PMID:28090592
Diallel crossing among doubled haploids of cucumber reveals significant reciprocal-cross differences

USDA-ARS?s Scientific Manuscript database

Cucumber is an excellent plant for studying organellar effects on phenotypes because chloroplasts show maternal and mitochondria paternal transmission. We produced doubled haploids (DH) from divergent cucumber populations, generated reciprocal crosses in a diallel mating scheme, measured fresh and d...
Functional assessment of human enhancer activities using whole-genome STARR-sequencing.

PubMed

Liu, Yuwen; Yu, Shan; Dhiman, Vineet K; Brunetti, Tonya; Eckart, Heather; White, Kevin P

2017-11-20

Genome-wide quantification of enhancer activity in the human genome has proven to be a challenging problem. Recent efforts have led to the development of powerful tools for enhancer quantification. However, because of genome size and complexity, these tools have yet to be applied to the whole human genome. In the current study, we use a human prostate cancer cell line, LNCaP as a model to perform whole human genome STARR-seq (WHG-STARR-seq) to reliably obtain an assessment of enhancer activity. This approach builds upon previously developed STARR-seq in the fly genome and CapSTARR-seq techniques in targeted human genomic regions. With an improved library preparation strategy, our approach greatly increases the library complexity per unit of starting material, which makes it feasible and cost-effective to explore the landscape of regulatory activity in the much larger human genome. In addition to our ability to identify active, accessible enhancers located in open chromatin regions, we can also detect sequences with the potential for enhancer activity that are located in inaccessible, closed chromatin regions. When treated with the histone deacetylase inhibitor, Trichostatin A, genes nearby this latter class of enhancers are up-regulated, demonstrating the potential for endogenous functionality of these regulatory elements. WHG-STARR-seq provides an improved approach to current pipelines for analysis of high complexity genomes to gain a better understanding of the intricacies of transcriptional regulation.
Mapping and Sequencing the Human Genome: Science, Ethics, and Public Policy.

ERIC Educational Resources Information Center

Cutter, Mary Ann G.; Drexler, Edward; McCullough, Laurence B.; McInerney, Joseph D.; Murray, Jeffrey C.; Rossiter, Belinda; Zola, John

The human genome project started in 1989 with the collaboration of the National Institutes of Health (NIH) and the U.S. Department of Energy (DOE). This document aims to develop an understanding among students of the human genome project and relevant issues. Topics include the science and technology of the human genome project, and the ethical and…
The draft genome and transcriptome of Cannabis sativa.

PubMed

van Bakel, Harm; Stout, Jake M; Cote, Atina G; Tallon, Carling M; Sharpe, Andrew G; Hughes, Timothy R; Page, Jonathan E

2011-10-20

Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics.
GENE SILENCING. Epigenetic silencing by the HUSH complex mediates position-effect variegation in human cells.

PubMed

Tchasovnikarova, Iva A; Timms, Richard T; Matheson, Nicholas J; Wals, Kim; Antrobus, Robin; Göttgens, Berthold; Dougan, Gordon; Dawson, Mark A; Lehner, Paul J

2015-06-26

Forward genetic screens in Drosophila melanogaster for modifiers of position-effect variegation have revealed the basis of much of our understanding of heterochromatin. We took an analogous approach to identify genes required for epigenetic repression in human cells. A nonlethal forward genetic screen in near-haploid KBM7 cells identified the HUSH (human silencing hub) complex, comprising three poorly characterized proteins, TASOR, MPP8, and periphilin; this complex is absent from Drosophila but is conserved from fish to humans. Loss of HUSH components resulted in decreased H3K9me3 both at endogenous genomic loci and at retroviruses integrated into heterochromatin. Our results suggest that the HUSH complex is recruited to genomic loci rich in H3K9me3, where subsequent recruitment of the methyltransferase SETDB1 is required for further H3K9me3 deposition to maintain transcriptional silencing. Copyright © 2015, American Association for the Advancement of Science.
Alu repeat discovery and characterization within human genomes

PubMed Central

Hormozdiari, Fereydoun; Alkan, Can; Ventura, Mario; Hajirasouliha, Iman; Malig, Maika; Hach, Faraz; Yorukoglu, Deniz; Dao, Phuong; Bakhshi, Marzieh; Sahinalp, S. Cenk; Eichler, Evan E.

2011-01-01

Human genomes are now being rapidly sequenced, but not all forms of genetic variation are routinely characterized. In this study, we focus on Alu retrotransposition events and seek to characterize differences in the pattern of mobile insertion between individuals based on the analysis of eight human genomes sequenced using next-generation sequencing. Applying a rapid read-pair analysis algorithm, we discover 4342 Alu insertions not found in the human reference genome and show that 98% of a selected subset (63/64) experimentally validate. Of these new insertions, 89% correspond to AluY elements, suggesting that they arose by retrotransposition. Eighty percent of the Alu insertions have not been previously reported and more novel events were detected in Africans when compared with non-African samples (76% vs. 69%). Using these data, we develop an experimental and computational screen to identify ancestry informative Alu retrotransposition events among different human populations. PMID:21131385

75 FR 44800 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-07-29

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... for Human Genome Research. The meeting will be closed to the public in accordance with the provisions... Committee: National Advisory Council for Human Genome Research. Date: August 18, 2010. Time: 1 p.m. to 3 p.m...
A 1463 Gene Cattle–Human Comparative Map With Anchor Points Defined by Human Genome Sequence Coordinates

PubMed Central

Everts-van der Wind, Annelie; Kata, Srinivas R.; Band, Mark R.; Rebeiz, Mark; Larkin, Denis M.; Everts, Robin E.; Green, Cheryl A.; Liu, Lei; Natarajan, Shreedhar; Goldammer, Tom; Lee, Jun Heon; McKay, Stephanie; Womack, James E.; Lewin, Harris A.

2004-01-01

A second-generation 5000 rad radiation hybrid (RH) map of the cattle genome was constructed primarily using cattle ESTs that were targeted to gaps in the existing cattle–human comparative map, as well as to sparsely populated map intervals. A total of 870 targeted markers were added, bringing the number of markers mapped on the RH5000 panel to 1913. Of these, 1463 have significant BLASTN hits (E < e–5) against the human genome sequence. A cattle–human comparative map was created using human genome sequence coordinates of the paired orthologs. One-hundred and ninety-five conserved segments (defined by two or more genes) were identified between the cattle and human genomes, of which 31 are newly discovered and 34 were extended singletons on the first-generation map. The new map represents an improvement of 20% genome-wide comparative coverage compared with the first-generation map. Analysis of gene content within human genome regions where there are gaps in the comparative map revealed gaps with both significantly greater and significantly lower gene content. The new, more detailed cattle–human comparative map provides an improved resource for the analysis of mammalian chromosome evolution, the identification of candidate genes for economically important traits, and for proper alignment of sequence contigs on cattle chromosomes. PMID:15231756
Implications of the Human Genome Project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kitcher, P.

The Human Genome Project (HGP), launched in 1991, aims to map and sequence the human genome by 2006. During the fifteen-year life of the project, it is projected that $3 billion in federal funds will be allocated to it. The ultimate aims of spending this money are to analyze the structure of human DNA, to identify all human genes, to recognize the functions of those genes, and to prepare for the biology and medicine of the twenty-first century. The following summary examines some of the implications of the program, concentrating on its scientific import and on the ethical and socialmore » problems that it raises. Its aim is to expose principles that might be used in applying the information which the HGP will generate. There is no attempt here to translate the principles into detailed proposals for legislation. Arguments and discussion can be found in the full report, but, like this summary, that report does not contain any legislative proposals.« less
The genome sequence of the colonial chordate, Botryllus schlosseri

PubMed Central

Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

2013-01-01

Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI: http://dx.doi.org/10.7554/eLife.00569.001 PMID:23840927
The human genome and the human control of natural evolution.

PubMed

Sakamoto, H

2001-10-01

Recent advances in research on the Human Genome are provoking many critical problems in the global policy regarding the future status of human beings as well as in that of the whole life system on the earth, and consequently, these advances provoke the serious bioethical and philosophical questions. Firstly, how can we comprehend that we are going to have the complete technology to manipulate the system of the human genome and other non-human genomes? Though no science and technology can be complete, we will, I believe, take possession of an almost complete gene technology in the early stage of the next Century. Gene technology will soon fall into the hands of human beings instead of rendering in the province of God. Secondly, which gene technologies will we actually realize and utilize in the early stages of the 21st Century? Most probably, we will adopt these technologies to health care to treat some apparent bodily diseases, for instance, cancer, hemophilia, ADA deficiency, and so forth, and sooner or later we will adopt gene therapy to germ lines, which, in the long run, suggests the possibility of a future "artificial evolution" instead of the "natural evolution" of the past. Thirdly, how is the new concept of "artificial evolution" justified ethically? I believe this kind of manmade evolution is the only way for human beings to survive into the future global environment. There cannot be any serious ethical objection against the idea of artificial evolution. Fourthly, what is the background philosophy for the concept of "artificial evolution"? I will discuss the nature of modern European humanism with individual dignity and fundamental human rights which has led the philosophy of modern culture and modern society, and I will conclude by suggesting that we should abolish an essential part of modern humanism and newly devise some alternative philosophy to fit the new Millennium.
Genome size of 14 species of fireflies (Insecta, Coleoptera, Lampyridae)

PubMed Central

Liu, Gui-Chun; Dong, Zhi-Wei; He, Jin-Wu; Zhao, Ruo-Ping; Wang, Wen; Li, Xue-Yan

2017-01-01

Eukaryotic genome size data are important both as the basis for comparative research into genome evolution and as estimators of the cost and difficulty of genome sequencing programs for non-model organisms. In this study, the genome size of 14 species of fireflies (Lampyridae) (two genera in Lampyrinae, three genera in Luciolinae, and one genus in subfamily incertae sedis) were estimated by propidium iodide (PI)-based flow cytometry. The haploid genome sizes of Lampyridae ranged from 0. 42 to 1. 31 pg, a 3. 1-fold span. Genome sizes of the fireflies varied within the tested subfamilies and genera. Lamprigera and Pyrocoelia species had large and small genome sizes, respectively. No correlation was found between genome size and morphological traits such as body length, body width, eye width, and antennal length. Our data provide additional information on genome size estimation of the firefly family Lampyridae. Furthermore, this study will help clarify the cost and difficulty of genome sequencing programs for non-model organisms and will help promote studies on firefly genome evolution. PMID:29280364
NAHR-mediated copy-number variants in a clinical population: mechanistic insights into both genomic disorders and Mendelizing traits.

PubMed

Dittwald, Piotr; Gambin, Tomasz; Szafranski, Przemyslaw; Li, Jian; Amato, Stephen; Divon, Michael Y; Rodríguez Rojas, Lisa Ximena; Elton, Lindsay E; Scott, Daryl A; Schaaf, Christian P; Torres-Martinez, Wilfredo; Stevens, Abby K; Rosenfeld, Jill A; Agadi, Satish; Francis, David; Kang, Sung-Hae L; Breman, Amy; Lalani, Seema R; Bacino, Carlos A; Bi, Weimin; Milosavljevic, Aleksandar; Beaudet, Arthur L; Patel, Ankita; Shaw, Chad A; Lupski, James R; Gambin, Anna; Cheung, Sau Wai; Stankiewicz, Pawel

2013-09-01

We delineated and analyzed directly oriented paralogous low-copy repeats (DP-LCRs) in the most recent version of the human haploid reference genome. The computationally defined DP-LCRs were cross-referenced with our chromosomal microarray analysis (CMA) database of 25,144 patients subjected to genome-wide assays. This computationally guided approach to the empirically derived large data set allowed us to investigate genomic rearrangement relative frequencies and identify new loci for recurrent nonallelic homologous recombination (NAHR)-mediated copy-number variants (CNVs). The most commonly observed recurrent CNVs were NPHP1 duplications (233), CHRNA7 duplications (175), and 22q11.21 deletions (DiGeorge/velocardiofacial syndrome, 166). In the ∼25% of CMA cases for which parental studies were available, we identified 190 de novo recurrent CNVs. In this group, the most frequently observed events were deletions of 22q11.21 (48), 16p11.2 (autism, 34), and 7q11.23 (Williams-Beuren syndrome, 11). Several features of DP-LCRs, including length, distance between NAHR substrate elements, DNA sequence identity (fraction matching), GC content, and concentration of the homologous recombination (HR) hot spot motif 5'-CCNCCNTNNCCNC-3', correlate with the frequencies of the recurrent CNVs events. Four novel adjacent DP-LCR-flanked and NAHR-prone regions, involving 2q12.2q13, were elucidated in association with novel genomic disorders. Our study quantitates genome architectural features responsible for NAHR-mediated genomic instability and further elucidates the role of NAHR in human disease.
Gene Expansion Shapes Genome Architecture in the Human Pathogen Lichtheimia corymbifera: An Evolutionary Genomics Analysis in the Ancient Terrestrial Mucorales (Mucoromycotina)

PubMed Central

Wehner, Stefanie; Linde, Jörg; Valiante, Vito; Sammeth, Michael; Riege, Konstantin; Nowrousian, Minou; Kaerger, Kerstin; Jacobsen, Ilse D.; Marz, Manja; Brakhage, Axel A.; Gabaldón, Toni; Böcker, Sebastian; Voigt, Kerstin

2014-01-01

Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i) the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD), (ii) despite the relatively high incidence of introns, alternative splicing (AS) is not frequently observed for the generation of paralogs and in response to stress, (iii) the content of repetitive elements is strikingly low (<5%), (iv) L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1–4 copies usually found in other fungi. More findings are: (i) lower content of tRNAs, but unique codons in L. corymbifera, (ii) Over 25% of the proteins are apparently specific for L. corymbifera. (iii) L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors) in comparision to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae. PMID:25121733
Population Genetic Inference from Personal Genome Data: Impact of Ancestry and Admixture on Human Genomic Variation

PubMed Central

Kidd, Jeffrey M.; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D.; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F.; Peckham, Heather E.; Omberg, Larsson; Bormann Chung, Christina A.; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G.; Russell, Archie; Reynolds, Andy; Clark, Andrew G.; Reese, Martin G.; Lincoln, Stephen E.; Butte, Atul J.; De La Vega, Francisco M.; Bustamante, Carlos D.

2012-01-01

Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas—70% of the European ancestry in today’s African Americans dates back to European gene flow happening only 7–8 generations ago. PMID:23040495
Recombination and genetic variance among maize doubled haploids induced from F1 and F2 plants.

PubMed

Sleper, Joshua A; Bernardo, Rex

2016-12-01

Inducing maize doubled haploids from F 2 plants (DHF2) instead of F 1 plants (DHF1) led to more recombination events. However, the best DHF2 lines did not outperform the best DHF1 lines. Maize (Zea mays L.) breeders rely on doubled haploid (DH) technology for fast and efficient production of inbreds. Breeders can induce DH lines most quickly from F 1 plants (DHF1), or induce DH lines from F 2 plants (DHF2) to allow selection prior to DH induction and have more recombinations. Our objective was to determine if the additional recombinations in maize DHF2 lines lead to a larger genetic variance and a superior mean of the best lines. A total of 311 DHF1 and 241 DHF2 lines, derived from the same biparental cross, were crossed to two testers and evaluated in multilocation trials in Europe and the US. The mean number of recombinations per genome was 14.48 among the DHF1 lines and 21.38 among the DHF1 lines. The means of the DHF1 and DHF2 lines did not differ for yield, moisture, and plant height. The genetic variance was higher among DHF2 lines than among DHF1 lines for moisture, but not for yield and plant height. The ratio of repulsion to coupling linkages, which was estimated from genomewide marker effects, was higher among DHF1 lines than among DHF2 lines for moisture, but not for yield and plant height. The higher genetic variance for moisture among DHF2 lines did not lead to lower moisture of the best 10 % of the lines. Our results indicated that the decision of inducing DH lines from F 1 or F 2 plants needs to be made from considerations other than the performance of the resulting DHF1 or DHF2 lines.
The first genetic map of a synthesized allohexaploid Brassica with A, B and C genomes based on simple sequence repeat markers.

PubMed

Yang, S; Chen, S; Geng, X X; Yan, G; Li, Z Y; Meng, J L; Cowling, W A; Zhou, W J

2016-04-01

We present the first genetic map of an allohexaploid Brassica species, based on segregating microsatellite markers in a doubled haploid mapping population generated from a hybrid between two hexaploid parents. This study reports the first genetic map of trigenomic Brassica. A doubled haploid mapping population consisting of 189 lines was obtained via microspore culture from a hybrid H16-1 derived from a cross between two allohexaploid Brassica lines (7H170-1 and Y54-2). Simple sequence repeat primer pairs specific to the A genome (107), B genome (44) and C genome (109) were used to construct a genetic linkage map of the population. Twenty-seven linkage groups were resolved from 274 polymorphic loci on the A genome (109), B genome (49) and C genome (116) covering a total genetic distance of 3178.8 cM with an average distance between markers of 11.60 cM. This is the first genetic framework map for the artificially synthesized Brassica allohexaploids. The linkage groups represent the expected complement of chromosomes in the A, B and C genomes from the original diploid and tetraploid parents. This framework linkage map will be valuable for QTL analysis and future genetic improvement of a new allohexaploid Brassica species, and in improving our understanding of the genetic control of meiosis in new polyploids.
Extensive sequencing of seven human genomes to characterize benchmark reference materials

PubMed Central

Zook, Justin M.; Catoe, David; McDaniel, Jennifer; Vang, Lindsay; Spies, Noah; Sidow, Arend; Weng, Ziming; Liu, Yuling; Mason, Christopher E.; Alexander, Noah; Henaff, Elizabeth; McIntyre, Alexa B.R.; Chandramohan, Dhruva; Chen, Feng; Jaeger, Erich; Moshrefi, Ali; Pham, Khoa; Stedman, William; Liang, Tiffany; Saghbini, Michael; Dzakula, Zeljko; Hastie, Alex; Cao, Han; Deikus, Gintaras; Schadt, Eric; Sebra, Robert; Bashir, Ali; Truty, Rebecca M.; Chang, Christopher C.; Gulbahce, Natali; Zhao, Keyan; Ghosh, Srinka; Hyland, Fiona; Fu, Yutao; Chaisson, Mark; Xiao, Chunlin; Trow, Jonathan; Sherry, Stephen T.; Zaranek, Alexander W.; Ball, Madeleine; Bobe, Jason; Estep, Preston; Church, George M.; Marks, Patrick; Kyriazopoulou-Panagiotopoulou, Sofia; Zheng, Grace X.Y.; Schnall-Levin, Michael; Ordonez, Heather S.; Mudivarti, Patrice A.; Giorda, Kristina; Sheng, Ying; Rypdal, Karoline Bjarnesdatter; Salit, Marc

2016-01-01

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly. PMID:27271295
Genome editing: a robust technology for human stem cells.

PubMed

Chandrasekaran, Arun Pandian; Song, Minjung; Ramakrishna, Suresh

2017-09-01

Human pluripotent stem cells comprise induced pluripotent and embryonic stem cells, which have tremendous potential for biological and therapeutic applications. The development of efficient technologies for the targeted genome alteration of stem cells in disease models is a prerequisite for utilizing stem cells to their full potential. Genome editing of stem cells is possible with the help of synthetic nucleases that facilitate site-specific modification of a gene of interest. Recent advances in genome editing techniques have improved the efficiency and speed of the development of stem cells for human disease models. Zinc finger nucleases, transcription activator-like effector nucleases, and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated system are powerful tools for editing DNA at specific loci. Here, we discuss recent technological advances in genome editing with site-specific nucleases in human stem cells.
Draft genome sequence of the rubber tree Hevea brasiliensis

PubMed Central

2013-01-01

Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. Conclusions The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber. PMID:23375136
Comparison of phasing strategies for whole human genomes

PubMed Central

Kirkness, Ewen; Schork, Nicholas J.

2018-01-01

Humans are a diploid species that inherit one set of chromosomes paternally and one homologous set of chromosomes maternally. Unfortunately, most human sequencing initiatives ignore this fact in that they do not directly delineate the nucleotide content of the maternal and paternal copies of the 23 chromosomes individuals possess (i.e., they do not ‘phase’ the genome) often because of the costs and complexities of doing so. We compared 11 different widely-used approaches to phasing human genomes using the publicly available ‘Genome-In-A-Bottle’ (GIAB) phased version of the NA12878 genome as a gold standard. The phasing strategies we compared included laboratory-based assays that prepare DNA in unique ways to facilitate phasing as well as purely computational approaches that seek to reconstruct phase information from general sequencing reads and constructs or population-level haplotype frequency information obtained through a reference panel of haplotypes. To assess the performance of the 11 approaches, we used metrics that included, among others, switch error rates, haplotype block lengths, the proportion of fully phase-resolved genes, phasing accuracy and yield between pairs of SNVs. Our comparisons suggest that a hybrid or combined approach that leverages: 1. population-based phasing using the SHAPEIT software suite, 2. either genome-wide sequencing read data or parental genotypes, and 3. a large reference panel of variant and haplotype frequencies, provides a fast and efficient way to produce highly accurate phase-resolved individual human genomes. We found that for population-based approaches, phasing performance is enhanced with the addition of genome-wide read data; e.g., whole genome shotgun and/or RNA sequencing reads. Further, we found that the inclusion of parental genotype data within a population-based phasing strategy can provide as much as a ten-fold reduction in phasing errors. We also considered a majority voting scheme for the construction
Genome Editing in Human Pluripotent Stem Cells.

PubMed

Carlson-Stevermer, Jared; Saha, Krishanu

2017-01-01

Genome editing in human pluripotent stem cells (hPSCs) enables the generation of reporter lines and knockout cell lines. Zinc finger nucleases, transcription activator-like effector nucleases (TALENs), and CRISPR/Cas9 technology have recently increased the efficiency of proper gene editing by creating double strand breaks (DSB) at defined sequences in the human genome. These systems typically use plasmids to transiently transcribe nucleases within the cell. Here, we describe the process for preparing hPSCs for transient expression of nucleases via electroporation and subsequent analysis to create genetically modified stem cell lines.
76 FR 65204 - National Human Genome Research Institute; Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-10-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute. The meeting will be open to the public as indicated below, with..., discussion, and evaluation of individual intramural programs and projects conducted by the National Human...
75 FR 60467 - National Human Genome Research Institute; Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-09-30

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute. The meeting will be open to the public as indicated below, with..., discussion, and evaluation of individual intramural programs and projects conducted by the National Human...
75 FR 2147 - National Human Genome Research Institute; Notice of Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2010-01-14

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Council for Human Genome Research. The meetings will be open to the public as indicated below, with... clearly unwarranted invasion of personal privacy. Name of Committee: National Advisory Council for Human...
Microsatellite Interruptions Stabilize Primate Genomes and Exist as Population-Specific Single Nucleotide Polymorphisms within Individual Human Genomes

PubMed Central

Ananda, Guruprasad; Hile, Suzanne E.; Breski, Amanda; Wang, Yanli; Kelkar, Yogeshwar; Makova, Kateryna D.; Eckert, Kristin A.

2014-01-01

Interruptions of microsatellite sequences impact genome evolution and can alter disease manifestation. However, human polymorphism levels at interrupted microsatellites (iMSs) are not known at a genome-wide scale, and the pathways for gaining interruptions are poorly understood. Using the 1000 Genomes Phase-1 variant call set, we interrogated mono-, di-, tri-, and tetranucleotide repeats up to 10 units in length. We detected ∼26,000–40,000 iMSs within each of four human population groups (African, European, East Asian, and American). We identified population-specific iMSs within exonic regions, and discovered that known disease-associated iMSs contain alleles present at differing frequencies among the populations. By analyzing longer microsatellites in primate genomes, we demonstrate that single interruptions result in a genome-wide average two- to six-fold reduction in microsatellite mutability, as compared with perfect microsatellites. Centrally located interruptions lowered mutability dramatically, by two to three orders of magnitude. Using a biochemical approach, we tested directly whether the mutability of a specific iMS is lower because of decreased DNA polymerase strand slippage errors. Modeling the adenomatous polyposis coli tumor suppressor gene sequence, we observed that a single base substitution interruption reduced strand slippage error rates five- to 50-fold, relative to a perfect repeat, during synthesis by DNA polymerases α, β, or η. Computationally, we demonstrate that iMSs arise primarily by base substitution mutations within individual human genomes. Our biochemical survey of human DNA polymerase α, β, δ, κ, and η error rates within certain microsatellites suggests that interruptions are created most frequently by low fidelity polymerases. Our combined computational and biochemical results demonstrate that iMSs are abundant in human genomes and are sources of population-specific genetic variation that may affect genome stability. The

Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples.

PubMed

Liu, Zhandong; Venkatesh, Santosh S; Maley, Carlo C

2008-10-30

Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12) genomes. Virtually all possible (> 98%) 12 bp oligomers appear in vertebrate genomes while < 2% of 19 bp oligomers are present. Other species showed different ranges of > 98% to < 2% of possible oligomers in D. melanogaster (12-17 bp), C. elegans (11-17 bp), A. thaliana (11-17 bp), S. cerevisiae (10-16 bp) and E. coli (9-15 bp). Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously
Point mutation impairs centromeric CENH3 loading and induces haploid plants.

PubMed

Karimi-Ashtiyani, Raheleh; Ishii, Takayoshi; Niessen, Markus; Stein, Nils; Heckmann, Stefan; Gurushidze, Maia; Banaei-Moghaddam, Ali Mohammad; Fuchs, Jörg; Schubert, Veit; Koch, Kerstin; Weiss, Oda; Demidov, Dmitri; Schmidt, Klaus; Kumlehn, Jochen; Houben, Andreas

2015-09-08

The chromosomal position of the centromere-specific histone H3 variant CENH3 (also called "CENP-A") is the assembly site for the kinetochore complex of active centromeres. Any error in transcription, translation, modification, or incorporation can affect the ability to assemble intact CENH3 chromatin and can cause centromere inactivation [Allshire RC, Karpen GH (2008) Nat Rev Genet 9 (12):923-937]. Here we show that a single-point amino acid exchange in the centromere-targeting domain of CENH3 leads to reduced centromere loading of CENH3 in barley, sugar beet, and Arabidopsis thaliana. Haploids were obtained after cenh3 L130F-complemented cenh3-null mutant plants were crossed with wild-type A. thaliana. In contrast, in a noncompeting situation (i.e., centromeres possessing only mutated or only wild-type CENH3), no uniparental chromosome elimination occurs during early embryogenesis. The high degree of evolutionary conservation of the identified mutation site offers promising opportunities for application in a wide range of crop species in which haploid technology is of interest.
Point mutation impairs centromeric CENH3 loading and induces haploid plants

PubMed Central

Karimi-Ashtiyani, Raheleh; Ishii, Takayoshi; Niessen, Markus; Stein, Nils; Heckmann, Stefan; Gurushidze, Maia; Banaei-Moghaddam, Ali Mohammad; Fuchs, Jörg; Schubert, Veit; Koch, Kerstin; Weiss, Oda; Demidov, Dmitri; Schmidt, Klaus; Kumlehn, Jochen; Houben, Andreas

2015-01-01

The chromosomal position of the centromere-specific histone H3 variant CENH3 (also called “CENP-A”) is the assembly site for the kinetochore complex of active centromeres. Any error in transcription, translation, modification, or incorporation can affect the ability to assemble intact CENH3 chromatin and can cause centromere inactivation [Allshire RC, Karpen GH (2008) Nat Rev Genet 9 (12):923–937]. Here we show that a single-point amino acid exchange in the centromere-targeting domain of CENH3 leads to reduced centromere loading of CENH3 in barley, sugar beet, and Arabidopsis thaliana. Haploids were obtained after cenh3 L130F-complemented cenh3-null mutant plants were crossed with wild-type A. thaliana. In contrast, in a noncompeting situation (i.e., centromeres possessing only mutated or only wild-type CENH3), no uniparental chromosome elimination occurs during early embryogenesis. The high degree of evolutionary conservation of the identified mutation site offers promising opportunities for application in a wide range of crop species in which haploid technology is of interest. PMID:26294252
Mobile Interspersed Repeats Are Major Structural Variants in the Human Genome

PubMed Central

Huang, Cheng Ran Lisa; Schneider, Anna M.; Lu, Yunqi; Niranjan, Tejasvi; Shen, Peilin; Robinson, Matoya A.; Steranka, Jared P.; Valle, David; Civin, Curt I.; Wang, Tao; Wheelan, Sarah J.; Ji, Hongkai; Boeke, Jef D.; Burns, Kathleen H.

2010-01-01

Summary Characterizing structural variants in the human genome is of great importance, but a genome wide analysis to detect interspersed repeats has not been done. Thus, the degree to which mobile DNAs contribute to genetic diversity, heritable disease, and oncogenesis remains speculative. We perform transposon insertion profiling by microarray (TIP-chip) to map human L1(Ta) retrotransposons (LINE-1 s) genome-wide. This identified numerous novel human L1(Ta) insertional polymorphisms with highly variant allelic frequencies. We also explored TIP-chip's usefulness to identify candidate alleles associated with different phenotypes in clinical cohorts. Our data suggest that the occurrence of new insertions is twice as high as previously estimated, and that these repeats are under-recognized as sources of human genomic and phenotypic diversity. We have just begun to probe the universe of human L1(Ta) polymorphisms, and as TIP-chip is applied to other insertions such as Alu SINEs, it will expand the catalog of genomic variants even further. PMID:20602999
A method for determining haploid and triploid genotypes and their association with vascular phenotypes in Williams syndrome and 7q11.23 duplication syndrome.

PubMed

Gregory, Michael D; Kolachana, Bhaskar; Yao, Yin; Nash, Tiffany; Dickinson, Dwight; Eisenberg, Daniel P; Mervis, Carolyn B; Berman, Karen F

2018-04-04

Williams syndrome ([WS], 7q11.23 hemideletion) and 7q11.23 duplication syndrome (Dup7) show contrasting syndromic symptoms. However, within each group there is considerable interindividual variability in the degree to which these phenotypes are expressed. Though software exists to identify areas of copy number variation (CNV) from commonly-available SNP-chip data, this software does not provide non-diploid genotypes in CNV regions. Here, we describe a method for identifying haploid and triploid genotypes in CNV regions, and then, as a proof-of-concept for applying this information to explain clinical variability, we test for genotype-phenotype associations. Blood samples for 25 individuals with WS and 13 individuals with Dup7 were genotyped with Illumina-HumanOmni5M SNP-chips. PennCNV and in-house code were used to make genotype calls for each SNP in the 7q11.23 locus. We tested for association between the presence of aortic arteriopathy and genotypes of the remaining (haploid in WS) or duplicated (triploid in Dup7) alleles. Haploid calls in the 7q11.23 region were made for 99.0% of SNPs in the WS group, and triploid calls for 98.8% of SNPs in those with Dup7. The G allele of SNP rs2528795 in the ELN gene was associated with aortic stenosis in WS participants (p < 0.0049) while the A allele of the same SNP was associated with aortic dilation in Dup7. Commonly available SNP-chip information can be used to make haploid and triploid calls in individuals with CNVs and then to relate variability in specific genes to variability in syndromic phenotypes, as demonstrated here using aortic arteriopathy. This work sets the stage for similar genotype-phenotype analyses in CNVs where phenotypes may be more complex and/or where there is less information about genetic mechanisms.
The banana (Musa acuminata) genome and the evolution of monocotyledonous plants.

PubMed

D'Hont, Angélique; Denoeud, France; Aury, Jean-Marc; Baurens, Franc-Christophe; Carreel, Françoise; Garsmeur, Olivier; Noel, Benjamin; Bocs, Stéphanie; Droc, Gaëtan; Rouard, Mathieu; Da Silva, Corinne; Jabbari, Kamel; Cardi, Céline; Poulain, Julie; Souquet, Marlène; Labadie, Karine; Jourda, Cyril; Lengellé, Juliette; Rodier-Goud, Marguerite; Alberti, Adriana; Bernard, Maria; Correa, Margot; Ayyampalayam, Saravanaraj; Mckain, Michael R; Leebens-Mack, Jim; Burgess, Diane; Freeling, Mike; Mbéguié-A-Mbéguié, Didier; Chabannes, Matthieu; Wicker, Thomas; Panaud, Olivier; Barbosa, Jose; Hribova, Eva; Heslop-Harrison, Pat; Habas, Rémy; Rivallan, Ronan; Francois, Philippe; Poiron, Claire; Kilian, Andrzej; Burthia, Dheema; Jenny, Christophe; Bakry, Frédéric; Brown, Spencer; Guignon, Valentin; Kema, Gert; Dita, Miguel; Waalwijk, Cees; Joseph, Steeve; Dievart, Anne; Jaillon, Olivier; Leclercq, Julie; Argout, Xavier; Lyons, Eric; Almeida, Ana; Jeridi, Mouna; Dolezel, Jaroslav; Roux, Nicolas; Risterucci, Ange-Marie; Weissenbach, Jean; Ruiz, Manuel; Glaszmann, Jean-Christophe; Quétier, Francis; Yahiaoui, Nabila; Wincker, Patrick

2012-08-09

Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish). Pests and diseases have gradually become adapted, representing an imminent danger for global banana production. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon-eudicotyledon divergence.
Mating system and gene flow in the red seaweed Gracilaria gracilis: effect of haploid-diploid life history and intertidal rocky shore landscape on fine-scale genetic structure.

PubMed

Engel, C R; Destombe, C; Valero, M

2004-04-01

The impact of haploid-diploidy and the intertidal landscape on a fine-scale genetic structure was explored in a red seaweed Gracilaria gracilis. The pattern of genetic structure was compared in haploid and diploid stages at a microgeographic scale (< 5 km): a total of 280 haploid and 296 diploid individuals located in six discrete, scattered rock pools were genotyped using seven microsatellite loci. Contrary to the theoretical expectation of predominantly endogamous mating systems in haploid-diploid organisms, G. gracilis showed a clearly allogamous mating system. Although within-population allele frequencies were similar between haploids and diploids, genetic differentiation among haploids was more than twice that of diploids, suggesting that there may be a lag between migration and (local) breeding due to the long generation times in G. gracilis. Weak, but significant, population differentiation was detected in both haploids and diploids and varied with landscape features, and not with geographic distance. Using an assignment test, we establish that effective migration rates varied according to height on the shore. In this intertidal species, biased spore dispersal may occur during the transport of spores and gametes at low tide when small streams flow from high- to lower-shore pools. The longevity of both haploid and diploid free-living stages and the long generation times typical of G. gracilis populations may promote the observed pattern of high genetic diversity within populations relative to that among populations.
Child Development and Structural Variation in the Human Genome

ERIC Educational Resources Information Center

Zhang, Ying; Haraksingh, Rajini; Grubert, Fabian; Abyzov, Alexej; Gerstein, Mark; Weissman, Sherman; Urban, Alexander E.

2013-01-01

Structural variation of the human genome sequence is the insertion, deletion, or rearrangement of stretches of DNA sequence sized from around 1,000 to millions of base pairs. Over the past few years, structural variation has been shown to be far more common in human genomes than previously thought. Very little is currently known about the effects…
Human Genome Editing and Ethical Considerations.

PubMed

Krishan, Kewal; Kanchan, Tanuj; Singh, Bahadur

2016-04-01

Editing human germline genes may act as boon in some genetic and other disorders. Recent editing of the genome of the human embryo with the CRISPR/Cas9 editing tool generated a debate amongst top scientists of the world for the ethical considerations regarding its effect on the future generations. It needs to be seen as to what transformation human gene editing brings to humankind in the times to come.
A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome

PubMed Central

Konkel, Miriam K.; Batzer, Mark A.

2010-01-01

It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families – long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements – mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development. PMID:20307669
Novel Approaches to Breast Cancer Prevention and Inhibition of Metastases

DTIC Science & Technology

2013-10-01

allow a functional characterization of human candidate breast cancer genes. The transgenic RNAi library is covering the whole Drosophila genome ...W81XWH-12-1-0093 / Penninger 15. SUBJECT TERMS Genome wide functional genetics, haploid stem cells, Drosophila cancer modeling...With the advent of modern genomics hundreds of candidate genes have been associated with breast cancer both in GWAS studies as well as by cancer genome
Comparative genome map of human and cattle

DOE Office of Scientific and Technical Information (OSTI.GOV)

Solinas-Toldo, S.; Fries, R.; Lengauer, C.

Chromosomal homologies between individual human chromosomes and the bovine karyotype have been established by using a new approach termed Zoo-FISH. Labeled DNA libraries from flow-sorted human chromosomes were used as probes for fluorescence in situ hybridization on cattle chromosomes. All human DNA libraries, except the Y chromosome library, hybridized to one or more cattle chromosomes, identifying and delineating 50 segments of homology, most of them corresponding to the regions of homology as identified by the previous mapping of individual conserved loci. However, Zoo-FISH refines the comparative maps constructed by molecular gene mapping of individual loci by providing information on themore » boundaries of conserved regions in the absence of obvious cytogenetic homologies of human and bovine chromosomes. It allows study of karyotypic evolution and opens new avenues for genomic analysis by facilitating the extrapolation of results from the human genome initiative. 50 refs., 3 figs., 1 tab.« less
Insights into Modern Human Prehistory Using Ancient Genomes.

PubMed

Yang, Melinda A; Fu, Qiaomei

2018-03-01

The genetic relationship of past modern humans to today's populations and each other was largely unknown until recently, when advances in ancient DNA sequencing allowed for unprecedented analysis of the genomes of these early people. These ancient genomes reveal new insights into human prehistory not always observed studying present-day populations, including greater details on the genetic diversity, population structure, and gene flow that characterized past human populations, particularly in early Eurasia, as well as increased insight on the relationship between archaic and modern humans. Here, we review genetic studies on ∼45000- to 7500-year-old individuals associated with mainly preagricultural cultures found in Eurasia, the Americas, and Africa. Copyright © 2017 Elsevier Ltd. All rights reserved.
The Diversity Present in 5140 Human Mitochondrial Genomes

PubMed Central

Pereira, Luísa; Freitas, Fernando; Fernandes, Verónica; Pereira, Joana B.; Costa, Marta D.; Costa, Stephanie; Máximo, Valdemar; Macaulay, Vincent; Rocha, Ricardo; Samuels, David C.

2009-01-01

We analyzed the current status (as of the end of August 2008) of human mitochondrial genomes deposited in GenBank, amounting to 5140 complete or coding-region sequences, in order to present an overall picture of the diversity present in the mitochondrial DNA of the global human population. To perform this task, we developed mtDNA-GeneSyn, a computer tool that identifies and exhaustedly classifies the diversity present in large genetic data sets. The diversity observed in the 5140 human mitochondrial genomes was compared with all possible transitions and transversions from the standard human mitochondrial reference genome. This comparison showed that tRNA and rRNA secondary structures have a large effect in limiting the diversity of the human mitochondrial sequences, whereas for the protein-coding genes there is a bias toward less variation at the second codon positions. The analysis of the observed amino acid variations showed a tolerance of variations that convert between the amino acids V, I, A, M, and T. This defines a group of amino acids with similar chemical properties that can interconvert by a single transition. PMID:19426953
Transposition of a Ds element from a plasmid into the plant genome in Nicotiana plumbaginifolia protoplast-derived cells.

PubMed

Houba-Hérin, N; Domin, M; Pédron, J

1994-07-01

Nicotiana plumbaginifolia haploid protoplasts were co-transformed with two plasmids, one with a NPT-II/Ds element and one with a gene encoding an amino-terminal truncated Ac transposase. It is shown that Ds can efficiently transpose from extrachromosomal DNA to N. plumbaginifolia chromosomes when the Ac transposase gene is present in trans. Ds has been shown to have transposed into the plant genome in a limited number of copies (1.9 copies per genome), for 21/32 transgenic lines tested. The flanking sequences present in the original plasmid are missing in these 21 plants. In only two of 21 plants was part of the transposase construct integrated. By segregation analysis of transgenic progeny, Ds was shown to be present in the heterozygous state in 10 lines even though haploid protoplasts had been originally transformed. This observation could indicate that integration occurred after or during DNA replication that leads to protoplast diploidization.
The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.

PubMed

Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H

2016-04-01

To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.
Sequences Associated with Centromere Competency in the Human Genome

PubMed Central

Hayden, Karen E.; Strome, Erin D.; Merrett, Stephanie L.; Lee, Hye-Ran; Rudd, M. Katharine

2013-01-01

Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance. PMID:23230266
Segmental Duplications and Copy-Number Variation in the Human Genome

PubMed Central

Sharp, Andrew J. ; Locke, Devin P. ; McGrath, Sean D. ; Cheng, Ze ; Bailey, Jeffrey A. ; Vallente, Rhea U. ; Pertz, Lisa M. ; Clark, Royden A. ; Schwartz, Stuart ; Segraves, Rick ; Oseroff, Vanessa V. ; Albertson, Donna G. ; Pinkel, Daniel ; Eichler, Evan E.

2005-01-01

The human genome contains numerous blocks of highly homologous duplicated sequence. This higher-order architecture provides a substrate for recombination and recurrent chromosomal rearrangement associated with genomic disease. However, an assessment of the role of segmental duplications in normal variation has not yet been made. On the basis of the duplication architecture of the human genome, we defined a set of 130 potential rearrangement hotspots and constructed a targeted bacterial artificial chromosome (BAC) microarray (with 2,194 BACs) to assess copy-number variation in these regions by array comparative genomic hybridization. Using our segmental duplication BAC microarray, we screened a panel of 47 normal individuals, who represented populations from four continents, and we identified 119 regions of copy-number polymorphism (CNP), 73 of which were previously unreported. We observed an equal frequency of duplications and deletions, as well as a 4-fold enrichment of CNPs within hotspot regions, compared with control BACs (P < .000001), which suggests that segmental duplications are a major catalyst of large-scale variation in the human genome. Importantly, segmental duplications themselves were also significantly enriched >4-fold within regions of CNP. Almost without exception, CNPs were not confined to a single population, suggesting that these either are recurrent events, having occurred independently in multiple founders, or were present in early human populations. Our study demonstrates that segmental duplications define hotspots of chromosomal rearrangement, likely acting as mediators of normal variation as well as genomic disease, and it suggests that the consideration of genomic architecture can significantly improve the ascertainment of large-scale rearrangements. Our specialized segmental duplication BAC microarray and associated database of structural polymorphisms will provide an important resource for the future characterization of human genomic
78 FR 68856 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-11-15

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Nakamura, Ph.D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research...-402-0838. [[Page 68857
Expression pattern of G protein-coupled receptor 30 in human seminiferous tubular cells.

PubMed

Oliveira, Pedro F; Alves, Marco G; Martins, Ana D; Correia, Sara; Bernardino, Raquel L; Silva, Joaquina; Barros, Alberto; Sousa, Mário; Cavaco, José E; Socorro, Sílvia

2014-05-15

The role of estrogens in male reproductive physiology has been intensively studied over the last few years. Yet, the involvement of their specific receptors has long been a matter of debate. The selective testicular expression of the classic nuclear estrogen receptors (ERα and ERβ) argues in favor of ER-specific functions in the spermatogenic event. Recently, the existence of a G protein-coupled estrogen receptor (GPR30) mediating non-genomic effects of estrogens has also been described. However, little is known about the specific testicular expression pattern of GPR30, as well as on its participation in the control of male reproductive function. Herein, by means of immunohistochemical and molecular biology techniques (RT-PCR and Western blot), we aimed to present the first exhaustive evaluation of GPR30 expression in non-neoplastic human testicular cells. Indeed, we were able to demonstrate that GPR30 was expressed in human testicular tissue and that the staining pattern was consistent with its cytoplasmic localization. Additionally, by using cultured human Sertoli cells (SCs) and isolated haploid and diploid germ cells fractions, we confirmed that GPR30 is expressed in SCs and diploid germ cells but not in haploid germ cells. This specific expression pattern suggests a role for GPR30 in spermatogenesis. Copyright © 2014 Elsevier Inc. All rights reserved.

Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

PubMed Central

Liu, Zhandong; Venkatesh, Santosh S; Maley, Carlo C

2008-01-01

Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12) genomes. Virtually all possible (> 98%) 12 bp oligomers appear in vertebrate genomes while < 2% of 19 bp oligomers are present. Other species showed different ranges of > 98% to < 2% of possible oligomers in D. melanogaster (12–17 bp), C. elegans (11–17 bp), A. thaliana (11–17 bp), S. cerevisiae (10–16 bp) and E. coli (9–15 bp). Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden
Doubled haploid production from Spanish onion (Allium cepa L.) germplasm: embryogenesis induction, plant regeneration and chromosome doubling.

PubMed

Fayos, Oreto; Vallés, María P; Garcés-Claver, Ana; Mallor, Cristina; Castillo, Ana M

2015-01-01

The use of doubled haploids in onion breeding is limited due to the low gynogenesis efficiency of this species. Gynogenesis capacity from Spanish germplasm, including the sweet cultivar Fuentes de Ebro, the highly pungent landrace BGHZ1354 and the two Valenciana type commercial varieties Recas and Rita, was evaluated and optimized in this study. The OH-1 population, characterized by a high gynogenesis induction, was used as control. Growing conditions of the donor plants were tested with a one-step protocol and field plants produced a slightly higher percentage of embryogenesis induction than growth chamber plants. A one-step protocol was compared with a two-step protocol for embryogenesis induction. Spanish germplasm produced a 2-3 times higher percentage of embryogenesis with the two-step protocol, Recas showing the highest percentage (2.09%) and Fuentes de Ebro the lowest (0.53%). These percentages were significantly lower than those from the OH-1 population, with an average of 15% independently of the protocol used. The effect of different containers on plant regeneration was tested using both protocols. The highest percentage of acclimated plants was obtained with the two-step protocol in combination with Eco2box (70%), whereas the lowest percentage was observed with glass tubes in the two protocols (20-23%). Different amiprofos-methyl (APM) treatments were applied to embryos for chromosome doubling. A similar number of doubled haploid plants were recovered with 25 or 50 μM APM in liquid medium. However, the application of 25 μM in solid medium for 24 h produced the highest number of doubled haploid plants. Somatic regeneration from flower buds of haploid and mixoploid plants proved to be a successful approach for chromosome doubling, since diploid plants were obtained from the four regenerated lines. In this study, doubled haploid plants were produced from the four Spanish cultivars, however further improvements are needed to increase their gynogenesis
Pervasive sequence patents cover the entire human genome.

PubMed

Rosenfeld, Jeffrey A; Mason, Christopher E

2013-01-01

The scope and eligibility of patents for genetic sequences have been debated for decades, but a critical case regarding gene patents (Association of Molecular Pathologists v. Myriad Genetics) is now reaching the US Supreme Court. Recent court rulings have supported the assertion that such patents can provide intellectual property rights on sequences as small as 15 nucleotides (15mers), but an analysis of all current US patent claims and the human genome presented here shows that 15mer sequences from all human genes match at least one other gene. The average gene matches 364 other genes as 15mers; the breast-cancer-associated gene BRCA1 has 15mers matching at least 689 other genes. Longer sequences (1,000 bp) still showed extensive cross-gene matches. Furthermore, 15mer-length claims from bovine and other animal patents could also claim as much as 84% of the genes in the human genome. In addition, when we expanded our analysis to full-length patent claims on DNA from all US patents to date, we found that 41% of the genes in the human genome have been claimed. Thus, current patents for both short and long nucleotide sequences are extraordinarily non-specific and create an uncertain, problematic liability for genomic medicine, especially in regard to targeted re-sequencing and other sequence diagnostic assays.
Scientific Goals of the Human Genome Project.

ERIC Educational Resources Information Center

Wills, Christopher

1993-01-01

The Human Genome Project, an effort to sequence all the DNA of a human cell, is needed to better understand the behavior of chromosomes during cell division, with the ultimate goal of understanding the specific genes contributing to specific diseases and disabilities. (MSE)
Genomic signatures of diet-related shifts during human origins

PubMed Central

Babbitt, Courtney C.; Warner, Lisa R.; Fedrigo, Olivier; Wall, Christine E.; Wray, Gregory A.

2011-01-01

There are numerous anthropological analyses concerning the importance of diet during human evolution. Diet is thought to have had a profound influence on the human phenotype, and dietary differences have been hypothesized to contribute to the dramatic morphological changes seen in modern humans as compared with non-human primates. Here, we attempt to integrate the results of new genomic studies within this well-developed anthropological context. We then review the current evidence for adaptation related to diet, both at the level of sequence changes and gene expression. Finally, we propose some ways in which new technologies can help identify specific genomic adaptations that have resulted in metabolic and morphological differences between humans and non-human primates. PMID:21177690
The draft genome and transcriptome of Cannabis sativa

PubMed Central

2011-01-01

Background Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. Results We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. Conclusions The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics. PMID:22014239
A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome.

PubMed

Konkel, Miriam K; Batzer, Mark A

2010-08-01

It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families - long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements - mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development. Copyright © 2010 Elsevier Ltd. All rights reserved.
75 FR 62548 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2010-10-12

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome...- 402-8837, [email protected] . Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human...
76 FR 9031 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-02-16

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome...- 402-8837, [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human...
78 FR 11898 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-02-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Conference Call). Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer CIDR, National Human....172, Human Genome Research, National Institutes of Health, HHS) Dated: February 13, 2013. David Clary...
78 FR 77477 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-12-23

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Conference Call). Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human..., Human Genome Research, National Institutes of Health, HHS). Dated: December 17, 2013. David Clary...
77 FR 64816 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-23

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Conference Call). Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human..., Human Genome Research, National Institutes of Health, HHS) Dated: October 16, 2012. David Clary, Program...
A Secure Alignment Algorithm for Mapping Short Reads to Human Genome.

PubMed

Zhao, Yongan; Wang, Xiaofeng; Tang, Haixu

2018-05-09

The elastic and inexpensive computing resources such as clouds have been recognized as a useful solution to analyzing massive human genomic data (e.g., acquired by using next-generation sequencers) in biomedical researches. However, outsourcing human genome computation to public or commercial clouds was hindered due to privacy concerns: even a small number of human genome sequences contain sufficient information for identifying the donor of the genomic data. This issue cannot be directly addressed by existing security and cryptographic techniques (such as homomorphic encryption), because they are too heavyweight to carry out practical genome computation tasks on massive data. In this article, we present a secure algorithm to accomplish the read mapping, one of the most basic tasks in human genomic data analysis based on a hybrid cloud computing model. Comparing with the existing approaches, our algorithm delegates most computation to the public cloud, while only performing encryption and decryption on the private cloud, and thus makes the maximum use of the computing resource of the public cloud. Furthermore, our algorithm reports similar results as the nonsecure read mapping algorithms, including the alignment between reads and the reference genome, which can be directly used in the downstream analysis such as the inference of genomic variations. We implemented the algorithm in C++ and Python on a hybrid cloud system, in which the public cloud uses an Apache Spark system.
A Targeted Capture Linkage Map Anchors the Genome of the Schistosomiasis Vector Snail, Biomphalaria glabrata.

PubMed

Tennessen, Jacob A; Bollmann, Stephanie R; Blouin, Michael S

2017-07-05

The aquatic planorbid snail Biomphalaria glabrata is one of the most intensively-studied mollusks due to its role in the transmission of schistosomiasis. Its 916 Mb genome has recently been sequenced and annotated, but it remains poorly assembled. Here, we used targeted capture markers to map over 10,000 B. glabrata scaffolds in a linkage cross of 94 F1 offspring, generating 24 linkage groups (LGs). We added additional scaffolds to these LGs based on linkage disequilibrium (LD) analysis of targeted capture and whole-genome sequences of 96 unrelated snails. Our final linkage map consists of 18,613 scaffolds comprising 515 Mb, representing 56% of the genome and 75% of genic and nonrepetitive regions. There are 18 large (> 10 Mb) LGs, likely representing the expected 18 haploid chromosomes, and > 50% of the genome has been assigned to LGs of at least 17 Mb. Comparisons with other gastropod genomes reveal patterns of synteny and chromosomal rearrangements. Linkage relationships of key immune-relevant genes may help clarify snail-schistosome interactions. By focusing on linkage among genic and nonrepetitive regions, we have generated a useful resource for associating snail phenotypes with causal genes, even in the absence of a complete genome assembly. A similar approach could potentially improve numerous poorly-assembled genomes in other taxa. This map will facilitate future work on this host of a serious human parasite. Copyright © 2017 Tennessen et al.
The human genome project: an historical perspective for social workers.

PubMed

Saunders, Marlene

2011-01-01

Having mapped the human genome, the Human Genome Project maintains that certain genes can be linked to specific diseases and certain forms of human behavior. This breakthrough, it is hoped, will lead to the effective treatment, even the elimination of serious, debilitating illnesses for all groups of people. However, because the project conjures up memories of eugenics, the project raises concerns about its potential for identifying and linking diseases and social conditions (e.g., criminal behavior) to certain groups. This article places the Human Genome Project in historical context in terms of its resemblance to the eugenics movement in America and a period in social work history when the profession embraced eugenics and was guided by the movement's premises in its response to poor people.
Efficient genome editing of differentiated renal epithelial cells.

PubMed

Hofherr, Alexis; Busch, Tilman; Huber, Nora; Nold, Andreas; Bohn, Albert; Viau, Amandine; Bienaimé, Frank; Kuehn, E Wolfgang; Arnold, Sebastian J; Köttgen, Michael

2017-02-01

Recent advances in genome editing technologies have enabled the rapid and precise manipulation of genomes, including the targeted introduction, alteration, and removal of genomic sequences. However, respective methods have been described mainly in non-differentiated or haploid cell types. Genome editing of well-differentiated renal epithelial cells has been hampered by a range of technological issues, including optimal design, efficient expression of multiple genome editing constructs, attainable mutation rates, and best screening strategies. Here, we present an easily implementable workflow for the rapid generation of targeted heterozygous and homozygous genomic sequence alterations in renal cells using transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeat (CRISPR) system. We demonstrate the versatility of established protocols by generating novel cellular models for studying autosomal dominant polycystic kidney disease (ADPKD). Furthermore, we show that cell culture-validated genetic modifications can be readily applied to mouse embryonic stem cells (mESCs) for the generation of corresponding mouse models. The described procedure for efficient genome editing can be applied to any cell type to study physiological and pathophysiological functions in the context of precisely engineered genotypes.
Characterization of noncoding regulatory DNA in the human genome.

PubMed

Elkon, Ran; Agami, Reuven

2017-08-08

Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.
77 FR 50140 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-08-20

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., Human Genome Research, National Institutes of Health, HHS) Dated: August 13, 2012. Anna Snouffer, Deputy..., Bethesda, MD 20892. Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human...
76 FR 50486 - National Human Genome Research Institute; Notice of Closed Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-08-15

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Conference Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome...- 402-8837, [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human...
Explaining human uniqueness: genome interactions with environment, behaviour and culture.

PubMed

Varki, Ajit; Geschwind, Daniel H; Eichler, Evan E

2008-10-01

What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, 'anthropogeny' (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any 'genes versus environment' dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture - perhaps relaxing allowable thresholds for large-scale genomic diversity.

Explaining human uniqueness: genome interactions with environment, behaviour and culture

PubMed Central

Varki, Ajit; Geschwind, Daniel H.; Eichler, Evan E.

2009-01-01

What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, ‘anthropogeny’ (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any ‘genes versus environment’ dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture — perhaps relaxing allowable thresholds for large-scale genomic diversity. PMID:18802414
The Emerging Field of Human Social Genomics

PubMed Central

Slavich, George M.; Cole, Steven W.

2013-01-01

Although we generally experience our bodies as being biologically stable across time and situations, an emerging field of research is demonstrating that external social conditions, especially our subjective perceptions of those conditions, can influence our most basic internal biological processes—namely, the expression of our genes. This research on human social genomics has begun to identify the types of genes that are subject to social-environmental regulation, the neural and molecular mechanisms that mediate the effects of social processes on gene expression, and the genetic polymorphisms that moderate individual differences in genomic sensitivity to social context. The molecular models resulting from this research provide new opportunities for understanding how social and genetic factors interact to shape complex behavioral phenotypes and susceptibility to disease. This research also sheds new light on the evolution of the human genome and challenges the fundamental belief that our molecular makeup is relatively stable and impermeable to social-environmental influence. PMID:23853742
The Complete Sequence of a Human Parainfluenzavirus 4 Genome

PubMed Central

Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond

2009-01-01

Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536
A comprehensive transcript index of the human genome generated using microarrays and computational approaches

PubMed Central

Schadt, Eric E; Edwards, Stephen W; GuhaThakurta, Debraj; Holder, Dan; Ying, Lisa; Svetnik, Vladimir; Leonardson, Amy; Hart, Kyle W; Russell, Archie; Li, Guoya; Cavet, Guy; Castle, John; McDonagh, Paul; Kan, Zhengyan; Chen, Ronghua; Kasarskis, Andrew; Margarint, Mihai; Caceres, Ramon M; Johnson, Jason M; Armour, Christopher D; Garrett-Engele, Philip W; Tsinoremas, Nicholas F; Shoemaker, Daniel D

2004-01-01

Background Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. Results The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. Conclusions These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized. PMID:15461792
Recruiting Human Microbiome Shotgun Data to Site-Specific Reference Genomes

PubMed Central

Xie, Gary; Lo, Chien-Chi; Scholz, Matthew; Chain, Patrick S. G.

2014-01-01

The human body consists of innumerable multifaceted environments that predispose colonization by a number of distinct microbial communities, which play fundamental roles in human health and disease. In addition to community surveys and shotgun metagenomes that seek to explore the composition and diversity of these microbiomes, there are significant efforts to sequence reference microbial genomes from many body sites of healthy adults. To illustrate the utility of reference genomes when studying more complex metagenomes, we present a reference-based analysis of sequence reads generated from 55 shotgun metagenomes, selected from 5 major body sites, including 16 sub-sites. Interestingly, between 13% and 92% (62.3% average) of these shotgun reads were aligned to a then-complete list of 2780 reference genomes, including 1583 references for the human microbiome. However, no reference genome was universally found in all body sites. For any given metagenome, the body site-specific reference genomes, derived from the same body site as the sample, accounted for an average of 58.8% of the mapped reads. While different body sites did differ in abundant genera, proximal or symmetrical body sites were found to be most similar to one another. The extent of variation observed, both between individuals sampled within the same microenvironment, or at the same site within the same individual over time, calls into question comparative studies across individuals even if sampled at the same body site. This study illustrates the high utility of reference genomes and the need for further site-specific reference microbial genome sequencing, even within the already well-sampled human microbiome. PMID:24454771
The Oxytricha trifallax Macronuclear Genome: A Complex Eukaryotic Genome with 16,000 Tiny Chromosomes

PubMed Central

Swart, Estienne C.; Bracht, John R.; Magrini, Vincent; Minx, Patrick; Chen, Xiao; Zhou, Yi; Khurana, Jaspreet S.; Goldman, Aaron D.; Nowacki, Mariusz; Schotanus, Klaas; Jung, Seolkyoung; Fulton, Robert S.; Ly, Amy; McGrath, Sean; Haub, Kevin; Wiggins, Jessica L.; Storton, Donna; Matese, John C.; Parsons, Lance; Chang, Wei-Jen; Bowen, Michael S.; Stover, Nicholas A.; Jones, Thomas A.; Eddy, Sean R.; Herrick, Glenn A.; Doak, Thomas G.; Wilson, Richard K.; Mardis, Elaine R.; Landweber, Laura F.

2013-01-01

The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor “silent” germline micronuclear genome by a process of “unscrambling” and fragmentation. The tiny macronuclear “nanochromosomes” typically encode single, protein-coding genes (a small portion, 10%, encode 2–8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing
Genome-Wide Prediction and Analysis of 3D-Domain Swapped Proteins in the Human Genome from Sequence Information.

PubMed

Upadhyay, Atul Kumar; Sowdhamini, Ramanathan

2016-01-01

3D-domain swapping is one of the mechanisms of protein oligomerization and the proteins exhibiting this phenomenon have many biological functions. These proteins, which undergo domain swapping, have acquired much attention owing to their involvement in human diseases, such as conformational diseases, amyloidosis, serpinopathies, proteionopathies etc. Early realisation of proteins in the whole human genome that retain tendency to domain swap will enable many aspects of disease control management. Predictive models were developed by using machine learning approaches with an average accuracy of 78% (85.6% of sensitivity, 87.5% of specificity and an MCC value of 0.72) to predict putative domain swapping in protein sequences. These models were applied to many complete genomes with special emphasis on the human genome. Nearly 44% of the protein sequences in the human genome were predicted positive for domain swapping. Enrichment analysis was performed on the positively predicted sequences from human genome for their domain distribution, disease association and functional importance based on Gene Ontology (GO). Enrichment analysis was also performed to infer a better understanding of the functional importance of these sequences. Finally, we developed hinge region prediction, in the given putative domain swapped sequence, by using important physicochemical properties of amino acids.
Personal genomes in progress: from the human genome project to the personal genome project.

PubMed

Lunshof, Jeantine E; Bobe, Jason; Aach, John; Angrist, Misha; Thakuria, Joseph V; Vorhaus, Daniel B; Hoehe, Margret R; Church, George M

2010-01-01

The cost of a diploid human genome sequence has dropped from about $70M to $2000 since 2007--even as the standards for redundancy have increased from 7x to 40x in order to improve call rates. Coupled with the low return on investment for common single-nucleotide polylmorphisms, this has caused a significant rise in interest in correlating genome sequences with comprehensive environmental and trait data (GET). The cost of electronic health records, imaging, and microbial, immunological, and behavioral data are also dropping quickly. Sharing such integrated GET datasets and their interpretations with a diversity of researchers and research subjects highlights the need for informed-consent models capable of addressing novel privacy and other issues, as well as for flexible data-sharing resources that make materials and data available with minimum restrictions on use. This article examines the Personal Genome Project's effort to develop a GET database as a public genomics resource broadly accessible to both researchers and research participants, while pursuing the highest standards in research ethics.
GenPlay Multi-Genome, a tool to compare and analyze multiple human genomes in a graphical interface.

PubMed

Lajugie, Julien; Fourel, Nicolas; Bouhassira, Eric E

2015-01-01

Parallel visualization of multiple individual human genomes is a complex endeavor that is rapidly gaining importance with the increasing number of personal, phased and cancer genomes that are being generated. It requires the display of variants such as SNPs, indels and structural variants that are unique to specific genomes and the introduction of multiple overlapping gaps in the reference sequence. Here, we describe GenPlay Multi-Genome, an application specifically written to visualize and analyze multiple human genomes in parallel. GenPlay Multi-Genome is ideally suited for the comparison of allele-specific expression and functional genomic data obtained from multiple phased genomes in a graphical interface with access to multiple-track operation. It also allows the analysis of data that have been aligned to custom genomes rather than to a standard reference and can be used as a variant calling format file browser and as a tool to compare different genome assembly, such as hg19 and hg38. GenPlay is available under the GNU public license (GPL-3) from http://genplay.einstein.yu.edu. The source code is available at https://github.com/JulienLajugie/GenPlay. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Brewing characteristics of haploid strains isolated from sake yeast Kyokai No. 7.

PubMed

Katou, Taku; Kitagaki, Hiroshi; Akao, Takeshi; Shimoi, Hitoshi

2008-11-01

Sake yeast exhibit various characteristics that make them more suitable for sake brewing compared to other yeast strains. Since sake yeast strains are Saccharomyces cerevisiae heterothallic diploid strains, it is likely that they have heterozygous alleles on homologous chromosomes (heterozygosity) due to spontaneous mutations. If this is the case, segregation of phenotypic traits in haploid strains after sporulation and concomitant meiosis of sake yeast strains would be expected to occur. To examine this hypothesis, we isolated 100 haploid strains from Kyokai No. 7 (K7), a typical sake yeast strain in Japan, and compared their brewing characteristics in small-scale sake-brewing tests. Analyses of the resultant sake samples showed a smooth and continuous distribution of analytical values for brewing characteristics, suggesting that K7 has multiple heterozygosities that affect brewing characteristics and that these heterozygous alleles do segregate after sporulation. Correlation and principal component analyses suggested that the analytical parameters could be classified into two groups, indicating fermentation ability and sake flavour. (c) 2008 John Wiley & Sons, Ltd.
Learning about the Human Genome. Part 2: Resources for Science Educators. ERIC Digest.

ERIC Educational Resources Information Center

Haury, David L.

This ERIC Digest identifies how the human genome project fits into the "National Science Education Standards" and lists Human Genome Project Web sites found on the World Wide Web. It is a resource companion to "Learning about the Human Genome. Part 1: Challenge to Science Educators" (Haury 2001). The Web resources and…
Segmenting the human genome based on states of neutral genetic divergence.

PubMed

Kuruppumullage Don, Prabhani; Ananda, Guruprasad; Chiaromonte, Francesca; Makova, Kateryna D

2013-09-03

Many studies have demonstrated that divergence levels generated by different mutation types vary and covary across the human genome. To improve our still-incomplete understanding of the mechanistic basis of this phenomenon, we analyze several mutation types simultaneously, anchoring their variation to specific regions of the genome. Using hidden Markov models on insertion, deletion, nucleotide substitution, and microsatellite divergence estimates inferred from human-orangutan alignments of neutrally evolving genomic sequences, we segment the human genome into regions corresponding to different divergence states--each uniquely characterized by specific combinations of divergence levels. We then parsed the mutagenic contributions of various biochemical processes associating divergence states with a broad range of genomic landscape features. We find that high divergence states inhabit guanine- and cytosine (GC)-rich, highly recombining subtelomeric regions; low divergence states cover inner parts of autosomes; chromosome X forms its own state with lowest divergence; and a state of elevated microsatellite mutability is interspersed across the genome. These general trends are mirrored in human diversity data from the 1000 Genomes Project, and departures from them highlight the evolutionary history of primate chromosomes. We also find that genes and noncoding functional marks [annotations from the Encyclopedia of DNA Elements (ENCODE)] are concentrated in high divergence states. Our results provide a powerful tool for biomedical data analysis: segmentations can be used to screen personal genome variants--including those associated with cancer and other diseases--and to improve computational predictions of noncoding functional elements.
Evolution and maintenance of haploid-diploid life cycles in natural populations: The case of the marine brown alga Ectocarpus.

PubMed

Couceiro, Lucía; Le Gac, Mickael; Hunsperger, Heather M; Mauger, Stéphane; Destombe, Christophe; Cock, J Mark; Ahmed, Sophia; Coelho, Susana M; Valero, Myriam; Peters, Akira F

2015-07-01

The evolutionary stability of haploid-diploid life cycles is still controversial. Mathematical models indicate that niche differences between ploidy phases may be a necessary condition for the evolution and maintenance of these life cycles. Nevertheless, experimental support for this prediction remains elusive. In the present work, we explored this hypothesis in natural populations of the brown alga Ectocarpus. Consistent with the life cycle described in culture, Ectocarpus crouaniorum in NW France and E. siliculosus in SW Italy exhibited an alternation between haploid gametophytes and diploid sporophytes. Our field data invalidated, however, the long-standing view of an isomorphic alternation of generations. Gametophytes and sporophytes displayed marked differences in size and, conforming to theoretical predictions, occupied different spatiotemporal niches. Gametophytes were found almost exclusively on the alga Scytosiphon lomentaria during spring whereas sporophytes were present year-round on abiotic substrata. Paradoxically, E. siliculosus in NW France exhibited similar habitat usage despite the absence of alternation of ploidy phases. Diploid sporophytes grew both epilithically and epiphytically, and this mainly asexual population gained the same ecological advantage postulated for haploid-diploid populations. Consequently, an ecological interpretation of the niche differences between haploid and diploid individuals does not seem to satisfactorily explain the evolution of the Ectocarpus life cycle. © 2015 The Author(s). Evolution © 2015 The Society for the Study of Evolution.
Dissecting the human microbiome with single-cell genomics.

PubMed

Tolonen, Andrew C; Xavier, Ramnik J

2017-06-14

Recent advances in genome sequencing of single microbial cells enable the assignment of functional roles to members of the human microbiome that cannot currently be cultured. This approach can reveal the genomic basis of phenotypic variation between closely related strains and can be applied to the targeted study of immunogenic bacteria in disease.
HOWDY: an integrated database system for human genome research

PubMed Central

Hirakawa, Mika

2002-01-01

HOWDY is an integrated database system for accessing and analyzing human genomic information (http://www-alis.tokyo.jst.go.jp/HOWDY/). HOWDY stores information about relationships between genetic objects and the data extracted from a number of databases. HOWDY consists of an Internet accessible user interface that allows thorough searching of the human genomic databases using the gene symbols and their aliases. It also permits flexible editing of the sequence data. The database can be searched using simple words and the search can be restricted to a specific cytogenetic location. Linear maps displaying markers and genes on contig sequences are available, from which an object can be chosen. Any search starting point identifies all the information matching the query. HOWDY provides a convenient search environment of human genomic data for scientists unsure which database is most appropriate for their search. PMID:11752279
Human genome-microbiome interaction: metagenomics frontiers for the aetiopathology of autoimmune diseases.

PubMed

Gundogdu, Aycan; Nalbantoglu, Ufuk

2017-04-01

A short while ago, the human genome and microbiome were analysed simultaneously for the first time as a multi-omic approach. The analyses of heterogeneous population cohorts showed that microbiome components were associated with human genome variations. In-depth analysis of these results reveals that the majority of those relationships are between immune pathways and autoimmune disease-associated microbiome components. Thus, it can be hypothesized that autoimmunity may be associated with homeostatic disequilibrium of the human-microbiome interactome. Further analysis of human genome-human microbiome relationships in disease contexts with tailored systems biology approaches may yield insights into disease pathogenesis and prognosis.
Segregation distortion causes large-scale differences between male and female genomes in hybrid ants.

PubMed

Kulmuni, Jonna; Seifert, Bernhard; Pamilo, Pekka

2010-04-20

Hybridization in isolated populations can lead either to hybrid breakdown and extinction or in some cases to speciation. The basis of hybrid breakdown lies in genetic incompatibilities between diverged genomes. In social Hymenoptera, the consequences of hybridization can differ from those in other animals because of haplodiploidy and sociality. Selection pressures differ between sexes because males are haploid and females are diploid. Furthermore, sociality and group living may allow survival of hybrid genotypes. We show that hybridization in Formica ants has resulted in a stable situation in which the males form two highly divergent gene pools whereas all the females are hybrids. This causes an exceptional situation with large-scale differences between male and female genomes. The genotype differences indicate strong transmission ratio distortion depending on offspring sex, whereby the mother transmits some alleles exclusively to her daughters and other alleles exclusively to her sons. The genetic differences between the sexes and the apparent lack of multilocus hybrid genotypes in males can be explained by recessive incompatibilities which cause the elimination of hybrid males because of their haploid genome. Alternatively, differentiation between sexes could be created by prezygotic segregation into male-forming and female-forming gametes in diploid females. Differentiation between sexes is stable and maintained throughout generations. The present study shows a unique outcome of hybridization and demonstrates that hybridization has the potential of generating evolutionary novelties in animals.
What Does it Mean to be Genomically Literate? National Human Genome Research Institute Meeting Report

PubMed Central

Hurle, Belen; Citrin, Toby; Jenkins, Jean F.; Kaphingst, Kimberly A.; Lamb, Neil; Roseman, Jo Ellen; Bonham, Vence L.

2014-01-01

Genomic discoveries will increasingly advance the science of medicine. Limited genomic literacy may adversely impact the public’s understanding and use of the power of genetics and genomics in health care and public health. In November 2011, a meeting was held by the National Human Genome Research Institute to examine the challenge of achieving genomic literacy for the general public, from K-12 to adult education. The role of the media in disseminating scientific messages and in perpetuating, or reducing, misconceptions was also discussed. Workshop participants agreed that genomic literacy will only be achieved through active engagement between genomics experts and the varied constituencies that comprise the public. This report summarizes the background, content, and outcomes from this meeting, including recommendations for a research agenda to inform decisions about how to advance genomic literacy in our society. PMID:23448722
Human evolutionary genomics: ethical and interpretive issues.

PubMed

Vitti, Joseph J; Cho, Mildred K; Tishkoff, Sarah A; Sabeti, Pardis C

2012-03-01

Genome-wide computational studies can now identify targets of natural selection. The unique information about humans these studies reveal, and the media attention they attract, indicate the need for caution and precision in communicating results. This need is exacerbated by ways in which evolutionary and genetic considerations have been misapplied to support discriminatory policies, by persistent misconceptions of these fields and by the social sensitivity surrounding discussions of racial ancestry. We discuss the foundations, accomplishments and future directions of human evolutionary genomics, attending to ways in which the interpretation of good science can go awry, and offer suggestions for researchers to prevent misapplication of their work. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
The Yeast Deletion Collection: A Decade of Functional Genomics

PubMed Central

Giaever, Guri; Nislow, Corey

2014-01-01

The yeast deletion collections comprise >21,000 mutant strains that carry precise start-to-stop deletions of ∼6000 open reading frames. This collection includes heterozygous and homozygous diploids, and haploids of both MATa and MATα mating types. The yeast deletion collection, or yeast knockout (YKO) set, represents the first and only complete, systematically constructed deletion collection available for any organism. Conceived during the Saccharomyces cerevisiae sequencing project, work on the project began in 1998 and was completed in 2002. The YKO strains have been used in numerous laboratories in >1000 genome-wide screens. This landmark genome project has inspired development of numerous genome-wide technologies in organisms from yeast to man. Notable spinoff technologies include synthetic genetic array and HIPHOP chemogenomics. In this retrospective, we briefly describe the yeast deletion project and some of its most noteworthy biological contributions and the impact that these collections have had on the yeast research community and on genomics in general. PMID:24939991

Deorphanizing the human transmembrane genome: A landscape of uncharacterized membrane proteins.

PubMed

Babcock, Joseph J; Li, Min

2014-01-01

The sequencing of the human genome has fueled the last decade of work to functionally characterize genome content. An important subset of genes encodes membrane proteins, which are the targets of many drugs. They reside in lipid bilayers, restricting their endogenous activity to a relatively specialized biochemical environment. Without a reference phenotype, the application of systematic screens to profile candidate membrane proteins is not immediately possible. Bioinformatics has begun to show its effectiveness in focusing the functional characterization of orphan proteins of a particular functional class, such as channels or receptors. Here we discuss integration of experimental and bioinformatics approaches for characterizing the orphan membrane proteome. By analyzing the human genome, a landscape reference for the human transmembrane genome is provided.
The oxytocin receptor gene (OXTR) localizes to human chromosome 3p25 by fluorescence in situ hybridization and PCR analysis of somatic cell hybrids

DOE Office of Scientific and Technical Information (OSTI.GOV)

Simmons, C.F. Jr.; Clancy, T.E.; Quan, R.

1995-04-10

The human oxytocin receptor regulates parturition and myometrial contractility, breast milk let-down, and reproductive behaviors in the mammalian central nervous system. Kimura et al. recently identified a human oxytocin receptor cDNA by means of expression cloning from a human myometrial cDNA library. To elucidate further the molecular mechanisms that regulate oxytocin receptor gene expression and to define the expected Mendelian inheritance of possible human disease states, we must determine the number of genes, their localization, and their organization and structure. We summarize below our data indicating that the human oxytocin receptor gene is localized to 3p25 and exists as amore » single copy in the haploid genome. 9 refs., 2 figs.« less
The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons

PubMed Central

Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.

2016-01-01

To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095
A draft annotation and overview of the human genome

PubMed Central

Wright, Fred A; Lemon, William J; Zhao, Wei D; Sears, Russell; Zhuo, Degen; Wang, Jian-Ping; Yang, Hee-Yung; Baer, Troy; Stredney, Don; Spitzner, Joe; Stutz, Al; Krahe, Ralf; Yuan, Bo

2001-01-01

Background The recent draft assembly of the human genome provides a unified basis for describing genomic structure and function. The draft is sufficiently accurate to provide useful annotation, enabling direct observations of previously inferred biological phenomena. Results We report here a functionally annotated human gene index placed directly on the genome. The index is based on the integration of public transcript, protein, and mapping information, supplemented with computational prediction. We describe numerous global features of the genome and examine the relationship of various genetic maps with the assembly. In addition, initial sequence analysis reveals highly ordered chromosomal landscapes associated with paralogous gene clusters and distinct functional compartments. Finally, these annotation data were synthesized to produce observations of gene density and number that accord well with historical estimates. Such a global approach had previously been described only for chromosomes 21 and 22, which together account for 2.2% of the genome. Conclusions We estimate that the genome contains 65,000-75,000 transcriptional units, with exon sequences comprising 4%. The creation of a comprehensive gene index requires the synthesis of all available computational and experimental evidence. PMID:11516338
Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

PubMed Central

de Koning, A. P. Jason; Gu, Wanjun; Castoe, Todd A.; Batzer, Mark A.; Pollock, David D.

2011-01-01

Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed. PMID:22144907
Breeding of a xylose-fermenting hybrid strain by mating genetically engineered haploid strains derived from industrial Saccharomyces cerevisiae.

PubMed

Inoue, Hiroyuki; Hashimoto, Seitaro; Matsushika, Akinori; Watanabe, Seiya; Sawayama, Shigeki

2014-12-01

The industrial Saccharomyces cerevisiae IR-2 is a promising host strain to genetically engineer xylose-utilizing yeasts for ethanol fermentation from lignocellulosic hydrolysates. Two IR-2-based haploid strains were selected based upon the rate of xylulose fermentation, and hybrids were obtained by mating recombinant haploid strains harboring heterogeneous xylose dehydrogenase (XDH) (wild-type NAD(+)-dependent XDH or engineered NADP(+)-dependent XDH, ARSdR), xylose reductase (XR) and xylulose kinase (XK) genes. ARSdR in the hybrids selected for growth rates on yeast extract-peptone-dextrose (YPD) agar and YP-xylose agar plates typically had a higher activity than NAD(+)-dependent XDH. Furthermore, the xylose-fermenting performance of the hybrid strain SE12 with the same level of heterogeneous XDH activity was similar to that of a recombinant strain of IR-2 harboring a single set of genes, XR/ARSdR/XK. These results suggest not only that the recombinant haploid strains retain the appropriate genetic background of IR-2 for ethanol production from xylose but also that ARSdR is preferable for xylose fermentation.
Human genome and open source: balancing ethics and business.

PubMed

Marturano, Antonio

2011-01-01

The Human Genome Project has been completed thanks to a massive use of computer techniques, as well as the adoption of the open-source business and research model by the scientists involved. This model won over the proprietary model and allowed a quick propagation and feedback of research results among peers. In this paper, the author will analyse some ethical and legal issues emerging by the use of such computer model in the Human Genome property rights. The author will argue that the Open Source is the best business model, as it is able to balance business and human rights perspectives.
Recent advances in understanding the role of nutrition in human genome evolution.

PubMed

Ye, Kaixiong; Gu, Zhenglong

2011-11-01

Dietary transitions in human history have been suggested to play important roles in the evolution of mankind. Genetic variations caused by adaptation to diet during human evolution could have important health consequences in current society. The advance of sequencing technologies and the rapid accumulation of genome information provide an unprecedented opportunity to comprehensively characterize genetic variations in human populations and unravel the genetic basis of human evolution. Series of selection detection methods, based on various theoretical models and exploiting different aspects of selection signatures, have been developed. Their applications at the species and population levels have respectively led to the identification of human specific selection events that distinguish human from nonhuman primates and local adaptation events that contribute to human diversity. Scrutiny of candidate genes has revealed paradigms of adaptations to specific nutritional components and genome-wide selection scans have verified the prevalence of diet-related selection events and provided many more candidates awaiting further investigation. Understanding the role of diet in human evolution is fundamental for the development of evidence-based, genome-informed nutritional practices in the era of personal genomics.
[Genetic system for maintaining the mitochondrial human genome in yeast Yarrowia lipolytica].

PubMed

Isakova, E P; Deryabina, Yu I; Velyakova, A V; Biryukova, J K; Teplova, V V; Shevelev, A B

2016-01-01

For the first time, the possibility of maintaining an intact human mitochondrial genome in a heterologous system in the mitochondria of yeast Yarrowia lipolytica is shown. A method for introducing directional changes into the structure of the mitochondrial human genome replicating in Y. lipolytica by an artificially induced ability of yeast mitochondria for homologous recombination is proposed. A method of introducing and using phenotypic selection markers for the presence or absence of defects in genes tRNA-Lys and tRNA-Leu of the mitochondrial genome is developed. The proposed system can be used to correct harmful mutations of the human mitochondrial genome associated with mitochondrial diseases and for preparative amplification of intact mitochondrial DNA with an adjusted sequence in yeast cells. The applicability of the new system for the correction of mutations in the genes of Lys- and Leu-specific tRNAs of the human mitochondrial genome associated with serious and widespread human mitochondrial diseases such as myoclonic epilepsy with lactic acidosis (MELAS) and myoclonic epilepsy with ragged-red fibers (MERRF) is shown.
Genome-wide scans for loci under selection in humans

PubMed Central

2005-01-01

Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection. PMID:16004726
Human genomic disease variants: a neutral evolutionary explanation.

PubMed

Dudley, Joel T; Kim, Yuseob; Liu, Li; Markov, Glenn J; Gerold, Kristyn; Chen, Rong; Butte, Atul J; Kumar, Sudhir

2012-08-01

Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease.
Human genomic disease variants: A neutral evolutionary explanation

PubMed Central

Dudley, Joel T.; Kim, Yuseob; Liu, Li; Markov, Glenn J.; Gerold, Kristyn; Chen, Rong; Butte, Atul J.; Kumar, Sudhir

2012-01-01

Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease. PMID:22665443
O father where art thou? Paternity analyses in a natural population of the haploid-diploid seaweed Chondrus crispus.

PubMed

Krueger-Hadfield, S A; Roze, D; Correa, J A; Destombe, C; Valero, M

2015-02-01

The link between life history traits and mating systems in diploid organisms has been extensively addressed in the literature, whereas the degree of selfing and/or inbreeding in natural populations of haploid-diploid organisms, in which haploid gametophytes alternate with diploid sporophytes, has been rarely measured. Dioecy has often been used as a proxy for the mating system in these organisms. Yet, dioecy does not prevent the fusion of gametes from male and female gametophytes originating from the same sporophyte. This is likely a common occurrence when spores from the same parent are dispersed in clumps and recruit together. This pattern of clumped spore dispersal has been hypothesized to explain significant heterozygote deficiency in the dioecious haploid-diploid seaweed Chondrus crispus. Fronds and cystocarps (structures in which zygotes are mitotically amplified) were sampled in two 25 m(2) plots located within a high and a low intertidal zone and genotyped at 5 polymorphic microsatellite loci in order to explore the mating system directly using paternity analyses. Multiple males sired cystocarps on each female, but only one of the 423 paternal genotypes corresponded to a field-sampled gametophyte. Nevertheless, larger kinship coefficients were detected between males siring cystocarps on the same female in comparison with males in the entire population, confirming restricted spermatial and clumped spore dispersal. Such dispersal mechanisms may be a mode of reproductive assurance due to nonmotile gametes associated with putatively reduced effects of inbreeding depression because of the free-living haploid stage in C. crispus.
Insights into the Dekkera bruxellensis Genomic Landscape: Comparative Genomics Reveals Variations in Ploidy and Nutrient Utilisation Potential amongst Wine Isolates

PubMed Central

Borneman, Anthony R.; Zeppel, Ryan; Chambers, Paul J.; Curtin, Chris D.

2014-01-01

The yeast Dekkera bruxellensis is a major contaminant of industrial fermentations, such as those used for the production of biofuel and wine, where it outlasts and, under some conditions, outcompetes the major industrial yeast Saccharomyces cerevisiae. In order to investigate the level of inter-strain variation that is present within this economically important species, the genomes of four diverse D. bruxellensis isolates were compared. While each of the four strains was shown to contain a core diploid genome, which is clearly sufficient for survival, two of the four isolates have a third haploid complement of chromosomes. The sequences of these additional haploid genomes were both highly divergent from those comprising the diploid core and divergent between the two triploid strains. Similar to examples in the Saccharomyces spp. clade, where some allotriploids have arisen on the basis of enhanced ability to survive a range of environmental conditions, it is likely these strains are products of two independent hybridisation events that may have involved multiple species or distinct sub-species of Dekkera. Interestingly these triploid strains represent the vast majority (92%) of isolates from across the Australian wine industry, suggesting that the additional set of chromosomes may confer a selective advantage in winery environments that has resulted in these hybrid strains all-but replacing their diploid counterparts in Australian winery settings. In addition to the apparent inter-specific hybridisation events, chromosomal aberrations such as strain-specific insertions and deletions and loss-of-heterozygosity by gene conversion were also commonplace. While these events are likely to have affected many phenotypes across these strains, we have been able to link a specific deletion to the inability to utilise nitrate by some strains of D. bruxellensis, a phenotype that may have direct impacts in the ability for these strains to compete with S. cerevisiae. PMID:24550744
Learning about human population history from ancient and modern genomes.

PubMed

Stoneking, Mark; Krause, Johannes

2011-08-18

Genome-wide data, both from SNP arrays and from complete genome sequencing, are becoming increasingly abundant and are now even available from extinct hominins. These data are providing new insights into population history; in particular, when combined with model-based analytical approaches, genome-wide data allow direct testing of hypotheses about population history. For example, genome-wide data from both contemporary populations and extinct hominins strongly support a single dispersal of modern humans from Africa, followed by two archaic admixture events: one with Neanderthals somewhere outside Africa and a second with Denisovans that (so far) has only been detected in New Guinea. These new developments promise to reveal new stories about human population history, without having to resort to storytelling.
76 FR 65738 - National Human Genome Research Institute; Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-10-24

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Amended Notice of Meeting Notice is hereby given of a change in the meeting of the National Human Genome Research Institute Special Emphasis Panel, November 29, 2011, 8 a.m. to November 29...
77 FR 67385 - National Human Genome Research Institute; Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-11-09

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Amended Notice of Meeting Notice is hereby given of a change in the meeting of the National Human Genome Research Institute Special Emphasis Panel, October 29, 2012, 8:00 a.m. to October 30...
78 FR 65342 - National Human Genome Research Institute; Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2013-10-31

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Amended Notice of Meeting Notice is hereby given of a change in the meeting of the National Human Genome Research Institute Special Emphasis Panel, October 17, 2013, 08:00 a.m. to October 17...
76 FR 71581 - National Human Genome Research Institute; Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2011-11-18

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Amended Notice of Meeting Notice is hereby given of a change in the meeting of the National Human Genome Research Institute Special Emphasis Panel, November 22, 2011, 12 p.m. to November 22...
Reconstruction of genome-scale human metabolic models using omics data.

PubMed

Ryu, Jae Yong; Kim, Hyun Uk; Lee, Sang Yup

2015-08-01

The impact of genome-scale human metabolic models on human systems biology and medical sciences is becoming greater, thanks to increasing volumes of model building platforms and publicly available omics data. The genome-scale human metabolic models started with Recon 1 in 2007, and have since been used to describe metabolic phenotypes of healthy and diseased human tissues and cells, and to predict therapeutic targets. Here we review recent trends in genome-scale human metabolic modeling, including various generic and tissue/cell type-specific human metabolic models developed to date, and methods, databases and platforms used to construct them. For generic human metabolic models, we pay attention to Recon 2 and HMR 2.0 with emphasis on data sources used to construct them. Draft and high-quality tissue/cell type-specific human metabolic models have been generated using these generic human metabolic models. Integration of tissue/cell type-specific omics data with the generic human metabolic models is the key step, and we discuss omics data and their integration methods to achieve this task. The initial version of the tissue/cell type-specific human metabolic models can further be computationally refined through gap filling, reaction directionality assignment and the subcellular localization of metabolic reactions. We review relevant tools for this model refinement procedure as well. Finally, we suggest the direction of further studies on reconstructing an improved human metabolic model.

Application of toxicogenomic profiling to evaluate effects of benzene and formaldehyde: from yeast to human

PubMed Central

McHale, Cliona M.; Smith, Martyn T.; Zhang, Luoping

2014-01-01

Genetic variation underlies a significant proportion of the individual variation in human susceptibility to toxicants. The primary current approaches to identify gene–environment (GxE) associations, genome-wide association studies (GWAS) and candidate gene association studies, require large exposed and control populations and an understanding of toxicity genes and pathways, respectively. This limits their application in the study of GxE associations for the leukemogens benzene and formaldehyde, whose toxicity has long been a focus of our research. As an alternative approach, we applied innovative in vitro functional genomics testing systems, including unbiased functional screening assays in yeast and a near-haploid human bone marrow cell line (KBM7). Through comparative genomic and computational analyses of the resulting data, we have identified human genes and pathways that may modulate susceptibility to benzene and formaldehyde. We have validated the roles of several genes in mammalian cell models. In populations occupationally exposed to low levels of benzene, we applied peripheral blood mononuclear cell transcriptomics and chromosome-wide aneuploidy studies (CWAS) in lymphocytes. In this review of the literature, we describe our comprehensive toxicogenomic approach and the potential mechanisms of toxicity and susceptibility genes identified for benzene and formaldehyde, as well as related studies conducted by other researchers. PMID:24571325
The Human Genome Initiative: First Steps.

ERIC Educational Resources Information Center

Newman, Alan R.

1990-01-01

Described is the basic biology involved in mapping chromosomes as presented at a symposium at a recent meeting of the American Chemical Association which focused on the Human Genome Initiative. Different types of gene maps and techniques used to produce gene maps are discussed. (CW)
National human genome projects: an update and an agenda.

PubMed

An, Joon Yong

2017-01-01

Population genetic and human genetic studies are being accelerated with genome technology and data sharing. Accordingly, in the past 10 years, several countries have initiated genetic research using genome technology and identified the genetic architecture of the ethnic groups living in the corresponding country or suggested the genetic foundation of a social phenomenon. Genetic research has been conducted from epidemiological studies that previously described the health or disease conditions in defined population. This perspective summarizes national genome projects conducted in the past 10 years and introduces case studies to utilize genomic data in genetic research.
The effects of quantitative fecundity in the haploid stage on reproductive success and diploid fitness in the aquatic peat moss Sphagnum macrophyllum

PubMed Central

Johnson, M G; Shaw, A J

2016-01-01

A major question in evolutionary biology is how mating patterns affect the fitness of offspring. However, in animals and seed plants it is virtually impossible to investigate the effects of specific gamete genotypes. In bryophytes, haploid gametophytes grow via clonal propagation and produce millions of genetically identical gametes throughout a population. The main goal of this research was to test whether gamete identity has an effect on the fitness of their diploid offspring in a population of the aquatic peat moss Sphagnum macrophyllum. We observed a heavily male-biased sex ratio in gametophyte plants (ramets) and in multilocus microsatellite genotypes (genets). There was a steeper relationship between mating success (number of different haploid mates) and fecundity (number of diploid offspring) for male genets compared with female genets. At the sporophyte level, we observed a weak effect of inbreeding on offspring fitness, but no effect of brood size (number of sporophytes per maternal ramet). Instead, the identities of the haploid male and haploid female parents were significant contributors to variance in fitness of sporophyte offspring in the population. Our results suggest that intrasexual gametophyte/gamete competition may play a role in determining mating success in this population. PMID:26905464
The effects of quantitative fecundity in the haploid stage on reproductive success and diploid fitness in the aquatic peat moss Sphagnum macrophyllum.

PubMed

Johnson, M G; Shaw, A J

2016-06-01

A major question in evolutionary biology is how mating patterns affect the fitness of offspring. However, in animals and seed plants it is virtually impossible to investigate the effects of specific gamete genotypes. In bryophytes, haploid gametophytes grow via clonal propagation and produce millions of genetically identical gametes throughout a population. The main goal of this research was to test whether gamete identity has an effect on the fitness of their diploid offspring in a population of the aquatic peat moss Sphagnum macrophyllum. We observed a heavily male-biased sex ratio in gametophyte plants (ramets) and in multilocus microsatellite genotypes (genets). There was a steeper relationship between mating success (number of different haploid mates) and fecundity (number of diploid offspring) for male genets compared with female genets. At the sporophyte level, we observed a weak effect of inbreeding on offspring fitness, but no effect of brood size (number of sporophytes per maternal ramet). Instead, the identities of the haploid male and haploid female parents were significant contributors to variance in fitness of sporophyte offspring in the population. Our results suggest that intrasexual gametophyte/gamete competition may play a role in determining mating success in this population.
77 FR 55853 - National Human Genome Research Institute; Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-09-11

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Amended Notice of Meeting Notice is hereby given of a change in the meeting of the National Advisory Council for Human Genome Research, September 10, 2012, 8:30 a.m. to September 11, 2012, 5...
Phenotypic diversification by enhanced genome restructuring after induction of multiple DNA double-strand breaks.

PubMed

Muramoto, Nobuhiko; Oda, Arisa; Tanaka, Hidenori; Nakamura, Takahiro; Kugou, Kazuto; Suda, Kazuki; Kobayashi, Aki; Yoneda, Shiori; Ikeuchi, Akinori; Sugimoto, Hiroki; Kondo, Satoshi; Ohto, Chikara; Shibata, Takehiko; Mitsukawa, Norihiro; Ohta, Kunihiro

2018-05-18

DNA double-strand break (DSB)-mediated genome rearrangements are assumed to provide diverse raw genetic materials enabling accelerated adaptive evolution; however, it remains unclear about the consequences of massive simultaneous DSB formation in cells and their resulting phenotypic impact. Here, we establish an artificial genome-restructuring technology by conditionally introducing multiple genomic DSBs in vivo using a temperature-dependent endonuclease TaqI. Application in yeast and Arabidopsis thaliana generates strains with phenotypes, including improved ethanol production from xylose at higher temperature and increased plant biomass, that are stably inherited to offspring after multiple passages. High-throughput genome resequencing revealed that these strains harbor diverse rearrangements, including copy number variations, translocations in retrotransposons, and direct end-joinings at TaqI-cleavage sites. Furthermore, large-scale rearrangements occur frequently in diploid yeasts (28.1%) and tetraploid plants (46.3%), whereas haploid yeasts and diploid plants undergo minimal rearrangement. This genome-restructuring system (TAQing system) will enable rapid genome breeding and aid genome-evolution studies.
Multimedia presentations on the human genome: Implementation and assessment of a teaching program for the introduction to genome science using a poster and animations.

PubMed

Kano, Kei; Yahata, Saiko; Muroi, Kaori; Kawakami, Masahiro; Tomoda, Mari; Miyaki, Koichi; Nakayama, Takeo; Kosugi, Shinji; Kato, Kazuto

2008-11-01

Genome science, including topics such as gene recombination, cloning, genetic tests, and gene therapy, is now an established part of our daily lives; thus we need to learn genome science to better equip ourselves for the present day. Learning from topics directly related to the human has been suggested to be more effective than learning from Mendel's peas not only because many students do not understand that plants are organisms, but also because human biology contains important social and health issues. Therefore, we have developed a teaching program for the introduction to genome science, whose subjects are focused on the human genome. This program comprises mixed multimedia presentations: a large poster with illustrations and text on the human genome (a human genome map for every home), and animations on the basics of genome science. We implemented and assessed this program at four high schools. Our results indicate that students felt that they learned about the human genome from the program and some increases in students' understanding were observed with longer exposure to the mixed multimedia presentations. Copyright © 2008 International Union of Biochemistry and Molecular Biology, Inc.
A periodic pattern of SNPs in the human genome

PubMed Central

Madsen, Bo Eskerod; Villesen, Palle; Wiuf, Carsten

2007-01-01

By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for example, transposable elements (SINE, LINE, and LTR), tandem repeats, and large duplicated regions. However, we found that the pattern is almost entirely confined to what we define as “periodic DNA.” Periodic DNA is a genomic region with a high degree of periodicity in nucleotide usage. It turned out that periodic DNA is mainly small regions (average length 16.9 bp), widely distributed in the genome. Furthermore, periodic DNA has a 1.8 times higher SNP density than the rest of the genome and SNPs inside periodic DNA have a significantly higher genotyping error rate than SNPs outside periodic DNA. Our results suggest that not all SNPs in the human genome are created by independent single nucleotide mutations, and that care should be taken in analysis of SNPs from periodic DNA. The latter may have important consequences for SNP and association studies. PMID:17673700
Genome size of termites (Insecta, Dictyoptera, Isoptera) and wood roaches (Insecta, Dictyoptera, Cryptocercidae)

NASA Astrophysics Data System (ADS)

Koshikawa, Shigeyuki; Miyazaki, Satoshi; Cornette, Richard; Matsumoto, Tadao; Miura, Toru

2008-09-01

The evolution of genome size has been discussed in relation to the evolution of various biological traits. In the present study, the genome sizes of 22 dictyopteran species were estimated by Feulgen image analysis densitometry and 6-diamidino-2-phenylindole (DAPI)-based flow cytometry. The haploid genome sizes ( C-values) of termites (Isoptera) ranged from 0.58 to 1.90 pg, and those of Cryptocercus wood roaches (Cryptocercidae) were 1.16 to 1.32 pg. Compared to known values of other cockroaches (Blattaria) and mantids (Mantodea), these values are low. A relatively small genome size appears to be a (syn)apomorphy of Isoptera + Cryptocercus, together with their sociality. In some phylogenetic groups, genome size evolution is thought to be influenced by selective pressure on a particular trait, such as cell size or rate of development. The present results raise the possibility that genome size is influenced by selective pressures on traits associated with the evolution of sociality.
Genome size of termites (Insecta, Dictyoptera, Isoptera) and wood roaches (Insecta, Dictyoptera, Cryptocercidae).

PubMed

Koshikawa, Shigeyuki; Miyazaki, Satoshi; Cornette, Richard; Matsumoto, Tadao; Miura, Toru

2008-09-01

The evolution of genome size has been discussed in relation to the evolution of various biological traits. In the present study, the genome sizes of 22 dictyopteran species were estimated by Feulgen image analysis densitometry and 6-diamidino-2-phenylindole (DAPI)-based flow cytometry. The haploid genome sizes (C-values) of termites (Isoptera) ranged from 0.58 to 1.90 pg, and those of Cryptocercus wood roaches (Cryptocercidae) were 1.16 to 1.32 pg. Compared to known values of other cockroaches (Blattaria) and mantids (Mantodea), these values are low. A relatively small genome size appears to be a (syn)apomorphy of Isoptera + Cryptocercus, together with their sociality. In some phylogenetic groups, genome size evolution is thought to be influenced by selective pressure on a particular trait, such as cell size or rate of development. The present results raise the possibility that genome size is influenced by selective pressures on traits associated with the evolution of sociality.
New Markers for Predicting Fertility of the Male Gametes in the Post Genomic Age.

PubMed

Dipresa, Savina; De Toni, Luca; Foresta, Carlo; Garolla, Andrea

2018-04-18

A number of test have been proposed to assess male fertility potential, ranging from routine testing by light microscopic method for evaluating semen samples, to screening test for DNA integrity aimed to look at sperm chromatin abnormalities. Spermatozoa are an extremely differentiated cell, they have critical functions for embryo development and heredity, in addiction to delivering a haploid paternal genome to the oocyte. Towards this goal certain requirements must always be met. The ability of spermatozoa to perform its reproductive function taking place in the spermatogenesis, a highly specialized process depending on multiple factors with effect on male fertility. In the past 30 years, large-scale analyses of transcriptomic and genome expression in mammals have generated a large amount of informations on numberless biomolecules involved in spermatogenesis and male germ cell reproductive function. Sperm proteome represents the protein content that spermatozoa needs to survive and work correctly and modifications of sperm proteome play a role in determining functional changes leading to a decrease of reproductive competence into affected spermatozoa. The post-genomic approach consists of different methodologies for concurrently testicular transcriptome studies, protein compositional analysis and metabolomics findings of the spermatozoa in humans. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Production of haploid plantlets in anther cultures of Albizzia lebbeck L.

PubMed

Gharyal, P K; Rashid, A; Maheshwari, S C

1983-12-01

Anthers of Albizzia lebbeck on B5 medium (BM) supplemented with kinetin (2 mg/l) and 2, 4-D (0.5 mg/l) showed callus initiation from microspores. Differentiation of embryoids and shoots was obtained on BM + BAP (1 mg/l) + IAA (0.5 mg/l) and of roots on BM. Root tip squashes of the regenerated plantlets showed the haploid chromosome number (n=13), confirming the microspore origin of the regenerants.
Crossed wires: 3D genome misfolding in human disease.

PubMed

Norton, Heidi K; Phillips-Cremins, Jennifer E

2017-11-06

Mammalian genomes are folded into unique topological structures that undergo precise spatiotemporal restructuring during healthy development. Here, we highlight recent advances in our understanding of how the genome folds inside the 3D nucleus and how these folding patterns are miswired during the onset and progression of mammalian disease states. We discuss potential mechanisms underlying the link among genome misfolding, genome dysregulation, and aberrant cellular phenotypes. We also discuss cases in which the endogenous 3D genome configurations in healthy cells might be particularly susceptible to mutation or translocation. Together, these data support an emerging model in which genome folding and misfolding is critically linked to the onset and progression of a broad range of human diseases. © 2017 Norton and Phillips-Cremins.
77 FR 27471 - National Human Genome Research Institute Amended Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-05-10

... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome Research Institute Amended Notice of Meeting Notice is hereby given of a change in the meeting of the National Advisory Council for Human Genome Research, May 21, 2012, 8:30 a.m. to May 22, 2012, 5:00 p.m...
Attitudes towards the Human Genome Project.

ERIC Educational Resources Information Center

Shahroudi, Julie; Shaw, Geraldine

Attitudes concerning the Human Genome Project were reported by faculty (N=40) and students (N=66) from a liberal arts college. Positive attitudes toward the project involved privacy, insurance and health, economic purposes, reproductive purposes, genetic counseling, religion and overall opinions. Negative attitudes were expressed regarding…
Human Genomic Loci Important in Common Infectious Diseases: Role of High-Throughput Sequencing and Genome-Wide Association Studies

PubMed Central

Sserwadda, Ivan; Amujal, Marion; Namatovu, Norah

2018-01-01

HIV/AIDS, tuberculosis (TB), and malaria are 3 major global public health threats that undermine development in many resource-poor settings. Recently, the notion that positive selection during epidemics or longer periods of exposure to common infectious diseases may have had a major effect in modifying the constitution of the human genome is being interrogated at a large scale in many populations around the world. This positive selection from infectious diseases increases power to detect associations in genome-wide association studies (GWASs). High-throughput sequencing (HTS) has transformed both the management of infectious diseases and continues to enable large-scale functional characterization of host resistance/susceptibility alleles and loci; a paradigm shift from single candidate gene studies. Application of genome sequencing technologies and genomics has enabled us to interrogate the host-pathogen interface for improving human health. Human populations are constantly locked in evolutionary arms races with pathogens; therefore, identification of common infectious disease-associated genomic variants/markers is important in therapeutic, vaccine development, and screening susceptible individuals in a population. This review describes a range of host-pathogen genomic loci that have been associated with disease susceptibility and resistant patterns in the era of HTS. We further highlight potential opportunities for these genetic markers. PMID:29755620
The dynamics of genome replication using deep sequencing

PubMed Central

Müller, Carolin A.; Hawkins, Michelle; Retkute, Renata; Malla, Sunir; Wilson, Ray; Blythe, Martin J.; Nakato, Ryuichiro; Komata, Makiko; Shirahige, Katsuhiko; de Moura, Alessandro P.S.; Nieduszynski, Conrad A.

2014-01-01

Eukaryotic genomes are replicated from multiple DNA replication origins. We present complementary deep sequencing approaches to measure origin location and activity in Saccharomyces cerevisiae. Measuring the increase in DNA copy number during a synchronous S-phase allowed the precise determination of genome replication. To map origin locations, replication forks were stalled close to their initiation sites; therefore, copy number enrichment was limited to origins. Replication timing profiles were generated from asynchronous cultures using fluorescence-activated cell sorting. Applying this technique we show that the replication profiles of haploid and diploid cells are indistinguishable, indicating that both cell types use the same cohort of origins with the same activities. Finally, increasing sequencing depth allowed the direct measure of replication dynamics from an exponentially growing culture. This is the first time this approach, called marker frequency analysis, has been successfully applied to a eukaryote. These data provide a high-resolution resource and methodological framework for studying genome biology. PMID:24089142
Mapping and annotating obesity-related genes in pig and human genomes.

PubMed

Martelli, Pier Luigi; Fontanesi, Luca; Piovesan, Damiano; Fariselli, Piero; Casadio, Rita

2014-01-01

Background. Obesity is a major health problem in both developed and emerging countries. Obesity is a complex disease whose etiology involves genetic factors in strong interplay with environmental determinants and lifestyle. The discovery of genetic factors and biological pathways underlying human obesity is hampered by the difficulty in controlling the genetic background of human cohorts. Animal models are then necessary to further dissect the genetics of obesity. Pig has emerged as one of the most attractive models, because of the similarity with humans in the mechanisms regulating the fat deposition. Results. We collected the genes related to obesity in humans and to fat deposition traits in pig. We localized them on both human and pig genomes, building a map useful to interpret comparative studies on obesity. We characterized the collected genes structurally and functionally with BAR+ and mapped them on KEGG pathways and on STRING protein interaction network. Conclusions. The collected set consists of 361 obesity related genes in human and pig genomes. All genes were mapped on the human genome, and 54 could not be localized on the pig genome (release 2012). Only for 3 human genes there is no counterpart in pig, confirming that this animal is a good model for human obesity studies. Obesity related genes are mostly involved in regulation and signaling processes/pathways and relevant connection emerges between obesity-related genes and diseases such as cancer and infectious diseases.
Genomics and the Ark: an ecocentric perspective on human history.

PubMed

Zwart, Hub; Penders, Bart

2011-01-01

Views of ourselves in relationship to the rest of the biosphere are changing. Theocentric and anthropocentric perspectives are giving way to more ecocentric views on the history, present, and future of humankind. Novel sciences, such as genomics, have deepened and broadened our understanding of the process of anthropogenesis, the coming into being of humans. Genomics suggests that early human history must be regarded as a complex narrative of evolving ecosystems, in which human evolution both influenced and was influenced by the evolution of companion species. During the agricultural revolution, human beings designed small-scale artificial ecosystems or evolutionary "Arks," in which networks of plants, animals, and microorganisms coevolved. Currently, our attitude towards this process seems subject to a paradoxical reversal. The boundaries of the Ark have dramatically broadened, and genomics is not only being used to increase our understanding of our ecological past, but may also help us to conserve, reconstruct, or even revivify species and ecosystems to whose degradation or (near) extinction we have contributed. This article explores the role of genomics in the elaboration of a more ecocentric view of ourselves with the help of two examples, namely the renaissance of Paleolithic diets and of Pleistocene parks. It argues that an understanding of the world in ecocentric terms requires new partnerships and mutually beneficial forms of collaboration and convergence between life sciences, social sciences, and the humanities.

CRISPR Genome Engineering for Human Pluripotent Stem Cell Research

PubMed Central

Chaterji, Somali; Ahn, Eun Hyun; Kim, Deok-Ho

2017-01-01

The emergence of targeted and efficient genome editing technologies, such as repurposed bacterial programmable nucleases (e.g., CRISPR-Cas systems), has abetted the development of cell engineering approaches. Lessons learned from the development of RNA-interference (RNA-i) therapies can spur the translation of genome editing, such as those enabling the translation of human pluripotent stem cell engineering. In this review, we discuss the opportunities and the challenges of repurposing bacterial nucleases for genome editing, while appreciating their roles, primarily at the epigenomic granularity. First, we discuss the evolution of high-precision, genome editing technologies, highlighting CRISPR-Cas9. They exist in the form of programmable nucleases, engineered with sequence-specific localizing domains, and with the ability to revolutionize human stem cell technologies through precision targeting with greater on-target activities. Next, we highlight the major challenges that need to be met prior to bench-to-bedside translation, often learning from the path-to-clinic of complementary technologies, such as RNA-i. Finally, we suggest potential bioinformatics developments and CRISPR delivery vehicles that can be deployed to circumvent some of the challenges confronting genome editing technologies en route to the clinic. PMID:29158838
Primer on Molecular Genetics; DOE Human Genome Program

DOE R&D Accomplishments Database

1992-04-01

This report is taken from the April 1992 draft of the DOE Human Genome 1991--1992 Program Report, which is expected to be published in May 1992. The primer is intended to be an introduction to basic principles of molecular genetics pertaining to the genome project. The material contained herein is not final and may be incomplete. Techniques of genetic mapping and DNA sequencing are described.
Cryptic Fitness Advantage: Diploids Invade Haploid Populations Despite Lacking Any Apparent Advantage as Measured by Standard Fitness Assays

PubMed Central

Gerstein, Aleeza C.; Otto, Sarah P.

2011-01-01

Ploidy varies tremendously within and between species, yet the factors that influence when or why ploidy variants are adaptive remains poorly understood. Our previous work found that diploid individuals repeatedly arose within ten replicate haploid populations of Saccharomyces cerevisiae, and in each case we witnessed diploid takeover within 1800 asexual generations of batch culture evolution in the lab. The character that allowed diploids to rise in frequency within haploid populations remains unknown. Here we present a number of experiments conducted with the goal to determine what this trait (or traits) might have been. Experiments were conducted both by sampling a small number of colonies from the stocks frozen every two weeks (93 generations) during the original experiment, as well through sampling a larger number of colonies at the two time points where polymorphism for ploidy was most prevalent. Surprisingly, none of our fitness component measures (lag phase, growth rate, biomass production) indicated an advantage to diploidy. Similarly, competition assays against a common competitor and direct competition between haploid and diploid colonies isolated from the same time point failed to indicate a diploid advantage. Furthermore, we uncovered a tremendous amount of trait variation among colonies of the same ploidy level. Only late-appearing diploids showed a competitive advantage over haploids, indicating that the fitness advantage that allowed eventual takeover was not diploidy per se but an attribute of a subset of diploid lineages. Nevertheless, the initial rise in diploids to intermediate frequency cannot be explained by any of the fitness measures used; we suggest that the resolution to this mystery is negative frequency-dependent selection, which is ignored in the standard fitness measures used. PMID:22174734
Indigenous peoples and the morality of the Human Genome Diversity Project.

PubMed

Dodson, M; Williamson, R

1999-04-01

In addition to the aim of mapping and sequencing one human's genome, the Human Genome Project also intends to characterise the genetic diversity of the world's peoples. The Human Genome Diversity Project raises political, economic and ethical issues. These intersect clearly when the genomes under study are those of indigenous peoples who are already subject to serious economic, legal and/or social disadvantage and discrimination. The fact that some individuals associated with the project have made dismissive comments about indigenous peoples has confused rather than illuminated the deeper issues involved, as well as causing much antagonism among indigenous peoples. There are more serious ethical issues raised by the project for all geneticists, including those who are sympathetic to the problems of indigenous peoples. With particular attention to the history and attitudes of Australian indigenous peoples, we argue that the Human Genome Diversity Project can only proceed if those who further its objectives simultaneously: respect the cultural beliefs of indigenous peoples; publicly support the efforts of indigenous peoples to achieve respect and equality; express respect by a rigorous understanding of the meaning of equitable negotiation of consent, and ensure that both immediate and long term economic benefits from the research flow back to the groups taking part.
Genome-Wide Identification of Regulatory Sequences Undergoing Accelerated Evolution in the Human Genome

PubMed Central

Dong, Xinran; Wang, Xiao; Zhang, Feng; Tian, Weidong

2016-01-01

Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes. PMID:27401230
Modelling Human Regulatory Variation in Mouse: Finding the Function in Genome-Wide Association Studies and Whole-Genome Sequencing

PubMed Central

Schmouth, Jean-François; Bonaguro, Russell J.; Corso-Diaz, Ximena; Simpson, Elizabeth M.

2012-01-01

An increasing body of literature from genome-wide association studies and human whole-genome sequencing highlights the identification of large numbers of candidate regulatory variants of potential therapeutic interest in numerous diseases. Our relatively poor understanding of the functions of non-coding genomic sequence, and the slow and laborious process of experimental validation of the functional significance of human regulatory variants, limits our ability to fully benefit from this information in our efforts to comprehend human disease. Humanized mouse models (HuMMs), in which human genes are introduced into the mouse, suggest an approach to this problem. In the past, HuMMs have been used successfully to study human disease variants; e.g., the complex genetic condition arising from Down syndrome, common monogenic disorders such as Huntington disease and β-thalassemia, and cancer susceptibility genes such as BRCA1. In this commentary, we highlight a novel method for high-throughput single-copy site-specific generation of HuMMs entitled High-throughput Human Genes on the X Chromosome (HuGX). This method can be applied to most human genes for which a bacterial artificial chromosome (BAC) construct can be derived and a mouse-null allele exists. This strategy comprises (1) the use of recombineering technology to create a human variant–harbouring BAC, (2) knock-in of this BAC into the mouse genome using Hprt docking technology, and (3) allele comparison by interspecies complementation. We demonstrate the throughput of the HuGX method by generating a series of seven different alleles for the human NR2E1 gene at Hprt. In future challenges, we consider the current limitations of experimental approaches and call for a concerted effort by the genetics community, for both human and mouse, to solve the challenge of the functional analysis of human regulatory variation. PMID:22396661
The Human Genome Project: An Imperative for International Collaboration.

ERIC Educational Resources Information Center

Allende, J. E.

1989-01-01

Discussed is the Human Genome Project which aims to decipher the totality of the human genetic information. The historical background, the objectives, international cooperation, ethical discussion, and the role of UNESCO are included. (KR)
[Manipulation of the human genome: ethics and law].

PubMed

Goulart, Maria Carolina Vaz; Iano, Flávia Godoy; Silva, Paulo Maurício; Sales-Peres, Silvia Helena de Carvalho; Sales-Peres, Arsênio

2010-06-01

The molecular biology has provided the basic tool for geneticists deepening in the molecular mechanisms that influence different diseases. It should be noted the scientific and moral responsibility of the researchers, because the scientists should imagine the moral consequences of the commercial application of genetic tests, since this fact involves not only the individual and their families, but the entire population. Besides being also necessary to make a reflection on how this information from the human genome will be used, for good or bad. The objective of this review was to bring the light of knowledge, data on characteristics of the ethical application of molecular biology, linking it with the rights of human beings. After studying literature, it might be observed that the Human Genome Project has generated several possibilities, such as the identification of genes associated with diseases with synergistic properties, but sometimes modifying behavior to genetically intervene in humans, bringing benefits or social harm. The big challenge is to decide what humanity wants on this giant leap.
Short template switch events explain mutation clusters in the human genome.

PubMed

Löytynoja, Ari; Goldman, Nick

2017-06-01

Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press.
Genome-Wide Landscapes of Human Local Adaptation in Asia

PubMed Central

Lu, Dongsheng; Xu, Shuhua

2013-01-01

Genetic studies of human local adaptation have been facilitated greatly by recent advances in high-throughput genotyping and sequencing technologies. However, few studies have investigated local adaptation in Asian populations on a genome-wide scale and with a high geographic resolution. In this study, taking advantage of the dense population coverage in Southeast Asia, which is the part of the world least studied in term of natural selection, we depicted genome-wide landscapes of local adaptations in 63 Asian populations representing the majority of linguistic and ethnic groups in Asia. Using genome-wide data analysis, we discovered many genes showing signs of local adaptation or natural selection. Notable examples, such as FOXQ1, MAST2, and CDH4, were found to play a role in hair follicle development and human cancer, signal transduction, and tumor repression, respectively. These showed strong indications of natural selection in Philippine Negritos, a group of aboriginal hunter-gatherers living in the Philippines. MTTP, which has associations with metabolic syndrome, body mass index, and insulin regulation, showed a strong signature of selection in Southeast Asians, including Indonesians. Functional annotation analysis revealed that genes and genetic variants underlying natural selections were generally enriched in the functional category of alternative splicing. Specifically, many genes showing significant difference with respect to allele frequency between northern and southern Asian populations were found to be associated with human height and growth and various immune pathways. In summary, this study contributes to the overall understanding of human local adaptation in Asia and has identified both known and novel signatures of natural selection in the human genome. PMID:23349834
EGASP: the human ENCODE Genome Annotation Assessment Project

PubMed Central

Guigó, Roderic; Flicek, Paul; Abril, Josep F; Reymond, Alexandre; Lagarde, Julien; Denoeud, France; Antonarakis, Stylianos; Ashburner, Michael; Bajic, Vladimir B; Birney, Ewan; Castelo, Robert; Eyras, Eduardo; Ucla, Catherine; Gingeras, Thomas R; Harrow, Jennifer; Hubbard, Tim; Lewis, Suzanna E; Reese, Martin G

2006-01-01

Background We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. Results The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified. Conclusion This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence. PMID:16925836
78 FR 55752 - National Human Genome Research Institute; Notice of Closed Meetings

Federal Register 2010, 2011, 2012, 2013, 2014

2013-09-11

... applications. Place: National Human Genome Research Institute, 4th Floor Library, 5635 Fishers Lane, Rockville... Research Institute; Notice of Closed Meetings Pursuant to section 10(d) of the Federal Advisory Committee... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research...
[Efficient genome editing in human pluripotent stem cells through CRISPR/Cas9].

PubMed

Liu, Gai-gai; Li, Shuang; Wei, Yu-da; Zhang, Yong-xian; Ding, Qiu-rong

2015-11-01

The RNA-guided CRISPR (clustered regularly interspaced short palindromic repeat)-associated Cas9 nuclease has offered a new platform for genome editing with high efficiency. Here, we report the use of CRISPR/Cas9 technology to target a specific genomic region in human pluripotent stem cells. We show that CRISPR/Cas9 can be used to disrupt a gene by introducing frameshift mutations to gene coding region; to knock in specific sequences (e.g. FLAG tag DNA sequence) to targeted genomic locus via homology directed repair; to induce large genomic deletion through dual-guide multiplex. Our results demonstrate the versatile application of CRISPR/Cas9 in stem cell genome editing, which can be widely utilized for functional studies of genes or genome loci in human pluripotent stem cells.
Lineage-specific genomics: Frequent birth and death in the human genome: The human genome contains many lineage-specific elements created by both sequence and functional turnover.

PubMed

Young, Robert S

2016-07-01

Frequent evolutionary birth and death events have created a large quantity of biologically important, lineage-specific DNA within mammalian genomes. The birth and death of DNA sequences is so frequent that the total number of these insertions and deletions in the human population remains unknown, although there are differences between these groups, e.g. transposable elements contribute predominantly to sequence insertion. Functional turnover - where the activity of a locus is specific to one lineage, but the underlying DNA remains conserved - can also drive birth and death. However, this does not appear to be a major driver of divergent transcriptional regulation. Both sequence and functional turnover have contributed to the birth and death of thousands of functional promoters in the human and mouse genomes. These findings reveal the pervasive nature of evolutionary birth and death and suggest that lineage-specific regions may play an important but previously underappreciated role in human biology and disease. © 2016 The Authors BioEssays Published by WILEY Periodicals, Inc.
Combinations of chromosome transfer and genome editing for the development of cell/animal models of human disease and humanized animal models.

PubMed

Uno, Narumi; Abe, Satoshi; Oshimura, Mitsuo; Kazuki, Yasuhiro

2018-02-01

Chromosome transfer technology, including chromosome modification, enables the introduction of Mb-sized or multiple genes to desired cells or animals. This technology has allowed innovative developments to be made for models of human disease and humanized animals, including Down syndrome model mice and humanized transchromosomic (Tc) immunoglobulin mice. Genome editing techniques are developing rapidly, and permit modifications such as gene knockout and knockin to be performed in various cell lines and animals. This review summarizes chromosome transfer-related technologies and the combined technologies of chromosome transfer and genome editing mainly for the production of cell/animal models of human disease and humanized animal models. Specifically, these include: (1) chromosome modification with genome editing in Chinese hamster ovary cells and mouse A9 cells for efficient transfer to desired cell types; (2) single-nucleotide polymorphism modification in humanized Tc mice with genome editing; and (3) generation of a disease model of Down syndrome-associated hematopoiesis abnormalities by the transfer of human chromosome 21 to normal human embryonic stem cells and the induction of mutation(s) in the endogenous gene(s) with genome editing. These combinations of chromosome transfer and genome editing open up new avenues for drug development and therapy as well as for basic research.
Origins of the Human Genome Project

DOE R&D Accomplishments Database

Cook-Deegan, Robert (Affiliation: Institute of Medicine, National Academy of Sciences)

1993-07-01

The human genome project was borne of technology, grew into a science bureaucracy in the United States and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information is embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.
PATENTS IN GENOMICS AND HUMAN GENETICS

PubMed Central

Cook-Deegan, Robert; Heaney, Christopher

2010-01-01

Genomics and human genetics are scientifically fundamental and commercially valuable. These fields grew to prominence in an era of growth in government and nonprofit research funding, and of even greater growth of privately funded research and development in biotechnology and pharmaceuticals. Patents on DNA technologies are a central feature of this story, illustrating how patent law adapts---and sometimes fails to adapt---to emerging genomic technologies. In instrumentation and for therapeutic proteins, patents have largely played their traditional role of inducing investment in engineering and product development, including expensive postdiscovery clinical research to prove safety and efficacy. Patents on methods and DNA sequences relevant to clinical genetic testing show less evidence of benefits and more evidence of problems and impediments, largely attributable to university exclusive licensing practices. Whole-genome sequencing will confront uncertainty about infringing granted patents but jurisprudence trends away from upholding the broadest and potentially most troublesome patent claims. PMID:20590431
Landscape of Insertion Polymorphisms in the Human Genome

PubMed Central

Onozawa, Masahiro; Goldberg, Liat; Aplan, Peter D.

2015-01-01

Nucleotide substitutions, small (<50 bp) insertions or deletions (indels), and large (>50 bp) deletions are well-known causes of genetic variation within the human genome. We recently reported a previously unrecognized form of polymorphic insertions, termed templated sequence insertion polymorphism (TSIP), in which the inserted sequence was templated from a distant genomic region, and was inserted in the genome through reverse transcription of an RNA intermediate. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; class 1 TSIPs show target site duplication, polyadenylation, and preference for insertion at a 5′-TTTT/A-3′ sequence, suggesting a LINE-1 based insertion mechanism, whereas class 2 TSIPs show features consistent with repair of a DNA double strand break by nonhomologous end joining. To gain a more complete picture of TSIPs throughout the human population, we evaluated whole-genome sequence from 52 individuals, and identified 171 TSIPs. Most individuals had 25–30 TSIPs, and common (present in >20% of individuals) TSIPs were found in individuals throughout the world, whereas rare TSIPs tended to cluster in specific geographic regions. The number of rare TSIPs was greater than the number of common TSIPs, suggesting that TSIP generation is an ongoing process. Intriguingly, mitochondrial sequences were a frequent template for class 2 insertions, used more commonly than any nuclear chromosome. Similar to single nucleotide polymorphisms and indels, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases, and can be useful in tracking historical migration of populations. PMID:25745018
Multi-scale structural community organisation of the human genome.

PubMed

Boulos, Rasha E; Tremblay, Nicolas; Arneodo, Alain; Borgnat, Pierre; Audit, Benjamin

2017-04-11

Structural interaction frequency matrices between all genome loci are now experimentally achievable thanks to high-throughput chromosome conformation capture technologies. This ensues a new methodological challenge for computational biology which consists in objectively extracting from these data the structural motifs characteristic of genome organisation. We deployed the fast multi-scale community mining algorithm based on spectral graph wavelets to characterise the networks of intra-chromosomal interactions in human cell lines. We observed that there exist structural domains of all sizes up to chromosome length and demonstrated that the set of structural communities forms a hierarchy of chromosome segments. Hence, at all scales, chromosome folding predominantly involves interactions between neighbouring sites rather than the formation of links between distant loci. Multi-scale structural decomposition of human chromosomes provides an original framework to question structural organisation and its relationship to functional regulation across the scales. By construction the proposed methodology is independent of the precise assembly of the reference genome and is thus directly applicable to genomes whose assembly is not fully determined.
The human genome: Some assembly required. Final report

DOE Office of Scientific and Technical Information (OSTI.GOV)

NONE

1994-12-31

The Human Genome Project promises to be one of the most rewarding endeavors in modern biology. The cost and the ethical and social implications, however, have made this project the source of considerable debate both in the scientific community and in the public at large. The 1994 Graduate Student Symposium addresses the scientific merits of the project, the technical issues involved in accomplishing the task, as well as the medical and social issues which stem from the wealth of knowledge which the Human Genome Project will help create. To this end, speakers were brought together who represent the diverse areasmore » of expertise characteristic of this multidisciplinary project. The keynote speaker addresses the project`s motivations and goals in the larger context of biological and medical sciences. The first two sessions address relevant technical issues, data collection with a focus on high-throughput sequencing methods and data analysis with an emphasis on identification of coding sequences. The third session explores recent advances in the understanding of genetic diseases and possible routes to treatment. Finally, the last session addresses some of the ethical, social and legal issues which will undoubtedly arise from having a detailed knowledge of the human genome.« less

Simulation and estimation of gene number in a biological pathway using almost complete saturation mutagenesis screening of haploid mouse cells.

PubMed

Tokunaga, Masahiro; Kokubu, Chikara; Maeda, Yusuke; Sese, Jun; Horie, Kyoji; Sugimoto, Nakaba; Kinoshita, Taroh; Yusa, Kosuke; Takeda, Junji

2014-11-24

Genome-wide saturation mutagenesis and subsequent phenotype-driven screening has been central to a comprehensive understanding of complex biological processes in classical model organisms such as flies, nematodes, and plants. The degree of "saturation" (i.e., the fraction of possible target genes identified) has been shown to be a critical parameter in determining all relevant genes involved in a biological function, without prior knowledge of their products. In mammalian model systems, however, the relatively large scale and labor intensity of experiments have hampered the achievement of actual saturation mutagenesis, especially for recessive traits that require biallelic mutations to manifest detectable phenotypes. By exploiting the recently established haploid mouse embryonic stem cells (ESCs), we present an implementation of almost complete saturation mutagenesis in a mammalian system. The haploid ESCs were mutagenized with the chemical mutagen N-ethyl-N-nitrosourea (ENU) and processed for the screening of mutants defective in various steps of the glycosylphosphatidylinositol-anchor biosynthetic pathway. The resulting 114 independent mutant clones were characterized by a functional complementation assay, and were shown to be defective in any of 20 genes among all 22 known genes essential for this well-characterized pathway. Ten mutants were further validated by whole-exome sequencing. The predominant generation of single-nucleotide substitutions by ENU resulted in a gene mutation rate proportional to the length of the coding sequence, which facilitated the experimental design of saturation mutagenesis screening with the aid of computational simulation. Our study enables mammalian saturation mutagenesis to become a realistic proposition. Computational simulation, combined with a pilot mutagenesis experiment, could serve as a tool for the estimation of the number of genes essential for biological processes such as drug target pathways when a positive selection of
Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus.

PubMed

Ansari, M Azim; Pedergnana, Vincent; L C Ip, Camilla; Magri, Andrea; Von Delft, Annette; Bonsall, David; Chaturvedi, Nimisha; Bartha, Istvan; Smith, David; Nicholson, George; McVean, Gilean; Trebes, Amy; Piazza, Paolo; Fellay, Jacques; Cooke, Graham; Foster, Graham R; Hudson, Emma; McLauchlan, John; Simmonds, Peter; Bowden, Rory; Klenerman, Paul; Barnes, Eleanor; Spencer, Chris C A

2017-05-01

Outcomes of hepatitis C virus (HCV) infection and treatment depend on viral and host genetic factors. Here we use human genome-wide genotyping arrays and new whole-genome HCV viral sequencing technologies to perform a systematic genome-to-genome study of 542 individuals who were chronically infected with HCV, predominantly genotype 3. We show that both alleles of genes encoding human leukocyte antigen molecules and genes encoding components of the interferon lambda innate immune system drive viral polymorphism. Additionally, we show that IFNL4 genotypes determine HCV viral load through a mechanism dependent on a specific amino acid residue in the HCV NS5A protein. These findings highlight the interplay between the innate immune system and the viral genome in HCV control.
Implications of the Human Genome Project for medical science.

PubMed

Collins, F S; McKusick, V A

2001-02-07

The year 2000 marked both the start of the new millennium and the announcement that the vast majority of the human genome had been sequenced. Much work remains to understand how this "instruction book for human biology" carries out its multitudes of functions. But the consequences for the practice of medicine are likely to be profound. Genetic prediction of individual risks of disease and responsiveness to drugs will reach the medical mainstream in the next decade or so. The development of designer drugs, based on a genomic approach to targeting molecular pathways that are disrupted in disease, will follow soon after. Potential misuses of genetic information, such as discrimination in obtaining health insurance and in the workplace, will need to be dealt with swiftly and effectively. Genomic medicine holds the ultimate promise of revolutionizing the diagnosis and treatment of many illnesses.
Trial and error: how the unclonable human mitochondrial genome was cloned in yeast.

PubMed

Bigger, Brian W; Liao, Ai-Yin; Sergijenko, Ana; Coutelle, Charles

2011-11-01

Development of a human mitochondrial gene delivery vector is a critical step in the ability to treat diseases arising from mutations in mitochondrial DNA. Although we have previously cloned the mouse mitochondrial genome in its entirety and developed it as a mitochondrial gene therapy vector, the human mitochondrial genome has been dubbed unclonable in E. coli, due to regions of instability in the D-loop and tRNA(Thr) gene. We tested multi- and single-copy vector systems for cloning human mitochondrial DNA in E. coli and Saccharomyces cerevisiae, including transformation-associated recombination. Human mitochondrial DNA is unclonable in E. coli and cannot be retained in multi- or single-copy vectors under any conditions. It was, however, possible to clone and stably maintain the entire human mitochondrial genome in yeast as long as a single-copy centromeric plasmid was used. D-loop and tRNA(Thr) were both stable and unmutated. This is the first report of cloning the entire human mitochondrial genome and the first step in developing a gene delivery vehicle for human mitochondrial gene therapy.
Identification of cis-suppression of human disease mutations by comparative genomics

PubMed Central

Jordan, Daniel M.; Frangakis, Stephan G.; Golzio, Christelle; Cassa, Christopher A.; Kurtzberg, Joanne; Davis, Erica E.; Sunyaev, Shamil R.; Katsanis, Nicholas

2015-01-01

Patterns of amino acid conservation have served as a tool for understanding protein evolution1. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients2. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes3,4 revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity5,6. PMID:26123021
Identification of cis-suppression of human disease mutations by comparative genomics.

PubMed

Jordan, Daniel M; Frangakis, Stephan G; Golzio, Christelle; Cassa, Christopher A; Kurtzberg, Joanne; Davis, Erica E; Sunyaev, Shamil R; Katsanis, Nicholas

2015-08-13

Patterns of amino acid conservation have served as a tool for understanding protein evolution. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity.
A BAC clone fingerprinting approach to the detection of human genome rearrangements

PubMed Central

Krzywinski, Martin; Bosdet, Ian; Mathewson, Carrie; Wye, Natasja; Brebner, Jay; Chiu, Readman; Corbett, Richard; Field, Matthew; Lee, Darlene; Pugh, Trevor; Volik, Stas; Siddiqui, Asim; Jones, Steven; Schein, Jacquie; Collins, Collin; Marra, Marco

2007-01-01

We present a method, called fingerprint profiling (FPP), that uses restriction digest fingerprints of bacterial artificial chromosome clones to detect and classify rearrangements in the human genome. The approach uses alignment of experimental fingerprint patterns to in silico digests of the sequence assembly and is capable of detecting micro-deletions (1-5 kb) and balanced rearrangements. Our method has compelling potential for use as a whole-genome method for the identification and characterization of human genome rearrangements. PMID:17953769
Genetic mapping of centromeres in the nine Citrus clementina chromosomes using half-tetrad analysis and recombination patterns in unreduced and haploid gametes.

PubMed

Aleza, Pablo; Cuenca, José; Hernández, María; Juárez, José; Navarro, Luis; Ollitrault, Patrick

2015-03-08

Mapping centromere locations in plant species provides essential information for the analysis of genetic structures and population dynamics. The centromere's position affects the distribution of crossovers along a chromosome and the parental heterozygosity restitution by 2n gametes is a direct function of the genetic distance to the centromere. Sexual polyploidisation is relatively frequent in Citrus species and is widely used to develop new seedless triploid cultivars. The study's objectives were to (i) map the positions of the centromeres of the nine Citrus clementina chromosomes; (ii) analyse the crossover interference in unreduced gametes; and (iii) establish the pattern of genetic recombination in haploid clementine gametes along each chromosome and its relationship with the centromere location and distribution of genic sequences. Triploid progenies were derived from unreduced megagametophytes produced by second-division restitution. Centromere positions were mapped genetically for all linkage groups using half-tetrad analysis. Inference of the physical locations of centromeres revealed one acrocentric, four metacentric and four submetacentric chromosomes. Crossover interference was observed in unreduced gametes, with variation seen between chromosome arms. For haploid gametes, a strong decrease in the recombination rate occurred in centromeric and pericentromeric regions, which contained a low density of genic sequences. In chromosomes VIII and IX, these low recombination rates extended beyond the pericentromeric regions. The genomic region corresponding to a genetic distance < 5cM from a centromere represented 47% of the genome and 23% of the genic sequences. The centromere positions of the nine citrus chromosomes were genetically mapped. Their physical locations, inferred from the genetic ones, were consistent with the sequence constitution and recombination pattern along each chromosome. However, regions with low recombination rates extended beyond the
Research on the human genome and patentability--the ethical consequences.

PubMed Central

Pompidou, A

1995-01-01

The genome is one of the primordial elements of the human being and is responsible for human identity and its transmission to descendants. The gene as such ought not be appropriated or owned by man. However, any sufficiently complete description of a gene should be capable of being protected as intellectual property. Furthermore, all utilisations of a gene or its elements that permit development of processes or new products should be patentable. Ethics, in the sense of moral action, should come into play from the very first stages of research into the human genome. Protection of intellectual and industrial property is of purely legal concern and need not provoke ethical consideration. By contrast, the use of the results of, and in particular the commercialisation of products deriving from, research into the human genome, ought to be subjected to ethical consideration and control. Considering the economic and societal stakes of such research, ethical analysis ought to be at an international level if mistakes and unforeseen risks of conflict are to be avoided. PMID:7608941
Efficient CRISPR/Cas9-Based Genome Engineering in Human Pluripotent Stem Cells.

PubMed

Kime, Cody; Mandegar, Mohammad A; Srivastava, Deepak; Yamanaka, Shinya; Conklin, Bruce R; Rand, Tim A

2016-01-01

Human pluripotent stem cells (hPS cells) are rapidly emerging as a powerful tool for biomedical discovery. The advent of human induced pluripotent stem cells (hiPS cells) with human embryonic stem (hES)-cell-like properties has led to hPS cells with disease-specific genetic backgrounds for in vitro disease modeling and drug discovery as well as mechanistic and developmental studies. To fully realize this potential, it will be necessary to modify the genome of hPS cells with precision and flexibility. Pioneering experiments utilizing site-specific double-strand break (DSB)-mediated genome engineering tools, including zinc finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALENs), have paved the way to genome engineering in previously recalcitrant systems such as hPS cells. However, these methods are technically cumbersome and require significant expertise, which has limited adoption. A major recent advance involving the clustered regularly interspaced short palindromic repeats (CRISPR) endonuclease has dramatically simplified the effort required for genome engineering and will likely be adopted widely as the most rapid and flexible system for genome editing in hPS cells. In this unit, we describe commonly practiced methods for CRISPR endonuclease genomic editing of hPS cells into cell lines containing genomes altered by insertion/deletion (indel) mutagenesis or insertion of recombinant genomic DNA. Copyright © 2016 John Wiley & Sons, Inc.
Heteroplasmy in the Mitochondrial Genomes of Human Lice and Ticks Revealed by High Throughput Sequencing

PubMed Central

Xiong, Haoyu; Barker, Stephen C.; Burger, Thomas D.; Raoult, Didier; Shao, Renfu

2013-01-01

The typical mitochondrial (mt) genomes of bilateral animals consist of 37 genes on a single circular chromosome. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, are extensively fragmented and contain 20 minichromosomes, with one to three genes on each minichromosome. Heteroplasmy, i.e. nucleotide polymorphisms in the mt genome within individuals, has been shown to be significantly higher in the mt cox1 gene of human lice than in humans and other animals that have the typical mt genomes. To understand whether the extent of heteroplasmy in human lice is associated with mt genome fragmentation, we sequenced the entire coding regions of all of the mt minichromosomes of six human body lice and six human head lice from Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks, which have the typical mitochondrial genome organization of bilateral animals. We found that the level of heteroplasmy varies significantly both among the human lice and among the ticks. The human lice from Ethiopia have significantly higher level of heteroplasmy than those from China and France (Pt<0.05). The tick, Amblyomma cajennense, has significantly higher level of heteroplasmy than other ticks (Pt<0.05). Our results indicate that heteroplasmy level can be substantially variable within a species and among closely related species, and does not appear to be determined by single factors such as genome fragmentation. PMID:24058467
Heteroplasmy in the mitochondrial genomes of human lice and ticks revealed by high throughput sequencing.

PubMed

Xiong, Haoyu; Barker, Stephen C; Burger, Thomas D; Raoult, Didier; Shao, Renfu

2013-01-01

The typical mitochondrial (mt) genomes of bilateral animals consist of 37 genes on a single circular chromosome. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, are extensively fragmented and contain 20 minichromosomes, with one to three genes on each minichromosome. Heteroplasmy, i.e. nucleotide polymorphisms in the mt genome within individuals, has been shown to be significantly higher in the mt cox1 gene of human lice than in humans and other animals that have the typical mt genomes. To understand whether the extent of heteroplasmy in human lice is associated with mt genome fragmentation, we sequenced the entire coding regions of all of the mt minichromosomes of six human body lice and six human head lice from Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks, which have the typical mitochondrial genome organization of bilateral animals. We found that the level of heteroplasmy varies significantly both among the human lice and among the ticks. The human lice from Ethiopia have significantly higher level of heteroplasmy than those from China and France (Pt<0.05). The tick, Amblyomma cajennense, has significantly higher level of heteroplasmy than other ticks (Pt<0.05). Our results indicate that heteroplasmy level can be substantially variable within a species and among closely related species, and does not appear to be determined by single factors such as genome fragmentation.
Microbial genome-wide association studies: lessons from human GWAS.

PubMed

Power, Robert A; Parkhill, Julian; de Oliveira, Tulio

2017-01-01

The reduced costs of sequencing have led to whole-genome sequences for a large number of microorganisms, enabling the application of microbial genome-wide association studies (GWAS). Given the successes of human GWAS in understanding disease aetiology and identifying potential drug targets, microbial GWAS are likely to further advance our understanding of infectious diseases. These advances include insights into pressing global health problems, such as antibiotic resistance and disease transmission. In this Review, we outline the methodologies of GWAS, the current state of the field of microbial GWAS, and how lessons from human GWAS can direct the future of the field.
Mammalian genomic regulatory regions predicted by utilizing human genomics, transcriptomics, and epigenetics data

PubMed Central

Nguyen, Quan H; Tellam, Ross L; Naval-Sanchez, Marina; Porto-Neto, Laercio R; Barendse, William; Reverter, Antonio; Hayes, Benjamin; Kijas, James; Dalrymple, Brian P

2018-01-01

Abstract Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets. PMID:29618048
Mammalian genomic regulatory regions predicted by utilizing human genomics, transcriptomics, and epigenetics data.

PubMed

Nguyen, Quan H; Tellam, Ross L; Naval-Sanchez, Marina; Porto-Neto, Laercio R; Barendse, William; Reverter, Antonio; Hayes, Benjamin; Kijas, James; Dalrymple, Brian P

2018-03-01

Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets.
CRISPR/Cas9 genome editing in human pluripotent stem cells: Harnessing human genetics in a dish.

PubMed

González, Federico

2016-07-01

Because of their extraordinary differentiation potential, human pluripotent stem cells (hPSCs) can differentiate into virtually any cell type of the human body, providing a powerful platform not only for generating relevant cell types useful for cell replacement therapies, but also for modeling human development and disease. Expanding this potential, structures resembling human organs, termed organoids, have been recently obtained from hPSCs through tissue engineering. Organoids exhibit multiple cell types self-organizing into structures recapitulating in part the physiology and the cellular interactions observed in the organ in vivo, offering unprecedented opportunities for human disease modeling. To fulfill this promise, tissue engineering in hPSCs needs to be supported by robust and scalable genome editing technologies. With the advent of the CRISPR/Cas9 technology, manipulating the genome of hPSCs has now become an easy task, allowing modifying their genome with superior precision, speed, and throughput. Here we review current and potential applications of the CRISPR/Cas9 technology in hPSCs and how they contribute to establish hPSCs as a model of choice for studying human genetics. Developmental Dynamics 245:788-806, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The emergence of human-evolutionary medical genomics

PubMed Central

Crespi, Bernard J

2011-01-01

In this review, I describe how evolutionary genomics is uniquely suited to spearhead advances in understanding human disease risk, owing to the privileged position of genes as fundamental causes of phenotypic variation, and the ability of population genetic and phylogenetic methods to robustly infer processes of natural selection, drift, and mutation from genetic variation at the levels of family, population, species, and clade. I first provide an overview of models for the origins and maintenance of genetically based disease risk in humans. I then discuss how analyses of genetic disease risk can be dovetailed with studies of positive and balancing selection, to evaluate the degree to which the ‘genes that make us human’ also represent the genes that mediate risk of polygenic disease. Finally, I present four basic principles for the nascent field of human evolutionary medical genomics, each of which represents a process that is nonintuitive from a proximate perspective. Joint consideration of these principles compels novel forms of interdisciplinary analyses, most notably studies that (i) analyze tradeoffs at the level of molecular genetics, and (ii) identify genetic variants that are derived in the human lineage or in specific populations, and then compare individuals with derived versus ancestral alleles. PMID:25567974
Genome-Wide Identification of Regulatory Sequences Undergoing Accelerated Evolution in the Human Genome.

PubMed

Dong, Xinran; Wang, Xiao; Zhang, Feng; Tian, Weidong

2016-10-01

Accelerated evolution of regulatory sequence can alter the expression pattern of target genes, and cause phenotypic changes. In this study, we used DNase I hypersensitive sites (DHSs) to annotate putative regulatory sequences in the human genome, and conducted a genome-wide analysis of the effects of accelerated evolution on regulatory sequences. Working under the assumption that local ancient repeat elements of DHSs are under neutral evolution, we discovered that ∼0.44% of DHSs are under accelerated evolution (ace-DHSs). We found that ace-DHSs tend to be more active than background DHSs, and are strongly associated with epigenetic marks of active transcription. The target genes of ace-DHSs are significantly enriched in neuron-related functions, and their expression levels are positively selected in the human brain. Thus, these lines of evidences strongly suggest that accelerated evolution on regulatory sequences plays important role in the evolution of human-specific phenotypes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Persistence and breakdown of strand symmetry in the human genome.

PubMed

Zhang, Shang-Hong

2015-04-07

Afreixo, V., Bastos, C.A.C., Garcia, S.P., Rodrigues, J.M.O.S., Pinho, A.J., Ferreira, P.J.S.G., 2013. The breakdown of the word symmetry in the human genome. J. Theor. Biol. 335, 153-159 analyzed the word symmetry (strand symmetry or the second parity rule) in the human genome. They concluded that strand symmetry holds for oligonucleotides up to 6 nt and is no longer statistically significant for oligonucleotides of higher orders. However, although they provided some new results for the issue, their interpretation would not be fully justified. Also, their conclusion needs to be further evaluated. Further analysis of their results, especially those of equivalence tests and word symmetry distance, shows that strand symmetry would persist for higher-order oligonucleotides up to 9 nt in the human genome, at least for its overall frequency framework (oligonucleotide frequency pattern). Copyright © 2015 Elsevier Ltd. All rights reserved.
Inferring Selective Constraint from Population Genomic Data Suggests Recent Regulatory Turnover in the Human Brain

PubMed Central

Schrider, Daniel R.; Kern, Andrew D.

2015-01-01

The comparative genomics revolution of the past decade has enabled the discovery of functional elements in the human genome via sequence comparison. While that is so, an important class of elements, those specific to humans, is entirely missed by searching for sequence conservation across species. Here we present an analysis based on variation data among human genomes that utilizes a supervised machine learning approach for the identification of human-specific purifying selection in the genome. Using only allele frequency information from the complete low-coverage 1000 Genomes Project data set in conjunction with a support vector machine trained from known functional and nonfunctional portions of the genome, we are able to accurately identify portions of the genome constrained by purifying selection. Our method identifies previously known human-specific gains or losses of function and uncovers many novel candidates. Candidate targets for gain and loss of function along the human lineage include numerous putative regulatory regions of genes essential for normal development of the central nervous system, including a significant enrichment of gain of function events near neurotransmitter receptor genes. These results are consistent with regulatory turnover being a key mechanism in the evolution of human-specific characteristics of brain development. Finally, we show that the majority of the genome is unconstrained by natural selection currently, in agreement with what has been estimated from phylogenetic methods but in sharp contrast to estimates based on transcriptomics or other high-throughput functional methods. PMID:26590212

Understanding the Human Genome Project: Using Stations to Provide a Comprehensive Overview

ERIC Educational Resources Information Center

Soto, Julio G.

2005-01-01

A lesson was designed for lower division general education, non-major biology lecture-only course that included the historical and scientific context, some of the skills used to study the human genome, results, conclusions and ethical consideration. Students learn to examine and compare the published Human Genome maps, and employ the strategies…
The Human Genome Diversity Project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cavalli-Sforza, L.

1994-12-31

The Human Genome Diversity Project (HGD Project) is an international anthropology project that seeks to study the genetic richness of the entire human species. This kind of genetic information can add a unique thread to the tapestry knowledge of humanity. Culture, environment, history, and other factors are often more important, but humanity`s genetic heritage, when analyzed with recent technology, brings another type of evidence for understanding species` past and present. The Project will deepen the understanding of this genetic richness and show both humanity`s diversity and its deep and underlying unity. The HGD Project is still largely in its planningmore » stages, seeking the best ways to reach its goals. The continuing discussions of the Project, throughout the world, should improve the plans for the Project and their implementation. The Project is as global as humanity itself; its implementation will require the kinds of partnerships among different nations and cultures that make the involvement of UNESCO and other international organizations particularly appropriate. The author will briefly discuss the Project`s history, describe the Project, set out the core principles of the Project, and demonstrate how the Project will help combat the scourge of racism.« less
Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

PubMed

Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

2016-01-01

Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res
The Evolution and Functional Impact of Human Deletion Variants Shared with Archaic Hominin Genomes

PubMed Central

Lin, Yen-Lung; Pavlidis, Pavlos; Karakoc, Emre; Ajay, Jerry; Gokcumen, Omer

2015-01-01

Allele sharing between modern and archaic hominin genomes has been variously interpreted to have originated from ancestral genetic structure or through non-African introgression from archaic hominins. However, evolution of polymorphic human deletions that are shared with archaic hominin genomes has yet to be studied. We identified 427 polymorphic human deletions that are shared with archaic hominin genomes, approximately 87% of which originated before the Human–Neandertal divergence (ancient) and only approximately 9% of which have been introgressed from Neandertals (introgressed). Recurrence, incomplete lineage sorting between human and chimp lineages, and hominid-specific insertions constitute the remaining approximately 4% of allele sharing between humans and archaic hominins. We observed that ancient deletions correspond to more than 13% of all common (>5% allele frequency) deletion variation among modern humans. Our analyses indicate that the genomic landscapes of both ancient and introgressed deletion variants were primarily shaped by purifying selection, eliminating large and exonic variants. We found 17 exonic deletions that are shared with archaic hominin genomes, including those leading to three fusion transcripts. The affected genes are involved in metabolism of external and internal compounds, growth and sperm formation, as well as susceptibility to psoriasis and Crohn’s disease. Our analyses suggest that these “exonic” deletion variants have evolved through different adaptive forces, including balancing and population-specific positive selection. Our findings reveal that genomic structural variants that are shared between humans and archaic hominin genomes are common among modern humans and can influence biomedically and evolutionarily important phenotypes. PMID:25556237
Using populations of human and microbial genomes for organism detection in metagenomes

DOE PAGES

Ames, Sasha K.; Gardner, Shea N.; Marti, Jose Manuel; ...

2015-04-29

Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-freemore » human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. In conclusion, left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected.« less
Using populations of human and microbial genomes for organism detection in metagenomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ames, Sasha K.; Gardner, Shea N.; Marti, Jose Manuel

Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-freemore » human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. In conclusion, left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected.« less
Genome-Wide Mutation Avalanches Induced in Diploid Yeast Cells by a Base Analog or an APOBEC Deaminase

PubMed Central

Lada, Artem G.; Stepchenkova, Elena I.; Waisertreiger, Irina S. R.; Noskov, Vladimir N.; Dhar, Alok; Eudy, James D.; Boissy, Robert J.; Hirano, Masayuki; Rogozin, Igor B.; Pavlov, Youri I.

2013-01-01

Genetic information should be accurately transmitted from cell to cell; conversely, the adaptation in evolution and disease is fueled by mutations. In the case of cancer development, multiple genetic changes happen in somatic diploid cells. Most classic studies of the molecular mechanisms of mutagenesis have been performed in haploids. We demonstrate that the parameters of the mutation process are different in diploid cell populations. The genomes of drug-resistant mutants induced in yeast diploids by base analog 6-hydroxylaminopurine (HAP) or AID/APOBEC cytosine deaminase PmCDA1 from lamprey carried a stunning load of thousands of unselected mutations. Haploid mutants contained almost an order of magnitude fewer mutations. To explain this, we propose that the distribution of induced mutation rates in the cell population is uneven. The mutants in diploids with coincidental mutations in the two copies of the reporter gene arise from a fraction of cells that are transiently hypersensitive to the mutagenic action of a given mutagen. The progeny of such cells were never recovered in haploids due to the lethality caused by the inactivation of single-copy essential genes in cells with too many induced mutations. In diploid cells, the progeny of hypersensitive cells survived, but their genomes were saturated by heterozygous mutations. The reason for the hypermutability of cells could be transient faults of the mutation prevention pathways, like sanitization of nucleotide pools for HAP or an elevated expression of the PmCDA1 gene or the temporary inability of the destruction of the deaminase. The hypothesis on spikes of mutability may explain the sudden acquisition of multiple mutational changes during evolution and carcinogenesis. PMID:24039593
Human Genome Replication Proceeds through Four Chromatin States

PubMed Central

Julienne, Hanna; Zoufir, Azedine; Audit, Benjamin; Arneodo, Alain

2013-01-01

Advances in genomic studies have led to significant progress in understanding the epigenetically controlled interplay between chromatin structure and nuclear functions. Epigenetic modifications were shown to play a key role in transcription regulation and genome activity during development and differentiation or in response to the environment. Paradoxically, the molecular mechanisms that regulate the initiation and the maintenance of the spatio-temporal replication program in higher eukaryotes, and in particular their links to epigenetic modifications, still remain elusive. By integrative analysis of the genome-wide distributions of thirteen epigenetic marks in the human cell line K562, at the 100 kb resolution of corresponding mean replication timing (MRT) data, we identify four major groups of chromatin marks with shared features. These states have different MRT, namely from early to late replicating, replication proceeds though a transcriptionally active euchromatin state (C1), a repressive type of chromatin (C2) associated with polycomb complexes, a silent state (C3) not enriched in any available marks, and a gene poor HP1-associated heterochromatin state (C4). When mapping these chromatin states inside the megabase-sized U-domains (U-shaped MRT profile) covering about 50% of the human genome, we reveal that the associated replication fork polarity gradient corresponds to a directional path across the four chromatin states, from C1 at U-domains borders followed by C2, C3 and C4 at centers. Analysis of the other genome half is consistent with early and late replication loci occurring in separate compartments, the former correspond to gene-rich, high-GC domains of intermingled chromatin states C1 and C2, whereas the latter correspond to gene-poor, low-GC domains of alternating chromatin states C3 and C4 or long C4 domains. This new segmentation sheds a new light on the epigenetic regulation of the spatio-temporal replication program in human and provides a
Quantification of GC-biased gene conversion in the human genome

PubMed Central

Glémin, Sylvain; Arndt, Peter F.; Messer, Philipp W.; Petrov, Dmitri; Galtier, Nicolas; Duret, Laurent

2015-01-01

Much evidence indicates that GC-biased gene conversion (gBGC) has a major impact on the evolution of mammalian genomes. However, a detailed quantification of the process is still lacking. The strength of gBGC can be measured from the analysis of derived allele frequency spectra (DAF), but this approach is sensitive to a number of confounding factors. In particular, we show by simulations that the inference is pervasively affected by polymorphism polarization errors and by spatial heterogeneity in gBGC strength. We propose a new general method to quantify gBGC from DAF spectra, incorporating polarization errors, taking spatial heterogeneity into account, and jointly estimating mutation bias. Applying it to human polymorphism data from the 1000 Genomes Project, we show that the strength of gBGC does not differ between hypermutable CpG sites and non-CpG sites, suggesting that in humans gBGC is not caused by the base-excision repair machinery. Genome-wide, the intensity of gBGC is in the nearly neutral area. However, given that recombination occurs primarily within recombination hotspots, 1%–2% of the human genome is subject to strong gBGC. On average, gBGC is stronger in African than in non-African populations, reflecting differences in effective population sizes. However, due to more heterogeneous recombination landscapes, the fraction of the genome affected by strong gBGC is larger in non-African than in African populations. Given that the location of recombination hotspots evolves very rapidly, our analysis predicts that, in the long term, a large fraction of the genome is affected by short episodes of strong gBGC. PMID:25995268
Meraculous: De Novo Genome Assembly with Short Paired-End Reads

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chapman, Jarrod A.; Ho, Isaac; Sunkara, Sirisha

2011-08-18

We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, with no errors; half the assembled sequence is in contigs longer than 101 kilobases and in scaffolds longer than 269 kilobases. Incorporating fosmid ends recovers entire chromosomes. Meraculous relies on an efficient and conservative traversal of the subgraph of the k-mer (deBruijn) graph of oligonucleotides with unique high quality extensions inmore » the dataset, avoiding an explicit error correction step as used in other short-read assemblers. A novel memory-efficient hashing scheme is introduced. The resulting contigs are ordered and oriented using paired reads separated by ~280 bp or ~3.2 kbp, and many gaps between contigs can be closed using paired-end placements. Practical issues with the dataset are described, and prospects for assembling larger genomes are discussed.« less
Metabolic engineering of a haploid strain derived from a triploid industrial yeast for producing cellulosic ethanol.

PubMed

Kim, Soo Rin; Skerker, Jeffrey M; Kong, In Iok; Kim, Heejin; Maurer, Matthew J; Zhang, Guo-Chang; Peng, Dairong; Wei, Na; Arkin, Adam P; Jin, Yong-Su

2017-03-01

Many desired phenotypes for producing cellulosic biofuels are often observed in industrial Saccharomyces cerevisiae strains. However, many industrial yeast strains are polyploid and have low spore viability, making it difficult to use these strains for metabolic engineering applications. We selected the polyploid industrial strain S. cerevisiae ATCC 4124 exhibiting rapid glucose fermentation capability, high ethanol productivity, strong heat and inhibitor tolerance in order to construct an optimal yeast strain for producing cellulosic ethanol. Here, we focused on developing a general approach and high-throughput screening method to isolate stable haploid segregants derived from a polyploid parent, such as triploid ATCC 4124 with a poor spore viability. Specifically, we deleted the HO genes, performed random sporulation, and screened the resulting segregants based on growth rate, mating type, and ploidy. Only one stable haploid derivative (4124-S60) was isolated, while 14 other segregants with a stable mating type were aneuploid. The 4124-S60 strain inherited only a subset of desirable traits present in the parent strain, same as other aneuploids, suggesting that glucose fermentation and specific ethanol productivity are likely to be genetically complex traits and/or they might depend on ploidy. Nonetheless, the 4124-60 strain did inherit the ability to tolerate fermentation inhibitors. When additional genetic perturbations known to improve xylose fermentation were introduced into the 4124-60 strain, the resulting engineered strain (IIK1) was able to ferment a Miscanthus hydrolysate better than a previously engineered laboratory strain (SR8), built by making the same genetic changes. However, the IIK1 strain showed higher glycerol and xylitol yields than the SR8 strain. In order to decrease glycerol and xylitol production, an NADH-dependent acetate reduction pathway was introduced into the IIK1 strain. By consuming 2.4g/L of acetate, the resulting strain (IIK1A
Origins of the Human Genome Project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cook-Deegan, Robert

1993-07-01

The human genome project was borne of technology, grew into a science bureaucracy in the US and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information ismore » embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.« less
Genome sequence of the highly weak-acid-tolerant Zygosaccharomyces bailii IST302, amenable to genetic manipulations and physiological studies.

PubMed

Palma, Margarida; Münsterkötter, Martin; Peça, João; Güldener, Ulrich; Sá-Correia, Isabel

2017-06-01

Zygosaccharomyces bailii is one of the most problematic spoilage yeast species found in the food and beverage industry particularly in acidic products, due to its exceptional resistance to weak acid stress. This article describes the annotation of the genome sequence of Z. bailii IST302, a strain recently proven to be amenable to genetic manipulations and physiological studies. The work was based on the annotated genomes of strain ISA1307, an interspecies hybrid between Z. bailii and a closely related species, and the Z. bailii reference strain CLIB 213T. The resulting genome sequence of Z. bailii IST302 is distributed through 105 scaffolds, comprising a total of 5142 genes and a size of 10.8 Mb. Contrasting with CLIB 213T, strain IST302 does not form cell aggregates, allowing its manipulation in the laboratory for genetic and physiological studies. Comparative cell cycle analysis with the haploid and diploid Saccharomyces cerevisiae strains BY4741 and BY4743, respectively, suggests that Z. bailii IST302 is haploid. This is an additional trait that makes this strain attractive for the functional analysis of non-essential genes envisaging the elucidation of mechanisms underlying its high tolerance to weak acid food preservatives, or the investigation and exploitation of the potential of this resilient yeast species as cell factory. © FEMS 2017.
CRISPR/Cas9 for Human Genome Engineering and Disease Research.

PubMed

Xiong, Xin; Chen, Meng; Lim, Wendell A; Zhao, Dehua; Qi, Lei S

2016-08-31

The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) system, a versatile RNA-guided DNA targeting platform, has been revolutionizing our ability to modify, manipulate, and visualize the human genome, which greatly advances both biological research and therapeutics development. Here, we review the current development of CRISPR/Cas9 technologies for gene editing, transcription regulation, genome imaging, and epigenetic modification. We discuss the broad application of this system to the study of functional genomics, especially genome-wide genetic screening, and to therapeutics development, including establishing disease models, correcting defective genetic mutations, and treating diseases.
The d4 gene family in the human genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chestkov, A.V.; Baka, I.D.; Kost, M.V.

1996-08-15

The d4 domain, a novel zinc finger-like structural motif, was first revealed in the rat neuro-d4 protein. Here we demonstrate that the d4 domain is conserved in evolution and that three related genes form a d4 family in the human genome. The human neuro-d4 is very similar to rat neuro-d4 at both the amino acid and the nucleotide levels. Moreover, the same splice variants have been detected among rat and human neuro-d4 transcripts. This gene has been localized on chromosome 19, and two other genes, members of the d4 family isolated by screening of the human genomic library at lowmore » stringency, have been mapped to chromosomes 11 and 14. The gene on chromosome 11 is the homolog of the ubiquitously expressed mouse gene ubi-d4/requiem, which is required for cell death after deprivation of trophic factors. A gene with a conserved d4 domain has been found in the genome of the nematode Caenorhabditis elegans. The conservation of d4 proteins from nematodes to vertebrates suggests that they have a general importance, but a diversity of d4 proteins expressed in vertebrate nervous systems suggests that some family members have special functions. 11 refs., 2 figs.« less
Novel mouse model recapitulates genome and transcriptome alterations in human colorectal carcinomas.

PubMed

McNeil, Nicole E; Padilla-Nash, Hesed M; Buishand, Floryne O; Hue, Yue; Ried, Thomas

2017-03-01

Human colorectal carcinomas are defined by a nonrandom distribution of genomic imbalances that are characteristic for this disease. Often, these imbalances affect entire chromosomes. Understanding the role of these aneuploidies for carcinogenesis is of utmost importance. Currently, established transgenic mice do not recapitulate the pathognonomic genome aberration profile of human colorectal carcinomas. We have developed a novel model based on the spontaneous transformation of murine colon epithelial cells. During this process, cells progress through stages of pre-immortalization, immortalization and, finally, transformation, and result in tumors when injected into immunocompromised mice. We analyzed our model for genome and transcriptome alterations using ArrayCGH, spectral karyotyping (SKY), and array based gene expression profiling. ArrayCGH revealed a recurrent pattern of genomic imbalances. These results were confirmed by SKY. Comparing these imbalances with orthologous maps of human chromosomes revealed a remarkable overlap. We observed focal deletions of the tumor suppressor genes Trp53 and Cdkn2a/p16. High-level focal genomic amplification included the locus harboring the oncogene Mdm2, which was confirmed by FISH in the form of double minute chromosomes. Array-based global gene expression revealed distinct differences between the sequential steps of spontaneous transformation. Gene expression changes showed significant similarities with human colorectal carcinomas. Pathways most prominently affected included genes involved in chromosomal instability and in epithelial to mesenchymal transition. Our novel mouse model therefore recapitulates the most prominent genome and transcriptome alterations in human colorectal cancer, and might serve as a valuable tool for understanding the dynamic process of tumorigenesis, and for preclinical drug testing. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The genomics of preterm birth: from animal models to human studies

PubMed Central

2013-01-01

Preterm birth (delivery at less than 37 weeks of gestation) is the leading cause of infant mortality worldwide. So far, the application of animal models to understand human birth timing has not substantially revealed mechanisms that could be used to prevent prematurity. However, with amassing data implicating an important role for genetics in the timing of the onset of human labor, the use of modern genomic approaches, such as genome-wide association studies, rare variant analyses using whole-exome or genome sequencing, and family-based designs, holds enormous potential. Although some progress has been made in the search for causative genes and variants associated with preterm birth, the major genetic determinants remain to be identified. Here, we review insights from and limitations of animal models for understanding the physiology of parturition, recent human genetic and genomic studies to identify genes involved in preterm birth, and emerging areas that are likely to be informative in future investigations. Further advances in understanding fundamental mechanisms, and the development of preventative measures, will depend upon the acquisition of greater numbers of carefully phenotyped pregnancies, large-scale informatics approaches combining genomic information with information on environmental exposures, and new conceptual models for studying the interaction between the maternal and fetal genomes to personalize therapies for mothers and infants. Information emerging from these advances will help us to identify new biomarkers for earlier detection of preterm labor, develop more effective therapeutic agents, and/or promote prophylactic measures even before conception. PMID:23673148
Meta genome-wide network from functional linkages of genes in human gut microbial ecosystems.

PubMed

Ji, Yan; Shi, Yixiang; Wang, Chuan; Dai, Jianliang; Li, Yixue

2013-03-01

The human gut microbial ecosystem (HGME) exerts an important influence on the human health. In recent researches, meta-genomics provided deep insights into the HGME in terms of gene contents, metabolic processes and genome constitutions of meta-genome. Here we present a novel methodology to investigate the HGME on the basis of a set of functionally coupled genes regardless of their genome origins when considering the co-evolution properties of genes. By analyzing these coupled genes, we showed some basic properties of HGME significantly associated with each other, and further constructed a protein interaction map of human gut meta-genome to discover some functional modules that may relate with essential metabolic processes. Compared with other studies, our method provides a new idea to extract basic function elements from meta-genome systems and investigate complex microbial environment by associating its biological traits with co-evolutionary fingerprints encoded in it.
RNA-programmed genome editing in human cells

PubMed Central

Jinek, Martin; East, Alexandra; Cheng, Aaron; Lin, Steven; Ma, Enbo; Doudna, Jennifer

2013-01-01

Type II CRISPR immune systems in bacteria use a dual RNA-guided DNA endonuclease, Cas9, to cleave foreign DNA at specific sites. We show here that Cas9 assembles with hybrid guide RNAs in human cells and can induce the formation of double-strand DNA breaks (DSBs) at a site complementary to the guide RNA sequence in genomic DNA. This cleavage activity requires both Cas9 and the complementary binding of the guide RNA. Experiments using extracts from transfected cells show that RNA expression and/or assembly into Cas9 is the limiting factor for Cas9-mediated DNA cleavage. In addition, we find that extension of the RNA sequence at the 3′ end enhances DNA targeting activity in vivo. These results show that RNA-programmed genome editing is a facile strategy for introducing site-specific genetic changes in human cells. DOI: http://dx.doi.org/10.7554/eLife.00471.001 PMID:23386978
Human gut microbiome: the second genome of human body.

PubMed

Zhu, Baoli; Wang, Xin; Li, Lanjuan

2010-08-01

The human body is actually a super-organism that is composed of 10 times more microbial cells than our body cells. Metagenomic study of the human microbiome has demonstrated that there are 3.3 million unique genes in human gut, 150 times more genes than our own genome, and the bacterial diversity analysis showed that about 1000 bacterial species are living in our gut and a majority of them belongs to the divisions of Firmicutes and Bacteriodetes. In addition, most people share a core microbiota that comprises 50-100 bacterial species when the frequency of abundance at phylotype level is not considered, and a core microbiome harboring more than 6000 functional gene groups is present in the majority of human gut surveyed till now. Gut bacteria are not only critical for regulating gut metabolism, but also important for host immune system as revealed by animal studies.

The Human Genome Project: applications in the diagnosis and treatment of neurologic disease.

PubMed

Evans, G A

1998-10-01

The Human Genome Project (HGP), an international program to decode the entire DNA sequence of the human genome in 15 years, represents the largest biological experiment ever conducted. This set of information will contain the blueprint for the construction and operation of a human being. While the primary driving force behind the genome project is the potential to vastly expand the amount of genetic information available for biomedical research, the ramifications for other fields of study in biological research, the biotechnology and pharmaceutical industry, our understanding of evolution, effects on agriculture, and implications for bioethics are likely to be profound.
CHPA, a Cysteine- and Histidine-Rich-Domain-Containing Protein, Contributes to Maintenance of the Diploid State in Aspergillus nidulans

PubMed Central

Sadanandom, Ari; Findlay, Kim; Doonan, John H.; Schulze-Lefert, Paul; Shirasu, Ken

2004-01-01

The alternation of eukaryotic life cycles between haploid and diploid phases is crucial for maintaining genetic diversity. In some organisms, the growth and development of haploid and diploid phases are nearly identical, and one might suppose that all genes required for one phase are likely to be critical for the other phase. Here, we show that targeted disruption of the chpA (cysteine- and histidine-rich-domain- [CHORD]-containing protein A) gene in haploid Aspergillus nidulans strains gives rise to chpA knockout haploids and heterozygous diploids but no chpA knockout diploids. A. nidulans chpA heterozygous diploids showed impaired conidiophore development and reduced conidiation. Deletion of chpA from diploid A. nidulans resulted in genome instability and reversion to a haploid state. Thus, our data suggest a vital role for chpA in maintenance of the diploid phase in A. nidulans. Furthermore, the human chpA homolog, Chp-1, was able to complement haploinsufficiency in A. nidulans chpA heterozygotes, suggesting that the function of CHORD-containing proteins is highly conserved in eukaryotes. PMID:15302831
Clan Genomics and the Complex Architecture of Human Disease

PubMed Central

Belmont, John W.; Boerwinkle, Eric

2013-01-01

Human diseases are caused by alleles that encompass the full range of variant types, from single-nucleotide changes to copy-number variants, and these variations span a broad frequency spectrum, from the very rare to the common. The picture emerging from analysis of whole-genome sequences, the 1000 Genomes Project pilot studies, and targeted genomic sequencing derived from very large sample sizes reveals an abundance of rare and private variants. One implication of this realization is that recent mutation may have a greater influence on disease susceptibility or protection than is conferred by variations that arose in distant ancestors. PMID:21962505
Characterization of canine osteosarcoma by array comparative genomic hybridization and RT-qPCR: signatures of genomic imbalance in canine osteosarcoma parallel the human counterpart.

PubMed

Angstadt, Andrea Y; Motsinger-Reif, Alison; Thomas, Rachael; Kisseberth, William C; Guillermo Couto, C; Duval, Dawn L; Nielsen, Dahlia M; Modiano, Jaime F; Breen, Matthew

2011-11-01

Osteosarcoma (OS) is the most commonly diagnosed malignant bone tumor in humans and dogs, characterized in both species by extremely complex karyotypes exhibiting high frequencies of genomic imbalance. Evaluation of genomic signatures in human OS using array comparative genomic hybridization (aCGH) has assisted in uncovering genetic mechanisms that result in disease phenotype. Previous low-resolution (10-20 Mb) aCGH analysis of canine OS identified a wide range of recurrent DNA copy number aberrations, indicating extensive genomic instability. In this study, we profiled 123 canine OS tumors by 1 Mb-resolution aCGH to generate a dataset for direct comparison with current data for human OS, concluding that several high frequency aberrations in canine and human OS are orthologous. To ensure complete coverage of gene annotation, we identified the human refseq genes that map to these orthologous aberrant dog regions and found several candidate genes warranting evaluation for OS involvement. Specifically, subsequenct FISH and qRT-PCR analysis of RUNX2, TUSC3, and PTEN indicated that expression levels correlated with genomic copy number status, showcasing RUNX2 as an OS associated gene and TUSC3 as a possible tumor suppressor candidate. Together these data demonstrate the ability of genomic comparative oncology to identify genetic abberations which may be important for OS progression. Large scale screening of genomic imbalance in canine OS further validates the use of the dog as a suitable model for human cancers, supporting the idea that dysregulation discovered in canine cancers will provide an avenue for complementary study in human counterparts. Copyright © 2011 Wiley-Liss, Inc.
Signatures of Long-Term Balancing Selection in Human Genomes

PubMed Central

de Filippo, Cesare; Teixeira, João C; Schmidt, Joshua M; Kleinert, Philip; Meyer, Diogo; Andrés, Aida M

2018-01-01

Abstract Balancing selection maintains advantageous diversity in populations through various mechanisms. Although extensively explored from a theoretical perspective, an empirical understanding of its prevalence and targets lags behind our knowledge of positive selection. Here, we describe the Non-central Deviation (NCD), a simple yet powerful statistic to detect long-term balancing selection (LTBS) that quantifies how close frequencies are to expectations under LTBS, and provides the basis for a neutrality test. NCD can be applied to a single locus or genomic data, and can be implemented considering only polymorphisms (NCD1) or also considering fixed differences with respect to an outgroup (NCD2) species. Incorporating fixed differences improves power, and NCD2 has higher power to detect LTBS in humans under different frequencies of the balanced allele(s) than other available methods. Applied to genome-wide data from African and European human populations, in both cases using chimpanzee as an outgroup, NCD2 shows that, albeit not prevalent, LTBS affects a sizable portion of the genome: ∼0.6% of analyzed genomic windows and 0.8% of analyzed positions. Significant windows (P < 0.0001) contain 1.6% of SNPs in the genome, which disproportionally fall within exons and change protein sequence, but are not enriched in putatively regulatory sites. These windows overlap ∼8% of the protein-coding genes, and these have larger number of transcripts than expected by chance even after controlling for gene length. Our catalog includes known targets of LTBS but a majority of them (90%) are novel. As expected, immune-related genes are among those with the strongest signatures, although most candidates are involved in other biological functions, suggesting that LTBS potentially influences diverse human phenotypes. PMID:29608730
Novel mechanism of conjoined gene formation in the human genome.

PubMed

Kim, Ryong Nam; Kim, Aeri; Choi, Sang-Haeng; Kim, Dae-Soo; Nam, Seong-Hyeuk; Kim, Dae-Won; Kim, Dong-Wook; Kang, Aram; Kim, Min-Young; Park, Kun-Hyang; Yoon, Byoung-Ha; Lee, Kang Seon; Park, Hong-Seog

2012-03-01

Recently, conjoined genes (CGs) have emerged as important genetic factors necessary for understanding the human genome. However, their formation mechanism and precise structures have remained mysterious. Based on a detailed structural analysis of 57 human CG transcript variants (CGTVs, discovered in this study) and all (833) known CGs in the human genome, we discovered that the poly(A) signal site from the upstream parent gene region is completely removed via the skipping or truncation of the final exon; consequently, CG transcription is terminated at the poly(A) signal site of the downstream parent gene. This result led us to propose a novel mechanism of CG formation: the complete removal of the poly(A) signal site from the upstream parent gene is a prerequisite for the CG transcriptional machinery to continue transcribing uninterrupted into the intergenic region and downstream parent gene. The removal of the poly(A) signal sequence from the upstream gene region appears to be caused by a deletion or truncation mutation in the human genome rather than post-transcriptional trans-splicing events. With respect to the characteristics of CG sequence structures, we found that intergenic regions are hot spots for novel exon creation during CGTV formation and that exons farther from the intergenic regions are more highly conserved in the CGTVs. Interestingly, many novel exons newly created within the intergenic and intragenic regions originated from transposable element sequences. Additionally, the CGTVs showed tumor tissue-biased expression. In conclusion, our study provides novel insights into the CG formation mechanism and expands the present concepts of the genetic structural landscape, gene regulation, and gene formation mechanisms in the human genome.
The Human Genome Project and Biology Education.

ERIC Educational Resources Information Center

McInerney, Joseph D.

1996-01-01

Highlights the importance of the Human Genome Project in educating the public about genetics. Discusses four challenges that science educators must address: teaching for conceptual understanding, the nature of science, the personal and social impact of science and technology, and the principles of technology. Contains 45 references. (JRH)
A decade after the first full human genome sequencing: when will we understand our own genome?

PubMed

Eisenhaber, Frank

2012-10-01

The contrast between the pomp of celebrating the first full human genome sequencing in 2000 and the cautious tone of recollections a decade thereafter could hardly be greater. The promises with regard to medical cures and biotechnology applications have been realized not even nearly to the expectations. Understanding the human genomes means knowing the genes' and proteins' functions and their interconnectedness via biomolecular mechanisms. This articles estimates how long will it take to achieve this goal if we extrapolate from the previous decade (indeed, a century!) and the possible disruptive trends in science, technology and society that may accelerate the pace of progress dramatically.
77 FR 64816 - National Human Genome Research Institute; Notice of Meeting

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-23

... sign language interpretation or other reasonable accommodations, should notify the Contact Person... relevance. Place: National Human Genome Research Institute, 5635 Fishers Lane, Terrace Level Conference Room... Genome Research Institute, 5635 Fishers Lane, Terrace Level Conference Room, Rockville, MD 20892. Contact...
Using populations of human and microbial genomes for organism detection in metagenomes.

PubMed

Ames, Sasha K; Gardner, Shea N; Marti, Jose Manuel; Slezak, Tom R; Gokhale, Maya B; Allen, Jonathan E

2015-07-01

Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-free human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. Left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected. © 2015 Ames et al.; Published by Cold Spring Harbor Laboratory Press.
Transcript levels of ten caste-related genes in adult diploid males of Melipona quadrifasciata (Hymenoptera, Apidae) - A comparison with haploid males, queens and workers

PubMed Central

Borges, Andreia A.; Humann, Fernanda C.; Oliveira Campos, Lucio A.; Tavares, Mara G.; Hartfelder, Klaus

2011-01-01

In Hymenoptera, homozygosity at the sex locus results in the production of diploid males. In social species, these pose a double burden by having low fitness and drawing resources normally spent for increasing the work force of a colony. Yet, diploid males are of academic interest as they can elucidate effects of ploidy (normal males are haploid, whereas the female castes, the queens and workers, are diploid) on morphology and life history. Herein we investigated expression levels of ten caste-related genes in the stingless bee Melipona quadrifasciata, comparing newly emerged and 5-day-old diploid males with haploid males, queens and workers. In diploid males, transcript levels for dunce and paramyosin were increased during the first five days of adult life, while those for diacylglycerol kinase and the transcriptional co-repressor groucho diminished. Two general trends were apparent, (i) gene expression patterns in diploid males were overall more similar to haploid ones and workers than to queens, and (ii) in queens and workers, more genes were up-regulated after emergence until day five, whereas in diploid and especially so in haploid males more genes were down-regulated. This difference between the sexes may be related to longevity, which is much longer in females than in males. PMID:22215977
Transcript levels of ten caste-related genes in adult diploid males of Melipona quadrifasciata (Hymenoptera, Apidae) - A comparison with haploid males, queens and workers.

PubMed

Borges, Andreia A; Humann, Fernanda C; Oliveira Campos, Lucio A; Tavares, Mara G; Hartfelder, Klaus

2011-10-01

In Hymenoptera, homozygosity at the sex locus results in the production of diploid males. In social species, these pose a double burden by having low fitness and drawing resources normally spent for increasing the work force of a colony. Yet, diploid males are of academic interest as they can elucidate effects of ploidy (normal males are haploid, whereas the female castes, the queens and workers, are diploid) on morphology and life history. Herein we investigated expression levels of ten caste-related genes in the stingless bee Melipona quadrifasciata, comparing newly emerged and 5-day-old diploid males with haploid males, queens and workers. In diploid males, transcript levels for dunce and paramyosin were increased during the first five days of adult life, while those for diacylglycerol kinase and the transcriptional co-repressor groucho diminished. Two general trends were apparent, (i) gene expression patterns in diploid males were overall more similar to haploid ones and workers than to queens, and (ii) in queens and workers, more genes were up-regulated after emergence until day five, whereas in diploid and especially so in haploid males more genes were down-regulated. This difference between the sexes may be related to longevity, which is much longer in females than in males.
The Human Genome Project: Information access, management, and regulation. Final report

DOE Office of Scientific and Technical Information (OSTI.GOV)

McInerney, J.D.; Micikas, L.B.

The Human Genome Project is a large, internationally coordinated effort in biological research directed at creating a detailed map of human DNA. This report describes the access of information, management, and regulation of the project. The project led to the development of an instructional module titled The Human Genome Project: Biology, Computers, and Privacy, designed for use in high school biology classes. The module consists of print materials and both Macintosh and Windows versions of related computer software-Appendix A contains a copy of the print materials and discs containing the two versions of the software.
Human-specific protein isoforms produced by novel splice sites in the human genome after the human-chimpanzee divergence.

PubMed

Kim, Dong Seon; Hahn, Yoonsoo

2012-11-13

Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
An Upper Limit on the Functional Fraction of the Human Genome.

PubMed

Graur, Dan

2017-07-01

For the human population to maintain a constant size from generation to generation, an increase in fertility must compensate for the reduction in the mean fitness of the population caused, among others, by deleterious mutations. The required increase in fertility due to this mutational load depends on the number of sites in the genome that are functional, the mutation rate, and the fraction of deleterious mutations among all mutations in functional regions. These dependencies and the fact that there exists a maximum tolerable replacement level fertility can be used to put an upper limit on the fraction of the human genome that can be functional. Mutational load considerations lead to the conclusion that the functional fraction within the human genome cannot exceed 25%, and is probably considerably lower. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Human Cancer Models Initiative | Office of Cancer Genomics

Cancer.gov

The Human Cancer Models Initiative (HCMI) is an international consortium that is generating novel human tumor-derived culture models, which are annotated with genomic and clinical data. In an effort to advance cancer research and more fully understand how in vitro findings are related to clinical biology, HCMI-developed models and related data will be available as a community resource for cancer and other research.
Human Cancer Models Initiative | Office of Cancer Genomics

Cancer.gov

The Human Cancer Models Initiative (HCMI) is an international consortium that is generating novel human tumor-derived culture models, which are annotated with genomic and clinical data. In an effort to advance cancer research and more fully understand how in vitro findings are related to clinical biology, HCMI-developed models and related data will be available as a community resource for cancer research.
Human genome-microbiome interaction: metagenomics frontiers for the aetiopathology of autoimmune diseases

PubMed Central

Nalbantoglu, Ufuk

2017-01-01

A short while ago, the human genome and microbiome were analysed simultaneously for the first time as a multi-omic approach. The analyses of heterogeneous population cohorts showed that microbiome components were associated with human genome variations. In-depth analysis of these results reveals that the majority of those relationships are between immune pathways and autoimmune disease-associated microbiome components. Thus, it can be hypothesized that autoimmunity may be associated with homeostatic disequilibrium of the human-microbiome interactome. Further analysis of human genome–human microbiome relationships in disease contexts with tailored systems biology approaches may yield insights into disease pathogenesis and prognosis. PMID:28785422
Xenopus laevis ribosomal protein genes: isolation of recombinant cDNA clones and study of the genomic organization.

PubMed Central

Bozzoni, I; Beccari, E; Luo, Z X; Amaldi, F

1981-01-01

Poly-A+ mRNA from Xenopus laevis oocytes, partially enriched for r-protein coding capacity has been used as starting material for preparing a cDNA bank in plasmid pBR322. The clones containing sequences specific for r-proteins have been selected by translation of the complementary mRNAs. Clones for six different r-proteins have been identified and utilized as probes for studying their genomic organization. Two gene copies per haploid genome were found for r-proteins L1, L14, S19, and four-five for protein S1, S8 and L32. Moreover a population polymorphism has been observed for the genomic regions containing sequences for r-protein S1, S8 and L14. Images PMID:6112733
ENGINES: exploring single nucleotide variation in entire human genomes.

PubMed

Amigo, Jorge; Salas, Antonio; Phillips, Christopher

2011-04-19

Next generation ultra-sequencing technologies are starting to produce extensive quantities of data from entire human genome or exome sequences, and therefore new software is needed to present and analyse this vast amount of information. The 1000 Genomes project has recently released raw data for 629 complete genomes representing several human populations through their Phase I interim analysis and, although there are certain public tools available that allow exploration of these genomes, to date there is no tool that permits comprehensive population analysis of the variation catalogued by such data. We have developed a genetic variant site explorer able to retrieve data for Single Nucleotide Variation (SNVs), population by population, from entire genomes without compromising future scalability and agility. ENGINES (ENtire Genome INterface for Exploring SNVs) uses data from the 1000 Genomes Phase I to demonstrate its capacity to handle large amounts of genetic variation (>7.3 billion genotypes and 28 million SNVs), as well as deriving summary statistics of interest for medical and population genetics applications. The whole dataset is pre-processed and summarized into a data mart accessible through a web interface. The query system allows the combination and comparison of each available population sample, while searching by rs-number list, chromosome region, or genes of interest. Frequency and FST filters are available to further refine queries, while results can be visually compared with other large-scale Single Nucleotide Polymorphism (SNP) repositories such as HapMap or Perlegen. ENGINES is capable of accessing large-scale variation data repositories in a fast and comprehensive manner. It allows quick browsing of whole genome variation, while providing statistical information for each variant site such as allele frequency, heterozygosity or FST values for genetic differentiation. Access to the data mart generating scripts and to the web interface is granted from

In Silico Pattern-Based Analysis of the Human Cytomegalovirus Genome

PubMed Central

Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T.; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

2003-01-01

More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/). PMID:12634390
In silico pattern-based analysis of the human cytomegalovirus genome.

PubMed

Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

2003-04-01

More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).
The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum.

PubMed

Zimin, Aleksey V; Puiu, Daniela; Hall, Richard; Kingan, Sarah; Clavijo, Bernardo J; Salzberg, Steven L

2017-11-01

Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15 344 693 583 bases and has a weighted average (N50) contig size of 232 659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4 179 762 575 bp of T. aestivum that correspond to its D genome components. © The Author 2017. Published by Oxford University Press.
De-Novo Assembly and Analysis of the Heterozygous Triploid Genome of the Wine Spoilage Yeast Dekkera bruxellensis AWRI1499

PubMed Central

Chambers, Paul J.; Pretorius, Isak S.

2012-01-01

Despite its industrial importance, the yeast species Dekkera (Brettanomyces) bruxellensis has remained poorly understood at the genetic level. In this study we describe whole genome sequencing and analysis for a prevalent wine spoilage strain, AWRI1499. The 12.7 Mb assembly, consisting of 324 contigs in 99 scaffolds (super-contigs) at 26-fold coverage, exhibits a relatively high density of single nucleotide polymorphisms (SNPs). Haplotype sampling for 1.2% of open reading frames suggested that the D. bruxellensis AWRI1499 genome is comprised of a moderately heterozygous diploid genome, in combination with a divergent haploid genome. Gene content analysis revealed enrichment in membrane proteins, particularly transporters, along with oxidoreductase enzymes. Availability of this assembly and annotation provides a resource for further investigation of genomic organization in this species, and functional characterization of genes that may confer important phenotypic traits. PMID:22470482
Tripolar mitosis and partitioning of the genome arrests human preimplantation development in vitro.

PubMed

Ottolini, Christian S; Kitchen, John; Xanthopoulou, Leoni; Gordon, Tony; Summers, Michael C; Handyside, Alan H

2017-08-29

Following in vitro fertilisation (IVF), only about half of normally fertilised human embryos develop beyond cleavage and morula stages to form a blastocyst in vitro. Although many human embryos are aneuploid and genomically imbalanced, often as a result of meiotic errors inherited in the oocyte, these aneuploidies persist at the blastocyst stage and the reasons for the high incidence of developmental arrest remain unknown. Here we use genome-wide SNP genotyping and meiomapping of both polar bodies to identify maternal meiotic errors and karyomapping to fingerprint the parental chromosomes in single cells from disaggregated arrested embryos and excluded cells from blastocysts. Combined with time lapse imaging of development in culture, we demonstrate that tripolar mitoses in early cleavage cause chromosome dispersal to clones of cells with identical or closely related sub-diploid chromosome profiles resulting in intercellular partitioning of the genome. We hypothesise that following zygotic genome activation (ZGA), the combination of genomic imbalance and partial genome loss disrupts the normal pattern of embryonic gene expression blocking development at the morula-blastocyst transition. Failure to coordinate the cell cycle in early cleavage and regulate centrosome duplication is therefore a major cause of human preimplantation developmental arrest in vitro.
The genome in three dimensions: a new frontier in human brain research.

PubMed

Mitchell, Amanda C; Bharadwaj, Rahul; Whittle, Catheryne; Krueger, Winfried; Mirnics, Karoly; Hurd, Yasmin; Rasmussen, Theodore; Akbarian, Schahram

2014-06-15

Less than 1.5% of the human genome encodes protein. However, vast portions of the human genome are subject to transcriptional and epigenetic regulation, and many noncoding regulatory DNA elements are thought to regulate the spatial organization of interphase chromosomes. For example, chromosomal "loopings" are pivotal for the orderly process of gene expression, by enabling distal regulatory enhancer or silencer elements to directly interact with proximal promoter and transcription start sites, potentially bypassing hundreds of kilobases of interspersed sequence on the linear genome. To date, however, epigenetic studies in the human brain are mostly limited to the exploration of DNA methylation and posttranslational modifications of the nucleosome core histones. In contrast, very little is known about the regulation of supranucleosomal structures. Here, we show that chromosome conformation capture, a widely used approach to study higher-order chromatin, is applicable to tissue collected postmortem, thereby informing about genome organization in the human brain. We introduce chromosome conformation capture protocols for brain and compare higher-order chromatin structures at the chromosome 6p22.2-22.1 schizophrenia and bipolar disorder susceptibility locus, and additional neurodevelopmental risk genes, (DPP10, MCPH1) in adult prefrontal cortex and various cell culture systems, including neurons derived from reprogrammed skin cells. We predict that the exploration of three-dimensional genome architectures and function will open up new frontiers in human brain research and psychiatric genetics and provide novel insights into the epigenetic risk architectures of regulatory noncoding DNA. Copyright © 2014 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
The human genome project: Prospects and implications for clinical medicine

DOE Office of Scientific and Technical Information (OSTI.GOV)

Green, E.D.; Waterston, R.H.

1991-10-09

The recently initiated human genome project is a large international effort to elucidate the genetic architecture of the genomes of man and several model organisms. The initial phases of this endeavor involve the establishment of rough blueprints (maps) of the genetic landscape of these genomes, with the long-term goal of determining their precise nucleotide sequences and identifying the genes. The knowledge gained by these studies will provide a vital tool for the study of many biologic processes and will have a profound impact on clinical medicine.
New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome.

PubMed

Garazha, Andrew; Ivanova, Alena; Suntsova, Maria; Malakhova, Galina; Roumiantsev, Sergey; Zhavoronkov, Alex; Buzdin, Anton

2015-01-01

Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of "domestication" of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.
Monkeypox Virus Host Factor Screen Using Haploid Cells Identifies Essential Role of GARP Complex in Extracellular Virus Formation.

PubMed

Realegeno, Susan; Puschnik, Andreas S; Kumar, Amrita; Goldsmith, Cynthia; Burgado, Jillybeth; Sambhara, Suryaprakash; Olson, Victoria A; Carroll, Darin; Damon, Inger; Hirata, Tetsuya; Kinoshita, Taroh; Carette, Jan E; Satheshkumar, Panayampalli Subbian

2017-06-01

Monkeypox virus (MPXV) is a human pathogen that is a member of the Orthopoxvirus genus, which includes Vaccinia virus and Variola virus (the causative agent of smallpox). Human monkeypox is considered an emerging zoonotic infectious disease. To identify host factors required for MPXV infection, we performed a genome-wide insertional mutagenesis screen in human haploid cells. The screen revealed several candidate genes, including those involved in Golgi trafficking, glycosaminoglycan biosynthesis, and glycosylphosphatidylinositol (GPI)-anchor biosynthesis. We validated the role of a set of vacuolar protein sorting (VPS) genes during infection, VPS51 to VPS54 (VPS51-54), which comprise the Golgi-associated retrograde protein (GARP) complex. The GARP complex is a tethering complex involved in retrograde transport of endosomes to the trans -Golgi apparatus. Our data demonstrate that VPS52 and VPS54 were dispensable for mature virion (MV) production but were required for extracellular virus (EV) formation. For comparison, a known antiviral compound, ST-246, was used in our experiments, demonstrating that EV titers in VPS52 and VPS54 knockout (KO) cells were comparable to levels exhibited by ST-246-treated wild-type cells. Confocal microscopy was used to examine actin tail formation, one of the viral egress mechanisms for cell-to-cell dissemination, and revealed an absence of actin tails in VPS52KO- or VPS54KO-infected cells. Further evaluation of these cells by electron microscopy demonstrated a decrease in levels of wrapped viruses (WVs) compared to those seen with the wild-type control. Collectively, our data demonstrate the role of GARP complex genes in double-membrane wrapping of MVs necessary for EV formation, implicating the host endosomal trafficking pathway in orthopoxvirus infection. IMPORTANCE Human monkeypox is an emerging zoonotic infectious disease caused by Monkeypox virus (MPXV). Of the two MPXV clades, the Congo Basin strain is associated with severe
ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.

PubMed

Coombe, Lauren; Zhang, Jessica; Vandervalk, Benjamin P; Chu, Justin; Jackman, Shaun D; Birol, Inanc; Warren, René L

2018-06-20

The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time. Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50 = 4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50 = 14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~ 10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n = 13). ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome
Insights into Land Plant Evolution Garnered from the Marchantia polymorpha Genome.

PubMed

Bowman, John L; Kohchi, Takayuki; Yamato, Katsuyuki T; Jenkins, Jerry; Shu, Shengqiang; Ishizaki, Kimitsune; Yamaoka, Shohei; Nishihama, Ryuichi; Nakamura, Yasukazu; Berger, Frédéric; Adam, Catherine; Aki, Shiori Sugamata; Althoff, Felix; Araki, Takashi; Arteaga-Vazquez, Mario A; Balasubrmanian, Sureshkumar; Barry, Kerrie; Bauer, Diane; Boehm, Christian R; Briginshaw, Liam; Caballero-Perez, Juan; Catarino, Bruno; Chen, Feng; Chiyoda, Shota; Chovatia, Mansi; Davies, Kevin M; Delmans, Mihails; Demura, Taku; Dierschke, Tom; Dolan, Liam; Dorantes-Acosta, Ana E; Eklund, D Magnus; Florent, Stevie N; Flores-Sandoval, Eduardo; Fujiyama, Asao; Fukuzawa, Hideya; Galik, Bence; Grimanelli, Daniel; Grimwood, Jane; Grossniklaus, Ueli; Hamada, Takahiro; Haseloff, Jim; Hetherington, Alexander J; Higo, Asuka; Hirakawa, Yuki; Hundley, Hope N; Ikeda, Yoko; Inoue, Keisuke; Inoue, Shin-Ichiro; Ishida, Sakiko; Jia, Qidong; Kakita, Mitsuru; Kanazawa, Takehiko; Kawai, Yosuke; Kawashima, Tomokazu; Kennedy, Megan; Kinose, Keita; Kinoshita, Toshinori; Kohara, Yuji; Koide, Eri; Komatsu, Kenji; Kopischke, Sarah; Kubo, Minoru; Kyozuka, Junko; Lagercrantz, Ulf; Lin, Shih-Shun; Lindquist, Erika; Lipzen, Anna M; Lu, Chia-Wei; De Luna, Efraín; Martienssen, Robert A; Minamino, Naoki; Mizutani, Masaharu; Mizutani, Miya; Mochizuki, Nobuyoshi; Monte, Isabel; Mosher, Rebecca; Nagasaki, Hideki; Nakagami, Hirofumi; Naramoto, Satoshi; Nishitani, Kazuhiko; Ohtani, Misato; Okamoto, Takashi; Okumura, Masaki; Phillips, Jeremy; Pollak, Bernardo; Reinders, Anke; Rövekamp, Moritz; Sano, Ryosuke; Sawa, Shinichiro; Schmid, Marc W; Shirakawa, Makoto; Solano, Roberto; Spunde, Alexander; Suetsugu, Noriyuki; Sugano, Sumio; Sugiyama, Akifumi; Sun, Rui; Suzuki, Yutaka; Takenaka, Mizuki; Takezawa, Daisuke; Tomogane, Hirokazu; Tsuzuki, Masayuki; Ueda, Takashi; Umeda, Masaaki; Ward, John M; Watanabe, Yuichiro; Yazaki, Kazufumi; Yokoyama, Ryusuke; Yoshitake, Yoshihiro; Yotsui, Izumi; Zachgo, Sabine; Schmutz, Jeremy

2017-10-05

The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded repertoires of signaling pathways, and increased diversity in some transcription factor families. Compared with other sequenced land plants, M. polymorpha exhibits low genetic redundancy in most regulatory pathways, with this portion of its genome resembling that predicted for the ancestral land plant. PAPERCLIP. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Multiplicity of genome equivalents in the radiation-resistant bacterium Micrococcus radiodurans.

PubMed Central

Hansen, M T

1978-01-01

The complexity of the genome of Micrococcus radiodurans was determined to be (2.0 +/- 0.3) X 10(9) daltons by DNA renaturation kinetics. The number of genome equivalents of DNA per cell was calculated from the complexity and the content of DNA. A lower limit of four genome equivalents per cell was approached with decreasing growth rate. Thus, no haploid stage appeared to be realized in this organism. The replication time was estimated from the kinetics and amount of residual DNA synthesis after inhibiting initiation of new rounds of replication. From this, the redundancy of terminal genetic markers was calculated to vary with growth rate from four to approximately eight copies per cell. All genetic material, including the least abundant, is thus multiply represented in each cell. The potential significance of the maintenance in each cell of multiple gene copies is discussed in relation to the extreme radiation resistance of M. radiodurans. PMID:649572
Evolutionary interrogation of human biology in well-annotated genomic framework of rhesus macaque.

PubMed

Zhang, Shi-Jian; Liu, Chu-Jun; Yu, Peng; Zhong, Xiaoming; Chen, Jia-Yu; Yang, Xinzhuang; Peng, Jiguang; Yan, Shouyu; Wang, Chenqu; Zhu, Xiaotong; Xiong, Jingwei; Zhang, Yong E; Tan, Bertrand Chin-Ming; Li, Chuan-Yun

2014-05-01

With genome sequence and composition highly analogous to human, rhesus macaque represents a unique reference for evolutionary studies of human biology. Here, we developed a comprehensive genomic framework of rhesus macaque, the RhesusBase2, for evolutionary interrogation of human genes and the associated regulations. A total of 1,667 next-generation sequencing (NGS) data sets were processed, integrated, and evaluated, generating 51.2 million new functional annotation records. With extensive NGS annotations, RhesusBase2 refined the fine-scale structures in 30% of the macaque Ensembl transcripts, reporting an accurate, up-to-date set of macaque gene models. On the basis of these annotations and accurate macaque gene models, we further developed an NGS-oriented Molecular Evolution Gateway to access and visualize macaque annotations in reference to human orthologous genes and associated regulations (www.rhesusbase.org/molEvo). We highlighted the application of this well-annotated genomic framework in generating hypothetical link of human-biased regulations to human-specific traits, by using mechanistic characterization of the DIEXF gene as an example that provides novel clues to the understanding of digestive system reduction in human evolution. On a global scale, we also identified a catalog of 9,295 human-biased regulatory events, which may represent novel elements that have a substantial impact on shaping human transcriptome and possibly underpin recent human phenotypic evolution. Taken together, we provide an NGS data-driven, information-rich framework that will broadly benefit genomics research in general and serves as an important resource for in-depth evolutionary studies of human biology.
A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.

PubMed

Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E

1997-06-01

In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.
Predicting human genetic interactions from cancer genome evolution.

PubMed

Lu, Xiaowen; Megchelenbrink, Wout; Notebaart, Richard A; Huynen, Martijn A

2015-01-01

Synthetic Lethal (SL) genetic interactions play a key role in various types of biological research, ranging from understanding genotype-phenotype relationships to identifying drug-targets against cancer. Despite recent advances in empirical measuring SL interactions in human cells, the human genetic interaction map is far from complete. Here, we present a novel approach to predict this map by exploiting patterns in cancer genome evolution. First, we show that empirically determined SL interactions are reflected in various gene presence, absence, and duplication patterns in hundreds of cancer genomes. The most evident pattern that we discovered is that when one member of an SL interaction gene pair is lost, the other gene tends not to be lost, i.e. the absence of co-loss. This observation is in line with expectation, because the loss of an SL interacting pair will be lethal to the cancer cell. SL interactions are also reflected in gene expression profiles, such as an under representation of cases where the genes in an SL pair are both under expressed, and an over representation of cases where one gene of an SL pair is under expressed, while the other one is over expressed. We integrated the various previously unknown cancer genome patterns and the gene expression patterns into a computational model to identify SL pairs. This simple, genome-wide model achieves a high prediction power (AUC = 0.75) for known genetic interactions. It allows us to present for the first time a comprehensive genome-wide list of SL interactions with a high estimated prediction precision, covering up to 591,000 gene pairs. This unique list can potentially be used in various application areas ranging from biotechnology to medical genetics.
Hybrid origin of gynogenetic clones and the introgression of their mitochondrial genome into sexual diploids through meiotic hybridogenesis in the loach, Misgurnus anguillicuadatus.

PubMed

Yamada, Aya; Kodo, Yukihiro; Murakami, Masaru; Kuroda, Masamichi; Aoki, Takao; Fujimoto, Takafumi; Arai, Katsutoshi

2015-11-01

In a few Japanese populations of the loach Misgurnus anguillicaudatus (Teleostei: Cobitidae), clonal diploid lineages produce unreduced diploid eggs that normally undergo gynogenetic reproduction; however the origin of these clones remains elusive. Here, we show the presence of two diverse clades, A and B, within this loach species from sequence analyses of two nuclear genes RAG1 (recombination activating gene 1) and IRBP2 (interphotoreceptor retinoid-binding protein, 2) and then demonstrate heterozygous genotypes fixed at the two loci as the evidence of the hybrid nature of clonal lineages. All the clonal individuals were identified by clone-specific mitochondrial DNA haplotypes, microsatellite genotypes, and random amplified polymorphic DNA fingerprints; they commonly showed two alleles, one from clade A and another from clade B, whereas other wild-type diploids possessed alleles from either clade A or B. However, we also found wild-type diploids with clone-specific mitochondrial DNA and nuclear genes from clade B. One possible explanation is an introgression of a clone-specific mitochondrial genome from clonal to these wild-type loaches. These individuals likely arose by a cross between haploid sperm from bisexual B clade males and haploid eggs with clone-specific mtDNA and clade B nuclear genome, produced by meiotic hybridogenesis (elimination of unmatched A genome followed by meiosis after preferential pairing between two matched B genomes) in clone-origin triploid individual (ABB). © 2015 Wiley Periodicals, Inc.
A Single Multiplex crRNA Array for FnCpf1-Mediated Human Genome Editing.

PubMed

Sun, Huihui; Li, Fanfan; Liu, Jie; Yang, Fayu; Zeng, Zhenhai; Lv, Xiujuan; Tu, Mengjun; Liu, Yeqing; Ge, Xianglian; Liu, Changbao; Zhao, Junzhao; Zhang, Zongduan; Qu, Jia; Song, Zongming; Gu, Feng

2018-06-15

Cpf1 has been harnessed as a tool for genome manipulation in various species because of its simplicity and high efficiency. Our recent study demonstrated that FnCpf1 could be utilized for human genome editing with notable advantages for target sequence selection due to the flexibility of the protospacer adjacent motif (PAM) sequence. Multiplex genome editing provides a powerful tool for targeting members of multigene families, dissecting gene networks, modeling multigenic disorders in vivo, and applying gene therapy. However, there are no reports at present that show FnCpf1-mediated multiplex genome editing via a single customized CRISPR RNA (crRNA) array. In the present study, we utilize a single customized crRNA array to simultaneously target multiple genes in human cells. In addition, we also demonstrate that a single customized crRNA array to target multiple sites in one gene could be achieved. Collectively, FnCpf1, a powerful genome-editing tool for multiple genomic targets, can be harnessed for effective manipulation of the human genome. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.
Human-specific protein isoforms produced by novel splice sites in the human genome after the human-chimpanzee divergence

PubMed Central

2012-01-01

Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
An Integrated Encyclopedia of DNA Elements in the Human Genome

PubMed Central

2012-01-01

Summary The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure, and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall the project provides new insights into the organization and regulation of our genes and genome, and an expansive resource of functional annotations for biomedical research. PMID:22955616
Discovery and Characterization of Chromatin States for Systematic Annotation of the Human Genome

NASA Astrophysics Data System (ADS)

Ernst, Jason; Kellis, Manolis

A plethora of epigenetic modifications have been described in the human genome and shown to play diverse roles in gene regulation, cellular differentiation and the onset of disease. Although individual modifications have been linked to the activity levels of various genetic functional elements, their combinatorial patterns are still unresolved and their potential for systematic de novo genome annotation remains untapped. Here, we use a multivariate Hidden Markov Model to reveal chromatin states in human T cells, based on recurrent and spatially coherent combinations of chromatin marks.We define 51 distinct chromatin states, including promoter-associated, transcription-associated, active intergenic, largescale repressed and repeat-associated states. Each chromatin state shows specific enrichments in functional annotations, sequence motifs and specific experimentally observed characteristics, suggesting distinct biological roles. This approach provides a complementary functional annotation of the human genome that reveals the genome-wide locations of diverse classes of epigenetic function.

Genetic recombination pathways and their application for genome modification of human embryonic stem cells.

PubMed

Nieminen, Mikko; Tuuri, Timo; Savilahti, Harri

2010-10-01

Human embryonic stem cells are pluripotent cells derived from early human embryo and retain a potential to differentiate into all adult cell types. They provide vast opportunities in cell replacement therapies and are expected to become significant tools in drug discovery as well as in the studies of cellular and developmental functions of human genes. The progress in applying different types of DNA recombination reactions for genome modification in a variety of eukaryotic cell types has provided means to utilize recombination-based strategies also in human embryonic stem cells. Homologous recombination-based methods, particularly those utilizing extended homologous regions and those employing zinc finger nucleases to boost genomic integration, have shown their usefulness in efficient genome modification. Site-specific recombination systems are potent genome modifiers, and they can be used to integrate DNA into loci that contain an appropriate recombination signal sequence, either naturally occurring or suitably pre-engineered. Non-homologous recombination can be used to generate random integrations in genomes relatively effortlessly, albeit with a moderate efficiency and precision. DNA transposition-based strategies offer substantially more efficient random strategies and provide means to generate single-copy insertions, thus potentiating the generation of genome-wide insertion libraries applicable in genetic screens. 2010 Elsevier Inc. All rights reserved.
Towards the delineation of the ancestral eutherian genome organization: comparative genome maps of human and the African elephant (Loxodonta africana) generated by chromosome painting.

PubMed Central

Frönicke, Lutz; Wienberg, Johannes; Stone, Gary; Adams, Lisa; Stanyon, Roscoe

2003-01-01

This study presents a whole-genome comparison of human and a representative of the Afrotherian clade, the African elephant, generated by reciprocal Zoo-FISH. An analysis of Afrotheria genomes is of special interest, because recent DNA sequence comparisons identify them as the oldest placental mammalian clade. Complete sets of whole-chromosome specific painting probes for the African elephant and human were constructed by degenerate oligonucleotide-primed PCR amplification of flow-sorted chromosomes. Comparative genome maps are presented based on their hybridization patterns. These maps show that the elephant has a moderately rearranged chromosome complement when compared to humans. The human paint probes identified 53 evolutionary conserved segments on the 27 autosomal elephant chromosomes and the X chromosome. Reciprocal experiments with elephant probes delineated 68 conserved segments in the human genome. The comparison with a recent aardvark and elephant Zoo-FISH study delineates new chromosomal traits which link the two Afrotherian species phylogenetically. In the absence of any morphological evidence the chromosome painting data offer the first non-DNA sequence support for an Afrotherian clade. The comparative human and elephant genome maps provide new insights into the karyotype organization of the proto-afrotherian, the ancestor of extant placental mammals, which most probably consisted of 2n=46 chromosomes. PMID:12965023
Functional Genomic Screening Approaches in Mechanistic Toxicology and Potential Future Applications of CRISPR-Cas9

PubMed Central

Shen, Hua; McHale, Cliona M.; Smith, Martyn T; Zhang, Luoping

2015-01-01

Characterizing variability in the extent and nature of responses to environmental exposures is a critical aspect of human health risk assessment. Chemical toxicants act by many different mechanisms, however, and the genes involved in adverse outcome pathways (AOPs) and AOP networks are not yet characterized. Functional genomic approaches can reveal both toxicity pathways and susceptibility genes, through knockdown or knockout of all non-essential genes in a cell of interest, and identification of genes associated with a toxicity phenotype following toxicant exposure. Screening approaches in yeast and human near-haploid leukemic KBM7 cells, have identified roles for genes and pathways involved in response to many toxicants but are limited by partial homology among yeast and human genes and limited relevance to normal diploid cells. RNA interference (RNAi) suppresses mRNA expression level but is limited by off-target effects (OTEs) and incomplete knockdown. The recently developed gene editing approach called clustered regularly interspaced short palindrome repeats-associated nuclease (CRISPR)-Cas9, can precisely knock-out most regions of the genome at the DNA level with fewer OTEs than RNAi, in multiple human cell types, thus overcoming the limitations of the other approaches. It has been used to identify genes involved in the response to chemical and microbial toxicants in several human cell types and could readily be extended to the systematic screening of large numbers of environmental chemicals. CRISPR-Cas9 can also repress and activate gene expression, including that of non-coding RNA, with near-saturation, thus offering the potential to more fully characterize AOPs and AOP networks. Finally, CRISPR-Cas9 can generate complex animal models in which to conduct preclinical toxicity testing at the level of individual genotypes or haplotypes. Therefore, CRISPR-Cas9 is a powerful and flexible functional genomic screening approach that can be harnessed to provide
Analysis of the full genome of human group C rotaviruses reveals lineage diversification and reassortment.

PubMed

Medici, Maria Cristina; Tummolo, Fabio; Martella, Vito; Arcangeletti, Maria Cristina; De Conto, Flora; Chezzi, Carlo; Fehér, Enikő; Marton, Szilvia; Calderaro, Adriana; Bányai, Krisztián

2016-08-01

Group C rotaviruses (RVC) are enteric pathogens of humans and animals. Whole-genome sequences are available only for few RVCs, leaving gaps in our knowledge about their genetic diversity. We determined the full-length genome sequence of two human RVCs (PR2593/2004 and PR713/2012), detected in Italy from hospital-based surveillance for rotavirus infection in 2004 and 2012. In the 11 RNA genomic segments, the two Italian RVCs segregated within separate intra-genotypic lineages showed variation ranging from 1.9 % (VP6) to 15.9 % (VP3) at the nucleotide level. Comprehensive analysis of human RVC sequences available in the databases allowed us to reveal the existence of at least two major genome configurations, defined as type I and type II. Human RVCs of type I were all associated with the M3 VP3 genotype, including the Italian strain PR2593/2004. Conversely, human RVCs of type II were all associated with the M2 VP3 genotype, including the Italian strain PR713/2012. Reassortant RVC strains between these major genome configurations were identified. Although only a few full-genome sequences of human RVCs, mostly of Asian origin, are available, the analysis of human RVC sequences retrieved from the databases indicates that at least two intra-genotypic RVC lineages circulate in European countries. Gathering more sequence data is necessary to develop a standardized genotype and intra-genotypic lineage classification system useful for epidemiological investigations and avoiding confusion in the literature.
Identification and Classification of Conserved RNA Secondary Structures in the Human Genome

PubMed Central

Pedersen, Jakob Skou; Bejerano, Gill; Siepel, Adam; Rosenbloom, Kate; Lindblad-Toh, Kerstin; Lander, Eric S; Kent, Jim; Miller, Webb; Haussler, David

2006-01-01

The discoveries of microRNAs and riboswitches, among others, have shown functional RNAs to be biologically more important and genomically more prevalent than previously anticipated. We have developed a general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functional RNAs encoded in the human genome and used it to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs. At a loose threshold for acceptance, this search resulted in a set of 48,479 candidate RNA structures. This screen finds a large number of known functional RNAs, including 195 miRNAs, 62 histone 3′UTR stem loops, and various types of known genetic recoding elements. Among the highest-scoring new predictions are 169 new miRNA candidates, as well as new candidate selenocysteine insertion sites, RNA editing hairpins, RNAs involved in transcript auto regulation, and many folds that form singletons or small functional RNA families of completely unknown function. While the rate of false positives in the overall set is difficult to estimate and is likely to be substantial, the results nevertheless provide evidence for many new human functional RNAs and present specific predictions to facilitate their further characterization. PMID:16628248
The humankind genome: from genetic diversity to the origin of human diseases.

PubMed

Belizário, Jose E

2013-12-01

Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
Human Ageing Genomic Resources: new and updated databases

PubMed Central

Tacutu, Robi; Thornton, Daniel; Johnson, Emily; Budovsky, Arie; Barardo, Diogo; Craig, Thomas; Diana, Eugene; Lehmann, Gilad; Toren, Dmitri; Wang, Jingwei; Fraifeld, Vadim E

2018-01-01

Abstract In spite of a growing body of research and data, human ageing remains a poorly understood process. Over 10 years ago we developed the Human Ageing Genomic Resources (HAGR), a collection of databases and tools for studying the biology and genetics of ageing. Here, we present HAGR’s main functionalities, highlighting new additions and improvements. HAGR consists of six core databases: (i) the GenAge database of ageing-related genes, in turn composed of a dataset of >300 human ageing-related genes and a dataset with >2000 genes associated with ageing or longevity in model organisms; (ii) the AnAge database of animal ageing and longevity, featuring >4000 species; (iii) the GenDR database with >200 genes associated with the life-extending effects of dietary restriction; (iv) the LongevityMap database of human genetic association studies of longevity with >500 entries; (v) the DrugAge database with >400 ageing or longevity-associated drugs or compounds; (vi) the CellAge database with >200 genes associated with cell senescence. All our databases are manually curated by experts and regularly updated to ensure a high quality data. Cross-links across our databases and to external resources help researchers locate and integrate relevant information. HAGR is freely available online (http://genomics.senescence.info/). PMID:29121237
Dose dependence of the excision of ultraviolet-induced pyrimidine dimers from nuclear deoxyribonucleic acids of haploid and diploid Saccharomyces cerevisiae.

PubMed Central

Waters, R; Moustacchi, E

1975-01-01

The yield of ultraviolet-induced dimers is similar for a fixed dose in both haploid and diploid Saccharomyces cerevisiae. The excision of these photo-products from the nuclear deoxyribonucleic acids of cells of both ploidies after ultraviolet incident doses of 2 times 10-3 to 4 times 10-3 ergs/mm2 decreased with the corresponding increasing dose. Postirradiation incubation in saline followed by a further incubation in nutrient medium increases the excision as compared to that seen in either nutrient medium or saline alone. Previous data regarding both pyrimidine dimer removal and the survival of haploid and diploid cells after ultraviolet irradiation and either immediate or delayed plating are discussed. PMID:1090608
One haploid parent contributes 100% of the gene pool for a widespread species in northwest North America.

PubMed

Karlin, E F; Andrus, R E; Boles, S B; Shaw, A J

2011-02-01

The monoicous peatmoss Sphagnum subnitens has a tripartite distribution that includes disjunct population systems in Europe (including the Azores), northwestern North America and New Zealand. Regional genetic diversity was highest in European S. subnitens but in northwestern North America, a single microsatellite-based multilocus haploid genotype was detected across 16 sites ranging from Coos County, Oregon, to Kavalga Island in the Western Aleutians (a distance of some 4115 km). Two multilocus haploid genotypes were detected across 14 sites on South Island, New Zealand. The microsatellite-based regional genetic diversity detected in New Zealand and North American S. subnitens is the lowest reported for any Sphagnum. The low genetic diversity detected in both of these regions most likely resulted from a founder event associated with vegetative propagation and complete selfing, with one founding haploid plant in northwest North America and two in New Zealand. Thus, one plant appears to have contributed 100% of the gene pool for the population systems of S. subnitens occurring in northwest North America, and this is arguably the most genetically uniform group of plants having a widespread distribution yet detected. Although having a distribution spanning 12.5° of latitude and 56° of longitude, there was no evidence of any genetic diversification in S. subnitens in northwest North America. No genetic structure was detected among the three regions, and it appears that European plants of S. subnitens provided the source for New Zealand and northwest North American populations. © 2010 Blackwell Publishing Ltd.
Combination of reversible male sterility and doubled haploid production by targeted inactivation of cytoplasmic glutamine synthetase in developing anthers and pollen.

PubMed

Ribarits, Alexandra; Mamun, A N K; Li, Shipeng; Resch, Tatiana; Fiers, Martijn; Heberle-Bors, Erwin; Liu, Chun-Ming; Touraev, Alisher

2007-07-01

Reversible male sterility and doubled haploid plant production are two valuable technologies in F(1)-hybrid breeding. F(1)-hybrids combine uniformity with high yield and improved agronomic traits, and provide self-acting intellectual property protection. We have developed an F(1)-hybrid seed technology based on the metabolic engineering of glutamine in developing tobacco anthers and pollen. Cytosolic glutamine synthetase (GS1) was inactivated in tobacco by introducing mutated tobacco GS genes fused to the tapetum-specific TA29 and microspore-specific NTM19 promoters. Pollen in primary transformants aborted close to the first pollen mitosis, resulting in male sterility. A non-segregating population of homozygous doubled haploid male-sterile plants was generated through microspore embryogenesis. Fertility restoration was achieved by spraying plants with glutamine, or by pollination with pollen matured in vitro in glutamine-containing medium. The combination of reversible male sterility with doubled haploid production results in an innovative environmentally friendly breeding technology. Tapetum-mediated sporophytic male sterility is of use in foliage crops, whereas microspore-specific gametophytic male sterility can be applied to any field crop. Both types of sterility preclude the release of transgenic pollen into the environment.
An ancient genome duplication contributed to the abundance of metabolic genes in the moss Physcomitrella patens

PubMed Central

Rensing, Stefan A; Ick, Julia; Fawcett, Jeffrey A; Lang, Daniel; Zimmer, Andreas; Van de Peer, Yves; Reski, Ralf

2007-01-01

Background: Analyses of complete genomes and large collections of gene transcripts have shown that most, if not all seed plants have undergone one or more genome duplications in their evolutionary past. Results: In this study, based on a large collection of EST sequences, we provide evidence that the haploid moss Physcomitrella patens is a paleopolyploid as well. Based on the construction of linearized phylogenetic trees we infer the genome duplication to have occurred between 30 and 60 million years ago. Gene Ontology and pathway association of the duplicated genes in P. patens reveal different biases of gene retention compared with seed plants. Conclusion: Metabolic genes seem to have been retained in excess following the genome duplication in P. patens. This might, at least partly, explain the versatility of metabolism, as described for P. patens and other mosses, in comparison to other land plants. PMID:17683536
Human genome education model project. Ethical, legal, and social implications of the human genome project: Education of interdisciplinary professionals

DOE Office of Scientific and Technical Information (OSTI.GOV)

Weiss, J.O.; Lapham, E.V.

1996-12-31

This meeting was held June 10, 1996 at Georgetown University. The purpose of this meeting was to provide a multidisciplinary forum for exchange of state-of-the-art information on the human genome education model. Topics of discussion include the following: psychosocial issues; ethical issues for professionals; legislative issues and update; and education issues.
Genome-scale modeling of human metabolism - a systems biology approach.

PubMed

Mardinoglu, Adil; Gatto, Francesco; Nielsen, Jens

2013-09-01

Altered metabolism is linked to the appearance of various human diseases and a better understanding of disease-associated metabolic changes may lead to the identification of novel prognostic biomarkers and the development of new therapies. Genome-scale metabolic models (GEMs) have been employed for studying human metabolism in a systematic manner, as well as for understanding complex human diseases. In the past decade, such metabolic models - one of the fundamental aspects of systems biology - have started contributing to the understanding of the mechanistic relationship between genotype and phenotype. In this review, we focus on the construction of the Human Metabolic Reaction database, the generation of healthy cell type- and cancer-specific GEMs using different procedures, and the potential applications of these developments in the study of human metabolism and in the identification of metabolic changes associated with various disorders. We further examine how in silico genome-scale reconstructions can be employed to simulate metabolic flux distributions and how high-throughput omics data can be analyzed in a context-dependent fashion. Insights yielded from this mechanistic modeling approach can be used for identifying new therapeutic agents and drug targets as well as for the discovery of novel biomarkers. Finally, recent advancements in genome-scale modeling and the future challenge of developing a model of whole-body metabolism are presented. The emergent contribution of GEMs to personalized and translational medicine is also discussed. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genome duplication and mutations in ACE2 cause multicellular, fast-sedimenting phenotypes in evolved Saccharomyces cerevisiae

PubMed Central

Oud, Bart; Guadalupe-Medina, Victor; Nijkamp, Jurgen F.; de Ridder, Dick; Pronk, Jack T.; van Maris, Antonius J. A.; Daran, Jean-Marc

2013-01-01

Laboratory evolution of the yeast Saccharomyces cerevisiae in bioreactor batch cultures yielded variants that grow as multicellular, fast-sedimenting clusters. Knowledge of the molecular basis of this phenomenon may contribute to the understanding of natural evolution of multicellularity and to manipulating cell sedimentation in laboratory and industrial applications of S. cerevisiae. Multicellular, fast-sedimenting lineages obtained from a haploid S. cerevisiae strain in two independent evolution experiments were analyzed by whole genome resequencing. The two evolved cell lines showed different frameshift mutations in a stretch of eight adenosines in ACE2, which encodes a transcriptional regulator involved in cell cycle control and mother-daughter cell separation. Introduction of the two ace2 mutant alleles into the haploid parental strain led to slow-sedimenting cell clusters that consisted of just a few cells, thus representing only a partial reconstruction of the evolved phenotype. In addition to single-nucleotide mutations, a whole-genome duplication event had occurred in both evolved multicellular strains. Construction of a diploid reference strain with two mutant ace2 alleles led to complete reconstruction of the multicellular-fast sedimenting phenotype. This study shows that whole-genome duplication and a frameshift mutation in ACE2 are sufficient to generate a fast-sedimenting, multicellular phenotype in S. cerevisiae. The nature of the ace2 mutations and their occurrence in two independent evolution experiments encompassing fewer than 500 generations of selective growth suggest that switching between unicellular and multicellular phenotypes may be relevant for competitiveness of S. cerevisiae in natural environments. PMID:24145419
Linkage disequilibrium between STRPs and SNPs across the human genome.

PubMed

Payseur, Bret A; Place, Michael; Weber, James L

2008-05-01

Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.
Evolution and Diversity of the Human Hepatitis D Virus Genome

PubMed Central

Huang, Chi-Ruei; Lo, Szecheng J.

2010-01-01

Human hepatitis delta virus (HDV) is the smallest RNA virus in genome. HDV genome is divided into a viroid-like sequence and a protein-coding sequence which could have originated from different resources and the HDV genome was eventually constituted through RNA recombination. The genome subsequently diversified through accumulation of mutations selected by interactions between the mutated RNA and proteins with host factors to successfully form the infectious virions. Therefore, we propose that the conservation of HDV nucleotide sequence is highly related with its functionality. Genome analysis of known HDV isolates shows that the C-terminal coding sequences of large delta antigen (LDAg) are the highest diversity than other regions of protein-coding sequences but they still retain biological functionality to interact with the heavy chain of clathrin can be selected and maintained. Since viruses interact with many host factors, including escaping the host immune response, how to design a program to predict RNA genome evolution is a great challenging work. PMID:20204073
The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes.

PubMed

Johnson, Timothy J; Kariyawasam, Subhashinie; Wannemuehler, Yvonne; Mangiamele, Paul; Johnson, Sara J; Doetkott, Curt; Skyberg, Jerod A; Lynne, Aaron M; Johnson, James R; Nolan, Lisa K

2007-04-01

Escherichia coli strains that cause disease outside the intestine are known as extraintestinal pathogenic E. coli (ExPEC) and include human uropathogenic E. coli (UPEC) and avian pathogenic E. coli (APEC). Regardless of host of origin, ExPEC strains share many traits. It has been suggested that these commonalities may enable APEC to cause disease in humans. Here, we begin to test the hypothesis that certain APEC strains possess potential to cause human urinary tract infection through virulence genotyping of 1,000 APEC and UPEC strains, generation of the first complete genomic sequence of an APEC (APEC O1:K1:H7) strain, and comparison of this genome to all available human ExPEC genomic sequences. The genomes of APEC O1 and three human UPEC strains were found to be remarkably similar, with only 4.5% of APEC O1's genome not found in other sequenced ExPEC genomes. Also, use of multilocus sequence typing showed that some of the sequenced human ExPEC strains were more like APEC O1 than other human ExPEC strains. This work provides evidence that at least some human and avian ExPEC strains are highly similar to one another, and it supports the possibility that a food-borne link between some APEC and UPEC strains exists. Future studies are necessary to assess the ability of APEC to overcome the hurdles necessary for such a food-borne transmission, and epidemiological studies are required to confirm that such a phenomenon actually occurs.
Read clouds uncover variation in complex regions of the human genome

PubMed Central

Bishara, Alex; Liu, Yuling; Weng, Ziming; Kashef-Haghighi, Dorna; Newburger, Daniel E.; West, Robert; Sidow, Arend; Batzoglou, Serafim

2015-01-01

Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies. PMID:26286554
Read clouds uncover variation in complex regions of the human genome.

PubMed

Bishara, Alex; Liu, Yuling; Weng, Ziming; Kashef-Haghighi, Dorna; Newburger, Daniel E; West, Robert; Sidow, Arend; Batzoglou, Serafim

2015-10-01

Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies. © 2015 Bishara et al.; Published by Cold Spring Harbor Laboratory Press.
Whole genome duplication and transposable element proliferation drive genome expansion in Corydoradinae catfishes.

PubMed

Marburger, Sarah; Alexandrou, Markos A; Taggart, John B; Creer, Simon; Carvalho, Gary; Oliveira, Claudio; Taylor, Martin I

2018-02-14

Genome size varies significantly across eukaryotic taxa and the largest changes are typically driven by macro-mutations such as whole genome duplications (WGDs) and proliferation of repetitive elements. These two processes may affect the evolutionary potential of lineages by increasing genetic variation and changing gene expression. Here, we elucidate the evolutionary history and mechanisms underpinning genome size variation in a species-rich group of Neotropical catfishes (Corydoradinae) with extreme variation in genome size-0.6 to 4.4 pg per haploid cell. First, genome size was quantified in 65 species and mapped onto a novel fossil-calibrated phylogeny. Two evolutionary shifts in genome size were identified across the tree-the first between 43 and 49 Ma (95% highest posterior density (HPD) 36.2-68.1 Ma) and the second at approximately 19 Ma (95% HPD 15.3-30.14 Ma). Second, restriction-site-associated DNA (RAD) sequencing was used to identify potential WGD events and quantify transposable element (TE) abundance in different lineages. Evidence of two lineage-scale WGDs was identified across the phylogeny, the first event occurring between 54 and 66 Ma (95% HPD 42.56-99.5 Ma) and the second at 20-30 Ma (95% HPD 15.3-45 Ma) based on haplotype numbers per contig and between 35 and 44 Ma (95% HPD 30.29-64.51 Ma) and 20-30 Ma (95% HPD 15.3-45 Ma) based on SNP read ratios. TE abundance increased considerably in parallel with genome size, with a single TE-family (TC1-IS630-Pogo) showing several increases across the Corydoradinae, with the most recent at 20-30 Ma (95% HPD 15.3-45 Ma) and an older event at 35-44 Ma (95% HPD 30.29-64.51 Ma). We identified signals congruent with two WGD duplication events, as well as an increase in TE abundance across different lineages, making the Corydoradinae an excellent model system to study the effects of WGD and TEs on genome and organismal evolution. © 2018 The Authors.

Nuclear fusion and genome encounter during yeast zygote formation.

PubMed

Tartakoff, Alan Michael; Jaiswal, Purnima

2009-06-01

When haploid cells of Saccharomyces cerevisiae are crossed, parental nuclei congress and fuse with each other. To investigate underlying mechanisms, we have developed assays that evaluate the impact of drugs and mutations. Nuclear congression is inhibited by drugs that perturb the actin and tubulin cytoskeletons. Nuclear envelope (NE) fusion consists of at least five steps in which preliminary modifications are followed by controlled flux of first outer and then inner membrane proteins, all before visible dilation of the waist of the nucleus or coalescence of the parental spindle pole bodies. Flux of nuclear pore complexes occurs after dilation. Karyogamy requires both the Sec18p/NSF ATPase and ER/NE luminal homeostasis. After fusion, chromosome tethering keeps tagged parental genomes separate from each other. The process of NE fusion and evidence of genome independence in yeast provide a prototype for understanding related events in higher eukaryotes.
Exact Markov chains versus diffusion theory for haploid random mating.

PubMed

Tyvand, Peder A; Thorvaldsen, Steinar

2010-05-01

Exact discrete Markov chains are applied to the Wright-Fisher model and the Moran model of haploid random mating. Selection and mutations are neglected. At each discrete value of time t there is a given number n of diploid monoecious organisms. The evolution of the population distribution is given in diffusion variables, to compare the two models of random mating with their common diffusion limit. Only the Moran model converges uniformly to the diffusion limit near the boundary. The Wright-Fisher model allows the population size to change with the generations. Diffusion theory tends to under-predict the loss of genetic information when a population enters a bottleneck. 2010 Elsevier Inc. All rights reserved.
The Core and Accessory Genomes of Burkholderia pseudomallei: Implications for Human Melioidosis

PubMed Central

Lin, Chi Ho; Karuturi, R. Krishna M.; Wuthiekanun, Vanaporn; Tuanyok, Apichai; Chua, Hui Hoon; Ong, Catherine; Paramalingam, Sivalingam Suppiah; Tan, Gladys; Tang, Lynn; Lau, Gary; Ooi, Eng Eong; Woods, Donald; Feil, Edward; Peacock, Sharon J.; Tan, Patrick

2008-01-01

Natural isolates of Burkholderia pseudomallei (Bp), the causative agent of melioidosis, can exhibit significant ecological flexibility that is likely reflective of a dynamic genome. Using whole-genome Bp microarrays, we examined patterns of gene presence and absence across 94 South East Asian strains isolated from a variety of clinical, environmental, or animal sources. 86% of the Bp K96243 reference genome was common to all the strains representing the Bp “core genome”, comprising genes largely involved in essential functions (eg amino acid metabolism, protein translation). In contrast, 14% of the K96243 genome was variably present across the isolates. This Bp accessory genome encompassed multiple genomic islands (GIs), paralogous genes, and insertions/deletions, including three distinct lipopolysaccharide (LPS)-related gene clusters. Strikingly, strains recovered from cases of human melioidosis clustered on a tree based on accessory gene content, and were significantly more likely to harbor certain GIs compared to animal and environmental isolates. Consistent with the inference that the GIs may contribute to pathogenesis, experimental mutation of BPSS2053, a GI gene, reduced microbial adherence to human epithelial cells. Our results suggest that the Bp accessory genome is likely to play an important role in microbial adaptation and virulence. PMID:18927621
Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing.

PubMed

Liu, Yu; Koyutürk, Mehmet; Maxwell, Sean; Xiang, Min; Veigl, Martina; Cooper, Richard S; Tayo, Bamidele O; Li, Li; LaFramboise, Thomas; Wang, Zhenghe; Zhu, Xiaofeng; Chance, Mark R

2014-08-16

Sequences up to several megabases in length have been found to be present in individual genomes but absent in the human reference genome. These sequences may be common in populations, and their absence in the reference genome may indicate rare variants in the genomes of individuals who served as donors for the human genome project. As the reference genome is used in probe design for microarray technology and mapping short reads in next generation sequencing (NGS), this missing sequence could be a source of bias in functional genomic studies and variant analysis. One End Anchor (OEA) and/or orphan reads from paired-end sequencing have been used to identify novel sequences that are absent in reference genome. However, there is no study to investigate the distribution, evolution and functionality of those sequences in human populations. To systematically identify and study the missing common sequences (micSeqs), we extended the previous method by pooling OEA reads from large number of individuals and applying strict filtering methods to remove false sequences. The pipeline was applied to data from phase 1 of the 1000 Genomes Project. We identified 309 micSeqs that are present in at least 1% of the human population, but absent in the reference genome. We confirmed 76% of these 309 micSeqs by comparison to other primate genomes, individual human genomes, and gene expression data. Furthermore, we randomly selected fifteen micSeqs and confirmed their presence using PCR validation in 38 additional individuals. Functional analysis using published RNA-seq and ChIP-seq data showed that eleven micSeqs are highly expressed in human brain and three micSeqs contain transcription factor (TF) binding regions, suggesting they are functional elements. In addition, the identified micSeqs are absent in non-primates and show dynamic acquisition during primate evolution culminating with most micSeqs being present in Africans, suggesting some micSeqs may be important sources of human
Genome-wide prediction of vaccine targets for human herpes simplex viruses using Vaxign reverse vaccinology

PubMed Central

2013-01-01

Herpes simplex virus (HSV) types 1 and 2 (HSV-1 and HSV-2) are the most common infectious agents of humans. No safe and effective HSV vaccines have been licensed. Reverse vaccinology is an emerging and revolutionary vaccine development strategy that starts with the prediction of vaccine targets by informatics analysis of genome sequences. Vaxign (http://www.violinet.org/vaxign) is the first web-based vaccine design program based on reverse vaccinology. In this study, we used Vaxign to analyze 52 herpesvirus genomes, including 3 HSV-1 genomes, one HSV-2 genome, 8 other human herpesvirus genomes, and 40 non-human herpesvirus genomes. The HSV-1 strain 17 genome that contains 77 proteins was used as the seed genome. These 77 proteins are conserved in two other HSV-1 strains (strain F and strain H129). Two envelope glycoproteins gJ and gG do not have orthologs in HSV-2 or 8 other human herpesviruses. Seven HSV-1 proteins (including gJ and gG) do not have orthologs in all 40 non-human herpesviruses. Nineteen proteins are conserved in all human herpesviruses, including capsid scaffold protein UL26.5 (NP_044628.1). As the only HSV-1 protein predicted to be an adhesin, UL26.5 is a promising vaccine target. The MHC Class I and II epitopes were predicted by the Vaxign Vaxitop prediction program and IEDB prediction programs recently installed and incorporated in Vaxign. Our comparative analysis found that the two programs identified largely the same top epitopes but also some positive results predicted from one program might not be positive from another program. Overall, our Vaxign computational prediction provides many promising candidates for rational HSV vaccine development. The method is generic and can also be used to predict other viral vaccine targets. PMID:23514126
Whole genome analysis of selected human and animal rotaviruses identified in Uganda from 2012 to 2014 reveals complex genome reassortment events between human, bovine, caprine and porcine strains.

PubMed

Bwogi, Josephine; Jere, Khuzwayo C; Karamagi, Charles; Byarugaba, Denis K; Namuwulya, Prossy; Baliraine, Frederick N; Desselberger, Ulrich; Iturriza-Gomara, Miren

2017-01-01

Rotaviruses of species A (RVA) are a common cause of diarrhoea in children and the young of various other mammals and birds worldwide. To investigate possible interspecies transmission of RVAs, whole genomes of 18 human and 6 domestic animal RVA strains identified in Uganda between 2012 and 2014 were sequenced using the Illumina HiSeq platform. The backbone of the human RVA strains had either a Wa- or a DS-1-like genetic constellation. One human strain was a Wa-like mono-reassortant containing a DS-1-like VP2 gene of possible animal origin. All eleven genes of one bovine RVA strain were closely related to those of human RVAs. One caprine strain had a mixed genotype backbone, suggesting that it emerged from multiple reassortment events involving different host species. The porcine RVA strains had mixed genotype backbones with possible multiple reassortant events with strains of human and bovine origin.Overall, whole genome characterisation of rotaviruses found in domestic animals in Uganda strongly suggested the presence of human-to animal RVA transmission, with concomitant circulation of multi-reassortant strains potentially derived from complex interspecies transmission events. However, whole genome data from the human RVA strains causing moderate and severe diarrhoea in under-fives in Uganda indicated that they were primarily transmitted from person-to-person.
Mutation induction in haploid yeast after split-dose radiation-exposure. I. Fractionated UV-irradiation.

PubMed

Schenk, K; Zölzer, F; Kiefer, J

1989-01-01

Mutation induction was investigated in wild-type haploid yeast Saccharomyces cerevisiae after split-dose UV-irradiation. Cells were exposed to fractionated 254 nm-UV-doses separated by intervals from 0 to 6 h with incubation either on non-nutrient or nutrient agar between. The test parameter was resistance to canavanine. If modifications of sensitivity due to incubation are appropriately taken into account there is no change of mutation frequency.
Zaba: a novel miniature transposable element present in genomes of legume plants.

PubMed

Macas, J; Neumann, P; Pozárková, D

2003-08-01

A novel family of miniature transposable elements, named Zaba, was identified in pea (Pisum sativum) and subsequently also in other legume species using computer analysis of their DNA sequences. Zaba elements are 141-190 bp long, generate 10-bp target site duplications, and their terminal inverted repeats make up most of the sequence. Zaba elements thus resemble class 3 foldback transposons. The elements are only moderately repetitive in pea (tens to hundreds copies per haploid genome), but they are present in up to thousands of copies in the genomes of several Medicago and Vicia species. More detailed analysis of the elements from pea, including isolation of new sequences from a genomic library, revealed that a fraction of these elements are truncated, and that their last transposition probably did not occur recently. A search for Zaba sequences in EST databases showed that at least some elements are transcribed, most probably due to their association with genic regions.
Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

PubMed

Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

2014-01-01

Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.
Genome-wide association studies in Africans and African Americans: Expanding the Framework of the Genomics of Human Traits and Disease

PubMed Central

Peprah, Emmanuel; Xu, Huichun; Tekola-Ayele, Fasil; Royal, Charmaine D.

2014-01-01

Genomic research is one of the tools for elucidating the pathogenesis of diseases of global health relevance, and paving the research dimension to clinical and public health translation. Recent advances in genomic research and technologies have increased our understanding of human diseases, genes associated with these disorders, and the relevant mechanisms. Genome-wide association studies (GWAS) have proliferated since the first studies were published several years ago, and have become an important tool in helping researchers comprehend human variation and the role genetic variants play in disease. However, the need to expand the diversity of populations in GWAS has become increasingly apparent as new knowledge is gained about genetic variation. Inclusion of diverse populations in genomic studies is critical to a more complete understanding of human variation and elucidation of the underpinnings of complex diseases. In this review, we summarize the available data on GWAS in recent-African ancestry populations within the western hemisphere (i.e. African Americans and peoples of the Caribbean) and continental African populations. Furthermore, we highlight ways in which genomic studies in populations of recent African ancestry have led to advances in the areas of malaria, HIV, prostate cancer, and other diseases. Finally, we discuss the advantages of conducting GWAS in recent African ancestry populations in the context of addressing existing and emerging global health conditions. PMID:25427668
From Human Genetics and Genomics to Pharmacogenetics and Pharmacogenomics: Past Lessons, Future Directions

PubMed Central

Nebert, Daniel W.; Zhang, Ge; Vesell, Elliot S.

2009-01-01

A brief history of human genetics and genomics is provided, comparing recent progress in those fields with that in pharmacogenetics and pharmacogenomics, which are subsets of genetics and genomics, respectively. Sequencing of the entire human genome, the mapping of common haplotypes of single-nucleotide polymorphisms (SNPs), and cost-effective genotyping technologies leading to genome-wide association (GWA) studies—have combined convincingly in the past several years to demonstrate the requirements needed to separate true associations from the plethora of false positives. While research in human genetics has moved from monogenic to oligogenic to complex diseases, its pharmacogenetics branch has followed, usually a few years behind. The continuous discoveries, even today, of new surprises about our genome cause us to question reviews declaring that “personalized medicine is almost here” or that “individualized drug therapy will soon be a reality.” As summarized herein, numerous reasons exist to show that an “unequivocal genotype” or even an “unequivocal phenotype” is virtually impossible to achieve in current limited-size studies of human populations. This problem (of insufficiently stringent criteria) leads to a decrease in statistical power and, consequently, equivocal interpretation of most genotype-phenotype association studies. It remains unclear whether personalized medicine or individualized drug therapy will ever be achievable by means of DNA testing alone. PMID:18464043
Genomic signatures of positive selection in humans and the limits of outlier approaches.

PubMed

Kelley, Joanna L; Madeoy, Jennifer; Calhoun, John C; Swanson, Willie; Akey, Joshua M

2006-08-01

Identifying regions of the human genome that have been targets of positive selection will provide important insights into recent human evolutionary history and may facilitate the search for complex disease genes. However, the confounding effects of population demographic history and selection on patterns of genetic variation complicate inferences of selection when a small number of loci are studied. To this end, identifying outlier loci from empirical genome-wide distributions of genetic variation is a promising strategy to detect targets of selection. Here, we evaluate the power and efficiency of a simple outlier approach and describe a genome-wide scan for positive selection using a dense catalog of 1.58 million SNPs that were genotyped in three human populations. In total, we analyzed 14,589 genes, 385 of which possess patterns of genetic variation consistent with the hypothesis of positive selection. Furthermore, several extended genomic regions were found, spanning >500 kb, that contained multiple contiguous candidate selection genes. More generally, these data provide important practical insights into the limits of outlier approaches in genome-wide scans for selection, provide strong candidate selection genes to study in greater detail, and may have important implications for disease related research.
Distinct p53 genomic binding patterns in normal and cancer-derived human cells

PubMed Central

McCorkle, Sean R; McCombie, WR; Dunn, John J

2011-01-01

Here, we report genome-wide analysis of the tumor suppressor p53 binding sites in normal human cells. 743 high-confidence ChIP-seq peaks representing putative genomic binding sites were identified in normal IMR90 fibroblasts using a reference chromatin sample. More than 40% were located within 2 kb of a transcription start site (TSS), a distribution similar to that documented for individually studied, functional p53 binding sites and, to date, not observed by previous p53 genome-wide studies. Nearly half of the high-confidence binding sites in the IMR90 cells reside in CpG islands in marked contrast to sites reported in cancer-derived cells. The distinct genomic features of the IMR90 binding sites do not reflect a distinct preference for specific sequences, since the de novo developed p53 motif based on our study is similar to those reported by genome-wide studies of cancer cells. More likely, the different chromatin landscape in normal, compared with cancer-derived cells, influences p53 binding via modulating availability of the sites. We compared the IMR90 ChIP-seq peaks to the recently published IMR90 methylome1 and demonstrated that they are enriched at hypomethylated DNA. Our study represents the first genome-wide, de novo mapping of p53 binding sites in normal human cells and reveals that p53 binding sites reside in distinct genomic landscapes in normal and cancer-derived human cells. PMID:22127205
Genomic landscape of human diversity across Madagascar

PubMed Central

Pierron, Denis; Heiske, Margit; Razafindrazaka, Harilanto; Rakoto, Ignace; Rabetokotany, Nelly; Ravololomanga, Bodo; Rakotozafy, Lucien M.-A.; Rakotomalala, Mireille Mialy; Razafiarivony, Michel; Rasoarifetra, Bako; Raharijesy, Miakabola Andriamampianina; Razafindralambo, Lolona; Ramilisonina; Fanony, Fulgence; Lejamble, Sendra; Thomas, Olivier; Mohamed Abdallah, Ahmed; Rocher, Christophe; Arachiche, Amal; Tonaso, Laure; Pereda-loth, Veronica; Schiavinato, Stéphanie; Brucato, Nicolas; Ricaut, Francois-Xavier; Kusuma, Pradiptajati; Sudoyo, Herawati; Ni, Shengyu; Boland, Anne; Deleuze, Jean-Francois; Beaujard, Philippe; Grange, Philippe; Adelaar, Sander; Stoneking, Mark; Rakotoarisoa, Jean-Aimé; Radimilahy, Chantal; Letellier, Thierry

2017-01-01

Although situated ∼400 km from the east coast of Africa, Madagascar exhibits cultural, linguistic, and genetic traits from both Southeast Asia and Eastern Africa. The settlement history remains contentious; we therefore used a grid-based approach to sample at high resolution the genomic diversity (including maternal lineages, paternal lineages, and genome-wide data) across 257 villages and 2,704 Malagasy individuals. We find a common Bantu and Austronesian descent for all Malagasy individuals with a limited paternal contribution from Europe and the Middle East. Admixture and demographic growth happened recently, suggesting a rapid settlement of Madagascar during the last millennium. However, the distribution of African and Asian ancestry across the island reveals that the admixture was sex biased and happened heterogeneously across Madagascar, suggesting independent colonization of Madagascar from Africa and Asia rather than settlement by an already admixed population. In addition, there are geographic influences on the present genomic diversity, independent of the admixture, showing that a few centuries is sufficient to produce detectable genetic structure in human populations. PMID:28716916
Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project.

PubMed

Konkel, Miriam K; Walker, Jerilyn A; Hotard, Ashley B; Ranck, Megan C; Fontenot, Catherine C; Storer, Jessica; Stewart, Chip; Marth, Gabor T; Batzer, Mark A

2015-08-29

The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome-wide analysis identifies 12 loci influencing human reproductive behavior.

PubMed

Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J; Tropf, Felix C; Shen, Xia; Wilson, James F; Chasman, Daniel I; Nolte, Ilja M; Tragante, Vinicius; van der Laan, Sander W; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J; Gieger, Christian; Gunderson, Erica P; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F; McMahon, George; Meddens, S Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A; Monnereau, Claire; van der Most, Peter J; Myhre, Ronny; Nalls, Mike A; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B; Rich-Edwards, Janet; Rietveld, Cornelius A; Robino, Antonietta; Rose, Lynda M; Rueedi, Rico; Ryan, Kathleen A; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I; Buring, Julie E; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M; de Geus, Eco J C; Eriksson, Johan G; Evans, Denis A; Faul, Jessica D; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; de Haan, Hugoline G; Haerting, Johannes; Harris, Tamara B; Heath, Andrew C; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia M; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; McQuillan, Ruth; Medland, Sarah E; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M; Ring, Susan M; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D; Starr, John M; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tung, Joyce Y; Uitterlinden, André G; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G; Wang, Jie Jin; Wareham, Nicholas J; Weir, David R; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F; Zondervan, Krina T; Stefansson, Kari; Krueger, Robert F; Lee, James J; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C

2016-12-01

The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the underlying mechanisms of AFB and NEB are poorly understood. We report a large genome-wide association study of both sexes including 251,151 individuals for AFB and 343,072 individuals for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study and 4 additional loci associated in a gene-based effort. These loci harbor genes that are likely to have a role, either directly or by affecting non-local gene expression, in human reproduction and infertility, thereby increasing understanding of these complex traits.
Reference quality assembly of the 3.5-Gb genome of Capsicum annuum from a single linked-read library.

PubMed

Hulse-Kemp, Amanda M; Maheshwari, Shamoni; Stoffel, Kevin; Hill, Theresa A; Jaffe, David; Williams, Stephen R; Weisenfeld, Neil; Ramakrishnan, Srividya; Kumar, Vijay; Shah, Preyas; Schatz, Michael C; Church, Deanna M; Van Deynze, Allen

2018-01-01

Linked-Read sequencing technology has recently been employed successfully for de novo assembly of human genomes, however, the utility of this technology for complex plant genomes is unproven. We evaluated the technology for this purpose by sequencing the 3.5-gigabase (Gb) diploid pepper ( Capsicum annuum ) genome with a single Linked-Read library. Plant genomes, including pepper, are characterized by long, highly similar repetitive sequences. Accordingly, significant effort is used to ensure that the sequenced plant is highly homozygous and the resulting assembly is a haploid consensus. With a phased assembly approach, we targeted a heterozygous F 1 derived from a wide cross to assess the ability to derive both haplotypes and characterize a pungency gene with a large insertion/deletion. The Supernova software generated a highly ordered, more contiguous sequence assembly than all currently available C. annuum reference genomes. Over 83% of the final assembly was anchored and oriented using four publicly available de novo linkage maps. A comparison of the annotation of conserved eukaryotic genes indicated the completeness of assembly. The validity of the phased assembly is further demonstrated with the complete recovery of both 2.5-Kb insertion/deletion haplotypes of the PUN1 locus in the F 1 sample that represents pungent and nonpungent peppers, as well as nearly full recovery of the BUSCO2 gene set within each of the two haplotypes. The most contiguous pepper genome assembly to date has been generated which demonstrates that Linked-Read library technology provides a tool to de novo assemble complex highly repetitive heterozygous plant genomes. This technology can provide an opportunity to cost-effectively develop high-quality genome assemblies for other complex plants and compare structural and gene differences through accurate haplotype reconstruction.
CRISPR/Cas9-mediated genome editing of Epstein-Barr virus in human cells.

PubMed

Yuen, Kit-San; Chan, Chi-Ping; Wong, Nok-Hei Mickey; Ho, Chau-Ha; Ho, Ting-Hin; Lei, Ting; Deng, Wen; Tsao, Sai Wah; Chen, Honglin; Kok, Kin-Hang; Jin, Dong-Yan

2015-03-01

The CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 (CRISPR-associated 9) system is a highly efficient and powerful tool for RNA-guided editing of the cellular genome. Whether CRISPR/Cas9 can also cleave the genome of DNA viruses such as Epstein-Barr virus (EBV), which undergo episomal replication in human cells, remains to be established. Here, we reported on CRISPR/Cas9-mediated editing of the EBV genome in human cells. Two guide RNAs (gRNAs) were used to direct a targeted deletion of 558 bp in the promoter region of BART (BamHI A rightward transcript) which encodes viral microRNAs (miRNAs). Targeted editing was achieved in several human epithelial cell lines latently infected with EBV, including nasopharyngeal carcinoma C666-1 cells. CRISPR/Cas9-mediated editing of the EBV genome was efficient. A recombinant virus with the desired deletion was obtained after puromycin selection of cells expressing Cas9 and gRNAs. No off-target cleavage was found by deep sequencing. The loss of BART miRNA expression and activity was verified, supporting the BART promoter as the major promoter of BART RNA. Although CRISPR/Cas9-mediated editing of the multicopy episome of EBV in infected HEK293 cells was mostly incomplete, viruses could be recovered and introduced into other cells at low m.o.i. Recombinant viruses with an edited genome could be further isolated through single-cell sorting. Finally, a DsRed selectable marker was successfully introduced into the EBV genome during the course of CRISPR/Cas9-mediated editing. Taken together, our work provided not only the first genetic evidence that the BART promoter drives the expression of the BART transcript, but also a new and efficient method for targeted editing of EBV genome in human cells. © 2015 The Authors.
Complete Genome Sequence of Treponema paraluiscuniculi, Strain Cuniculi A: The Loss of Infectivity to Humans Is Associated with Genome Decay

PubMed Central

Šmajs, David; Zobaníková, Marie; Strouhal, Michal; Čejková, Darina; Dugan-Rocha, Shannon; Pospíšilová, Petra; Norris, Steven J.; Albert, Tom; Qin, Xiang; Hallsworth-Pepin, Kym; Buhay, Christian; Muzny, Donna M.; Chen, Lei; Gibbs, Richard A.; Weinstock, George M.

2011-01-01

Treponema paraluiscuniculi is the causative agent of rabbit venereal spirochetosis. It is not infectious to humans, although its genome structure is very closely related to other pathogenic Treponema species including Treponema pallidum subspecies pallidum, the etiological agent of syphilis. In this study, the genome sequence of Treponema paraluiscuniculi, strain Cuniculi A, was determined by a combination of several high-throughput sequencing strategies. Whereas the overall size (1,133,390 bp), arrangement, and gene content of the Cuniculi A genome closely resembled those of the T. pallidum genome, the T. paraluiscuniculi genome contained a markedly higher number of pseudogenes and gene fragments (51). In addition to pseudogenes, 33 divergent genes were also found in the T. paraluiscuniculi genome. A set of 32 (out of 84) affected genes encoded proteins of known or predicted function in the Nichols genome. These proteins included virulence factors, gene regulators and components of DNA repair and recombination. The majority (52 or 61.9%) of the Cuniculi A pseudogenes and divergent genes were of unknown function. Our results indicate that T. paraluiscuniculi has evolved from a T. pallidum-like ancestor and adapted to a specialized host-associated niche (rabbits) during loss of infectivity to humans. The genes that are inactivated or altered in T. paraluiscuniculi are candidates for virulence factors important in the infectivity and pathogenesis of T. pallidum subspecies. PMID:21655244
REGULATION OF GEOGRAPHIC VARIABILITY IN HAPLOID:DIPLOD RATIOS OF BIPHASIC SEAWEED LIFE CYCLES(1).

PubMed

da Silva Vieira, Vasco Manuel Nobre de Carvalho; Santos, Rui Orlando Pimenta

2012-08-01

The relative abundance of haploid and diploid individuals (H:D) in isomorphic marine algal biphasic cycles varies spatially, but only if vital rates of haploid and diploid phases vary differently with environmental conditions (i.e. conditional differentiation between phases). Vital rates of isomorphic phases in particular environments may be determined by subtle morphological or physiological differences. Herein, we test numerically how geographic variability in H:D is regulated by conditional differentiation between isomorphic life phases and the type of life strategy of populations (i.e. life cycles dominated by reproduction, survival or growth). Simulation conditions were selected using available data on H:D spatial variability in seaweeds. Conditional differentiation between ploidy phases had a small effect on the H:D variability for species with life strategies that invest either in fertility or in growth. Conversely, species with life strategies that invest mainly in survival, exhibited high variability in H:D through a conditional differentiation in stasis (the probability of staying in the same size class), breakage (the probability of changing to a smaller size class) or growth (the probability of changing to a bigger size class). These results were consistent with observed geographic variability in H:D of natural marine algae populations. © 2012 Phycological Society of America.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.