Sample records for snp frequency haplotype

  1. TNF-alpha SNP haplotype frequencies in equidae.

    PubMed

    Brown, J J; Ollier, W E R; Thomson, W; Matthews, J B; Carter, S D; Binns, M; Pinchbeck, G; Clegg, P D

    2006-05-01

    Tumour necrosis factor alpha (TNF-alpha) is a pro-inflammatory cytokine that plays a crucial role in the regulation of inflammatory and immune responses. In all vertebrate species the genes encoding TNF-alpha are located within the major histocompatability complex. In the horse TNF-alpha has been ascribed a role in a variety of important disease processes. Previously two single nucleotide polymorphisms (SNPs) have been reported within the 5' un-translated region of the equine TNF-alpha gene. We have examined the equine TNF-alpha promoter region further for additional SNPs by analysing DNA from 131 horses (Equus caballus), 19 donkeys (E. asinus), 2 Grant's zebras (E. burchellii boehmi) and one onager (E. hemionus). Two further SNPs were identified at nucleotide positions 24 (T/G) and 452 (T/C) relative to the first nucleotide of the 522 bp polymerase chain reaction product. A sequence variant at position 51 was observed between equidae. SNaPSHOT genotyping assays for these and the two previously reported SNPs were performed on 457 horses comprising seven different breeds and 23 donkeys to determine the gene frequencies. SNP frequencies varied considerably between different horse breeds and also between the equine species. In total, nine different TNF-alpha promoter SNP haplotypes and their frequencies were established amongst the various equidae examined, with some haplotypes being found only in horses and others only in donkeys or zebras. The haplotype frequencies observed varied greatly between different horse breeds. Such haplotypes may relate to levels of TNF-alpha production and disease susceptibility and further investigation is required to identify associations between particular haplotypes and altered risk of disease.

  2. A phased SNP-based classification of sickle cell anemia HBB haplotypes.

    PubMed

    Shaikho, Elmutaz M; Farrell, John J; Alsultan, Abdulrahman; Qutub, Hatem; Al-Ali, Amein K; Figueiredo, Maria Stella; Chui, David H K; Farrer, Lindsay A; Murphy, George J; Mostoslavsky, Gustavo; Sebastiani, Paola; Steinberg, Martin H

    2017-08-11

    Sickle cell anemia causes severe complications and premature death. Five common β-globin gene cluster haplotypes are each associated with characteristic fetal hemoglobin (HbF) levels. As HbF is the major modulator of disease severity, classifying patients according to haplotype is useful. The first method of haplotype classification used restriction fragment length polymorphisms (RFLPs) to detect single nucleotide polymorphisms (SNPs) in the β-globin gene cluster. This is labor intensive, and error prone. We used genome-wide SNP data imputed to the 1000 Genomes reference panel to obtain phased data distinguishing parental alleles. We successfully haplotyped 813 sickle cell anemia patients previously classified by RFLPs with a concordance >98%. Four SNPs (rs3834466, rs28440105, rs10128556, and rs968857) marking four different restriction enzyme sites unequivocally defined most haplotypes. We were able to assign a haplotype to 86% of samples that were either partially or misclassified using RFLPs. Phased data using only four SNPs allowed unequivocal assignment of a haplotype that was not always possible using a larger number of RFLPs. Given the availability of genome-wide SNP data, our method is rapid and does not require high computational resources.

  3. Computational intelligence in bioinformatics: SNP/haplotype data in genetic association study for common diseases.

    PubMed

    Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan

    2009-09-01

    Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.

  4. Honey bee-inspired algorithms for SNP haplotype reconstruction problem

    NASA Astrophysics Data System (ADS)

    PourkamaliAnaraki, Maryam; Sadeghi, Mehdi

    2016-03-01

    Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/

  5. High-Density SNP Genotyping to Define β-Globin Locus Haplotypes

    PubMed Central

    Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.

    2014-01-01

    Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352

  6. The clinical application of single-sperm-based SNP haplotyping for PGD of osteogenesis imperfecta.

    PubMed

    Chen, Linjun; Diao, Zhenyu; Xu, Zhipeng; Zhou, Jianjun; Yan, Guijun; Sun, Haixiang

    2018-05-15

    Osteogenesis imperfecta (OI) is a genetically heterogeneous disorder, presenting either autosomal dominant, autosomal recessive or X-linked inheritance patterns. The majority of OI cases are autosomal dominant and are caused by heterozygous mutations in either the COL1A1 or COL1A2 gene. In these dominant disorders, allele dropout (ADO) can lead to misdiagnosis in preimplantation genetic diagnosis (PGD). Polymorphic markers linked to the mutated genes have been used to establish haplotypes for identifying ADO and ensuring the accuracy of PGD. However, the haplotype of male patients cannot be determined without data from affected relatives. Here, we developed a method for single-sperm-based single-nucleotide polymorphism (SNP) haplotyping via next-generation sequencing (NGS) for the PGD of OI. After NGS, 10 informative polymorphic SNP markers located upstream and downstream of the COL1A1 gene and its pathogenic mutation site were linked to individual alleles in a single sperm from an affected male. After haplotyping, a normal blastocyst was transferred to the uterus for a subsequent frozen embryo transfer cycle. The accuracy of PGD was confirmed by amniocentesis at 19 weeks of gestation. A healthy infant weighing 4,250 g was born via vaginal delivery at the 40th week of gestation. Single-sperm-based SNP haplotyping can be applied for PGD of any monogenic disorders or de novo mutations in males in whom the haplotype of paternal mutations cannot be determined due to a lack of affected relatives. ADO: allele dropout; DI: dentinogenesis imperfect; ESHRE: European Society of Human Reproduction and Embryology; FET: frozen embryo transfer; gDNA: genomic DNA; ICSI: intracytoplasmic sperm injection; IVF: in vitro fertilization; MDA: multiple displacement amplification; NGS: next-generation sequencing; OI: osteogenesis imperfect; PBS: phosphate buffer saline; PCR: polymerase chain reaction; PGD: preimplantation genetic diagnosis; SNP: single-nucleotide polymorphism; STR

  7. Single nucleotide polymorphisms and haplotype frequencies of CYP3A5 in a Japanese population.

    PubMed

    Saeki, Mayumi; Saito, Yoshiro; Nakamura, Takahiro; Murayama, Norie; Kim, Su-Ryang; Ozawa, Shogo; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Nakajima, Toshiharu; Saito, Hirohisa; Kitamura, Yutaka; Kamatani, Naoyuki; Sawada, Jun-ichi

    2003-06-01

    In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A5 in a Japanese population, we sequenced the proximal promoter region, all exons, and the surrounding intronic regions using genomic DNA from 187 Japanese subjects. Thirteen SNPs, including seven novel ones: 13108T>C, 16025A>G, 16903A>G, 16993C>G, 27448C>A, 29782A>G, and 31551T>C (A of the translational start codon of GenBank Accession # NG_000004.2 is numbered 1 according to the CYP Allele Nomenclature), were identified. The most common SNP was 6986A>G (key SNP for CYP3A5*3), with a 0.759 frequency. Two novel SNPs, 29782A>G (I456V) and 31551T>C (I488T), as well as 12952T>C (*5 marker) were found, but these alterations were always associated with the *3A marker SNPs, 6986A>G and 31611C>T. Using these 13 SNPs, haplotype analysis was performed and five novel *1 haplotypes (subtypes) (*1e to *1i) and six novel *3 haplotypes (subtypes) (*3d to *3i) were identified. Our findings suggest that CYP3A5*3 is the major defective allele and that other functional exonic SNPs are rare in the Japanese. Copyright 2003 Wiley-Liss, Inc.

  8. Extensive population structure in San, Khoe, and mixed ancestry populations from southern Africa revealed by 44 short 5-SNP haplotypes.

    PubMed

    Schlebusch, Carina M; Soodyall, Himlya

    2012-12-01

    The San and Khoe people currently represent remnant groups of a much larger and widely distributed population of hunter-gatherers and pastoralists who had exclusive occupation of southern Africa before the arrival of Bantu-speaking groups in the past 1,200 years and sea-borne immigrants within the last 350 years. Genetic studies [mitochondrial deoxyribonucleic acid (DNA) and Y-chromosome] conducted on San and Khoe groups revealed that they harbor some of the most divergent lineages found in living peoples throughout the world. Recently, high-density, autosomal, single-nucleotide polymorphism (SNP)-array studies confirmed the early divergence of Khoe-San population groups from all other human populations. The present study made use of 220 autosomal SNP markers (in the format of both haplotypes and genotypes) to examine the population structure of various San and Khoe groups and their relationship to other neighboring groups. Whereas analyses based on the genotypic SNP data only supported the division of the included populations into three main groups-Khoe-San, Bantu-speakers, and non-African populations-haplotype analyses revealed finer structure within Khoe-San populations. By the use of only 44 short SNP haplotypes (compiled from a total of 220 SNPs), most of the Khoe-San groups could be resolved as separate groups by applying STRUCTURE analyses. Therefore, by carefully selecting a few SNPs and combining them into haplotypes, we were able to achieve the same level of population distinction that was achieved previously in high-density SNP studies on the same population groups. Using haplotypes proved to be a very efficient and cost-effective way to study population structure. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.

  9. Genetic differences in the two main groups of the Japanese population based on autosomal SNPs and haplotypes.

    PubMed

    Yamaguchi-Kabata, Yumi; Tsunoda, Tatsuhiko; Kumasaka, Natsuhiko; Takahashi, Atsushi; Hosono, Naoya; Kubo, Michiaki; Nakamura, Yusuke; Kamatani, Naoyuki

    2012-05-01

    Although the Japanese population has a rather low genetic diversity, we recently confirmed the presence of two main clusters (the Hondo and Ryukyu clusters) through principal component analysis of genome-wide single-nucleotide polymorphism (SNP) genotypes. Understanding the genetic differences between the two main clusters requires further genome-wide analyses based on a dense SNP set and comparison of haplotype frequencies. In the present study, we determined haplotypes for the Hondo cluster of the Japanese population by detecting SNP homozygotes with 388,591 autosomal SNPs from 18,379 individuals and estimated the haplotype frequencies. Haplotypes for the Ryukyu cluster were inferred by a statistical approach using the genotype data from 504 individuals. We then compared the haplotype frequencies between the Hondo and Ryukyu clusters. In most genomic regions, the haplotype frequencies in the Hondo and Ryukyu clusters were very similar. However, in addition to the human leukocyte antigen region on chromosome 6, other genomic regions (chromosomes 3, 4, 5, 7, 10 and 12) showed dissimilarities in haplotype frequency. These regions were enriched for genes involved in the immune system, cell-cell adhesion and the intracellular signaling cascade. These differentiated genomic regions between the Hondo and Ryukyu clusters are of interest because they (1) should be examined carefully in association studies and (2) likely contain genes responsible for morphological or physiological differences between the two groups.

  10. Haplotype diversity in 11 candidate genes across four populations.

    PubMed

    Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

    2005-09-01

    Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.

  11. Y chromosomal haplotype characteristics of domestic sheep (Ovis aries) in China.

    PubMed

    Wang, Yutao; Xu, Lei; Yan, Wei; Li, Shaobin; Wang, Jiqing; Liu, Xiu; Hu, Jiang; Luo, Yuzhu

    2015-07-10

    Investigations on the variation present at the male-specific Y chromosome region provide strong information to understand the origin and evolution of domestic sheep. One SNP OY1 (g.88A>G) in the upstream region of SRY gene, and the microsatellite SRYM18 locus within ovine Y chromosome were analyzed in one hundred and forty five samples collected from eleven breeds in China. SNP OY1 was analyzed using PCR-SSCP method and sequencing. Two different PCR-SSCP patterns represented two specific sequences with sequence analysis revealing SNP-OY1 (g.88A>G) were observed, while SNP A-OY1 showed the most common frequency (82.8%). Sequencing of the SRYM18 region revealed one novel size fragment (A2) with different repetitive units. Seven haplotypes (H4, H5, H6, H7, H8, H9 and H12) and two novel haplotypes (Ha and Hb) were established using combined genotype analysis. H6 showed the highest frequency (43.4%) across all breeds, and H8 showed the second frequency (24.1%). Ha was only found in one breed (Tan), while Hb was present in three breeds (Gansu alpine, White Suffolk and Duolang). Our findings reveal one novel allele in SRYM18 region and two novel male haplotypes of domestic sheep in China. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Intricacies in arrangement of SNP haplotypes suggest "Great Admixture" that created modern humans.

    PubMed

    Dutta, Rajib; Mainsah, Joseph; Yatskiv, Yuriy; Chakrabortty, Sharmistha; Brennan, Patrick; Khuder, Basil; Qiu, Shuhao; Fedorova, Larisa; Fedorov, Alexei

    2017-06-05

    Inferring history from genomic sequences is challenging and problematic because chromosomes are mosaics of thousands of small Identicalby-descent (IBD) fragments, each of them having their own unique story. However, the main events in recent evolution might be deciphered from comparative analysis of numerous loci. A paradox of why humans, whose effective population size is only 10 4 , have nearly three million frequent SNPs is formulated and examined. We studied 5398 loci evenly covering all human autosomes. Common haplotypes built from frequent SNPs that are present in people from various populations have been examined. We demonstrated highly non-random arrangement of alleles in common haplotypes. Abundance of mutually exclusive pairs of common haplotypes that have different alleles at every polymorphic position (so-called Yin/Yang haplotypes) was found in 56% of loci. A novel widely spread category of common haplotypes named Mosaic has been described. Mosaic consists of numerous pieces of Yin/Yang haplotypes and represents an ancestral stage of one of them. Scenarios of possible appearance of large number of frequent human SNPs and their habitual arrangement in Yin/Yang common haplotypes have been evaluated with an advanced genomic simulation algorithm. Computer modeling demonstrated that the observed arrangement of 2.9 million frequent SNPs could not originate from a sole stand-alone population. A "Great Admixture" event has been proposed that can explain peculiarities with frequent SNP distributions. This Great Admixture presumably occurred 100-300 thousand years ago between two ancestral populations that had been separated from each other about a million years ago. Our programs and algorithms can be applied to other species to perform evolutionary and comparative genomics.

  13. Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

    PubMed

    Delaneau, Olivier; Marchini, Jonathan

    2014-06-13

    A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.

  14. Elucidation of the ‘Honeycrisp’ pedigree through haplotype analysis with a multi-family integrated SNP linkage map and a large apple (Malus×domestica) pedigree-connected SNP data set

    PubMed Central

    Howard, Nicholas P; van de Weg, Eric; Bedford, David S; Peace, Cameron P; Vanderzande, Stijn; Clark, Matthew D; Teh, Soon Li; Cai, Lichun; Luby, James J

    2017-01-01

    The apple (Malus×domestica) cultivar Honeycrisp has become important economically and as a breeding parent. An earlier study with SSR markers indicated the original recorded pedigree of ‘Honeycrisp’ was incorrect and ‘Keepsake’ was identified as one putative parent, the other being unknown. The objective of this study was to verify ‘Keepsake’ as a parent and identify and genetically describe the unknown parent and its grandparents. A multi-family based dense and high-quality integrated SNP map was created using the apple 8 K Illumina Infinium SNP array. This map was used alongside a large pedigree-connected data set from the RosBREED project to build extended SNP haplotypes and to identify pedigree relationships. ‘Keepsake’ was verified as one parent of ‘Honeycrisp’ and ‘Duchess of Oldenburg’ and ‘Golden Delicious’ were identified as grandparents through the unknown parent. Following this finding, siblings of ‘Honeycrisp’ were identified using the SNP data. Breeding records from several of these siblings suggested that the previously unreported parent is a University of Minnesota selection, MN1627. This selection is no longer available, but now is genetically described through imputed SNP haplotypes. We also present the mosaic grandparental composition of ‘Honeycrisp’ for each of its 17 chromosome pairs. This new pedigree and genetic information will be useful in future pedigree-based genetic studies to connect ‘Honeycrisp’ with other cultivars used widely in apple breeding programs. The created SNP linkage map will benefit future research using the data from the Illumina apple 8 and 20 K and Affymetrix 480 K SNP arrays. PMID:28243452

  15. A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes

    PubMed Central

    2011-01-01

    Background Knowing the phase of marker genotype data can be useful in genome-wide association studies, because it makes it possible to use analysis frameworks that account for identity by descent or parent of origin of alleles and it can lead to a large increase in data quantities via genotype or sequence imputation. Long-range phasing and haplotype library imputation constitute a fast and accurate method to impute phase for SNP data. Methods A long-range phasing and haplotype library imputation algorithm was developed. It combines information from surrogate parents and long haplotypes to resolve phase in a manner that is not dependent on the family structure of a dataset or on the presence of pedigree information. Results The algorithm performed well in both simulated and real livestock and human datasets in terms of both phasing accuracy and computation efficiency. The percentage of alleles that could be phased in both simulated and real datasets of varying size generally exceeded 98% while the percentage of alleles incorrectly phased in simulated data was generally less than 0.5%. The accuracy of phasing was affected by dataset size, with lower accuracy for dataset sizes less than 1000, but was not affected by effective population size, family data structure, presence or absence of pedigree information, and SNP density. The method was computationally fast. In comparison to a commonly used statistical method (fastPHASE), the current method made about 8% less phasing mistakes and ran about 26 times faster for a small dataset. For larger datasets, the differences in computational time are expected to be even greater. A computer program implementing these methods has been made available. Conclusions The algorithm and software developed in this study make feasible the routine phasing of high-density SNP chips in large datasets. PMID:21388557

  16. Haplotype-Based Genotyping in Polyploids.

    PubMed

    Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

    2018-01-01

    Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.

  17. Linear reduction methods for tag SNP selection.

    PubMed

    He, Jingwu; Zelikovsky, Alex

    2004-01-01

    It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.

  18. Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel.

    PubMed

    Huang, Jie; Howie, Bryan; McCarthy, Shane; Memari, Yasin; Walter, Klaudia; Min, Josine L; Danecek, Petr; Malerba, Giovanni; Trabetti, Elisabetta; Zheng, Hou-Feng; Gambaro, Giovanni; Richards, J Brent; Durbin, Richard; Timpson, Nicholas J; Marchini, Jonathan; Soranzo, Nicole

    2015-09-14

    Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-effective strategy for augmenting the single-nucleotide polymorphism (SNP) content of genome-wide arrays. The UK10K Cohorts project has generated a data set of 3,781 whole genomes sequenced at low depth (average 7x), aiming to exhaustively characterize genetic variation down to 0.1% minor allele frequency in the British population. Here we demonstrate the value of this resource for improving imputation accuracy at rare and low-frequency variants in both a UK and an Italian population. We show that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling. We also present a method for combining WGS panels to improve variant coverage and downstream imputation accuracy, which we illustrate by integrating 7,562 WGS haplotypes from the UK10K project with 2,184 haplotypes from the 1000 Genomes Project. Finally, we introduce a novel approximation that maintains speed without sacrificing imputation accuracy for rare variants.

  19. Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.

    PubMed

    Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi

    2004-01-01

    In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.

  20. [Relationship between genetic polymorphisms of 3 SNP loci in 5-HTT gene and paranoid schizophrenia].

    PubMed

    Xuan, Jin-Feng; Ding, Mei; Pang, Hao; Xing, Jia-Xin; Sun, Yi-Hua; Yao, Jun; Zhao, Yi; Li, Chun-Mei; Wang, Bao-Jie

    2012-12-01

    To investigate the population genetic data of 3 SNP loci (rs25533, rs34388196 and rs1042173) of 5-hydroxytryptamine transporter (5-HTT) gene and the association with paranoid schizophrenia. Three SNP loci of 5-HTT gene were examined in 132 paranoid schizophrenia patients and 150 unrelated healthy individuals of Northern Chinese Han population by PCR-RFLP technique. The Hardy-Weinberg equilibrium test was performed using the chi-square test and the data of haplotype frequency and population genetics parameters were statistically analyzed. Among these three SNP loci, four haplotypes were obtained. There were no statistically significant differences between the patient group and the control group (P > 0.05). The DP values of the 3 SNP loci were 0.276, 0.502 and 0.502. The PIC of them were 0.151, 0.281 and 0.281. The PE of them were 0.014, 0.072 and 0.072. The three SNP loci and four haplotypes of 5-HTT gene have no association with paranoid schizophrenia, while the polymorphism still have high potential application in forensic practice.

  1. Haplotype-based approach to known MS-associated regions increases the amount of explained risk

    PubMed Central

    Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S

    2015-01-01

    Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143

  2. Haplotype analysis of sucrose synthase gene family in three Saccharum species

    PubMed Central

    2013-01-01

    Background Sugarcane is an economically important crop contributing about 80% and 40% to the world sugar and ethanol production, respectively. The complicated genetics consequential to its complex polyploid genome, however, have impeded efforts to improve sugar yield and related important agronomic traits. Modern sugarcane cultivars are complex hybrids derived mainly from crosses among its progenitor species, S. officinarum and S. spontanuem, and to a lesser degree, S. robustom. Atypical of higher plants, sugarcane stores its photoassimilates as sucrose rather than as starch in its parenchymous stalk cells. In the sugar biosynthesis pathway, sucrose synthase (SuSy, UDP-glucose: D-fructose 2-a-D-glucosyltransferase, EC 2.4.1.13) is a key enzyme in the regulation of sucrose accumulation and partitioning by catalyzing the reversible conversion of sucrose and UDP into UDP-glucose and fructose. However, little is known about the sugarcane SuSy gene family members and hence no definitive studies have been reported regarding allelic diversity of SuSy gene families in Saccharum species. Results We identified and characterized a total of five sucrose synthase genes in the three sugarcane progenitor species through gene annotation and PCR haplotype analysis by analyzing 70 to 119 PCR fragments amplified from intron-containing target regions. We detected all but one (i.e. ScSuSy5) of ScSuSy transcripts in five tissue types of three Saccharum species. The average SNP frequency was one SNP per 108 bp, 81 bp, and 72 bp in S. officinarum, S. robustom, and S. spontanuem respectively. The average shared SNP is 15 between S. officinarum and S. robustom, 7 between S. officinarum and S. spontanuem , and 11 between S. robustom and S. spontanuem. We identified 27, 35, and 32 haplotypes from the five ScSuSy genes in S. officinarum, S. robustom, and S. spontanuem respectively. Also, 12, 11, and 9 protein sequences were translated from the haplotypes in S. officinarum, S. robustom, S

  3. Haplotype analysis of sucrose synthase gene family in three Saccharum species.

    PubMed

    Zhang, Jisen; Arro, Jie; Chen, Youqiang; Ming, Ray

    2013-05-10

    Sugarcane is an economically important crop contributing about 80% and 40% to the world sugar and ethanol production, respectively. The complicated genetics consequential to its complex polyploid genome, however, have impeded efforts to improve sugar yield and related important agronomic traits. Modern sugarcane cultivars are complex hybrids derived mainly from crosses among its progenitor species, S. officinarum and S. spontanuem, and to a lesser degree, S. robustom. Atypical of higher plants, sugarcane stores its photoassimilates as sucrose rather than as starch in its parenchymous stalk cells. In the sugar biosynthesis pathway, sucrose synthase (SuSy, UDP-glucose: D-fructose 2-a-D-glucosyltransferase, EC 2.4.1.13) is a key enzyme in the regulation of sucrose accumulation and partitioning by catalyzing the reversible conversion of sucrose and UDP into UDP-glucose and fructose. However, little is known about the sugarcane SuSy gene family members and hence no definitive studies have been reported regarding allelic diversity of SuSy gene families in Saccharum species. We identified and characterized a total of five sucrose synthase genes in the three sugarcane progenitor species through gene annotation and PCR haplotype analysis by analyzing 70 to 119 PCR fragments amplified from intron-containing target regions. We detected all but one (i.e. ScSuSy5) of ScSuSy transcripts in five tissue types of three Saccharum species. The average SNP frequency was one SNP per 108 bp, 81 bp, and 72 bp in S. officinarum, S. robustom, and S. spontanuem respectively. The average shared SNP is 15 between S. officinarum and S. robustom, 7 between S. officinarum and S. spontanuem , and 11 between S. robustom and S. spontanuem. We identified 27, 35, and 32 haplotypes from the five ScSuSy genes in S. officinarum, S. robustom, and S. spontanuem respectively. Also, 12, 11, and 9 protein sequences were translated from the haplotypes in S. officinarum, S. robustom, S. spontanuem

  4. [Genetic Variability and Structure of SNP Haplotypes in the DMPK Gene in Yakuts and Other Ethnic Groups of Northern Eurasia in Relation to Myotonic Dystrophy].

    PubMed

    Swarovskaya, M G; Stepanova, S K; Marussin, A V; Sukhomyasova, A L; Maximova, N R; Stepanov, V A

    2015-06-01

    The genetic variability of the DMPK locus has been studied in relation to six SNP markers (rs2070736, rs572634, rs1799894, rs527221, rs915915, and rs10415988) in Yakuts with myotonic dystrophy (MD) in the Yakut population and in populations of northern Eurasia. Significant differences were observed in the allele frequencies between patients and a population sample of Yakuts for three SNP loci (rs915915, rs1799894, and rs10415988) associated with a high chance of disease manifestation. The odds ratios (OR) of MD development in representatives of the Yakut population for these three loci were 2.59 (95% CI, p = 0,004), 4.99 (95% CI, p = 0.000), and 3.15 (95% CI, p = 0.01), respectively. Haplotype TTTCTC, which is associated with MD, and haplotype GTCCTT, which was observed only in Yakut MD patients (never in MD patients of non-Yakut origin), were revealed. A low level of variability in the locus of DMRK gene in Yakuts (H(e) = 0.283) compared with other examined populations was noted. An analysis of pairwise genetic relationships between populations revealed their significant differentiation for all the examined loci. In addition, a low level of differentiation in territorial groups of Yakut populations (F(ST) = 0.79%), which was related to the high subdivision of the northern Eurasian population (F(ST) = 11.83%), was observed.

  5. SNP-VISTA: An interactive SNP visualization tool

    PubMed Central

    Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L

    2005-01-01

    Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID

  6. The analysis of APOL1 genetic variation and haplotype diversity provided by 1000 Genomes project.

    PubMed

    Peng, Ting; Wang, Li; Li, Guisen

    2017-08-11

    The APOL1 gene variants has been shown to be associated with an increased risk of multiple kinds of diseases, particularly in African Americans, but not in Caucasians and Asians. In this study, we explored the single nucleotide polymorphism (SNP) and haplotype diversity of APOL1 gene in different races provided by 1000 Genomes project. Variants of APOL1 gene in 1000 Genome Project were obtained and SNPs located in the regulatory region or coding region were selected for genetic variation analysis. Total 2504 individuals from 26 populations were classified as four groups that included Africa, Europe, Asia and Admixed populations. Tag SNPs were selected to evaluate the haplotype diversities in the four populations by HaploStats software. APOL1 gene was surrounded by some of the most polymorphic genes in the human genome, variation of APOL1 gene was common, with up to 613 SNP (1000 Genome Project reported) and 99 of them (16.2%) with MAF ≥ 1%. There were 79 SNPs in the URR and 92 SNPs in 3'UTR. Total 12 SNPs in URR and 24 SNPs in 3'UTR were considered as common variants with MAF ≥ 1%. It is worth noting that URR-1 was presents lower frequencies in European populations, while other three haplotypes taken an opposite pattern; 3'UTR presents several high-frequency variation sites in a short segment, and the differences of its haplotypes among different population were significant (P < 0.01), UTR-1 and UTR-5 presented much higher frequency in African population, while UTR-2, UTR-3 and UTR-4 were much lower. APOL1 coding region showed that two SNP of G1 with higher frequency are actually pull down the haplotype H-1 frequency when considering all populations pooled together, and the diversity among the four populations be widen by the G1 two mutation (P 1  = 3.33E-4 vs P 2  = 3.61E-30). The distributions of APOL1 gene variants and haplotypes were significantly different among the different populations, in either regulatory or coding regions. It could provide

  7. Haplotypes composed of minor frequency single nucleotide polymorphisms of the TNF gene protect from progression into sepsis: A study using the new sepsis classification.

    PubMed

    Retsas, Theodoros; Huse, Klaus; Lazaridis, Lazaros-Dimitrios; Karampela, Niki; Bauer, Michael; Platzer, Matthias; Kolonia, Virginia; Papageorgiou, Eirini; Giamarellos-Bourboulis, Evangelos J; Dimopoulos, George

    2018-02-01

    Several articles have provided conflicting results regarding the role of single nucleotide polymorphisms (SNPs) in the promoter region of the TNF gene in susceptibility to sepsis. Former articles have been based on previous definitions of sepsis. This study investigated the influence of TNF haplotypes on the development of sepsis using the new Sepsis-3 definitions. DNA was isolated from patients suffering from infection and systemic inflammatory response syndrome. Haplotyping was performed for six SNPs of TNF. The serum levels of tumour necrosis factor alpha (TNF-α) of these patients were measured using an enzyme immunosorbent assay. Patients were classified into infection and sepsis categories using the Sepsis-3 definitions. Associations between the TNF haplotypes and the clinical characteristics and serum TNF-α levels of the patients were examined. The most common TNF haplotype h1 was composed of major alleles of the studied SNPs. Carriage of haplotypes composed of minor frequency alleles was associated with a lower risk of developing sepsis (odds ratio 0.41, 95% confidence interval 0.19-0.88, p=0.022), but this did not affect the 28-day outcome. Serum TNF-α levels were significantly higher among patients homozygous for h1 haplotypes who developed sepsis compared to infection (p=0.032); a similar result was not observed for patients carrying other haplotypes. Haplotypes containing minor frequency SNP alleles of TNF protect against the development of sepsis without affecting the outcome. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.

  8. Genomic-assisted haplotype analysis and the development of high-throughput SNP markers for salinity tolerance in soybean

    PubMed Central

    Patil, Gunvant; Do, Tuyen; Vuong, Tri D.; Valliyodan, Babu; Lee, Jeong-Dong; Chaudhary, Juhi; Shannon, J. Grover; Nguyen, Henry T.

    2016-01-01

    Soil salinity is a limiting factor of crop yield. The soybean is sensitive to soil salinity, and a dominant gene, Glyma03g32900 is primarily responsible for salt-tolerance. The identification of high throughput and robust markers as well as the deployment of salt-tolerant cultivars are effective approaches to minimize yield loss under saline conditions. We utilized high quality (15x) whole-genome resequencing (WGRS) on 106 diverse soybean lines and identified three major structural variants and allelic variation in the promoter and genic regions of the GmCHX1 gene. The discovery of single nucleotide polymorphisms (SNPs) associated with structural variants facilitated the design of six KASPar assays. Additionally, haplotype analysis and pedigree tracking of 93 U.S. ancestral lines were performed using publically available WGRS datasets. Identified SNP markers were validated, and a strong correlation was observed between the genotype and salt treatment phenotype (leaf scorch, chlorophyll content and Na+ accumulation) using a panel of 104 soybean lines and, an interspecific bi-parental population (F8) from PI483463 x Hutcheson. These markers precisely identified salt-tolerant/sensitive genotypes (>91%), and different structural-variants (>98%). These SNP assays, supported by accurate phenotyping, haplotype analyses and pedigree tracking information, will accelerate marker-assisted selection programs to enhance the development of salt-tolerant soybean cultivars. PMID:26781337

  9. Population Structure With Localized Haplotype Clusters

    PubMed Central

    Browning, Sharon R.; Weir, Bruce S.

    2010-01-01

    We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data. PMID:20457877

  10. Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

    PubMed

    Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

    2009-02-01

    High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.

  11. HLA-A*02 allele frequencies and haplotypic associations in Koreans.

    PubMed

    Park, M H; Whang, D H; Kang, S J; Han, K S

    2000-03-01

    We have investigated the frequencies of HLA-A*02 alleles and their haplotypic associations with HLA-B and -DRB1 loci in 439 healthy unrelated Koreans, including 214 parents from 107 families. All of the 227 samples (51.7%) typed as A2 by serology were analyzed for A*02 alleles using polymerase chain reaction (PCR)-low ionic strength-single-strand conformation polymorphism (LIS-SSCP) method. A total of six different A*02 alleles were detected (A*02 allele frequency 29.6%): A*0201/9 (16.6%), *0203 (0.5%), *0206 (9.3%), *0207 (3.0%), and one each case of *0210 and *02 undetermined type. Two characteristic haplotypes showing the strongest linkage disequilibrium were A*0203-B38-DRB]*1502 and A*0207-B46-DRB1*0803. Besides these strong associations, significant two-locus associations (P<0.001) were observed for A*0201 with B61, DRB1*0901 and DRB1*1401, and for A*0206 with B48 and B61. HLA haplotypes carrying HLA-A2 showed a variable distribution of A*02 alleles, and all of the eight most common A2-B-DR haplotypes occurring at frequencies of > or =1% were variably associated with two different A*02 alleles. These results demonstrate that substantial heterogeneity is present in the distribution of HLA-A*02 alleles and related haplotypes in Koreans.

  12. Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.

    PubMed

    Long, Ji-Rong; Zhao, Lan-Juan; Liu, Peng-Yuan; Lu, Yan; Dvornyk, Volodymyr; Shen, Hui; Liu, Yong-Jun; Zhang, Yuan-Yuan; Xiong, Dong-Hai; Xiao, Peng; Deng, Hong-Wen

    2004-05-24

    The adequacy of association studies for complex diseases depends critically on the existence of linkage disequilibrium (LD) between functional alleles and surrounding SNP markers. We examined the patterns of LD and haplotype distribution in eight candidate genes for osteoporosis and/or obesity using 31 SNPs in 1,873 subjects. These eight genes are apolipoprotein E (APOE), type I collagen alpha1 (COL1A1), estrogen receptor-alpha (ER-alpha), leptin receptor (LEPR), parathyroid hormone (PTH)/PTH-related peptide receptor type 1 (PTHR1), transforming growth factor-beta1 (TGF-beta1), uncoupling protein 3 (UCP3), and vitamin D (1,25-dihydroxyvitamin D3) receptor (VDR). Yin yang haplotypes, two high-frequency haplotypes composed of completely mismatching SNP alleles, were examined. To quantify LD patterns, two common measures of LD, D' and r2, were calculated for the SNPs within the genes. The haplotype distribution varied in the different genes. Yin yang haplotypes were observed only in PTHR1 and UCP3. D' ranged from 0.020 to 1.000 with the average of 0.475, whereas the average r2 was 0.158 (ranging from 0.000 to 0.883). A decay of LD was observed as the intermarker distance increased, however, there was a great difference in LD characteristics of different genes or even in different regions within gene. The differences in haplotype distributions and LD patterns among the genes underscore the importance of characterizing genomic regions of interest prior to association studies.

  13. New HLA haplotype frequency reference standards: high-resolution and large sample typing of HLA DR-DQ haplotypes in a sample of European Americans.

    PubMed

    Klitz, W; Maiers, M; Spellman, S; Baxter-Lowe, L A; Schmeckpeper, B; Williams, T M; Fernandez-Viña, M

    2003-10-01

    A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.

  14. Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection.

    PubMed

    Louzoun, Yoram; Alter, Idan; Gragert, Loren; Albrecht, Mark; Maiers, Martin

    2018-05-01

    Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.

  15. Linkage disequilibrium, SNP frequency change due to selection, and association mapping in popcorn chromosome regions containing QTLs for quality traits

    PubMed Central

    Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca e; Mundim, Gabriel Borges

    2016-01-01

    Abstract The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis. PMID:27007903

  16. Linkage disequilibrium, SNP frequency change due to selection, and association mapping in popcorn chromosome regions containing QTLs for quality traits.

    PubMed

    Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca E; Mundim, Gabriel Borges

    2016-03-01

    The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis.

  17. [Frequency distribution of HLA antigens and haplotypes in newly arrived inhabitants of Magadan].

    PubMed

    Solovenchuk, L L; Pereverzeva, V V; Nevretdinova, Z G

    1994-09-01

    Peculiarities of the frequency distribution of antigens and haplotypes of A, B, and Cw subloci of the HLA system in 924 Slavic inhabitants of Magadan are described. Significant differences in gene and haplotype frequencies between inhabitants of Magadan and those of Moscow, Odessa, Poles'e, Latvia, and England were revealed, which could not be attributed solely to the specificity of migration processes. On the basis of an analysis of gamete associations of the A and B subloci, an attempt was made to explain the specificity of the frequency distribution of HLA system alleles and haplotypes in the investigated sample from an ecological point of view.

  18. Association analysis of calpain 10 gene variants/haplotypes with gestational diabetes mellitus among Mexican women.

    PubMed

    Castro-Martínez, Anna Gabriela; Sánchez-Corona, José; Vázquez-Vargas, Adriana Patricia; García-Zapién, Alejandra Guadalupe; López-Quintero, Andres; Villalpando-Velazco, Héctor Javier; Flores-Martínez, Silvia Esperanza

    2018-02-28

    Gestational diabetes mellitus (GDM) is a metabolically complex disease with major genetic determinants. GDM has been associated with insulin resistance and dysfunction of pancreatic beta cells, so the GDM candidate genes are those that encode proteins modulating the function and secretion of insulin, such as that for calpain 10 (CAPN10). This study aimed to assess whether single nucleotide polymorphism (SNP)-43, SNP-44, SNP-63, and the indel-19 variant, and specific haplotypes of the CAPN10 gene were associated with gestational diabetes mellitus. We studied 116 patients with gestational diabetes mellitus and 83 women with normal glucose tolerance. Measurements of anthropometric and biochemical parameters were performed. SNP-43, SNP-44, and SNP-63 were identified by polymerase chain reaction (PCR)-restriction fragment length polymorphisms, while the indel-19 variant was detected by TaqMan qPCR assays.  The allele, genotype, and haplotype frequencies of the four variants did not differ significantly between women with gestational diabetes mellitus and controls. However, in women with gestational diabetes mellitus, glucose levels were significantly higher bearing the 3R/3R genotype than in carriers of the 3R/2R genotype of the indel-19 variant (p = 0.006). In conclusion, the 3R/3R genotype of the indel-19 variant of the CAPN-10 gene influenced increased glucose levels in these Mexican women with gestational diabetes mellitus.

  19. Construction of a versatile SNP array for pyramiding useful genes of rice.

    PubMed

    Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki

    2016-01-01

    DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  20. Hapl-o-Mat: open-source software for HLA haplotype frequency estimation from ambiguous and heterogeneous data.

    PubMed

    Schäfer, Christian; Schmidt, Alexander H; Sauter, Jürgen

    2017-05-30

    Knowledge of HLA haplotypes is helpful in many settings as disease association studies, population genetics, or hematopoietic stem cell transplantation. Regarding the recruitment of unrelated hematopoietic stem cell donors, HLA haplotype frequencies of specific populations are used to optimize both donor searches for individual patients and strategic donor registry planning. However, the estimation of haplotype frequencies from HLA genotyping data is challenged by the large amount of genotype data, the complex HLA nomenclature, and the heterogeneous and ambiguous nature of typing records. To meet these challenges, we have developed the open-source software Hapl-o-Mat. It estimates haplotype frequencies from population data including an arbitrary number of loci using an expectation-maximization algorithm. Its key features are the processing of different HLA typing resolutions within a given population sample and the handling of ambiguities recorded via multiple allele codes or genotype list strings. Implemented in C++, Hapl-o-Mat facilitates efficient haplotype frequency estimation from large amounts of genotype data. We demonstrate its accuracy and performance on the basis of artificial and real genotype data. Hapl-o-Mat is a versatile and efficient software for HLA haplotype frequency estimation. Its capability of processing various forms of HLA genotype data allows for a straightforward haplotype frequency estimation from typing records usually found in stem cell donor registries.

  1. Estimating haplotype frequencies by combining data from large DNA pools with database information.

    PubMed

    Gasbarra, Dario; Kulathinal, Sangita; Pirinen, Matti; Sillanpää, Mikko J

    2011-01-01

    We assume that allele frequency data have been extracted from several large DNA pools, each containing genetic material of up to hundreds of sampled individuals. Our goal is to estimate the haplotype frequencies among the sampled individuals by combining the pooled allele frequency data with prior knowledge about the set of possible haplotypes. Such prior information can be obtained, for example, from a database such as HapMap. We present a Bayesian haplotyping method for pooled DNA based on a continuous approximation of the multinomial distribution. The proposed method is applicable when the sizes of the DNA pools and/or the number of considered loci exceed the limits of several earlier methods. In the example analyses, the proposed model clearly outperforms a deterministic greedy algorithm on real data from the HapMap database. With a small number of loci, the performance of the proposed method is similar to that of an EM-algorithm, which uses a multinormal approximation for the pooled allele frequencies, but which does not utilize prior information about the haplotypes. The method has been implemented using Matlab and the code is available upon request from the authors.

  2. Haplotype combination of the bovine CFL2 gene sequence variants and association with growth traits in Qinchuan cattle.

    PubMed

    Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong

    2015-06-01

    The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.25Haplotype analysis of three SNPs showed that 8 different haplotypes were identified in all, but only 5 haplotypes were listed except for those with a frequency of <0.03. Hap4 (-GTC-) had the highest haplotype frequencies (34.70%). However in the three SNPs there were no significant associations between the 13 combined genotypes of the CFL2 gene and growth traits. LD analysis showed that the SNP T 1694 A and C 2213 G loci had a strong linkage (r(2)>0.33). Association analysis indicated that SNP G 1500 A, T 1694 A and C 2213 G were significantly associated with growth traits in the QC population. The results of our study suggest that the CFL2 gene may be a strong candidate gene that affects growth traits in the QC cattle breeding program. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. Following the footprints of polymorphic inversions on SNP data: from detection to association tests

    PubMed Central

    Cáceres, Alejandro; González, Juan R.

    2015-01-01

    Inversion polymorphisms have important phenotypic and evolutionary consequences in humans. Two different methodologies have been used to infer inversions from SNP dense data, enabling the use of large cohorts for their study. One approach relies on the differences in linkage disequilibrium across breakpoints; the other one captures the internal haplotype groups that tag the inversion status of chromosomes. In this article, we assessed the convergence of the two methods in the detection of 20 human inversions that have been reported in the literature. The methods converged in four inversions including inv-8p23, for which we studied its association with low-BMI in American children. Using a novel haplotype tagging method with control on inversion ancestry, we computed the frequency of inv-8p23 in two American cohorts and observed inversion haplotype admixture. Accounting for haplotype ancestry, we found that the European inverted allele in children carries a recessive risk of underweight, validated in an independent Spanish cohort (combined: OR= 2.00, P = 0.001). While the footprints of inversions on SNP data are complex, we show that systematic analyses, such as convergence of different methods and controlling for ancestry, can reveal the contribution of inversions to the ancestral composition of populations and to the heritability of human disease. PMID:25672393

  4. Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

    PubMed Central

    McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

    2013-01-01

    To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982

  5. Mapping of HLA- DQ haplotypes in a group of Danish patients with celiac disease.

    PubMed

    Lund, Flemming; Hermansen, Mette N; Pedersen, Merete F; Hillig, Thore; Toft-Hansen, Henrik; Sölétormos, György

    2015-10-01

    A cost-effective identification of HLA- DQ risk haplotypes using the single nucleotide polymorphism (SNP) technique has recently been applied in the diagnosis of celiac disease (CD) in four European populations. The objective of the study was to map risk HLA- DQ haplotypes in a group of Danish CD patients using the SNP technique. Cohort A: Among 65 patients with gastrointestinal symptoms we compared the HLA- DQ2 and HLA- DQ8 risk haplotypes obtained by the SNP technique (method 1) with results based on a sequence specific primer amplification technique (method 2) and a technique used in an assay from BioDiagene (method 3). Cohort B: 128 patients with histologically verified CD were tested for CD risk haplotypes (method 1). Patients with negative results were further tested for sub-haplotypes of HLA- DQ2 (methods 2 and 3). Cohort A: The three applied methods provided the same HLA- DQ2 and HLA- DQ8 results among 61 patients. Four patients were negative for the HLA- DQ2 and HLA- DQ8 haplotypes (method 1) but were positive for the HLA- DQ2.5-trans and HLA- DQ2.2 haplotypes (methods 2 and 3). Cohort B: A total of 120 patients were positive for the HLA- DQ2.5-cis and HLA- DQ8 haplotypes (method 1). The remaining seven patients were positive for HLA- DQ2.5-trans or HLA- DQ2.2 haplotypes (methods 2 and 3). One patient was negative with all three HLA methods. The HLA- DQ risk haplotypes were detected in 93.8% of the CD patients using the SNP technique (method 1). The sensitivity increased to 99.2% by combining methods 1 - 3.

  6. The iSelect 9 K SNP analysis revealed polyploidization induced revolutionary changes and intense human selection causing strong haplotype blocks in wheat.

    PubMed

    Hao, Chenyang; Wang, Yuquan; Chao, Shiaoman; Li, Tian; Liu, Hongxia; Wang, Lanfen; Zhang, Xueyong

    2017-01-30

    A Chinese wheat mini core collection was genotyped using the wheat 9 K iSelect SNP array. Total 2420 and 2396 polymorphic SNPs were detected on the A and the B genome chromosomes, which formed 878 haplotype blocks. There were more blocks in the B genome, but the average block size was significantly (P < 0.05) smaller than those in the A genome. Intense selection (domestication and breeding) had a stronger effect on the A than on the B genome chromosomes. Based on the genetic pedigrees, many blocks can be traced back to a well-known Strampelli cross, which was made one century ago. Furthermore, polyploidization of wheat (both tetraploidization and hexaploidization) induced revolutionary changes in both the A and the B genomes, with a greater increase of gene diversity compared to their diploid ancestors. Modern breeding has dramatically increased diversity in the gene coding regions, though obvious blocks were formed on most of the chromosomes in both tetraploid and hexaploid wheats. Tag-SNP markers identified in this study can be used for marker assisted selection using haplotype blocks as a wheat breeding strategy. This strategy can also be employed to facilitate genome selection in other self-pollinating crop species.

  7. Mapping a New Spontaneous Preterm Birth Susceptibility Gene, IGF1R, Using Linkage, Haplotype Sharing, and Association Analysis

    PubMed Central

    Luukkonen, Aino; Teramo, Kari; Puttonen, Hilkka; Ojaniemi, Marja; Varilo, Teppo; Chaudhari, Bimal P.; Plunkett, Jevon; Murray, Jeffrey C.; McCarroll, Steven A.; Muglia, Louis J.; Palotie, Aarno; Hallman, Mikko

    2011-01-01

    Preterm birth is the major cause of neonatal death and serious morbidity. Most preterm births are due to spontaneous onset of labor without a known cause or effective prevention. Both maternal and fetal genomes influence the predisposition to spontaneous preterm birth (SPTB), but the susceptibility loci remain to be defined. We utilized a combination of unique population structures, family-based linkage analysis, and subsequent case-control association to identify a susceptibility haplotype for SPTB. Clinically well-characterized SPTB families from northern Finland, a subisolate founded by a relatively small founder population that has subsequently experienced a number of bottlenecks, were selected for the initial discovery sample. Genome-wide linkage analysis using a high-density single-nucleotide polymorphism (SNP) array in seven large northern Finnish non-consanginous families identified a locus on 15q26.3 (HLOD 4.68). This region contains the IGF1R gene, which encodes the type 1 insulin-like growth factor receptor IGF-1R. Haplotype segregation analysis revealed that a 55 kb 12-SNP core segment within the IGF1R gene was shared identical-by-state (IBS) in five families. A follow-up case-control study in an independent sample representing the more general Finnish population showed an association of a 6-SNP IGF1R haplotype with SPTB in the fetuses, providing further evidence for IGF1R as a SPTB predisposition gene (frequency in cases versus controls 0.11 versus 0.05, P = 0.001, odds ratio 2.3). This study demonstrates the identification of a predisposing, low-frequency haplotype in a multifactorial trait using a well-characterized population and a combination of family and case-control designs. Our findings support the identification of the novel susceptibility gene IGF1R for predisposition by the fetal genome to being born preterm. PMID:21304894

  8. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    PubMed Central

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  9. Single Nucleotide Polymorphism (SNP)-Strings: An Alternative Method for Assessing Genetic Associations

    PubMed Central

    Goodin, Douglas S.; Khankhanian, Pouya

    2014-01-01

    Background Genome-wide association studies (GWAS) identify disease-associations for single-nucleotide-polymorphisms (SNPs) from scattered genomic-locations. However, SNPs frequently reside on several different SNP-haplotypes, only some of which may be disease-associated. This circumstance lowers the observed odds-ratio for disease-association. Methodology/Principal Findings Here we develop a method to identify the two SNP-haplotypes, which combine to produce each person’s SNP-genotype over specified chromosomal segments. Two multiple sclerosis (MS)-associated genetic regions were modeled; DRB1 (a Class II molecule of the major histocompatibility complex) and MMEL1 (an endopeptidase that degrades both neuropeptides and β-amyloid). For each locus, we considered sets of eleven adjacent SNPs, surrounding the putative disease-associated gene and spanning ∼200 kb of DNA. The SNP-information was converted into an ordered-set of eleven-numbers (subject-vectors) based on whether a person had zero, one, or two copies of particular SNP-variant at each sequential SNP-location. SNP-strings were defined as those ordered-combinations of eleven-numbers (0 or 1), representing a haplotype, two of which combined to form the observed subject-vector. Subject-vectors were resolved using probabilistic methods. In both regions, only a small number of SNP-strings were present. We compared our method to the SHAPEIT-2 phasing-algorithm. When the SNP-information spanning 200 kb was used, SHAPEIT-2 was inaccurate. When the SHAPEIT-2 window was increased to 2,000 kb, the concordance between the two methods, in both of these eleven-SNP regions, was over 99%, suggesting that, in these regions, both methods were quite accurate. Nevertheless, correspondence was not uniformly high over the entire DNA-span but, rather, was characterized by alternating peaks and valleys of concordance. Moreover, in the valleys of poor-correspondence, SHAPEIT-2 was also inconsistent with itself, suggesting that

  10. HIGH-THROUGHPUT IDENTIFICATION OF THE PREDOMINANT MALARIA PARASITE CLONE IN COMPLEX BLOOD STAGE INFECTIONS USING A MULTI-SNP MOLECULAR HAPLOTYPING ASSAY

    PubMed Central

    COLE-TOBIAN, JENNIFER L.; ZIMMERMAN, PETER A.; KING, CHRISTOPHER L.

    2013-01-01

    Individuals living in malaria endemic areas are often infected with multiple parasite clones. Currently used single nucleotide polymorphism (SNP) genotyping methods for malaria parasites are cumbersome; furthermore, few methods currently exist that can rapidly determine the most abundant clone in these complex infections. Here we describe an oligonucleotide ligation assay (OLA) to distinguish SNPs in the Plasmodium vivax Duffy binding protein gene (Pvdbp) at 14 polymorphic residues simultaneously. Allele abundance is determined by the highest mean fluorescent intensity of each allele. Using mixtures of plasmids encoding known haplotypes of the Pvdbp, single clones of P. vivax parasites from infected Aotus monkeys, and well-defined mixed infections from field samples, we were able to identify the predominant Pvdbp genotype with > 93% accuracy when the dominant clone is twice as abundant as a lesser genotype and > 97% of the time if the ratio was 5:1 or greater. Thus, the OLA can accurately, reproducibly, and rapidly determine the predominant parasite haplotype in complex blood stage infections. PMID:17255222

  11. Probability distribution of haplotype frequencies under the two-locus Wright-Fisher model by diffusion approximation.

    PubMed

    Boitard, Simon; Loisel, Patrice

    2007-05-01

    The probability distribution of haplotype frequencies in a population, and the way it is influenced by genetical forces such as recombination, selection, random drift ...is a question of fundamental interest in population genetics. For large populations, the distribution of haplotype frequencies for two linked loci under the classical Wright-Fisher model is almost impossible to compute because of numerical reasons. However the Wright-Fisher process can in such cases be approximated by a diffusion process and the transition density can then be deduced from the Kolmogorov equations. As no exact solution has been found for these equations, we developed a numerical method based on finite differences to solve them. It applies to transient states and models including selection or mutations. We show by several tests that this method is accurate for computing the conditional joint density of haplotype frequencies given that no haplotype has been lost. We also prove that it is far less time consuming than other methods such as Monte Carlo simulations.

  12. SNP analyses of growth factor genes EGF, TGF{beta}-1, and HGF reveal haplotypic association of EGF with autism

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Toyoda, Takao; Thanseem, Ismail; Kawai, Masayoshi

    Autism is a pervasive neurodevelopmental disorder diagnosed in early childhood. Growth factors have been found to play a key role in the cellular differentiation and proliferation of the central and peripheral nervous systems. Epidermal growth factor (EGF) is detected in several regions of the developing and adult brain, where, it enhances the differentiation, maturation, and survival of a variety of neurons. Transforming growth factor-{beta} (TGF{beta}) isoforms play an important role in neuronal survival, and the hepatocyte growth factor (HGF) has been shown to exhibit neurotrophic activity. We examined the association of EGF, TGF{beta}1, and HGF genes with autism, in amore » trio association study, using DNA samples from families recruited to the Autism Genetic Resource Exchange; 252 trios with a male offspring scored for autism were selected for the study. Transmission disequilibrium test revealed significant haplotypic association of EGF with autism. No significant SNP or haplotypic associations were observed for TGF{beta}1 or HGF. Given the role of EGF in brain and neuronal development, we suggest a possible role of EGF in the pathogenesis of autism.« less

  13. Single nucleotide polymorphisms and haplotypes associated with feed efficiency in beef cattle

    PubMed Central

    2013-01-01

    Background General, breed- and diet-dependent associations between feed efficiency in beef cattle and single nucleotide polymorphisms (SNPs) or haplotypes were identified on a population of 1321 steers using a 50 K SNP panel. Genomic associations with traditional two-step indicators of feed efficiency – residual feed intake (RFI), residual average daily gain (RADG), and residual intake gain (RIG) – were compared to associations with two complementary one-step indicators of feed efficiency: efficiency of intake (EI) and efficiency of gain (EG). Associations uncovered in a training data set were evaluated on independent validation data set. A multi-SNP model was developed to predict feed efficiency. Functional analysis of genes harboring SNPs significantly associated with feed efficiency and network visualization aided in the interpretation of the results. Results For the five feed efficiency indicators, the numbers of general, breed-dependent, and diet-dependent associations with SNPs (P-value < 0.0001) were 31, 40, and 25, and with haplotypes were six, ten, and nine, respectively. Of these, 20 SNP and six haplotype associations overlapped between RFI and EI, and five SNP and one haplotype associations overlapped between RADG and EG. This result confirms the complementary value of the one and two-step indicators. The multi-SNP models included 89 SNPs and offered a precise prediction of the five feed efficiency indicators. The associations of 17 SNPs and 7 haplotypes with feed efficiency were confirmed on the validation data set. Nine clusters of Gene Ontology and KEGG pathway categories (mean P-value < 0.001) including, 9nucleotide binding; ion transport, phosphorous metabolic process, and the MAPK signaling pathway were overrepresented among the genes harboring the SNPs associated with feed efficiency. Conclusions The general SNP associations suggest that a single panel of genomic variants can be used regardless of breed and diet. The breed- and diet

  14. Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

    PubMed Central

    Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John

    2008-01-01

    The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213

  15. Congruence as a measurement of extended haplotype structure across the genome

    PubMed Central

    2012-01-01

    Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. PMID:22369243

  16. MADD-FOLH1 Polymorphisms and Their Haplotypes with Serum Lipid Levels and the Risk of Coronary Heart Disease and Ischemic Stroke in a Chinese Han Population.

    PubMed

    Wu, Dong-Feng; Yin, Rui-Xing; Cao, Xiao-Li; Huang, Feng; Wu, Jin-Zhen; Chen, Wu-Xian

    2016-04-08

    This study aimed to detect the association of the MADD-FOLH1 single nucleotide polymorphisms (SNPs) and their haplotypes with the risk of coronary heart disease (CHD) and ischemic stroke (IS) in a Chinese Han population. Six SNPs of rs7395662, rs326214, rs326217, rs1051006, rs3736101, and rs7120118 were genotyped in 584 CHD and 555 IS patients, and 596 healthy controls. The genotypic and allelic frequencies of the rs7395662 SNP were different between controls and patients, and the genotypes of the rs7395662 SNP were associated with the risk of CHD and IS in different genetic models. Six main haplotypes among the rs1051006, rs326214, rs326217, rs3736101, and rs7120118 SNPs were detected in our study population, the haplotypes of G-G-T-G-C and G-A-T-G-T were associated with an increased risk of CHD and IS, respectively. The subjects with rs7395662GG genotype in controls had higher triglyceride (TG) and lower high-density lipoprotein cholesterol (HDL-C) levels than the subjects with AA/AG genotypes. Several SNPs interacted with alcohol consumption to influence serum TG (rs326214, rs326217, and rs7120118) and HDL-C (rs7395662) levels. The SNP of rs3736101 interacted with cigarette smoking to modify serum HDL-C levels. The SNP of rs1051006 interacted with body mass index ≥24 kg/m² to modulate serum low-density lipoprotein cholesterol levels. The interactions of several haplotypes and alcohol consumption on the risk of CHD and IS were also observed.

  17. Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11

    PubMed Central

    Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.

    2009-01-01

    Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120

  18. The discrete Laplace exponential family and estimation of Y-STR haplotype frequencies.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2013-07-21

    Estimating haplotype frequencies is important in e.g. forensic genetics, where the frequencies are needed to calculate the likelihood ratio for the evidential weight of a DNA profile found at a crime scene. Estimation is naturally based on a population model, motivating the investigation of the Fisher-Wright model of evolution for haploid lineage DNA markers. An exponential family (a class of probability distributions that is well understood in probability theory such that inference is easily made by using existing software) called the 'discrete Laplace distribution' is described. We illustrate how well the discrete Laplace distribution approximates a more complicated distribution that arises by investigating the well-known population genetic Fisher-Wright model of evolution by a single-step mutation process. It was shown how the discrete Laplace distribution can be used to estimate haplotype frequencies for haploid lineage DNA markers (such as Y-chromosomal short tandem repeats), which in turn can be used to assess the evidential weight of a DNA profile found at a crime scene. This was done by making inference in a mixture of multivariate, marginally independent, discrete Laplace distributions using the EM algorithm to estimate the probabilities of membership of a set of unobserved subpopulations. The discrete Laplace distribution can be used to estimate haplotype frequencies with lower prediction error than other existing estimators. Furthermore, the calculations could be performed on a normal computer. This method was implemented in the freely available open source software R that is supported on Linux, MacOS and MS Windows. Copyright © 2013 Elsevier Ltd. All rights reserved.

  19. Identification and genetic effect of haplotype in the bovine BMP7 gene.

    PubMed

    Huang, Yong-Zhen; Wang, Xin-Lei; He, Hua; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong

    2013-12-15

    Bone morphogenetic proteins (BMPs) are peptide growth factors belonging to the transforming growth factor-beta (TGF-β) superfamily, and some members of the BMP family support white adipocyte differentiation. In this study, we focused on the BMP7 which singularly promotes the differentiation of brown preadipocytes. Haplotypes involving 5 single nucleotide polymorphism (SNP) sites in the bovine BMP7 gene were identified and their effect on body weight was analyzed. 16 haplotypes and 18 combined haplotypes were revealed and the linkage disequilibrium was assessed in the cattle population with 602 individuals representing three main cattle breeds from China. The results showed that haplotypes 3, 10 and 14 were predominant and accounted for 75.64%, 69.85%, and 83.36% in Nanyang, Qinchuan and Jiaxian cattle breeds, respectively. The statistical analyses indicated that the SNP 1, 4, and 5 are associated with the body weight, body length, and heart girth at 12 and 24 months in Nanyang cattle population (P<0.05), whereas there is no significant association between their 16 haplotypes and 18 combined haplotypes. Our results provide evidence that some SNPs and haplotypes in BMP7 are associated with growth traits, and may be utilized as a genetic marker in marker-assisted selection for beef cattle breeding programs. Copyright © 2013. Published by Elsevier B.V.

  20. APC Yin-Yang haplotype associated with colorectal cancer risk

    PubMed Central

    GARRE, P.; DE LA HOYA, M.; INIESTA, P.; ROMERA, A.; LLOVET, P.; GONZALEZ, S.; PEREZ-SEGURA, P.; CAPELLA, G.; DIAZ-RUBIO, E.; CALDES, T.

    2010-01-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan® assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32–2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61–1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC. PMID:22993613

  1. APC Yin-Yang haplotype associated with colorectal cancer risk.

    PubMed

    Garre, P; DE LA Hoya, M; Iniesta, P; Romera, A; Llovet, P; Gonzalez, S; Perez-Segura, P; Capella, G; Diaz-Rubio, E; Caldes, T

    2010-09-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan(®) assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32-2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61-1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC.

  2. Haplotype Frequency Distribution in Northeastern European Saduria entomon (Crustacea: Isopoda) Populations. A Phylogeographic Approach

    NASA Astrophysics Data System (ADS)

    Sell, Jerzy

    2003-11-01

    The distribution pattern of mtDNA haplotypes in distinct populations of the glacial relict crustacean Saduria entomon was examined to assess phylogeographic relationships among them. Populations from the Baltic, the White Sea and the Barents Sea were screened for mtDNA variation using PCR-based RFLP analysis of a 1150 bp fragment containing part of the CO I and CO II genes. Five mtDNA haplotypes were recorded. An analysis of geographical heterogeneity in haplotype frequency distributions revealed significant differences among populations. The isolated populations of S. entomon have diverged since the retreat of the last glaciation. The geographical pattern of variation is most likely the result of stochastic (founder effect, genetic drift) mechanisms and suggests that the haplotype differentiation observed is probably older than the isolation of the Baltic and Arctic seas.

  3. [Association between the methylenetetrahydrofolate reductase gene polymorphisms and haplotype with toxicity response of high dose methotrexate chemotherapy].

    PubMed

    Liao, Qing-Chuan; Li, Xiao-Lei; Liu, Si-Ting; Zhang, Yong; Li, Tian-Yuan; Qiu, Jin-Chun

    2012-07-01

    To investigate the association between single nucleotide polymorphisms (SNP) and its haplotypes of methylenetetrahydrofolate reductase (MTHFR) gene with high dose methotrexate (HDMTX)-induced toxicity in children with acute lymphoblastic leukemia (ALL). HDMTX-treated children with ALL (1.2 to 14-years old) were selected from inpatient and followed for a retrospective study. The toxicity response of HDMTX chemotherapy was evaluated using WHO common toxicity criteria. Sixty-one patients with therapy-related toxicity and 36 patients without therapy-related toxicity were genotyped for 2 SNP (677C > T and 1298A > C) of the MTHFR gene by polymerase chain reaction-restriction fragment length polymorphism. Frequency of haplotypes and linkage disequilibrium of MTHFR gene were analyzed by SHEsis program. The distribution of MTHFR gene 677C > T polymorphism did not appeare different between groups with or without toxicity response (χ(2) = 4.609, P = 0.100), but the 1298A > C polymorphism was significantly different (χ(2) = 10.192, P = 0.006). Individuals who carried C allele (AC + CC genotype) had a decreased risk of toxicity response compared to AA genotype (OR = 0.245, 95%CI: 0.099 - 0.607, P = 0.002). 677C > T and 1298A > C polymorphisms showed strong linkage disequilibrium (D' = 0.895). The CC haplotype was significantly associated with decreased risk of toxicity response (OR = 0.338, 95%CI: 0.155 - 0.738, P = 0.005), while the TA haplotype was significantly associated with the increased risk of toxicity response (OR = 1.907, 95%CI: 1.045 - 3.482, P = 0.035). MTHFR gene 1298C allele and CC haplotype might serve as protective factors while TA haplotype as a risk factor for the susceptibility to toxicity response of HDMTX chemotherapy in children with ALL.

  4. Tag SNP selection via a genetic algorithm.

    PubMed

    Mahdevar, Ghasem; Zahiri, Javad; Sadeghi, Mehdi; Nowzari-Dalini, Abbas; Ahrabian, Hayedeh

    2010-10-01

    Single Nucleotide Polymorphisms (SNPs) provide valuable information on human evolutionary history and may lead us to identify genetic variants responsible for human complex diseases. Unfortunately, molecular haplotyping methods are costly, laborious, and time consuming; therefore, algorithms for constructing full haplotype patterns from small available data through computational methods, Tag SNP selection problem, are convenient and attractive. This problem is proved to be an NP-hard problem, so heuristic methods may be useful. In this paper we present a heuristic method based on genetic algorithm to find reasonable solution within acceptable time. The algorithm was tested on a variety of simulated and experimental data. In comparison with the exact algorithm, based on brute force approach, results show that our method can obtain optimal solutions in almost all cases and runs much faster than exact algorithm when the number of SNP sites is large. Our software is available upon request to the corresponding author.

  5. Reducing animal sequencing redundancy by preferentially selecting animals with low-frequency haplotypes

    USDA-ARS?s Scientific Manuscript database

    Many studies leverage targeted whole genome sequencing (WGS) experiments in order to identify rare and causal variants within populations. As a natural consequence of experimental design, many of these surveys tend to sequence redundant haplotype segments due to high frequency in the base population...

  6. No association between polymorphisms and haplotypes of COL1A1 and COL1A2 genes and osteoporotic fracture in postmenopausal Chinese women

    PubMed Central

    Hu, Wei-wei; He, Jin-wei; Zhang, Hao; Wang, Chun; Gu, Jie-mei; Yue, Hua; Ke, Yao-hua; Hu, Yun-qiu; Fu, Wen-zhen; Li, Miao; Liu, Yu-juan; Zhang, Zhen-lin

    2011-01-01

    Aim: To study whether genetic polymorphisms of COL1A1 and COL1A2 genes affected the onset of fracture in postmenopausal Chinese women. Methods: SNPs in COL1A1 and COL1A2 genes were identified via direct sequencing in 32 unrelated postmenopausal Chinese women. Ten SNPs were genotyped in 1252 postmenopausal Chinese women. The associations were examined using both single-SNP and haplotype tests using logistic regression. Results: Twenty four (4 novel) and 28 (7 novel) SNPs were identified in COL1A1 and COL1A2 gene, respectively. The distribution frequencies of 2 SNPs in COL1A1 (rs2075554 and rs2586494) and 3 SNPs in COL1A2 (rs42517, rs1801182, and rs42524) were significantly different from those documented for the European Caucasian population. No significant difference was observed between fracture and control groups with respect to allele frequency or genotype distribution in 9 selected SNPs and haplotype. No significant association was found between fragility fracture and each SNP or haplotype. The results remained the same after additional corrections for other risk factors such as weight, height, and bone mineral density. Conclusion: Our results show no association between common genetic variations of COL1A1 and COL1A2 genes and fracture, suggesting the complex genetic background of osteoporotic fractures. PMID:21602843

  7. Human leukocyte antigen alleles, genotypes and haplotypes frequencies in renal transplant donors and recipients from West Central India.

    PubMed

    Patel, Jaina S; Patel, Manisha M; Koringa, Prakash G; Shah, Tejas M; Patel, Amrutlal K; Tripathi, Ajai K; Mathew, Anila; Rajapurkar, Mohan M; Joshi, Chaitanya G

    2013-04-01

    Human leukocyte antigen (HLA) is comprised of a highly polymorphic set of genes which determines the histocompatibility of organ transplantation. The present study was undertaken to identify HLA class I and class II allele, genotype and haplotype frequencies in renal transplant recipients and donors from West Central India. HLA typing was carried out using Polymerase Chain Reaction-Sequence Specific Primer in 552 live related and unrelated renal transplant recipients and donors. The most frequent HLA class I and class II alleles and their frequencies in recipients were HLA-AFNx0101 (0.1685) and AFNx0102 (0.1649), HLA-BFNx0135 (0.1322), and HLA-DR beta 1 (DRB 1)FNx0115 (0.2192), whereas in donors, these were HLA-AFNx0102 (0.1848) and AFNx0101 (0.1667), HLA-BFNx0135 (0.1359), and HLA-DRB1FNx0115 (0.2409). The two-locus haplotype statistical analysis revealed HLA-AFNx0102-B61 as the most common haplotype with the frequency of 0.0487 and 0.0510 in recipients and donors, respectively. Further, among the three locus haplotypes HLA-AFNx0133-BFNx0144-DRB1FNx0107 and HLA-AFNx0102-BFNx0161-DRB1FNx0115 were the most common haplotypes with frequencies 0.0362 and 0.0326, respectively in recipients and 0.0236 and 0.0323, respectively in donors. Genotype frequency revealed a high prevalence of genotype HLA-AFNx0102/AFNx0124 in recipients (0.058) compared to donors (0.0109) whereas low prevalence of HLA-AFNx0101/AFNx0102 in recipients (0.0435) than in donors (0.0797). The phylogenetic and principal component analysis of HLA allele and haplotype frequency distribution revealed genetic similarities of various ethnic groups. Further, case control analysis provides preliminary evidence of association of HLA-A genotype (P < 0.05) with renal failure. This study will be helpful in suitable donor search besides providing valuable information for population genetics and HLA disease association analysis.

  8. HLA Type Inference via Haplotypes Identical by Descent

    NASA Astrophysics Data System (ADS)

    Setty, Manu N.; Gusev, Alexander; Pe'Er, Itsik

    The Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self antigens from non self ones. HLA genes are hyper variable with nearly every locus harboring over a dozen alleles. This variation plays an important role in susceptibility to multiple autoimmune diseases and needs to be matched on for organ transplantation. Unfortunately, HLA typing by serological methods is time consuming and expensive compared to high throughput Single Nucleotide Polymorphism (SNP) data. We present a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from SNP genotype data. IBD information is modeled as graph where shared haplotypes are explored among clusters of individuals with known and unknown HLA types to identify the latter. We analyze performance of the method in a previously typed subset of the HapMap population, achieving accuracy of 96% in HLA-A, 94% in HLA-B, 95% in HLA-C, 77% in HLA-DR1, 93% in HLA-DQA1 and 90% in HLA-DQB1 genes. We compare our method to a tag SNP based approach and demonstrate higher sensitivity and specificity. Our method demonstrates the power of using shared haplotype segments for large-scale imputation at the HLA locus.

  9. [Analysis of HLA haplotype frequency and linkage disequilibrium in patients with acute lymphoblastic leukemia from Northern Chinese Han].

    PubMed

    Gao, Su-qing; Cheng, Liang-hong; Lu, Liang; Jing, Shi-zheng; Cheng, Xi; Zhang, Yin-ze; Zou, Hong-yan; Deng, Zhi-hui

    2009-02-01

    To analyze the difference between the frequencies of HLA-A-B, B-DRB1 and A-B-DRB1 haplotype, as well as their linkage disequilibrium pattern in patients with acute lymphoblastic leukemia(ALL) and healthy controls from Northern Chinese Han. The frequencies of HLA-A-B, B-DRB1, A-B-DR haplotypes and linkage disequilibrium were estimated by Expectation Maximization method based on the genotypes of 643 patients with ALL and 2 0359 unrelated healthy donors, and the statistical significance between the two groups were estimated by chi-square test. Linkage disequilibrium was analyzed with population genetic methods. The most common HLA-A-B, B-DRB1, and A-B-DR haplotypes were A30-B13, A2-B46, A33-B58, B13-DR7, B46-DR9, B52-DR15, B58-DR17, A30-B13-DR7, A33-B58-DR17 and A1-B37-DR10 in both groups. The frequencies of A30-B13, A2-B46, A33-B44, B13-DR7, A30-B13-DR7 and A2-B46-DR9 haplotypes and linkage disequilibrium value were significantly decreased (P<0.05) in the patient group than that in the control group. On the other hand, the frequencies of A2-B52, A31-B61, A24- B8, B60-DR9, B27-DR4, B52-DR14, B44-DR17, B27-DR12 and A11-B27-DR12 haplotypes and linkage disequilibrium value were significantly increased (P<0.05) in the patient group than that in the control group. There are some common and positive linkage disequilibrium haplotypes in both the ALL patients and the healthy donors in Northern Chinese Han. Interestingly, some haplotypes and their linkage disequilibrium patterns had significantly different distributions between the two groups. The study provided basic data for the relationship of ALL and HLA haplotype and for finding the HLA-A, B, DR matching donors.

  10. Exploring and Harnessing Haplotype Diversity to Improve Yield Stability in Crops.

    PubMed

    Qian, Lunwen; Hickey, Lee T; Stahl, Andreas; Werner, Christian R; Hayes, Ben; Snowdon, Rod J; Voss-Fels, Kai P

    2017-01-01

    In order to meet future food, feed, fiber, and bioenergy demands, global yields of all major crops need to be increased significantly. At the same time, the increasing frequency of extreme weather events such as heat and drought necessitates improvements in the environmental resilience of modern crop cultivars. Achieving sustainably increase yields implies rapid improvement of quantitative traits with a very complex genetic architecture and strong environmental interaction. Latest advances in genome analysis technologies today provide molecular information at an ultrahigh resolution, revolutionizing crop genomic research, and paving the way for advanced quantitative genetic approaches. These include highly detailed assessment of population structure and genotypic diversity, facilitating the identification of selective sweeps and signatures of directional selection, dissection of genetic variants that underlie important agronomic traits, and genomic selection (GS) strategies that not only consider major-effect genes. Single-nucleotide polymorphism (SNP) markers today represent the genotyping system of choice for crop genetic studies because they occur abundantly in plant genomes and are easy to detect. SNPs are typically biallelic, however, hence their information content compared to multiallelic markers is low, limiting the resolution at which SNP-trait relationships can be delineated. An efficient way to overcome this limitation is to construct haplotypes based on linkage disequilibrium, one of the most important features influencing genetic analyses of crop genomes. Here, we give an overview of the latest advances in genomics-based haplotype analyses in crops, highlighting their importance in the context of polyploidy and genome evolution, linkage drag, and co-selection. We provide examples of how haplotype analyses can complement well-established quantitative genetics frameworks, such as quantitative trait analysis and GS, ultimately providing an effective tool

  11. HLA-A, -B, -DRB1 allele and haplotype frequencies of 920 cord blood units from Central Chile.

    PubMed

    Schäfer, Christian; Sauter, Jürgen; Riethmüller, Tobias; Kashi, Zahra Mehdizadeh; Schmidt, Alexander H; Barriga, Francisco J

    2016-08-01

    We present human leukocyte antigen (HLA) haplotype and allele/antigenic group frequencies derived from a data set of 920 umbilical cord blood units collected in Central Chile. HLA-A and -B genotypes were typed using sequence specific oligonucleotide probe methods while HLA-DRB1 genotypes were obtained from sequencing-based typing. The most frequent haplotype is A*29~B*44~DRB1*07:01 with an estimated frequency of 2.1%. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  12. Haplotype diversity in the equine myostatin gene with focus on variants associated with race distance propensity and muscle fiber type proportions

    PubMed Central

    Petersen, Jessica L; Valberg, Stephanie J; Mickelson, James R; McCue, Molly E

    2014-01-01

    Summary Two variants in the equine myostatin gene (MSTN), including a T/C SNP substitution in the first intron and a 227-bp SINE insertion in the promoter, are associated with muscle fiber type proportions in the Quarter Horse (QH) and with the prediction of race distance propensity in the Thoroughbred (TB). Genotypes from these loci, along with 18 additional variants surrounding MSTN, were examined in 301 horses of 14 breeds to evaluate haplotype relationships and diversity. The C allele of intron 1 was found in 12 of 14 breeds at a frequency of 0.27; the SINE was observed in five breeds, but common in only the TB and QH (0.73 and 0.48 respectively). Haplotype data suggest the SINE insertion is contemporary to and arose upon a haplotype containing the intron 1 C allele. Gluteal muscle biopsies of TBs showed a significant association of the intron 1 C allele and SINE with a higher proportion of Type 2B and lower proportion of Type 1 fibers. However, in the Belgian horse, in which the SINE is not present, the intron 1 SNP was not associated with fiber type proportions, and evaluation of fiber type proportions across the Belgian, TB and QH breeds shows the significant effect of breed on fiber type proportions is negated when evaluating horses without the SINE variant. These data suggest the SINE, rather than the intron 1 SNP, is driving the observed muscle fiber type characteristics and is the variant targeted by selection for short-distance racing. PMID:25160752

  13. FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads.

    PubMed

    Wang, Meng; Lin, Shili

    2014-09-15

    In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Genetic analysis of autoimmune regulator haplotypes in alopecia areata.

    PubMed

    Wengraf, D A; McDonagh, A J G; Lovewell, T R J; Vasilopoulos, Y; Macdonald-Hull, S P; Cork, M J; Messenger, A G; Tazi-Ahnini, R

    2008-03-01

    Alopecia areata is an immune-mediated disorder, occurring with the highest observed frequency in the rare recessive autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED) syndrome caused by mutations of the autoimmune regulator (AIRE) gene on chromosome 21q22.3. We have previously detected association between alopecia areata and a single nucleotide polymorphism (SNP) in the AIRE gene in patients without APECED, and we now report the findings of an extended examination of the association of alopecia areata with haplotype analysis including six SNPs in the AIRE gene: C-103T, C4144G, T5238C, G6528A, T7215C and T11787C. In Caucasian groups of 295 patients and 363 controls, we found strong association between the AIRE 7215C allele and AA [P = 3.8 x 10(-8), OR (95% CI): 2.69 (1.8-4.0)]. The previously reported association between AA and the AIRE 4144G allele was no longer significant on correction for multiple testing. The AIRE haplotypes CCTGCT and CGTGCC showed a highly significant association with AA [P = 6.05 x 10(-6), 9.47 (2.91-30.8) and P = 0.001, 3.51 (1.55-7.95), respectively]. To select the haplotypes most informative for analysis, we tagged the polymorphisms using SNPTag software. Employing AIRE C-103T, G6528A, T7215C and T11787C as tag SNPs, two haplotypes were associated with AA; AIRE CGCT and AIRE CGCC [P = 3.84 x 10(-7), 11.40 (3.53-36.9) and P = 3.94 x 10(-4), 2.13 (1.39-3.24) respectively]. The AIRE risk haplotypes identified in this study potentially account for a major component of the genetic risk of developing alopecia areata.

  15. Common PCSK1 haplotypes are associated with obesity in the Chinese population.

    PubMed

    Chang, Yi-Cheng; Chiu, Yen-Feng; Shih, Kuang-Chung; Lin, Ming-Wei; Sheu, Wayne Huey-Herng; Donlon, Timothy; Curb, Jess David; Jou, Yuh-Shan; Chang, Tien-Jyun; Li, Hung-Yuan; Chuang, Lee-Ming

    2010-07-01

    Prohormone convertase subtilisin/kexin type 1 (PCSK1) genetic polymorphisms have recently been associated with obesity in European populations. This study aimed to examine whether common PCSK1 genetic variation is associated with obesity and related metabolic phenotypes in the Chinese population. We genotyped nine common tag single-nucleotide polymorphisms (tagSNP) of the PCSK1 gene in 1,094 subjects of Chinese origin from the Stanford Asia-Pacific Program for Hypertension and Insulin Resistance (SAPPHIRe) family study. One SNP in the PCSK1 gene (rs155971) were nominally associated with risk of obesity in the SAPPHIRe cohort (P = 0.01). A common protective haplotype was associated with reduced risk of obesity (23.79% vs. 32.89%, P = 0.01) and smaller waist circumference (81.71 +/- 10.22 vs. 84.75 +/- 10.48 cm, P = 0.02). Another common haplotype was significantly associated with increased risk of obesity (37.07% vs. 23.84%, P = 0.005). The global P value for haplotype association with obesity was 0.02. We also identified a suggestive association of another PCSK1 SNP (rs3811951) with fasting glucose, fasting insulin, homeostasis model assessment of insulin resistance (HOMA(IR)), triglycerides, and high-density lipoprotein cholesterol (P = 0.05, 0.003, 0.001, 0.04, and 0.04, respectively). These data indicate common PCSK1 genetic variants are associated with obesity in the Chinese population.

  16. PAX6 Haplotypes Are Associated with High Myopia in Han Chinese

    PubMed Central

    Jiang, Bo; Yap, Maurice K. H.; Leung, Kim Hung; Ng, Po Wah; Fung, Wai Yan; Lam, Wai Wa; Gu, Yang-shun; Yip, Shea Ping

    2011-01-01

    Background The paired box 6 (PAX6) gene is considered as a master gene for eye development. Linkage of myopia to the PAX6 region on chromosome 11p13 was shown in several studies, but the results for association between myopia and PAX6 were inconsistent so far. Methodology/Principal Findings We genotyped 16 single nucleotide polymorphisms (SNPs) in the PAX6 gene and its regulatory regions in an initial study for 300 high myopia cases and 300 controls (Group 1), and successfully replicated the positive results with another independent group of 299 high myopia cases and 299 controls (Group 2). Five SNPs were genotyped in the replication study. The spherical equivalent of subjects with high myopia was ≤−8.0 dioptres. The PLINK package was used for genetic data analysis. No association was found between each of the SNPs and high myopia. However, exhaustive sliding-window haplotype analysis highlighted an important role for rs12421026 because haplotypes containing this SNP were found to be associated with high myopia. The most significant results were given by the 4-SNP haplotype window consisting of rs2071754, rs3026393, rs1506 and rs12421026 (P = 3.54×10−10, 4.06×10−11 and 1.56×10−18 for Group 1, Group 2 and Combined Group, respectively) and the 3-SNP haplotype window composed of rs3026393, rs1506 and rs12421026 (P = 5.48×10−10, 7.93×10−12 and 6.28×10−23 for the three respective groups). The results remained significant after correction for multiple comparisons by permutations. The associated haplotyes found in a previous study were also successfully replicated in this study. Conclusions/Significance PAX6 haplotypes are associated with susceptibility to the development of high myopia in Chinese. The PAX6 locus plays a role in high myopia. PMID:21589860

  17. The IGF1 small dog haplotype is derived from Middle Eastern grey wolves.

    PubMed

    Gray, Melissa M; Sutter, Nathan B; Ostrander, Elaine A; Wayne, Robert K

    2010-02-24

    A selective sweep containing the insulin-like growth factor 1 (IGF1) gene is associated with size variation in domestic dogs. Intron 2 of IGF1 contains a SINE element and single nucleotide polymorphism (SNP) found in all small dog breeds that is almost entirely absent from large breeds. In this study, we surveyed a large sample of grey wolf populations to better understand the ancestral pattern of variation at IGF1 with a particular focus on the distribution of the small dog haplotype and its relationship to the origin of the dog. We present DNA sequence data that confirms the absence of the derived small SNP allele in the intron 2 region of IGF1 in a large sample of grey wolves and further establishes the absence of a small dog associated SINE element in all wild canids and most large dog breeds. Grey wolf haplotypes from the Middle East have higher nucleotide diversity suggesting an origin there. Additionally, PCA and phylogenetic analyses suggests a closer kinship of the small domestic dog IGF1 haplotype with those from Middle Eastern grey wolves. The absence of both the SINE element and SNP allele in grey wolves suggests that the mutation for small body size post-dates the domestication of dogs. However, because all small dogs possess these diagnostic mutations, the mutations likely arose early in the history of domestic dogs. Our results show that the small dog haplotype is closely related to those in Middle Eastern wolves and is consistent with an ancient origin of the small dog haplotype there. Thus, in concordance with past archeological studies, our molecular analysis is consistent with the early evolution of small size in dogs from the Middle East.See associated opinion by Driscoll and Macdonald: http://jbiol.com/content/9/2/10.

  18. No association between polymorphisms/haplotypes of the vascular endothelial growth factor gene and preeclampsia

    PubMed Central

    2011-01-01

    Background Preeclampsia (PE) is the first worldwide cause of death in pregnant women, intra-uterine growth retardation, and fetal prematurity. Some vascular endothelial grown factor gene (VEGF) polymorphisms have been associated to PE and other pregnancy disturbances. We evaluated the associations between VEGF genotypes/haplotypes and PE in Mexican women. Methods 164 pregnant women were enrolled in a case-control study (78 cases and 86 normotensive pregnant controls). The rs699947 (-2578C/A), rs1570360 (-1154G/A), rs2010963 (+405G/C), and rs25648 (-7C/T), VEGF variants were discriminated using Polymerase Chain Reaction - Restriction Fragment Length Polymorphism (PCR-RFLP) methods or Taqman single nucleotide polymorphism (SNP) assays. Results The proportions of the minor allele for rs699947, rs1570360, rs2010963, and rs25648 VEGF SNPs were 0.33, 0.2, 0.39, and 0.17 in controls, and 0.39, 0.23, 0.41, and 0.15 in cases, respectively (P values > 0.05). The most frequent haplotypes of rs699947, rs1570360, rs2010963, and rs25648 VEGF SNPs, were C-G-C-C and C-G-G-C with frequencies of 0.39, 0.21 in cases and 0.37, 0.25 in controls, respectively (P values > 0.05) Conclusion There was no evidence of an association between VEGF alleles, genotypes, or haplotypes frequencies and PE in our study. PMID:21575227

  19. Haplotype frequency distribution for 7 microsatellites in chromosome 8 and 11 in relation to the metabolic syndrome in four ethnic groups: Tehran Lipid and Glucose Study.

    PubMed

    Daneshpour, Maryam Sadat; Hosseinzadeh, Nima; Zarkesh, Maryam; Azizi, Fereidoun

    2012-03-01

    Different variants of haplotype frequencies may lead to various frequencies of the same variants in individuals with drug resistance and disease susceptibility at the population level. In this study, the haplotype frequencies of 4 STR loci including the D8S1132, D8S1779, D8S514 and D8S1743, and 3 STR loci including D11S1304, D11S1998 and D11S934 were investigated in 563 individuals of four Iranian ethnic groups in the capital city of Iran, Tehran. One hundred thirty subjects had the metabolic syndrome. Haplotype frequencies of all markers were calculated. There were significant differences in the haplotype frequencies in short and long alleles between the metabolic affected subjects and controls. In addition, haplotype frequencies were significant in the four ethnic groups in both chromosomes 8 and 11. Our findings show a relation between the short allele of D8S1743 in all related haplotype frequencies of subjects with metabolic syndrome. These findings may require more studies of some candidate genes, including the lipoprotein lipase gene, in this chromosomal region. Copyright © 2011. Published by Elsevier B.V.

  20. Modeling and E-M estimation of haplotype-specific relative risks from genotype data for a case-control study of unrelated individuals.

    PubMed

    Stram, Daniel O; Leigh Pearce, Celeste; Bretsky, Phillip; Freedman, Matthew; Hirschhorn, Joel N; Altshuler, David; Kolonel, Laurence N; Henderson, Brian E; Thomas, Duncan C

    2003-01-01

    The US National Cancer Institute has recently sponsored the formation of a Cohort Consortium (http://2002.cancer.gov/scpgenes.htm) to facilitate the pooling of data on very large numbers of people, concerning the effects of genes and environment on cancer incidence. One likely goal of these efforts will be generate a large population-based case-control series for which a number of candidate genes will be investigated using SNP haplotype as well as genotype analysis. The goal of this paper is to outline the issues involved in choosing a method of estimating haplotype-specific risk estimates for such data that is technically appropriate and yet attractive to epidemiologists who are already comfortable with odds ratios and logistic regression. Our interest is to develop and evaluate extensions of methods, based on haplotype imputation, that have been recently described (Schaid et al., Am J Hum Genet, 2002, and Zaykin et al., Hum Hered, 2002) as providing score tests of the null hypothesis of no effect of SNP haplotypes upon risk, which may be used for more complex tasks, such as providing confidence intervals, and tests of equivalence of haplotype-specific risks in two or more separate populations. In order to do so we (1) develop a cohort approach towards odds ratio analysis by expanding the E-M algorithm to provide maximum likelihood estimates of haplotype-specific odds ratios as well as genotype frequencies; (2) show how to correct the cohort approach, to give essentially unbiased estimates for population-based or nested case-control studies by incorporating the probability of selection as a case or control into the likelihood, based on a simplified model of case and control selection, and (3) finally, in an example data set (CYP17 and breast cancer, from the Multiethnic Cohort Study) we compare likelihood-based confidence interval estimates from the two methods with each other, and with the use of the single-imputation approach of Zaykin et al. applied under both

  1. The effect of using genealogy-based haplotypes for genomic prediction.

    PubMed

    Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

    2013-03-06

    Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.

  2. Bootstrap study of genome-enabled prediction reliabilities using haplotype blocks across Nordic Red cattle breeds.

    PubMed

    Cuyabano, B C D; Su, G; Rosa, G J M; Lund, M S; Gianola, D

    2015-10-01

    This study compared the accuracy of genome-enabled prediction models using individual single nucleotide polymorphisms (SNP) or haplotype blocks as covariates when using either a single breed or a combined population of Nordic Red cattle. The main objective was to compare predictions of breeding values of complex traits using a combined training population with haplotype blocks, with predictions using a single breed as training population and individual SNP as predictors. To compare the prediction reliabilities, bootstrap samples were taken from the test data set. With the bootstrapped samples of prediction reliabilities, we built and graphed confidence ellipses to allow comparisons. Finally, measures of statistical distances were used to calculate the gain in predictive ability. Our analyses are innovative in the context of assessment of predictive models, allowing a better understanding of prediction reliabilities and providing a statistical basis to effectively calibrate whether one prediction scenario is indeed more accurate than another. An ANOVA indicated that use of haplotype blocks produced significant gains mainly when Bayesian mixture models were used but not when Bayesian BLUP was fitted to the data. Furthermore, when haplotype blocks were used to train prediction models in a combined Nordic Red cattle population, we obtained up to a statistically significant 5.5% average gain in prediction accuracy, over predictions using individual SNP and training the model with a single breed. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  3. A reduced number of mtSNPs saturates mitochondrial DNA haplotype diversity of worldwide population groups.

    PubMed

    Salas, Antonio; Amigo, Jorge

    2010-05-03

    The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations ("African-Americans" and "Hispanics") were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of "universal" mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of mtSNPs is more efficient than previous empirical approaches. In contrast to

  4. A Reduced Number of mtSNPs Saturates Mitochondrial DNA Haplotype Diversity of Worldwide Population Groups

    PubMed Central

    Salas, Antonio; Amigo, Jorge

    2010-01-01

    Background The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. Methodology/Principal Findings This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations (“African-Americans” and “Hispanics”) were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of “universal” mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. Conclusions/Significance The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of

  5. Reconstruction of Haplotype-Blocks Selected during Experimental Evolution.

    PubMed

    Franssen, Susanne U; Barton, Nicholas H; Schlötterer, Christian

    2017-01-01

    The genetic analysis of experimentally evolving populations typically relies on short reads from pooled individuals (Pool-Seq). While this method provides reliable allele frequency estimates, the underlying haplotype structure remains poorly characterized. With small population sizes and adaptive variants that start from low frequencies, the interpretation of selection signatures in most Evolve and Resequencing studies remains challenging. To facilitate the characterization of selection targets, we propose a new approach that reconstructs selected haplotypes from replicated time series, using Pool-Seq data. We identify selected haplotypes through the correlated frequencies of alleles carried by them. Computer simulations indicate that selected haplotype-blocks of several Mb can be reconstructed with high confidence and low error rates, even when allele frequencies change only by 20% across three replicates. Applying this method to real data from D. melanogaster populations adapting to a hot environment, we identify a selected haplotype-block of 6.93 Mb. We confirm the presence of this haplotype-block in evolved populations by experimental haplotyping, demonstrating the power and accuracy of our haplotype reconstruction from Pool-Seq data. We propose that the combination of allele frequency estimates with haplotype information will provide the key to understanding the dynamics of adaptive alleles. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  6. High-resolution HLA-A, HLA-B, and HLA-DRB1 haplotype frequencies from the French Bone Marrow Donor Registry.

    PubMed

    Gourraud, Pierre-Antoine; Pappas, Derek James; Baouz, Amar; Balère, Marie-Lorraine; Garnier, Federico; Marry, Evelyne

    2015-05-01

    We have estimated human leukocyte antigen (HLA) haplotype frequencies using the maximum likelihood mode, which accommodates typing ambiguities. The results of the frequency distribution of the 7015 haplotypes obtained are presented here. These include a total of 114 HLA-A, 185 HLA-B, and 76 HLA-DRB1 unique alleles at each locus. Across all populations, although the most common individual HLA alleles were HLA-A(∗)02:01 (29.0%), HLA-B(∗)07:02 (11.4%), and HLA-DRB1(∗)07:01 (15.9%), the most frequent haplotype was found to be HLA-A(∗)01:01∼B(∗)08:01∼DRB1(∗)03:01. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  7. Haplotypic Background of a Private Allele at High Frequency in the Americas

    PubMed Central

    Schroeder, Kari B.; Jakobsson, Mattias; Crawford, Michael H.; Schurr, Theodore G.; Boca, Simina M.; Conrad, Donald F.; Tito, Raul Y.; Osipova, Ludmilla P.; Tarskaia, Larissa A.; Zhadanov, Sergey I.; Wall, Jeffrey D.; Pritchard, Jonathan K.; Malhi, Ripan S.; Smith, David G.; Rosenberg, Noah A.

    2009-01-01

    Recently, the observation of a high-frequency private allele, the 9-repeat allele at microsatellite D9S1120, in all sampled Native American and Western Beringian populations has been interpreted as evidence that all modern Native Americans descend primarily from a single founding population. However, this inference assumed that all copies of the 9-repeat allele were identical by descent and that the geographic distribution of this allele had not been influenced by natural selection. To investigate whether these assumptions are satisfied, we genotyped 34 single nucleotide polymorphisms across ∼500 kilobases (kb) around D9S1120 in 21 Native American and Western Beringian populations and 54 other worldwide populations. All chromosomes with the 9-repeat allele share the same haplotypic background in the vicinity of D9S1120, suggesting that all sampled copies of the 9-repeat allele are identical by descent. Ninety-one percent of these chromosomes share the same 76.26 kb haplotype, which we call the “American Modal Haplotype” (AMH). Three observations lead us to conclude that the high frequency and widespread distribution of the 9-repeat allele are unlikely to be the result of positive selection: 1) aside from its association with the 9-repeat allele, the AMH does not have a high frequency in the Americas, 2) the AMH is not unusually long for its frequency compared with other haplotypes in the Americas, and 3) in Latin American mestizo populations, the proportion of Native American ancestry at D9S1120 is not unusual compared with that observed at other genomewide microsatellites. Using a new method for estimating the time to the most recent common ancestor (MRCA) of all sampled copies of an allele on the basis of an estimate of the length of the genealogy descended from the MRCA, we calculate the mean time to the MRCA of the 9-repeat allele to be between 7,325 and 39,900 years, depending on the demographic model used. The results support the hypothesis that all

  8. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

    PubMed Central

    2013-01-01

    Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially yielding different numbers of haplotype-based quality SNPs. Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually mining results, a total of 25,417 quality SNPs were discovered – 15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were insertion/deletions (indels). A vast majority of SNP-containing contigs consisted of only 2 haplotypes, as expected, but the percentages of 2 haplotype contigs varied widely in these citrus cultivars. BLAST of the 25,417 25-mer SNP oligos to the Clementine reference genome scaffolds revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit / alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+ hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955 (98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide identities, accounting for 93% of all the alignments. Considering almost all the nucleotide discrepancies in the 24/25 alignments were at the SNP sites, it served well as in silico validation of these SNPs, in addition to and consistent with the rate (81%) validated by sequencing and SNaPshot assay. Conclusions High-quality EST-SNPs from different

  9. [Gene and haplotype frequencies for the loci HLA-A, B and DRB1 in 11755 north Chinese Han bone marrow registry donors].

    PubMed

    Wu, Qiang-Ju; Liu, Meng-Li; Qi, Jun; Liu, Sheng; Zhang, Yan; Wei, Xiao-Qian

    2007-04-01

    The study was aimed to investigate the human leukocyte antigen (HLA)-A, B, DRB1 alleles and haplotype frequencies and the characteristics of linkage disequilibrium in north Chinese Han bone marrow donors. HLA phenotype data of 11 755 north Chinese Han bone marrow donors were identified by PCR-SSP and PCR-SSO. HLA-A, B, DRB1 allele and haplotype frequencies were calculated by computer software named Arleguin which was based on Expectation-Maximization (EM) algorithms. The results showed that the population of 11755 unrelated-donors was tested by Hardy-Weinberg equilibrium, and 18,42 and 15 specificities of HLA alleles were identified on the HLA-A, B, DRB1 locus respectively, including HLA-A25, B42, B53, B73 and DR3 which were rarely reported in Han population. HLA-A36, A43, A80, B78, B82 and DR18 were not detected in this study. The most frequent alleles with a frequency of over 0.05 were HLA-A*02, A*11, A*24, A*33, A*30, A*01, A*03, A*13, B62, B*51, B*46, B60, B61, B*35, B*44, DRB1*15, DRB1*09, DRB1*04, DRB1*07, DRB1*12, DRB1*11, DRB1*14, DRB1*08, DRB1*13. There were a total of 2 026 kinds of HLA-A-B-DR haplotypes (with a frequency of over 10(-6)) to be obtained. The each frequency of 26 kinds of three-locus haplotypes including HLA-A30-B13-DR7, A2-B46-DR9, A33-B58-DR17 etc was higher than 0.005. A30-B13-DR7 was the most frequent haplotype in north Chinese Han population. There were a total of 538 kinds of haplotypes for HLA-A-B, 227 kinds for A-DR and 522 kinds for B-DR to be obtained, and there were 409, 195, 423 kinds of haplotypes respectively with a frequency higher than 10 - 6. There were 28 kinds of HLA-A-B haplotypes including A30-B13, A2-B46, A33-B58 etc, 26 kinds of HLA-A-DR haplotypes including A2-DR9, A2-DR15, A30-DR7 etc, and 24 kinds of HLA-B-DR haplotypes including B13-DR7, B46-DR9, B13-DR12 etc with a frequency higher than 0.01. 296 (72%) kinds of HLA-A-B, 130 (67%) kinds of A-DR and 308 (73%) kinds of B-DR haplotypes were statistical linkage

  10. The effect of using genealogy-based haplotypes for genomic prediction

    PubMed Central

    2013-01-01

    Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971

  11. Identification of SNP Haplotypes and Prospects of Association Mapping in Watermelon

    USDA-ARS?s Scientific Manuscript database

    Watermelon is the fifth most economically important vegetable crop cultivated world-wide. Implementing Single Nucleotide Polymorphism (SNP) marker technology in watermelon breeding and germplasm evaluation programs holds a key to improve horticulturally important traits. Next-generation sequencing...

  12. TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS

    PubMed Central

    AGUIAR, DEREK; WONG, WENDY S.W.; ISTRAIL, SORIN

    2014-01-01

    The growing availability of inexpensive high-throughput sequence data is enabling researchers to sequence tumor populations within a single individual at high coverage. But, cancer genome sequence evolution and mutational phenomena like driver mutations and gene fusions are difficult to investigate without first reconstructing tumor haplotype sequences. Haplotype assembly of single individual tumor populations is an exceedingly difficult task complicated by tumor haplotype heterogeneity, tumor or normal cell sequence contamination, polyploidy, and complex patterns of variation. While computational and experimental haplotype phasing of diploid genomes has seen much progress in recent years, haplotype assembly in cancer genomes remains uncharted territory. In this work, we describe HapCompass-Tumor a computational modeling and algorithmic framework for haplotype assembly of copy number variable cancer genomes containing haplotypes at different frequencies and complex variation. We extend our polyploid haplotype assembly model and present novel algorithms for (1) complex variations, including copy number changes, as varying numbers of disjoint paths in an associated graph, (2) variable haplotype frequencies and contamination, and (3) computation of tumor haplotypes using simple cycles of the compass graph which constrain the space of haplotype assembly solutions. The model and algorithm are implemented in the software package HapCompass-Tumor which is available for download from http://www.brown.edu/Research/Istrail_Lab/. PMID:24297529

  13. Asian population frequencies and haplotype distribution of killer cell immunoglobulin-like receptor (KIR) genes among Chinese, Malay, and Indian in Singapore.

    PubMed

    Lee, Yi Chuan; Chan, Soh Ha; Ren, Ee Chee

    2008-11-01

    Killer cell immunoglobulin-like receptors (KIR) gene frequencies have been shown to be distinctly different between populations and contribute to functional variation in the immune response. We have investigated KIR gene frequencies in 370 individuals representing three Asian populations in Singapore and report here the distribution of 14 KIR genes (2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 2DS1, 2DS2, 2DS3, 2DS4, 2DS5, 3DL1, 3DL2, 3DL3, 3DS1) with two pseudogenes (2DP1, 3DP1) among Singapore Chinese (n = 210); Singapore Malay (n = 80), and Singapore Indian (n = 80). Four framework genes (KIR3DL3, 3DP1, 2DL4, 3DL2) and a nonframework pseudogene 2DP1 were detected in all samples while KIR2DS2, 2DL2, 2DL5, and 2DS5 had the greatest significant variation across the three populations. Fifteen significant linkage patterns, consistent with associations between genes of A and B haplotypes, were observed. Eighty-four distinct KIR profiles were determined in our populations, 38 of which had not been described in other populations. KIR haplotype studies were performed using nine Singapore Chinese families comprising 34 individuals. All genotypes could be resolved into corresponding pairs of existing haplotypes with eight distinct KIR genotypes and eight different haplotypes. The haplotype A2 with frequency of 63.9% was dominant in Singapore Chinese, comparable to that reported in Korean and Chinese Han. The A haplotypes predominate in Singapore Chinese, with ratio of A to B haplotypes of approximately 3:1. Comparison with KIR frequencies in other populations showed that Singapore Chinese shared similar distributions with Chinese Han, Japanese, and Korean; Singapore Indian was found to be comparable with North Indian Hindus while Singapore Malay resembled the Thai.

  14. Haplotype analysis of the apolipoprotein A5 gene in obese pediatric patients.

    PubMed

    Horvatovich, Katalin; Bokor, Szilvia; Baráth, Akos; Maász, Anita; Kisfali, Péter; Járomi, Luca; Polgár, Noémi; Tóth, Dénes; Répásy, Judit; Endreffy, Emoke; Molnár, Dénes; Melegh, Béla

    2011-06-01

    Apolipoprotein A5 (APOA5) gene variants have been shown to be associated with elevated TG levels; the T-1131C (rs662799) variant has been reported to confer risk for the metabolic syndrome in adult populations. Little is known about the APOA5 variants in pediatric population, no such information is available for pediatric obesity at all. Here we examined four haplotype-tagging polymorphisms (T-1131C, IVS3 + G476A [rs2072560], T1259C [rs2266788] and C56G [rs3135506]) and studied also the frequency of major naturally occurring haplotypes of APOA5 in obese children. The polymorphisms were analyzed in 232 obese children, and in 137 healthy, normal weight controls, using PCR-RFLP methods. In the pediatric patients we could confirm the already known adult subjects based association of -1131C, IVS3 + 476A and 1259C variants with elevated triglyceride concentrations, both in obese patients and in the controls. The prevalence of the APOA5*2 haplotype (containing the minor allele of T-1131C, IVS3 + G476A and T1259C SNPs together) was 15.5% in obese children, and 5.80% in the controls (p<0.001); multiple logistic regression analysis revealed that this haplotype confers susceptibility for development of obesity (OR=2.87; 95% CI: 1.29-6.37; p≤0.01). By contrast, the APOA5*4 haplotype (with -1131C alone) did not show similar associations. Our findings also suggest that the APOA5*5 haplotype (1259C alone) can be protective against obesity (OR=0.25; 95% CI: 0.07-0.80; p<0.05). While previous studies in adults demonstrated, that the APOA5 -1131C minor allele confers risk for adult metabolic syndrome, here we show, that the susceptibility nature of this SNP restricted to the APOA5*2 haplotype in pediatric obese subjects.

  15. A whole genome SNP genotyping by DNA microarray and candidate gene association study for kidney stone disease

    PubMed Central

    2014-01-01

    Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the mi

  16. Linear reduction method for predictive and informative tag SNP selection.

    PubMed

    He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander

    2005-01-01

    Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.

  17. Better ILP models for haplotype assembly.

    PubMed

    Etemadi, Maryam; Bagherian, Mehri; Chen, Zhi-Zhong; Wang, Lusheng

    2018-02-19

    The haplotype assembly problem for diploid is to find a pair of haplotypes from a given set of aligned Single Nucleotide Polymorphism (SNP) fragments (reads). It has many applications in association studies, drug design, and genetic research. Since this problem is computationally hard, both heuristic and exact algorithms have been designed for it. Although exact algorithms are much slower, they are still of great interest because they usually output significantly better solutions than heuristic algorithms in terms of popular measures such as the Minimum Error Correction (MEC) score, the number of switch errors, and the QAN50 score. Exact algorithms are also valuable because they can be used to witness how good a heuristic algorithm is. The best known exact algorithm is based on integer linear programming (ILP) and it is known that ILP can also be used to improve the output quality of every heuristic algorithm with a little decline in speed. Therefore, faster ILP models for the problem are highly demanded. As in previous studies, we consider not only the general case of the problem but also its all-heterozygous case where we assume that if a column of the input read matrix contains at least one 0 and one 1, then it corresponds to a heterozygous SNP site. For both cases, we design new ILP models for the haplotype assembly problem which aim at minimizing the MEC score. The new models are theoretically better because they contain significantly fewer constraints. More importantly, our experimental results show that for both simulated and real datasets, the new model for the all-heterozygous (respectively, general) case can usually be solved via CPLEX (an ILP solver) at least 5 times (respectively, twice) faster than the previous bests. Indeed, the running time can sometimes be 41 times better. This paper proposes a new ILP model for the haplotype assembly problem and its all-heterozygous case, respectively. Experiments with both real and simulated datasets show that the

  18. Polymorphism Located between CPT1B and CHKB, and HLA-DRB1*1501-DQB1*0602 Haplotype Confer Susceptibility to CNS Hypersomnias (Essential Hypersomnia)

    PubMed Central

    Miyagawa, Taku; Honda, Makoto; Kawashima, Minae; Shimada, Mihoko; Tanaka, Susumu; Honda, Yutaka; Tokunaga, Katsushi

    2009-01-01

    Background SNP rs5770917 located between CPT1B and CHKB, and HLA-DRB1*1501-DQB1*0602 haplotype were previously identified as susceptibility loci for narcolepsy with cataplexy. This study was conducted in order to investigate whether these genetic markers are associated with Japanese CNS hypersomnias (essential hypersomnia: EHS) other than narcolepsy with cataplexy. Principal Findings EHS was significantly associated with SNP rs5770917 (Pallele = 3.6×10−3; OR = 1.56; 95% c.i.: 1.12–2.15) and HLA-DRB1*1501-DQB1*0602 haplotype (P positivity = 9.2×10−11; OR = 3.97; 95% c.i.: 2.55–6.19). No interaction between the two markers (SNP rs5770917 and HLA-DRB1*1501-DQB1*0602 haplotype) was observed in EHS. Conclusion CPT1B, CHKB and HLA are candidates for susceptibility to CNS hypersomnias (EHS), as well as narcolepsy with cataplexy. PMID:19404393

  19. Genome Patterns of Selection and Introgression of Haplotypes in Natural Populations of the House Mouse (Mus musculus)

    PubMed Central

    Staubach, Fabian; Lorenc, Anna; Messer, Philipp W.; Tang, Kun; Petrov, Dmitri A.; Tautz, Diethard

    2012-01-01

    General parameters of selection, such as the frequency and strength of positive selection in natural populations or the role of introgression, are still insufficiently understood. The house mouse (Mus musculus) is a particularly well-suited model system to approach such questions, since it has a defined history of splits into subspecies and populations and since extensive genome information is available. We have used high-density single-nucleotide polymorphism (SNP) typing arrays to assess genomic patterns of positive selection and introgression of alleles in two natural populations of each of the subspecies M. m. domesticus and M. m. musculus. Applying different statistical procedures, we find a large number of regions subject to apparent selective sweeps, indicating frequent positive selection on rare alleles or novel mutations. Genes in the regions include well-studied imprinted loci (e.g. Plagl1/Zac1), homologues of human genes involved in adaptations (e.g. alpha-amylase genes) or in genetic diseases (e.g. Huntingtin and Parkin). Haplotype matching between the two subspecies reveals a large number of haplotypes that show patterns of introgression from specific populations of the respective other subspecies, with at least 10% of the genome being affected by partial or full introgression. Using neutral simulations for comparison, we find that the size and the fraction of introgressed haplotypes are not compatible with a pure migration or incomplete lineage sorting model. Hence, it appears that introgressed haplotypes can rise in frequency due to positive selection and thus can contribute to the adaptive genomic landscape of natural populations. Our data support the notion that natural genomes are subject to complex adaptive processes, including the introgression of haplotypes from other differentiated populations or species at a larger scale than previously assumed for animals. This implies that some of the admixture found in inbred strains of mice may also have

  20. Association between endothelin type A receptor haplotypes and mortality in coronary heart disease.

    PubMed

    Ellis, Katrina L; Pilbrow, Anna P; Potter, Howard C; Frampton, Chris M; Doughty, Rob N; Whalley, Gillian A; Ellis, Chris J; Palmer, Barry R; Skelton, Lorraine; Yandle, Tim G; Troughton, Richard W; Richards, A Mark; A Cameron, Vicky

    2012-05-01

    The endothelin type A receptor, encoded by EDNRA, mediates the effects of endothelin-1 to promote vasoconstriction, vascular cell growth, adhesion, fibrosis and thrombosis. We investigated the association between EDNRA haplotype and cardiovascular outcomes in patients with coronary artery disease. Coronary disease patients (n = 1007) were genotyped for the His323His (rs5333) variant and one tag SNP from each of the major EDNRA haplotype blocks (rs6537484, rs1568136, rs5335 and rs10003447). EDNRA haplotype associations with clinical history, natriuretic peptides cardiac function and cardiovascular outcomes were tested over a median 3.8 years. Univariate analysis identified a 'low-risk' EDNRA haplotype associated with later age of Type 2 diabetes onset (p = 0.004) smaller BMI (p = 0.021), and reduced mortality (log rank p = 0.001). Cox proportional hazards analysis including established cardiovascular risk factors revealed an independent association between haplotype and mortality (p < 0.0001). These data highlight the potential importance of the endothelin system, and in particular EDNRA in coronary disease.

  1. Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

    PubMed

    McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.

  2. Bovine Exome Sequence Analysis and Targeted SNP Genotyping of Recessive Fertility Defects BH1, HH2, and HH3 Reveal a Putative Causative Mutation in SMC2 for HH3

    PubMed Central

    McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746

  3. Frequency distribution of interleukin-10 haplotypes (-1082 A>G, -819 C>T, and -592 C>A) in a Mexican population.

    PubMed

    Vázquez-Villamar, M; Palafox-Sánchez, C A; Hernández-Bello, J; Muñoz-Valle, J F; Valle, Y; Cruz, A; Alatorre-Meza, A I; Oregon-Romero, E

    2016-11-03

    Interleukin 10 (IL-10) is an immunoregulatory cytokine with multiple roles in the immune system. Three single nucleotide polymorphisms at positions -1082 (A>G), -819 (C>T), and -592 (C>A) in the promoter region of the IL10 gene are believed to be associated with different inflammatory, infectious, and autoimmune diseases. These polymorphisms exhibit a strong linkage disequilibrium (LD) and form three principal haplotypes (GCC, ACC, and ATA). The GCC and ATA haplotypes have been associated with high and low levels of IL-10 production, respectively. The aim of this study was to establish the allele and haplotype frequencies of the IL10 polymorphisms in Mestizos from western Mexico. SNPs were analyzed in 340 healthy unrelated Mestizos from western Mexico by polymerase chain reaction-restriction fragment length polymorphism. The studied population presented significant differences, in the distribution of IL10 polymorphisms, from the Asian, African, and European populations. We also observed a strong LD within -1082 A>G, -819 C>T, and -592 C>A (100% pc = 7.735 x 10 -18 ). The haplotypes ACC (45.4%), ATA (22.0%), GTA (14.9%), and GCC (13.9%) were most frequently observed in this population. The haplotype frequencies, however, differed from those reported previously in Mestizos from central Mexico, Asians, Africans, and European Caucasians, suggesting a differential gene flow in the Mexican Mestizo population. This could account for the genetic variability between Mexicans and populations of other ethnicities. The study of these polymorphisms and their haplotypes could help in expanding our knowledge to design future disease-risk studies on the western Mexican population.

  4. MHC Class II haplotypes of Colombian Amerindian tribes

    PubMed Central

    Yunis, Juan J.; Yunis, Edmond J.; Yunis, Emilio

    2013-01-01

    We analyzed 1041 individuals belonging to 17 Amerindian tribes of Colombia, Chimila, Bari and Tunebo (Chibcha linguistic family), Embera, Waunana (Choco linguistic family), Puinave and Nukak (Maku-Puinave linguistic families), Cubeo, Guanano, Tucano, Desano and Piratapuyo (Tukano linguistic family), Guahibo and Guayabero (Guayabero Linguistic Family), Curripaco and Piapoco (Arawak linguistic family) and Yucpa (Karib linguistic family). for MHC class II haplotypes (HLA-DRB1, DQA1, DQB1). Approximately 90% of the MHC class II haplotypes found among these tribes are haplotypes frequently encountered in other Amerindian tribes. Nonetheless, striking differences were observed among Chibcha and non-Chibcha speaking tribes. The DRB1*04:04, DRB1*04:11, DRB1*09:01 carrying haplotypes were frequently found among non-Chibcha speaking tribes, while the DRB1*04:07 haplotype showed significant frequencies among Chibcha speaking tribes, and only marginal frequencies among non-Chibcha speaking tribes. Our results suggest that the differences in MHC class II haplotype frequency found among Chibcha and non-Chibcha speaking tribes could be due to genetic differentiation in Mesoamerica of the ancestral Amerindian population into Chibcha and non-Chibcha speaking populations before they entered into South America. PMID:23885196

  5. High-resolution HLA allele and haplotype frequencies in majority and minority populations of Costa Rica and Nicaragua: Differential admixture proportions in neighboring countries.

    PubMed

    Arrieta-Bolaños, E; Madrigal-Sánchez, J J; Stein, J E; Órlich-Pérez, P; Moreira-Espinoza, M J; Paredes-Carias, E; Vanegas-Padilla, Y; Salazar-Sánchez, L; Madrigal, J A; Marsh, S G E; Shaw, B E

    2018-06-01

    The HLA system shows the most extensive polymorphism in the human genome. Allelic and haplotypic frequencies of HLA genes vary dramatically across human populations. Due to a complex history of migration, populations in Latin America show a broad variety of admixture proportions, usually varying not only between countries, but also within countries. Knowledge of HLA allele and haplotype frequencies is essential for medical fields such as transplantation, but also serves as a means to assess genetic diversity and ancestry in human populations. Here, we have determined high-resolution HLA-A, -B, -C, and -DRB1 allele and haplotype frequencies in a sample of 713 healthy subjects from three Mestizo populations, one population of African descent, and Amerindians of five different groups from Costa Rica and Nicaragua and compared their profiles to a large set of indigenous populations from Iberia, Sub-Saharan Africa, and the Americas. Our results show a great degree of allelic and haplotypic diversity within and across these populations, with most extended haplotypes being private. Mestizo populations show alleles and haplotypes of putative European, Amerindian, and Sub-Saharan African origin, albeit with differential proportions. Despite some degree of gene flow, Amerindians and Afro-descendants show great similarity to other Amerindian and West African populations, respectively. This is the first comprehensive study reporting high-resolution HLA diversity in Central America, and its results will shed light into the genetic history of this region while also supporting the development of medical programs for organ and stem cell transplantation. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  6. HLA-A, HLA-B and HLA-DRB1 allele and haplotype frequencies of 10 918 Koreans from bone marrow donor registry in Korea.

    PubMed

    Park, H; Lee, Y-J; Song, E Y; Park, M H

    2016-10-01

    The human leucocyte antigen (HLA) system is the most polymorphic genetic system in humans, and HLA matching is crucial in organ transplantation, especially in hematopoietic stem cell transplantation. We investigated HLA-A, HLA-B and HLA-DRB1 allele and haplotype frequencies at allelic level in 10 918 Koreans from bone marrow donor registry in Korea. Intermediate resolution HLA typing was performed using Luminex technology (Wakunaga, Japan), and additional allelic level typing was performed using PCR-single-strand conformation polymorphism method and/or sequence-based typing (Abbott Molecular, USA). Allele and haplotype frequencies were calculated by direct counting and maximum likelihood methods, respectively. A total of 39 HLA-A, 66 HLA-B and 47 HLA-DRB1 alleles were identified. High-frequency alleles found at a frequency of ≥5% were 6 HLA-A (A*02:01, *02:06, *11:01, *24:02, *31:01 and *33:03), 6 HLA-B (B*15:01, *35:01, *44:03, *51:01, 54:01 and *58:01) and 8 HLA-DRB1 (DRB1*01:01, *04:05, *04:06, *07:01, *08:03, *09:01, *13:02 and *15:01) alleles. At each locus, A*02, B*15 and DRB1*14 generic groups were most diverse at allelic level, consisting of 9, 12 and 11 different alleles, respectively. A total of 366, 197 and 21 different HLA-A-B-DRB1 haplotypes were estimated with frequencies of ≥0.05%, ≥0.1% and ≥0.5%, respectively. The five most common haplotypes with frequencies of ≥2.0% were A*33:03-B*44:03-DRB1*13:02 (4.97%), A*33:03-B*58:01-DRB1*13:02, A*33:03-B*44:03-DRB1*07:01, A*24:02-B*07:02-DRB1*01:01 and A*24:02-B*52:01-DRB1*15:02. Among 34 serologic HLA-A-B-DR haplotypes with frequencies of ≥0.5%, 17 haplotypes revealed allele-level diversity and majority of the allelic variation was arising from A2, A26, B61, B62, DR4 and DR14 specificities. Haplotype diversity obtained in this study is the most comprehensive data thus far reported in Koreans, and the information will be useful for unrelated stem cell transplantation as well as for disease

  7. Factor IX gene haplotypes in Amerindians.

    PubMed

    Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S

    1997-02-01

    We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.

  8. Multi-allelic haplotype model based on genetic partition for genomic prediction and variance component estimation using SNP markers.

    PubMed

    Da, Yang

    2015-12-18

    The amount of functional genomic information has been growing rapidly but remains largely unused in genomic selection. Genomic prediction and estimation using haplotypes in genome regions with functional elements such as all genes of the genome can be an approach to integrate functional and structural genomic information for genomic selection. Towards this goal, this article develops a new haplotype approach for genomic prediction and estimation. A multi-allelic haplotype model treating each haplotype as an 'allele' was developed for genomic prediction and estimation based on the partition of a multi-allelic genotypic value into additive and dominance values. Each additive value is expressed as a function of h - 1 additive effects, where h = number of alleles or haplotypes, and each dominance value is expressed as a function of h(h - 1)/2 dominance effects. For a sample of q individuals, the limit number of effects is 2q - 1 for additive effects and is the number of heterozygous genotypes for dominance effects. Additive values are factorized as a product between the additive model matrix and the h - 1 additive effects, and dominance values are factorized as a product between the dominance model matrix and the h(h - 1)/2 dominance effects. Genomic additive relationship matrix is defined as a function of the haplotype model matrix for additive effects, and genomic dominance relationship matrix is defined as a function of the haplotype model matrix for dominance effects. Based on these results, a mixed model implementation for genomic prediction and variance component estimation that jointly use haplotypes and single markers is established, including two computing strategies for genomic prediction and variance component estimation with identical results. The multi-allelic genetic partition fills a theoretical gap in genetic partition by providing general formulations for partitioning multi-allelic genotypic values and provides a haplotype

  9. Sequential sentinel SNP Regional Association Plots (SSS-RAP): an approach for testing independence of SNP association signals using meta-analysis data.

    PubMed

    Zheng, Jie; Gaunt, Tom R; Day, Ian N M

    2013-01-01

    Genome-Wide Association Studies (GWAS) frequently incorporate meta-analysis within their framework. However, conditional analysis of individual-level data, which is an established approach for fine mapping of causal sites, is often precluded where only group-level summary data are available for analysis. Here, we present a numerical and graphical approach, "sequential sentinel SNP regional association plot" (SSS-RAP), which estimates regression coefficients (beta) with their standard errors using the meta-analysis summary results directly. Under an additive model, typical for genes with small effect, the effect for a sentinel SNP can be transformed to the predicted effect for a possibly dependent SNP through a 2×2 2-SNP haplotypes table. The approach assumes Hardy-Weinberg equilibrium for test SNPs. SSS-RAP is available as a Web-tool (http://apps.biocompute.org.uk/sssrap/sssrap.cgi). To develop and illustrate SSS-RAP we analyzed lipid and ECG traits data from the British Women's Heart and Health Study (BWHHS), evaluated a meta-analysis for ECG trait and presented several simulations. We compared results with existing approaches such as model selection methods and conditional analysis. Generally findings were consistent. SSS-RAP represents a tool for testing independence of SNP association signals using meta-analysis data, and is also a convenient approach based on biological principles for fine mapping in group level summary data. © 2012 Blackwell Publishing Ltd/University College London.

  10. Application of a posteriori granddaughter and modified granddaughter designs to determine Holstein haplotype effects

    USDA-ARS?s Scientific Manuscript database

    A posteriori and modified granddaughter designs were applied to determine haplotype effects for Holstein bulls and cows with BovineSNP50 genotypes. The a posteriori granddaughter design was applied to 52 sire families, each with '100 genotyped sons with genetic evaluations based on progeny tests. Fo...

  11. Application of a posteriori granddaughter and modified granddaughter designs to determine Holstein haplotype effects

    USDA-ARS?s Scientific Manuscript database

    A posteriori and modified granddaughter designs were applied to determine haplotype effects for Holstein bulls and cows with BovineSNP50 genotypes. The a posteriori granddaughter design was applied to 52 sire families, each with >100 genotyped sons with genetic evaluations based on progeny tests. Fo...

  12. Haplotypes in SLC24A5 Gene as Ancestry Informative Markers in Different Populations

    PubMed Central

    Giardina, Emiliano; Pietrangeli, Ilenia; Martínez-Labarga, Cristina; Martone, Claudia; de Angelis, Flavio; Spinella, Aldo; De Stefano, Gianfranco; Rickards, Olga; Novelli, Giuseppe

    2008-01-01

    Ancestry informative markers (AIMs) are human polymorphisms that exhibit substantially allele frequency differences among populations. These markers can be useful to provide information about ancestry of samples which may be useful in predicting a perpetrator’s ethnic origin to aid criminal investigations. Variations in human pigmentation are the most obvious phenotypes to distinguish individuals. It has been recently shown that the variation of a G in an A allele of the coding single-nucleotide polymorphism (SNP) rs1426654 within SLC24A5 gene varies in frequency among several population samples according to skin pigmentation. Because of these observations, the SLC24A5 locus has been evaluated as Ancestry Informative Region (AIR) by typing rs1426654 together with two additional intragenic markers (rs2555364 and rs16960620) in 471 unrelated individuals originating from three different continents (Africa, Asia and Europe). This study further supports the role of human SLC24A5 gene in skin pigmentation suggesting that variations in SLC24A5 haplotypes can correlate with human migration and ancestry. Furthermore, our data do reveal the utility of haplotype and combined unphased genotype analysis of SLC24A5 in predicting ancestry and provide a good example of usefulness of genetic characterization of larger regions, in addition to single polymorphisms, as candidates for population-specific sweeps in the ancestral population. PMID:19440451

  13. Lack of Association of Bone Morphogenetic Protein 2 Gene Haplotypes with Bone Mineral Density, Bone Loss, or Risk of Fractures in Men

    PubMed Central

    Varanasi, Satya S.; Tuck, Stephen P.; Mastana, Sarabjit S.; Dennison, Elaine; Cooper, Cyrus; Vila, Josephine; Francis, Roger M.; Datta, Harish K.

    2011-01-01

    Introduction. The association of bone morphogenetic protein 2 (BMP2) with BMD and risk of fracture was suggested by a recent linkage study, but subsequent studies have been contradictory. We report the results of a study of the relationship between BMP2 genotypes and BMD, annual change in BMD, and risk of fracture in male subjects. Materials and Methods. We tested three single-nucleotide polymorphisms (SNPs) across the BMP2 gene, including Ser37Ala SNP, in 342 Caucasian Englishmen, comprising 224 control and 118 osteoporotic subjects. Results. BMP2 SNP1 (Ser37Ala) genotypes were found to have similar low frequency in control subjects and men with osteoporosis. The major informative polymorphism, BMP2 SNP3 (Arg190Ser), showed no statistically significant association with weight, height, BMD, change in BMD at hip or lumbar spine, and risk of fracture. Conclusion. There were no genotypic or haplotypic effects of the BMP2 candidate gene on BMD, change in BMD, or fracture risk identified in this cohort. PMID:22013543

  14. Novel quantitative real-time LCR for the sensitive detection of SNP frequencies in pooled DNA: method development, evaluation and application.

    PubMed

    Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios

    2011-01-19

    Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food.

  15. Absence of the tag polymorphism for the risk haplotype HLA-DR2 for multiple sclerosis in Wixárika subjects from Mexico.

    PubMed

    González-Enríquez, G V; Torres-Mendoza, B M; Márquez-Pedroza, J; Macías-Islas, M A; Ortiz, G G; Cruz-Ramos, J A

    2018-02-03

    The HLA-DRB1*15:01 allele has a demonstrated risk for the development of multiple sclerosis (MS) in most populations around the world. The single nucleotide polymorphism (SNP) rs3129934 is found in linkage disequilibrium with the risk haplotype formed by the HLA-DRB1*15:01 and HLA-DQB1*06:02 alleles, and it is considered a reliable marker of the presence of this haplotype. Native Americans have a null or low prevalence of MS. In this study, we sought to identify the frequency of rs3129934 in the Wixárika ethnic group as well as in Mestizo (mixed race) patients with MS and in controls from western Mexico. Through real-time polymerase chain reaction (PCR) using TaqMan probes, we analyzed the allele and genotype frequencies of rs3129934 in Mestizo individuals with and without MS and in 73 Wixárika subjects from the state of Jalisco, Mexico. The Wixárika subjects were homozygote for the C allele of rs3129934. The allele and genotype frequency in Mestizos with MS was similar to that of other MS populations with Caucasian ancestry. The absence of the T risk allele rs3129934 (associated with the haplotype HLA-DRB1*15:01, HLA-DQ1*06:02) in this sample of Wixárika subjects is consistent with the unreported MS in this Amerindian group, related to absence of such paramount genetic risk factor.

  16. A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

    PubMed Central

    Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

    2013-01-01

    Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962

  17. Y-SNPs haplotype diversity in four Chinese cattle breeds.

    PubMed

    Zhang, Runfeng; Cheng, Ming; Li, Xiaofeng; Chen, Fuying; Zheng, Jing; Wang, Xiaofei; Meng, Quanke

    2013-01-01

    To investigate the genetic diversity of Chinese cattle, 96 male samples of 4 Chinese native cattle breeds were investigated using 5 single nucleotide polymorphisms specific to the bovine Y chromosome. Two previously described haplotypes (taurine Y2 and indicine Y3) were detected in 74 and 22 animals, respectively. The haplotype frequencies varied amongst the four native breeds. The taurine Y2 haplotype dominated in the Qinchuan, Dabieshan, and Yunba breeds. However, the indicine Y3 haplotype occurred in high frequency in the Enshi breed. Among the four native breeds, Yunba had the highest haplotype diversity (0.4330 ± 0.0750), followed by Qinchuan (0.2899 ± 0.1028) and Enshi (0.2222 ± 0.1662), Dabieshan was the least differentiated (0.1079 ± 0.0680). Compared with some foreign cattle breeds, the low level of haplotype diversity was detected in our breeds (0.2633 ± 0.1030).

  18. Novel Quantitative Real-Time LCR for the Sensitive Detection of SNP Frequencies in Pooled DNA: Method Development, Evaluation and Application

    PubMed Central

    Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios

    2011-01-01

    Background Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. Methods The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. Conclusions The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. Significance The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food. PMID:21283808

  19. Common CYP2D6 polymorphisms affecting alternative splicing and transcription: long-range haplotypes with two regulatory variants modulate CYP2D6 activity

    PubMed Central

    Wang, Danxin; Poi, Ming J.; Sun, Xiaochun; Gaedigk, Andrea; Leeder, J. Steven; Sadee, Wolfgang

    2014-01-01

    Cytochrome P450 2D6 (CYP2D6) is involved in the metabolism of 25% of clinically used drugs. Genetic polymorphisms cause substantial variation in CYP2D6 activity and serve as biomarkers guiding drug therapy. However, genotype–phenotype relationships remain ambiguous except for poor metabolizers carrying null alleles, suggesting the presence of yet unknown genetic variants. Searching for regulatory CYP2D6 polymorphisms, we find that a SNP defining the CYP2D6*2 allele, rs16947 [R296C, 17–60% minor allele frequency (MAF)], previously thought to convey normal activity, alters exon 6 splicing, thereby reducing CYP2D6 expression at least 2-fold. In addition, two completely linked SNPs (rs5758550/rs133333, MAF 13–42%) increase CYP2D6 transcription more than 2-fold, located in a distant downstream enhancer region (>100 kb) that interacts with the CYP2D6 promoter. In high linkage disequilibrium (LD) with each other, rs16947 and the enhancer SNPs form haplotypes that affect CYP2D6 enzyme activity in vivo. In a pediatric cohort of 164 individuals, rs16947 alone (minor haplotype frequency 28%) was associated with reduced CYP2D6 metabolic activity (measured as dextromethorphan/metabolite ratios), whereas rs5758550/rs133333 alone (frequency 3%) resulted in increased CYP2D6 activity, while haplotypes containing both rs16947 and rs5758550/rs133333 were similar to the wild-type. Other alleles used in biomarker panels carrying these variants such as CYP2D6*41 require re-evaluation of independent effects on CYP2D6 activity. The occurrence of two regulatory variants of high frequency and in high LD, residing on a long haplotype, highlights the importance of gene architecture, likely shaped by evolutionary selection pressures, in determining activity of encoded proteins. PMID:23985325

  20. Single-nucleotide polymorphisms and haplotypes of non-coding area in the CP gene are correlated with Parkinson's disease.

    PubMed

    Zhao, Na; Xiao, Jianqiu; Zheng, Zhiyong; Fei, Guoqiang; Zhang, Feng; Jin, Lirong; Zhong, Chunjiu

    2015-04-01

    Our previous studies have demonstrated that ceruloplasmin (CP) dysmetabolism is correlated with Parkinson's disease (PD). However, the causes of decreased serum CP levels in PD patients remain to be clarified. This study aimed to explore the potential association between genetic variants of the CP gene and PD. Clinical features, serum CP levels, and the CP gene (both promoter and coding regions) were analyzed in 60 PD patients and 50 controls. A luciferase reporter system was used to investigate the function of promoter single-nucleotide polymorphisms (SNPs). High-density comparative genomic hybridization microarrays were also used to detect large-scale copy-number variations in CP and an additional 47 genes involved in PD and/or copper/iron metabolism. The frequencies of eight SNPs (one intronic SNP and seven promoter SNPs of the CP gene) and their haplotypes were significantly different between PD patients, especially those with lowered serum CP levels, and controls. However, the luciferase reporter system revealed no significant effect of the risk haplotype on promoter activity of the CP gene. Neither these SNPs nor their haplotypes were correlated with the Hoehn and Yahr staging of PD. The results of this study suggest that common genetic variants of CP are associated with PD and further investigation is needed to explore their functions in PD.

  1. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  2. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  3. Acute chest syndrome is associated with single nucleotide polymorphism-defined beta globin cluster haplotype in children with sickle cell anaemia

    PubMed Central

    Bean, Christopher J.; Boulet, Sheree L.; Yang, Genyan; Payne, Amanda B.; Ghaji, Nafisa; Pyle, Meredith E.; Hooper, W. Craig; Bhatnagar, Pallav; Keefer, Jeffrey; Barron-Casella, Emily A.; Casella, James F.; DeBaun, Michael R.

    2013-01-01

    Summary Genetic diversity at the human β-globin locus has been implicated as a modifier of sickle cell anaemia (SCA) severity. However, haplotypes defined by restriction fragment length polymorphism sites across the β-globin locus have not been consistently associated with clinical phenotypes. To define the genetic structure at the β-globin locus more thoroughly, we performed high-density single nucleotide polymorphism (SNP) mapping in 820 children who were homozygous for the sickle cell mutation (HbSS). Genotyping results revealed very high linkage disequilibrium across a large region spanning the locus control region and the HBB (β-globin gene) cluster. We identified three predominant haplotypes accounting for 96% of the βS-carrying chromosomes in this population that could be distinguished using a minimal set of common SNPs. Consistent with previous studies, fetal haemoglobin level was significantly associated with βS-haplotypes. After controlling for covariates, an association was detected between haplotype and rate of hospitalization for acute chest syndrome (ACS) (incidence rate ratio 0.51, 95% confidence interval 0.29–0.89) but not incidence rate of vaso-occlusive pain or presence of silent cerebral infarct (SCI). Our results suggest that these SNP-defined βS-haplotypes may be associated with ACS, but not pain or SCI in a study population of children with SCA. PMID:23952145

  4. Haplotypes of heparin-binding epidermal-growth-factor-like growth factor gene are associated with pre-eclampsia.

    PubMed

    Harendra, Galhenagey Gayani; Jayasekara, Rohan W; Dissanayake, Vajira H W

    2012-01-01

    Heparin-binding epidermal-growth-factor-like growth factor (HBEGF) plays an important role in placentation, including impaired placentation, the primary defect seen in pre-eclampsia. We carried out a case-control disease-association study to examine the association of single nucleotide polymorphisms (SNP) in the HBEGF gene and haplotypes defined by them with pre-eclampsia in a Sinhalese population in Sri Lanka. A total of 175 women with pre-eclampsia and 171 matched normotensive controls were genotyped for six SNP selected in silico as having putative functional effects using mass array Sequenom iplex methodology and a newly designed polymerase chain reaction-restriction fragment length polymorphism assay. The individual SNP were not associated with pre-eclampsia. The haplotypes defined by them, however, showed both predisposing (rs13385T,rs2074613G,rs2237076G,rs2074611C,rs4150196A,rs1862176A; odds ratio,1.65; 95% confidence interval1.04-2.60; P=0.032) and protective (rs13385C,rs2074613G,rs2237076A,rs2074611C,rs4150196A,rs1862176A; odds ratio,0.20; 95% confidence interval, 0.04-0.89; P=0.034) effects. These results confirm that polymorphisms in the HGEGF gene are associated with pre-eclampsia. The haplotypes are likely to exert their effects through the numerous transcription regulation factors binding to the polymorphic sites, namely GATA-1, GATA-3, MZF-1 and AML-1a. © 2011 The Authors. Journal of Obstetrics and Gynaecology Research © 2011 Japan Society of Obstetrics and Gynecology.

  5. Novel Harmful Recessive Haplotypes Identified for Fertility Traits in Nordic Holstein Cattle

    PubMed Central

    Sahana, Goutam; Nielsen, Ulrik Sander; Aamand, Gert Pedersen; Lund, Mogens Sandø; Guldbrandtsen, Bernt

    2013-01-01

    Using genomic data, lethal recessives may be discovered from haplotypes that are common in the population but never occur in the homozygote state in live animals. This approach only requires genotype data from phenotypically normal (i.e. live) individuals and not from the affected embryos that die. A total of 7,937 Nordic Holstein animals were genotyped with BovineSNP50 BeadChip and haplotypes including 25 consecutive markers were constructed and tested for absence of homozygotes states. We have identified 17 homozygote deficient haplotypes which could be loosely clustered into eight genomic regions harboring possible recessive lethal alleles. Effects of the identified haplotypes were estimated on two fertility traits: non-return rates and calving interval. Out of the eight identified genomic regions, six regions were confirmed as having an effect on fertility. The information can be used to avoid carrier-by-carrier mattings in practical animal breeding. Further, identification of causative genes/polymorphisms responsible for lethal effects will lead to accurate testing of the individuals carrying a lethal allele. PMID:24376603

  6. Kullback-Leibler divergence for detection of rare haplotype common disease association.

    PubMed

    Lin, Shili

    2015-11-01

    Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.

  7. HLA Haplotype Frequency Estimation from Real-Life Data with the Hapl-o-Mat Software.

    PubMed

    Sauter, Jürgen; Schäfer, Christian; Schmidt, Alexander H

    2018-01-01

    HLA haplotype frequencies are of use in a variety of settings. Such data is typically derived either from family pedigree data by targeted typing or statistical analysis of large population-specific genotype samples. As established tools for the latter approach lacked ability to treat the amount, ambiguity, and inhomogeneity found in genotype data in hematopoietic stem cell donor registries, we developed Hapl-o-Mat to alleviate these specific shortcomings.

  8. The Association of a Novel Haplotype in the Dopamine Transporter with Preschool Age Posttraumatic Stress Disorder

    PubMed Central

    Brett, Zoë H.; Henry, Caitlin; Scheeringa, Michael

    2013-01-01

    Abstract Objective Significant evidence supports a genetic contribution to the development of posttraumatic stress disorder (PTSD). Three previous studies have demonstrated an association between PTSD and the nine repeat allele of the 3′ untranslated region (3′UTR) variable number tandem repeat (VNTR) in the dopamine transporter (DAT, rs28363170). Recently a novel, functionally significant C/T single-nucleotide polymorphism (SNP) in the 3′UTR (rs27072) with putative interactions with the 3′VNTR, has been identified. To provide enhanced support for the role of DAT and striatal dopamine regulation in the development of PTSD, this study examined the impact of a haplotype defined by the C allele of rs27072 and the nine repeat allele of the 3′VNTR on PTSD diagnosis in young trauma-exposed children. Methods DAT haplotypes were determined in 150 trauma-exposed 3–6 year-old children. PTSD was assessed with a semistructured interview. After excluding double heterozygotes, analysis was performed on 143 total subjects. Haplotype was examined in relation to categorical and continuous measures of PTSD, controlling for trauma type and race. Additional analysis within the two largest race categories was performed, as other means of controlling for ethnic stratification were not available. Results The number of haplotypes (0, 1, or 2) defined by the presence of the nine repeat allele of rs28363170 (VNTR in the 3′UTR) and the C allele of rs27072 (SNP in the 3′UTR) was significantly associated with both the diagnosis of PTSD and total PTSD symptoms. Specifically, children with one or two copies of the haplotype had significantly more PTSD symptoms and were more likely to be diagnosed with PTSD than were children without this haplotype. Conclusions These findings extend previous findings associating genetic variation in the DAT with PTSD. The association of a haplotype in DAT with PTSD provides incremental traction for a model of genetic vulnerability to PTSD, a

  9. Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

    PubMed Central

    Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

    2015-01-01

    Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880

  10. Haplotype diversity of the myostatin gene among beef cattle breeds

    PubMed Central

    Dunner, Susana; Miranda, M Eugenia; Amigues, Yves; Cañón, Javier; Georges, Michel; Hanset, Roger; Williams, John; Ménissier, François

    2003-01-01

    A total of 678 individuals from 28 European bovine breeds were both phenotyped and analysed at the myostatin locus by the Single Strand Conformation Polymorphism (SSCP) method. Seven new mutations were identified which contribute to the high polymorphism (1 SNP every 100 bp) present in this small gene; twenty haplotypes were described and a genotyping method was set up using the Oligonucleotide Ligation Assay (OLA) method. Some haplotypes appeared to be exclusive to a particular breed; this was the case for 5 in the Charolaise (involving mutation Q204X) and 7 in the Maine-Anjou (involving mutation E226X). The relationships between the different haplotypes were studied, thus allowing to test the earlier hypothesis on the origin of muscular hypertrophy in Europe: muscular hypertrophy (namely nt821(del11)) was mainly spread in different waves from northern Europe milk purpose populations in most breeds; however, other mutations (mostly disruptive) arose in a single breed, were highly selected and have since scarcely evolved to other populations. PMID:12605853

  11. Recovery of Native Genetic Background in Admixed Populations Using Haplotypes, Phenotypes, and Pedigree Information – Using Cika Cattle as a Case Breed

    PubMed Central

    Simčič, Mojca; Smetko, Anamarija; Sölkner, Johann; Seichter, Doris; Gorjanc, Gregor; Kompan, Dragomir; Medugorac, Ivica

    2015-01-01

    The aim of this study was to obtain unbiased estimates of the diversity parameters, the population history, and the degree of admixture in Cika cattle which represents the local admixed breeds at risk of extinction undergoing challenging conservation programs. Genetic analyses were performed on the genome-wide Single Nucleotide Polymorphism (SNP) Illumina Bovine SNP50 array data of 76 Cika animals and 531 animals from 14 reference populations. To obtain unbiased estimates we used short haplotypes spanning four markers instead of single SNPs to avoid an ascertainment bias of the BovineSNP50 array. Genome-wide haplotypes combined with partial pedigree and type trait classification show the potential to improve identification of purebred animals with a low degree of admixture. Phylogenetic analyses demonstrated unique genetic identity of Cika animals. Genetic distance matrix presented by rooted Neighbour-Net suggested long and broad phylogenetic connection between Cika and Pinzgauer. Unsupervised clustering performed by the admixture analysis and two-dimensional presentation of the genetic distances between individuals also suggest Cika is a distinct breed despite being similar in appearance to Pinzgauer. Animals identified as the most purebred could be used as a nucleus for a recovery of the native genetic background in the current admixed population. The results show that local well-adapted strains, which have never been intensively managed and differentiated into specific breeds, exhibit large haplotype diversity. They suggest a conservation and recovery approach that does not rely exclusively on the search for the original native genetic background but rather on the identification and removal of common introgressed haplotypes would be more powerful. Successful implementation of such an approach should be based on combining phenotype, pedigree, and genome-wide haplotype data of the breed of interest and a spectrum of reference breeds which potentially have had

  12. Dimensional Anxiety Mediates Linkage of GABRA2 Haplotypes With Alcoholism

    PubMed Central

    Enoch, Mary-Anne; Schwartz, Lori; Albaugh, Bernard; Virkkunen, Matti; Goldman, David

    2015-01-01

    The GABAAα2 receptor gene (GABRA2) modulates anxiety and stress response. Three recent association studies implicate GABRA2 in alcoholism, however in these papers both common, opposite-configuration haplotypes in the region distal to intron3 predict risk. We have now replicated the GABRA2 association with alcoholism in 331 Plains Indian men and women and 461 Finnish Caucasian men. Using a dimensional measure of anxiety, harm avoidance (HA), we also found that the association with alcoholism is mediated, or moderated, by anxiety. Nine SNPs were genotyped revealing two haplotype blocks. Within the previously implicated block 2 region, we identified the two common, opposite-configuration risk haplotypes, A and B. Their frequencies differed markedly in Finns and Plains Indians. In both populations, most block 2 SNPs were significantly associated with alcoholism. The associations were due to increased frequencies of both homozygotes in alcoholics, indicating the possibility of alcoholic subtypes with opposite genotypes. Congruently, there was no significant haplotype association. Using HA as an indicator variable for anxiety, we found haplotype linkage to alcoholism with high and low dimensional anxiety, and to HA itself, in both populations. High HA alcoholics had the highest frequency of the more abundant haplotype (A in Finns, B in Plains Indians); low HA alcoholics had the highest frequency of the less abundant haplotype (B in Finns, A in Plains Indians) (Finns: P α0.007, OR α2.1, Plains Indians: P α0.040, OR α1.9). Non-alcoholics had intermediate frequencies. Our results suggest that within the distal GABRA2 region is a functional locus or loci that may differ between populations but that alters risk for alcoholism via the mediating action of anxiety. PMID:16874763

  13. How Have Self-Incompatibility Haplotypes Diversified? Generation of New Haplotypes during the Evolution of Self-Incompatibility from Self-Compatibility.

    PubMed

    Sakai, Satoki

    2016-08-01

    I developed a gametophytic self-incompatibility (SI) model to study the conditions leading to diversification in SI haplotypes. In the model, the SI system is assumed to be incomplete, and the pollen expressing a given specificity is not fully rejected by the pistils expressing the same specificity. I also assumed that mutations can occur that enhance the rejection of pollen by pistils with the same haplotype variant and reduce rejection by pistils with other variants in the same haplotype. I found that if such mutations occur, the new haplotypes (mutant variants) can stably coexist with the ancestral haplotype in which the mutant arose. This is because pollen bearing the new haplotype is most strongly rejected by pistils bearing the same new haplotype among the pistils in the population; hence, negative frequency-dependent selection prevents their fixation. I also performed simulations and found that the nearly complete SI system evolves from completely self-compatible populations and that SI haplotypes can increase to about 40-50 within a few thousand generations. On the basis of my findings, I propose that diversification of SI haplotypes occurred during the evolution of SI from self-compatibility.

  14. Polymorphisms and haplotypes in the bovine neuropeptide Y, growth hormone receptor, ghrelin, insulin-like growth factor 2, and uncoupling proteins 2 and 3 genes and their associations with measures of growth, performance, feed efficiency, and carcass merit in beef cattle.

    PubMed

    Sherman, E L; Nkrumah, J D; Murdoch, B M; Li, C; Wang, Z; Fu, A; Moore, S S

    2008-01-01

    Genes that regulate metabolism and energy partitioning have the potential to influence economically important traits in farm animals, as do polymorphisms within these genes. In the current study, SNP in the bovine neuropeptide Y (NPY), growth hormone receptor (GHR), ghrelin (GHRL), uncoupling proteins 2 and 3 (UCP2 and UCP3), IGF2, corticotrophin-releasing hormone (CRH), cocaine and amphetamine regulated transcript (CART), melanocortin-4 receptor (MC4R), proopiomelanocortin (POMC), and GH genes were evaluated for associations with growth, feed efficiency, and carcass merit in beef steers. In total, 24 SNP were evaluated for associations with these traits and haplotypes were constructed within each gene when 2 or more SNP showed significant associations. An A/G SNP located in intron 4 of the GHR gene had the largest effects on BW of the animals (dominance effect P < 0.01) and feed efficiency (allele substitution effect P < 0.05). Another A/G SNP located in the promoter region of GHR had similar effects but the haplotypes of these 2 SNP reduced the effects of the SNP located in intron 4. Three SNP in the NPY gene showed associations to marbling (P < 0.001) as well as with ADG, BW, and feed conversion ratio (FCR; P < 0.05). The combination of these 3 SNP into haplotypes generally improved the association or had a similar scale of association as each single SNP. Only 1 SNP in UCP3, an A/G SNP in intron 3, was associated with ADG (P = 0.025), partial efficiency of growth, and FCR (P < 0.01). Three SNP in UCP2 gene were in almost complete linkage disequilibrium and showed associations with lean meat yield, yield grade, DMI, and BW (P < 0.05). Haplo-types between the SNP in UCP3 and UCP2 generally reduced the associations seen individually in each SNP. An A/G SNP in the GHRL gene tended to show effects on residual feed intake, FCR, and partial efficiency of growth (P < 0.10). The IGF2 SNP most strongly affected LM area (P < 0.01), back fat, ADG, and FCR (P < 0.05). The

  15. JAK2 46/1 haplotype is associated with JAK2 V617F--positive myeloproliferative neoplasms in Brazilian patients.

    PubMed

    Macedo, L C; Santos, B C; Pagliarini-e-Silva, S; Pagnano, K B B; Rodrigues, C; Quintero, F C; Ferreira, M E; Baraldi, E C; Ambrosio-Albuquerque, E P; Sell, A M; Visentainer, J E L

    2015-10-01

    This study aimed to verify the association between the JAK2 46/1 haplotype (V617F positive) and some hematological parameters in BCR-ABL-negative chronic myeloproliferative neoplasms (cMPNs) in our population. The blood samples obtained from the patients with cMPN were genotyped for the JAK2 V617F mutation and JAK2 rs10974944 SNP screening using a PCR-RFLP assay. The JAK2 V617F mutation was detected in 80.15% of patients. The G variant of rs10974944 was more frequent in all MPNs, especially those that were JAK2 V617F positive, than in the control population. We also compared the 46/1 haplotype status in each MPN disease entity, polycythemia vera (PV), essential thrombocythemia (ET), primary myelofibrosis (PMF), and MPNu with controls. The G allele frequency relative to controls was significantly enriched in patients with PV and ET, but not in those with PMF and MPNu. PV and ET patients especially, all of whom had the JAK2 V617F mutation, showed significant excess of the G allele. The frequency of JAK2 V617F mutation was associated with elevated hematological parameters, but when we analyze the occurrence of the mutation and the presence of the G allele, just the high hemoglobin was significantly. In agreement with previous reports, JAK2 46/1 haplotype for JAK2 V617F was associated with cMPN positive in Brazilian patients. © 2015 John Wiley & Sons Ltd.

  16. [Association analysis of SNP-63 and indel-19 variant in the calpain-10 gene with polycystic ovary syndrome in women of reproductive age].

    PubMed

    Flores-Martínez, Silvia Esperanza; Castro-Martínez, Anna Gabriela; López-Quintero, Andrés; García-Zapién, Alejandra Guadalupe; Torres-Rodríguez, Ruth Noemí; Sánchez-Corona, José

    2015-01-01

    Polycystic ovary syndrome is a complex and heterogeneous disease involving both reproductive and metabolic problems. It has been suggested a genetic predisposition in the etiology of this syndrome. The identification of calpain-10 gene (CAPN10) as the first candidate gene for type 2 diabetes mellitus, has focused the interest in investigating their possible relation with the polycystic ovary syndrome, because this syndrome is associated with hyperinsulinemia and insulin resistance, two metabolic abnormalities associated with type 2 diabetes mellitus. To investigate if there is association between the SNP-63 and the variant indel-19 of the CAPN10 gene and polycystic ovary syndrome in women of reproductive age. This study included 101 women (55 with polycystic ovary syndrome and 46 without polycystic ovary syndrome). The genetic variant indel-19 was identified by electrophoresis of the amplified fragments by PCR, and the SNP-63 by PCR-RFLP. The allele and genotype frequencies of the two variants do not differ significatly between women with polycystic ovary syndrome and control women group. The haplotype 21 (defined by the insertion allele of indel-19 variant and C allele of SNP-63) was found with higher frequency in both study groups, being more frequent in the polycystic ovary syndrome patients group, however, this difference was not statistically significant (p = 0.8353). The results suggest that SNP-63 and indel-19 variant of the CAPN10 gene do not represent a risk factor for polycystic ovary syndrome in our patients group. Copyright © 2015. Published by Masson Doyma México S.A.

  17. SNP and haplotype analysis of paired box 3 (PAX3) gene provide evidence for association with growth traits in Chinese cattle.

    PubMed

    Xu, Yao; Cai, Hanfang; Zhou, Yang; Shi, Tao; Lan, Xianyong; Zhang, Chunlei; Lei, Chuzhao; Jia, Yutang; Chen, Hong

    2014-07-01

    Paired box 3 (PAX3) belongs to the PAX superfamily of transcription factors and plays essential roles in the embryogenesis and postnatal formation of limb musculature through affecting the survival of muscle progenitor cells. By genetic mapping, PAX3 gene is assigned in the interval of quantitative trait loci for body weight on bovine BTA2. The objectives of this study were to detect polymorphisms of PAX3 gene in 1,241 cattle from five breeds and to investigate their effects on growth traits. Initially, three novel single nucleotide polymorphisms (SNPs) were identified by DNA pool sequencing and aCRS-RFLP methods (AC_000159: g.T-580G, g.A4617C and g.79018Ins/del G), which were located at 5'-UTR, exon 4 and intron 6, respectively. A total of eight haplotypes were constructed and the frequency of the three main haplotypes H1 (TAG), H2 (GCG) and H3 (GAG) accounted for over 81.7 % of the total individuals. Statistical analysis revealed that the three SNPs were associated with body height and body length of Nanyang and Chinese Caoyuan cattle at the age of 6 and/or 12 months old (P < 0.05), and consistently significant effects were also found in the haplotype combination analysis on these traits (P < 0.05). This study presented a complete scan of variations within bovine PAX3 gene, which could provide evidence for improving the economic traits of cattle by using these variations as potentially genetic markers in early marker-assisted selection programs.

  18. A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.

    PubMed

    Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin

    2018-04-26

    Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.

  19. The molecular epidemiology of Huntington disease is related to intermediate allele frequency and haplotype in the general population.

    PubMed

    Kay, Chris; Collins, Jennifer A; Wright, Galen E B; Baine, Fiona; Miedzybrodzka, Zosia; Aminkeng, Folefac; Semaka, Alicia J; McDonald, Cassandra; Davidson, Mark; Madore, Steven J; Gordon, Erynn S; Gerry, Norman P; Cornejo-Olivas, Mario; Squitieri, Ferdinando; Tishkoff, Sarah; Greenberg, Jacquie L; Krause, Amanda; Hayden, Michael R

    2018-04-01

    Huntington disease (HD) is the most common monogenic neurodegenerative disorder in populations of European ancestry, but occurs at lower prevalence in populations of East Asian or black African descent. New mutations for HD result from CAG repeat expansions of intermediate alleles (IAs), usually of paternal origin. The differing prevalence of HD may be related to the rate of new mutations in a population, but no comparative estimates of IA frequency or the HD new mutation rate are available. In this study, we characterize IA frequency and the CAG repeat distribution in fifteen populations of diverse ethnic origin. We estimate the HD new mutation rate in a series of populations using molecular IA expansion rates. The frequency of IAs was highest in Hispanic Americans and Northern Europeans, and lowest in black Africans and East Asians. The prevalence of HD correlated with the frequency of IAs by population and with the proportion of IAs found on the HD-associated A1 haplotype. The HD new mutation rate was estimated to be highest in populations with the highest frequency of IAs. In European ancestry populations, one in 5,372 individuals from the general population and 7.1% of individuals with an expanded CAG repeat in the HD range are estimated to have a molecular new mutation. Our data suggest that the new mutation rate for HD varies substantially between populations, and that IA frequency and haplotype are closely linked to observed epidemiological differences in the prevalence of HD across major ancestry groups in different countries. © 2018 Wiley Periodicals, Inc.

  20. Short communication: casein haplotype variability in sicilian dairy goat breeds.

    PubMed

    Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B

    2008-09-01

    In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.

  1. High-Resolution Analyses of Human Leukocyte Antigens Allele and Haplotype Frequencies Based on 169,995 Volunteers from the China Bone Marrow Donor Registry Program

    PubMed Central

    Zhou, Xiao-Yang; Zhu, Fa-Ming; Li, Jian-Ping; Mao, Wei; Zhang, De-Mei; Liu, Meng-Li; Hei, Ai-Lian; Dai, Da-Peng; Jiang, Ping; Shan, Xiao-Yan; Zhang, Bo-Wei; Zhu, Chuan-Fu; Shen, Jie; Deng, Zhi-Hui; Wang, Zheng-Lei; Yu, Wei-Jian; Chen, Qiang; Qiao, Yan-Hui; Zhu, Xiang-Ming; Lv, Rong; Li, Guo-Ying; Li, Guo-Liang; Li, Heng-Cong; Zhang, Xu; Pei, Bin; Jiao, Li-Xin; Shen, Gang; Liu, Ying; Feng, Zhi-Hui; Su, Yu-Ping; Xu, Zhao-Xia; Di, Wen-Ying; Jiang, Yao-Qin; Fu, Hong-Lei; Liu, Xiang-Jun; Liu, Xiang; Zhou, Mei-Zhen; Du, Dan; Liu, Qi; Han, Ying; Zhang, Zhi-Xin; Cai, Jian-Ping

    2015-01-01

    Allogeneic hematopoietic stem cell transplantation is a widely used and effective therapy for hematopoietic malignant diseases and numerous other disorders. High-resolution human leukocyte antigen (HLA) haplotype frequency distributions not only facilitate individual donor searches but also determine the probability with which a particular patient can find HLA-matched donors in a registry. The frequencies of the HLA-A, -B, -C, -DRB1, and -DQB1 alleles and haplotypes were estimated among 169,995 Chinese volunteers using the sequencing-based typing (SBT) method. Totals of 191 HLA-A, 244 HLA-B, 146 HLA-C, 143 HLA-DRB1 and 47 HLA-DQB1 alleles were observed, which accounted for 6.98%, 7.06%, 6.46%, 9.11% and 7.91%, respectively, of the alleles in each locus in the world (IMGT 3.16 Release, Apr. 2014). Among the 100 most common haplotypes from the 169,995 individuals, nine distinct haplotypes displayed significant regionally specific distributions. Among these, three were predominant in the South China region (i.e., the 20th, 31st, and 81sthaplotypes), another three were predominant in the Southwest China region (i.e., the 68th, 79th, and 95th haplotypes), one was predominant in the South and Southwest China regions (the 18th haplotype), one was relatively common in the Northeast and North China regions (the 94th haplotype), and one was common in the Northeast, North and Northwest China (the 40th haplotype). In conclusion, this is the first to analyze high-resolution HLA diversities across the entire country of China, based on a detailed and complete data set that covered 31 provinces, autonomous regions, and municipalities. Specifically, we also evaluated the HLA matching probabilities within and between geographic regions and analyzed the regional differences in the HLA diversities in China. We believe that the data presented in this study might be useful for unrelated HLA-matched donor searches, donor registry planning, population genetic studies, and anthropogenesis

  2. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.

    PubMed

    N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype

  3. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map

    PubMed Central

    Haile, Jemanesh K.; Cory, Aron T.; Clarke, Fran R.; Clarke, John M.; Knox, Ron E.; Pozniak, Curtis J.

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype

  4. Hb S [β6(A3)Glu→Val, GAG>GTG] in Mexican Mestizos: frequency and analysis of the 5' β-globin haplotype.

    PubMed

    Guzmán, Luis F; Perea, Francisco J; Magaña, María T; Morales-González, Karina R; Chávez-Velazco, M Luz; Ibarra, Bertha

    2010-01-01

    Between 1978 and 2009, we studied 1,863 Mexican Mestizo patients with clinical data compatible with a hemoglobinopathy. Of these patients, 382 had some hemoglobin (Hb) abnormality (20.5%), 128 had a sickle cell hemoglobinopathy, representing a general frequency of 6.9%, which is similar to the percentage observed in previous studies on Mexican Mestizos. We analyzed the 5' β-globin haplotype (5'Hp) in 79 unrelated β(S) chromosomes (26 β(S)/β(S), 14 β(S)/β(Thal), nine β(S)/β(A) and four β(S)/β(D)), and four haplotypes were observed: 72.2% CAR 24.1% Benin, 2.5% Senegal and 1.2% Cameroon; the last two are reported for first time in Mexico. In some Latin American populations such as Brazil, the Bantu haplotype predominates, while in others such as Jamaica, the Benin haplotype is the most frequent, showing heterogeneity of African genes as a consequence of different regions involved in the slave trade.

  5. Selection Signature Analysis Implicates the PC1/PCSK1 Region for Chicken Abdominal Fat Content

    PubMed Central

    Wang, Zhipeng; Zhang, Yuandan; Wang, Shouzhi; Wang, Ning; Ma, Li; Leng, Li; Wang, Shengwen; Wang, Qigui; Wang, Yuxiang; Tang, Zhiquan; Li, Ning; Da, Yang; Li, Hui

    2012-01-01

    We conducted a selection signature analysis using the chicken 60k SNP chip in two chicken lines that had been divergently selected for abdominal fat content (AFC) for 11 generations. The selection signature analysis used multiple signals of selection, including long-range allele frequency differences between the lean and fat lines, long-range heterozygosity changes, linkage disequilibrium, haplotype frequencies, and extended haplotype homozygosity. Multiple signals of selection identified ten signatures on chromosomes 1, 2, 4, 5, 11, 15, 20, 26 and Z. The 0.73 Mb PC1/PCSK1 region of the Z chromosome at 55.43-56.16 Mb was the most heavily selected region. This region had 26 SNP markers and seven genes, Mar-03, SLC12A2, FBN2, ERAP1, CAST, PC1/PCSK1 and ELL2, where PC1/PCSK1 are the chicken/human names for the same gene. The lean and fat lines had two main haplotypes with completely opposite SNP alleles for the 26 SNP markers and were virtually line-specific, and had a recombinant haplotype with nearly equal frequency (0.193 and 0.196) in both lines. Other haplotypes in this region had negligible frequencies. Nine other regions with selection signatures were PAH-IGF1, TRPC4, GJD4-CCNY, NDST4, NOVA1, GALNT9, the ESRP2-GALR1 region with five genes, the SYCP2-CADH4 with six genes, and the TULP1-KIF21B with 14 genes. Genome-wide association analysis showed that nearly all regions with evidence of selection signature had SNP effects with genome-wide significance (P<10–6) on abdominal fat weight and percentage. The results of this study provide specific gene targets for the control of chicken AFC and a potential model of AFC in human obesity. PMID:22792402

  6. Selection signature analysis implicates the PC1/PCSK1 region for chicken abdominal fat content.

    PubMed

    Zhang, Hui; Hu, Xiaoxiang; Wang, Zhipeng; Zhang, Yuandan; Wang, Shouzhi; Wang, Ning; Ma, Li; Leng, Li; Wang, Shengwen; Wang, Qigui; Wang, Yuxiang; Tang, Zhiquan; Li, Ning; Da, Yang; Li, Hui

    2012-01-01

    We conducted a selection signature analysis using the chicken 60k SNP chip in two chicken lines that had been divergently selected for abdominal fat content (AFC) for 11 generations. The selection signature analysis used multiple signals of selection, including long-range allele frequency differences between the lean and fat lines, long-range heterozygosity changes, linkage disequilibrium, haplotype frequencies, and extended haplotype homozygosity. Multiple signals of selection identified ten signatures on chromosomes 1, 2, 4, 5, 11, 15, 20, 26 and Z. The 0.73 Mb PC1/PCSK1 region of the Z chromosome at 55.43-56.16 Mb was the most heavily selected region. This region had 26 SNP markers and seven genes, Mar-03, SLC12A2, FBN2, ERAP1, CAST, PC1/PCSK1 and ELL2, where PC1/PCSK1 are the chicken/human names for the same gene. The lean and fat lines had two main haplotypes with completely opposite SNP alleles for the 26 SNP markers and were virtually line-specific, and had a recombinant haplotype with nearly equal frequency (0.193 and 0.196) in both lines. Other haplotypes in this region had negligible frequencies. Nine other regions with selection signatures were PAH-IGF1, TRPC4, GJD4-CCNY, NDST4, NOVA1, GALNT9, the ESRP2-GALR1 region with five genes, the SYCP2-CADH4 with six genes, and the TULP1-KIF21B with 14 genes. Genome-wide association analysis showed that nearly all regions with evidence of selection signature had SNP effects with genome-wide significance (P<10(-6)) on abdominal fat weight and percentage. The results of this study provide specific gene targets for the control of chicken AFC and a potential model of AFC in human obesity.

  7. High-resolution HLA haplotype frequencies of stem cell donors in Germany with foreign parentage: how can they be used to improve unrelated donor searches?

    PubMed

    Pingel, Julia; Solloch, Ute V; Hofmann, Jan A; Lange, Vinzenz; Ehninger, Gerhard; Schmidt, Alexander H

    2013-03-01

    In hematopoietic stem cell transplantation, human leukocyte antigens (HLA), usually HLA loci A, B, C, DRB1 and DQB1, are required to check histocompatibility between a potential donor and the recipient suffering from a malignant or non-malignant blood disease. As databases of potential unrelated donors are very heterogeneous with respect to typing resolution and number of typed loci, donor registries make use of haplotype frequency-based algorithms to provide matching probabilities for each potentially matching recipient/donor pair. However, it is well known that HLA allele and haplotype frequencies differ significantly between populations. We estimated high-resolution HLA-A, -B, -C, -DRB1 haplotype and allele frequencies of donors within DKMS German Bone Marrow Donor Center with parentage from 17 different countries: Turkey, Poland, Italy, Russian Federation, Croatia, Greece, Austria, Kazakhstan, France, The Netherlands, Republic of China, Romania, Portugal, USA, Spain, United Kingdom and Bosnia and Herzegovina. 5-locus haplotypes including HLA-DQB1 are presented for Turkey, Poland, Italy and Russian Federation. We calculated linkage disequilibria for each sample. Genetic distances between included countries could be shown to reflect geography. We further demonstrate how genetic differences between populations are reflected in matching probabilities of recipient/donor pairs and how they influence the search for unrelated donors as well as strategic donor center typings. Copyright © 2012 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  8. Homozygosity of single nucleotide polymorphisms in the 3' region of the canine estrogen receptor 1 gene is greater in Toy Poodles than in Miniature Dachshunds and Chihuahuas.

    PubMed

    Pathirana, Indunil N; Tanaka, Kakeru; Kawate, Noritoshi; Tsuji, Makoto; Hatoya, Shingo; Inaba, Toshio; Tamada, Hiromichi

    2011-06-01

    Differences in the distribution of single nucleotide polymorphisms (SNPs) and haplotypes in the estrogen receptor α gene (ESR1) were examined in Miniature Dachshunds (n = 48), Chihuahuas (n = 20) and Toy Poodles (n = 18). Five DNA fragments located in the 40-kb region at the 3' end of ESR1 were amplified by polymerase chain reaction and were directly sequenced. We compared allele, genotype and estimated haplotype frequencies at each SNP in the 3' end of ESR1 for these three breeds of small dog. The frequency of the major allele and the genotype frequency of the major allele homozygotes, were significantly higher in Toy Poodles for five SNPs (SNP #5, #14-17) than in Miniature Dachshunds, and significantly higher in Toy Poodles than Chihuahuas for three SNPs (SNP #15-17). A common haplotype block was identified in an approximately 20-kb region encompassing four SNPs (SNPs # 14-17). The frequencies of the most abundant estimated haplotype (GTTG) and GTTG homozygotes were significantly higher in Toy Poodles than in the other two breeds. These results imply that homozygosity for the allele, genotype and haplotype distribution within the block at the 3' end of ESR1 is greater in Toy Poodles than in Miniature Dachshunds and Chihuahuas. © 2011 The Authors; Animal Science Journal © 2011 Japanese Society of Animal Science.

  9. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil.

    PubMed

    Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe

    2010-07-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.

  10. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil

    PubMed Central

    2010-01-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405

  11. Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers

    PubMed Central

    Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.

    2018-01-01

    Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092

  12. Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.

    PubMed

    Jiang, Yong; Schmidt, Renate H; Reif, Jochen C

    2018-05-04

    Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.

  13. Genomic dissection of a ‘Fuji’ apple cultivar: re-sequencing, SNP marker development, definition of haplotypes, and QTL detection

    PubMed Central

    Kunihisa, Miyuki; Moriya, Shigeki; Abe, Kazuyuki; Okada, Kazuma; Haji, Takashi; Hayashi, Takeshi; Kawahara, Yoshihiro; Itoh, Ryutaro; Itoh, Takeshi; Katayose, Yuichi; Kanamori, Hiroyuki; Matsumoto, Toshimi; Mori, Satomi; Sasaki, Harumi; Matsumoto, Takashi; Nishitani, Chikako; Terakami, Shingo; Yamamoto, Toshiya

    2016-01-01

    ‘Fuji’ is one of the most popular and highly-produced apple cultivars worldwide, and has been frequently used in breeding programs. The development of genotypic markers for the preferable phenotypes of ‘Fuji’ is required. Here, we aimed to define the haplotypes of ‘Fuji’ and find associations between haplotypes and phenotypes of five traits (harvest day, fruit weight, acidity, degree of watercore, and flesh mealiness) by using 115 accessions related to ‘Fuji’. Through the re-sequencing of ‘Fuji’ genome, total of 2,820,759 variants, including single nucleotide polymorphisms (SNPs) and insertions or deletions (indels) were detected between ‘Fuji’ and ‘Golden Delicious’ reference genome. We selected mapping-validated 1,014 SNPs, most of which were heterozygous in ‘Fuji’ and capable of distinguishing alleles inherited from the parents of ‘Fuji’ (i.e., ‘Ralls Janet’ and ‘Delicious’). We used these SNPs to define the haplotypes of ‘Fuji’ and trace their inheritance in relatives, which were shown to have an average of 27% of ‘Fuji’ genome. Analysis of variance (ANOVA) based on ‘Fuji’ haplotypes identified one quantitative trait loci (QTL) each for harvest time, acidity, degree of watercore, and mealiness. A haplotype from ‘Delicious’ chr14 was considered to dominantly cause watercore, and one from ‘Ralls Janet’ chr1 was related to low-mealiness. PMID:27795675

  14. Performance of Single Nucleotide Polymorphisms versus Haplotypes for Genome-Wide Association Analysis in Barley

    PubMed Central

    Jannink, Jean-Luc

    2010-01-01

    Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933

  15. HERC1 polymorphisms: population-specific variations in haplotype composition.

    PubMed

    Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen

    2009-08-01

    Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.

  16. Detecting local haplotype sharing and haplotype association

    USDA-ARS?s Scientific Manuscript database

    A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...

  17. The Role of Osteopontin (OPN/SPP1) Haplotypes in the Susceptibility to Crohn's Disease

    PubMed Central

    Bayrle, Corinna; Wetzke, Martin; Fries, Christoph; Tillack, Cornelia; Olszak, Torsten; Beigel, Florian; Steib, Christian; Friedrich, Matthias; Diegelmann, Julia; Czamara, Darina; Brand, Stephan

    2011-01-01

    Background Osteopontin represents a multifunctional molecule playing a pivotal role in chronic inflammatory and autoimmune diseases. Its expression is increased in inflammatory bowel disease (IBD). The aim of our study was to analyze the association of osteopontin (OPN/SPP1) gene variants in a large cohort of IBD patients. Methodology/Principal Findings Genomic DNA from 2819 Caucasian individuals (n = 841 patients with Crohn's disease (CD), n = 473 patients with ulcerative colitis (UC), and n = 1505 healthy unrelated controls) was analyzed for nine OPN SNPs (rs2728127, rs2853744, rs11730582, rs11739060, rs28357094, rs4754 = p.Asp80Asp, rs1126616 = p.Ala236Ala, rs1126772 and rs9138). Considering the important role of osteopontin in Th17-mediated diseases, we performed analysis for epistasis with IBD-associated IL23R variants and analyzed serum levels of the Th17 cytokine IL-22. For four OPN SNPs (rs4754, rs1126616, rs1126772 and rs9138), we observed significantly different distributions between male and female CD patients. rs4754 was protective in male CD patients (p = 0.0004, OR = 0.69). None of the other investigated OPN SNPs was associated with CD or UC susceptibility. However, several OPN haplotypes showed significant associations with CD susceptibility. The strongest association was found for a haplotype consisting of the 8 OPN SNPs rs2728127-rs2853744-rs11730582-rs11439060-rs28357094-rs112661-rs1126772-rs9138 (omnibus p-value = 2.07×10−8). Overall, the mean IL-22 secretion in the combined group of OPN minor allele carriers with CD was significantly lower than that of CD patients with OPN wildtype alleles (p = 3.66×10−5). There was evidence for weak epistasis between the OPN SNP rs28357094 with the IL23R SNP rs10489629 (p = 4.18×10−2) and between OPN SNP rs1126616 and IL23R SNP rs2201841 (p = 4.18×10−2) but none of these associations remained significant after Bonferroni correction. Conclusions

  18. Genetic dissection of powdery mildew resistance in interspecific half-sib grapevine families using SNP-based maps.

    PubMed

    Teh, Soon Li; Fresnedo-Ramírez, Jonathan; Clark, Matthew D; Gadoury, David M; Sun, Qi; Cadle-Davidson, Lance; Luby, James J

    2017-01-01

    Quantitative trait locus (QTL) identification in perennial fruit crops is impeded largely by their lengthy generation time, resulting in costly and labor-intensive maintenance of breeding programs. In a grapevine (genus Vitis ) breeding program, although experimental families are typically unreplicated, the genetic backgrounds may contain similar progenitors previously selected due to their contribution of favorable alleles. In this study, we investigated the utility of joint QTL identification provided by analyzing half-sib families. The genetic control of powdery mildew was studied using two half-sib F 1 families, namely GE0711/1009 (MN1264 × MN1214; N  = 147) and GE1025 (MN1264 × MN1246; N  = 125) with multiple species in their ancestry. Maternal genetic maps consisting of 1077 and 1641 single nucleotide polymorphism (SNP) markers, respectively, were constructed using a pseudo-testcross strategy. Ratings of field resistance to powdery mildew were obtained based on whole-plant evaluation of disease severity. This 2-year analysis uncovered two QTLs that were validated on a consensus map in these half-sib families with improved precision relative to the parental maps. Examination of haplotype combinations based on the two QTL regions identified strong association of haplotypes inherited from 'Seyval blanc', through MN1264, with powdery mildew resistance. This investigation also encompassed the use of microsatellite markers to establish a correlation between 206-bp (UDV-015b) and 357-bp (VViv67) fragment sizes with resistance-carrying haplotypes. Our work is one of the first reports in grapevine demonstrating the use of SNP-based maps and haplotypes for QTL identification and tagging of powdery mildew resistance in half-sib families.

  19. LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.

    PubMed

    Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej

    2005-06-15

    The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.

  20. SNP ID-info: SNP ID searching and visualization platform.

    PubMed

    Yang, Cheng-Hong; Chuang, Li-Yeh; Cheng, Yu-Huei; Wen, Cheng-Hao; Chang, Phei-Lang; Chang, Hsueh-Wei

    2008-09-01

    Many association studies provide the relationship between single nucleotide polymorphisms (SNPs), diseases and cancers, without giving a SNP ID, however. Here, we developed the SNP ID-info freeware to provide the SNP IDs within inputting genetic and physical information of genomes. The program provides an "SNP-ePCR" function to generate the full-sequence using primers and template inputs. In "SNPosition," sequence from SNP-ePCR or direct input is fed to match the SNP IDs from SNP fasta-sequence. In "SNP search" and "SNP fasta" function, information of SNPs within the cytogenetic band, contig position, and keyword input are acceptable. Finally, the SNP ID neighboring environment for inputs is completely visualized in the order of contig position and marked with SNP and flanking hits. The SNP identification problems inherent in NCBI SNP BLAST are also avoided. In conclusion, the SNP ID-info provides a visualized SNP ID environment for multiple inputs and assists systematic SNP association studies. The server and user manual are available at http://bio.kuas.edu.tw/snpid-info.

  1. RTEL1 tagging SNPs and haplotypes were associated with glioma development.

    PubMed

    Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong

    2013-05-17

    As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.

  2. Haplotype defined by the MLH1-93G/A polymorphism is associated with MLH1 promoter hypermethylation in sporadic colorectal cancers.

    PubMed

    Miyakura, Yasuyuki; Tahara, Makiko; Lefor, Alan T; Yasuda, Yoshikazu; Sugano, Kokichi

    2014-11-24

    Methylation of the MLH1 promoter region has been suggested to be a major mechanism of gene inactivation in sporadic microsatellite instability-positive (MSI-H) colorectal cancers (CRCs). Recently, single-nucleotide polymorphism (SNP) in the MLH1 promoter region (MLH1-93G/A; rs1800734) has been proposed to be associated with MLH1 promoter methylation, loss of MLH1 protein expression and MSI-H tumors. We examined the association of MLH1-93G/A and six other SNPs surrounding MLH1-93G/A with the methylation status in 210 consecutive sporadic CRCs in Japanese patients. Methylation of the MLH1 promoter region was evaluated by Na-bisulfite polymerase chain reaction (PCR)/single-strand conformation polymorphism (SSCP) analysis. The genotype frequencies of SNPs located in the 54-kb region surrounding the MLH1-93G/A SNP were examined by SSCP analysis. Methylation of the MLH1 promoter region was observed in 28.6% (60/210) of sporadic CRCs. The proportions of MLH1-93G/A genotypes A/A, A/G and G/G were 26% (n=54), 51% (n=108) and 23% (n=48), respectively, and they were significantly associated with the methylation status (p=0.01). There were no significant associations between genotype frequency of the six other SNPs and methylation status. The A-allele of MLH1-93G/A was more common in cases with methylation than the G-allele (p=0.0094), especially in females (p=0.0067). In logistic regression, the A/A genotype of the MLH1-93G/A SNP was shown to be the most significant risk factor for methylation of the MLH1 promoter region (odds ratio 2.82, p=0.003). Furthermore, a haplotype of the A-allele of rs2276807 located -47 kb upstream from the MLH1-93G/A SNP and the A-allele of MLH1-93G/A SNP was significantly associated with MLH1 promoter methylation. These results indicate that individuals, and particularly females, carrying the A-allele at the MLH1-93G/A SNP, especially in association with the A-allele of rs2276807, may harbor an increased risk of methylation of the MLH1 promoter

  3. HLA-A, HLA-B, and HLA-DRB1 Allele and Haplotype Frequencies in Renal Transplant Candidates in a Population in Southern Brazil.

    PubMed

    Saito, Patrícia Keiko; Yamakawa, Roger Haruki; Noguti, Erika Noda; Bedendo, Gustavo Borelli; Júnior, Waldir Veríssimo da Silva; Yamada, Sérgio Seiji; Borelli, Sueli Donizete

    2016-05-01

    Very few studies have examined the diversity of human leukocyte antigens (HLA) in the Brazilian renal transplant candidates. The frequencies of the HLA-A, HLA-B, and HLA-DRB1 alleles, haplotypes and phenotypes were studied in 522 patients with chronic renal failure, renal transplant candidates, registered at the Transplant Centers in north/northwestern Paraná State, southern Brazil. Patients were classified according to the ethnic group (319 whites [Caucasians], 134 mestizos [mixed race descendants of Europeans, Africans, and Amerindians; browns or "pardos"] and 69 blacks). The HLA typing was performed by the polymerase chain reaction sequence-specific oligonucleotide method (PCR-SSO), combined with Luminex technology. In the analysis of the total samples, 20 HLA-A, 32 HLA-B, and 13 HLA-DRB1 allele groups were identified. The most frequent allele groups for each HLA locus were HLA-A*02 (25.4%), HLA-B*44 (10.9%), and HLA-DRB1*13 (13.9%). The most frequent haplotypes were HLA-A*01-B*08-DRB1*03 (2.3%), A*02-B*44-DRB1*07 (1.2%), and A*03-B*07-DRB1*11 (1.0%). Significant differences (P < 0.05) were observed in the HLA-A*68, B*08, and B*58 allele frequencies among ethnic groups. This study provides the first data on the HLA-A, HLA-B, and HLA-DRB1 allele, phenotype and haplotype frequencies of renal transplant candidates in a population in southern Brazil. © 2015 Wiley Periodicals, Inc.

  4. Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction

    PubMed Central

    Barbero, Marina M. D.; Oliveira, Henrique N.; de Camargo, Gregório M. F.; Fernandes Júnior, Gerardo A.; Aspilcueta-Borquis, Rusbel R.; Souza, Fabio R. P.; Boligon, Arione A.; Melo, Thaise P.; Regatieri, Inaê C.; Feitosa, Fabieli L. B.; Fonseca, Larissa F. S.; Magalhães, Ana F. B.; Costa, Raphael B.; Albuquerque, Lucia G.

    2018-01-01

    Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs. PMID:29293544

  5. Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction.

    PubMed

    Takada, Luciana; Barbero, Marina M D; Oliveira, Henrique N; de Camargo, Gregório M F; Fernandes Júnior, Gerardo A; Aspilcueta-Borquis, Rusbel R; Souza, Fabio R P; Boligon, Arione A; Melo, Thaise P; Regatieri, Inaê C; Feitosa, Fabieli L B; Fonseca, Larissa F S; Magalhães, Ana F B; Costa, Raphael B; Albuquerque, Lucia G

    2018-01-01

    Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs.

  6. A genome-wide association study of production traits in a commercial population of Large White pigs: evidence of haplotypes affecting meat quality

    PubMed Central

    2014-01-01

    Background Numerous quantitative trait loci (QTL) have been detected in pigs over the past 20 years using microsatellite markers. However, due to the low density of these markers, the accuracy of QTL location has generally been poor. Since 2009, the dense genome coverage provided by the Illumina PorcineSNP60 BeadChip has made it possible to more accurately map QTL using genome-wide association studies (GWAS). Our objective was to perform high-density GWAS in order to identify genomic regions and corresponding haplotypes associated with production traits in a French Large White population of pigs. Methods Animals (385 Large White pigs from 106 sires) were genotyped using the PorcineSNP60 BeadChip and evaluated for 19 traits related to feed intake, growth, carcass composition and meat quality. Of the 64 432 SNPs on the chip, 44 412 were used for GWAS with an animal mixed model that included a regression coefficient for the tested SNPs and a genomic kinship matrix. SNP haplotype effects in QTL regions were then tested for association with phenotypes following phase reconstruction based on the Sscrofa10.2 pig genome assembly. Results Twenty-three QTL regions were identified on autosomes and their effects ranged from 0.25 to 0.75 phenotypic standard deviation units for feed intake and feed efficiency (four QTL), carcass (12 QTL) and meat quality traits (seven QTL). The 10 most significant QTL regions had effects on carcass (chromosomes 7, 10, 16, 17 and 18) and meat quality traits (two regions on chromosome 1 and one region on chromosomes 8, 9 and 13). Thirteen of the 23 QTL regions had not been previously described. A haplotype block of 183 kb on chromosome 1 (six SNPs) was identified and displayed three distinct haplotypes with significant (0.0001 < P < 0.03) associations with all evaluated meat quality traits. Conclusions GWAS analyses with the PorcineSNP60 BeadChip enabled the detection of 23 QTL regions that affect feed consumption, carcass and meat

  7. SNP Data Quality Control in a National Beef and Dairy Cattle System and Highly Accurate SNP Based Parentage Verification and Identification

    PubMed Central

    McClure, Matthew C.; McCarthy, John; Flynn, Paul; McClure, Jennifer C.; Dair, Emma; O'Connell, D. K.; Kearney, John F.

    2018-01-01

    A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non

  8. Association Between Chloroplast DNA and Mitochondrial DNA Haplotypes in Prunus spinosa L. (Rosaceae) Populations across Europe

    PubMed Central

    MOHANTY, APARAJITA; MARTÍN, JUAN PEDRO; GONZÁLEZ, LUIS MIGUEL; AGUINAGALDE, ITZIAR

    2003-01-01

    Chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) were studied in 24 populations of Prunus spinosa sampled across Europe. The cpDNA and mtDNA fragments were amplified using universal primers and subsequently digested with restriction enzymes to obtain the polymorphisms. Combinations of all the polymorphisms resulted in 33 cpDNA haplotypes and two mtDNA haplotypes. Strict association between the cpDNA haplotypes and the mtDNA haplotypes was detected in most cases, indicating conjoint inheritance of the two genomes. The most frequent and abundant cpDNA haplotype (C20; frequency, 51 %) is always associated with the more frequent and abundant mtDNA haplotype (M1; frequency, 84 %). All but two of the cpDNA haplotypes associated with the less frequent mtDNA haplotype (M2) are private haplotypes. These private haplotypes are phylogenetically related but geographically unrelated. They form a separate cluster on the minimum‐length spanning tree. PMID:14534199

  9. Forensic SNP Genotyping with SNaPshot: Development of a Novel In-house SBE Multiplex SNP Assay.

    PubMed

    Zar, Mian Sahib; Shahid, Ahmad Ali; Shahzad, Muhammad Saqib; Shin, Kyoung-Jin; Lee, Hwan Young; Lee, Sang-Seob; Israr, Muhammad; Wiegand, Peter; Kulstein, Galina

    2018-04-10

    This study introduces a newly developed in-house SNaPshot single-base extension (SBE) multiplex assay for forensic single nucleotide polymorphism (SNP) genotyping of fresh and degraded samples. The assay was validated with fresh blood samples from four different populations. In addition, altogether 24 samples from skeletal remains were analyzed with the multiplex. Full SNP profiles could be obtained from 14 specimens, while ten remains showed partial SNP profiles. Minor allele frequencies (MAF) of bone samples and different populations were compared and used for association of skeletal remains with a certain population. The results reveal that the SNPs of the bone samples are genetically close to the Pathan population. The findings show that the new multiplex system can be utilized for SNP genotyping of degraded and forensic relevant skeletal material, enabling to provide additional investigative leads in criminal cases. © 2018 American Academy of Forensic Sciences.

  10. ABO, Rhesus, and Kell Antigens, Alleles, and Haplotypes in West Bengal, India

    PubMed Central

    Basu, Debapriya; Datta, Suvro Sankha; Montemayor, Celina; Bhattacharya, Prasun; Mukherjee, Krishnendu; Flegel, Willy A.

    2018-01-01

    Background Few studies have documented the blood group antigens in the population of eastern India. Frequencies of some common alleles and haplotypes were unknown. We describe phenotype, allele, and haplotype frequencies in the state of West Bengal, India. Methods We tested 1,528 blood donors at the Medical College Hospital, Kolkata. The common antigens of the ABO, Rhesus, and Kell blood group systems were determined by standard serologic methods in tubes. Allele and haplotype frequencies were calculated with an iterative method that yielded maximum-likelihood estimates under the assumption of a Hardy-Weinberg equilibrium. Results The prevalence of ABO antigens were B (34%), O (32%), A (25%), and AB (9%) with ABO allele frequencies for O = 0.567, A = 0.189, and B = 0.244. The D antigen (RH1) was observed in 96.6% of the blood donors with RH haplotype frequencies, such as for CDe = 0.688809, cde = 0.16983 and CdE = 0.000654. The K antigen (K1) was observed in 12 donors (0.79%) with KEL allele frequencies for K = 0.004 and k = 0.996. Conclusions: For the Bengali population living in the south of West Bengal, we established the frequencies of the major clinically relevant antigens in the ABO, Rhesus, and Kell blood group systems and derived estimates for the underlying ABO and KEL alleles and RH haplotypes. Such blood donor screening will improve the availability of compatible red cell units for transfusion. Our approach using widely available routine methods can readily be applied in other regions, where the sufficient supply of blood typed for the Rh and K antigens is lacking. PMID:29593462

  11. Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays

    PubMed Central

    Gardner, Shea N; Wagner, Mark C

    2005-01-01

    Background Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation. Results A method and SPR Opt (SNP and PCR-RFLP Optimization) software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP) and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP) analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed) are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes. Conclusion This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As more sequence data becomes

  12. Haplotypes in the CRP Gene Associated with Increased BMI and Levels of CRP in Subjects with Type 2 Diabetes or Obesity from Southwestern Mexico

    PubMed Central

    Martínez-Calleja, América; Quiróz-Vargas, Irma; Parra-Rojas, Isela; Muñoz-Valle, José Francisco; Leyva-Vázquez, Marco A.; Fernández-Tilapa, Gloria; Vences-Velázquez, Amalia; Cruz, Miguel; Salazar-Martínez, Eduardo; Flores-Alfaro, Eugenia

    2012-01-01

    Objective. We evaluated the association between four polymorphisms in the CRP gene with circulating levels of C-reactive protein (CRP), type 2 diabetes (T2D), obesity, and risk score of coronary heart disease. Methods. We studied 402 individuals and classified them into four groups: healthy, obese, T2D obese, and T2D without obesity, from Guerrero, Southwestern Mexico. Blood levels of CRP, glucose, cholesterol, triglycerides, and leukocytes were measured. Genotyping was performed by PCR/RFLP, and the risk score for coronary heart disease was determined by the Framingham's methodology. Results. The TT genotype of SNP rs1130864 was associated with increased body mass index and T2D patients with obesity. We found that the haplotype 2 (TGAG) was associated with increased levels of CRP (β = 0.3; 95%CI: 0.1, 0.5; P = 0.005) and haplotype 7 (TGGG) with higher body mass index (BMI) (β = 0.2; 95%CI: 0.1, 0.3; P < 0.001). The risk score for coronary heart disease was associated with increased levels of CRP, but not with any polymorphism or haplotype. Conclusions. The association between the TT genotype of SNP rs1130864 with obesity and the haplotype 7 with BMI may explain how obesity and genetic predisposition increase the risk of diseases such as T2D in the population of Southwestern Mexico. PMID:23049543

  13. Haplotypes in the CRP gene associated with increased BMI and levels of CRP in subjects with type 2 diabetes or obesity from Southwestern Mexico.

    PubMed

    Martínez-Calleja, América; Quiróz-Vargas, Irma; Parra-Rojas, Isela; Muñoz-Valle, José Francisco; Leyva-Vázquez, Marco A; Fernández-Tilapa, Gloria; Vences-Velázquez, Amalia; Cruz, Miguel; Salazar-Martínez, Eduardo; Flores-Alfaro, Eugenia

    2012-01-01

    We evaluated the association between four polymorphisms in the CRP gene with circulating levels of C-reactive protein (CRP), type 2 diabetes (T2D), obesity, and risk score of coronary heart disease. We studied 402 individuals and classified them into four groups: healthy, obese, T2D obese, and T2D without obesity, from Guerrero, Southwestern Mexico. Blood levels of CRP, glucose, cholesterol, triglycerides, and leukocytes were measured. Genotyping was performed by PCR/RFLP, and the risk score for coronary heart disease was determined by the Framingham's methodology. The TT genotype of SNP rs1130864 was associated with increased body mass index and T2D patients with obesity. We found that the haplotype 2 (TGAG) was associated with increased levels of CRP (β = 0.3; 95%CI: 0.1, 0.5; P = 0.005) and haplotype 7 (TGGG) with higher body mass index (BMI) (β = 0.2; 95%CI: 0.1, 0.3; P < 0.001). The risk score for coronary heart disease was associated with increased levels of CRP, but not with any polymorphism or haplotype. The association between the TT genotype of SNP rs1130864 with obesity and the haplotype 7 with BMI may explain how obesity and genetic predisposition increase the risk of diseases such as T2D in the population of Southwestern Mexico.

  14. Divergence at the casein haplotypes in dairy and meat goat breeds.

    PubMed

    Küpper, Julia; Chessa, Stefania; Rignanese, Daniela; Caroli, Anna; Erhardt, Georg

    2010-02-01

    Casein genes have been proved to have an influence on milk properties, and are in addition appropriate for phylogeny studies. A large number of casein polymorphisms exist in goats, making their analysis quite complex. The four casein loci were analyzed by molecular techniques for genetic polymorphism detection in the two dairy goat breeds Bunte Deutsche Edelziege (BDE; n=96), Weisse Deutsche Edelziege (WDE; n=91), and the meat goat breed Buren (n=75). Of the 35 analyzed alleles, 18 were found in BDE, and 17 in Buren goats and WDE. In addition, a new allele was identified at the CSN1S1 locus in the BDE, showing a frequency of 0.05. This variant, named CSN1S1*A', is characterized by a t-->c transversion in intron 9. Linkage disequilibrium was found at the casein haplotype in all three breeds. A total of 30 haplotypes showed frequencies higher than 0.01. In the Buren breed only one haplotype showed a frequency higher than 0.1. The ancestral haplotype B-A-A-B (in the order: CSN1S1-CSN2-CSN1S2-CSN3) occurred in all three breeds, showing a very high frequency (>0.8) in the Buren.

  15. HLA-G, -A haplotypes in Amerindians (Ecuador): HLA-G*01:05N World distribution.

    PubMed

    Arnaiz-Villena, Antonio; Palacio-Gruber, Jose; Enriquez de Salamanca, Mercedes; Juárez, Ignacio; Campos, Cristina; Nieto, Jorge; Muñiz, Ester; Martin-Villa, Jose Manuel

    2018-02-01

    HLA-G and HLA-A frequencies have been analysed in Amerindians from Ecuador. HLA-G allele frequencies are found to be closer to those of other Amerindians (Mayas from Guatemala and Uros from Peru) and closer to European ones than to Far East Asians groups, particularly, regarding to HLA-G*01:04 allele. HLA-G/-A haplotypes have been calculated for the first time in Amerindians. It is remarkable that HLA-G*01:05N "null" allele is found in a very low frequency (like in Amerindian Mayas and Uros) and is also found in haplotypes belonging to the HLA-A19 group of alleles (HLA-A*30, -A*31, -A*33). It was previously postulated that HLA-G*01:05N appeared in HLA-A*30/-B*13 haplotypes in Middle East Mediterraneans. It may be hypothesized that in Evolution, HLA-G*01:05N existed primarily in one of the HLA extant or extinct -A19 haplotype, whether this haplotype was placed in Middle East or other World areas, including America. However, the highest present day HLA-G*01:05N frequencies are found in Middle East Mediterraneans. Copyright © 2017. Published by Elsevier Inc.

  16. A High Density SNP Array for the Domestic Horse and Extant Perissodactyla: Utility for Association Mapping, Genetic Diversity, and Phylogeny Studies

    PubMed Central

    McCue, Molly E.; Bannasch, Danika L.; Petersen, Jessica L.; Gurr, Jessica; Bailey, Ernie; Binns, Matthew M.; Distl, Ottmar; Guérin, Gérard; Hasegawa, Telhisa; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Penedo, M. Cecilia T.; Røed, Knut H.; Ryder, Oliver A.; Swinburne, June E.; Tozaki, Teruaki; Valberg, Stephanie J.; Vaudin, Mark; Lindblad-Toh, Kerstin

    2012-01-01

    An equine SNP genotyping array was developed and evaluated on a panel of samples representing 14 domestic horse breeds and 18 evolutionarily related species. More than 54,000 polymorphic SNPs provided an average inter-SNP spacing of ∼43 kb. The mean minor allele frequency across domestic horse breeds was 0.23, and the number of polymorphic SNPs within breeds ranged from 43,287 to 52,085. Genome-wide linkage disequilibrium (LD) in most breeds declined rapidly over the first 50–100 kb and reached background levels within 1–2 Mb. The extent of LD and the level of inbreeding were highest in the Thoroughbred and lowest in the Mongolian and Quarter Horse. Multidimensional scaling (MDS) analyses demonstrated the tight grouping of individuals within most breeds, close proximity of related breeds, and less tight grouping in admixed breeds. The close relationship between the Przewalski's Horse and the domestic horse was demonstrated by pair-wise genetic distance and MDS. Genotyping of other Perissodactyla (zebras, asses, tapirs, and rhinoceros) was variably successful, with call rates and the number of polymorphic loci varying across taxa. Parsimony analysis placed the modern horse as sister taxa to Equus przewalski. The utility of the SNP array in genome-wide association was confirmed by mapping the known recessive chestnut coat color locus (MC1R) and defining a conserved haplotype of ∼750 kb across all breeds. These results demonstrate the high quality of this SNP genotyping resource, its usefulness in diverse genome analyses of the horse, and potential use in related species. PMID:22253606

  17. Association of KIR genotypes and haplotypes with susceptibility to chronic hepatitis B virus infection in Chinese Han population.

    PubMed

    Lu, Zhiming; Zhang, Bingchang; Chen, Shijun; Gai, Zhongtao; Feng, Zhaolei; Liu, Xiangdong; Liu, Yiqing; Wen, Xin; Li, Li; Jiao, Yulian; Ma, Chunyan; Shao, Song; Cui, Xiangfa; Chen, Guojian; Li, Jianfeng; Zhao, Yueran

    2008-12-01

    Killer immunoglobulin-like receptor (KIR) genes can regulate the activation of NK and T cells upon interaction with HLA class I molecules. Hepatitis B virus (HBV) infection has been regarded as a multi-factorial disorder disease. Previous studies revealed that KIRs were involved in HCV and HIV infection or clearance. The aim of this study was to explore the possibility of the inheritance of KIR genotypes and haplotypes as a candidate for susceptibility to persistent HBV infection or HBV clearance. The sequence specific primer polymerase chain reaction (SSP-PCR) was employed to identify the KIR genes and pseudogenes in 150 chronic hepatitis B (CHB) patients, 251 spontaneously recovered (SR) controls, and 412 healthy controls. The frequencies of genotype G, M, FZ1 increased in CHB patients compared with healthy control subjects. The frequency of genotype AH was higher in SR controls than that in both CHB patients and healthy controls. The carriage frequencies of genotype G and AH were higher; while, the frequencies of AF and AJ were lower in SR controls than those in healthy control subjects. The frequency of A haplotype was lower, whereas, the frequency of B haplotype was higher in CHB patients and SR controls than those in healthy controls. In healthy controls, haplotype 4 was found lower compared with that in CHB patients and SR controls and the frequency of haplotype 5 was higher in SR controls than that in other two groups. Based on these findings, it seems that the genotypes M and FZ1 are HBV susceptive genotypes; AH, on the other hand, may be protective genotypes that facilitate the clearance of HBV. It appears that the haplotype 4 is HBV susceptive haplotype, whereas, haplotype 5 may be the protective haplotype that facilitates the clearance of HBV.

  18. THE REAL McCOIL: A method for the concurrent estimation of the complexity of infection and SNP allele frequency for malaria parasites

    PubMed Central

    Chang, Hsiao-Han; Worby, Colin J.; Yeka, Adoke; Nankabirwa, Joaniter; Kamya, Moses R.; Staedke, Sarah G.; Hubbart, Christina; Amato, Roberto; Kwiatkowski, Dominic P.

    2017-01-01

    As many malaria-endemic countries move towards elimination of Plasmodium falciparum, the most virulent human malaria parasite, effective tools for monitoring malaria epidemiology are urgent priorities. P. falciparum population genetic approaches offer promising tools for understanding transmission and spread of the disease, but a high prevalence of multi-clone or polygenomic infections can render estimation of even the most basic parameters, such as allele frequencies, challenging. A previous method, COIL, was developed to estimate complexity of infection (COI) from single nucleotide polymorphism (SNP) data, but relies on monogenomic infections to estimate allele frequencies or requires external allele frequency data which may not available. Estimates limited to monogenomic infections may not be representative, however, and when the average COI is high, they can be difficult or impossible to obtain. Therefore, we developed THE REAL McCOIL, Turning HEterozygous SNP data into Robust Estimates of ALelle frequency, via Markov chain Monte Carlo, and Complexity Of Infection using Likelihood, to incorporate polygenomic samples and simultaneously estimate allele frequency and COI. This approach was tested via simulations then applied to SNP data from cross-sectional surveys performed in three Ugandan sites with varying malaria transmission. We show that THE REAL McCOIL consistently outperforms COIL on simulated data, particularly when most infections are polygenomic. Using field data we show that, unlike with COIL, we can distinguish epidemiologically relevant differences in COI between and within these sites. Surprisingly, for example, we estimated high average COI in a peri-urban subregion with lower transmission intensity, suggesting that many of these cases were imported from surrounding regions with higher transmission intensity. THE REAL McCOIL therefore provides a robust tool for understanding the molecular epidemiology of malaria across transmission settings. PMID

  19. HLA-A, -B, -C, -DQB1, and -DRB1,3,4,5 allele and haplotype frequencies in the Costa Rica Central Valley Population and its relationship to worldwide populations.

    PubMed

    Arrieta-Bolaños, Esteban; Maldonado-Torres, Hazael; Dimitriu, Oana; Hoddinott, Michael A; Fowles, Finnuala; Shah, Anila; Orlich-Pérez, Priscilla; McWhinnie, Alasdair J; Alfaro-Bourrouet, Wilbert; Buján-Boza, Willem; Little, Ann-Margaret; Salazar-Sánchez, Lizbeth; Madrigal, J Alejandro

    2011-01-01

    The human leukocyte antigen (HLA) system is the most polymorphic in humans. Its allele, genotype, and haplotype frequencies vary significantly among different populations. Molecular typing data on HLA are necessary for the development of stem cell donor registries, cord blood banks, HLA-disease association studies, and anthropology studies. The Costa Rica Central Valley Population (CCVP) is the major population in this country. No previous study has characterized HLA frequencies in this population. Allele group and haplotype frequencies of HLA genes in the CCVP were determined by means of molecular typing in a sample of 130 unrelated blood donors from one of the country's major hospitals. A comparison between these frequencies and those of 126 populations worldwide was also carried out. A minimum variance dendrogram based on squared Euclidean distances was constructed to assess the relationship between the CCVP sample and populations from all over the world. Allele group and haplotype frequencies observed in this study are consistent with a profile of a dynamic and diverse population, with a hybrid ethnic origin, predominantly Caucasian-Amerindian. Results showed that populations genetically closest to the CCVP are a Mestizo urban population from Venezuela, and another one from Guadalajara, Mexico. Copyright © 2011 American Society for Histocompatibility and Immunogenetics. All rights reserved.

  20. A Haplotype Information Theory Method Reveals Genes of Evolutionary Interest in European vs. Asian Pigs.

    PubMed

    Hudson, Nicholas J; Naval-Sánchez, Marina; Porto-Neto, Laercio; Pérez-Enciso, Miguel; Reverter, Antonio

    2018-06-05

    Asian and European wild boars were independently domesticated ca. 10,000 years ago. Since the 17th century, Chinese breeds have been imported to Europe to improve the genetics of European animals by introgression of favourable alleles, resulting in a complex mosaic of haplotypes. To interrogate the structure of these haplotypes further, we have run a new haplotype segregation analysis based on information theory, namely compression efficiency (CE). We applied the approach to sequence data from individuals from each phylogeographic region (n = 23 from Asia and Europe) including a number of major pig breeds. Our genome-wide CE is able to discriminate the breeds in a manner reflecting phylogeography. Furthermore, 24,956 non-overlapping sliding windows (each comprising 1,000 consecutive SNP) were quantified for extent of haplotype sharing within and between Asia and Europe. The genome-wide distribution of extent of haplotype sharing was quite different between groups. Unlike European pigs, Asian pigs haplotype sharing approximates a normal distribution. In line with this, we found the European breeds possessed a number of genomic windows of dramatically higher haplotype sharing than the Asian breeds. Our CE analysis of sliding windows capture some of the genomic regions reported to contain signatures of selection in domestic pigs. Prominent among these regions, we highlight the role of a gene encoding the mitochondrial enzyme LACTB which has been associated with obesity, and the gene encoding MYOG a fundamental transcriptional regulator of myogenesis. The origin of these regions likely reflects either a population bottleneck in European animals, or selective targets on commercial phenotypes reducing allelic diversity in particular genes and/or regulatory regions.

  1. β-globin gene cluster haplotypes in ethnic minority populations of southwest China

    PubMed Central

    Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

    2017-01-01

    The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625

  2. Temperature gradient affects differentiation of gene expression and SNP allele frequencies in the dominant Lake Baikal zooplankton species.

    PubMed

    Bowman, Larry L; Kondrateva, Elizaveta S; Timofeyev, Maxim A; Yampolsky, Lev Y

    2018-06-01

    Local adaptation and phenotypic plasticity are main mechanisms of organisms' resilience in changing environments. Both are affected by gene flow and are expected to be weak in zooplankton populations inhabiting large continuous water bodies and strongly affected by currents. Lake Baikal, the deepest and one of the coldest lakes on Earth, experienced epilimnion temperature increase during the last 100 years, exposing Baikal's zooplankton to novel selective pressures. We obtained a partial transcriptome of Epischura baikalensis (Copepoda: Calanoida), the dominant component of Baikal's zooplankton, and estimated SNP allele frequencies and transcript abundances in samples from regions of Baikal that differ in multiyear average surface temperatures. The strongest signal in both SNP and transcript abundance differentiation is the SW-NE gradient along the 600+ km long axis of the lake, suggesting isolation by distance. SNP differentiation is stronger for nonsynonymous than synonymous SNPs and is paralleled by differential survival during a laboratory exposure to increased temperature, indicating directional selection operating on the temperature gradient. Transcript abundance, generally collinear with the SNP differentiation, shows samples from the warmest, less deep location clustering together with the southernmost samples. Differential expression is more frequent among transcripts orthologous to candidate thermal response genes previously identified in model arthropods, including genes encoding cytoskeleton proteins, heat-shock proteins, proteases, enzymes of central energy metabolism, lipid and antioxidant pathways. We conclude that the pivotal endemic zooplankton species in Lake Baikal exists under temperature-mediated selection and possesses both genetic variation and plasticity to respond to novel temperature-related environmental pressures. © 2018 John Wiley & Sons Ltd.

  3. Human Leukocyte Antigen-A, B, C, DRB1, and DQB1 Allele and Haplotype Frequencies in a Subset of 237 Donors in the South African Bone Marrow Registry

    PubMed Central

    Ingram, Charlotte; Schlaphoff, Terry; Borrill, Veronica; Christoffels, Alan

    2018-01-01

    Human leukocyte antigen- (HLA-) A, HLA-B, HLA-C, HLA-DRB1, and HLA-DQB1 allele and haplotype frequencies were studied in a subset of 237 volunteer bone marrow donors registered at the South African Bone Marrow Registry (SABMR). Hapl-o-Mat software was used to compute allele and haplotype frequencies from individuals typed at various resolutions, with some alleles in multiple allele code (MAC) format. Four hundred and thirty-eight HLA-A, 235 HLA-B, 234 HLA-DRB1, 41 HLA-DQB1, and 29 HLA-C alleles are reported. The most frequent alleles were A∗02:02g (0.096), B∗07:02g (0.082), C∗07:02g (0.180), DQB1∗06:02 (0.157), and DRB1∗15:01 (0.072). The most common haplotype was A∗03:01g~B∗07:02g~C∗07:02g~DQB1∗06:02~DRB1∗15:01 (0.067), which has also been reported in other populations. Deviations from Hardy-Weinberg equilibrium were observed in A, B, and DRB1 loci, with C~DQB1 being the only locus pair in linkage disequilibrium. This study describes allele and haplotype frequencies from a subset of donors registered at SABMR, the only active bone marrow donor registry in Africa. Although the sample size was small, our results form a key resource for future population studies, disease association studies, and donor recruitment strategies. PMID:29850621

  4. Haplotype analysis indicates an association between the DOPA decarboxylase (DDC) gene and nicotine dependence.

    PubMed

    Ma, Jennie Z; Beuten, Joke; Payne, Thomas J; Dupont, Randolph T; Elston, Robert C; Li, Ming D

    2005-06-15

    DOPA decarboxylase (DDC; also known as L-amino acid decarboxylase; AADC) is involved in the synthesis of dopamine, norepinephrine and serotonin. Because the mesolimbic dopaminergic system is implicated in the reinforcing effects of many drugs, including nicotine, the DDC gene is considered a plausible candidate for involvement in the development of vulnerability to nicotine dependence (ND). Further, this gene is located within the 7p11 region that showed a 'suggestive linkage' to ND in our previous genome-wide scan in the Framingham Heart Study population. In the present study, we tested eight single nucleotide polymorphisms (SNPs) within DDC for association with ND, which was assessed by smoking quantity (SQ), the heaviness of smoking index (HSI) and the Fagerstrom test for ND (FTND) score, in a total of 2037 smokers and non-smokers from 602 nuclear families of African- or European-American (AA or EA, respectively) ancestry. Association analysis for individual SNPs using the PBAT-GEE program indicated that SNP rs921451 was significantly associated with two of the three adjusted ND measures in the EA sample (P=0.01-0.04). Haplotype-based association analysis revealed a protective T-G-T-G haplotype for rs921451-rs3735273-rs1451371-rs2060762 in the AA sample, which was significantly associated with all three adjusted ND measures after correction for multiple testing (min Z=-2.78, P=0.006 for HSI). In contrast, we found a high-risk T-G-T-G haplotype for a different SNP combination in the EA sample, rs921451-rs3735273-rs1451371-rs3757472, which showed a significant association after Bonferroni correction with the SQ and FTND score (max Z=2.73, P=0.005 for FTND). In summary, our findings provide the first evidence for the involvement of DDC in the susceptibility to ND and, further, reveal the racial specificity of its impact.

  5. RTEL1 tagging SNPs and haplotypes were associated with glioma development

    PubMed Central

    2013-01-01

    Abstract As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case–control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF) > 5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P = 0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P = 0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype “GG” of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P = 0.0002), while the genotype “CC” of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P = 0.0003). Furthermore, haplotype “GCT” in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher’s P = 0.0005; Pearson’s P = 0.0005), and haplotype “ATT” was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher’s P = 0.0013; Pearson’s P = 0.0013). Two single variants, the genotypes of “GG” of rs6010620 and “CC” of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. Virtual slides The virtual slides for this article

  6. Frequency and origin of haplotypes associated with the beta-globin gene cluster in individuals with trait and sickle cell anemia in the Atlantic and Pacific coastal regions of Colombia

    PubMed Central

    Fong, Cristian; Lizarralde-Iragorri, María Alejandra; Rojas-Gallardo, Diana; Barreto, Guillermo

    2013-01-01

    Sickle cell anemia is a genetic disease with high prevalence in people of African descent. There are five typical haplotypes associated with this disease and the haplotypes associated with the beta-globin gene cluster have been used to establish the origin of African-descendant people in America. In this work, we determined the frequency and the origin of haplotypes associated with hemoglobin S in a sample of individuals with sickle cell anemia (HbSS) and sickle cell hemoglobin trait (HbAS) in coastal regions of Colombia. Blood samples from 71 HbAS and 79 HbSS individuals were obtained. Haplotypes were determined based on the presence of variable restriction sites within the β-globin gene cluster. On the Pacific coast of Colombia the most frequent haplotype was Benin, while on the Atlantic coast Bantu was marginally higher than Benin. Eight atypical haplotypes were observed on both coasts, being more diverse in the Atlantic than in the Pacific region. These results suggest a differential settlement of the coasts, dependent on where slaves were brought from, either from the Gulf of Guinea or from Angola, where the haplotype distributions are similar. Atypical haplotypes probably originated from point mutations that lost or gained a restriction site and/or by recombination events. PMID:24385850

  7. Fitchi: haplotype genealogy graphs based on the Fitch algorithm.

    PubMed

    Matschiner, Michael

    2016-04-15

    : In population genetics and phylogeography, haplotype genealogy graphs are important tools for the visualization of population structure based on sequence data. In this type of graph, node sizes are often drawn in proportion to haplotype frequencies and edge lengths represent the minimum number of mutations separating adjacent nodes. I here present Fitchi, a new program that produces publication-ready haplotype genealogy graphs based on the Fitch algorithm. http://www.evoinformatics.eu/fitchi.htm : michaelmatschiner@mac.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. MDR1 haplotypes derived from exons 21 and 26 do not affect the steady-state pharmacokinetics of tacrolimus in renal transplant patients.

    PubMed

    Mai, Ingrid; Perloff, Elke S; Bauer, Steffen; Goldammer, Mark; Johne, Andreas; Filler, Guido; Budde, Klemens; Roots, Ivar

    2004-11-01

    This retrospective study investigated the influence of MDR1 haplotypes derived from the polymorphisms 2677G > T (exon 21) and 3435C > T (exon 26) on the pharmacokinetics of the immunosuppressant drug tacrolimus in 73 renal transplant patients. Based on both variants of SNPs 2677 and 3435, four different haplotypes and eight different genotypes were identified in the study sample. Tacrolimus trough concentrations (C(0)) were compared between different SNP variants and genotypes, as well as between carriers and noncarriers of each haplotype. Additionally, CYP3A5 genotype (6956G > A) was determined. No significant differences were observed between groups. Differences in mean tacrolimus C(0) values between carriers and noncarriers of each haplotype ranged from -0.04 microg/litre (95% confidence interval: -0.53 to 0.60) to -23 microg/litre (-1.07 to 1.53). No association was found between CYP3A5*1/*3 genotype and tacrolimus Co concentractions. MDR1 haplotypes derived from the SNPs 2677G > T (exon 21) and 3435C > T (exon 26) do not influence the pharmacokinetics of tacrolimus in renal transplant patients.

  9. Global variation in CYP2C8–CYP2C9 functional haplotypes

    PubMed Central

    Speed, William C; Kang, Soonmo Peter; Tuck, David P; Harris, Lyndsay N; Kidd, Kenneth K

    2009-01-01

    We have studied the global frequency distributions of 10 single nucleotide polymorphisms (SNPs) across 132 kb of CYP2C8 and CYP2C9 in ∼2500 individuals representing 45 populations. Five of the SNPs were in noncoding sequences; the other five involved the more common missense variants (four in CYP2C8, one in CYP2C9) that change amino acids in the gene products. One haplotype containing two CYP2C8 coding variants and one CYP2C9 coding variant reaches an average frequency of 10% in Europe; a set of haplotypes with a different CYP2C8 coding variant reaches 17% in Africa. In both cases these haplotypes are found in other regions of the world at <1%. This considerable geographic variation in haplotype frequencies impacts the interpretation of CYP2C8/CYP2C9 association studies, and has pharmacogenomic implications for drug interactions. PMID:19381162

  10. C-reactive protein haplotype is associated with high PSA as a marker of metastatic prostate cancer but not with overall cancer risk

    PubMed Central

    Eklund, C M; Tammela, T L J; Schleutker, J; Hurme, M

    2009-01-01

    Growing evidence points to a role for inflammation in prostate carcinogenesis. The significance of C-reactive protein (CRP), an inflammatory and innate immunity molecule, has not been evaluated thoroughly in prostate cancer (PC). In this study of 739 Finnish patients with PC and 760 healthy men, we evaluated the associations of CRP genotypes and haplotypes with total PC risk and PC progression, using prostate-specific antigen (PSA) as a marker of metastatic disease. Although the haplotype frequencies were similar in patients and controls, an association between haplotype ACCCA and patients' PSA levels was found. The carriers more often had a high PSA than non-carriers (P=0.0002) and the SNP rs2794521 A-allele and rs1800947 C-allele carriers had a higher PSA than non-carriers (P=0.009 and P=0.0004, respectively). A trend for a younger age at diagnosis was found among the carriers of ACCCA (P=0.07) and the rs1800947 C-allele (P=0.06), as well as a trend for the latter to have more likely metastases (P=0.06), but not after Bonferroni correction (α=0.00208). This is the first study to suggest association between PSA and CRP variants in PC and, therefore, further studies are warranted. CRP alleles previously found to protect against increased CRP levels are now suggested to be associated with metastatic PC, indicated by elevated PSA. PMID:19436291

  11. Common FABP4 genetic variants and plasma levels of fatty acid binding protein 4 in older adults.

    PubMed

    Mukamal, Kenneth J; Wilk, Jemma B; Biggs, Mary L; Jensen, Majken K; Ix, Joachim H; Kizer, Jorge R; Tracy, Russell P; Zieman, Susan J; Mozaffarian, Dariush; Psaty, Bruce M; Siscovick, David S; Djoussé, Luc

    2013-11-01

    We examined common variants in the fatty acid binding protein 4 gene (FABP4) and plasma levels of FABP4 in adults aged 65 and older from the Cardiovascular Health Study. We genotyped rs16909187, rs1054135, rs16909192, rs10808846, rs7018409, rs2290201, and rs6992708 and measured circulating FABP4 levels among 3190 European Americans and 660 African Americans. Among European Americans, the minor alleles of six single nucleotide polymorphisms (SNP) were associated with lower FABP4 levels (all p ≤ 0.01). Among African Americans, the SNP with the lowest minor allele frequency was associated with lower FABP4 levels (p = 0.015). The C-A haplotype of rs16909192 and rs2290201 was associated with lower FABP4 levels in both European Americans (frequency = 16 %; p = 0.001) and African Americans (frequency = 8 %; p = 0.04). The haplotype combined a SNP in the first intron with one in the 3'untranslated region. However, the alleles associated with lower FABP4 levels were associated with higher fasting glucose in meta-analyses from the MAGIC consortium. These results demonstrate associations of common SNP and haplotypes in the FABP4 gene with lower plasma FABP4 but higher fasting glucose levels.

  12. HLA-A, HLA-B, HLA-DRB1 allele and haplotype frequencies in 6384 umbilical cord blood units and transplantation matching and engraftment statistics in the Zhejiang cord blood bank of China.

    PubMed

    Wang, F; He, J; Chen, S; Qin, F; Dai, B; Zhang, W; Zhu, F M; Lv, H J

    2014-02-01

    Umbilical cord blood (UCB) is a widely accepted source of progenitor cells, and now, many cord blood banks were established. Here, we analysed the HLA-A, HLA-B and HLA-DRB1 allele and haplotype frequencies, HLA matching possibilities for searching potential donors and outcome of UCB transplantations in Zhejiang cord blood bank of China. A total of 6384 UCB units were characterized for 17 HLA-A, 30 HLA-B and 13 HLA-DRB1 alleles at the first field resolution level. Additionally, B*14, B*15 and B*40 were typed to the second field level. A total of 1372 distinct A-B-DRB1 haplotypes were identified. The frequencies of 7 haplotypes were more than 1%, and 439 haplotypes were <0.01%. A*02-B*46-DRB1*09, A*33-B*58-DRB1*03 and A*30-B*13-DRB1*07 were the most common haplotypes, with frequencies of 4.4%, 3.3%, and 2.9%, respectively. Linkage disequilibrium(LD) analysis showed that there were 83 A-B, 106 B-DRB1, 54 A-DRB1 haplotypes with positive LD, in which 51 A-B, 60 B-DRB1, 32 A-DRB1 haplotypes exhibited a significant LD (P < 0.05). In 682 search requests, 12.9%, 40.0% and 42.7% of patients were found to have 6 of 6, 5 of 6 and 4 of 6 HLA-A, HLA-B and HLA-DRB1 matching donors, respectively. A total of 30 UCB units were transplanted to 24 patients (3 patients not evaluated due to early death); 14 of 21 patients (66.7%) engrafted. This study reveals the HLA distribution and its transplantation application in the cord blood bank of Zhejiang province. These data can help to select potential UCB donors for transplantation and used to assess the scale of new cord blood banking endeavours. © 2013 John Wiley & Sons Ltd.

  13. A Candidate Trans-acting Modulator of Fetal Hemoglobin Gene Expression in the Arab-Indian Haplotype of Sickle Cell Anemia

    PubMed Central

    Vathipadiekal, Vinod; Farrell, John J.; Wang, Shuai; Edward, Heather L.; Shappell, Heather; Al-Rubaish, A.M.; Al-Muhanna, Fahad; Naserullah, Z.; Alsuliman, A.; Qutub, Hatem Othman; Simkin, Irene; Farrer, Lindsay A.; Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Mostoslavsky, Gustavo; Murphy, George J.; Patra, Pradeep.K.; Chui, David H.K.; Alsultan, Abdulrahman; Al-Ali, Amein K.; Sebastiani, Paola.; Steinberg, Martin. H.

    2016-01-01

    Fetal hemoglobin (HbF) levels are higher in the Arab-Indian (AI) β-globin gene haplotype of sickle cell anemia compared with African-origin haplotypes. To study genetic elements that effect HbF expression in the AI haplotype we completed whole genome sequencing in 14 Saudi AI haplotype sickle hemoglobin homozygotes—seven selected for low HbF (8.2±1.3%) and seven selected for high HbF (23.5±.2.6%). An intronic single nucleotide polymorphism (SNP) in ANTXR1, an anthrax toxin receptor (chromosome 2p13), was associated with HbF. These results were replicated in two independent Saudi AI haplotype cohorts of 120 and 139 patients, but not in 76 Saudi Benin haplotype, 894 African origin haplotype and 44 Arab Indian haplotype patients of Indian descent, suggesting that this association is effective only in the Saudi AI haplotype background. ANTXR1 variants explained 10% of the HbF variability compared with 8% for BCL11A. These two genes had independent, additive effects on HbF and together explained about 15% of HbF variability in Saudi AI sickle cell anemia patients. ANTXR1 was expressed at mRNA and protein levels in erythroid progenitors derived from induced pluripotent stem cells (iPSCs) and CD34+ cells. As CD34+ cells matured and their HbF decreased ANTXR1 expression increased; as iPSCs differentiated and their HbF increased, ANTXR1 expression decreased. Along with elements in cis to the HbF genes, ANTXR1 contributes to the variation in HbF in Saudi AI haplotype sickle cell anemia and is the first gene in trans to HBB that is associated with HbF only in carriers of the Saudi AI haplotype. PMID:27501013

  14. Beta-globin gene cluster haplotypes of Amerindian populations from the Brazilian Amazon region.

    PubMed

    Guerreiro, J F; Figueiredo, M S; Zago, M A

    1994-01-01

    We have determined the beta-globin cluster haplotypes for 80 Indians from four Brazilian Amazon tribes: Kayapó, Wayampí, Wayana-Apalaí, and Arára. The results are analyzed together with 20 Yanomámi previously studied. From 2 to 4 different haplotypes were identified for each tribe, and 7 of the possible 32 haplotypes were found in a sample of 172 chromosomes for which the beta haplotypes were directly determined or derived from family studies. The haplotype distribution does not differ significantly among the five populations. The two most common haplotypes in all tribes were haplotypes 2 and 6, with average frequencies of 0.843 and 0.122, respectively. The genetic affinities between Brazilian Indians and other human populations were evaluated by estimates of genetic distance based on haplotype data. The lowest values were observed in relation to Asians, especially Chinese, Polynesians, and Micronesians.

  15. A TNF region haplotype offers protection from typhoid fever in Vietnamese patients

    PubMed Central

    2009-01-01

    The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085

  16. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications

    PubMed Central

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The

  17. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    PubMed

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The

  18. IL1B-CGTC haplotype is associated with colorectal cancer in admixed individuals with increased African ancestry

    PubMed Central

    Sanabria-Salas, María Carolina; Hernández-Suárez, Gustavo; Umaña-Pérez, Adriana; Rawlik, Konrad; Tenesa, Albert; Serrano-López, Martha Lucía; Sánchez de Gómez, Myriam; Rojas, Martha Patricia; Bravo, Luis Eduardo; Albis, Rosario; Plata, José Luis; Green, Heather; Borgovan, Theodor; Li, Li; Majumdar, Sumana; Garai, Jone; Lee, Edward; Ashktorab, Hassan; Brim, Hassan; Li, Li; Margolin, David; Fejerman, Laura; Zabaleta, Jovanny

    2017-01-01

    Single-nucleotide polymorphisms (SNPs) in cytokine genes can affect gene expression and thereby modulate inflammation and carcinogenesis. However, the data on the association between SNPs in the interleukin 1 beta gene (IL1B) and colorectal cancer (CRC) are conflicting. We found an association between a 4-SNP haplotype block of the IL1B (-3737C/-1464G/-511T/-31C) and CRC risk, and this association was exclusively observed in individuals with a higher proportion of African ancestry, such as individuals from the Coastal Colombian region (odds ratio, OR 2.06; 95% CI 1.31–3.25; p < 0.01). Moreover, a significant interaction between this CRC risk haplotype and local African ancestry dosage was identified in locus 2q14 (p = 0.03). We conclude that Colombian individuals with high African ancestry proportions at locus 2q14 harbour more IL1B-CGTC copies and are consequently at an increased risk of CRC. This haplotype has been previously found to increase the IL1B promoter activity and is the most frequent haplotype in African Americans. Despite of limitations in the number of samples and the lack of functional analysis to examine the effect of these haplotypes on CRC cell lines, our results suggest that inflammation and ethnicity play a major role in the modulation of CRC risk. PMID:28157220

  19. Epistatic interaction between haplotypes of the ghrelin ligand and receptor genes influence susceptibility to myocardial infarction and coronary artery disease.

    PubMed

    Baessler, Andrea; Fischer, Marcus; Mayer, Bjoern; Koehler, Martina; Wiedmann, Silke; Stark, Klaus; Doering, Angela; Erdmann, Jeanette; Riegger, Guenter; Schunkert, Heribert; Kwitek, Anne E; Hengstenberg, Christian

    2007-04-15

    Data from both experimental models and humans provide evidence that ghrelin and its receptor, the growth hormone secretagogue receptor (ghrelin receptor, GHSR), possess a variety of cardiovascular effects. Thus, we hypothesized that genetic variants within the ghrelin system (ligand ghrelin and its receptor GHSR) are associated with susceptibility to myocardial infarction (MI) and coronary artery disease (CAD). Seven single nucleotide polymorphisms (SNPs) covering the GHSR region as well as eight SNPs across the ghrelin gene (GHRL) region were genotyped in index MI patients (864 Caucasians, 'index MI cases') from the German MI family study and in matched controls without evidence of CAD (864 Caucasians, 'controls', MONICA Augsburg). In addition, siblings of these MI patients with documented severe CAD (826 'affected sibs') were matched likewise with controls (n = 826 Caucasian 'controls') and used for verification. The effect of interactions between genetic variants of both genes of the ghrelin system was explored by conditional classification tree models. We found association of several GHSR SNPs with MI [best SNP odds ratio (OR) 1.7 (1.2-2.5); P = 0.002] using a recessive model. Moreover, we identified a common GHSR haplotype which significantly increases the risk for MI [multivariate adjusted OR for homozygous carriers 1.6 (1.1-2.5) and CAD OR 1.6 (1.1-2.5)]. In contrast, no relationship between genetic variants and the disease could be revealed for GHRL. However, the increase in MI/CAD frequency related to the susceptible GHSR haplotype was abolished when it coincided with a common GHRL haplotype. Multivariate adjustments as well as permutation-based methods conveyed the same results. These data are the first to demonstrate an association of SNPs and haplotypes within important genes of the ghrelin system and the susceptibility to MI, whereas association with MI/CAD could be identified for genetic variants across GHSR, no relationship could be revealed for GHRL

  20. Association of ORAI1 Haplotypes with the Risk of HLA-B27 Positive Ankylosing Spondylitis

    PubMed Central

    Wei, James Cheng-Chung; Yen, Jeng-Hsien; Juo, Suh-Hang Hank; Chen, Wei-Chiao; Wang, Yu-Shiuan; Chiu, Yi-Ching; Hsieh, Tusty-Jiuan; Guo, Yuh-Cherng; Huang, Chun-Huang; Wong, Ruey-Hong; Wang, Hui-Po; Tsai, Ke-Li; Wu, Yang-Chang; Chang, Hsueh-Wei; Hsi, Edward; Chang, Wei-Pin; Chang, Wei-Chiao

    2011-01-01

    Ankylosing spondylitis (AS) is a chronic inflammation of the sacroiliac joints, spine and peripheral joints. The aetiology of ankylosing spondylitis is still unclear. Previous studies have indicated that genetics factors such as human leukocyte antigen HLA-B27 associates to AS susceptibility. We carried out a case-control study to determine whether the genetic polymorphisms of ORAI1 gene, a major component of store-operated calcium channels that involved the regulation of immune system, is a susceptibility factor to AS in a Taiwanese population. We enrolled 361 AS patients fulfilled the modified New York criteria and 379 controls from community. Five tagging single nucleotides polymorphisms (tSNPs) at ORAI1 were selected from the data of Han Chinese population in HapMap project. Clinical statuses of AS were assessed by the Bath Ankylosing Spondylitis Disease Activity Index (BASDAI), Bath Ankylosing Spondylitis Functional Index (BASFI), and Bath Ankylosing Spondylitis Global Index (BAS-G). Our results indicated that subjects carrying the minor allele homozygote (CC) of the promoter SNP rs12313273 or TT homozygote of the SNP rs7135617 had an increased risk of HLA-B27 positive AS. The minor allele C of 3′UTR SNP rs712853 exerted a protective effect to HLA-B27 positive AS. Furthermore, the rs12313273/rs7135617 pairwise allele analysis found that C-G (OR 1.69, 95% CI 1.27, 2.25; p = 0.0003) and T-T (OR 1.75, 95% CI 1.36, 2.27; p<0.0001) haplotypes had a significantly association with the risk of HLA-B27-positive AS in comparison with the T-G carriers. This is the first study that indicate haplotypes of ORAI1 (rs12313273 and rs7135617) are associated with the risk of HLA-B27 positive AS. PMID:21674042

  1. HLA class I antigen and HLA-A, -B, and -C haplotype frequencies in Uruguayans.

    PubMed

    Alvarez, Ines; Bengochea, Milka; Toledo, Roberto; Carretto, Elena; Hidalgo, Pedro C

    2006-08-01

    HLA class I antigens were determined for 959 unrelated Uruguayans. The predominant HLA alleles were A2, Cw4, and B35, and the most frequently observed two-loci haplotypes were A2-B44 and B35-Cw4. The most frequent three-loci HLA haplotype was A2-Cw5-B44. We compared the Uruguayan sample with similar data from other populations.

  2. HLA-DR2-associated DRB1 and DRB5 alleles and haplotypes in Koreans.

    PubMed

    Song, E Y; Kang, S J; Lee, Y J; Park, M H

    2000-09-01

    There are considerable racial differences in the distribution of HLA-DR2-associated DRB1 and DRB5 alleles and the characteristics of linkage disequilibrium between these alleles. In this study, the frequencies of DR2-associated DRB1 and DRB5 alleles and related haplotypes were analyzed in 186 DR2-positive individuals out of 800 normal Koreans registered for unrelated bone marrow donors. HLA class I antigen typing was performed by the serological method and DRB1 and DRB5 genotyping by the PCR-single strand conformational polymorphism method. Only 3 alleles were detected for DR2-associated DRB1 and DRB5 genes, respectively: DRB1(*)1501 (gene frequency 8.0%), (*)1502 (3.2%), (*)1602 (0.9%); DRB5(*)0101 (8.0%), (*)0102 (3.2%), and (*)0202 (0.9%). DRB1-DRB5 haplotype analysis showed an exclusive association between these alleles: DRB1*1501-DRB5*0101 (haplotype frequency 8.0%), DRB1(*)1502-DRB5(*)0102 (3.2%), and DRB1(*)1602-DRB5(*)0202 (0.9%). The 5 most common DR2-associated A-B-DRB1 haplotypes occurring at frequencies of > or = 0.5% were A24-B52-DRB1(*)1502 (1.8%), A2-B62-DRB1(*)1501, A2-B54-DRB1(*)1501, A26-B61-DRB1(*)1501, and A24-B51-DRB1(*)1501. The remarkable homogeneity in the haplotypic associations between DR2-associated DRB1 and DRB5 alleles in Koreans would be advantageous for organ transplantation compared with other ethnic groups showing considerable heterogeneity in the distribution of DRB1-DRB5 haplotypes.

  3. Alpha-globin gene haplotypes in South American Indians.

    PubMed

    Zago, M A; Melo Santos, E J; Clegg, J B; Guerreiro, J F; Martinson, J J; Norwich, J; Figueiredo, M S

    1995-08-01

    The haplotypes of the alpha-globin gene cluster were determined for 99 Indians from the Brazilian Amazon region who belong to 5 tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Three predominant haplotypes were identified: Ia (present in 38.9% of chromosomes), IIIa (25.8%), and IIe (22.1%). The only alpha-globin gene rearrangement detected was alpha alpha alpha 3.7 I gene triplication associated with haplotype IIIa, found in high frequencies (5.6% and 10.6%) in two tribes and absent in the others. alpha-Globin gene deletions that cause alpha-thalassemia were not seen, supporting the argument that malaria was absent in these populations until recently. The heterogeneous distribution of alpha-globin gene haplotypes and rearrangements among the different tribes differs markedly from the homogeneous distribution of beta-globin gene cluster haplotypes and reflects the action of various genetic mechanisms (genetic drift, founder effect, consanguinity) on small isolated population groups with a complicated history of divergence-fusion events. The alpha-globin gene haplotype distribution has some similarities to distributions observed in Southeast Asian and Pacific Island populations, indicating that these populations have considerable genetic affinities. However, the absence of several features of the alpha-globin gene cluster that are consistently present among the Pacific Islanders suggests that the similarity of haplotypes between Brazilian Indians and people from Polynesia, Micronesia, and Melanesia is more likely to result of ancient common ancestry rather than the consequence of recent direct genetic contribution through immigration.

  4. Y-STR haplotypes of Native American populations from the Brazilian Amazon region.

    PubMed

    Palha, Teresinha Jesus Brabo Ferreira; Rodrigues, Elzemar Martins Ribeiro; dos Santos, Sidney Emanuel Batista

    2010-10-01

    The allele and haplotype frequencies of nine Y-STRs (DYS19, DYS389 I, DYS389 II, DYS390, DYS391, DYS392, DYS393, DYS385 I/II) were determined in a sample of six native tribes from the Brazilian Amazon (Tiriyó, Awa-Guajá, Waiãpi, Urubu-Kaapor, Zoé and Parakanã). Forty-eight different haplotypes were identified, 28 of which unique. Five haplotypes are very frequent and were shared by over 10 individuals. The estimated haplotype diversity (0.9114) was very low compared to other geographic groups, including Africans, Europeans and Asians. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  5. Distribution of HLA class I alleles and haplotypes in Korean.

    PubMed Central

    Kim, T. G.; Han, H.; Lim, B. U.; Kim, W.; Kim, S. M.

    1993-01-01

    The antigen (phenotype), gene (allele) and haplotype frequencies of HLA class I were analysed in 4,622 Koreans. With allele frequencies of over 0.05, the most frequent HLA-A,-B and -C antigens were A2, A24, A33, A11, A26, A31; B62, B51, B44, B54, B61, B35, B58, B60; Cw3, Cw1, Cw4, Cw7. Of these A2, A24, Cw1 and Cw3 were present in very high frequencies, respectively (0.3211, 0.2200, 0.2204, and 0.3737). The most common haplotypes with frequencies larger than 0.02 were A2-Blank, A33-B44, A33-B58, A11-B62, A24-B51, A24-B54, A2-B27, B54-Cw1, B58-Cw3, B51-Blank, B61-Cw3, B62-Cw4, B35-Cw3, B44-Blank, B60-Cw3, B27-Cw1, A2-Cw3, A2-Cw1, A24-Cw1, A33-Cw3, A26-Cw3, and A11-Cw4. A significant negative linkage disequilibrium was found for the haplotypes of A2-B7, A2-B44, A2-B58, A24-B13, A24-B27, A33-B54 and A33-B62, of which frequencies were larger than 0.003. The B-C and A-C haplotypes which showed the significant negative linkage disequilibrium were B44-Cw1, B51-Cw1, B44-Cw3,B62-Blank, A2-Cw4, A2-Blank, A11-Cw3, A11-Blank and A33-Cw1 and had frequencies higher than 0.01. The findings presented here could be used per se to estimate the populational relationships or as the control data for HLA-disease investigation. Furthermore they could provide the scope for the definition of new antigens. PMID:8240747

  6. The recombination landscape around forensic STRs: Accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data.

    PubMed

    Phillips, C; Ballard, D; Gill, P; Court, D Syndercombe; Carracedo, A; Lareu, M V

    2012-05-01

    Family studies can be used to measure the genetic distance between same-chromosome (syntenic) STRs in order to detect physical linkage or linkage disequilibrium. However, family studies are expensive and time consuming, in many cases uninformative, and lack a reliable means to infer the phase of the diplotypes obtained. HapMap provides a more comprehensive and fine-scale estimation of recombination rates using high density multi-point SNP data (average inter-SNP distance: 900 nucleotides). Data at this fine scale detects sub-kilobase genetic distances across the whole recombining human genome. We have used the most recent HapMap SNP data release 22 to measure and compare genetic distances, and by inference fine-scale recombination rates, between 29 syntenic STR pairs identified from 39 validated STRs currently available for forensic use. The 39 STRs comprise 23 core loci: SE33, Penta D & E, 13 CODIS and 7 non-CODIS European Standard Set STRs, plus supplementary STRs in the recently released Promega CS-7™ and Qiagen Investigator HDplex™ kits. Also included were D9S1120, a marker we developed for forensic use unique to chromosome 9, and the novel D6S1043 component STR of SinoFiler™ (Applied Biosystems). The data collated provides reliable estimates of recombination rates between each STR pair, that can then be placed into haplotype frequency calculators for short pedigrees with multiple meiotic inputs and which just requires the addition of allele frequencies. This allows all current STR sets or their combinations to be used in supplemented paternity analyses without the need for further adjustment for physical linkage. The detailed analysis of recombination rates made for autosomal forensic STRs was extended to the more than 50 X chromosome STRs established or in development for complex kinship analyses. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  7. µ-Calpain, calpastatin, and growth hormone receptor genetic effects on preweaning performance, carcass quality traits, and residual variance of tenderness in Angus cattle selected to increase minor haplotype ... frequencies

    USDA-ARS?s Scientific Manuscript database

    Genetic marker effects and interactions are estimated with poor precision when minor marker allele frequencies are low. An Angus population was subjected to marker assisted selection for multiple years to increase divergent haplotype and minor marker allele frequencies to 1) estimate effect size an...

  8. Lower frequency of the HLA-G UTR-4 haplotype in women with unexplained recurrent miscarriage.

    PubMed

    Meuleman, T; Drabbels, J; van Lith, J M M; Dekkers, O M; Rozemuller, E; Cretu-Stancu, M; Claas, F H J; Bloemenkamp, K W M; Eikmans, M

    2018-04-01

    HLA-G expressed by trophoblasts at the fetal-maternal interface and its soluble form have immunomodulatory effects. HLA-G expression depends on the combination of DNA polymorphisms. We hypothesized that combinations of specific single nucleotide polymorphisms (SNPs) in the 3'untranslated region (3'UTR) of HLA-G play a role in unexplained recurrent miscarriage. In a case control design, 100 cases with at least three unexplained consecutive miscarriages prior to the 20th week of gestation were included. Cases were at time of the third miscarriage younger than 36 years, and they conceived all their pregnancies from the same partner. The control group included 89 women with an uneventful pregnancy. The association of HLA-G 3'UTR SNPs and specific HLA-G haplotype with recurrent miscarriage was studied with logistic regression. Odds ratios (OR) and 95% confidence intervals (95% CI) were reported. Individual SNPs were not significantly associated with recurrent miscarriage after correction for multiple comparisons. However, the presence of the UTR-4 haplotype, which included +3003C, was significantly lower in women with recurrent miscarriage (OR 0.4, 95% CI 0.2-0.8, p = 0.015). In conclusion, this is the first study to perform a comprehensive analysis of HLA-G SNPs and HLA-G haplotypes in a well-defined group of women with recurrent miscarriage and women with uneventful pregnancy. The UTR-4 haplotype was less frequently observed in women with recurrent miscarriage, suggesting an immunoregulatory role of this haplotype for continuation of the pregnancy without complications. Thus, association of HLA-G with recurrent miscarriage is not related to single polymorphisms in the 3'UTR, but is rather dependent on haplotypes. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Practical interpretation of CYP2D6 haplotypes: Comparison and integration of automated and expert calling.

    PubMed

    Ruaño, Gualberto; Kocherla, Mohan; Graydon, James S; Holford, Theodore R; Makowski, Gregory S; Goethe, John W

    2016-05-01

    We describe a population genetic approach to compare samples interpreted with expert calling (EC) versus automated calling (AC) for CYP2D6 haplotyping. The analysis represents 4812 haplotype calls based on signal data generated by the Luminex xMap analyzers from 2406 patients referred to a high-complexity molecular diagnostics laboratory for CYP450 testing. DNA was extracted from buccal swabs. We compared the results of expert calls (EC) and automated calls (AC) with regard to haplotype number and frequency. The ratio of EC to AC was 1:3. Haplotype frequencies from EC and AC samples were convergent across haplotypes, and their distribution was not statistically different between the groups. Most duplications required EC, as only expansions with homozygous or hemizygous haplotypes could be automatedly called. High-complexity laboratories can offer equivalent interpretation to automated calling for non-expanded CYP2D6 loci, and superior interpretation for duplications. We have validated scientific expert calling specified by scoring rules as standard operating procedure integrated with an automated calling algorithm. The integration of EC with AC is a practical strategy for CYP2D6 clinical haplotyping. Copyright © 2016 Elsevier B.V. All rights reserved.

  10. Detecting disease-predisposing variants: the haplotype method.

    PubMed Central

    Valdes, A M; Thomson, G

    1997-01-01

    For many HLA-associated diseases, multiple alleles-- and, in some cases, multiple loci--have been suggested as the causative agents. The haplotype method for identifying disease-predisposing amino acids in a genetic region is a stratification analysis. We show that, for each haplotype combination containing all the amino acid sites involved in the disease process, the relative frequencies of amino acid variants at sites not involved in disease but in linkage disequilibrium with the disease-predisposing sites are expected to be the same in patients and controls. The haplotype method is robust to mode of inheritance and penetrance of the disease and can be used to determine unequivocally whether all amino acid sites involved in the disease have not been identified. Using a resampling technique, we developed a statistical test that takes account of the nonindependence of the sites sampled. Further, when multiple sites in the genetic region are involved in disease, the test statistic gives a closer fit to the null expectation when some--compared with none--of the true predisposing factors are included in the haplotype analysis. Although the haplotype method cannot distinguish between very highly correlated sites in one population, ethnic comparisons may help identify the true predisposing factors. The haplotype method was applied to insulin-dependent diabetes mellitus (IDDM) HLA class II DQA1-DQB1 data from Caucasian, African, and Japanese populations. Our results indicate that the combination DQA1#52 (Arg predisposing) DQB1#57 (Asp protective), which has been proposed as an important IDDM agent, does not include all the predisposing elements. With rheumatoid arthritis HLA class II DRB1 data, the results were consistent with the shared-epitope hypothesis. PMID:9042931

  11. The "Sardinian" HLA-A30,B18,DR3,DQw2 haplotype constantly lacks the 21-OHA and C4B genes. Is it an ancestral haplotype without duplication?

    PubMed

    Contu, L; Carcassi, C; Dausset, J

    1989-01-01

    The C4 and 21-OH loci of the class III HLA have been studied by specific DNA probes and the restriction enzyme Taq 1 in 24 unrelated Sardinian individuals selected from completely HLA-typed families. All 24 individuals had the HLA extended haplotype A30,Cw5,B18, BfF1,DR3,DRw52,DQw2, named "Sardinian" in the present paper because of its frequency of 15% in the Sardinian population. Eighteen of these were homozygous for the entire haplotype, and six were heterozygous at the A locus and blank (or homozygous) at all the other loci. In all completely homozygous cells and in four heterozygous cells at the A locus, the restriction fragments of the 21-OHA (3.2 kb) and C4B (5.8 kb or 5.4 kb) genes were absent, and the fragments of the C4A (7.0 kb) and 21-OHB (3.7 kb) genes were present. It is suggested that the "Sardinian" haplotype is an ancestral haplotype without duplication of the C4 and 21-OH genes, practically always identical in its structure, also in unrelated individuals. The diversity of this haplotype in the class III region (about 30 kb less) may be at least partially responsible for its misalignment with most haplotypes, which have duplicated C4 and 21-OH genes, and therefore also for its decreased probability to recombine. This can help explain its high stability and frequency in the Sardinian population. The same conclusion can be suggested for the Caucasian extended haplotype A1,B8,DR3 that always seems to lack the C4A and 21-OHA genes.

  12. Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms.

    PubMed

    Yamamoto, Toshio; Nagasaki, Hideki; Yonemaru, Jun-ichi; Ebana, Kaworu; Nakajima, Maiko; Shibaya, Taeko; Yano, Masahiro

    2010-04-27

    To create useful gene combinations in crop breeding, it is necessary to clarify the dynamics of the genome composition created by breeding practices. A large quantity of single-nucleotide polymorphism (SNP) data is required to permit discrimination of chromosome segments among modern cultivars, which are genetically related. Here, we used a high-throughput sequencer to conduct whole-genome sequencing of an elite Japanese rice cultivar, Koshihikari, which is closely related to Nipponbare, whose genome sequencing has been completed. Then we designed a high-throughput typing array based on the SNP information by comparison of the two sequences. Finally, we applied this array to analyze historical representative rice cultivars to understand the dynamics of their genome composition. The total 5.89-Gb sequence for Koshihikari, equivalent to 15.7 x the entire rice genome, was mapped using the Pseudomolecules 4.0 database for Nipponbare. The resultant Koshihikari genome sequence corresponded to 80.1% of the Nipponbare sequence and led to the identification of 67,051 SNPs. A high-throughput typing array consisting of 1917 SNP sites distributed throughout the genome was designed to genotype 151 representative Japanese cultivars that have been grown during the past 150 years. We could identify the ancestral origin of the pedigree haplotypes in 60.9% of the Koshihikari genome and 18 consensus haplotype blocks which are inherited from traditional landraces to current improved varieties. Moreover, it was predicted that modern breeding practices have generally decreased genetic diversity Detection of genome-wide SNPs by both high-throughput sequencer and typing array made it possible to evaluate genomic composition of genetically related rice varieties. With the aid of their pedigree information, we clarified the dynamics of chromosome recombination during the historical rice breeding process. We also found several genomic regions decreasing genetic diversity which might be

  13. Calpain-10 gene polymorphisms and risk of type 2 diabetes mellitus in Mexican mestizos.

    PubMed

    Picos-Cárdenas, V J; Sáinz-González, E; Miliar-García, A; Romero-Zazueta, A; Quintero-Osuna, R; Leal-Ugarte, E; Peralta-Leal, V; Meza-Espinoza, J P

    2015-03-27

    The calpain-10 gene is expressed primarily in tissues important in glucose metabolism; thus, some of its polymorphisms have been associated with type 2 diabetes. In this study, we examined the association between the calpain-10 single-nucleotide polymorphism (SNP)-43, SNP-19, and SNP-63 and type 2 diabetes in Mexican mestizos. We included 211 patients and 152 non-diabetic subjects. Polymerase chain reaction was used to identify alleles. We compared allele, genotype, haplotype, and diplotype frequencies between both groups and used the chi-square test to calculate the risk. The allele frequency of SNP-43 allele 1 was 70% in controls and 72% in patients; the GG, GA, and AA genotype frequencies were 48.7, 42.8, and 8.5% in controls and 51.2, 41.7, and 7.1% in patients, respectively. For SNP- 19, the prevalence of allele 1 (2R) was 32% in controls and 39% in patients. In controls, homozygosity (2R/2R) was 10.5%, heterozygosity was 42.8%, and 3R/3R was 46.7%; in cases, these values were 13.3, 50.7, and 36.0%, respectively. For SNP-63, the frequency of allele 1 was 87% in controls and 83% in patients; genotype frequencies in controls were 75.7% (CC), 23% (CT), and 1.3% (TT), and were 69.7, 27.5, and 2.8%, respectively for the cases. Genotype distributions were consistent with Hardy-Weinberg equilibrium. No significant intergroup differences for allele, genotype, haplotype, or diplotype frequencies were observed. We found no association between these polymorphisms and diabetes. However, our sample size was small, so the role of calpain-10 risk alleles should be further examined.

  14. Haplotype structure in Ashkenazi Jewish BRCA1 and BRCA2 mutation carriers

    PubMed Central

    Im, Kate M.; Kirchhoff, Tomas; Wang, Xianshu; Green, Todd; Chow, Clement Y.; Vijai, Joseph; Korn, Joshua; Gaudet, Mia M.; Fredericksen, Zachary; Pankratz, V. Shane; Guiducci, Candace; Crenshaw, Andrew; McGuffog, Lesley; Kartsonaki, Christiana; Morrison, Jonathan; Healey, Sue; Sinilnikova, Olga M.; Mai, Phuong L.; Greene, Mark H.; Piedmonte, Marion; Rubinstein, Wendy S.; Hogervorst, Frans B.; Rookus, Matti A.; Collée, J. Margriet; Hoogerbrugge, Nicoline; van Asperen, Christi J.; Meijers-Heijboer, Hanne E. J.; Van Roozendaal, Cees E.; Caldes, Trinidad; Perez-Segura, Pedro; Jakubowska, Anna; Lubinski, Jan; Huzarski, Tomasz; Blecharz, Paweł; Nevanlinna, Heli; Aittomäki, Kristiina; Lazaro, Conxi; Blanco, Ignacio; Barkardottir, Rosa B.; Montagna, Marco; D'Andrea, Emma; Devilee, Peter; Olopade, Olufunmilayo I.; Neuhausen, Susan L.; Peissel, Bernard; Bonanni, Bernardo; Peterlongo, Paolo; Singer, Christian F.; Rennert, Gad; Lejbkowicz, Flavio; Andrulis, Irene L.; Glendon, Gord; Ozcelik, Hilmi; Toland, Amanda Ewart; Caligo, Maria Adelaide; Beattie, Mary S.; Chan, Salina; Domchek, Susan M.; Nathanson, Katherine L.; Rebbeck, Timothy R.; Phelan, Catherine; Narod, Steven; John, Esther M.; Hopper, John L.; Buys, Saundra S.; Daly, Mary B.; Southey, Melissa C.; Terry, Mary-Beth; Tung, Nadine; Hansen, Thomas v. O.; Osorio, Ana; Benitez, Javier; Durán, Mercedes; Weitzel, Jeffrey N.; Garber, Judy; Hamann, Ute; Peock, Susan; Cook, Margaret; Oliver, Clare T.; Frost, Debra; Platte, Radka; Evans, D. Gareth; Eeles, Ros; Izatt, Louise; Paterson, Joan; Brewer, Carole; Hodgson, Shirley; Morrison, Patrick J.; Porteous, Mary; Walker, Lisa; Rogers, Mark T.; Side, Lucy E.; Godwin, Andrew K.; Schmutzler, Rita K.; Wappenschmidt, Barbara; Laitman, Yael; Meindl, Alfons; Deissler, Helmut; Varon-Mateeva, Raymonda; Preisler-Adams, Sabine; Kast, Karin; Venat-Bouvet, Laurence; Stoppa-Lyonnet, Dominique; Chenevix-Trench, Georgia; Easton, Douglas F.; Klein, Robert J.; Daly, Mark J.; Friedman, Eitan; Dean, Michael; Clark, Andrew G.; Altshuler, David M.; Antoniou, Antonis C.; Couch, Fergus J.; Offit, Kenneth; Gold, Bert

    2011-01-01

    Abstract Three founder mutations in BRCA1 and BRCA2 contribute to the risk of hereditary breast and ovarian cancer in Ashkenazi Jews (AJ). They are observed at increased frequency in the AJ compared to other BRCA mutations in Caucasian non-Jews (CNJ). Several authors have proposed that elevated allele frequencies in the surrounding genomic regions reflect adaptive or balancing selection. Such proposals predict long-range linkage dis-equilibrium (LD) resulting from a selective sweep, although genetic drift in a founder population may also act to create long-distance LD. To date, few studies have used the tools of statistical genomics to examine the likelihood of long-range LD at a deleterious locus in a population that faced a genetic bottleneck. We studied the genotypes of hundreds of women from a large international consortium of BRCA1 and BRCA2 mutation carriers and found that AJ women exhibited long-range haplotypes compared to CNJ women. More than 50% of the AJ chromosomes with the BRCA1 185delAG mutation share an identical 2.1 Mb haplotype and nearly 16% of AJ chromosomes carrying the BRCA2 6174delT mutation share a 1.4 Mb haplotype. Simulations based on the best inference of Ashkenazi population demography indicate that long-range haplotypes are expected in the context of a genome-wide survey. Our results are consistent with the hypothesis that a local bottleneck effect from population size constriction events could by chance have resulted in the large haplotype blocks observed at high frequency in the BRCA1 and BRCA2 regions of Ashkenazi Jews. PMID:21597964

  15. African gene flow to north Brazil as revealed by HBB*S gene haplotype analysis.

    PubMed

    Lemos Cardoso, Greice; Farias Guerreiro, João

    2006-01-01

    Haplotypes linked to the HBB*S gene were analyzed in a sample of 260 chromosomes of Brazilian sickle cell anemia patients from the population of Belém, state of Pará, to evaluate if the present-day haplotype frequencies correlate as well as expected with historical information on the geographic origin of African slaves sent directly to Northern Brazil. The HBB*S gene haplotype distribution (66% Bantu, 21.8% Benin, 10.9% Senegal, and 1.3% Cameroon) is in agreement with those observed for other Brazilian populations regarding the highest proportion of the Bantu type, followed by the Benin type, but it differs significantly concerning the Senegal type as this haplotype is rare or absent in samples from other Brazilian regions already studied. In addition, our results are in accordance with historical records that establish that about 90% of the slaves sent to Northern Brazil were from Angola, Congo, and Mozambique, where the Bantu haplotype predominates, in contrast to 10% of slaves from Senegambia, Guine-Bissau, and Cape Verde, where the Senegal haplotype is the most common. On the other hand, the observed frequency of the Benin haplotype in Belém was much higher than that expected by historical data. This fact corroborates the suggestion that the high prevalence of the Benin type in Belém is due to domestic slave trade and later internal migrations, mainly from the Northeast, since there are no historical records of direct slave trade from Central West Africa to North Brazil. Am. J. Hum. Biol. 18:93-98, 2006. (c) 2005 Wiley-Liss, Inc.

  16. Identification of specific angiotensin-converting enzyme variants and haplotypes that confer risk and protection against type 2 diabetic nephropathy.

    PubMed

    Ezzidi, Intissar; Mtiraoui, Nabil; Kacem, Maha; Chaieb, Molka; Mahjoub, Touhami; Almawi, Wassim Y

    2009-11-01

    Cross-sectional and family studies identified angiotensin-converting enzyme (ACE) gene as a risk factor for diabetic nephropathy (DN). The contribution of ACE gene variants to DN development and progression is controversial and varies among different ethnic/racial groups. We investigated the association of three ACE gene variants with DN, rs1799752 insertion/deletion (I/D), rs1800764T/C and rs12449782A/G in 917 Tunisian type 2 diabetic (T2DM) patients: 515 with (DN) and 402 without (DWN) nephropathy. ACE genotyping was done by PCR-based assays; haplotype estimation was performed using H-Plus software (chi(2)-test based). Genotype frequency distributions of the three studied variants were in Hardy-Weinberg equilibrium. Minor allele frequency of rs1800764 was higher in DN patients than DWN patients or healthy controls, and minor allele frequency of rs1799752 was higher in DN than DWN patients. Higher frequency of rs1799752 and rs1800764 homozygous mutant genotypes was seen in DN compared to DWN patients. Of the three variants, only rs1799752 deletion/deletion (D/D) genotype was associated with a significant increase in albumin to creatinine ratios levels, and D/D carriers had elevated low-density lipoprotein, total cholesterol and urea. Three locus haplotype [rs1799752(I/D)/rs1800764(T/C)/rs12449782(A/G)] analysis revealed that the frequency of DCG haplotype was higher, while that of ITG and ICA haplotypes were lower among unselected type 2 diabetic patients. Taking ITA haplotype as reference, multivariate regression analysis confirmed the negative (ITG), and positive (DCG, DTG, DCA and DTA) association of specific ACE haplotypes with DN, after adjusting for potential nephropathy-linked covariates. Our results support the involvement of specific ACE variants in DN pathogenesis and demonstrate the presence of DN-specific haplotypes at the ACE locus.

  17. New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.

    PubMed

    De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A

    2002-06-01

    Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.

  18. Association of pro-ghrelin and GHS-R1A gene polymorphisms and haplotypes with heavy alcohol use and body mass.

    PubMed

    Landgren, Sara; Jerlhag, Elisabet; Zetterberg, Henrik; Gonzalez-Quintela, Arturo; Campos, Joaquin; Olofsson, Ulrica; Nilsson, Staffan; Blennow, Kaj; Engel, Jörgen A

    2008-12-01

    Ghrelin, an orexigenic peptide, acts on growth hormone secretagogue receptors (GHS-R1A), expressed in the hypothalamus as well as in important reward nodes such as the ventral tegmental area. Interestingly, ghrelin has been found to activate an important part of the reward systems, i.e., the cholinergic-dopaminergic reward link. Additionally, the rewarding and neurochemical properties of alcohol are, at least in part, mediated via this reward link. There is comorbidity between alcohol dependence and eating disorders. Thus, plasma levels of ghrelin are altered in patients with addictive behaviors such as alcohol and nicotine dependence and in binge eating disorder. This overlap prompted as to investigate the pro-ghrelin and GHS-R1A genes in a haplotype analysis of heavy alcohol-using individuals. A total of 417 Spanish individuals (abstainers, moderate, and heavy alcohol drinkers) were investigated in a haplotype analysis of the pro-ghrelin and GHS-R1A genes. Tag SNPs were chosen using HapMap data and the Tagger and Haploview softwares. These SNPs were then genotyped using TaqMan Allelic Discrimination. SNP rs2232165 of the GHS-R1A gene was associated with heavy alcohol consumption and SNP rs2948694 of the same gene as well as haplotypes of both the pro-ghrelin and the GHS-R1A genes were associated with body mass in heavy alcohol consuming individuals. The present findings are the first to disclose an association between the pro-ghrelin and GHS-R1A genes and heavy alcohol use, further strengthening the role of the ghrelin system in addictive behaviors and brain reward.

  19. High-Resolution SNP/CGH Microarrays Reveal the Accumulation of Loss of Heterozygosity in Commonly Used Candida albicans Strains

    PubMed Central

    Abbey, Darren; Hickman, Meleah; Gresham, David; Berman, Judith

    2011-01-01

    Phenotypic diversity can arise rapidly through loss of heterozygosity (LOH) or by the acquisition of copy number variations (CNV) spanning whole chromosomes or shorter contiguous chromosome segments. In Candida albicans, a heterozygous diploid yeast pathogen with no known meiotic cycle, homozygosis and aneuploidy alter clinical characteristics, including drug resistance. Here, we developed a high-resolution microarray that simultaneously detects ∼39,000 single nucleotide polymorphism (SNP) alleles and ∼20,000 copy number variation loci across the C. albicans genome. An important feature of the array analysis is a computational pipeline that determines SNP allele ratios based upon chromosome copy number. Using the array and analysis tools, we constructed a haplotype map (hapmap) of strain SC5314 to assign SNP alleles to specific homologs, and we used it to follow the acquisition of loss of heterozygosity (LOH) and copy number changes in a series of derived laboratory strains. This high-resolution SNP/CGH microarray and the associated hapmap facilitated the phasing of alleles in lab strains and revealed detrimental genome changes that arose frequently during molecular manipulations of laboratory strains. Furthermore, it provided a useful tool for rapid, high-resolution, and cost-effective characterization of changes in allele diversity as well as changes in chromosome copy number in new C. albicans isolates. PMID:22384363

  20. Efficient algorithms for polyploid haplotype phasing.

    PubMed

    He, Dan; Saha, Subrata; Finkers, Richard; Parida, Laxmi

    2018-05-09

    Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.

  1. snpAD: An ancient DNA genotype caller.

    PubMed

    Prüfer, Kay

    2018-06-21

    The study of ancient genomes can elucidate the evolutionary past. However, analyses are complicated by base-modifications in ancient DNA molecules that result in errors in DNA sequences. These errors are particularly common near the ends of sequences and pose a challenge for genotype calling. I describe an iterative method that estimates genotype frequencies and errors along sequences to allow for accurate genotype calling from ancient sequences. The implementation of this method, called snpAD, performs well on high-coverage ancient data, as shown by simulations and by subsampling the data of a high-coverage Neandertal genome. Although estimates for low-coverage genomes are less accurate, I am able to derive approximate estimates of heterozygosity from several low-coverage Neandertals. These estimates show that low heterozygosity, compared to modern humans, was common among Neandertals. The C ++ code of snpAD is freely available at http://bioinf.eva.mpg.de/snpAD/. Supplementary data are available at Bioinformatics online.

  2. ASSOCIATION BETWEEN GAB2 HAPLOTYPE AND HIGHER GLUCOSE METABOLISM IN ALZHEIMER'S DISEASE-AFFECTED BRAIN REGIONS IN COGNITIVELY NORMAL APOEε4 CARRIERS

    PubMed Central

    Liang, Winnie S.; Chen, Kewei; Lee, Wendy; Sidhar, Kunal; Corneveaux, Jason J.; Allen, April N.; Myers, Amanda; Villa, Stephen; Meechoovet, Bessie; Pruzin, Jeremy; Bandy, Daniel; Fleisher, Adam S.; Langbaum, Jessica B.S.; Huentelman, Matthew J.; Jensen, Kendall; Dunckley, Travis; Caselli, Richard J.; Kaib, Susan; Reiman, Eric M.

    2010-01-01

    In a genome-wide association study (GWAS) of late-onset Alzheimer's disease (AD), we found an association between common haplotypes of the GAB2 gene and AD risk in carriers of the apolipoprotein E (APOE) ε4 allele, the major late-onset AD susceptibility gene. We previously proposed the use of fluorodeoxyglucose positron emission tomography (FDG-PET) measurements as a quantitative presymptomatic endophenotype, more closely related to disease risk than the clinical syndrome itself, to help evaluate putative genetic and non-genetic modifiers of AD risk. In this study, we examined the relationship between the presence or absence of the relatively protective GAB2 haplotype and PET measurements of regional-to-whole brain FDG uptake in several AD-affected brain regions in 158 cognitively normal late-middle-aged APOEε4 homozygotes, heterozygotes, and non-carriers. GAB2 haplotypes were characterized using Affymetrix Genome-Wide Human SNP 6.0 Array data from each of these subjects. As predicted, the possibly protective GAB2 haplotype was associated with higher regional-to-whole brain FDG uptake in AD-affected brain regions in APOEε4 carriers. While additional studies are needed, this study supports the association between the possibly protective GAB2 haplotype and the risk of late-onset AD in APOEε4 carriers. It also supports the use of brain-imaging endophenotypes to help assess possible modifiers of AD risk. PMID:20888920

  3. Relative extended haplotype homozygosity signals across breeds reveal dairy and beef specific signatures of selection.

    PubMed

    Bomba, Lorenzo; Nicolazzi, Ezequiel L; Milanesi, Marco; Negrini, Riccardo; Mancini, Giordano; Biscarini, Filippo; Stella, Alessandra; Valentini, Alessio; Ajmone-Marsan, Paolo

    2015-04-02

    A number of methods are available to scan a genome for selection signatures by evaluating patterns of diversity within and between breeds. Among these, "extended haplotype homozygosity" (EHH) is a reliable approach to detect genome regions under recent selective pressure. The objective of this study was to use this approach to identify regions that are under recent positive selection and shared by the most representative Italian dairy and beef cattle breeds. A total of 3220 animals from Italian Holstein (2179), Italian Brown (775), Simmental (493), Marchigiana (485) and Piedmontese (379) breeds were genotyped with the Illumina BovineSNP50 BeadChip v.1. After standard quality control procedures, genotypes were phased and core haplotypes were identified. The decay of linkage disequilibrium (LD) for each core haplotype was assessed by measuring the EHH. Since accurate estimates of local recombination rates were not available, relative EHH (rEHH) was calculated for each core haplotype. Genomic regions that carry frequent core haplotypes and with significant rEHH values were considered as candidates for recent positive selection. Candidate regions were aligned across to identify signals shared by dairy or beef cattle breeds. Overall, 82 and 87 common regions were detected among dairy and beef cattle breeds, respectively. Bioinformatic analysis identified 244 and 232 genes in these common genomic regions. Gene annotation and pathway analysis showed that these genes are involved in molecular functions that are biologically related to milk or meat production. Our results suggest that a multi-breed approach can lead to the identification of genomic signatures in breeds of cattle that are selected for the same production goal and thus to the localisation of genomic regions of interest in dairy and beef production.

  4. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.

    PubMed

    Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong

    2010-04-08

    PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.

  5. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2012-12-11

    Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.

  6. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

    PubMed Central

    2012-01-01

    Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411

  7. A risk PRODH haplotype affects sensorimotor gating, memory, schizotypy, and anxiety in healthy male subjects.

    PubMed

    Roussos, Panos; Giakoumaki, Stella G; Bitsios, Panos

    2009-06-15

    Significant associations have been shown for haplotypes comprising three PRODH single nucleotide polymorphisms (SNPs; 1945T/C, 1766A/G, 1852G/A) located in the 3' region of the gene, suggesting a role of these variants in the etiopathogenesis of schizophrenia. We assessed the relationship between these high-risk PRODH polymorphisms and schizophrenia-related endophenotypes in a large and highly homogeneous cohort of healthy males. Participants (n = 217) were tested in prepulse inhibition (PPI), verbal and working memory, trait anxiety and schizotypy. The QTPHASE from the UNPHASED package was used for the association analysis of each SNP or haplotype data. This procedure revealed significant phenotypic impact of the risk CGA haplotype. Subjects were then divided in two groups; levels of PPI, anxiety, and schizotypy, verbal and working memory were compared with analysis of variance. CGA carriers (n = 32) exhibited attenuated PPI (p < .001) and verbal memory (p < .001) and higher anxiety (p < .004) and schizotypy (p < .008) compared with the noncarriers (n = 185). There were no differences in baseline startle, demographics, and working memory. The main significant correlations were schizotypy x PPI [85-dB, 120-msec trials] in the carriers and schizotypy x anxiety in the entire group and the noncarriers but not the carriers group. Our results strongly support PPI as a valid schizophrenia endophenotype and highlight the importance of examining the role of risk haplotypes on multiple endophenotypes and have implications for understanding the continuum from normality to psychosis, transitional states, and the genetics of schizophrenia-related traits.

  8. Geographic distribution of haplotype diversity at the bovine casein locus

    PubMed Central

    Jann, Oliver C; Ibeagha-Awemu, Eveline M; Özbeyaz, Ceyhan; Zaragoza, Pilar; Williams, John L; Ajmone-Marsan, Paolo; Lenstra, Johannes A; Moazami-Goudarzi, Katy; Erhardt, Georg

    2004-01-01

    The genetic diversity of the casein locus in cattle was studied on the basis of haplotype analysis. Consideration of recently described genetic variants of the casein genes which to date have not been the subject of diversity studies, allowed the identification of new haplotypes. Genotyping of 30 cattle breeds from four continents revealed a geographically associated distribution of haplotypes, mainly defined by frequencies of alleles at CSN1S1 and CSN3. The genetic diversity within taurine breeds in Europe was found to decrease significantly from the south to the north and from the east to the west. Such geographic patterns of cattle genetic variation at the casein locus may be a result of the domestication process of modern cattle as well as geographically differentiated natural or artificial selection. The comparison of African Bos taurus and Bos indicus breeds allowed the identification of several Bos indicus specific haplotypes (CSN1S1*C-CSN2*A2-CSN3*AI/CSN3*H) that are not found in pure taurine breeds. The occurrence of such haplotypes in southern European breeds also suggests that an introgression of indicine genes into taurine breeds could have contributed to the distribution of the genetic variation observed. PMID:15040901

  9. Compression and fast retrieval of SNP data

    PubMed Central

    Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio

    2014-01-01

    Motivation: The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. Results: We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Availability and implementation: Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. Contact: sambofra@dei.unipd.it or cobelli@dei.unipd.it. PMID:25064564

  10. HLA-A, B and DRB1 allele and haplotype frequencies in volunteer bone marrow donors from the north of Parana State.

    PubMed

    Bardi, Marlene Silva; Jarduli, Luciana Ribeiro; Jorge, Adylson Justino; Camargo, Rossana Batista Oliveira Godoy; Carneiro, Fernando Pagotto; Gelinski, Jair Roberto; Silva, Roseclei Assunção Feliciano; Lavado, Edson Lopes

    2012-01-01

    Knowledge of allele and haplotype frequencies of the human leukocyte antigen (HLA) system is important in the search for unrelated bone marrow donors. The Brazilian population is very heterogeneous and the HLA system is highly informative of populations because of the high level of polymorphisms. The aim of this study was to characterize the immunogenetic profile of ethnic groups (Caucasians, Afro-Brazilians and Asians) in the north of Parana State. A study was carried out of 3978 voluntary bone marrow donors registered in the Brazilian National Bone Marrow Donor Registry and typed for the HLA-A, B and DRB1 (low resolution) loci. The alleles were characterized by the polymerase chain reaction sequence-specific oligonucleotides method using the LabType SSO kit (One Lambda, CA, USA). The ARLEQUIN v.3.11 computer program was used to calculate allele and haplotype frequencies The most common alleles found in Caucasians were HLA-A*02, 24, 01; HLA-B*35, 44, 51; DRB1*11, 13, 07; for Afro-Brazilians they were HLA-A*02, 03, 30; HLA-B*35, 15, 44; DRB1*13, 11, 03; and for Asians they were: HLA-A*24, 02, 26; HLA-B*40, 51, 52; DRB1*04, 15, 09. The most common haplotype combinations were: HLA-A*01, B*08, DRB1*03 and HLA-A*29, B*44, DRB1*07 for Caucasians; HLA-A*29, B*44, DRB1*07 and HLA-A*01, B*08 and DRB1*03 for Afro-Brazilians; and HLA-A*24, B*52, DRB1*15 and HLA-A*24, B*40 and DRB1*09 for Asians. There is a need to target and expand bone marrow donor campaigns in the north of Parana State. The data of this study may be used as a reference by the Instituto Nacional de Cancer/Brazilian National Bone Marrow Donor Registry to evaluate the immunogenetic profile of populations in specific regions and in the selection of bone marrow donors.

  11. HLA-A, B and DRB1 allele and haplotype frequencies in volunteer bone marrow donors from the north of Parana State

    PubMed Central

    Bardi, Marlene Silva; Jarduli, Luciana Ribeiro; Jorge, Adylson Justino; Camargo, Rossana Batista Oliveira Godoy; Carneiro, Fernando Pagotto; Gelinski, Jair Roberto; Silva, Roseclei Assunção Feliciano; Lavado, Edson Lopes

    2012-01-01

    Background Knowledge of allele and haplotype frequencies of the human leukocyte antigen (HLA) system is important in the search for unrelated bone marrow donors. The Brazilian population is very heterogeneous and the HLA system is highly informative of populations because of the high level of polymorphisms. Aim The aim of this study was to characterize the immunogenetic profile of ethnic groups (Caucasians, Afro-Brazilians and Asians) in the north of Parana State. Methods A study was carried out of 3978 voluntary bone marrow donors registered in the Brazilian National Bone Marrow Donor Registry and typed for the HLA-A, B and DRB1 (low resolution) loci. The alleles were characterized by the polymerase chain reaction sequence-specific oligonucleotides method using the LabType SSO kit (One Lambda, CA, USA). The ARLEQUIN v.3.11 computer program was used to calculate allele and haplotype frequencies Results The most common alleles found in Caucasians were HLA-A*02, 24, 01; HLA-B*35, 44, 51; DRB1*11, 13, 07; for Afro-Brazilians they were HLA-A*02, 03, 30; HLA-B*35, 15, 44; DRB1*13, 11, 03; and for Asians they were: HLA-A*24, 02, 26; HLA-B*40, 51, 52; DRB1*04, 15, 09. The most common haplotype combinations were: HLA-A*01, B*08, DRB1*03 and HLA-A*29, B*44, DRB1*07 for Caucasians; HLA-A*29, B*44, DRB1*07 and HLA-A*01, B*08 and DRB1*03 for Afro-Brazilians; and HLA-A*24, B*52, DRB1*15 and HLA-A*24, B*40 and DRB1*09 for Asians. Conclusion There is a need to target and expand bone marrow donor campaigns in the north of Parana State. The data of this study may be used as a reference by the Instituto Nacional de Cancer/Brazilian National Bone Marrow Donor Registry to evaluate the immunogenetic profile of populations in specific regions and in the selection of bone marrow donors PMID:23049380

  12. Neuropsychiatric systemic lupus erythematosus is associated with imbalance in interleukin 10 promoter haplotypes

    PubMed Central

    Rood, M; Keijsers, V; van der Linden, M W; Tong, T; Borggreve, S; Verweij, C; Breedveld, F; Huizinga, T

    1999-01-01

    OBJECTIVE—To investigate the association of interleukin 10 (IL10) promoter polymorphisms and neuropsychiatric manifestations of systemic lupus erythematosus (SLE).
METHODS—IL10 haplotypes of 11 healthy volunteers were cloned to confirm that in the Dutch population, only the three common haplotypes (-1082/-819/-592) GCC, ACC and ATA exist. The IL10 promoter polymorphisms of 92 SLE patients and 162 healthy controls were determined. The medical records of the SLE patients were screened for the presence of neuropsychiatric involvement.
RESULTS—All cloned haplotypes were either GCC, ACC or ATA. Forty two SLE patients had suffered from neuropsychiatric manifestations (NP-SLE). In NP-SLE patients, the frequency of the ATA haplotype is 30% versus 18% in the controls and 17% in the non-NP-SLE group (odds ratios 1.9, p=0.02, and 2.1, p=0.04, respectively), whereas the GCC haplotype frequency is lower in the NP-SLE group compared with controls and non-NP-SLE patients (40% versus 55% and 61%, odds ratios 0.6, p=0.02 and 0.4 p=0.006). The odds ratio for the presence of NP-SLE is inversely proportional to the number of GCC haplotypes per genotype when the NP-SLE group is compared with non-NP-SLE patients.
CONCLUSIONS—The IL10 locus is associated with neuropsychiatric manifestations in SLE. This suggests that IL10 is implicated in the immunopathogenesis of neuropsychiatric manifestations in SLE.

 Keywords: systemic lupus erythematosus; neuropsychiatric manifestations; genetics; interleukin 10 promoter haplotypes PMID:10343522

  13. A comprehensive literature review of haplotyping software and methods for use with unrelated individuals.

    PubMed

    Salem, Rany M; Wessel, Jennifer; Schork, Nicholas J

    2005-03-01

    Interest in the assignment and frequency analysis of haplotypes in samples of unrelated individuals has increased immeasurably as a result of the emphasis placed on haplotype analyses by, for example, the International HapMap Project and related initiatives. Although there are many available computer programs for haplotype analysis applicable to samples of unrelated individuals, many of these programs have limitations and/or very specific uses. In this paper, the key features of available haplotype analysis software for use with unrelated individuals, as well as pooled DNA samples from unrelated individuals, are summarised. Programs for haplotype analysis were identified through keyword searches on PUBMED and various internet search engines, a review of citations from retrieved papers and personal communications, up to June 2004. Priority was given to functioning computer programs, rather than theoretical models and methods. The available software was considered in light of a number of factors: the algorithm(s) used, algorithm accuracy, assumptions, the accommodation of genotyping error, implementation of hypothesis testing, handling of missing data, software characteristics and web-based implementations. Review papers comparing specific methods and programs are also summarised. Forty-six haplotyping programs were identified and reviewed. The programs were divided into two groups: those designed for individual genotype data (a total of 43 programs) and those designed for use with pooled DNA samples (a total of three programs). The accuracy of programs using various criteria are assessed and the programs are categorised and discussed in light of: algorithm and method, accuracy, assumptions, genotyping error, hypothesis testing, missing data, software characteristics and web implementation. Many available programs have limitations (eg some cannot accommodate missing data) and/or are designed with specific tasks in mind (eg estimating haplotype frequencies rather than

  14. Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.

    PubMed

    Städler, Thomas; Delph, Lynda F

    2002-09-03

    Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.

  15. Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.

    PubMed

    Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng

    2010-02-01

    Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.

  16. BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.

    PubMed

    Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F

    2014-01-01

    We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.

  17. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco

    2018-04-11

    The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.

  18. Haplotypes and gene expression implicate the MAPT region for Parkinson disease

    PubMed Central

    Tobin, J.E.; Latourelle, J.C.; Lew, M.F.; Klein, C.; Suchowersky, O.; Shill, H.A.; Golbe, L.I.; Mark, M.H.; Growdon, J.H.; Wooten, G.F.; Racette, B.A.; Perlmutter, J.S.; Watts, R.; Guttman, M.; Baker, K.B.; Goldwurm, S.; Pezzoli, G.; Singer, C.; Saint-Hilaire, M.H.; Hendricks, A.E.; Williamson, S.; Nagle, M.W.; Wilk, J.B.; Massood, T.; Laramie, J.M.; DeStefano, A.L.; Litvan, I.; Nicholson, G.; Corbett, A.; Isaacson, S.; Burn, D.J.; Chinnery, P.F.; Pramstaller, P.P.; Sherman, S.; Al-hinti, J.; Drasby, E.; Nance, M.; Moller, A.T.; Ostergaard, K.; Roxburgh, R.; Snow, B.; Slevin, J.T.; Cambi, F.; Gusella, J.F.; Myers, R.H.

    2009-01-01

    Background Microtubule-associated protein tau (MAPT) has been associated with several neurodegenerative disorders including forms of parkinsonism and Parkinson disease (PD). We evaluated the association of the MAPT region with PD in a large cohort of familial PD cases recruited by the GenePD Study. In addition, postmortem brain samples from patients with PD and neurologically normal controls were used to evaluate whether the expression of the 3-repeat and 4-repeat isoforms of MAPT, and neighboring genes Saitohin (STH) and KIAA1267, are altered in PD cerebellum. Methods Twenty-one single-nucleotide polymorphisms (SNPs) in the region of MAPT on chromosome 17q21 were genotyped in the GenePD Study. Single SNPs and haplotypes, including the H1 haplotype, were evaluated for association to PD. Relative quantification of gene expression was performed using real-time RT-PCR. Results After adjusting for multiple comparisons, SNP rs1800547 was significantly associated with PD affection. While the H1 haplotype was associated with a significantly increased risk for PD, a novel H1 subhaplotype was identified that predicted a greater increased risk for PD. The expression of 4-repeat MAPT, STH, and KIAA1267 was significantly increased in PD brains relative to controls. No difference in expression was observed for 3-repeat MAPT. Conclusions This study supports a role for MAPT in the pathogenesis of familial and idiopathic Parkinson disease (PD). Interestingly, the results of the gene expression studies suggest that other genes in the vicinity of MAPT, specifically STH and KIAA1267, may also have a role in PD and suggest complex effects for the genes in this region on PD risk. PMID:18509094

  19. A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea

    PubMed Central

    Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368

  20. Sequence polymorphism at the human apolipoprotein AII gene ( APOA2): unexpected deficit of variation in an African-American sample.

    PubMed

    Fullerton, Stephanie M; Clark, Andrew G; Weiss, Kenneth M; Taylor, Scott L; Stengård, Jari H; Salomaa, Veikko; Boerwinkle, Eric; Nickerson, Deborah A

    2002-07-01

    A 3.3-kb region, encompassing the APOA2 gene and 2 kb of 5' and 3' flanking DNA, was re-sequenced in a "core" sample of 24 individuals, sampled without regard to the health from each of three populations: African-Americans from Jackson (Miss., USA), Europeans from North Karelia (Finland), and non-Hispanic European-Americans from Rochester, (Minn., USA). Fifteen variable sites were identified (14 SNPs and one multi-allelic microsatellite, all silent), and these sites segregated as 18 sequence haplotypes (or nine, if SNPs only are considered). The haplotype distribution in the core African-American sample was unusual, with a deficit of particular haplotypes compared with those found in the other two samples, and a significantly (P<0.05) low level of nucleotide diversity relative to patterns of polymorphism and divergence at other human loci. Six of the 14 SNPs, whose variation captured the haplotype structure of the core data, were then genotyped by oligonucleotide ligation assay in an additional 2183 individuals from the same three populations (n=843, n=452, and n=888, respectively). All six sites varied in each of the larger "epidemiological" samples, and together, they defined 19 SNP haplotypes, seven with relative frequencies greater than 1% in the total sample; all of these common haplotypes had been identified earlier in the core re-sequencing survey. Here also, the African-American sample showed significantly lower SNP heterozygosity and haplotype diversity than the other two samples. The deficit of polymorphism is consistent with a population-specific non-neutral increase in the relative frequency of several haplotypes in Jackson.

  1. Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo)

    PubMed Central

    2012-01-01

    Background The turkey (Meleagris gallopavo) is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs) the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The whole genome SNP discovery

  2. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication

    PubMed Central

    vonHoldt, Bridgett M.; Pollinger, John P.; Lohmueller, Kirk E.; Han, Eunjung; Parker, Heidi G.; Quignon, Pascale; Degenhardt, Jeremiah D.; Boyko, Adam R.; Earl, Dent A.; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C.; Mosher, Dana S.; Spady, Tyrone C.; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G.; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-ping; Bustamante, Carlos D.; Ostrander, Elaine A.; Novembre, John; Wayne, Robert K.

    2010-01-01

    Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication1,2. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data3. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity. PMID:20237475

  3. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication.

    PubMed

    Vonholdt, Bridgett M; Pollinger, John P; Lohmueller, Kirk E; Han, Eunjung; Parker, Heidi G; Quignon, Pascale; Degenhardt, Jeremiah D; Boyko, Adam R; Earl, Dent A; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C; Mosher, Dana S; Spady, Tyrone C; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-Ping; Bustamante, Carlos D; Ostrander, Elaine A; Novembre, John; Wayne, Robert K

    2010-04-08

    Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity.

  4. Compression and fast retrieval of SNP data.

    PubMed

    Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio

    2014-11-01

    The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. RNA-Seq identifies SNP markers for growth traits in rainbow trout.

    PubMed

    Salem, Mohamed; Vallejo, Roger L; Leeds, Timothy D; Palti, Yniv; Liu, Sixin; Sabbagh, Annas; Rexroad, Caird E; Yao, Jianbo

    2012-01-01

    Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs) explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each) identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample) were considered at scores >5.0 as an amplification and <0.2 as loss of heterozygosity. A subset of SNPs (n = 54) were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is suitable for marker

  6. The distribution of HLA haplotypes in the ethnic groups that make up the Brazilian Bone Marrow Volunteer Donor Registry (REDOME).

    PubMed

    Halagan, Michael; Oliveira, Danielli Cristina; Maiers, Martin; Fabreti-Oliveira, Raquel A; Moraes, Maria Elisa Hue; Visentainer, Jeane Eliete Laguila; Pereira, Noemi Farah; Romero, Matilde; Cardoso, Juliana Fernandes; Porto, Luís Cristóvão

    2018-04-26

    The Registries of Bone Marrow Donors around the world include more than 30 million volunteer donors from 57 different countries, and were responsible for over 17,000 hematopoietic stem cell transplants in 2016. The Brazilian Bone Marrow Volunteer Donor Registry (REDOME) was established in 1993 and is the third largest registry in the world with more than 4.3 million donors. We characterized HLA allele and haplotypes frequencies from REDOME comparing them with the donor self-reported race group classification. Five-locus haplotype frequencies (A~C~B~DRB1~DQB1) were estimated for each of the six race groups, resolving phase and allelic ambiguity using the expectation-maximization (EM) algorithm. The top 100 haplotypes in the race groups were separated into eight clusters of haplotypes, based on haplotype similarity, using CLUTO. We present HLA allele and haplotype frequency data from six race groups from 2,938,259 individuals from REDOME. The most frequent haplotype was the same for all groups: A*01:01g~C*07:01g~B*08:01g~DRB1*03:01g~DQB1*02:01g. Some frequent haplotypes such as A*02:01g~C*16:01g~B*44:03~DRB1*07:01g~DQB1*02:01g was not found in people with Preta (Sub-Saharan African descent). A cluster including Branca (European) and Parda or non-informed (admixed) could be distinguished from both Preta (SubSaharan) and Indígena (Amerindian) groups, and from the Amarela (Asian) ones, which clustered with their original population. These results have implications on cross-population matching and can help in donor searches and population-based recruitment strategies.

  7. Nucleotide-binding oligomerization domain containing 1 (NOD1) haplotypes and single nucleotide polymorphisms modify susceptibility to inflammatory bowel diseases in a New Zealand caucasian population: a case-control study

    PubMed Central

    Huebner, Claudia; Ferguson, Lynnette R; Han, Dug Yeo; Philpott, Martin; Barclay, Murray L; Gearry, Richard B; McCulloch, Alan; Demmers, Pieter S; Browning, Brian L

    2009-01-01

    Background The nucleotide-binding oligomerization domain containing 1 (NOD1) gene encodes a pattern recognition receptor that senses pathogens, leading to downstream responses characteristic of innate immunity. We investigated the role of NOD1 single nucleotide polymorphisms (SNPs) on IBD risk in a New Zealand Caucasian population, and studied Nod1 expression in response to bacterial invasion in the Caco2 cell line. Findings DNA samples from 388 Crohn's disease (CD), 405 ulcerative colitis (UC), 27 indeterminate colitis patients and 201 randomly selected controls, from Canterbury, New Zealand were screened for 3 common SNPs in NOD1, using the MassARRAY® iPLEX Gold assay. Transcriptional activation of the protein produced by NOD1 (Nod1) was studied after infection of Caco2 cells with Escherichia coli LF82. Carrying the rs2075818 G allele decreased the risk of CD (OR = 0.66, 95% CI = 0.50–0.88, p < 0.002) but not UC. There was an increased frequency of the three SNP (rs2075818, rs2075822, rs2907748) haplotype, CTG (p = 0.004) and a decreased frequency of the GTG haplotype (p = 0.02).in CD. The rs2075822 CT or TT genotypes were at an increased frequency (genotype p value = 0.02), while the rs2907748 AA or AG genotypes showed decreased frequencies in UC (p = 0.04), but not in CD. Functional assays showed that Nod1 is produced 6 hours after bacterial invasion of the Caco2 cell line. Conclusion The NOD1 gene is important in signalling invasion of colonic cells by pathogenic bacteria, indicative of its' key role in innate immunity. Carrying specific SNPs in this gene significantly modifies the risk of CD and/or UC in a New Zealand Caucasian population. PMID:19327158

  8. High frequency of C9orf72 hexanucleotide repeat expansion in amyotrophic lateral sclerosis patients from two founder populations sharing the same risk haplotype.

    PubMed

    Goldstein, Orly; Gana-Weisz, Mali; Nefussy, Beatrice; Vainer, Batel; Nayshool, Omri; Bar-Shira, Anat; Traynor, Bryan J; Drory, Vivian E; Orr-Urtreger, Avi

    2018-04-01

    We characterized the C9orf72 hexanucleotide repeat expansion (RE) mutation in amyotrophic lateral sclerosis (ALS) patients of 2 distinct origins, Ashkenazi and North Africa Jews (AJ, NAJ), its frequency, and genotype-phenotype correlations. In AJ, 80% of familial ALS (fALS) and 11% of sporadic ALS carried the RE, a total of 12.9% of all AJ-ALS compared to 0.3% in AJ controls (odds ratio [OR] = 44.3, p < 0.0001). In NAJ, 10% of fALS and 9% of sporadic ALS carried the RE, a total of 9.1% of all NAJ-ALS compared to 1% in controls (OR = 9.9, p = 0.0006). We identified a risk haplotype shared among all ALS patients, although an association with age at disease onset, fALS, and dementia were observed only in AJ. Variations were identified downstream the repeats. The risk haplotype and these polymorphisms were at high frequencies in alleles with 8 repeats or more, suggesting sequence instability. The different genotype-phenotype correlations and OR, together with the large range in age at onset, suggest that other modifiers and risk factors may affect penetrance and phenotype in ALS. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Cost-effective HLA typing with tagging SNPs predicts celiac disease risk haplotypes in the Finnish, Hungarian, and Italian populations.

    PubMed

    Koskinen, Lotta; Romanos, Jihane; Kaukinen, Katri; Mustalahti, Kirsi; Korponay-Szabo, Ilma; Barisani, Donatella; Bardella, Maria Teresa; Ziberna, Fabiana; Vatta, Serena; Széles, György; Pocsai, Zsuzsa; Karell, Kati; Haimila, Katri; Adány, Róza; Not, Tarcisio; Ventura, Alessandro; Mäki, Markku; Partanen, Jukka; Wijmenga, Cisca; Saavalainen, Päivi

    2009-04-01

    Human leukocyte antigen (HLA) genes, located on chromosome 6p21.3, have a crucial role in susceptibility to various autoimmune and inflammatory diseases, such as celiac disease and type 1 diabetes. Certain HLA heterodimers, namely DQ2 (encoded by the DQA1*05 and DQB1*02 alleles) and DQ8 (DQA1*03 and DQB1*0302), are necessary for the development of celiac disease. Traditional genotyping of HLA genes is laborious, time-consuming, and expensive. A novel HLA-genotyping method, using six HLA-tagging single-nucleotide polymorphisms (SNPs) and suitable for high-throughput approaches, was described recently. Our aim was to validate this method in the Finnish, Hungarian, and Italian populations. The six previously reported HLA-tagging SNPs were genotyped in patients with celiac disease and in healthy individuals from Finland, Hungary, and two distinct regions of Italy. The potential of this method was evaluated in analyzing how well the tag SNP results correlate with the HLA genotypes previously determined using traditional HLA-typing methods. Using the tagging SNP method, it is possible to determine the celiac disease risk haplotypes accurately in Finnish, Hungarian, and Italian populations, with specificity and sensitivity ranging from 95% to 100%. In addition, it predicts homozygosity and heterozygosity for a risk haplotype, allowing studies on genotypic risk effects. The method is transferable between populations and therefore suited for large-scale research studies and screening of celiac disease among high-risk individuals or at the population level.

  10. Development of allele-specific primer PCR for a swine TLR2 SNP and comparison of the frequency among several pig breeds of Japan and the Czech Republic.

    PubMed

    Muneta, Yoshihiro; Minagawa, Yu; Kusumoto, Masahiro; Shinkai, Hiroki; Uenishi, Hirohide; Splichal, Igor

    2012-05-01

    In the present study, we have developed an allele-specific primer-polymerase chain reaction (ASP-PCR) for genotyping a single nucleotide polymorphism (SNP) of swine Toll-like receptor 2 (TLR2) (C406G), which is related to the prevalence of pneumonia caused by Mycoplasma hyopneumoniae. We also compared the allele frequency among several pig breeds of Japan and the Czech Republic. Allele-specific primers were constructed by introducing 1-base mismatch sequence before the SNP site. The swine TLR2 C406G mutation was successfully determined by the ASP-PCR using genomic DNA samples in Japan as previously genotyped by a sequencing method. Using the PCR condition determined, genomic DNA samples from pig blood obtained from 110 pigs from 7 different breeds in the Czech Republic were genotyped by the ASP-PCR. The genotyping results from the ASP-PCR were completely matched with the results from the sequencing method. The allele frequency of the swine TLR2 C406G mutation was 27.5% in the Czech Republic and 3.6% in Japan. The C406G mutation was only found in the Landrace breed in Japan, and was almost exclusively found in the Landrace breed in the Czech Republic as well. These results indicated the usefulness of ASP-PCR for detecting a specific SNP for swine TLR2.

  11. A SNP uncoupling Mina expression from the TGFβ signaling pathway.

    PubMed

    Lian, Shang L; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori; Bix, Mark

    2018-03-01

    Mina is a JmjC family 2-oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell-type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1-region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1-region SNPs perturbs a Mina cis-regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus-spanning 26-kilobase genomic interval. We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c-but not C57Bl/6 allele-abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. © 2017 The Authors. Immunity, Inflammation and Disease Published by John Wiley & Sons Ltd.

  12. A SNP uncoupling Mina expression from the TGFβ signaling pathway

    PubMed Central

    Lian, Shang L.; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori

    2017-01-01

    Abstract Introduction Mina is a JmjC family 2‐oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell‐type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1‐region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Methods Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1‐region SNPs perturbs a Mina cis‐regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus‐spanning 26‐kilobase genomic interval. Results We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c—but not C57Bl/6 allele—abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Conclusions Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. PMID:28967702

  13. Design and characterization of a 52K SNP chip for goats.

    PubMed

    Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C M; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T; McEwan, John; Martin, Patrice; Moreno, Carole R; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.

  14. Design and Characterization of a 52K SNP Chip for Goats

    PubMed Central

    Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C. M.; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T.; McEwan, John; Martin, Patrice; Moreno, Carole R.; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L.; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50–60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years. PMID:24465974

  15. [Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].

    PubMed

    Gu, Ming-liang; Chu, Jia-you

    2007-12-01

    Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.

  16. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed Central

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina

    2018-01-01

    The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446

  17. Genetic variation in CXCR1 haplotypes linked to severity of Streptococcus uberis infection in an experimental challenge model.

    PubMed

    Siebert, Lydia; Headrick, Susan; Lewis, Mark; Gillespie, Barbara; Young, Charlie; Wojakiewicz, Leszek; Kerro-Dego, Oudessa; Prado, Maria E; Almeida, Raul; Oliver, Stephen P; Pighetti, Gina M

    2017-08-01

    Mastitis, an inflammation of the mammary gland, costs the dairy industry billions of dollars in lost revenues annually. The prevalence and costs associated with mastitis has made genetic selection methods a target for research. Previous research has identified amino acid changes at positions 122, 207, 245, 327, and 332 in the IL8 receptor, CXCR1, that result in three dominant amino acid haplotypes: VWHKH, VWHRR, and AWQRR. We hypothesize different haplotype combinations influence a cow's resistance, strength, and duration of response to mastitis. To test this, Holstein dairy cows (n=40) were intramammarily challenged with Streptococcus uberis within 3 d post-calving. All cows developed mastitis based on isolation of S. uberis from the challenged quarter at least twice. All cows with the VWHRR x VWHRR (n=5) and AWQRR x VWHRR (n=6) haplotype combinations required antibiotic therapy due to clinical signs of mastitis and tended (P=0.08) to be different from cows with a VWHRR x VWHKH (n=6) haplotype combination where only 33.3% required antibiotic therapy. Cows with a VWHRR homozygous haplotype combination displayed significantly higher responses to challenge indicated by elevated S. uberis counts (4340±5,521.9CFU/mL; P=0.01), mammary scores (1.1±0.18; P=0.03), milk scores (0.9±0.17; P=0.002), and SCC (1,010,832±489,993cells/mL; P=0.03). Contrastingly, AWQRR x VWHRR cows had significantly lower S. uberis counts (15.3±16.46CFU/mL; P=0.01), mammary scores (0.3±0.16; P=0.03), milk scores (0±0.15; P=0.002), and SCC (239,261±92,264.3cells/mL; P=0.03). Cows of the VWHKH x VWHRR haplotype combination displayed responses to challenge statistically comparable to other haplotype combinations, but appeared to have an earlier peak in SCC in comparison to all other haplotype combinations. Haplotype combination did not influence milk yield (P=0.6). Our results suggest using combinations of the SNPs within the CXCR1 gene gives a better indication of a cow's ability to combat S

  18. Hereditary tyrosinemia type I: strong association with haplotype 6 in French Canadians permits simple carrier detection and prenatal diagnosis.

    PubMed Central

    Demers, S. I.; Phaneuf, D.; Tanguay, R. M.

    1994-01-01

    Hereditary tyrosinemia type 1 (HT1), a severe inborn error of tyrosine catabolism, is caused by deficiency of the terminal enzyme, fumarylacetoacetate hydrolase (FAH). The highest reported frequency of HT1 is in the French Canadian population, especially in the Saguenay-Lac-St-Jean region. Using human FAH cDNA probes, we have identified 10 haplotypes with TaqI, KpnI, RsaI, BglII, and MspI RFLPs in 118 normal chromosomes from the French Canadian population. Interestingly, in 29 HT1 children, a prevalent haplotype, haplotype 6, was found to be strongly associated with the disease, at a frequency of 90% of alleles, as compared with approximately 18% in 35 control individuals. This increased to 96% in the 24 patients originating from Saguenay-Lac-St-Jean. These results suggest that one or only a few prevailing mutations are responsible for most of the HT1 cases in Saguenay-Lac-St-Jean. Since most patients were found to be homozygous for a specific haplotype in this population, FAH RFLPs have permitted simple carrier detection in nine different informative HT1 families, with a confidence level of 99.9%. Heterozygosity rate values obtained from 52 carriers indicated that approximately 88% of families at risk from Saguenay-Lac-St-Jean are fully or partially informative. Prenatal diagnosis was also achieved in an American family. Analysis of 24 HT1 patients from nine countries gave a frequency of approximately 52% for haplotype 6, suggesting a relatively high association, worldwide, of HT1 with this haplotype. Images Figure 1 PMID:7913582

  19. TAS2R38 and CA6 genetic polymorphisms, frequency of bitter food intake, and blood biomarkers among elderly woman.

    PubMed

    Mikołajczyk-Stecyna, Joanna; Malinowska, Anna M; Chmurzynska, Agata

    2017-09-01

    Taste sensitivity is one of the most important biological determinants of food choice. Three SNPs of the TAS2R38 gene (rs713598, rs1726866, and rs10246939) give rise to two common haplotypes: PAV and AVI. These haplotypes, as well as an SNP within the CA6 gene (rs2274333) that encodes carbonic anhydrase VI (CA6), correlate with bitterness perception. The extent of consumption of bitter food may influence some health outcomes. The aim of this study is thus to investigate the impact of the TAS2R38 and CA6 genetic polymorphisms on the choice of bitter food, BMI, blood lipoprotein, and glucose concentrations as well as systemic inflammation in elderly women. The associations between the TAS2R38 diplotype, CA6 genotype, and the intake of bitter-tasting foods were studied in a group of 118 Polish women over 60 years of age. The intake of Brassica vegetables, grapefruit, and coffee was assessed using a food frequency questionnaire. Biochemical parameters were measured using the spectrophotometric method. Genotyping was performed using the high resolution melting method. We found a correlation between lipid profile, glucose and CRP levels, and frequency of bitter food intake. The AVI/AVI subjects drank coffee more frequently than did the PAV/PAV homozygotes, as did the A carriers of CA6 in comparison with the GG homozygotes. We also observed that simultaneous carriers of the PAV haplotype and A allele of TAS2R38 and CA6, respectively, choose white cabbage more frequent and had lower plasma levels of CRP and glucose than did AVI/AVI and GG homozygotes. In elderly women, the TAS2R38 and CA6 polymorphisms may affect the frequency of consumption of coffee and white cabbage, but not of other bitter-tasting foods. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. Interrelationships between Amerindian tribes of lower Amazonia as manifest by HLA haplotype disequilibria.

    PubMed

    Black, F L

    1984-11-01

    HLA B-C haplotypes exhibit common disequilibria in populations drawn from four continents, indicating that they are subject to broadly active selective forces. However, the A-B and A-C associations we have examined show no consistent disequilibrium pattern, leaving open the possibility that these disequilibria are due to descent from common progenitors. By examining HLA haplotype distributions, I have explored the implications that would follow from the hypothesis that biological selection played no role in determining A-C disequilibria in 10 diverse tribes of the lower Amazon Basin. Certain haplotypes are in strong positive disequilibria across a broad geographic area, suggesting that members of diverse tribes descend from common ancestors. On the basis of the extent of diffusion of the components of these haplotypes, one can estimate that the progenitors lived less than 6,000 years ago. One widely encountered lineage entered the area within the last 1,200 years. When haplotype frequencies are used in genetic distance measurements, they give a pattern of relationships very similar to that obtained by conventional chord measurements based on several genetic markers; but more than that, when individual haplotype disequilibria in the several tribes are compared, multiple origins of a single tribe are discernible and relationships are revealed that correlate more closely to geographic and linguistic patterns than do the genetic distance measurements.

  1. Association of HLA haplotype with alopecia areata in Chinese Hans.

    PubMed

    Xiao, F-L; Ye, D-Q; Yang, S; Zhou, F-S; Zhou, S-M; Zhu, Y-G; Liang, Y-H; Ren, Y-Q; Zhang, X-J

    2006-11-01

    Some studies have shown discrepancies in human leucocyte antigen (HLA) associated with alopecia areata (AA) between different ethnic populations. To investigate whether HLA-I, -DQA1 and -DQB1 alleles and the HLA haplotype are associated with AA, and the correlation between the HLA haplotype profile, age of onset and severity of AA in Chinese Hans. The polymerase chain reaction-sequence specific primer (PCR-SSP) method was used to analyse the frequencies of HLA class I, -DQA1 and -DQB1 alleles in 192 patients with AA and 252 controls in Chinese Hans. The linkage disequilibrium was calculated using the 2 x 2 table. The 24 two-locus haplotypes [including A*02-B*18, A*02-B*27, A*02-B*52, A*02-Cw*0704, A*02-DQA1*0104, A*02-DQB1*0604, A*02-DQB1*0606, B*18-Cw*0704, B*18-DQA1*0104, B*18-DQA1*0302, B*18-DQB1*0606, B*27-Cw*0704, B*27-DQA1*0104, B*27-DQA1*0302, B*52-Cw*0704, B*52-DQA1*0104, B*52-DQA1*0302, B52-DQB1*0606, Cw*0704-DQA1*0104, Cw*0704-DQA1*0302, Cw*0704-DQB1*0606, DQA1*0104-DQB1*0604, DQA1*0104-DQB1*0606, DQA1*0302-DQB1*0606 (P<0.05)] were associated with AA, while eight extended haplotypes (A*02-B*18-DQA1*0104, A*02-B*27-DQA1*0104, A*02-B*52-DQA1*0104, A*02-B*52-DQA1*0302, A*02-B*52-DQB1*0606, B*52-Cw*0704-DQA1*0104, B*52-Cw*0704-DQA1*0302, A*02-B*52-DQA1*0302-DQB1*0606) were found to be related to AA in Chinese Hans. Through stratified analysis, we found that the extended haplotype B*52-Cw*0704-DQA1*0302 was related to early onset of AA, and no haplotype was only associated with severe AA. This is the first detailed report to elucidate HLA haplotypes associated with AA and that demonstrates the significant HLA haplotypes in Chinese Hans AA. The haplotype B*52-Cw*0704-DQA1*0302 was identified to be related to early onset of AA. Our results provide some information for future research on predisposing genes in HLA regions in Chinese Hans.

  2. Variants of transcription factor 7-like 2 (TCF7L2) gene and incident glucose intolerance in Japanese-Brazilians.

    PubMed

    Franco, L F; Crispim, F; Pereira, A C; Moisés, R S

    2011-03-01

    Common variants of the transcription factor 7-like 2 (TCF7L2) gene have been found to be associated with type 2 diabetes in different ethnic groups. The Japanese-Brazilian population has one of the highest prevalence rates of diabetes. Therefore, the aim of the present study was to assess whether two single-nucleotide polymorphisms (SNPs) of TCF7L2, rs7903146 and rs12255372, could predict the development of glucose intolerance in Japanese-Brazilians. In a population-based 7-year prospective study, we genotyped 222 individuals (72 males and 150 females, aged 56.2 ± 10.5 years) with normal glucose tolerance at baseline. In the study population, we found that the minor allele frequency was 0.05 for SNP rs7903146 and 0.03 for SNP rs12255372. No significant allele or genotype association with glucose intolerance incidence was found for either SNP. Haplotypes were constructed with these two SNPs and three haplotypes were defined: CG (frequency: 0.94), TT (frequency = 0.027) and TG (frequency = 0.026). None of the haplotypes provided evidence for association with the incidence of glucose intolerance. Despite no associations between incidence of glucose intolerance and SNPs of the TCF7L2 gene in Japanese-Brazilians, we found that carriers of the CT genotype for rs7903146 had significantly lower insulin levels 2 h after a 75-g glucose load than carriers of the CC genotype. In conclusion, in Japanese-Brazilians, a population with a high prevalence of type 2 diabetes, common TCF7L2 variants did not make major contributions to the incidence of glucose tolerance abnormalities.

  3. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations.

    PubMed

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-08-10

    The sickle (betas) mutation in the beta-globin gene (HBB) occurs on five "classical" betas haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the betas allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical betas haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical betas haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). The most common betas sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the betas mutation. Two different classical betas haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of betas haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle

  4. PCR/oligonucleotide probe typing of HLA class II alleles in a Filipino population reveals an unusual distribution of HLA haplotypes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bugawan, T.L.; Chang, J.D.; Erlich, H.A.

    1994-02-01

    The authors have analyzed the distribution of HLA class II alleles and haplotypes in a Filipino population by PCR amplification of the DRB1, DQB1, and DPB1 second-exon sequences from buccal swabs obtained from 124 family members and 53 unrelated individuals. The amplified DNA was typed by using nonradioactive sequence-specific oligonucleotide probes. Twenty-two different DRB1 alleles, including the novel Filipino *1105, and 46 different DRB1/DQB1 haplotypes, including the unusual DRB1*0405-DQB1*0503, were identified. An unusually high frequency (f = .383) of DPB1*0101, a rare allele in other Asian populations, was also observed. In addition, an unusual distribution of DRB1 alleles and haplotypesmore » was seen in this population, with DR2 (f = .415) and DRB1*1502-DQB1*0502 (f = .233) present at high frequencies. This distribution of DRB1 alleles differs from the typical HLA population distribution, in which the allele frequencies are more evenly balanced. The distribution of HLA class II alleles and haplotypes in this Filipino population is different from that of other Asian and Pacific groups: of those populations studied to date, the Indonesian population is the most similar. DRB1*1502-DQB1*0502 was in strong linkage disequilibrium (D[prime] = .41) with DPB 1*0101 (f = .126, for the extended haplotype), which is consistent with selection for this DR, DQ, DP haplotype being responsible for the high frequency of these three class II alleles in this populations. 30 refs., 2 figs., 6 tabs.« less

  5. OAS single-nucleotide polymorphisms and haplotypes are associated with variations in immune responses to rubella vaccine

    PubMed Central

    Haralambieva, Iana H.; Dhiman, Neelam; Ovsyannikova, Inna G.; Vierkant, Robert A.; Pankratz, V. Shane; Jacobson, Robert M.; Poland, Gregory A.

    2010-01-01

    Interferon (IFN)-induced antiviral genes are crucial players in innate antiviral defense and potential determinants of immune response heterogeneity. We selected 114 candidate SNPs from 12 antiviral genes using an LD tagSNP selection approach and genotyped them in a cohort of 738 schoolchildren immunized with two doses of rubella vaccine. Associations between SNPs/haplotypes and rubella virus-specific immune measures were assessed using linear regression methodologies. We identified 23 significant associations (p<0.05) between polymorphisms within the 2′-5′-oligoadenylate synthetase (OAS) gene cluster, and rubella virus-specific IL-2, IL-10, IL-6 secretion and antibody levels. The minor allele variants of three OAS1 SNPs (rs3741981/Ser162Gly, rs1051042/Thr361Arg, rs2660), located in a linkage disequilibrium block of functional importance, were significantly associated with an increase in rubella virus-specific IL-2/Th1 response (p≤0.024). Seven OAS1 and OAS3 promoter/regulatory SNPs were similarly associated with IL-2 secretion. Importantly, two SNPs (rs3741981 and rs10774670), independently cross-regulated rubella virus-specific IL-10 secretion levels (p≤0.031). Furthermore, both global tests and individual haplotype analyses revealed significant associations between OAS1 haplotypes and rubella virus-specific cytokine secretion. Our results suggest that innate immunity and OAS genetic variations are likely involved in modulating the magnitude and quality of the adaptive immune responses to live attenuated rubella vaccine. PMID:20079393

  6. Diet and Colorectal Cancer: Analysis of a Candidate Pathway Using SNPS, Haplotypes, and Multi-Gene Assessment

    PubMed Central

    Slattery, Martha L.; Lundgreen, Abbie; Herrick, Jennifer S.; Caan, Bette J.; Potter, John D.; Wolff, Roger K.

    2012-01-01

    There is considerable biologic plausibility to the hypothesis that genetic variability in pathways involved in insulin signaling and energy homeostasis may modulate dietary risk associated with colorectal cancer. We utilized data from 2 population-based case-control studies of colon (n = 1,574 cases, 1,970 controls) and rectal (n = 791 cases, 999 controls) cancer to evaluate genetic variation in candidate SNPs identified from 9 genes in a candidate pathway: PDK1, RP6KA1, RPS6KA2, RPS6KB1, RPS6KB2, PTEN, FRAP1 (mTOR), TSC1, TSC2, Akt1, PIK3CA, and PRKAG2 with dietary intake of total energy, carbohydrates, fat, and fiber. We employed SNP, haplotype, and multiple-gene analysis to evaluate associations. PDK1 interacted with dietary fat for both colon and rectal cancer and with dietary carbohydrates for colon cancer. Statistically significant interaction with dietary carbohydrates and rectal cancer was detected by haplotype analysis of PDK1. Evaluation of dietary interactions with multiple genes in this candidate pathway showed several interactions with pairs of genes: Akt1 and PDK1, PDK1 and PTEN, PDK1 and TSC1, and PRKAG2 and PTEN. Analyses show that genetic variation influences risk of colorectal cancer associated with diet and illustrate the importance of evaluating dietary interactions beyond the level of single SNPs or haplotypes when a biologically relevant candidate pathway is examined. PMID:21999454

  7. RAD tag sequencing as a source of SNP markers in Cynara cardunculus L

    PubMed Central

    2012-01-01

    Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349

  8. SNPMeta: SNP annotation and SNP metadata collection without a reference genome

    USDA-ARS?s Scientific Manuscript database

    The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a ...

  9. Different effects of apolipoprotein A5 SNPs and haplotypes on triglyceride concentration in three ethnic origins.

    PubMed

    Ken-Dror, Gie; Goldbourt, Uri; Dankner, Rachel

    2010-05-01

    Several polymorphisms in the ApoA5 gene emerged as important candidate genes in triglyceride metabolism. The aim of this study was to determine the associations between ApoA5 polymorphisms, plasma triglyceride concentrations and the presence of cardiovascular disease (CVD) in three ethnic origins. Genotypes for 15 single nucleotide polymorphisms (SNPs) were determined in 659 older adults (mean age 71+/-7 years) who immigrated to Israel or whose ancestors originated from East Europe (Ashkenazi), North Africa, Asia (Sephardic) or Yemen (Yemenite). The minor alleles of the four common SNPs (rs662799, rs651821, rs2072560 and rs2266788) are associated with an increase of 27-38% in triglyceride concentration among Ashkenazi and Yemenite Jews compared with the major alleles, but not among those of Sephardic origin. Conversely, among the Sephardic group, the presence of the minor allele in SNP rs3135506 compared with the major allele was associated with an increase of 34% in triglyceride concentration. The four SNPs were in significant linkage disequilibrium (D'=0.96-0.99), resulting in three haplotypes H1, H2 and H3, representing 98-99% of the population. Haplotype H2 was significantly associated with triglyceride concentration among Ashkenazi and Yemenite but not among Sephardic Jews. Conversely, haplotype H3 was associated with triglyceride concentration in Sephardic but not in Ashkenazi and Yemenite Jews. Ashkenazi carriers of H2 haplotype had a CVD odds ratio of 2.19 (95% CI: 1.05-4.58) compared with H1 (the most frequent), after adjustment for all other risk factors. These results suggest that different SNPs in ApoA5 polymorphisms may be associated with triglyceride concentration and CVD in each of these ethnic origins.

  10. The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes

    PubMed Central

    Lindenau, Juliana D.; Wagner, Sandrine C.; de Castro, Simone M.; Hutz, Mara H.

    2016-01-01

    Abstract Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management. PMID:27706371

  11. Application of site and haplotype-frequency based approaches for detecting selection signatures in cattle

    PubMed Central

    2011-01-01

    Background 'Selection signatures' delimit regions of the genome that are, or have been, functionally important and have therefore been under either natural or artificial selection. In this study, two different and complementary methods--integrated Haplotype Homozygosity Score (|iHS|) and population differentiation index (FST)--were applied to identify traces of decades of intensive artificial selection for traits of economic importance in modern cattle. Results We scanned the genome of a diverse set of dairy and beef breeds from Germany, Canada and Australia genotyped with a 50 K SNP panel. Across breeds, a total of 109 extreme |iHS| values exceeded the empirical threshold level of 5% with 19, 27, 9, 10 and 17 outliers in Holstein, Brown Swiss, Australian Angus, Hereford and Simmental, respectively. Annotating the regions harboring clustered |iHS| signals revealed a panel of interesting candidate genes like SPATA17, MGAT1, PGRMC2 and ACTC1, COL23A1, MATN2, respectively, in the context of reproduction and muscle formation. In a further step, a new Bayesian FST-based approach was applied with a set of geographically separated populations including Holstein, Brown Swiss, Simmental, North American Angus and Piedmontese for detecting differentiated loci. In total, 127 regions exceeding the 2.5 per cent threshold of the empirical posterior distribution were identified as extremely differentiated. In a substantial number (56 out of 127 cases) the extreme FST values were found to be positioned in poor gene content regions which deviated significantly (p < 0.05) from the expectation assuming a random distribution. However, significant FST values were found in regions of some relevant genes such as SMCP and FGF1. Conclusions Overall, 236 regions putatively subject to recent positive selection in the cattle genome were detected. Both |iHS| and FST suggested selection in the vicinity of the Sialic acid binding Ig-like lectin 5 gene on BTA18. This region was recently reported

  12. Development of COS-SNP and HRM markers for high-throughput and reliable haplotype-based detection of Lr14a in durum wheat (Triticum durum Desf.).

    PubMed

    Terracciano, Irma; Maccaferri, Marco; Bassi, Filippo; Mantovani, Paola; Sanguineti, Maria C; Salvi, Silvio; Simková, Hana; Doležel, Jaroslav; Massi, Andrea; Ammar, Karim; Kolmer, James; Tuberosa, Roberto

    2013-04-01

    Leaf rust (Puccinia triticina Eriks. & Henn.) is a major disease affecting durum wheat production. The Lr14a-resistant gene present in the durum wheat cv. Creso and its derivative cv. Colosseo is one of the best characterized leaf-rust resistance sources deployed in durum wheat breeding. Lr14a has been mapped close to the simple sequence repeat markers gwm146, gwm344 and wmc10 in the distal portion of the chromosome arm 7BL, a gene-dense region. The objectives of this study were: (1) to enrich the Lr14a region with single nucleotide polymorphisms (SNPs) and high-resolution melting (HRM)-based markers developed from conserved ortholog set (COS) genes and from sequenced Diversity Array Technology (DArT(®)) markers; (2) to further investigate the gene content and colinearity of this region with the Brachypodium and rice genomes. Ten new COS-SNP and five HRM markers were mapped within an 8.0 cM interval spanning Lr14a. Two HRM markers pinpointed the locus in an interval of <1.0 cM and eight COS-SNPs were mapped 2.1-4.1 cM distal to Lr14a. Each marker was tested for its capacity to predict the state of Lr14a alleles (in particular, Lr14-Creso associated to resistance) in a panel of durum wheat elite germplasm including 164 accessions. Two of the most informative markers were converted into KASPar(®) markers. Single assay markers ubw14 and wPt-4038-HRM designed for agarose gel electrophoresis/KASPar(®) assays and high-resolution melting analysis, respectively, as well as the double-marker combinations ubw14/ubw18, ubw14/ubw35 and wPt-4038-HRM-ubw35 will be useful for germplasm haplotyping and for molecular-assisted breeding.

  13. A 48 SNP set for grapevine cultivar identification

    PubMed Central

    2011-01-01

    Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP

  14. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations

    PubMed Central

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-01-01

    Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence

  15. Comparative analysis of the IGF2 and ZBED6 gene variants and haplotypes reveals significant effect of growth traits in cattle.

    PubMed

    Huang, Yong-Zhen; Zhan, Zhao-Yang; Sun, Yu-Jia; Wang, Jing; Li, Ming-Xun; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong

    2013-06-01

    Muscle growth is a complex phenomenon regulated by many factors, whereby net growth results from the combined action of synthesis and turnover. Insulin-like growth factor 2 (IGF2) is a fetal growth and differentiation factor that plays an important role in muscle growth and in myoblast proliferation and differentiation; Zinc finger, BED-type containing 6 (ZBED6) is a novel transcription factor that was identified and shown to act as a repressor of IGF2 transcription in skeletal muscle. In this study, a total of seven single nucleotide polymorphisms (SNPs) were identified, four SNPs in intron 8 of IGF2 and one promoter SNP and two missense mutations in the coding region of ZBED6, two of which were in complete linkage disequilibrium (LD) in the bovine IGF2. The 58 haplotypes were inferred in 1522 individuals representing four purebred cattle breeds from China. The seven SNPs, 79 and 66 combined diplotypes were revealed for association with body mass in Nanyang and Jiaxian cattle populations at five different ages (P < 0.05 or 0.01). The mutant-type variants and haplotype 58 (likely in LD with the beneficial quantitative trait nucleotide allele) was superior for body mass; the heterozygote diplotype of the most common haplotypes 58 was associated with higher body mass compared to either heterozygote or homozygote. The statistical analyses indicated that the mutant-type variants and haplotypes are significantly associated with body mass in study cattle populations at different ages. These data demonstrate that variants and haplotypes are associated with growth traits, and these results may provide important biological insights into the phenotypic differentiation that is associated with adaptation and specialization of cattle breeds.

  16. Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2014-07-01

    The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  17. An Increased Frequency in HLA Class I Alleles and Haplotypes Suggests Genetic Susceptibility to Influenza A (H1N1) 2009 Pandemic: A Case-Control Study

    PubMed Central

    Narayanankutty, Arun; Reséndiz-Hernández, Juan M.; Nava-Quiroz, Karol J.; Bautista-Félix, Nora E.; Castillejos-López, Manuel D. J.

    2018-01-01

    Background The influenza A H1N1/09 pandemic infected a small number of exposed individuals, which suggests the involvement of genetic factors. There are scarce data available on classical HLA class I association with the influenza A H1N1/09 pandemic. Methods We analyzed the frequency of classical HLA class I alleles and haplotypes in A H1N1/09 influenza in a case-control study including 138 influenza patients (INF-P) and 225 asymptomatic healthy contacts (INF-C) simultaneously recruited. HLA class I typing was performed by high-resolution sequence-based typing method. Results Our analysis revealed higher frequency of C∗07:02:01, B∗39:06:02, C∗03:02:01, B∗44:03:01, B∗51:01:05, and B∗73:01 (p < 0.05; OR = 1.84–9.98) and of two haplotypes—A∗68:01:02-C∗07:02:01 (p = 1.05E − 05; OR = 23.99) and B∗35:01:01-C∗07:02.01 (p = 4.15E − 04, OR = 2.15)—in A H1N1/09 influenza subjects. A∗68:01:01 was exclusively present only in the INF-P group (5/138). A decrease in the frequency of C∗03:03:01, A∗11:01:01, B∗39:01:01, A∗24:02:01, C∗03:04:01, B∗51:01:01, and C∗07:01:01 (p < 0.05; OR = 0.12–0.52) and of haplotypes A∗02:01:01-B∗35:01:01-C∗04:01:01, A∗24:02:01-B∗35:01:01, B∗39:01:01-C∗07:02:01, and B∗40:02:01-C∗03:04:01 (p < 0.05; OR = 0.08–0.22) were observed in INF-P group. Conclusion Selective classical HLA class I allele and haplotype combinations predispose individuals towards susceptibility or protection against the influenza A H1N1/09 pandemic. This work has significant implications for accessing population transmission risk for A H1N1/09 or a similar strain breakout in the future. PMID:29682588

  18. SNP by SNP by environment interaction network of alcoholism.

    PubMed

    Zollanvari, Amin; Alterovitz, Gil

    2017-03-14

    Alcoholism has a strong genetic component. Twin studies have demonstrated the heritability of a large proportion of phenotypic variance of alcoholism ranging from 50-80%. The search for genetic variants associated with this complex behavior has epitomized sequence-based studies for nearly a decade. The limited success of genome-wide association studies (GWAS), possibly precipitated by the polygenic nature of complex traits and behaviors, however, has demonstrated the need for novel, multivariate models capable of quantitatively capturing interactions between a host of genetic variants and their association with non-genetic factors. In this regard, capturing the network of SNP by SNP or SNP by environment interactions has recently gained much interest. Here, we assessed 3,776 individuals to construct a network capable of detecting and quantifying the interactions within and between plausible genetic and environmental factors of alcoholism. In this regard, we propose the use of first-order dependence tree of maximum weight as a potential statistical learning technique to delineate the pattern of dependencies underpinning such a complex trait. Using a predictive based analysis, we further rank the genes, demographic factors, biological pathways, and the interactions represented by our SNP [Formula: see text]SNP[Formula: see text]E network. The proposed framework is quite general and can be potentially applied to the study of other complex traits.

  19. Haplotype analysis of the HFE gene among populations of Northern Eurasia, in patients with metabolic disorders or stomach cancer, and in long-lived people.

    PubMed

    Mikhailova, S V; Babenko, V N; Ivanoshchuk, D E; Gubina, M A; Maksimov, V N; Solovjova, I G; Voevoda, M I

    2016-06-17

    Previously, it was shown that the HFE gene (associated with human hereditary hemochromatosis) has several haplotypes of intronic polymorphisms. Some haplotype frequencies are race specific and hence can be used in phylogenetic analysis. We assumed that analysis of Caucasoid patients-living now in Western Siberia and having diseases associated with dietary habits and metabolic rate-will allow us to understand the processes of possible selection during settling of the northern part of Asia. Haplotype analysis of Northern Eurasian native and recently settled ethnic groups was performed on polymorphisms rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, rs1572982, rs2794719, rs807209, and rs2032451 of this gene. The CCA haplotype of the rs2071303, rs1800708, and rs1572982 was found to be associated with HLA-A2 (39 %) in Asian populations. Haplotype analysis for the rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, and rs1572982 was performed on Russian patients with some metabolic disorders or stomach cancer and among long-lived people. Decreased frequencies of the TTA haplotype (T in rs2071303, T in rs1800708, and A in rs1572982) were observed in the groups of patients with diseases associated with overweight (fatty liver disease, type 2 diabetes mellitus, or metabolic syndrome + arterial hypertension) as compared with the control sample. We detected significant differences in this haplotype's frequency between the patients with type 2 diabetes mellitus and Russian adolescents, elderly citizens, and long-lived people (χ(2) P value = 0.003, 0.010, and 0.015, respectively). No significant differences in frequencies of the alleles with mutations in coding regions of the HFE gene (C282Y, H63D, and S65C) were detected between the analyzed patients (with stomach cancer, metabolic syndrome, fatty liver disease, or type 2 diabetes mellitus) and the control Caucasoid sample. Monophyletic origin of H63D (rs1799945) was confirmed in Caucasoids and Northern

  20. Apolipoprotein H promoter polymorphisms in relation to lupus and lupus-related phenotypes.

    PubMed

    Suresh, Sangita; Demirci, F Yesim K; Jacobs, Erin; Kao, Amy H; Rhew, Elisa Y; Sanghera, Dharambir K; Selzer, Faith; Sutton-Tyrrell, Kim; McPherson, David; Bontempo, Franklin A; Kammerer, Candace M; Ramsey-Goldman, Rosalind; Manzi, Susan; Kamboh, M Ilyas

    2009-02-01

    Sequence variation in gene promoters is often associated with disease risk. We tested the hypothesis that common promoter variation in the APOH gene (encoding for ss(2)-glycoprotein I) is associated with systemic lupus erythematosus (SLE) risk and SLE-related clinical phenotypes in a Caucasian cohort. We used a case-control design and genotyped 345 women with SLE and 454 healthy control women for 8 APOH promoter single-nucleotide polymorphisms (SNP; -1284C>G, -1219G>A, -1190G>C, -759A>G, -700C>A, -643T>C, -38G>A, and -32C>A).Association analyses were performed on single SNP and haplotypes. Haplotype analyses were performed using EH (Estimate Haplotype-frequencies) and Haploview programs. In vitro reporter gene assay was performed in COS-1 cells. Electrophoretic mobility shift assay (EMSA) was performed using HepG2 nuclear cells. Overall haplotype distribution of the APOH promoter SNP was significantly different between cases and controls (p = 0.009). The -643C allele was found to be protective against carotid plaque formation (adjusted OR 0.37, p = 0.013) among patients with SLE. The -643C allele was associated with a ~2-fold decrease in promoter activity as compared to wild-type -643T allele (mean +/- standard deviation: 3.94 +/- 0.05 vs 6.99 +/- 0.68, p = 0.016). EMSA showed that the -643T>C SNP harbors a binding site for a nuclear factor. The -1219G>A SNP showed a significant association with the risk of lupus nephritis (age-adjusted OR 0.36, p = 0.016). Our data indicate that APOH promoter variants may be involved in the etiology of SLE, especially the risk for autoimmune-mediated cardiovascular disease.

  1. GJB2 Mutations in Mongolia: Complex Alleles, Low Frequency, and Reduced Fitness of the Deaf

    PubMed Central

    Tekin, Mustafa; Xia, Xia-Juan; Erdenetungalag, Radnaabazar; Cengiz, F. Basak; White, Thomas W.; Radnaabazar, Janchiv; Dangaasuren, Begzsuren; Tastan, Hakki; Nance, Walter E.; Pandya, Arti

    2016-01-01

    Summary We screened the GJB2 gene for mutations in 534 (108 multiplex and 426 simplex) probands with non-syndromic sensorineural deafness, who were ascertained through the only residential school for deaf in Mongolia and in 217 hearing controls. Twenty different alleles, including four novel changes, were identified. Biallelic GJB2 mutations were found in 4.5% of the deaf probands (8.3% in multiplex, 3.5% in simplex). The most common mutations were c.IVS1+1G>A (c.-3201G>A) and c.235delC with allele frequencies of 3.5% and 1.5%, respectively. The c.IVS1+1G>A mutation appears to have diverse origins based on its association with multiple haplotypes constructed using nearby SNP markers. The p.V27I and p.E114G variants were frequently detected in both deaf probands and hearing controls. The p.E114G variant was always associated with p.V27I, and haplotype analysis confirmed that it was always in cis with the p.V27I variant. Although in vitro experiments using Xenopus oocytes have suggested that p.[V27I;E114G] disturb the gap junction function of Cx26, the equal distribution of this complex allele in both deaf probands and hearing controls makes it a less likely cause of profound congenital deafness. We found a lower frequency of assortative mating (37.5%) and decreased genetic fitness (62%) of the deaf in Mongolia as compared to the western populations, which provides an explanation for lower frequency of GJB2 deafness in Mongolia. PMID:20201936

  2. Haplotype Analysis Discriminates Genetic Risk for DR3-Associated Endocrine Autoimmunity and Helps Define Extreme Risk for Addison’s Disease

    PubMed Central

    Baker, Peter R.; Baschal, Erin E.; Fain, Pam R.; Triolo, Taylor M.; Nanduri, Priyaanka; Siebert, Janet C.; Armstrong, Taylor K.; Babu, Sunanda R.; Rewers, Marian J.; Gottlieb, Peter A.; Barker, Jennifer M.; Eisenbarth, George S.

    2010-01-01

    Context: Multiple autoimmune disorders (e.g. Addison’s disease, type 1 diabetes, celiac disease) are associated with HLA-DR3, but it is likely that alleles of additional genes in linkage disequilibrium with HLA-DRB1 contribute to disease. Objective: The objective of the study was to characterize major histocompatability complex (MHC) haplotypes conferring extreme risk for autoimmune Addison’s disease (AD). Design, Setting, and Participants: Eighty-six 21-hydroxylase autoantibody-positive, nonautoimmune polyendocrine syndrome type 1, Caucasian individuals collected from 1992 to 2009 with clinical AD from 68 families (12 multiplex and 56 simplex) were genotyped for HLA-DRB1, HLA-DQB1, MICA, HLA-B, and HLA-A as well as high density MHC single-nucleotide polymorphism (SNP) analysis for 34. Main Outcome Measures: AD and genotype were measured. Result: Ninety-seven percent of the multiplex individuals had both HLA-DR3 and HLA-B8 vs. 60% of simplex AD patients (P = 9.72 × 10−4) and 13% of general population controls (P = 3.00 × 10−19). The genotype DR3/DR4 with B8 was present in 85% of AD multiplex patients, 24% of simplex patients, and 1.5% of control individuals (P = 4.92 × 10−191). The DR3-B8 haplotype of AD patients had HLA-A1 less often (47%) than controls (81%, P = 7.00 × 10−5) and type 1 diabetes patients (73%, P = 1.93 × 10−3). Analysis of 1228 SNPs across the MHC for individuals with AD revealed a shorter conserved haplotype (3.8) with the loss of the extended conserved 3.8.1 haplotype approximately halfway between HLA-B and HLA-A. Conclusion: Extreme risk for AD, especially in multiplex families, is associated with haplotypic DR3 variants, in particular a portion (3.8) but not all of the conserved 3.8.1 haplotype. PMID:20631027

  3. Global selection on sucrose synthase haplotypes during a century of wheat breeding.

    PubMed

    Hou, Jian; Jiang, Qiyan; Hao, Chenyang; Wang, Yuquan; Zhang, Hongna; Zhang, Xueyong

    2014-04-01

    Spike number per unit area, number of grains per spike, and thousand kernel weight (TKW) are important yield components. In China, increases in wheat (Triticum aestivum) yields are mainly due to increases in grain number per spike and TKW. TKW mainly depends on starch content, as starch accounts for about 70% of the grain endosperm. Sucrose synthase catalysis is the first step in the conversion of sucrose to starch, that is, the conversion of sucrose to fructose and UDP-glucose by the wheat sucrose synthase genes (TaSus1 and TaSus2) that are located on chromosomes 7A/7B/7D and 2A/2B/2D, respectively. A total of 1,520 wheat accessions were genotyped at the six loci. Two, two, five, and two haplotypes were identified at the TaSus2-2A, TaSus2-2B, TaSus1-7A, and TaSus1-7B loci, respectively. Their main variations were detected within the introns. Significant differences between the haplotypes correlated with TKW differences among 348 modern Chinese cultivars from the core collection. Frequency changes for favored haplotypes showed gradual increases in cultivars released since beginning of the last century in China, Europe, and North America. Geographic distributions and time changes of favored haplotypes were characterized in six major wheat production regions worldwide. Strong selection bottlenecks to haplotype variations occurred at polyploidization and domestication and during breeding of wheat. Genetic-effect differences between haplotypes at the same locus influence the selection time and intensity. This work shows that the endosperm starch synthesis pathway is a major target of indirect selection in global wheat breeding for higher yield.

  4. Haplotyping for disease association: a combinatorial approach.

    PubMed

    Lancia, Giuseppe; Ravi, R; Rizzi, Romeo

    2008-01-01

    We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.

  5. KinSNP software for homozygosity mapping of disease genes using SNP microarrays

    PubMed Central

    2010-01-01

    Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from http://bioinfo.bgu.ac.il/bsu/software/kinSNP. PMID:20846928

  6. Genome sequence, comparative analysis and haplotype structure of the domestic dog.

    PubMed

    Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

    2005-12-08

    Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

  7. Enhancing the mathematical properties of new haplotype homozygosity statistics for the detection of selective sweeps.

    PubMed

    Garud, Nandita R; Rosenberg, Noah A

    2015-06-01

    Soft selective sweeps represent an important form of adaptation in which multiple haplotypes bearing adaptive alleles rise to high frequency. Most statistical methods for detecting selective sweeps from genetic polymorphism data, however, have focused on identifying hard selective sweeps in which a favored allele appears on a single haplotypic background; these methods might be underpowered to detect soft sweeps. Among exceptions is the set of haplotype homozygosity statistics introduced for the detection of soft sweeps by Garud et al. (2015). These statistics, examining frequencies of multiple haplotypes in relation to each other, include H12, a statistic designed to identify both hard and soft selective sweeps, and H2/H1, a statistic that conditional on high H12 values seeks to distinguish between hard and soft sweeps. A challenge in the use of H2/H1 is that its range depends on the associated value of H12, so that equal H2/H1 values might provide different levels of support for a soft sweep model at different values of H12. Here, we enhance the H12 and H2/H1 haplotype homozygosity statistics for selective sweep detection by deriving the upper bound on H2/H1 as a function of H12, thereby generating a statistic that normalizes H2/H1 to lie between 0 and 1. Through a reanalysis of resequencing data from inbred lines of Drosophila, we show that the enhanced statistic both strengthens interpretations obtained with the unnormalized statistic and leads to empirical insights that are less readily apparent without the normalization. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Association between SLC19A1 Gene Polymorphism and High Dose Methotrexate Toxicity in Childhood Acute Lymphoblastic Leukaemia and Non Hodgkin Malignant Lymphoma: Introducing a Haplotype based Approach

    PubMed Central

    Kotnik, Barbara Faganel; Jazbec, Janez; Grabar, Petra Bohanec; Rodriguez-Antona, Cristina

    2017-01-01

    Abstract Background We investigated the clinical relevance of SLC 19A1 genetic variability for high dose methotrexate (HD-MTX) related toxicities in children and adolescents with acute lymphoblastic leukaemia (ALL) and non Hodgkin malignant lymphoma (NHML). Patients and methods Eighty-eight children and adolescents with ALL/NHML were investigated for the influence of SLC 19A1 single nucleotide polymorphisms (SNPs) and haplotypes on HD-MTX induced toxicities. Results Patients with rs2838958 TT genotype had higher probability for mucositis development as compared to carriers of at least one rs2838958 C allele (OR 0.226 (0.071–0.725), p < 0.009). Haplotype TGTTCCG (H4) statistically significantly reduced the risk for the occurrence of adverse events during treatment with HD-MTX (OR 0.143 (0.023–0.852), p = 0.030). Conclusions SLC 19A1 SNP and haplotype analysis could provide additional information in a personalized HD-MTX therapy for children with ALL/NHML in order to achieve better treatment outcome. However further studies are needed to validate the results. PMID:29333125

  9. Unique haplotypes of cacao trees as revealed by trnH-psbA chloroplast DNA

    PubMed Central

    Gutiérrez-López, Nidia; Ovando-Medina, Isidro; Salvador-Figueroa, Miguel; Molina-Freaner, Francisco; Avendaño-Arrazate, Carlos H.

    2016-01-01

    Cacao trees have been cultivated in Mesoamerica for at least 4,000 years. In this study, we analyzed sequence variation in the chloroplast DNA trnH-psbA intergenic spacer from 28 cacao trees from different farms in the Soconusco region in southern Mexico. Genetic relationships were established by two analysis approaches based on geographic origin (five populations) and genetic origin (based on a previous study). We identified six polymorphic sites, including five insertion/deletion (indels) types and one transversion. The overall nucleotide diversity was low for both approaches (geographic = 0.0032 and genetic = 0.0038). Conversely, we obtained moderate to high haplotype diversity (0.66 and 0.80) with 10 and 12 haplotypes, respectively. The common haplotype (H1) for both networks included cacao trees from all geographic locations (geographic approach) and four genetic groups (genetic approach). This common haplotype (ancient) derived a set of intermediate haplotypes and singletons interconnected by one or two mutational steps, which suggested directional selection and event purification from the expansion of narrow populations. Cacao trees from Soconusco region were grouped into one cluster without any evidence of subclustering based on AMOVA (FST = 0) and SAMOVA (FST = 0.04393) results. One population (Mazatán) showed a high haplotype frequency; thus, this population could be considered an important reservoir of genetic material. The indels located in the trnH-psbA intergenic spacer of cacao trees could be useful as markers for the development of DNA barcoding. PMID:27076998

  10. Y chromosome haplotype diversity of domestic sheep (Ovis aries) in northern Eurasia.

    PubMed

    Zhang, Min; Peng, Wei-Feng; Yang, Guang-Li; Lv, Feng-Hua; Liu, Ming-Jun; Li, Wen-Rong; Liu, Yong-Gang; Li, Jin-Quan; Wang, Feng; Shen, Zhi-Qiang; Zhao, Sheng-Guo; Hehua, Eer; Marzanov, Nurbiy; Murawski, Maziek; Kantanen, Juha; Li, Meng-Hua

    2014-12-01

    Variation in two SNPs and one microsatellite on the Y chromosome was analyzed in a total of 663 rams representing 59 breeds from a large geographic range in northern Eurasia. SNPA-oY1 showed the highest allele frequency (91.55%) across the breeds, whereas SNPG-oY1 was present in only 56 samples. Combined genotypes established seven haplotypes (H4, H5, H6, H7, H8, H12 and H19). H6 dominated in northern Eurasia, and H8 showed the second-highest frequency. H4, which had been earlier reported to be absent in European breeds, was detected in one European breed (Swiniarka), whereas H7, which had been previously identified to be unique to European breeds, was present in two Chinese breeds (Ninglang Black and Large-tailed Han), one Buryatian (Transbaikal Finewool) and two Russian breeds (North Caucasus Mutton-Wool and Kuibyshev). H12, which had been detected only in Turkish breeds, was also found in Chinese breeds in this work. An overall low level of haplotype diversity (median h = 0.1288) was observed across the breeds with relatively higher median values in breeds from the regions neighboring the Near Eastern domestication center of sheep. H6 is the dominant haplotype in northwestern and eastern China, in which the haplotype distribution could be explained by the historical translocations of the H4 and H8 Y chromosomes to China via the Mongol invasions followed by expansions to northwestern and eastern China. Our findings extend previous results of sheep Y chromosomal genetic variability and indicate probably recent paternal gene flows between sheep breeds from distinct major geographic regions. © 2014 Stichting International Foundation for Animal Genetics.

  11. A single nucleotide polymorphism in osteonectin 3’ untranslated region regulates bone volume and is targeted by miR-433

    PubMed Central

    Dole, Neha S.; Kapinas, Kristina; Kessler, Catherine B.; Yee, Siu-Pok; Adams, Douglas J.; Pereira, Renata C.; Delany, Anne M.

    2014-01-01

    Osteonectin/SPARC is one of the most abundant non-collagenous extracellular matrix proteins in bone, regulating collagen fiber assembly and promoting osteoblast differentiation. Osteonectin-null and –haploinsufficient mice have low turnover osteopenia, indicating that osteonectin contributes to normal bone formation. In male idiopathic osteoporosis patients, osteonectin 3’ UTR single nucleotide polymorphism (SNP) haplotypes that differed only at SNP1599 (rs1054204) were previously associated with bone mass. Haplotype A (containing SNP1599G) was more frequent in severely affected patients, whereas haplotype B (containing SNP1599C) was more frequent in less affected patients and healthy controls. We hypothesized that SNP1599 contributes to variability in bone mass by modulating osteonectin levels. Osteonectin 3’UTR reporter constructs demonstrated that haplotype A has a repressive effect on gene expression compared to B. We found that SNP1599G contributed to a miR-433 binding site and miR-433 inhibitor relieved repression of the haplotype A, but not B, 3’ UTR reporter construct. We tested our hypothesis in vivo, using a knock-in approach to replace the mouse osteonectin 3’ UTR with human haplotype A or B 3’ UTR. Compared to haplotype A mice, bone osteonectin levels were higher in haplotype B mice. B mice displayed higher bone formation rate and gained more trabecular bone with age. When parathyroid hormone was administered intermittently, haplotype B mice gained more cortical bone area than A mice. Cultured marrow stromal cells from B mice deposited more mineralized matrix and had higher osteocalcin mRNA compared with A mice, demonstrating a cell-autonomous effect on differentiation. Altogether, SNP1599 differentially regulates osteonectin expression and contributes to variability in bone mass, by a mechanism that may involve differential targeting by miR-433. This work validates the findings of the previous candidate gene study, and it assigns a

  12. SNP2TFBS - a database of regulatory SNPs affecting predicted transcription factor binding site affinity.

    PubMed

    Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp

    2017-01-04

    SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Association between the KRAS Gene Polymorphisms and Papillary Thyroid Carcinoma in a Chinese Han Population.

    PubMed

    Ning, Lifeng; Rao, Wenwang; Yu, Yaqin; Liu, Xiaoli; Pan, Yuchen; Ma, Yuan; Liu, Rui; Zhang, Shangchao; Sun, Hui; Yu, Qiong

    2016-01-01

    Several studies have reported the association between MAPK signaling pathway gene polymorphisms and papillary thyroid carcinoma (PTC). KRAS gene, an oncogene from the mammalian RAS gene family plays an important role in the MAPK pathway. This study aimed to identify the potential association of KRAS gene polymorphisms with susceptibility to PTC in a Han Chinese population. A total of 861 patients with PTC, 562 disease controls with nodular goiter and 897 healthy controls were recruited. Four tagSNP polymorphisms (rs12427141, rs712, rs7315339 and rs7960917) of KRAS gene were genotyped by matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF-MS) . Statistical analyses and haplotype estimations were conducted using Haploview and Unphased softwares. Only significant differences were observed in genotypic frequencies of the rs7315339 polymorphism (χ 2 =7.234, df=2, p=0.027) between PTC and disease controls. Statistically significant differences in both allelic and genotypic genotypes frequencies for rs712 (Genotype, χ 2 =8.258, p=0.016) and rs12427141 (Allele, χ 2 =3.992, p=0.046; Genotype, χ 2 =8.140, p=0.017) were observed between PTC patients and controls. Haplotype analyses revealed higher frequencies of GA and TA haplotypes (p=0.039 and p=0.003, respectively) from rs712- rs12427141 (two-SNP) or TGA and TTG haplotype containing the alleles from rs7960917, rs712 and rs12427141, as well as the GAT haplotype containing the alleles from rs712, rs12427141 and rs7315339 in PTC patients than in healthy controls (p=0.042, p=0.037, p=0.027, respectively). Inversely, the haplotype TTA from rs7960917, rs712 and rs12427141 or the haplotype TAC from rs712, rs12427141 and rs7315339 was significantly less frequent in the PTC patients than in normal control (p=0.003, p=0.003, respectively). These findings suggest the role of these KRAS gene variants in susceptibility to PTC. Moreover, significant differences of the KRAS gene polymorphisms may

  14. Identification of a functional enhancer variant within the chronic pancreatitis-associated SPINK1 c.101A>G (p.Asn34Ser)-containing haplotype.

    PubMed

    Boulling, Arnaud; Masson, Emmanuelle; Zou, Wen-Bin; Paliwal, Sumit; Wu, Hao; Issarapu, Prachand; Bhaskar, Seema; Génin, Emmanuelle; Cooper, David N; Li, Zhao-Shen; Chandak, Giriraj R; Liao, Zhuan; Chen, Jian-Min; Férec, Claude

    2017-08-01

    The haplotype harboring the SPINK1 c.101A>G (p.Asn34Ser) variant (also known as rs17107315:T>C) represents the most important heritable risk factor for idiopathic chronic pancreatitis identified to date. The causal variant contained within this risk haplotype has however remained stubbornly elusive. Herein, we set out to resolve this enigma by employing a hypothesis-driven approach. First, we searched for variants in strong linkage disequilibrium (LD) with rs17107315:T>C using HaploReg v4.1. Second, we identified two candidate SNPs by visual inspection of sequences spanning all 25 SNPs found to be in LD with rs17107315:T>C, guided by prior knowledge of pancreas-specific transcription factors and their cognate binding sites. Third, employing a novel cis-regulatory module (CRM)-guided approach to further filter the two candidate SNPs yielded a solitary candidate causal variant. Finally, combining data from phylogenetic conservation and chromatin accessibility, cotransfection transactivation experiments, and population genetic studies, we suggest that rs142703147:C>A, which disrupts a PTF1L-binding site within an evolutionarily conserved HNF1A-PTF1L CRM located ∼4 kb upstream of the SPINK1 promoter, contributes to the aforementioned chronic pancreatitis risk haplotype. Further studies are required not only to improve the characterization of this functional SNP but also to identify other functional components that might contribute to this high-risk haplotype. © 2017 Wiley Periodicals, Inc.

  15. Glutamate decarboxylase genes and alcoholism in Han Taiwanese men.

    PubMed

    Loh, El-Wui; Lane, Hsien-Yuan; Chen, Chien-Hsiun; Chang, Pi-Shan; Ku, Li-Wen; Wang, Kathy H T; Cheng, Andrew T A

    2006-11-01

    Glutamate decarboxylase (GAD), the rate-limiting enzyme in the synthesis of gamma-aminobutyric acid (GABA), may be involved in the development of alcoholism. This study examined the possible roles of the genes that code for 2 forms of GAD (GAD1 and GAD2) in the development of alcoholism. An association study was conducted among 140 male alcoholic subjects meeting the DSM-III-R criteria for alcohol dependence and 146 controls recruited from the Han Taiwanese in community and clinical settings. Psychiatric assessment of drinking conditions was conducted using a Chinese version of the Schedules for Clinical Assessment in Neuropsychiatry. The SHEsis and Haploview programs were used in statistical analyses. Nine single-nucleotide polymorphisms (SNPs) at the GAD1 gene were valid for further statistics. Between alcoholic subjects and controls, significant differences were found in genotype distributions of SNP1 (p=0.000), SNP2 (p=0.015), SNP4 (p=0.015), SNP5 (p=0.031), SNP6 (p=0.012), and SNP8 (p=0.004) and in allele distributions of SNP1 (p=0.001), SNP2 (p=0.009), and SNP8 (p=0.009). Permutation tests of SNP1, SNP2, and SNP8 demonstrated significant differences in allele frequencies but not in 2 major haplotype blocks. Three valid SNPs at the GAD2 gene demonstrated no associations with alcoholism. Further permutation tests in the only 1 haplotype block or individual SNPs demonstrated no significant differences. This is the first report indicating a possible significant role of the GAD1 gene in the development of alcohol dependence and/or the course of alcohol withdrawal and outcome of alcoholism.

  16. A graphene-based platform for single nucleotide polymorphism (SNP) genotyping.

    PubMed

    Liu, Meng; Zhao, Huimin; Chen, Shuo; Yu, Hongtao; Zhang, Yaobin; Quan, Xie

    2011-06-15

    A facile, rapid, stable and sensitive approach for fluorescent detection of single nucleotide polymorphism (SNP) is designed based on DNA ligase reaction and π-stacking between the graphene and the nucleotide bases. In the presence of perfectly matched DNA, DNA ligase can catalyze the linkage of fluorescein amidite-labeled single-stranded DNA (ssDNA) and a phosphorylated ssDNA, and thus the formation of a stable duplex in high yield. However, the catalytic reaction cannot effectively carry out with one-base mismatched DNA target. In this case, we add graphene to the system in order to produce different quenching signals due to its different adsorption affinity for ssDNA and double-stranded DNA. Taking advantage of the unique surface property of graphene and the high discriminability of DNA ligase, the proposed protocol exhibits good performance in SNP genotyping. The results indicate that it is possible to accurately determine SNP with frequency as low as 2.6% within 40 min. Furthermore, the presented flexible strategy facilitates the development of other biosensing applications in the future. Copyright © 2011 Elsevier B.V. All rights reserved.

  17. Detecting structure of haplotypes and local ancestry

    USDA-ARS?s Scientific Manuscript database

    We present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local an...

  18. Investigation of extended Y chromosome STR haplotypes in Sardinia.

    PubMed

    Lacerenza, D; Aneli, S; Di Gaetano, C; Critelli, R; Piazza, A; Matullo, G; Culigioni, C; Robledo, R; Robino, C; Calò, C

    2017-03-01

    Y-chromosomal variation of selected single nucleotide polymorphisms (SNPs) and 32 short tandem repeat (STR) loci was evaluated in Sardinia in three open population groups (Northern Sardinia, n=40; Central Sardinia, n=56; Southern Sardinia, n=91) and three isolates (Desulo, n=34; Benetutti, n=45, Carloforte, n=42). The tested Y-STRs consisted of Yfiler ® Plus markers and the seven rapidly mutating (RM) loci not included in the YFiler ® Plus kit (DYF399S1, DYF403S1ab, DYF404S1, DYS526ab, DYS547, DYS612, and DYS626). As expected, inclusion of additional Y-STR loci increased haplotype diversity (h), though complete differentiation of male lineages was impossible even by means of RM Y-STRs (h=0.99997). Analysis of molecular variance indicated that the three open populations were fairly homogeneous, whereas signs of genetic heterogeneity could be detected when the three isolates were also included in the analysis. Multidimensional scaling analysis showed that, even for extended haplotypes including RM Y-STR markers, Sardinians were clearly differentiated from populations of the Italian peninsula and Sicily. The only exception was represented by the Carloforte sample that, in accordance with its peculiar population history, clustered with Northern/Central Italian populations. The introduction of extended forensic Y-STR panels, including highly variable RM Y-STR markers, is expected to reduce the impact of population structure on haplotype frequency estimations. However, our results show that the availability of geographically detailed reference databases is still important for the assessment of the evidential value of a Y-haplotype match. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  19. Association of polymorphisms and haplotypes in the cytochrome P450 1B1 gene with uterine leiomyoma: A case control study

    PubMed Central

    SALIMI, SAEEDEH; KHODAMIAN, MARYAM; NAROOIE-NEJAD, MEHRNAZ; HAJIZADEH, AZAM; FAZELI, KIMIA; NAMAZI, LIDA; YAGHMAEI, MINOO

    2015-01-01

    Uterine leiomyoma (UL) is an estrogen-dependent neoplasm of the uterus and estrogen metabolizing enzymes affect its promotion and progression. The aim of the present study was to evaluate the association between four single-nucleotide polymorphisms (SNPs) of the cytochrome P450 1B1 (CYP1B1) gene and UL risk. Four SNPs of the CYP1B1 gene in 105 UL patients and 112 unrelated healthy controls were genotyped using a direct sequencing method. Haplotype analyses were performed with UNPHASED software and linkage disequilibrium (LD) was assessed by Haploview software. There were no associations between Leu432Val (rs1056836), Asp449Asp (rs1056837) and Asn453Ser (rs1800440) polymorphisms of the CYP1B1 gene and UL. Although the genotypic frequencies of the Arg368His (rs79204362) polymorphism did not differ between the two groups, the frequency of A (His) allele was significantly higher in UL females (P=0.02). In addition, the frequency of GTAA haplotype was significantly higher in the controls and played a protective role in UL susceptibility. A strong LD between the three common SNPs (rs1056836, rs1056837 and rs1800440) in the CYP1B1 gene was observed in the population. In conclusion, a higher frequency of the CYP1B1 368His (A) allele was observed in UL females. The frequency of the GTAA haplotype was significantly higher in healthy females and this haplotype played a protective role in UL susceptibility. PMID:26075073

  20. Extended Islands of Tractability for Parsimony Haplotyping

    NASA Astrophysics Data System (ADS)

    Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi

    Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.

  1. BAT2 and BAT3 polymorphisms as novel genetic risk factors for rejection after HLA-related SCT.

    PubMed

    Piras, Ignazio Stefano; Angius, Andrea; Andreani, Marco; Testi, Manuela; Lucarelli, Guido; Floris, Matteo; Marktel, Sarah; Ciceri, Fabio; La Nasa, Giorgio; Fleischhauer, Katharina; Roncarolo, Maria Grazia; Bulfone, Alessandro; Gregori, Silvia; Bacchetta, Rosa

    2014-11-01

    The genetic background of donor and recipient is an important factor determining the outcome of allogeneic hematopoietic SCT (allo-HSCT). We applied whole-genome analysis to investigate genetic variants-other than HLA class I and II-associated with negative outcome after HLA-identical sibling allo-HSCT in a cohort of 110 β-Thalassemic patients. We identified two single-nucleotide polymorphisms (SNPs) in BAT2 (A/G) and BAT3 (T/C) genes, SNP rs11538264 and SNP rs10484558, both located in the HLA class III region, in strong linkage disequilibrium between each other (R(2)=0.92). When considered as single SNP, none of them reached a significant association with graft rejection (nominal P<0.00001 for BAT2 SNP rs11538264, and P<0.0001 for BAT3 SNP rs10484558), whereas the BAT2/BAT3 A/C haplotype was present at significantly higher frequency in patients who rejected as compared to those with functional graft (30.0% vs 2.6%, nominal P=1.15 × 10(-8); and adjusted P=0.0071). The BAT2/BAT3 polymorphisms and specifically the A/C haplotype may represent a novel immunogenetic factor associated with graft rejection in patients undergoing allo-HSCT.

  2. BAT2 and BAT3 polymorphisms as novel genetic risk factors for rejection after HLA-related stem cell transplantation

    PubMed Central

    Piras, Ignazio Stefano; Angius, Andrea; Andreani, Marco; Testi, Manuela; Lucarelli, Guido; Floris, Matteo; Marktel, Sarah; Ciceri, Fabio; La Nasa, Giorgio; Fleischhauer, Katharina; Roncarolo, Maria Grazia; Bulfone, Alessandro

    2014-01-01

    The genetic background of donor and recipient is an important factor determining the outcome of allogeneic hematopoietic stem cell transplantation (allo-HSCT). We applied a whole genome analysis to investigate genetic variants - other than HLA class I and II - associated with negative outcome after HLA-identical sibling allo-HSCT in a cohort of 110 β-Thalassemic patients. We identified two single nucleotide polymorphisms in BAT2 (A/G) and BAT3 (T/C) genes, SNP rs11538264 and SNP rs10484558, both located in the HLA class III region, in strong Linkage Disequilibrium between each other (R2=0.92). When considered as single SNP, none of them reached a significant association with graft rejection (nominal P < 0.00001 for BAT2 SNP rs11538264, and P < 0.0001 for BAT3 SNP rs10484558). Whereas, the BAT2/BAT3 A/C haplotype was present at significantly higher frequency in patients who rejected as compared to those with functional graft (30.0% vs. 2.6%, nominal P = 1.15×10−8; and adjusted P = 0.0071). The BAT2/BAT3 polymorphisms and specifically the A/C haplotype may represent novel immunogenetic factor associated with graft rejection in patients undergoing allo-HSCT. PMID:25111513

  3. Contribution of HLA-A/B/C/DRB1/DQB1 common haplotypes to donor search outcome in unrelated hematopoietic stem cell transplantation.

    PubMed

    Pédron, Béatrice; Guérin-El Khourouj, Valérie; Dalle, Jean-Hugues; Ouachée-Chardin, Marie; Yakouben, Karima; Corroyez, France; Auvrignon, Anne; Petit, Arnaud; Landman-Parker, Judith; Leverger, Guy; Baruchel, André; Sterkers, Ghislaine

    2011-11-01

    In unrelated hematopoietic stem cell transplantation (HSCT), the prediction of donor search outcome at the time of search initiation is of great value for the physicians to delineate the strategy of patient care. The probability of finding an unrelated donor is high for patients who carry at least 1 of the 10 most common HLA haplotypes in Caucasians. As only 10% to 20% patients respond to this criterion, here we aimed at finding additional common haplotypes to improve the prediction of a successful search. HLA broad HLA-A/B/DRB1 haplotypes that were observed with frequencies ≥0.19% in patient families of European origin and that split into ≤2 predominant 4-digit HLA-A/B/C/DRB1/DQB1 haplotypes were considered as common. Carriage of at least 1 of those in 168 patients of various geographic areas with no family donor was confronted to the chance of finding ≥9/10 HLA-matched unrelated donors. Fifty common 4-digit haplotypes were identified. A higher (P < 5 × 10(-6)) chance of finding a suitable donor was found for 55 of 170 (32%) recipients that carried at least 1 of these common haplotypes. Up to now, estimates classified patients into ≥3 groups of probability with ≥1 intermediate group of poor utility for the clinicians. Considering carriage of these common haplotypes together with the frequencies of alleles and of B/C and DRB1/DQB1 associations, which are carried by patient HLA haplotypes, we could classify the patients into 2 groups of probability with a 98% and 26% chance of finding a donor, respectively. Prediction of search outcome could be improved by including the 50 most common HLA haplotypes in the current approaches. Copyright © 2011 American Society for Blood and Marrow Transplantation. Published by Elsevier Inc. All rights reserved.

  4. SNPServer: a real-time SNP discovery tool.

    PubMed

    Savage, David; Batley, Jacqueline; Erwin, Tim; Logan, Erica; Love, Christopher G; Lim, Geraldine A C; Mongin, Emmanuel; Barker, Gary; Spangenberg, German C; Edwards, David

    2005-07-01

    SNPServer is a real-time flexible tool for the discovery of SNPs (single nucleotide polymorphisms) within DNA sequence data. The program uses BLAST, to identify related sequences, and CAP3, to cluster and align these sequences. The alignments are parsed to the SNP discovery software autoSNP, a program that detects SNPs and insertion/deletion polymorphisms (indels). Alternatively, lists of related sequences or pre-assembled sequences may be entered for SNP discovery. SNPServer and autoSNP use redundancy to differentiate between candidate SNPs and sequence errors. For each candidate SNP, two measures of confidence are calculated, the redundancy of the polymorphism at a SNP locus and the co-segregation of the candidate SNP with other SNPs in the alignment. SNPServer is available at http://hornbill.cspp.latrobe.edu.au/snpdiscovery.html.

  5. Combined genotype and haplotype distributions of MTHFR C677T and A1298C polymorphisms

    PubMed Central

    Fan, Shujun; Yang, Boyi; Zhi, Xueyuan; Wang, Yanxun; Zheng, Quanmei; Sun, Guifan

    2016-01-01

    Abstract Methylenetetrahydrofolate reductase (MTHFR) C677T and A1298C polymorphisms are, independently and/or in combination, associated with many disorders. However, data on the combined genotype and haplotype distributions of the 2 polymorphisms in Chinese population were limited. We recruited 13,473 adult women from 9 Chinese provinces, collected buccal cell samples, and determined genotypes, to estimate the combined genotype and haplotype distributions of the MTHFR C677T and A1298C polymorphisms. In the total sample, the 6 common combined genotypes were CT/AA (29.5%), TT/AA (21.9%), CC/AA (15.4%), CC/AC (14.9%), CT/AC (13.7%), and CC/CC (3.4%); the 3 frequent haplotypes were 677T-1298A (43.6%), 677C-1298A (37.9%), and 677C-1298C (17.6%). Importantly, we observed that there were 51 (0.4%) individuals with the CT/CC genotype, 92 (0.7%) with the TT/AC genotype, 17 (0.1%) with the TT/CC genotype, and that the frequency of the 677T-1298C haplotype was 0.9%. In addition, the prevalence of some combined genotypes and haplotypes varied among populations residing in different areas and even showed apparent geographical gradients. Further linkage disequilibrium analysis showed that the D’ and r2 values were 0.883 and 0.143, respectively. In summary, the findings of our study provide further strong evidence that the MTHFR C677T and A1298C polymorphisms are usually in trans and occasionally in cis configurations. The frequencies of mutant genotype combinations were relatively higher in Chinese population than other populations, and showed geographical variations. These baseline data would be useful for future related studies and for developing health management programs. PMID:27902594

  6. X-chromosome as a marker for population history: linkage disequilibrium and haplotype study in Eurasian populations

    PubMed Central

    Laan, Maris; Wiebe, Victor; Khusnutdinova, Elza; Remm, Maido; Pääbo, Svante

    2005-01-01

    Linkage disequilibrium structure is still unpredictable because the interplay of regional recombination rate and demographic history is poorly understood. We have compared the distribution of LD across two genomic regions differing in crossing-over activity – Xq13 (0.166 cM/Mb) and Xp22 (1.3 cM/Mb) – in 15 Eurasian populations. Demographic events predicted to increase the LD level – genetic drift, bottleneck and admixture – had a very strong impact on extent and patterns of regional LD across Xq13 compared to Xp22. The haplotype distribution of the DXS1225-DXS8082 microsatellites from Xq13 exhibiting strong association in all populations was remarkably influenced by population history. European populations shared one common haplotype with a frequency of 25-40%. The Volga-Ural populations studied, living at the geographic borderline of Europe, showed elevated LD as well as harboring a significant fraction of haplotypes originating from East Asia, thus reflecting their past migrations and admixture. In the young Kuusamo isolate from Finland, a bottleneck has led to allelic associations between loci and shifted the haplotype distribution, but has much less affected single microsatellite allele frequencies compared to the main Finnish population. The data show that the footprint of a demographic event is longer preserved in haplotype distribution within a region of low crossing-over rate, than in the information content of a single marker, or between actively recombining markers. As the knowledge of LD patterns is often chosen to assist association mapping of common disease, our conclusions emphasise the importance of understanding the history, structure and variation of a study population. PMID:15657606

  7. Review: can diet influence the selective advantage of mitochondrial DNA haplotypes?

    PubMed

    Ballard, J William O; Youngson, Neil A

    2015-11-05

    This review explores the potential for changes in dietary macronutrients to differentially influence mitochondrial bioenergetics and thereby the frequency of mtDNA haplotypes in natural populations. Such dietary modification may be seasonal or result from biogeographic or demographic shifts. Mechanistically, mtDNA haplotypes may influence the activity of the electron transport system (ETS), retrograde signalling to the nuclear genome and affect epigenetic modifications. Thus, differential provisioning by macronutrients may lead to selection through changes in the levels of ATP production, modulation of metabolites (including AMP, reactive oxygen species (ROS) and the NAD(+)/NADH ratio) and potentially complex epigenetic effects. The exquisite complexity of dietary influence on haplotype frequency is further illustrated by the fact that macronutrients may differentially influence the selective advantage of specific mutations in different life-history stages. In Drosophila, complex I mutations may affect larval growth because dietary nutrients are fed through this complex in immaturity. In contrast, the majority of electrons are provided to complex III in adult flies. We conclude the review with a case study that considers specific interactions between diet and complex I of the ETS. Complex I is the first enzyme of the mitochondrial ETS and co-ordinates in the oxidation of NADH and transfer of electrons to ubiquinone. Although the supposition that mtDNA variants may be selected upon by dietary macronutrients could be intuitively consistent to some and counter intuitive to others, it must face a multitude of scientific hurdles before it can be recognized. © 2015 Authors.

  8. Significant association of full-thickness rotator cuff tears and estrogen-related receptor-β (ESRRB).

    PubMed

    Teerlink, Craig C; Cannon-Albright, Lisa A; Tashjian, Robert Z

    2015-02-01

    The precise etiology of rotator cuff disease is unknown, but prior evidence suggests a role for genetic factors. Variants of estrogen-related receptor-β (ESRRB) have been previously associated with rotator cuff disease. The purpose of the present study was to confirm the association between multiple candidate genes, including ESRRB, and rotator cuff disease in an independent set of patients with rotator cuff tear. The Illumina 5M (Illumina Inc, San Diego, CA, USA) single nucleotide polymorphism (SNP) platform was used to genotype 175 patients with rotator cuff tear. Genotypes were used to select a set of 2595 genetically matched Caucasian controls available from the Illumina iControls database. Tests of association were performed with Genome-wide Efficient Mixed Model Association (GEMMA) software at 69 SNPs that fell within 20 kb of 6 candidate genes (DEFB1, DENND2C, ESRRB, FGF3, FGF10, and FGFR1). Tests of association revealed 1 significantly associated SNP occurring in ESRRB (rs17583842; P = 4.4E-4). Another SNP within ESRRB (rs7157192) had a nominal P value of 7.8E-3. FastPHASE software estimated 2 frequent haplotypes among 54 individuals who carried both risk alleles at these 2 SNPs. The first haplotype had a frequency of 13.9% (n = 15) in risk-allele carriers and only 2.2% in controls (odds ratio, 6.9; 95% confidence interval, 3.9-2.2). The second haplotype had a frequency of 12.9% in risk-allele carriers and only 2.7% in controls (odds ratio, 5.3; 95% confidence interval, 3.0-9.5). The significant association and the presence of high-risk haplotypes identified in the ESRRB gene confirm the association of variants in ESRRB and rotator cuff disease. Copyright © 2015 Journal of Shoulder and Elbow Surgery Board of Trustees. All rights reserved.

  9. Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry.

    PubMed

    Karpyak, Victor M; Kim, Jeong-Hyun; Biernacka, Joanna M; Wieben, Eric D; Mrazek, David A; Black, John L; Choi, Doo-Sup

    2009-04-01

    Mpdz gene variations are known contributors of acute alcohol withdrawal severity and seizures in mice. To investigate the relevance of these findings for human alcoholism, we resequenced 46 exons, exon-intron boundaries, and 2 kilobases in the 5' region of the human MPDZ gene in 61 subjects with a history of alcohol withdrawal seizures (AWS), 59 subjects with a history of alcohol withdrawal without AWS, and 64 Coriell samples from self-reported nonalcoholic subjects [all European American (EA) ancestry] and compared with the Mpdz sequences of 3 mouse strains with different propensity to AWS. To explore potential associations of the human MPDZ gene with alcoholism and AWS, single SNP and haplotype analyses were performed using 13 common variants. Sixty-seven new, mostly rare variants were discovered in the human MPDZ gene. Sequence comparison revealed that the human gene does not have variations identical to those comprising Mpdz gene haplotype associated with AWS in mice. We also found no significant association between MPDZ haplotypes and AWS in humans. However, a global test of haplotype association revealed a significant difference in haplotype frequencies between alcohol-dependent subjects without AWS and Coriell controls (p = 0.015), suggesting a potential role of MPDZ in alcoholism and/or related phenotypes other than AWS. Haplotype-specific tests for the most common haplotypes (frequency > 0.05), revealed a specific high-risk haplotype (p = 0.006, maximum statistic p = 0.051), containing rs13297480G allele also found to be significantly more prevalent in alcoholics without AWS compared with nonalcoholic Coriell subjects (p = 0.019). Sequencing of MPDZ gene in individuals with EA ancestry revealed no variations in the sites identical to those associated with AWS in mice. Exploratory haplotype and single SNP association analyses suggest a possible association between the MPDZ gene and alcohol dependence but not AWS. Further functional genomic analysis of MPDZ

  10. Insights into HLA-G Genetics Provided by Worldwide Haplotype Diversity

    PubMed Central

    Castelli, Erick C.; Ramalho, Jaqueline; Porto, Iane O. P.; Lima, Thálitta H. A.; Felício, Leandro P.; Sabbagh, Audrey; Donadi, Eduardo A.; Mendes-Junior, Celso T.

    2014-01-01

    Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures. PMID:25339953

  11. HLA-A, -B, -C, -DRB1 and -DQB1 allele and haplotype frequencies in the Serbian population.

    PubMed

    Andric, Zorana; Popadic, Dusan; Jovanovic, Barbara; Jaglicic, Ivana; Bojic, Svetlana; Simonovic, Ruzica

    2014-03-01

    This study provides the first published detailed analysis of five loci polymorphisms as well as reports of two, three and five loci haplotype frequencies in the Serbian population in a sample of 1992 volunteer bone marrow donors recruited from different part of the country. Typing was performed by PCR SSO method combined with PCR SSP techniques to resolve ambiguities. In total, 16 HLA-A, 28 HLA-B, 14 HLA-C, 13 HLA-DRB1 and 5 HLA-DQB1 allelic groups were identified. The most frequent in allele groups are HLA-A(∗)02 (29.5%), HLA-A(∗)01 (14.2%), HLA-B(∗)35 (13.1%), HLA-B(∗)51 (12.8%), HLA-C(∗)07 (24.8%), HLA-DRB1(∗)11 (16.9%), HLA-DRB1(∗)13 (13.2%), HLA-DQB1(∗)03 (33.3%) and DQB1(∗)05 (33.0%). The most frequent three- and five-loci haplotypes were A(∗)01-B(∗)08-DRB1(∗)03 (5.9%) and A(∗)02-B(∗)18-DRB1(∗)11 (1.9%), A(∗)01-B(∗)08-C(∗)07-DRB1(∗)03-DQB1(∗)02 (6.6%) followed by A(∗)02-B(∗)18-C(∗)07-DRB1(∗)11-DQB1(∗)03 (2.5%), then A(∗)33-B(∗)14-C(∗)08-DRB1(∗)01-DQB1(∗)05 and A(∗)02-B(∗)35-C(∗)04-DRB1(∗)16-DQB1(∗)05 (2.2% both), respectively. The results of cluster analysis showed that the Serbian population is closely related to the populations living in central Balkan and neighboring European regions. The level of allelic diversity found in this study are relevant to facilitate searching for unrelated matched donor and provide a healthy control population from our region that should be useful in the future disease association study. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  12. Construction and forensic genetic characterization of 11 autosomal haplotypes consisting of 22 tri-allelic indels.

    PubMed

    Zhao, Xiaohong; Chen, Xiaogang; Zhao, Yuancun; Zhang, Shu; Gao, Zehua; Yang, Yiwen; Wang, Yufang; Zhang, Ji

    2018-05-01

    Insertion/deletion polymorphisms (indels), which combine the advantages of both short tandem repeats and single-nucleotide polymorphisms, are suitable for parentage testing. To overcome the limitations of the low polymorphism of di-allelic indels, we constructed a set of haplotypes with physically linked, multi-allelic indels. Candidate haplotypes were selected from the 1000 Genomes Project database, and were subject to the following criteria for inclusion: (i) each marker must have a minimum allele frequency (MAF) of ≥0.1 in the Han population of China; (ii) markers must exist in a non-coding region; (iii) the physical distance between a pair of candidate indels must be <500 bp; (iv) the allele length variation of each indel from 1 to 20 bp; (v) different haplotypes must be located on different chromosomes or chromosomal arms, or be more than 10 Mb apart if on the same chromosomal arm; and (vi) they must not be located across a recombination hotspot. A multiplex system with 11 haplotype markers, comprising 22 tri-allelic indel loci distributed over 10 chromosomes was developed. To validate the multiplex panel, we investigated the haplotype distribution in sets of two and three-generation pedigrees. The results demonstrated that the haplotypes consisting of multi-allelic indel markers exhibited higher polymorphism than a single indel locus, and thus provide Supplementary information for forensic kinship identification. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. HLA-A and -B alleles and haplotypes in hemochromatosis probands with HFE C282Y homozygosity in central Alabama.

    PubMed

    Barton, James C; Acton, Ronald T

    2002-10-07

    We wanted to quantify HLA-A and -B allele and haplotype frequencies in Alabama hemochromatosis probands with HFE C282Y homozygosity and controls, and to compare results to those in other populations. Alleles were detected using DNA-based typing (probands) and microlymphocytotoxicity (controls). Alleles were determined in 139 probands (1,321 controls) and haplotypes in 118 probands (605 controls). In probands, A*03 positivity was 0.7482 (0.2739 controls; p = or < 0.0001; odds ratio (OR) 7.9); positivity for B*07, B*14, and B*56 was also increased. In probands, haplotypes A*03-B*07 and A*03-B*14 were more frequent (p < 0.0001, respectively; OR = 12.3 and 11.1, respectively). The haplotypes A*01-B*60, A*02-B*39, A*02-B*62, A*03-B*13, A*03-B*15, A*03-B*27, A*03-B*35, A*03-B*44, A*03-B*47, and A*03-B*57 were also significantly more frequent in probands. 37.3% of probands were HLA-haploidentical with other proband(s). A*03 and A*03-B*07 frequencies are increased in Alabama probands, as in other hemochromatosis cohorts. Increased absolute frequencies of A*03-B*35 have been reported only in the present Alabama probands and in hemochromatosis patients in Italy. Increased absolute frequencies of A*01-B*60, A*02-B*39, A*02-B*62, A*03-B*13, A*03-B*15, A*03-B*27, A*03-B*44, A*03-B*47, and A*03-B*57 in hemochromatosis cohorts have not been reported previously.

  14. Casein SNP in Norwegian goats: additive and dominance effects on milk composition and quality

    PubMed Central

    2011-01-01

    Background The four casein proteins in goat milk are encoded by four closely linked casein loci (CSN1S1, CSN2, CSN1S2 and CSN3) within 250 kb on caprine chromosome 6. A deletion in exon 12 of CSN1S1, so far reported only in Norwegian goats, has been found at high frequency (0.73). Such a high frequency is difficult to explain because the national breeding goal selects against the variant's effect. Methods In this study, 575 goats were genotyped for 38 Single Nucleotide Polymorphisms (SNP) located within the four casein genes. Milk production records of these goats were obtained from the Norwegian Dairy Goat Control. Test-day mixed models with additive and dominance fixed effects of single SNP were fitted in a model including polygenic effects. Results Significant additive effects of single SNP within CSN1S1 and CSN3 were found for fat % and protein %, milk yield and milk taste. The allele with the deletion showed additive and dominance effects on protein % and fat %, and overdominance effects on milk quantity (kg) and lactose %. At its current frequency, the observed dominance (overdominance) effects of the deletion allele reduced its substitution effect (and additive genetic variance available for selection) in the population substantially. Conclusions The selection pressure of conventional breeding on the allele with the deletion is limited due to the observed dominance (overdominance) effects. Inclusion of molecular information in the national breeding scheme will reduce the frequency of this deletion in the population. PMID:21864407

  15. Patterns of haplotypes for 92 cystic fibrosis mutations: Variability, association and recurrence

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Morral, N.; Llevadot, R.; Estivill, X.

    1994-09-01

    Most CFTR mutations are very uncommon among the cystic fibrosis population, with frequencies of less than 1%, and many are found only in specific areas. We have analyzed 92 CF mutations for several markers (4 microsatellites and 3 other polymorphisms) scattered in the CFTR gene. Haplotypes associated with these mutations can be used as a framework in the screening of chromosomes carrying unknown mutations. The association between mutation and haplotype reduces the number of mutations it is necessary to search for to a maximum of 16 for the same haplotype. Only mutations {triangle}F508, G542X and N1303K are associated with moremore » than one haplotype as a result of slippage at more than one microsatellite loci, suggesting that these three are the most ancient CF mutations. Recurrence has been found for at least 7 mutations: H199Y, R347P, L558S, R553X, 2184insA, 3272-26A{r_arrow}G, 3849+10kbC{r_arrow}T and R1162X. Also microsatellite analysis of chromosomes of several ethnic origins (Czech, Italian, Russian, Slovac and Spanish) suggested that possibility of three or more independent origins for mutations R334W, R347P, R1162X, and 3849+10kbC{r_arrow}T, which was confirmed by analysis of markers flanking these mutations.« less

  16. Evaluation of a SNP map of 6q24-27 confirms diabetic nephropathy loci and identifies novel associations type 2 diabetes patients enriched with nephropathy from an African American population

    PubMed Central

    Leak, Tennille S.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Keene, Keith L.; Gordon, Candace J.; Hicks, Pamela J.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.

    2009-01-01

    Previously we performed a genome scan for type 2 diabetes (T2DM) using 638 African-American (AA) affected sibling pairs from 247 families; non-parametric linkage analysis suggested evidence of linkage at 6q24-27 (LOD 2.26). To comprehensively evaluate this region we performed a 2-stage association study by first constructing a SNP map of 754 SNPs selected from HapMap on the basis of linkage disequilibrium (LD) in 300 AAT2DM-ESRD subjects, 311 AA controls, 43 European American controls and 45 Yoruba Nigerian samples (Set 1). Replication analyses were conducted in an independent population of 283 AA T2DM-ESRD subjects and 282 AA controls (Set 2). In addition, we adjusted for the impact of admixture on association results by using ancestry informative markers (AIMs). In Stage 1, 137 (18.2%) SNPs showed nominal evidence of association (P<0.05) in one or more of tests of association: allelic (n=33), dominant (n=36), additive (n=29), or recessive (n=34) genotypic models, and 2- (n=47) and 3-SNP (n=43) haplotypic analyses. These SNPs were selected for follow-up genotyping. Stage 2 analyses confirmed association with a predicted 2-SNP “risk” haplotype in the PARK2 gene. Also, two intergenic SNPs showed consistent genotypic association with T2DM-ESRD: rs12197043 and rs4897081. Combined analysis of all subjects from both stages revealed nominal associations with 17 SNPs within genes; including suggestive associations in ESR1 and PARK2. This study confirms known diabetic nephropathy loci and identifies potentially novel susceptibility variants located within 6q24-27 in AA. PMID:18560894

  17. Vitis Phylogenomics: Hybridization Intensities from a SNP Array Outperform Genotype Calls

    PubMed Central

    Miller, Allison J.; Matasci, Naim; Schwaninger, Heidi; Aradhya, Mallikarjuna K.; Prins, Bernard; Zhong, Gan-Yuan; Simon, Charles; Buckler, Edward S.; Myles, Sean

    2013-01-01

    Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera and has general

  18. Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers

    PubMed Central

    2010-01-01

    Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls

  19. Influence of promoter/enhancer region haplotypes on MGMT transcriptional regulation: a potential biomarker for human sensitivity to alkylating agents.

    PubMed

    Xu, Meixiang; Nekhayeva, Ilona; Cross, Courtney E; Rondelli, Catherine M; Wickliffe, Jeffrey K; Abdel-Rahman, Sherif Z

    2014-03-01

    The O6-methylguanine-DNA methyltransferase gene (MGMT) encodes the direct reversal DNA repair protein that removes alkyl adducts from the O6 position of guanine. Several single-nucleotide polymorphisms (SNPs) exist in the MGMT promoter/enhancer (P/E) region. However, the haplotype structure encompassing these SNPs and their functional/biological significance are currently unknown. We hypothesized that MGMT P/E haplotypes, rather than individual SNPs, alter MGMT transcription and can thus alter human sensitivity to alkylating agents. To identify the haplotype structure encompassing the MGMT P/E region SNPs, we sequenced 104 DNA samples from healthy individuals and inferred the haplotypes using the data generated. We identified eight SNPs in this region, namely T7C (rs180989103), T135G (rs1711646), G290A (rs61859810), C485A (rs1625649), C575A (rs113813075), G666A (rs34180180), C777A (rs34138162) and C1099T (rs16906252). Phylogenetics and Sequence Evolution analysis predicted 21 potential haplotypes that encompass these SNPs ranging in frequencies from 0.000048 to 0.39. Of these, 10 were identified in our study population as 20 paired haplotype combinations. To determine the functional significance of these haplotypes, luciferase reporter constructs representing these haplotypes were transfected into glioblastoma cells and their effect on MGMT promoter activity was determined. Compared with the most common (reference) haplotype 1, seven haplotypes significantly upregulated MGMT promoter activity (18-119% increase; P < 0.05), six significantly downregulated MGMT promoter activity (29-97% decrease; P < 0.05) and one haplotype had no effect. Mechanistic studies conducted support the conclusion that MGMT P/E haplotypes, rather than individual SNPs, differentially regulate MGMT transcription and could thus play a significant role in human sensitivity to environmental and therapeutic alkylating agents.

  20. IL-10 -1082 SNP and IL-10 in primary CNS and vitreoretinal lymphomas.

    PubMed

    Ramkumar, Hema L; Shen, De Fen; Tuo, Jingsheng; Braziel, Rita M; Coupland, Sarah E; Smith, Justine R; Chan, Chi-Chao

    2012-10-01

    Most primary central nervous system lymphomas (PCNSLs) and primary vitreoretinal lymphomas (PVRLs) are B-cell lymphomas that produce high levels of interleukin (IL)-10, which is linked to rapid disease progression. The IL-10 (-1082) G → A polymorphism (IL-10 SNP) is associated with improved survival in certain non-CNS lymphoma patients. PDCD4 is a tumor suppressor gene and upstream regulator of IL-10. This study examined the correlation between the IL-10 SNP, PDCD4 mRNA expression, and IL-10 expression (at transcript and protein levels) in these lymphoma cells. Single-nucleotide polymorphism (SNP)-typing at IL-10 (-1082) was performed after microdissecting cytospun PVRL cells from 26 specimens. Vitreal IL-10 and IL-6 levels were measured by ELISA. PCNSL cells from 52 paraffin-embedded sections were microdissected and SNP typed on genomic DNA. RT-PCR was performed to analyze expression of IL-10 and PDCD4 mRNA. IL-10 (-1082) SNP typing was performed on blood samples of 96 healthy controls. We measured IL-10 (-1082) SNP expression in 26 PVRLs and 52 PCNSLs and examined its relationship with IL-10 protein and gene expression, respectively. More PVRL patients expressed one copy of the IL-10 ( -1082 )  G → A SNP with the GA genotype compared to controls. The frequencies of the three genotypes (AA, AG, GG) significantly differed in PVRL versus controls and in PCNSL versus controls. In PVRLs, the vitreal IL-10/IL-6 ratio was higher in IL-10 (-1082) AG and IL-10 (-1082) AA patients, compared to IL-10 (-1082) GG patients. IL-10 mRNA expression was higher in IL-10 (-1082) AG and IL-10 (-1082) AA PCNSLs, compared to IL-10 (-1082) GG PCNSLs. No correlation was found between IL-10 and PDCD4 expression levels in 37 PCNSL samples. PVRL and PCNSL patients had similar IL-10 (-1082) A allele frequencies, but genotype distributions differed from healthy controls. The findings suggest that the IL-10 (-1082) A allele is a risk factor for higher IL-10 levels in PVRLs and

  1. IL-10 -1082 SNP and IL-10 in primary CNS and vitreoretinal lymphomas

    PubMed Central

    Ramkumar, Hema L.; Shen, De Fen; Tuo, Jingsheng; Braziel, Rita M.; Coupland, Sarah E.; Smith, Justine R.

    2012-01-01

    Objectives Most primary central nervous system lymphomas (PCNSLs) and primary vitreoretinal lymphomas (PVRLs) are B-cell lymphomas that produce high levels of interleukin (IL)-10, which is linked to rapid disease progression. The IL-10-1082G→A polymorphism (IL-10 SNP) is associated with improved survival in certain non-CNS lymphoma patients. PDCD4 is a tumor suppressor gene and upstream regulator of IL-10. This study examined the correlation between the IL-10 SNP, PDCD4 mRNA expression, and IL-10 expression (at transcript and protein levels) in these lymphoma cells. Materials and methods Single-nucleotide polymorphism (SNP)-typing at IL-10-1082 was performed after micro-dissecting cytospun PVRL cells from 26 specimens. Vitreal IL-10 and IL-6 levels were measured by ELISA. PCNSL cells from 52 paraffin-embedded sections were microdissected and SNP typed on genomic DNA. RT-PCR was performed to analyze expression of IL-10 and PDCD4 mRNA. IL-10-1082 SNP typing was performed on blood samples of 96 healthy controls. We measured IL-10-1082 SNP expression in 26 PVRLs and 52 PCNSLs and examined its relationship with IL-10 protein and gene expression, respectively. Results More PVRL patients expressed one copy of the IL-10-1082G→A SNP with the GA genotype compared to controls. The frequencies of the three genotypes (AA, AG, GG) significantly differed in PVRL versus controls and in PCNSL versus controls. In PVRLs, the vitreal IL-10/IL-6 ratio was higher in IL-10-1082 AG and IL-10-1082 AA patients, compared to IL-10-1082 GG patients. IL-10 mRNA expression was higher in IL-10-1082 AG and IL-10-1082 AA PCNSLs, compared to IL-10-1082 GG PCNSLs. No correlation was found between IL-10 and PDCD4 expression levels in 37 PCNSL samples. Conclusions PVRL and PCNSL patients had similar IL-10-1082 A allele frequencies, but genotype distributions differed from healthy controls. The findings suggest that the IL-10-1082 A allele is a risk factor for higher IL-10 levels in PVRLs and PCNSLs

  2. Association of methionine synthase gene polymorphisms with wool production and quality traits in Chinese Merino population.

    PubMed

    Rong, E G; Yang, H; Zhang, Z W; Wang, Z P; Yan, X H; Li, H; Wang, N

    2015-10-01

    Methionine synthase (MTR) plays a crucial role in maintaining homeostasis of intracellular methionine, folate, and homocysteine, and its activity correlates with DNA methylation in many mammalian tissues. Our previous genomewide association study identified that 1 SNP located in the gene was associated with several wool production and quality traits in Chinese Merino. To confirm the potential involvement of the gene in sheep wool production and quality traits, we performed sheep tissue expression profiling, SNP detection, and association analysis with sheep wool production and quality traits. The semiquantitative reverse transcription PCR analysis showed that the gene was differentially expressed in skin from Merino and Kazak sheep. The sequencing analysis identified a total of 13 SNP in the gene from Chinese Merino sheep. Comparison of the allele frequencies revealed that these 13 identified SNP were significantly different among the 6 tested Chinese Merino strains ( < 0.001). Linkage disequilibrium analysis showed that SNP 3 to 11 were strongly linked in a single haplotype block in the tested population. Association analysis showed that SNP 2 to 11 were significantly associated with the average wool fiber diameter and the fineness SD and that SNP 4 to 11 were significantly associated with the CV of fiber diameter trait ( < 0.05). Single nucleotide polymorphism 2 and SNP 5 to 12 were weakly associated with wool crimp. Similarly, the haplotypes derived from these 13 identified SNP were also significantly associated with the average wool fiber diameter, fineness SD, and the CV of fiber diameter ( < 0.05). Our results suggest that is a candidate gene for sheep wool production and quality traits, and the identified SNP might be used in sheep breeding.

  3. Addiction Genetics and Pleiotropic Effects of Common Haplotypes that Make Polygenic Contributions to Vulnerability to Substance Dependence

    PubMed Central

    Uhl, George R.; Drgon, Tomas; Johnson, Catherine; Liu, Qing-Rong

    2016-01-01

    Abundant evidence from family, adoption, and twin studies point to large genetic contributions to individual differences in vulnerability to develop dependence on one or more addictive substances. Twin data suggest that most of this genetic vulnerability is shared by individuals who are dependent on a variety of addictive substances. Molecular genetic studies, especially genomewide and candidate gene association studies, have elucidated common haplotypes in dozens of genes that appear to make polygenic contributions to vulnerability to developing dependence. Most genes that harbor currently identified addiction-associated haplotypes are expressed in the brain. Haplotypes in many of the same genes are identified in genomewide association studies that compare allele frequencies in substance dependent vs. control individuals from European, African, and Asian racial/ethnic backgrounds. Many of these addiction-associated haplotypes display pleiotropic influences on a variety of related brain-based phenotypes that display 1) substantial heritability and 2) clinical cooccurence with substance dependence. PMID:19152208

  4. Significant association between IL10-1082/-819 and TNF-308 haplotypes and the susceptibility to cervical carcinogenesis in women infected by Human papillomavirus.

    PubMed

    Chagas, Bárbara Simas; Lima, Rita de Cássia Pereira de; Paiva Júnior, Sérgio de Sá Leitão; Silva, Ruany Cristyne de Oliveira; Cordeiro, Marcelo Nazário; Silva Neto, Jacinto da Costa; Batista, Marcus Vinicius de Aragão; Silva, Anna Jéssica Duarte; Gurgel, Ana Pavla Almeida Diniz; Freitas, Antonio Carlos de

    2018-06-20

    Human papillomavirus (HPV) is responsible for high-grade cervical lesions and cervical cancer. The inflammation plays a key role in cervical cancer progression. In this context, studies propose an association between TNFα and IL10 SNPs and susceptibility to HPV infection. The present work aimed to investigate the possible association between IL10 and TNFα promoter polymorphisms and HPV infection in the cervical carcinogenesis risk in women from Brazil. A total of 654 samples was evaluated in this study. HPV detection was performed by PCR and HPV genotyping was performed by PCR and sequencing of positive MY09/11 PCR product. Genotyping of IL10 SNPs (rs1800871 and rs1800896) was performed by High Resolution Melt analysis. Genotyping of TNFα SNP (rs1800629) was performed by fluorogenic allele-specific probes. The distribution of TNF-308 (rs1800629) allelic (p = 0.03) and genotype (p = 0.03) frequencies and HPV-58 infection has showed a statistically significant difference between case and control groups for the assessed TNFα polymorphism. When it comes to TNFα (rs1800629) allelic and genotypic distribution and HPVs 18 and 31 infections, no statistically significant differences between case and control groups were observed for the studied TNFα polymorphism. The allelic and genotypic distribution of IL10-819 (rs1800871) and IL10-1082 (rs1800896) and HPV infection (HPVs 58, 18 and 31) has showed no statistically significant differences between case and control groups for the assessed IL10 polymorphisms. Furthermore, it was observed that haplotypes were associated with an increased cervical cancer risk in HPVs 16, 18 and 58-positive women. It was observed that women carrying the GTA and ATG haplotypes had 3.85 and 17.99-fold, respectively, increased cervical cancer susceptibility when infected by HPV-58. In women infected with HPV-16 and HPV-18, statistically significant results in women carrying the GTA and ATA haplotypes was observed. They had a 2.32 and 3

  5. Founder haplotype analysis of Fanconi anemia in the Korean population finds common ancestral haplotypes for a FANCG variant.

    PubMed

    Park, Joonhong; Kim, Myungshin; Jang, Woori; Chae, Hyojin; Kim, Yonggoo; Chung, Nack-Gyun; Lee, Jae-Wook; Cho, Bin; Jeong, Dae-Chul; Park, In Yang; Park, Mi Sun

    2015-05-01

    A common ancestral haplotype is strongly suggested in the Korean and Japanese patients with Fanconi anemia (FA), because common mutations have been frequently found: c.2546delC and c.3720_3724delAAACA of FANCA; c.307+1G>C, c.1066C>T, and c.1589_1591delATA of FANCG. Our aim in this study was to investigate the origin of these common mutations of FANCA and FANCG. We genotyped 13 FA patients consisting of five FA-A patients and eight FA-G patients from the Korean FA population. Microsatellite markers used for haplotype analysis included four CA repeat markers which are closely linked with FANCA and eight CA repeat markers which are contiguous with FANCG. As a result, Korean FA-A patients carrying c.2546delC or c.3720_3724delAAACA did not share the same haplotypes. However, three unique haplotypes carrying c.307+1G>C, c.1066C > T, or c.1589_1591delATA, that consisted of eight polymorphic loci covering a flanking region were strongly associated with Korean FA-G, consistent with founder haplotypes reported previously in the Japanese FA-G population. Our finding confirmed the common ancestral haplotypes on the origins of the East Asian FA-G patients, which will improve our understanding of the molecular population genetics of FA-G. To the best of our knowledge, this is the first report on the association between disease-linked mutations and common ancestral haplotypes in the Korean FA population. © 2015 John Wiley & Sons Ltd/University College London.

  6. Combination Testing Using a Single MSH5 Variant alongside HLA Haplotypes Improves the Sensitivity of Predicting Coeliac Disease Risk in the Polish Population.

    PubMed

    Paziewska, Agnieszka; Cukrowska, Bozena; Dabrowska, Michalina; Goryca, Krzysztof; Piatkowska, Magdalena; Kluska, Anna; Mikula, Michal; Karczmarski, Jakub; Oralewska, Beata; Rybak, Anna; Socha, Jerzy; Balabas, Aneta; Zeber-Lubecka, Natalia; Ambrozkiewicz, Filip; Konopka, Ewa; Trojanowska, Ilona; Zagroba, Malgorzata; Szperl, Malgorzata; Ostrowski, Jerzy

    2015-01-01

    Assessment of non-HLA variants alongside standard HLA testing was previously shown to improve the identification of potential coeliac disease (CD) patients. We intended to identify new genetic variants associated with CD in the Polish population that would improve CD risk prediction when used alongside HLA haplotype analysis. DNA samples of 336 CD and 264 unrelated healthy controls were used to create DNA pools for a genome wide association study (GWAS). GWAS findings were validated with individual HLA tag single nucleotide polymorphism (SNP) typing of 473 patients and 714 healthy controls. Association analysis using four HLA-tagging SNPs showed that, as was found in other populations, positive predicting genotypes (HLA-DQ2.5/DQ2.5, HLA-DQ2.5/DQ2.2, and HLA-DQ2.5/DQ8) were found at higher frequencies in CD patients than in healthy control individuals in the Polish population. Both CD-associated SNPs discovered by GWAS were found in the CD susceptibility region, confirming the previously-determined association of the major histocompatibility (MHC) region with CD pathogenesis. The two most significant SNPs from the GWAS were rs9272346 (HLA-dependent; localized within 1 Kb of DQA1) and rs3130484 (HLA-independent; mapped to MSH5). Specificity of CD prediction using the four HLA-tagging SNPs achieved 92.9%, but sensitivity was only 45.5%. However, when a testing combination of the HLA-tagging SNPs and the MSH5 SNP was used, specificity decreased to 80%, and sensitivity increased to 74%. This study confirmed that improvement of CD risk prediction sensitivity could be achieved by including non-HLA SNPs alongside HLA SNPs in genetic testing.

  7. Prevalence of genetic thrombophilic polymorphisms in the Sri Lankan population--implications for association study design and clinical genetic testing services.

    PubMed

    Dissanayake, Vajira H W; Weerasekera, Lakshini Y; Gammulla, C Gayani; Jayasekara, Rohan W

    2009-10-01

    We investigated the prevalence of genotypes/alleles of single nucleotide polymorphisms (SNP) and haplotypes defined by them in three genes in which variations are associated with venous thromboembolism in 80 Sinhalese, 80 Sri Lankan Tamils and 80 Moors in the Sri Lankan population and compared the SNP data with that of other populations in Southern India and haplotype data with that of HapMap populations. The genes and polymorphisms investigated were Methylenetetrahydrofolate reductase (MTHFR) - 677C>T (rs1801133), 1298A>C (rs1801131), 1317T>C, 1793G>A (rs2274976); Factor V (F5) - 1691G>A (rs6025) and 4070A>G (rs1800595); and prothrombin (F2) - 20210G>A (rs1799963). The polymorphisms were genotyped using PCR/RFLP methods. The prevalence of the variant alleles of each polymorphism in the Sinhalese, Tamils, and Moors was MTHFR 677T: Sinhalese - 13%, Tamils - 9%, Moors - 9%. 1317T>C: Sinhalese - 0%; Tamils - 0%; Moors - 0%. 1793A: Sinhalese - 19%, Tamils - 19%, Moors - 19%. F5 1691A: Sinhalese - 2%, Tamils - 3%, Moors - 2%. 4070G: Sinhalese - 6%, Tamils - 5%, Moors - 8%. F2 20210A: Sinhalese - 0%, Tamils - 0%, Moors - 0%. The frequencies observed were similar to data from other South Indian populations; the haplotype data showed haplotypes unique to the Sri Lankan population when compared to HapMap populations. rs9651118 was identified as a SNP that splits the haplotypes harbouring the functionally significant 677T allele in the MTHFR gene. This data would be useful in planning genetic association studies in the Sri Lankan population and in deciding on which genetic variants should be tested in a clinical genetic testing service.

  8. Discovery of novel MHC-class I alleles and haplotypes in Filipino cynomolgus macaques (Macaca fascicularis) by pyrosequencing and Sanger sequencing: Mafa-class I polymorphism.

    PubMed

    Shiina, Takashi; Yamada, Yukiho; Aarnink, Alice; Suzuki, Shingo; Masuya, Anri; Ito, Sayaka; Ido, Daisuke; Yamanaka, Hisashi; Iwatani, Chizuru; Tsuchiya, Hideaki; Ishigaki, Hirohito; Itoh, Yasushi; Ogasawara, Kazumasa; Kulski, Jerzy K; Blancher, Antoine

    2015-10-01

    Although the low polymorphism of the major histocompatibility complex (MHC) transplantation genes in the Filipino cynomolgus macaque (Macaca fascicularis) is expected to have important implications in the selection and breeding of animals for medical research, detailed polymorphism information is still lacking for many of the duplicated class I genes. To better elucidate the degree and types of MHC polymorphisms and haplotypes in the Filipino macaque population, we genotyped 127 unrelated animals by the Sanger sequencing method and high-resolution pyrosequencing and identified 112 different alleles, 28 at cynomolgus macaque MHC (Mafa)-A, 54 at Mafa-B, 12 at Mafa-I, 11 at Mafa-E, and seven at Mafa-F alleles, of which 56 were newly described. Of them, the newly discovered Mafa-A8*01:01 lineage allele had low nucleotide similarities (<86%) with primate MHC class I genes, and it was also conserved in the Vietnamese and Indonesian populations. In addition, haplotype estimations revealed 17 Mafa-A, 23 Mafa-B, and 12 Mafa-E haplotypes integrated with 84 Mafa-class I haplotypes and Mafa-F alleles. Of these, the two Mafa-class I haplotypes, F/A/E/B-Hp1 and F/A/E/B-Hp2, had the highest haplotype frequencies at 10.6 and 10.2%, respectively. This suggests that large scale genetic screening of the Filipino macaque population would identify these and other high-frequency Mafa-class I haplotypes that could be used as MHC control animals for the benefit of biomedical research.

  9. Biological Effects of COMT Haplotypes and Psychosis Risk in 22q11.2 Deletion Syndrome

    PubMed Central

    Gothelf, Doron; Law, Amanda J.; Frisch, Amos; Chen, Jingshan; Zarchi, Omer; Michaelovsky, Elena; Ren-Patterson, Renee; Lipska, Barbara K.; Carmel, Miri; Kolachana, Bhaskar; Weizman, Abraham; Weinberger, Daniel R.

    2013-01-01

    Background 22q11.2 deletion syndrome (22q11.2DS) is the most common genetic syndrome associated with schizophrenia. The catechol-o-methyltransferase (COMT) gene is located in the obligatory deletion region, and possible associations between COMT variants and neuropsychiatric manifestations in 22q11.2DS have been reported. The purpose of the current study was to evaluate the effect of COMT hemizygosity and molecular haplotypes on gene expression and enzyme activity and its association with psychotic symptoms in 22q11.2DS. Methods Lymphoblast samples were drawn from 53 individuals with 22q11.2DS and 16 typically developing controls. We measured COMT mRNA and protein expression and enzyme activity using standard procedures. The presence of a psychotic disorder and cognitive deficits were also evaluated using structured testing. Results There was a ~50% reduction in COMT mRNA, protein and enzyme activity levels in 22q11.2DS samples. Haplotype analysis revealed clear phenotypic differences between various Val-containing haplotypes on COMT-3′UTR extended mRNA, S-COMT and MB proteins and enzyme activity. The G variant of rs165599, a 3′UTR SNP, was associated with low levels of COMT expression and with the presence of psychosis and lower performance IQ scores in our 22q11.2DS sample. Finally, we demonstrate that the COMT rs74745580 ‘T’ mutation is associated with absent S-COMT expression and very low COMT activity in two 22q11.2DS individuals. Conclusions Our findings confirm a robust effect of COMT hemizygosity on COMT activity and show complex interactions of variants within the COMT gene that influence COMT biology and confound conclusions based on associations with the Val158Met genotype alone. PMID:23992923

  10. Single nucleotide polymorphism and haplotype effects associated with somatic cell score in German Holstein cattle

    PubMed Central

    2014-01-01

    Background To better understand the genetic determination of udder health, we performed a genome-wide association study (GWAS) on a population of 2354 German Holstein bulls for which daughter yield deviations (DYD) for somatic cell score (SCS) were available. For this study, we used genetic information of 44 576 informative single nucleotide polymorphisms (SNPs) and 11 725 inferred haplotype blocks. Results When accounting for the sub-structure of the analyzed population, 16 SNPs and 10 haplotypes in six genomic regions were significant at the Bonferroni threshold of P ≤ 1.14 × 10-6. The size of the identified regions ranged from 0.05 to 5.62 Mb. Genomic regions on chromosomes 5, 6, 18 and 19 coincided with known QTL affecting SCS, while additional genomic regions were found on chromosomes 13 and X. Of particular interest is the region on chromosome 6 between 85 and 88 Mb, where QTL for mastitis traits and significant SNPs for SCS in different Holstein populations coincide with our results. In all identified regions, except for the region on chromosome X, significant SNPs were present in significant haplotypes. The minor alleles of identified SNPs on chromosomes 18 and 19, and the major alleles of SNPs on chromosomes 6 and X were favorable for a lower SCS. Differences in somatic cell count (SCC) between alternative SNP alleles reached 14 000 cells/mL. Conclusions The results support the polygenic nature of the genetic determination of SCS, confirm the importance of previously reported QTL, and provide evidence for the segregation of additional QTL for SCS in Holstein cattle. The small size of the regions identified here will facilitate the search for causal genetic variations that affect gene functions. PMID:24898131

  11. Genetic variation of 'Candidatus Liberibacter solanacearum' haplotype C and identification of a novel haplotype from Trioza urticae and stinging nettle.

    PubMed

    Haapalainen, Minna L; Wang, Jinhui; Latvala, Satu; Lehtonen, Mikko T; Pirhonen, Minna; Nissinen, Anne I

    2018-03-30

    'Candidatus Liberibacter solanacearum' (CLso) haplotype C is associated with disease in carrots and transmitted by the carrot psyllid Trioza apicalis. To identify possible other sources and vectors of this pathogen in Finland, samples were taken of wild plants within and near the carrot fields, the psyllids feeding on these plants, parsnips growing next to carrots, and carrot seeds. For analyzing the genotype of the CLso positive samples, a multi-locus sequence typing (MLST) scheme was developed. CLso haplotype C was detected in 11% of the Trioza anthrisci samples, in 35% of the Anthriscus sylvestris plants with discoloration, and in parsnips showing leaf discoloration. MLST revealed that the CLso in T. anthrisci and most A. sylvestris plants represent different strains than the bacteria found in T. apicalis and the cultivated plants. CLso haplotype D was detected in two of the 34 carrot seed lots tested, but was not detected in the plants grown from these seeds. Phylogenetic analysis by UPGMA clustering suggested that the haplotype D is more closely related to the haplotype A than to C. A novel, sixth haplotype of CLso, most closely related to A and D, was found in the psyllid Trioza urticae and stinging nettle (Urtica dioica, Urticaceae), and named as haplotype U.

  12. Single nucleotide polymorphism (SNP) variation of wolves (Canis lupus) in Southeast Alaska and comparison with wolves, dogs, and coyotes in North America.

    PubMed

    Cronin, Matthew A; Cánovas, Angela; Bannasch, Danika L; Oberbauer, Anita M; Medrano, Juan F

    2015-01-01

    There is considerable interest in the genetics of wolves (Canis lupus) because of their close relationship to domestic dogs (C. familiaris) and the need for informed conservation and management. This includes wolf populations in Southeast Alaska for which we determined genotypes of 305 wolves at 173662 single nucleotide polymorphism (SNP) loci. After removal of invariant and linked SNP, 123801 SNP were used to quantify genetic differentiation of wolves in Southeast Alaska and wolves, coyotes (C. latrans), and dogs from other areas in North America. There is differentiation of SNP allele frequencies between the species (wolves, coyotes, and dogs), although differentiation is relatively low between some wolf and coyote populations. There are varying levels of differentiation among populations of wolves, including low differentiation of wolves in interior Alaska, British Columbia, and the northern US Rocky Mountains. There is considerable differentiation of SNP allele frequencies of wolves in Southeast Alaska from wolves in other areas. However, wolves in Southeast Alaska are not a genetically homogeneous group and there are comparable levels of genetic differentiation among areas within Southeast Alaska and between Southeast Alaska and other geographic areas. SNP variation and other genetic data are discussed regarding taxonomy and management. © The American Genetic Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Accuracy of various human NAT2 SNP genotyping panels to infer rapid, intermediate and slow acetylator phenotypes

    PubMed Central

    Hein, David W; Doll, Mark A

    2012-01-01

    Aim Humans exhibit genetic polymorphism in NAT2 resulting in rapid, intermediate and slow acetylator phenotypes. Over 65 NAT2 variants possessing one or more SNPs in the 870-bp NAT2 coding region have been reported. The seven most frequent SNPs are rs1801279 (191G>A), rs1041983 (282C>T), rs1801280 (341T>C), rs1799929 (481C>T), rs1799930 (590G>A), rs1208 (803A>G) and rs1799931 (857G>A). The majority of studies investigate the NAT2 genotype assay for three SNPs: 481C>T, 590G>A and 857G>A. A tag-SNP (rs1495741) recently identified in a genome-wide association study has also been proposed as a biomarker for the NAT2 phenotype. Materials & methods Sulfamethazine N-acetyltransferase catalytic activities were measured in cryopreserved human hepatocytes from a convenience sample of individuals in the USA with an ethnic frequency similar to the 2010 US population census. These activities were segregated by the tag-SNP rs1495741 and each of the seven SNPs described above. We assessed the accuracy of the tag-SNP and various two-, three-, four- and seven-SNP genotyping panels for their ability to accurately infer NAT2 phenotype. Results The accuracy of the various NAT2 SNP genotype panels to infer NAT2 phenotype were as follows: seven-SNP: 98.4%; tag-SNP: 77.7%; two-SNP: 96.1%; three-SNP: 92.2%; and four-SNP: 98.4%. Conclusion A NAT2 four-SNP genotype panel of rs1801279 (191G>A), rs1801280 (341T>C), rs1799930 (590G>A) and rs1799931 (857G>A) infers NAT2 acetylator phenotype with high accuracy, and is recommended over the tag-, two-, three- and (for economy of scale) the seven-SNP genotyping panels, particularly in populations of non-European ancestry. PMID:22092036

  14. On the use of haplotype phylogeny to detect disease susceptibility loci

    PubMed Central

    Bardel, Claire; Danjean, Vincent; Hugot, Jean-Pierre; Darlu, Pierre; Génin, Emmanuelle

    2005-01-01

    Background The cladistic approach proposed by Templeton has been presented as promising for the study of the genetic factors involved in common diseases. This approach allows the joint study of multiple markers within a gene by considering haplotypes and grouping them in nested clades. The idea is to search for clades with an excess of cases as compared to the whole sample and to identify the mutations defining these clades as potential candidate disease susceptibility sites. However, the performance of this approach for the study of the genetic factors involved in complex diseases has never been studied. Results In this paper, we propose a new method to perform such a cladistic analysis and we estimate its power through simulations. We show that under models where the susceptibility to the disease is caused by a single genetic variant, the cladistic test is neither really more powerful to detect an association nor really more efficient to localize the susceptibility site than an individual SNP testing. However, when two interacting sites are responsible for the disease, the cladistic analysis greatly improves the probability to find the two susceptibility sites. The impact of the linkage disequilibrium and of the tree characteristics on the efficiency of the cladistic analysis are also discussed. An application on a real data set concerning the CARD15 gene and Crohn disease shows that the method can successfully identify the three variant sites that are involved in the disease susceptibility. Conclusion The use of phylogenies to group haplotypes is especially interesting to pinpoint the sites that are likely to be involved in disease susceptibility among the different markers identified within a gene. PMID:15904492

  15. Accurate HLA type inference using a weighted similarity graph.

    PubMed

    Xie, Minzhu; Li, Jing; Jiang, Tao

    2010-12-14

    The human leukocyte antigen system (HLA) contains many highly variable genes. HLA genes play an important role in the human immune system, and HLA gene matching is crucial for the success of human organ transplantations. Numerous studies have demonstrated that variation in HLA genes is associated with many autoimmune, inflammatory and infectious diseases. However, typing HLA genes by serology or PCR is time consuming and expensive, which limits large-scale studies involving HLA genes. Since it is much easier and cheaper to obtain single nucleotide polymorphism (SNP) genotype data, accurate computational algorithms to infer HLA gene types from SNP genotype data are in need. To infer HLA types from SNP genotypes, the first step is to infer SNP haplotypes from genotypes. However, for the same SNP genotype data set, the haplotype configurations inferred by different methods are usually inconsistent, and it is often difficult to decide which one is true. In this paper, we design an accurate HLA gene type inference algorithm by utilizing SNP genotype data from pedigrees, known HLA gene types of some individuals and the relationship between inferred SNP haplotypes and HLA gene types. Given a set of haplotypes inferred from the genotypes of a population consisting of many pedigrees, the algorithm first constructs a weighted similarity graph based on a new haplotype similarity measure and derives constraint edges from known HLA gene types. Based on the principle that different HLA gene alleles should have different background haplotypes, the algorithm searches for an optimal labeling of all the haplotypes with unknown HLA gene types such that the total weight among the same HLA gene types is maximized. To deal with ambiguous haplotype solutions, we use a genetic algorithm to select haplotype configurations that tend to maximize the same optimization criterion. Our experiments on a previously typed subset of the HapMap data show that the algorithm is highly accurate

  16. Developing a new nonbinary SNP fluorescent multiplex detection system for forensic application in China.

    PubMed

    Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila

    2017-04-01

    Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology

    PubMed Central

    Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M

    2007-01-01

    Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the

  18. KinSNP software for homozygosity mapping of disease genes using SNP microarrays.

    PubMed

    Amir, El-Ad David; Bartal, Ofer; Morad, Efrat; Nagar, Tal; Sheynin, Jony; Parvari, Ruti; Chalifa-Caspi, Vered

    2010-08-01

    Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from.

  19. HLA class II SNP interactions and the association with type 1 diabetes mellitus in Bengali speaking patients of Eastern India.

    PubMed

    Raha, Oindrila; Sarkar, Biswanath; Lakkakula, Bhaskar V K S; Pasumarthy, Veerraju; Godi, Sudhakar; Chowdhury, Subhankar; Raychaudhuri, Pradip; Vadlamudi, Raghavendra Rao

    2013-02-27

    Several studies have demonstrated a fundamental role for the HLA in the susceptibility of, or protection to, type 1 diabetes mellitus (T1DM). However, this has not been adequately studied in Asian Indian populations. To assess the frequency of HLA class II (DPA1, DPB1, DQA1, DQB1 and DRB1) associated to susceptibility or protection toT1DM in a Bengali population of India with diabetes. Single nucleotide polymorphism study. The HLA genotyping was performed by a polymerase chain reaction followed by their HLA-DP, DQ, and DRB1 genotypes and haplotypes by sequencing method. The results are studied by Plink software. The χ2 tests were used for the inferential statistics. To our knowledge, this study is the first of a kind which has attempted to check the HLA association with T1DM by SNPs analysis. The study recruited 151 patients with T1DM and same number of ethno-linguistic, sex matched non-diabetic controls. The present study found a significant SNP rs7990 of HLA-DQA1 (p = 0.009) negative correlation, again indicating that risk from HLA is considerably more with T1DM. This study demonstrates that the HLA class-II alleles play a major role in genetic basis of T1DM.

  20. A founder haplotype of APOE-Sendai mutation associated with lipoprotein glomerulopathy.

    PubMed

    Toyota, Kentaro; Hashimoto, Taeko; Ogino, Daisuke; Matsunaga, Akira; Ito, Minoru; Masakane, Ikuto; Degawa, Noriyuki; Sato, Hiroshi; Shirai, Sayuri; Umetsu, Kazuo; Tamiya, Gen; Saito, Takao; Hayasaka, Kiyoshi

    2013-05-01

    Lipoprotein glomerulopathy (LPG) is a hereditary disease characterized by lipoprotein thrombi in the glomerulus, hyperlipoproteinemia, and a marked increase in serum apolipoprotein E (APOE). More than 12 APOE mutations have been identified as causes of LPG, and APOE-Sendai (Arg145Pro) mutation was frequently detected in patients from the eastern part of Japan including Yamagata prefecture. Recently, effective therapy with intensive lipid-lowering agents was established, and epidemiologic data are required for early diagnosis. We determined the haplotype structure of APOE-Sendai in 13 patients from 9 unrelated families with LPG, and found that the haplotype of all APOE-Sendai mutations was identical, suggesting that APOE-Sendai mutation is common in Japanese patients probably through a founder effect. We also studied the gene frequency of APOE-Sendai in 2023 control subjects and 418 patients receiving hemodialysis in Yamagata prefecture using the TaqMan method, but did not identify any subjects carrying the mutation, indicating that it is very rare in the general population even in the eastern part of Japan. In addition to APOE mutation, other genetic and/or epigenetic factors are considered to be involved in the pathogenesis of LPG because of its low penetrance. The patients did not have a common haplotype of the counterpart APOE allele, and some patients had the same haplotype of the counterpart APOE allele as the asymptomatic carriers. These results suggest that the counterpart APOE allele is not likely associated with the onset of LPG. Further study is required to clarify the pathogenesis of LPG.

  1. The higher frequency of IgA deficiency among Swedish twins is not explained by HLA haplotypes.

    PubMed

    Frankowiack, M; Kovanen, R-M; Repasky, G A; Lim, C K; Song, C; Pedersen, N L; Hammarström, L

    2015-01-01

    Serum immunoglobulin A (IgA) concentrations were determined in 12 600 adult Swedish twins, applying a high-throughput reverse-phase protein microarray technique. The prevalence of IgA deficiency (IgAD) was found to be 1:241 in monozygotic (MZ) twins and 1:198 in dizygotic (DZ) twins. Hence, the prevalence in twins is markedly elevated as compared with the normal Swedish adult population (1:600). The twins did not show a difference in the frequency of HLA haplotypes in comparison with almost 40 000 healthy Swedish controls. As expected, the risk-conveying HLA alleles A*01, B*08 and DRB1*01 were overrepresented among the IgAD twins and were also associated with significantly lower mean serum IgA concentrations in the twin cohort. In contrast, significantly higher mean IgA concentrations were found among individuals carrying the protective HLA alleles B*07 and DRB1*15. Exome sequencing data from two MZ twin pairs discordant for the deficiency showed no differences between the siblings. Model fitting analyses derived a heritability of 35% and indicate that genetic influences are modestly important for IgAD. The probandwise concordance rates for IgAD were found to be 31% for MZ and 13% for DZ twins.

  2. Molecular characterization of a long range haplotype affecting protein yield and mastitis susceptibility in Norwegian Red cattle.

    PubMed

    Sodeland, Marte; Grove, Harald; Kent, Matthew; Taylor, Simon; Svendsen, Morten; Hayes, Ben J; Lien, Sigbjørn

    2011-08-11

    Previous fine mapping studies in Norwegian Red cattle (NRC) in the region 86-90.4 Mb on Bos taurus chromosome 6 (BTA6) has revealed a quantitative trait locus (QTL) for protein yield (PY) around 88 Mb and a QTL for clinical mastitis (CM) around 90 Mb. The close proximity of these QTLs may partly explain the unfavorable genetic correlation between these two traits in NRC. A long range haplotype covering this region was introduced into the NRC population through the importation of a Holstein-Friesian bull (1606 Frasse) from Sweden in the 1970s. It has been suggested that this haplotype has a favorable effect on milk protein content but an unfavorable effect on mastitis susceptibility. Selective breeding for milk production traits is likely to have increased the frequency of this haplotype in the NRC population. Association mapping for PY and CM in NRC was performed using genotypes from 556 SNPs throughout the region 86-97 Mb on BTA6 and daughter-yield-deviations (DYDs) from 2601 bulls made available from the Norwegian dairy herd recording system. Highest test scores for PY were found for single-nucleotide polymorphisms (SNPs) within and surrounding the genes CSN2 and CSN1S2, coding for the β-casein and α(S2)-casein proteins. High coverage re-sequencing by high throughput sequencing technology enabled molecular characterization of a long range haplotype from 1606 Frasse encompassing these two genes. Haplotype analysis of a large number of descendants from this bull indicated that the haplotype was not markedly disrupted by recombination in this region. The haplotype was associated with both increased milk protein content and increased susceptibility to mastitis, which might explain parts of the observed genetic correlation between PY and CM in NRC. Plausible causal polymorphisms affecting PY were detected in the promoter region and in the 5'-flanking UTR of CSN1S2. These polymorphisms could affect transcription or translation of CSN1S2 and thereby affect the amount

  3. Impacts of TNF-LTA SNPs/Haplotypes and Lifestyle Factors on Oral Carcinoma in an Indian Population.

    PubMed

    Bandil, Kapil; Singhal, Pallavi; Sharma, Upma; Hussain, Showket; Basu, Surojit; Parashari, Aditya; Singh, Veena; Sehgal, Ashok; Shivam, Animesh; Ahuja, Puneet; Bharadwaj, Mausumi; Banerjee, Basu Dev; Mehrotra, Ravi

    2016-10-01

    To investigate a potential association between single-nucleotide polymorphisms (SNPs) and  haplotypes at the TNFA-LTA locus and the development of oral cancer in an Indian population. In this study, 150 oral precancer/cancer samples (50 precancer and 100 cancer), along with an equal number of control samples, were genotyped. Six SNPs at the TNF-LTA locus (i.e., -238G/A, -308G/A, -857C/T, -863C/A, -1031T/C, and +252A/G) were analyzed by use of a polymerase chain reaction-restriction fragment length polymorphism method, the assay was validated by sequencing 10 % of samples. The allelic frequencies of TNFA and LTA SNPs were found to be significantly associated with the risk of oral cancer and precancerous lesions in comparison with controls (P < 0.0003). Further haplotypic analysis showed that two haplotypes (ATCTGG and ACACGG) served as risk haplotypes for oral cancer. These haplotypes were also found to be significantly and positively associated with lifestyle habits (tobacco chewing P = 0.04, odds ratio [OR] 3.4) and socioeconomic status (P = 0.01, OR 3.4). We noticed an increased percentage of risk haplotypes correlating with the aggressiveness of oral cancer. The percentages of risk haplotypes were found to be threefold higher in precancer and fourfold higher in advanced stages of oral cancer in comparison with controls. Five SNPs at the TNF-LTA locus (i.e., -308G>A, -857C>T, -863C>A, -1031T>C, and +252A>G) were found to be associated with the development of oral cancer. Two haplotypes (ATCTGG and ACACGG) emerged as major risk haplotypes for oral carcinoma progression and were also found to be associated with lifestyle factors and clinical aggressiveness. These findings make the TNF-LTA locus a suitable candidate for a future biomarker, which may be used either for early detection or for helping to improve treatment efficacy and effectiveness.

  4. [Association Between SNP rs6007897 of CELSR1 and Acute Ischemic Stroke in Western China Han Population: a Case-control Study].

    PubMed

    Qin, Feng-qin; Yu, Li-hua; Hu, Wen-ting; Guo, Jian; Chen, Ning; Guo, Jiang; Fang, Jing-huan; He, Li

    2015-07-01

    To investigate the relationship between single nucleotide polymorphism (SNP) rs6007897 of CELSR1 and acute ischemic stroke in Western China Han population. All subjects (759 acute ischemic stroke patients and 786 controls) were genotyped using ligation detection reaction (LDR). We analyzed the differences between SNP rs6007897 genotypes and allele frequencies between two groups. Two genotypes (AA, AG) of rs6007897 were found in both stroke and control group. There was no statistically significance between two groups about genotype and allele frequency. After adjusting for risk factors, we found there was no significant association between rs6007897 and ischemic stroke CP = 0.797, odds ratio (OR) = 0.886, 95% confidence interval (CI) = 0.352-2.227). SNP rs6007897 of CELSR1 was not significantly associated with ischemic stroke in Western China Han population.

  5. iXora: exact haplotype inferencing and trait association.

    PubMed

    Utro, Filippo; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar E; Royaert, Stefan; Schnell, Raymond J; Motamayor, Juan Carlos; Kuhn, David N; Parida, Laxmi

    2013-06-06

    We address the task of extracting accurate haplotypes from genotype data of individuals of large F1 populations for mapping studies. While methods for inferring parental haplotype assignments on large F1 populations exist in theory, these approaches do not work in practice at high levels of accuracy. We have designed iXora (Identifying crossovers and recombining alleles), a robust method for extracting reliable haplotypes of a mapping population, as well as parental haplotypes, that runs in linear time. Each allele in the progeny is assigned not just to a parent, but more precisely to a haplotype inherited from the parent. iXora shows an improvement of at least 15% in accuracy over similar systems in literature. Furthermore, iXora provides an easy-to-use, comprehensive environment for association studies and hypothesis checking in populations of related individuals. iXora provides detailed resolution in parental inheritance, along with the capability of handling very large populations, which allows for accurate haplotype extraction and trait association. iXora is available for non-commercial use from http://researcher.ibm.com/project/3430.

  6. APOBEC3H haplotypes and HIV-1 pro-viral vif DNA sequence diversity in early untreated human immunodeficiency virus-1 infection.

    PubMed

    Gourraud, P A; Karaouni, A; Woo, J M; Schmidt, T; Oksenberg, J R; Hecht, F M; Liegler, T J; Barbour, J D

    2011-03-01

    We examined single nucleotide polymorphisms (SNP) in the APOBEC3 locus on chromosome 22, paired with population sequences of pro-viral human immunodeficiency virus-1 (HIV-1) vif from peripheral blood mononuclear cells, from 96 recently HIV-1-infected treatment-naive adults. We found evidence for the existence of an APOBEC3H linkage disequilibrium (LD) block associated with variation in GA → AA, or APOBEC3F/H signature, sequence changes in pro-viral HIV-1 vif sequence (top 10 significant SNPs with a significant p = 4.8 × 10(-3)). We identified a common five position risk haplotype distal to APOBEC3H (A3Hrh). These markers were in high LD (D' = 1; r(2) = 0.98) to a previously described A3H "RED" haplotype containing a variant (E121) with enhanced susceptibility to HIV-1 Vif. This association was confirmed by a haplotype analysis. Homozygote carriers of the A3Hrh had lower GA->AA (A3F/H) sequence editing upon pro-viral HIV-1 vif sequence (p = 0.01), and lower HIV-1 RNA levels over time during early, untreated HIV-1 infection, (p = 0.015 mixed effects model). This effect may be due to enhanced susceptibility of A3H forms to HIV-1 Vif mediated viral suppression of sequence editing activity, slowing viral diversification and escape from immune responses. Copyright © 2011 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  7. Ancestral Asian source(s) of new world Y-chromosome founder haplotypes.

    PubMed Central

    Karafet, T M; Zegura, S L; Posukh, O; Osipova, L; Bergen, A; Long, J; Goldman, D; Klitz, W; Harihara, S; de Knijff, P; Wiebe, V; Griffiths, R C; Templeton, A R; Hammer, M F

    1999-01-01

    Haplotypes constructed from Y-chromosome markers were used to trace the origins of Native Americans. Our sample consisted of 2,198 males from 60 global populations, including 19 Native American and 15 indigenous North Asian groups. A set of 12 biallelic polymorphisms gave rise to 14 unique Y-chromosome haplotypes that were unevenly distributed among the populations. Combining multiallelic variation at two Y-linked microsatellites (DYS19 and DXYS156Y) with the unique haplotypes results in a total of 95 combination haplotypes. Contra previous findings based on Y- chromosome data, our new results suggest the possibility of more than one Native American paternal founder haplotype. We postulate that, of the nine unique haplotypes found in Native Americans, haplotypes 1C and 1F are the best candidates for major New World founder haplotypes, whereas haplotypes 1B, 1I, and 1U may either be founder haplotypes and/or have arrived in the New World via recent admixture. Two of the other four haplotypes (YAP+ haplotypes 4 and 5) are probably present because of post-Columbian admixture, whereas haplotype 1G may have originated in the New World, and the Old World source of the final New World haplotype (1D) remains unresolved. The contrasting distribution patterns of the two major candidate founder haplotypes in Asia and the New World, as well as the results of a nested cladistic analysis, suggest the possibility of more than one paternal migration from the general region of Lake Baikal to the Americas. PMID:10053017

  8. rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks

    PubMed Central

    2018-01-01

    Abstract Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element–target gene pairs (E–G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. PMID:29140525

  9. VKORC1 haplotypes are associated with arterial vascular diseases (stroke, coronary heart disease, and aortic dissection).

    PubMed

    Wang, Yibo; Zhang, Weili; Zhang, Yuhui; Yang, Yuejin; Sun, Lizhong; Hu, Shengshou; Chen, Jilin; Zhang, Channa; Zheng, Yi; Zhen, Yisong; Sun, Kai; Fu, Chunyan; Yang, Tao; Wang, Jianwei; Sun, Jing; Wu, Haiying; Glasgow, Wayne C; Hui, Rutai

    2006-03-28

    The haplotypes in the gene vitamin K epoxide reductase complex subunit 1 (VKORC1) have been found to affect warfarin dose response through effects on the formation of reduced-form vitamin K, a cofactor for gamma-carboxylation of vitamin K-dependent proteins, which is involved in the coagulation cascade and has a potential impact on atherosclerosis. We hypothesized that VKORC1-dependent effects on the coagulation cascade and atherosclerosis would contribute to susceptibility for vascular diseases. To test the hypothesis, we studied the association of polymorphisms of VKORC1 with stroke (1811 patients), coronary heart disease (740 patients), and aortic dissection (253 patients) compared with matched controls (n=1811, 740, and 416, respectively). Five common noncoding single-nucleotide polymorphisms of VKORC1 were identified in a natural haplotype block with strong linkage disequilibrium (D'>0.9, r2>0.9), then single-nucleotide polymorphism (SNP) +2255 in the block was selected for the association study. We found that the presence of the C allele of the +2255 locus conferred almost twice the risk of vascular disease (odds ratio [OR] 1.95, 95% confidence interval [CI] .58 to 2.41, P<0.001 for stroke; OR 1.72, 95% CI 1.24 to 2.38, P<0.01 for coronary heart disease; and OR 1.90, 95% CI 1.04 to 3.48, P<0.05 for aortic dissection). We also observed that subjects with the CC and CT genotypes had lower levels of undercarboxylated osteocalcin (a regulator for the bone), probably vascular calcification, and lower levels of protein induced in vitamin K absence or antagonism II (PIVKA-II, a des-gamma-carboxy prothrombin) than those with TT genotypes. The haplotype of VKORC1 may serve as a novel genetic marker for the risk of stroke, coronary heart disease, and aortic dissection.

  10. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    PubMed

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions

  11. Mitochondrial haplotype variation and phylogeography of Iberian brown trout populations.

    PubMed

    MacHordom, A; Suárez, J; Almodóvar, A; Bautista, J M

    2000-09-01

    The biogeographical distribution of brown trout mitochondrial DNA haplotypes throughout the Iberian Peninsula was established by polymerase chain reaction-restriction fragment polymorphism analysis. The study of 507 specimens from 58 localities representing eight widely separated Atlantic-slope (north and west Iberian coasts) and six Mediterranean drainage systems served to identify five main groups of mitochondrial haplotypes: (i) haplotypes corresponding to non-native, hatchery-reared brown trout that were widely distributed but also found in wild populations of northern Spain (Cantabrian slope); (ii) a widespread Atlantic haplotype group; (iii) a haplotype restricted to the Duero Basin; (iv) a haplotype shown by southern Iberian populations; and (v) a Mediterranean haplotype. The Iberian distribution of these haplotypes reflects both the current fishery management policy of introducing non-native brown trout, and Messinian palaeobiogeography. Our findings complement and extend previous allozyme studies on Iberian brown trout and improve present knowledge of glacial refugia and postglacial movement of brown trout lineages.

  12. Haplotype-Based Association Analysis via Variance-Components Score Test

    PubMed Central

    Tzeng, Jung-Ying ; Zhang, Daowen 

    2007-01-01

    Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies. PMID:17924336

  13. Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio).

    PubMed

    Xu, Jian; Zhao, Zixia; Zhang, Xiaofeng; Zheng, Xianhu; Li, Jiongtang; Jiang, Yanliang; Kuang, Youyi; Zhang, Yan; Feng, Jianxin; Li, Chuangju; Yu, Juhua; Li, Qiang; Zhu, Yuanyuan; Liu, Yuanyuan; Xu, Peng; Sun, Xiaowen

    2014-04-24

    A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species.

  14. Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes

    PubMed Central

    Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Ángel

    2009-01-01

    Background Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. Results To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. Conclusion The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest. PMID:19344481

  15. Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes.

    PubMed

    Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Angel

    2009-03-19

    Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest.

  16. A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals

    PubMed Central

    Browning, Brian L.; Browning, Sharon R.

    2009-01-01

    We present methods for imputing data for ungenotyped markers and for inferring haplotype phase in large data sets of unrelated individuals and parent-offspring trios. Our methods make use of known haplotype phase when it is available, and our methods are computationally efficient so that the full information in large reference panels with thousands of individuals is utilized. We demonstrate that substantial gains in imputation accuracy accrue with increasingly large reference panel sizes, particularly when imputing low-frequency variants, and that unphased reference panels can provide highly accurate genotype imputation. We place our methodology in a unified framework that enables the simultaneous use of unphased and phased data from trios and unrelated individuals in a single analysis. For unrelated individuals, our imputation methods produce well-calibrated posterior genotype probabilities and highly accurate allele-frequency estimates. For trios, our haplotype-inference method is four orders of magnitude faster than the gold-standard PHASE program and has excellent accuracy. Our methods enable genotype imputation to be performed with unphased trio or unrelated reference panels, thus accounting for haplotype-phase uncertainty in the reference panel. We present a useful measure of imputation accuracy, allelic R2, and show that this measure can be estimated accurately from posterior genotype probabilities. Our methods are implemented in version 3.0 of the BEAGLE software package. PMID:19200528

  17. selectSNP – An R package for selecting SNPs optimal for genetic evaluation

    USDA-ARS?s Scientific Manuscript database

    There has been a huge increase in the number of SNPs in the public repositories. This has made it a challenge to design low and medium density SNP panels, which requires careful selection of available SNPs considering many criteria, such as map position, allelic frequency, possible biological functi...

  18. rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks.

    PubMed

    Guo, Liyuan; Wang, Jing

    2018-01-04

    Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Conserved extended haplotypes discriminate HLA-DR3-homozygous Basque patients with type 1 diabetes mellitus and celiac disease.

    PubMed

    Bilbao, J R; Calvo, B; Aransay, A M; Martin-Pagola, A; Perez de Nanclares, G; Aly, T A; Rica, I; Vitoria, J C; Gaztambide, S; Noble, J; Fain, P R; Awdeh, Z L; Alper, C A; Castaño, L

    2006-10-01

    The major susceptibility locus for type 1 diabetes mellitus (T1D) maps to the human lymphocyte antigen (HLA) class II region in the major histocompatibility complex on chromosome 6p21. In southern European populations, like the Basques, the greatest risk to T1D is associated with DR3 homo- and heterozygosity and is comparable to that of DR3/DR4, the highest risk genotype in northern European populations. Celiac disease (CD) is another DR3-associated autoimmune disorder showing certain overlap with T1D that has been explained by the involvement of common genetic determinants, a situation more frequent in DR3-rich populations, like the Basques. As both T1D- and CD-associated HLA alleles are part of conserved extended haplotypes (CEH), we compared DR3-homozygous T1D and CD patients to determine whether CEHs were equally distributed between both disorders or there was a differential contribution of different haplotypes. We observed a very pronounced distribution bias (P<10(-5)) of the two major DR3 CEHs, with DR3-B18 predominating in T1D and DR3-B8 in CD. Additionally, high-density single nucleotide polymorphism (SNP) analysis of the complete CEH [A*30-B*18-MICA*4-F1C30-DRB1*0301-DQB1*0201-DPB1*0202] revealed extraordinary conservation throughout the 4.9 Mbp analyzed supporting the existence of additional diabetogenic variants (other than HLA-DRB1*0301-DQB1*0201), conserved within the DR3-B18 CEH (but not in other DR3 haplotypes) that could explain its enhanced diabetogenicity.

  20. Restricted dog leucocyte antigen (DLA) class II haplotypes and genotypes in Beagles.

    PubMed

    Soutter, Francesca; Kennedy, Lorna J; Ollier, William E R; Solano-Gallego, Laia; Catchpole, Brian

    2015-03-01

    Beagles are commonly used in vaccine trials as part of the regulatory approval process. Genetic restriction within this breed and the impact this might have on vaccine responses are rarely considered. This study was designed to characterise diversity of dog leucocyte antigen (DLA) class II genes in a breeding colony of laboratory Beagles, whose offspring are used in vaccine studies. DLA haplotypes were determined by PCR and sequence-based typing from genomic DNA extracted from blood. Breeding colony Beagles had significantly different DLA haplotype frequencies in comparison with pet Beagles and both groups showed limited DLA diversity. Restricted DLA class II genetic variability within Beagles might result in selective antigen presentation and vaccine responses that are not necessarily representative of those seen in other dog breeds. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  1. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

    PubMed Central

    Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

    1999-01-01

    With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World. PMID:10388827

  2. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

    PubMed

    Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

    1999-07-01

    With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World.

  3. Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.

    PubMed

    Hashemi, Abolfazl; Zhu, Banghua; Vikalo, Haris

    2018-03-21

    Haplotype assembly is the task of reconstructing haplotypes of an individual from a mixture of sequenced chromosome fragments. Haplotype information enables studies of the effects of genetic variations on an organism's phenotype. Most of the mathematical formulations of haplotype assembly are known to be NP-hard and haplotype assembly becomes even more challenging as the sequencing technology advances and the length of the paired-end reads and inserts increases. Assembly of haplotypes polyploid organisms is considerably more difficult than in the case of diploids. Hence, scalable and accurate schemes with provable performance are desired for haplotype assembly of both diploid and polyploid organisms. We propose a framework that formulates haplotype assembly from sequencing data as a sparse tensor decomposition. We cast the problem as that of decomposing a tensor having special structural constraints and missing a large fraction of its entries into a product of two factors, U and [Formula: see text]; tensor [Formula: see text] reveals haplotype information while U is a sparse matrix encoding the origin of erroneous sequencing reads. An algorithm, AltHap, which reconstructs haplotypes of either diploid or polyploid organisms by iteratively solving this decomposition problem is proposed. The performance and convergence properties of AltHap are theoretically analyzed and, in doing so, guarantees on the achievable minimum error correction scores and correct phasing rate are established. The developed framework is applicable to diploid, biallelic and polyallelic polyploid species. The code for AltHap is freely available from https://github.com/realabolfazl/AltHap . AltHap was tested in a number of different scenarios and was shown to compare favorably to state-of-the-art methods in applications to haplotype assembly of diploids, and significantly outperforms existing techniques when applied to haplotype assembly of polyploids.

  4. Association between platelet P2Y12 haplotype and risk of cardiovascular events in chronic coronary disease.

    PubMed

    Schettert, Isolmar T; Pereira, Alexandre C; Lopes, Neuza H; Hueb, Whady A; Krieger, Jose E

    2006-01-01

    A positive association was recently described between P2Y12 platelet receptor H1 and H2 haplotypes and peripheral artery disease. We tested the described P2Y12 receptor haplotypes in a group of patients with coronary artery disease. The P2Y12 platelet receptor H1 and H2 haplotypes was tested in a group of 540 patients enrolled in the Medical, Angioplasty, or Surgery Study II (MASS II), a randomized trial comparing treatments for patients with coronary artery disease (CAD) and preserved left ventricular function. After a 3-year follow-up period, the incidence of the composite end point of cardiac death, myocardial infarction, and refractory angina requiring revascularization was determined in the H1/H1, H1/H2 and H2/H2 haplotype groups. We used Student's t-test and the chi-square test to analyze the differences among groups and Kaplan-Meier method to calculate survival curves. Risk was assessed with the use of a Cox proportional-hazards model. The frequency of haplotypes among studied patients were 410 (75.9%) H1/H1, 119 (22.0%) H1/H2 and 11 (2.1%) H2/H2. The baseline clinical characteristics, mean clinical follow-up time and received treatment of each genotype group were similar. We did not disclose any association between haplotype groups regarding the incidence of any of the studied cardiovascular end-points. This is the first report studying the association of P2Y12 platelet receptor H1 and H2 haplotype and cardiovascular events. Our findings do not provide evidence for a strong association between H1/H1 and H1/H2 haplotypes and a increased risk of cardiovascular events in a population with CAD. Future works should address the role of the H2/H2 haplotype as a genetic marker for cardiovascular events.

  5. A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data.

    PubMed

    Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G; Gu, C Charles

    2014-11-01

    Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of custom correlation coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (six genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. © 2014 WILEY PERIODICALS, INC.

  6. A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data

    PubMed Central

    Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G.; Gu, C. Charles

    2014-01-01

    Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of Custom Correlation Coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (6 genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. PMID:25168954

  7. [Comparative analysis of STR and SNP polymorphism in the populations of sockeye salmon (Oncorhynchus nerka) from Eastern and Western Kamchatka].

    PubMed

    Khrustaleva, A M; Volkov, A A; Stoklitskaia, D S; Miuge, N S; Zelenina, D A

    2010-11-01

    Sockeye salmon samples from five largest lacustrine-riverine systems of Kamchatka Peninsula were tested for polymorphism at six microsatellite (STR) and five single nucleotide polymorphism (SNP) loci. Statistically significant genetic differentiation among local populations from this part of the species range examined was demonstrated. The data presented point to pronounced genetic divergence of the populations from two geographical regions, Eastern and Western Kamchatka. For sockeye salmon, the individual identification test accuracy was higher for microsatellites compared to similar number of SNP markers. Pooling of the STR and SNP allele frequency data sets provided the highest accuracy of the individual fish population assignment.

  8. Genetic Variation in TLR Genes in Ugandan and South African Populations and Comparison with HapMap Data

    PubMed Central

    Randhawa, April Kaur; Horne, David J.; Adams, Mark D.; Shey, Muki; Barnholtz-Sloan, Jill; Mayanja-Kizza, Harriet; Kaplan, Gilla; Hanekom, Willem A.; Boom, W. Henry; Hawn, Thomas R.; Stein, Catherine M.

    2012-01-01

    Genetic epidemiological studies of complex diseases often rely on data from the International HapMap Consortium for identification of single nucleotide polymorphisms (SNPs), particularly those that tag haplotypes. However, little is known about the relevance of the African populations used to collect HapMap data for study populations conducted elsewhere in Africa. Toll-like receptor (TLR) genes play a key role in susceptibility to various infectious diseases, including tuberculosis. We conducted full-exon sequencing in samples obtained from Uganda (n = 48) and South Africa (n = 48), in four genes in the TLR pathway: TLR2, TLR4, TLR6, and TIRAP. We identified one novel TIRAP SNP (with minor allele frequency [MAF] 3.2%) and a novel TLR6 SNP (MAF 8%) in the Ugandan population, and a TLR6 SNP that is unique to the South African population (MAF 14%). These SNPs were also not present in the 1000 Genomes data. Genotype and haplotype frequencies and linkage disequilibrium patterns in Uganda and South Africa were similar to African populations in the HapMap datasets. Multidimensional scaling analysis of polymorphisms in all four genes suggested broad overlap of all of the examined African populations. Based on these data, we propose that there is enough similarity among African populations represented in the HapMap database to justify initial SNP selection for genetic epidemiological studies in Uganda and South Africa. We also discovered three novel polymorphisms that appear to be population-specific and would only be detected by sequencing efforts. PMID:23112821

  9. A DRD1 haplotype is associated with risk for autism spectrum disorders in male-only affected sib-pair families.

    PubMed

    Hettinger, Joe A; Liu, Xudong; Schwartz, Charles E; Michaelis, Ron C; Holden, Jeanette J A

    2008-07-05

    Individuals with autism spectrum disorders (ASDs) have impairments in executive function and social cognition, with males generally being more severely affected in these areas than females. Because the dopamine D1 receptor (encoded by DRD1) is integral to the neural circuitry mediating these processes, we examined the DRD1 gene for its role in susceptibility to ASDs by performing single marker and haplotype case-control comparisons, family-based association tests, and genotype-phenotype assessments (quantitative transmission disequilibrium tests: QTDT) using three DRD1 polymorphisms, rs265981C/T, rs4532A/G, and rs686T/C. Our previous findings suggested that the dopaminergic system may be more integrally involved in families with affected males only than in other families. We therefore restricted our study to families with two or more affected males (N = 112). There was over-transmission of rs265981-C and rs4532-A in these families (P = 0.040, P = 0.038), with haplotype TDT analysis showing over-transmission of the C-A-T haplotype (P = 0.022) from mothers to affected sons (P = 0.013). In addition, haplotype case-control comparisons revealed an increase of this putative risk haplotype in affected individuals relative to a comparison group (P = 0.004). QTDT analyses showed associations of the rs265981-C, rs4532-A, rs686-T alleles, and the C-A-T haplotype with more severe problems in social interaction, greater difficulties with nonverbal communication and increased stereotypies compared to individuals with other haplotypes. Preferential haplotype transmission of markers at the DRD1 locus and an increased frequency of a specific haplotype support the DRD1 gene as a risk gene for core symptoms of ASD in families having only affected males. Copyright 2008 Wiley-Liss, Inc.

  10. [Association between CETP polymorphisms and haplotypes with dyslipidemia in Xinjiang Uygur and Kazak residents].

    PubMed

    Hu, Y H; Liu, J M; Zhang, M; He, J; Yan, Y Z; Ma, J L; Ma, R L; Guo, H; Rui, D S; Sun, F; Mu, L L; Niu, Q; Ding, Y S; Zhang, J Y; Li, S G; Guo, S X

    2016-08-24

    To explore the relationship between the polymorphisms and haplotypes in the CETP gene and dyslipidemia among Xinjiang Kazak and Uygur residents. A population status survey was performed from 2010 to 2011 in Kashgar Xinjiang Uygur and Kazak residents, stratified cluster sampling method was used to select Uygur, Kazak residents with abnormal blood lipid values (n=367 and 345, respectively) as the dyslipidemia groups, and to select residents with normal lipid values as control group from the same area (n=374 and 390, respectively). SNaPshot technology was applied to detect the DNA of CETP gene rs3764261, rs1800775, rs708272 and rs5882 loci in all selected residents, and linkage disequilibrium analysis and haplotype construction were performed. (1) In Uygur residents, the dyslipidemia risk of rs708272 CT (OR=0.64, 95%CI 0.46-0.91, P=0.01) and TT genotype (OR=0.60, 95%CI 0.40-0.91, P=0.02) was significantly lower than CC genotype. Dyslipidemia risk of rs3764261 GT (OR=0.55, 95%CI 0.40-0.74, P=0.00) and TT genotype (OR=0.47, 95%CI 0.28-0.78, P<0.01) was significantly lower than GG genetype. Dyslipidemia risk of the rs1800775 CC genotype was higher than AA genotype (OR=1.79, 95%CI 1.17-2.74, P=0.01). There was no statistical significance in CETP gene of the 4 genotype and allele frequency between the dyslipidemia and normal lipid groups in Kazak residents (all P>0.05). (2) In Uighur residents with dyslipidemia, HDL-C level was significantly higher in rs708272 TT genotype carriers than in CC and CT genotypes (all P<0.05) and in rs3764261 TT genotype carriers than in GG genotype carriers (P=0.008), while was significantly lower in rs1800775 CC genotype carriers with AA genotype carriers (P=0.008). (3) Linkage disequilibrium analysis showed that there was strong linkage disequilibrium between rs3764261 and rs708272 (D'=0.869, r(2)=0.869), rs1800775 and rs708272 (D'=0.845, r(2)=0.446) in Uighur residents, and there was strong linkage disequilibrium between rs3764261 and rs

  11. Investigation of the Annexin A5 M2 haplotype in 500 white European couples who have experienced recurrent spontaneous abortion.

    PubMed

    Demetriou, Charalambos; Abu-Amero, Sayeda; White, Shawnelle; Peskett, Emma; Markoff, Arseni; Stanier, Philip; Moore, Gudrun E; Regan, Lesley

    2015-11-01

    Annexin A5 is a placental anti-coagulant protein that contains four nucleotide substitutions (M2 haplotype) in its promoter. This haplotype is a risk factor for recurrent spontaneous abortion (RSA). The influence of the M2 haplotype in the gestational timing of spontaneous abortions, paternal risk and relationships with known risk factors were investigated. European couples (n = 500) who had experienced three or more consecutive spontaneous abortions, and two fertile control groups, were selected for this study. The allele frequency of M2 was significantly higher among patients who had experienced early RSA than among controls (P = 0.002). No difference was found between controls and patients who had undergone late spontaneous abortions. No difference was found between patients who had experienced RSA who had a live birth or no live births, or between patients who were positive or negative for known risk factors. Male and female partners in each group had similar allele frequencies of M2. The M2 haplotype is a risk factor for early spontaneous abortions, before the 12th week of gestation, and confers about the same relative risk to carriers of both sexes. Having one or more M2 allele(s) in combination with other risk factors further increases the RSA risk. Copyright © 2015 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.

  12. Prion gene haplotypes of U.S. cattle

    PubMed Central

    Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W

    2006-01-01

    Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337

  13. Allelic and haplotypic diversity of HLA-A, -B, -C, -DRB1, and -DQB1 genes in the Korean population.

    PubMed

    Lee, K W; Oh, D H; Lee, C; Yang, S Y

    2005-05-01

    High-resolution human leukocyte antigen (HLA) typing exposes the unique patterns of HLA allele and haplotype frequencies in each population. In this study, HLA-A, -B, -C, -DRB1, and -DQB1 genotypes were analyzed in 485 apparently unrelated healthy Korean individuals. A total of 20 HLA-A, 43 HLA-B, 21 HLA-C, 31 HLA-DRB1, and 14 HLA-DQB1 alleles were identified. Eleven alleles (A*0201, A*1101, A*2402, A*3303, B*1501, Cw*0102, Cw*0302, Cw*0303, DQB1*0301, DQB1*0302, and DQB1*0303) were found in more than 10% of the population. In each serologic group, a maximum of three alleles were found with several exceptions (A2, B62, DR4, DR14, and DQ6). In each serologic group exhibiting multiple alleles, two major alleles were present at 62-96% (i.e. A*0201 and A*0206 comprise 85% of A2-positive alleles). Multiple-locus haplotypes estimated by the maximum likelihood method revealed 51 A-C, 43 C-B, 52 B-DRB1, 34 DRB1-DQB1, 48 A-C-B, 42 C-B-DRB1, 46 B-DRB1-DQB1, and 30 A-C-B-DRB1-DQB1 haplotypes with frequencies of more than 0.5%. In spite of their high polymorphism in B and DRB1, identification of relatively small numbers of two-locus (B-C and DRB1-DQB1) haplotypes suggested strong associations of those two loci, respectively. Five-locus haplotypes defined by high-resolution DNA typing correlated well with previously identified serology-based haplotypes in the population. The five most frequent haplotypes were: A*3303-Cw*1403-B*4403-DRB1*1302-DQB1*0604 (4.2%), A*3303-Cw*0701/6-B*4403-DRB1*0701-DQB1*0201/2 (3.0%), A*3303-Cw*0302-B*5801-DRB1*1302-DQB1*0609 (3.0%), A*2402-Cw*0702-B*0702-DRB1*0101-DQB1*0501 (2.9%), and A*3001-Cw*0602-B*1302-DRB1*0701-DQB1*0201/2 (2.7%). Several sets of allele level haplotypes that could not be discriminated by routine HLA-A, -B, and -DRB1 low-resolution typing originated from allelic diversity of A2, B61, DR4, and DR8 serologic groups. Information obtained in this study will be useful for medical and forensic applications as well as in anthropology.

  14. HLA DPA1, DPB1 alleles and haplotypes contribute to the risk associated with type 1 diabetes: analysis of the type 1 diabetes genetics consortium families.

    PubMed

    Varney, Michael D; Valdes, Ana Maria; Carlson, Joyce A; Noble, Janelle A; Tait, Brian D; Bonella, Persia; Lavant, Eva; Fear, Anna Lisa; Louey, Anthony; Moonsamy, Priscilla; Mychaleckyj, Josyf C; Erlich, Henry

    2010-08-01

    To determine the relative risk associated with DPA1 and DPB1 alleles and haplotypes in type 1 diabetes. The frequency of DPA1 and DPB1 alleles and haplotypes in type 1 diabetic patients was compared to the family based control frequency in 1,771 families directly and conditional on HLA (B)-DRB1-DQA1-DQB1 linkage disequilibrium. A relative predispositional analysis (RPA) was performed in the presence or absence of the primary HLA DR-DQ associations and the contribution of DP haplotype to individual DR-DQ haplotype risks examined. Eight DPA1 and thirty-eight DPB1 alleles forming seventy-four DPA1-DPB1 haplotypes were observed; nineteen DPB1 alleles were associated with multiple DPA1 alleles. Following both analyses, type 1 diabetes susceptibility was significantly associated with DPB1*0301 (DPA1*0103-DPB1*0301) and protection with DPB1*0402 (DPA1*0103-DPB1*0402) and DPA1*0103-DPB1*0101 but not DPA1*0201-DPB1*0101. In addition, DPB1*0202 (DPA1*0103-DPB1*0202) and DPB1*0201 (DPA1*0103-DPB1*0201) were significantly associated with susceptibility in the presence of the high risk and protective DR-DQ haplotypes. Three associations (DPB1*0301, *0402, and *0202) remained statistically significant when only the extended HLA-A1-B8-DR3 haplotype was considered, suggesting that DPB1 alone may delineate the risk associated with this otherwise conserved haplotype. HLA DP allelic and haplotypic diversity contributes significantly to the risk for type 1 diabetes; DPB1*0301 (DPA1*0103-DPB1*0301) is associated with susceptibility and DPB1*0402 (DPA1*0103-DPB1*0402) and DPA1*0103-DPB1*0101 with protection. Additional evidence is presented for the susceptibility association of DPB1*0202 (DPA1*0103-DPB1*0202) and for a contributory role of individual amino acids and DPA1 or a gene in linkage disequilibrium in DR3-DPB1*0101 positive haplotypes.

  15. [A total of 362 HLA different haplotypes and HLA recombination haplotypes based on analysis of their family pedigree in Chinese partial Han populations].

    PubMed

    Gao, Su-Qing; Cheng, Xi; Li, Qian; Li, Yu-Zhu; Deng, Zhi-Hui

    2009-06-01

    This study was aimed to discover the novel HLA recombination haplotypes and investigate the distribution of haplotypes in Chinese Han population. Based on the HLA-A, B, DRB1 typing results of 179 family members, 791 haplotypes were assigned by the mode of inheritance. The results showed that a total of 4 novel recombinant haplotypes in HLA-DRB1 locus region were observed in 4 families, which ratio of paternal to maternal chromosomes was 3:1. The recombination ratio between HLA-DRB1 and HLA-A or B loci was 0.92% (4/433). There were a total of 362 kinds of HLA-A, -B, -DRB1 haplotypes to be confirmed in Chinese Han partial population. A33-B58-DR17, A2-B46-DR9, A30-B13-DR7, A11-B13-DR15, A11-B75-DR12 and A2-B46-DR14 were the most common haplotypes that was consistent with the distribution of HLA alleles in unrelated donors. There were A1-B63-DR12, A29-B46-DR15, A1-B61-DR10, A34-B35-DR9, A29-B54-DR4, A23-B13-DR16 and A34-B62-DR15 haplotypes and so on, which were rare haplotypes not yet reported in Chinese. It is concluded that the HLA-A-B-DRB1 haplotypes would be confirmed by analysis of their family pedigree. The results obtained in this study are basic data for study of Chinese anthropology, organ transplantation and disease correlation analysis.

  16. Exercise improves adiponectin concentrations irrespective of the adiponectin gene polymorphisms SNP45 and the SNP276 in obese Korean women.

    PubMed

    Lee, Kyoung-Young; Kang, Hyun-Sik; Shin, Yun-A

    2013-03-10

    The effects of exercise on adiponectin levels have been reported to be variable and may be attributable to an interaction between environmental and genetic factors. The single nucleotide polymorphisms (SNP) 45 (T>G) and SNP276 (G>T) of the adiponectin gene are associated with metabolic risk factors including adiponectin levels. We examined whether SNP45 and SNP276 would differentially influence the effect of exercise training in middle-aged women with uncomplicated obesity. We conducted a prospective study in the general community that included 90 Korean women (age 47.0±5.1 years) with uncomplicated obesity. The intervention was aerobic exercise training for 3 months. Body composition, adiponectin levels, and other metabolic risk factors were measured. Prior to exercise training, only body weight differed among the SNP276 genotypes. Exercise training improved body composition, systolic blood pressure, maximal oxygen consumption, high-density lipoprotein cholesterol, and leptin levels. In addition, exercise improved adiponectin levels irrespective of weight gain or loss. However, after adjustments for age, BMI, body fat (%), and waist circumference, no differences were found in obesity-related characteristics (e.g., adiponectin) following exercise training among the SNP45 and the 276 genotypes. Our findings suggest that aerobic exercise affects adiponectin levels regardless of weight loss and this effect would not be influenced by SNP45 and SNP276 in the adiponectin gene. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.

  17. Development of an Italian RM Y-STR haplotype database: Results of the 2013 GEFI collaborative exercise.

    PubMed

    Robino, C; Ralf, A; Pasino, S; De Marchi, M R; Ballantyne, K N; Barbaro, A; Bini, C; Carnevali, E; Casarino, L; Di Gaetano, C; Fabbri, M; Ferri, G; Giardina, E; Gonzalez, A; Matullo, G; Nutini, A L; Onofri, V; Piccinini, A; Piglionica, M; Ponzano, E; Previderè, C; Resta, N; Scarnicci, F; Seidita, G; Sorçaburu-Cigliero, S; Turrina, S; Verzeletti, A; Kayser, M

    2015-03-01

    Recently introduced rapidly mutating Y-chromosomal short tandem repeat (RM Y-STR) loci, displaying a multiple-fold higher mutation rate relative to any other Y-STRs, including those conventionally used in forensic casework, have been demonstrated to improve the resolution of male lineage differentiation and to allow male relative separation usually impossible with standard Y-STRs. However, large and geographically-detailed frequency haplotype databases are required to estimate the statistical weight of RM Y-STR haplotype matches if observed in forensic casework. With this in mind, the Italian Working Group (GEFI) of the International Society for Forensic Genetics launched a collaborative exercise aimed at generating an Italian quality controlled forensic RM Y-STR haplotype database. Overall 1509 male individuals from 13 regional populations covering northern, central and southern areas of the Italian peninsula plus Sicily were collected, including both "rural" and "urban" samples classified according to population density in the sampling area. A subset of individuals was additionally genotyped for Y-STR loci included in the Yfiler and PowerPlex Y23 (PPY23) systems (75% and 62%, respectively), allowing the comparison of RM and conventional Y-STRs. Considering the whole set of 13 RM Y-STRs, 1501 unique haplotypes were observed among the 1509 sampled Italian men with a haplotype diversity of 0.999996, largely superior to Yfiler and PPY23 with 0.999914 and 0.999950, respectively. AMOVA indicated that 99.996% of the haplotype variation was within populations, confirming that genetic-geographic structure is almost undetected by RM Y-STRs. Haplotype sharing among regional Italian populations was not observed at all with the complete set of 13 RM Y-STRs. Haplotype sharing within Italian populations was very rare (0.27% non-unique haplotypes), and lower in urban (0.22%) than rural (0.29%) areas. Additionally, 422 father-son pairs were investigated, and 20.1% of them could

  18. Haplotype Reconstruction in Large Pedigrees with Many Untyped Individuals

    NASA Astrophysics Data System (ADS)

    Li, Xin; Li, Jing

    Haplotypes, as they specify the linkage patterns between dispersed genetic variations, provide important information for understanding the genetics of human traits. However haplotypes are not directly available from current genotyping platforms, and hence there are extensive investigations of computational methods to recover such information. Two major computational challenges arising in current family-based disease studies are large family sizes and many ungenotyped family members. Traditional haplotyping methods can neither handle large families nor families with missing members. In this paper, we propose a method which addresses these issues by integrating multiple novel techniques. The method consists of three major components: pairwise identical-bydescent (IBD) inference, global IBD reconstruction and haplotype restoring. By reconstructing the global IBD of a family from pairwise IBD and then restoring the haplotypes based on the inferred IBD, this method can scale to large pedigrees, and more importantly it can handle families with missing members. Compared with existing methods, this method demonstrates much higher power to recover haplotype information, especially in families with many untyped individuals.

  19. Japanese Alzheimer's Disease and Other Complex Disorders Diagnosis Based on Mitochondrial SNP Haplogroups

    PubMed Central

    Takasaki, Shigeru

    2012-01-01

    This paper first explains how the relations between Japanese Alzheimer's disease (AD) patients and their mitochondrial SNP frequencies at individual mtDNA positions examined using the radial basis function (RBF) network and a method based on RBF network predictions and that Japanese AD patients are associated with the haplogroups G2a and N9b1. It then describes a method for the initial diagnosis of Alzheimer's disease that is based on the mtSNP haplogroups of the AD patients. The method examines the relations between someone's mtDNA mutations and the mtSNPs of AD patients. As the mtSNP haplogroups thus obtained indicate which nucleotides of mtDNA loci are changed in the Alzheimer's patients, a person's probability of becoming an AD patient can be predicted by comparing those mtDNA mutations with that person's mtDNA mutations. The proposed method can also be used to diagnose diseases such as Parkinson's disease and type 2 diabetes and to identify people likely to become centenarians. PMID:22848858

  20. Identification of the ancestral haplotype for apolipoprotein B suggests an African origin of Homo sapiens sapiens and traces their subsequent migration to Europe and the Pacific

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rapacz, J.; Hasler-Rapacz, J.O.; Chen, L.

    1991-02-15

    The probable ancestral haplotype for human apolipoprotein B (apoB) has been identified through immunological analysis of chimpanzee and gorilla serum and sequence analysis of their DNA. Moreover, the frequency of this ancestral apoB haplotype among different human populations provides strong support for the African origin of Homo sapiens sapiens and their subsequent migration from Africa to Europe and to the Pacific. The approach used here for the identification of the ancestral human apoB haplotype is likely to be applicable to many other genes.

  1. Association study between BDNF gene variants and Mexican patients with obsessive-compulsive disorder.

    PubMed

    Márquez, Lidia; Camarena, Beatriz; Hernández, Sandra; Lóyzaga, Cristina; Vargas, Luis; Nicolini, Humberto

    2013-11-01

    Obsessive-compulsive disorder (OCD) is a psychiatric disorder whose etiology is not yet known. We investigate the role of three variants of the BDNF gene (rs6265, rs1519480 and rs7124442) by single SNP and haplotype analysis in OCD Mexican patients using a case-control and family-based association design. BDNF gene variants were genotyped in 283 control subjects, 232 OCD patients and first degree relatives of 111 OCD subjects. Single SNP analysis in case-control study showed an association between rs6265 and OCD with a high frequency of Val/Val genotype and Val allele (p=0.0001 and p=0.0001, respectively). Also, genotype and allele analysis of rs1519480 showed significant differences (p=0.0001, p=0.0001; respectively) between OCD and control groups. Haplotype analysis showed a high frequency of A-T (rs6265-rs1519480) in OCD patients compared with the control group (OR=2.06 [1.18-3.59], p=0.0093) and a low frequency of haplotype A-C in the OCD patients (OR=0.04 [0.01-0.16], p=0.000002). The family-based association study showed no significant differences in the transmission of any variant. Our study replicated the association between BDNF Val66Met gene polymorphism and OCD. Also, we found a significant association of rs1519480 in OCD patients compared with a control group, region that has never been analyzed in OCD. In conclusion, our findings suggest that BDNF gene could be related to the development of OCD. © 2013 Elsevier B.V. and ECNP. All rights reserved.

  2. A parsimonious tree-grow method for haplotype inference.

    PubMed

    Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan

    2005-09-01

    Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf

  3. In Vivo Characterization of Human APOA5 Haplotypes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahituv, Nadav; Akiyama, Jennifer; Chapman-Helleboid, Audrey

    2006-10-01

    Increased plasma triglycerides concentrations are an independent risk factor for cardiovascular disease. Numerous studies support a reproducible genetic association between two minor haplotypes in the human apolipoprotein A5 gene (APOA5) and increased plasma triglyceride concentrations. We thus sought to investigate the effect of these minor haplotypes (APOA5*2 and APOA5*3) on ApoAV plasma levels through the precise insertion of single-copy intact APOA5 haplotypes at a targeted location in the mouse genome. While we found no difference in the amount of human plasma ApoAV in mice containing the common APOA5*1 and minor APOA5*2 haplotype, the introduction of the single APOA5*3 defining allelemore » (19W) resulted in 3-fold lower ApoAV plasma levels consistent with existing genetic association studies. These results indicate that S19W polymorphism is likely to be functional and explain the strong association of this variant with plasma triglycerides supporting the value of sensitive in vivo assays to define the functional nature of human haplotypes.« less

  4. Inferring mechanisms of copy number change from haplotype structures at the human DEFA1A3 locus.

    PubMed

    Black, Holly A; Khan, Fayeza F; Tyson, Jess; Al Armour, John

    2014-07-21

    The determination of structural haplotypes at copy number variable regions can indicate the mechanisms responsible for changes in copy number, as well as explain the relationship between gene copy number and expression. However, obtaining spatial information at regions displaying extensive copy number variation, such as the DEFA1A3 locus, is complex, because of the difficulty in the phasing and assembly of these regions. The DEFA1A3 locus is intriguing in that it falls within a region of high linkage disequilibrium, despite its high variability in copy number (n = 3-16); hence, the mechanisms responsible for changes in copy number at this locus are unclear. In this study, a region flanking the DEFA1A3 locus was sequenced across 120 independent haplotypes with European ancestry, identifying five common classes of DEFA1A3 haplotype. Assigning DEFA1A3 class to haplotypes within the 1000 Genomes project highlights a significant difference in DEFA1A3 class frequencies between populations with different ancestry. The features of each DEFA1A3 class, for example, the associated DEFA1A3 copy numbers, were initially assessed in a European cohort (n = 599) and replicated in the 1000 Genomes samples, showing within-class similarity, but between-class and between-population differences in the features of the DEFA1A3 locus. Emulsion haplotype fusion-PCR was used to generate 61 structural haplotypes at the DEFA1A3 locus, showing a high within-class similarity in structure. Structural haplotypes across the DEFA1A3 locus indicate that intra-allelic rearrangement is the predominant mechanism responsible for changes in DEFA1A3 copy number, explaining the conservation of linkage disequilibrium across the locus. The identification of common structural haplotypes at the DEFA1A3 locus could aid studies into how DEFA1A3 copy number influences expression, which is currently unclear.

  5. Identification of the ancestral haplotype for apolipoprotein B suggests an African origin of Homo sapiens sapiens and traces their subsequent migration to Europe and the Pacific.

    PubMed Central

    Rapacz, J; Chen, L; Butler-Brunner, E; Wu, M J; Hasler-Rapacz, J O; Butler, R; Schumaker, V N

    1991-01-01

    The probable ancestral haplotype for human apolipoprotein B (apoB) has been identified through immunological analysis of chimpanzee and gorilla serum and sequence analysis of their DNA. Moreover, the frequency of this ancestral apoB haplotype among different human populations provides strong support for the African origin of Homo sapiens sapiens and their subsequent migration from Africa to Europe and to the Pacific. The approach used here for the identification of the ancestral human apoB haplotype is likely to be applicable to many other genes. PMID:1996341

  6. Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina

    PubMed Central

    Kovačević, Lejla; Fatur-Cerić, Vera; Hadžić, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

    2013-01-01

    Aim To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. Methods The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. Results The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. Conclusion This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis. PMID:23771760

  7. Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina.

    PubMed

    Kovačević, Lejla; Fatur-Cerić, Vera; Hadzic, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

    2013-06-01

    To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis.

  8. snpTree--a web-server to identify and construct SNP trees from whole genome sequence data.

    PubMed

    Leekitcharoenphon, Pimlapas; Kaas, Rolf S; Thomsen, Martin Christen Frølund; Friis, Carsten; Rasmussen, Simon; Aarestrup, Frank M

    2012-01-01

    The advances and decreasing economical cost of whole genome sequencing (WGS), will soon make this technology available for routine infectious disease epidemiology. In epidemiological studies, outbreak isolates have very little diversity and require extensive genomic analysis to differentiate and classify isolates. One of the successfully and broadly used methods is analysis of single nucletide polymorphisms (SNPs). Currently, there are different tools and methods to identify SNPs including various options and cut-off values. Furthermore, all current methods require bioinformatic skills. Thus, we lack a standard and simple automatic tool to determine SNPs and construct phylogenetic tree from WGS data. Here we introduce snpTree, a server for online-automatic SNPs analysis. This tool is composed of different SNPs analysis suites, perl and python scripts. snpTree can identify SNPs and construct phylogenetic trees from WGS as well as from assembled genomes or contigs. WGS data in fastq format are aligned to reference genomes by BWA while contigs in fasta format are processed by Nucmer. SNPs are concatenated based on position on reference genome and a tree is constructed from concatenated SNPs using FastTree and a perl script. The online server was implemented by HTML, Java and python script.The server was evaluated using four published bacterial WGS data sets (V. cholerae, S. aureus CC398, S. Typhimurium and M. tuberculosis). The evaluation results for the first three cases was consistent and concordant for both raw reads and assembled genomes. In the latter case the original publication involved extensive filtering of SNPs, which could not be repeated using snpTree. The snpTree server is an easy to use option for rapid standardised and automatic SNP analysis in epidemiological studies also for users with limited bioinformatic experience. The web server is freely accessible at http://www.cbs.dtu.dk/services/snpTree-1.0/.

  9. SNPConvert: SNP Array Standardization and Integration in Livestock Species.

    PubMed

    Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra

    2016-06-09

    One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git.

  10. Influences of APOA5 Variants on Plasma Triglyceride Levels in Uyghur Population

    PubMed Central

    Wang, Yi; Wu, Di; Jin, Li; Wang, Xiaofeng

    2014-01-01

    Objective Single nucleotide polymorphisms (SNPs) in apolipoprotein A5 (APOA5) gene are associated with triglyceride (TG) levels. However, the minor allele frequencies and linkage disequilibriums (LDs) of the SNPs in addition to their effects on TG levels vary greatly between Caucasians and East Asians. The distributions of the SNPs/haplotypes and their associations with TG levels in Uyghur population, an admixture population of Caucasians and East Asians, have not been reported to date. Here, we performed a cross-sectional study to address these. Methods Genotyping of four SNPs in APOA5 (rs662799, rs3135506, rs2075291, and rs2266788) was performed in 1174 unrelated Uyghur subjects. SNP/haplotype and TG association analyses were conducted. Results The frequencies of the SNPs in Uyghurs were in between those in Caucasians and East Asians. The LD between rs662799 and rs2266788 in Uyghurs was stronger than that in East Asians but weaker than that in Caucasians, and the four SNPs resulted in four haplotypes (TGGT, CGGC, TCGT, and CGTT arranged in the order of rs662799, rs3135506, rs2075291, and rs2266788) representing 99.2% of the population. All the four SNPs were significantly associated with TG levels. Compared with non-carriers, carriers of rs662799-C, rs3135506-C, rs2075291-T, and rs2266788-C alleles had 16.0%, 15.1%, 17.1%, and 12.4% higher TG levels, respectively. When haplotype TGGT was defined as the reference, the haplotypes CGGC, TCGT, and CGTT resulted in 16.1%, 19.0%, and 19.8% higher TG levels, respectively. The proportions of variance in TG explained by APOA5 locus were 2.5%, 0.3%, 0.4%, and 1.9% for single SNP rs662799, rs3135506, rs2075291, and rs2266788, respectively, and 3.0% for the haplotypes constructed by them. Conclusions The association profiles between the SNPs and haplotypes at APOA5 locus and TG levels in this admixture population differed from those in Caucasians and East Asians. The functions of these SNPs and haplotypes need to be

  11. Association of diamine oxidase and histamine N-methyltransferase polymorphisms with presence of migraine in a group of Mexican mothers of children with allergies.

    PubMed

    Meza-Velázquez, R; López-Márquez, F; Espinosa-Padilla, S; Rivera-Guillen, M; Ávila-Hernández, J; Rosales-González, M

    2017-10-01

    Low histamine metabolism has been suggested to play a role in the pathogenesis of allergy and migraine. We investigated the possible association between 2 single-nucleotide polymorphisms (SNP), C314T HNMT and C2029G DAO, and the presence and severity of migraine and migraine-related disability. We studied the frequency of C314T HNMT and C2029G DAO allelic variants in 162 mothers of children with allergies (80 with migraine and 82 without) using a TaqMan-based qPCR Assay and a case-control model. We conducted a logistic regression analysis to examine the association between migraine and the allelic and haplotype variants. Mutant C2029G DAO SNP was found significantly more frequently in the group of women with migraine than in controls (OR, 1.6; 95% CI, 1.1-2.1). No significant differences were found in frequencies of genotypes or alleles in the case of C314T HNMT SNP. Both mutated alleles were associated with migraine-related disability. Coexistence of alleles for both SNPs (haplotypes) showed a strong association with migraine. Haplotypes containing both mutated alleles (either heterozygous or homozygous) were very strongly associated with MIDAS grade iv migraine (OR, 45.0; 95% CI, 5.2-358). This suggests that mutant alleles of C314T for HNMT and C2029G for DAO polymorphisms may interact in a way that increases the risk and impact of migraine. We suggest a synergistic association between HNMT and DAO functional polymorphisms and migraine; this hypothesis must be further confirmed by larger studies. However, the characteristics and ethnic differences between analysed populations should be considered when interpreting the results. Copyright © 2016 Sociedad Española de Neurología. Publicado por Elsevier España, S.L.U. All rights reserved.

  12. H1 tau haplotype-related genomic variation at 17q21.3 as an Asian heritage of the European Gypsy population.

    PubMed

    Almos, P Z; Horváth, S; Czibula, A; Raskó, I; Sipos, B; Bihari, P; Béres, J; Juhász, A; Janka, Z; Kálmán, J

    2008-11-01

    In this study, we examine the frequency of a 900 kb inversion at 17q21.3 in the Gypsy and Caucasian populations of Hungary, which may reflect the Asian origin of Gypsy populations. Of the two haplotypes (H1 and H2), H2 is thought to be exclusively of Caucasian origin, and its occurrence in other racial groups is likely to reflect admixture. In our sample, the H1 haplotype was significantly more frequent in the Gypsy population (89.8 vs 75.5%, P<0.001) and was in Hardy-Weinberg disequilibrium (P=0.017). The 17q21.3 region includes the gene of microtubule-associated protein tau, and this result might imply higher sensitivity to H1 haplotype-related multifactorial tauopathies among Gypsies.

  13. HaploForge: a comprehensive pedigree drawing and haplotype visualization web application.

    PubMed

    Tekman, Mehmet; Medlar, Alan; Mozere, Monika; Kleta, Robert; Stanescu, Horia

    2017-12-15

    Haplotype reconstruction is an important tool for understanding the aetiology of human disease. Haplotyping infers the most likely phase of observed genotypes conditional on constraints imposed by the genotypes of other pedigree members. The results of haplotype reconstruction, when visualized appropriately, show which alleles are identical by descent despite the presence of untyped individuals. When used in concert with linkage analysis, haplotyping can help delineate a locus of interest and provide a succinct explanation for the transmission of the trait locus. Unfortunately, the design choices made by existing haplotype visualization programs do not scale to large numbers of markers. Indeed, following haplotypes from generation to generation requires excessive scrolling back and forth. In addition, the most widely used program for haplotype visualization produces inconsistent recombination artefacts for the X chromosome. To resolve these issues, we developed HaploForge, a novel web application for haplotype visualization and pedigree drawing. HaploForge takes advantage of HTML5 to be fast, portable and avoid the need for local installation. It can accurately visualize autosomal and X-linked haplotypes from both outbred and consanguineous pedigrees. Haplotypes are coloured based on identity by descent using a novel A* search algorithm and we provide a flexible viewing mode to aid visual inspection. HaploForge can currently process haplotype reconstruction output from Allegro, GeneHunter, Merlin and Simwalk. HaploForge is licensed under GPLv3 and is hosted and maintained via GitHub. https://github.com/mtekman/haploforge. r.kleta@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  14. Different DRB1*03:01-DQB1*02:01 haplotypes confer different risk for celiac disease.

    PubMed

    Alshiekh, S; Zhao, L P; Lernmark, Å; Geraghty, D E; Naluai, Å T; Agardh, D

    2017-08-01

    Celiac disease is associated with the HLA-DR3-DQA1*05:01-DQB1*02:01 and DR4-DQA1*03:01-DQB1*03:02 haplotypes. In addition, there are currently over 40 non-HLA loci associated with celiac disease. This study extends previous analyses on different HLA haplotypes in celiac disease using next generation targeted sequencing. Included were 143 patients with celiac disease and 135 non-celiac disease controls investigated at median 9.8 years (1.4-18.3 years). PCR-based amplification of HLA and sequencing with Illumina MiSeq technology were used for extended sequencing of the HLA class II haplotypes HLA-DRB1, DRB3, DRB4, DRB5, DQA1 and DQB1, respectively. Odds ratios were computed marginally for every allele and haplotype as the ratio of allelic frequency in patients and controls as ratio of exposure rates (RR), when comparing a null reference with equal exposure rates in cases and controls. Among the extended HLA haplotypes, the strongest risk haplotype for celiac disease was shown for DRB3*01:01:02 in linkage with DQA1*05:01-DQB1*02:01 (RR = 6.34; P-value < .0001). In a subpopulation analysis, DRB3*01:01:02-DQA1*05:01-DQB1*02:01 remained the most significant in patients with Scandinavian ethnicity (RR = 4.63; P < .0001) whereas DRB1*07:01:01-DRB4*01:03:01-DQA1*02:01-DQB1*02:02:01 presented the highest risk of celiac disease among non-Scandinavians (RR = 7.94; P = .011). The data also revealed 2 distinct celiac disease risk DR3-DQA1*05:01-DQB*02:01 haplotypes distinguished by either the DRB3*01:01:02 or DRB3*02:02:01 alleles, indicating that different DRB1*03:01-DQB1*02:01 haplotypes confer different risk for celiac disease. The associated risk of celiac disease for DR3-DRB3*01:01:02-DQA1*05:01-DQB1*02:01 is predominant among patients of Scandinavian ethnicity. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  15. Strong association between a splice mutation (IVS12+5G{r_arrow}A) and haplotype 6 in hereditary tyrosinemia type I

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tanguay, R.M.; St-Louis, M.; Gibson, K.

    1994-09-01

    Hereditary tyrosinemia type I (HT I; McKusick 276700) is a severe inborn error of tyrosine catabolism pathway caused by a deficiency of fumarylacetoacetate hydrolase (FAH). The highest frequency reported is the one in Saguenay-Lac St-Jean (Quebec, Canada) where 1:1,846 births are affected. The FAH gene has been cloned and several mutations have been described. Allele specific oligonucleotide (ASO) hybridization was used to examine the frequency of a splice (IVS12-5G{r_arrow}A) mutation recently reported and RFLP analysis was done to identify haplotypes related to HT I. The splice mutation was found on 45/50 alleles (90%) in patients from SLSJ and 12/66 (18%)more » alleles from patients world-wide. All 25 patients from the SLSJ region were positive with 20 being homozygous, indicating that this mutation is the major cause of HT I in French Canada. Of these 25 patients, 96% were positive for one haplotype called no 6 which is these 25 patients, 96% were positive for one haplotype called no 6 which is identified by TaqI, RsaI, BglII, MspI and KpnI digestions. These data show a really strong association between the mutation (IVS12+5G{r_arrow}A) and haplotype 6. Among our patients from around the world, {approximately}52% were positive for haplotype 6 indicating its strong relation with HT I. These results provide the rationale for DNA-based carrier testing for HT I in the F-C population at risk as well as in HT I patients in general.« less

  16. A new mathematical modeling for pure parsimony haplotyping problem.

    PubMed

    Feizabadi, R; Bagherian, M; Vaziri, H R; Salahi, M

    2016-11-01

    Pure parsimony haplotyping (PPH) problem is important in bioinformatics because rational haplotyping inference plays important roles in analysis of genetic data, mapping complex genetic diseases such as Alzheimer's disease, heart disorders and etc. Haplotypes and genotypes are m-length sequences. Although several integer programing models have already been presented for PPH problem, its NP-hardness characteristic resulted in ineffectiveness of those models facing the real instances especially instances with many heterozygous sites. In this paper, we assign a corresponding number to each haplotype and genotype and based on those numbers, we set a mixed integer programing model. Using numbers, instead of sequences, would lead to less complexity of the new model in comparison with previous models in a way that there are neither constraints nor variables corresponding to heterozygous nucleotide sites in it. Experimental results approve the efficiency of the new model in producing better solution in comparison to two state-of-the art haplotyping approaches. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. The role of human leukocyte antigen DRB1-DQB1 haplotypes in the susceptibility to acquired idiopathic thrombotic thrombocytopenic purpura.

    PubMed

    Sinkovits, György; Szilágyi, Ágnes; Farkas, Péter; Inotai, Dóra; Szilvási, Anikó; Tordai, Attila; Rázsó, Katalin; Réti, Marienn; Prohászka, Zoltán

    2017-02-01

    The acquired form of idiopathic thrombotic thrombocytopenic purpura (TTP) is an autoimmune disease, in which the underlying ADAMTS13-deficiency is caused by inhibitory autoantibodies against the protease. Human leukocyte antigens (HLA), responsible for antigen presentation, play an important role in the development of antibodies. The loci coding HLA DR and DQ molecules are inherited in linkage as haplotypes. The c.1858C>T polymorphism of the PTPN22 gene, which codes a protein tyrosine phosphatase important in lymphocyte activation, predisposes to a number of autoimmune diseases. We determined the HLA-DRB1-DQB1 haplotypes and the PTPN22 c.1858C>T genotypes in 75 patients with acquired idiopathic TTP and in healthy controls, in order to assess the role of these genetic factors and their interactions in the susceptibility to TTP. We found that the carrier frequencies of the DRB1 ∗ 11-DQB1 ∗ 03 and DRB1 ∗ 15-DQB1 ∗ 06 haplotypes were higher, while those of the DRB1 ∗ 07-DQB1 ∗ 02 and DRB1 ∗ 13-DQB1 ∗ 06 haplotypes were lower in TTP patients. There was no difference in the overall frequency of the PTPN22 c.1858T allele between TTP patients and controls. In conclusion, we identified four HLA-DRB1-DQB1 haplotypes associated with an increased (DRB1 ∗ 11-DQB1 ∗ 03 and DRB1 ∗ 15-DQB1 ∗ 06) or a decreased (DRB1 ∗ 07-DQB1 ∗ 02 and DRB1 ∗ 13-DQB1 ∗ 06) susceptibility to acquired idiopathic TTP. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  18. Comparative genome-wide mapping versus extreme pool-genotyping and development of diagnostic SNP markers linked to QTL for adult plant resistance to stripe rust in common wheat.

    PubMed

    Wu, Jianhui; Huang, Shuo; Zeng, Qingdong; Liu, Shengjie; Wang, Qilin; Mu, Jingmei; Yu, Shizhou; Han, Dejun; Kang, Zhensheng

    2018-06-16

    A major stripe rust resistance QTL on chromosome 4BL was localized to a 4.5-Mb interval using comparative QTL mapping methods and validated in 276 wheat genotypes by haplotype analysis. CYMMIT-derived wheat line P10103 was previously identified to have adult plant resistance (APR) to stripe rust in the greenhouse and field. The conventional approach for QTL mapping in common wheat is laborious. Here, we performed QTL detection of APR using a combination of genome-wide scanning and extreme pool-genotyping. SNP-based genetic maps were constructed using the Wheat55 K SNP array to genotype a recombinant inbred line (RIL) population derived from the cross Mingxian 169 × P10103. Five stable QTL were detected across multiple environments. A fter comparing SNP profiles from contrasting, extreme DNA pools of RILs six putative QTL were located to approximate chromosome positions. A major QTL on chromosome 4B was identified in F 2:4 contrasting pools from cross Zhengmai 9023 × P10103. A consensus QTL (LOD = 26-40, PVE = 42-55%), named QYr.nwafu-4BL, was defined and localized to a 4.5-Mb interval flanked by SNP markers AX-110963704 and AX-110519862 in chromosome arm 4BL. Based on stripe rust response, marker genotypes, pedigree analysis and mapping data, QYr.nwafu-4BL is likely to be a new APR QTL. The applicability of the SNP-based markers flanking QYr.nwafu-4BL was validated on a diversity panel of 276 wheat lines. The additional minor QTL on chromosomes 4A, 5A, 5B and 6A enhanced the level of resistance conferred by QYr.nwafu-4BL. Marker-assisted pyramiding of QYr.nwafu-4BL and other favorable minor QTL in new wheat cultivars should improve the level of APR to stripe rust.

  19. Haplotype assembly in polyploid genomes and identical by descent shared tracts.

    PubMed

    Aguiar, Derek; Istrail, Sorin

    2013-07-01

    Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and computational efficiency. Furthermore, current models and methodologies for haplotype assembly (i) do not consider individuals sharing haplotypes jointly, which reduces the size and accuracy of assembled haplotypes, and (ii) are unable to model genomes having more than two sets of homologous chromosomes (polyploidy). Polyploid organisms are increasingly becoming the target of many research groups interested in the genomics of disease, phylogenetics, botany and evolution but there is an absence of theory and methods for polyploid haplotype reconstruction. In this work, we present a number of results, extensions and generalizations of compass graphs and our HapCompass framework. We prove the theoretical complexity of two haplotype assembly optimizations, thereby motivating the use of heuristics. Furthermore, we present graph theory-based algorithms for the problem of haplotype assembly using our previously developed HapCompass framework for (i) novel implementations of haplotype assembly optimizations (minimum error correction), (ii) assembly of a pair of individuals sharing a haplotype tract identical by descent and (iii) assembly of polyploid genomes. We evaluate our methods on 1000 Genomes Project, Pacific Biosciences and simulated sequence data. HapCompass is available for download at http://www.brown.edu/Research/Istrail_Lab/. Supplementary data are available at Bioinformatics online.

  20. Re-sequencing regions of the ovine Y chromosome in domestic and wild sheep reveals novel paternal haplotypes.

    PubMed

    Meadows, J R S; Kijas, J W

    2009-02-01

    The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.

  1. BCL11A Enhancer Haplotypes and Fetal Hemoglobin in Sickle Cell Anemia

    PubMed Central

    Sebastiani, P.; Farrell, J.J.; Alsultan, A.; Wang, S.; Edward, H. L.; Shappell, H.; Bae, H.; Milton, J. N.; Baldwin, C.T.; Al-Rubaish, A.M.; Naserullah, Z.; Al-Muhanna, F.; Alsuliman, A.; Patra, P. K.; Farrer, L.A.; Ngo, D.; Vathipadiekal, V.; Chui, D.H.K.; Al-Ali, A.K.; Steinberg, M.H.

    2015-01-01

    Background Fetal hemoglobin (HbF) levels in sickle cell anemia patients vary. We genotyped polymorphisms in the erythroid-specific enhancer of BCL11A to see if they might account for the very high HbF associated with the Arab-Indian (AI) haplotype and Benin haplotype of sickle cell anemia. Methods and Results Six BCL112A enhancer SNPs and their haplotypes were studied in Saudi Arabs from the Eastern Province and Indian patients with AI haplotype (HbF ~20%), African Americans (HbF ~7%), and Saudi Arabs from the Southwestern Province (HbF ~12%). Four SNPs (rs1427407, rs6706648, rs6738440, and rs7606173) and their haplotypes were consistently associated with HbF levels. The distributions of haplotypes differ in the 3 cohorts but not their genetic effects: the haplotype TCAG was associated with the lowest HbF level and the haplotype GTAC was associated with the highest HbF level and differences in HbF levels between carriers of these haplotypes in all cohorts was approximately 6%. Conclusions Common HbF BCL11A enhancer haplotypes in patients with African origin and AI sickle cell anemia have similar effects on HbF but they do not explain their differences in HbF. PMID:25703683

  2. The Holstein Friesian Lethal Haplotype 5 (HH5) Results from a Complete Deletion of TBF1M and Cholesterol Deficiency (CDH) from an ERV-(LTR) Insertion into the Coding Region of APOB

    PubMed Central

    Schütz, Ekkehard; Wehrhahn, Christin; Wanjek, Marius; Bortfeld, Ralf; Wemheuer, Wilhelm E.; Beck, Julia; Brenig, Bertram

    2016-01-01

    Background With the availability of massive SNP data for several economically important cattle breeds, haplotype tests have been performed to identify unknown recessive disorders. A number of so-called lethal haplotypes, have been uncovered in Holstein Friesian cattle and, for at least seven of these, the causative mutations have been identified in candidate genes. However, several lethal haplotypes still remain elusive. Here we report the molecular genetic causes of lethal haplotype 5 (HH5) and cholesterol deficiency (CDH). A targeted enrichment for the known genomic regions, followed by massive parallel sequencing was used to interrogate for causative mutations in a case/control approach. Methods Targeted enrichment for the known genomic regions, followed by massive parallel sequencing was used in a case/control approach. PCRs for the causing mutations were developed and compared to routine imputing in 2,100 (HH5) and 3,100 (CDH) cattle. Results HH5 is caused by a deletion of 138kbp, spanning position 93,233kb to 93,371kb on chromosome 9 (BTA9), harboring only dimethyl-adenosine transferase 1 (TFB1M). The deletion breakpoints are flanked by bovine long interspersed nuclear elements Bov-B (upstream) and L1ME3 (downstream), suggesting a homologous recombination/deletion event. TFB1M di-methylates adenine residues in the hairpin loop at the 3’-end of mitochondrial 12S rRNA, being essential for synthesis and function of the small ribosomal subunit of mitochondria. Homozygous TFB1M-/- mice reportedly exhibit embryonal lethality with developmental defects. A 2.8% allelic frequency was determined for the German HF population. CDH results from a 1.3kbp insertion of an endogenous retrovirus (ERV2-1-LTR_BT) into exon 5 of the APOB gene at BTA11:77,959kb. The insertion is flanked by 6bp target site duplications as described for insertions mediated by retroviral integrases. A premature stop codon in the open reading frame of APOB is generated, resulting in a truncation of

  3. A genomic portrait of haplotype diversity and signatures of selection in indigenous southern African populations.

    PubMed

    Chimusa, Emile R; Meintjies, Ayton; Tchanga, Milaine; Mulder, Nicola; Seoighe, Cathal; Seioghe, Cathal; Soodyall, Himla; Ramesar, Rajkumar

    2015-03-01

    We report a study of genome-wide, dense SNP (∼ 900K) and copy number polymorphism data of indigenous southern Africans. We demonstrate the genetic contribution to southern and eastern African populations, which involved admixture between indigenous San, Niger-Congo-speaking and populations of Eurasian ancestry. This finding illustrates the need to account for stratification in genome-wide association studies, and that admixture mapping would likely be a successful approach in these populations. We developed a strategy to detect the signature of selection prior to and following putative admixture events. Several genomic regions show an unusual excess of Niger-Kordofanian, and unusual deficiency of both San and Eurasian ancestry, which were considered the footprints of selection after population admixture. Several SNPs with strong allele frequency differences were observed predominantly between the admixed indigenous southern African populations, and their ancestral Eurasian populations. Interestingly, many candidate genes, which were identified within the genomic regions showing signals for selection, were associated with southern African-specific high-risk, mostly communicable diseases, such as malaria, influenza, tuberculosis, and human immunodeficiency virus/AIDs. This observation suggests a potentially important role that these genes might have played in adapting to the environment. Additionally, our analyses of haplotype structure, linkage disequilibrium, recombination, copy number variation and genome-wide admixture highlight, and support the unique position of San relative to both African and non-African populations. This study contributes to a better understanding of population ancestry and selection in south-eastern African populations; and the data and results obtained will support research into the genetic contributions to infectious as well as non-communicable diseases in the region.

  4. A Genomic Portrait of Haplotype Diversity and Signatures of Selection in Indigenous Southern African Populations

    PubMed Central

    Chimusa, Emile R.; Meintjies, Ayton; Tchanga, Milaine; Mulder, Nicola; Seoighe, Cathal; Soodyall, Himla; Ramesar, Rajkumar

    2015-01-01

    We report a study of genome-wide, dense SNP (∼900K) and copy number polymorphism data of indigenous southern Africans. We demonstrate the genetic contribution to southern and eastern African populations, which involved admixture between indigenous San, Niger-Congo-speaking and populations of Eurasian ancestry. This finding illustrates the need to account for stratification in genome-wide association studies, and that admixture mapping would likely be a successful approach in these populations. We developed a strategy to detect the signature of selection prior to and following putative admixture events. Several genomic regions show an unusual excess of Niger-Kordofanian, and unusual deficiency of both San and Eurasian ancestry, which were considered the footprints of selection after population admixture. Several SNPs with strong allele frequency differences were observed predominantly between the admixed indigenous southern African populations, and their ancestral Eurasian populations. Interestingly, many candidate genes, which were identified within the genomic regions showing signals for selection, were associated with southern African-specific high-risk, mostly communicable diseases, such as malaria, influenza, tuberculosis, and human immunodeficiency virus/AIDs. This observation suggests a potentially important role that these genes might have played in adapting to the environment. Additionally, our analyses of haplotype structure, linkage disequilibrium, recombination, copy number variation and genome-wide admixture highlight, and support the unique position of San relative to both African and non-African populations. This study contributes to a better understanding of population ancestry and selection in south-eastern African populations; and the data and results obtained will support research into the genetic contributions to infectious as well as non-communicable diseases in the region. PMID:25811879

  5. Modeling haplotype block variation using Markov chains.

    PubMed

    Greenspan, G; Geiger, D

    2006-04-01

    Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity.

  6. Modeling Haplotype Block Variation Using Markov Chains

    PubMed Central

    Greenspan, G.; Geiger, D.

    2006-01-01

    Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity. PMID:16361244

  7. Complement factor H gene (CFH) polymorphisms C-257T, G257A and haplotypes are associated with protection against severe dengue phenotype, possible related with high CFH expression

    PubMed Central

    Pastor, André F.; Moura, Laís Rodrigues; Neto, José W.D.; Nascimento, Eduardo J.M.; Calzavara-Silva, Carlos E.; Gomes, Ana Lisa V.; da Silva, Ana Maria; Cordeiro, Marli T.; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H.V.G.; Marques, Ernesto T.A.; Acioli-Santos, Bartolomeu

    2013-01-01

    Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The –257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of –257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. PMID:23747994

  8. Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio)

    PubMed Central

    2014-01-01

    Background A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. Results The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. Conclusions The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species. PMID:24762296

  9. Evidence of triple mutant Pfdhps ISGNGA haplotype in Plasmodium falciparum isolates from North-east India: An analysis of sulfadoxine resistant haplotype selection.

    PubMed

    Das, Manuj K; Chetry, Sumi; Kalita, Mohan C; Dutta, Prafulla

    2016-12-01

    North-east region of India has consistent role in the spread of multi drug resistant Plasmodium (P.) falciparum to other parts of Southeast Asia. After rapid clinical treatment failure of Artemisinin based combination therapy-Sulphadoxine/Pyrimethamine (ACT-SP) chemoprophylaxis, Artemether-Lumefantrine (ACT-AL) combination therapy was introduced in the year 2012 in this region for the treatment of uncomplicated P. falciparum malaria. In a DNA sequencing based polymorphism analysis, seven codons of P. falciparum dihydropteroate synthetase ( Pf dhps) gene were screened in a total of 127 P. falciparum isolates collected from Assam, Arunachal Pradesh and Tripura of North-east India during the year 2014 and 2015 to document current sulfadoxine resistant haplotypes. Sequences were analyzed to rearrange both nucleotide and protein haplotypes. Molecular diversity indices were analyzed in DNA Sequence Polymorphism software (DnaSP) on the basis of Pf dhps gene sequences. Disappearance from selective neutrality was assessed based on the ratio of non-synonomous to synonomous nucleotide substitutions [dN/dS ratio]. Moreover, two-tailed Z test was performed in search of the significance for probability of rejecting null hypothesis of strict neutrality [dN = dS]. Presence of mutant P. falciparum multidrug resistance protein1 ( Pf mdr1) was also checked in those isolates that were present with new Pf dhps haplotypes. Phylogenetic relationship based on Pf dhps gene was reconstructed in Molecular Evolutionary Genetics Analysis (MEGA). Among eight different sulfadoxine resistant haplotypes found, IS GNG A haplotype was documented in a total of five isolates from Tripura with association of a new mutant M538 R allele. Sequence analysis of Pf mdr1 gene in these five isolates came to notice that not all but only one isolate was mutant at codon 86 (N86 Y ; Y YSND) in the multidrug resistance protein. Molecular diversity based on Pf dhps haplotypes revealed that P. falciparum

  10. Evaluation of Bovine High-Density SNP Genotyping Array in Indigenous Dairy Cattle Breeds.

    PubMed

    Dash, S; Singh, A; Bhatia, A K; Jayakumar, S; Sharma, A; Singh, S; Ganguly, I; Dixit, S P

    2018-04-03

    In total 52 samples of Sahiwal ( 19 ), Tharparkar ( 17 ), and Gir ( 16 ) were genotyped by using BovineHD SNP chip to analyze minor allele frequency (MAF), genetic diversity, and linkage disequilibrium among these cattle. The common SNPs of BovineHD and 54K SNP Chips were also extracted and evaluated for their performance. Only 40%-50% SNPs of these arrays was found informative for genetic analysis in these cattle breeds. The overall mean of MAF for SNPs of BovineHD SNPChip was 0.248 ± 0.006, 0.241 ± 0.007, and 0.242 ± 0.009 in Sahiwal, Tharparkar and Gir, respectively, while that for 54K SNPs was on lower side. The average Reynold's genetic distance between breeds ranged from 0.042 to 0.055 based on BovineHD Beadchip, and from 0.052 to 0.084 based on 54K SNP Chip. The estimates of genetic diversity based on HD and 54K chips were almost same and, hence, low density chip seems to be good enough to decipher genetic diversity of these cattle breeds. The linkage disequilibrium started decaying (r 2  < 0.2) at 140 kb inter-marker distance and, hence, a 20K low density customized SNP array from HD chip could be designed for genomic selection in these cattle else the 54K Bead Chip as such will be useful.

  11. Longitudinal analysis of haplotypes and polymorphisms of the APOA5 and APOC3 genes associated with variation in serum triglyceride levels: the Bogalusa Heart Study.

    PubMed

    Hallman, D Michael; Srinivasan, Sathanur R; Chen, Wei; Boerwinkle, Eric; Berenson, Gerald S

    2006-12-01

    Polymorphisms in the APOC3 and APOA5 genes, from the APOA1/APOC3/APOA4/APOA5 gene cluster on chromosome 11q23, have been associated with interindividual variation in plasma triglycerides. APOA5 polymorphisms implicated include 2 in the promoter region (-1131 T/C and -3 A/G) and 1 in exon 2 (+56 C/G). APOC3 polymorphisms implicated include 1 (SstI) in the 3' untranslated region and 1 (-2854 G/T) in the APOC3-APOA4 intergenic region. We analyzed the associations of haplotypes and multilocus genotypes of these polymorphisms on longitudinal serum triglyceride profiles in 360 African American and 823 white subjects from the Bogalusa Heart Study. Subjects were examined from 2 to 8 times (mean +/- SD, 5.4 +/- 1.3) between 1973 and 1996, at ages ranging from 4 to 38 years, with 1978 observations in African Americans and 4465 in whites. Serum triglycerides were significantly higher among whites across all ages. Allele frequencies differed significantly between African Americans and whites at all but the APOA5 +56 C/G locus. Linkage disequilibrium among the loci was higher in whites and haplotype diversity lower: 6 haplotypes had estimated frequencies of more than 1% in African Americans, 5 in whites. Individually, all polymorphisms except APOC3 -2854 G/T showed significant associations with triglyceride levels in the full sample. However, genotype models including all 5 loci showed significant triglyceride associations for only 3 (APOC3 SstI, APOA5 -1131 T/C, and APOA5 +56 C/G); significant interactions among them indicated their effects were not independent. Neither APOC3 -2854 G/T nor APOA5 -3 A/G had significant effects when the other 3 loci were in the models. The EM algorithm was used to estimate haplotype frequencies and assign haplotype probabilities to individuals, which is conditional on their genotypes; individuals' haplotype probability vectors were then used as predictors in multilevel mixed models of longitudinal triglyceride profiles. Of haplotypes comprising

  12. [Relationship between High-Resolution HLA-A,-B,-DRB1 Alleles and Haplotype Polymorphisms with Myeloid Leukemia of Han People in North China].

    PubMed

    Qi, Jun; Wang, Tian-Ju; Chen, Li-Ping; Wang, Man-Ni; Wu, Jun-Hua; DU, Dan

    2018-02-01

    To investigate the potential relationship between the high-resolution HLA-A,-B,-DRB1 alleles and haplotype polymorphism with actute myeloid leukemia (AML) and chronic myeloid leukemia (CML) of Han people in North China. A total of 1241 healthy unrelated Han people's bone marrow donors in North China were used as a control group, 259 patients with myeloid leukemia were genotyped at high-resolution level by means of PCR-SBT, -SSO and -SSP typing methods for HLA-A,-B,-DRB1 loci. The frequencies of HLA allele and haplotype were calculated by software Arleguin 3.5.2. The different distribution of genes and haplotypes was analyzed by case control study, and the odd ratio (OR) of leukemia was also calculated. The structural difference of HLA alleles was analyzed 111by HLA three-dimensional structure modeling and software Swiss-PdbViewer v4.1. χ 2 test and correction showed that an increased frequency of A*02:07 (8.47% vs 5.28%, P' =0.013), A*29:01 (1.85% vs 0.68%, P=0.044), B*07:02 (5.29% vs 3.10%, P=0.029), B*07:05:01G (1.85% vs 0.68%, P=0.044) and B*35:02 (1.06% vs 0.20%, P=0.023) were found in AML patients (n=189) as compared with controls, respectively; whereas A*02:03 was less frequent in AML as compared with controls (0.79% vs 3.10%, P=0.011). The frequency of B*46:01 was lower in CML patients (n=70) as compared with controls (2.86% vs 7.82%, P=0.031). However, the above-mentioned discrepancies were not statistically significant by Bonferroni correction. Through Fisher exact test and Bonferroni correction, the frequency of DRB1*11:28 and its haplotype A*24:02-B*15:01-DRB1*11:28 in CML group were very significantly higher than in controls (1.43% vs 0.00%, Pc=0.015; 1.43% vs 0.00%, P=0.003). Three-dimensional structure modeling of DRB1*11:28 and DRB1*11:01 presented significant structure differentiation (RMSD=0.09 nm) in peptide binding region of the backbone calculated by Swiss-PdbViewer v4.1. The haplotype A*03:01-B*50:01-DRB1*07:01 in AML and A*11:01-B*40:06-DRB1

  13. Mineralocorticoid receptor haplotype, oral contraceptives and emotional information processing.

    PubMed

    Hamstra, D A; de Kloet, E R; van Hemert, A M; de Rijk, R H; Van der Does, A J W

    2015-02-12

    Oral contraceptives (OCs) affect mood in some women and may have more subtle effects on emotional information processing in many more users. Female carriers of mineralocorticoid receptor (MR) haplotype 2 have been shown to be more optimistic and less vulnerable to depression. To investigate the effects of oral contraceptives on emotional information processing and a possible moderating effect of MR haplotype. Cross-sectional study in 85 healthy premenopausal women of West-European descent. We found significant main effects of oral contraceptives on facial expression recognition, emotional memory and decision-making. Furthermore, carriers of MR haplotype 1 or 3 were sensitive to the impact of OCs on the recognition of sad and fearful faces and on emotional memory, whereas MR haplotype 2 carriers were not. Different compounds of OCs were included. No hormonal measures were taken. Most naturally cycling participants were assessed in the luteal phase of their menstrual cycle. Carriers of MR haplotype 2 may be less sensitive to depressogenic side-effects of OCs. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.

  14. A haplotypic variant at the IRGM locus and rs11747270 are related to the susceptibility for chronic periodontitis.

    PubMed

    Folwaczny, Matthias; Tsekeri, Eleni; Glas, Jürgen

    2018-02-01

    Immunity-regulated GTPase M (IRGM) plays a critical role in the defense against intracellular bacteria by regulating autophagy formation. This direct genetic association study aimed to determine whether variants at the IRGM genetic locus are associated with chronic periodontitis. Using PCR and melting curve analysis 390 periodontitis patients and 770 healthy controls have been genotyped regarding six polymorphisms in the IRGM gene (rs13361189, rs10065172, rs4958847, rs1000113, rs11747270, rs931058). Frequency distribution of alleles and genotypes for the six polymorphisms were not significantly different between the periodontitis and the control group. Also following stratification according to gender and smoking no significant linkage was found for any of the IRGM variants with periodontitis. Analysis of a subsample of patients revealed a significant association for rs11747270 with severe periodontitis (p = 0.003). Pairwise linkage analysis revealed one block composed of rs13361189, rs10065172, rs4958847, rs1000113 and 11747270 with strong or even complete linkage disequilibrium (r 2  > 0.9). Four haplotypes showed a frequency of > 1%, among which the haplotype C-T-A-T-G was significantly associated with chronic periodontitis (p = 0.0051; OR 4.66, 95% CI 1.41-15.42). One rare haplotype of the IRGM locus is significantly associated with chronic periodontitis in a German cohort.

  15. Globally dispersed Y chromosomal haplotypes in wild and domestic sheep.

    PubMed

    Meadows, J R S; Hanotte, O; Drögemüller, C; Calvo, J; Godfrey, R; Coltman, D; Maddox, J F; Marzanov, N; Kantanen, J; Kijas, J W

    2006-10-01

    To date, investigations of genetic diversity and the origins of domestication in sheep have utilised autosomal microsatellites and variation in the mitochondrial genome. We present the first analysis of both domestic and wild sheep using genetic markers residing on the ovine Y chromosome. Analysis of a single nucleotide polymorphism (oY1) in the SRY promoter region revealed that allele A-oY1 was present in all wild bighorn sheep (Ovis canadensis), two subspecies of thinhorn sheep (Ovis dalli), European Mouflon (Ovis musimon) and the Barbary (Ammontragis lervia). A-oY1 also had the highest frequency (71.4%) within 458 domestic sheep drawn from 65 breeds sampled from Africa, Asia, Australia, the Caribbean, Europe, the Middle East and Central Asia. Sequence analysis of a second locus, microsatellite SRYM18, revealed a compound repeat array displaying fixed differences, which identified bighorn and thinhorn sheep as distinct from the European Mouflon and domestic animals. Combined genotypic data identified 11 male-specific haplotypes that represented at least two separate lineages. Investigation of the geographical distribution of each haplotype revealed that one (H6) was both very common and widespread in the global sample of domestic breeds. The remaining haplotypes each displayed more restricted and informative distributions. For example, H5 was likely founded following the domestication of European breeds and was used to trace the recent transportation of animals to both the Caribbean and Australia. A high rate of Y chromosomal dispersal appears to have taken place during the development of domestic sheep as only 12.9% of the total observed variation was partitioned between major geographical regions.

  16. Ultraaccurate genome sequencing and haplotyping of single human cells.

    PubMed

    Chu, Wai Keung; Edge, Peter; Lee, Ho Suk; Bansal, Vikas; Bafna, Vineet; Huang, Xiaohua; Zhang, Kun

    2017-11-21

    Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10 -8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.

  17. Haplotype Structure of the ENPP1 Gene and Nominal Association of the K121Q Missense Single Nucleotide Polymorphism With Glycemic Traits in the Framingham Heart Study

    PubMed Central

    Stolerman, Elliot S.; Manning, Alisa K.; McAteer, Jarred B.; Dupuis, Josée; Fox, Caroline S.; Cupples, L. Adrienne; Meigs, James B.; Florez, Jose C.

    2008-01-01

    OBJECTIVE—A recent meta-analysis demonstrated a nominal association of the ectonucleotide pyrophosphatase phosphodiesterase 1 (ENPP1) K→Q missense single nucleotide polymorphism (SNP) at position 121 with type 2 diabetes. We set out to confirm the association of ENPP1 K121Q with hyperglycemia, expand this association to insulin resistance traits, and determine whether the association stems from K121Q or another variant in linkage disequilibrium with it. RESEARCH DESIGN AND METHODS—We characterized the haplotype structure of ENPP1 and selected 39 tag SNPs that captured 96% of common variation in the region (minor allele frequency ≥5%) with an r2 value ≥0.80. We genotyped the SNPs in 2,511 Framingham Heart Study participants and used age- and sex-adjusted linear mixed effects (LME) models to test for association with quantitative metabolic traits. We also examined whether interaction between K121Q and BMI affected glycemic trait levels. RESULTS—The Q allele of K121Q (rs1044498) was associated with increased fasting plasma glucose (FPG), A1C, fasting insulin, and insulin resistance by homeostasis model assessment (HOMA-IR; all P = 0.01–0.006). Two noncoding SNPs (rs7775386 and rs7773477) demonstrated similar associations, but LME models indicated that their effects were not independent from K121Q. We found no association of K121Q with obesity, but interaction models suggested that the effect of the Q allele on FPG and HOMA-IR was stronger in those with a higher BMI (P = 0.008 and 0.01 for interaction, respectively). CONCLUSIONS—The Q allele of ENPP1 K121Q is associated with hyperglycemia and insulin resistance in whites. We found an adiposity-SNP interaction, with a stronger association of K121Q with diabetes-related quantitative traits in people with a higher BMI. PMID:18426862

  18. [C677T-SNP of methylenetetrahydrofolate reductase gene and breast cancer in Mexican women].

    PubMed

    Calderón-Garcidueñas, Ana Laura; Cerda-Flores, Ricardo Martín; Castruita-Ávila, Ana Lilia; González-Guerrero, Juan Francisco; Barrera-Saldaña, Hugo Alberto

    2017-01-01

    Low-penetrance susceptibility genes such as 5,10-methylenetetrahydrofolate reductase gene (MTHFR) have been considered in the progression of breast cancer (BC). Cancer is a result of genetic, environmental and epigenetic interactions; therefore, these genes should be studied in environmental context, because the results can vary between populations and even within the same country. The objective was to analyze the allelic and genotypic frequencies of the MTHFR C667T SNP in Mexican Mestizo patients with BC and controls from Northeastern Mexico. 243 patients and 118 healthy women were studied. The analysis of the polymorphism was performed with a DNA microarray. Once the frequency of the polymorphism was obtained, Hardy-Weinberg equilibrium test was carried out for the genotypes. Chi square test was used to compare the distribution of frequencies. The allele frequency in patients was: C = 0.5406; T = 0.4594 and in controls C = 0.5678, T = 0.4322. Genotype in BC patients was: C / C = 29.9%, C / T = 48.3% and T / T = 21.8. The distribution in controls was: C / C = 31.4%, C / T = 50.8%, T / T = 17.8% (chi squared 0.77, p = 0.6801). Northeastern Mexican women in this study showed no association between MTFHR C667T SNP and the risk of BC. It seems that the contribution of this polymorphism to BC in Mexico varies depending on various factors, both genetic and environmental.

  19. Familiality and SNP heritability of age at onset and episodicity in major depressive disorder.

    PubMed

    Ferentinos, P; Koukounari, A; Power, R; Rivera, M; Uher, R; Craddock, N; Owen, M J; Korszun, A; Jones, L; Jones, I; Gill, M; Rice, J P; Ising, M; Maier, W; Mors, O; Rietschel, M; Preisig, M; Binder, E B; Aitchison, K J; Mendlewicz, J; Souery, D; Hauser, J; Henigsberg, N; Breen, G; Craig, I W; Farmer, A E; Müller-Myhsok, B; McGuffin, P; Lewis, C M

    2015-07-01

    Strategies to dissect phenotypic and genetic heterogeneity of major depressive disorder (MDD) have mainly relied on subphenotypes, such as age at onset (AAO) and recurrence/episodicity. Yet, evidence on whether these subphenotypes are familial or heritable is scarce. The aims of this study are to investigate the familiality of AAO and episode frequency in MDD and to assess the proportion of their variance explained by common single nucleotide polymorphisms (SNP heritability). For investigating familiality, we used 691 families with 2-5 full siblings with recurrent MDD from the DeNt study. We fitted (square root) AAO and episode count in a linear and a negative binomial mixed model, respectively, with family as random effect and adjusting for sex, age and center. The strength of familiality was assessed with intraclass correlation coefficients (ICC). For estimating SNP heritabilities, we used 3468 unrelated MDD cases from the RADIANT and GSK Munich studies. After similarly adjusting for covariates, derived residuals were used with the GREML method in GCTA (genome-wide complex trait analysis) software. Significant familial clustering was found for both AAO (ICC = 0.28) and episodicity (ICC = 0.07). We calculated from respective ICC estimates the maximal additive heritability of AAO (0.56) and episodicity (0.15). SNP heritability of AAO was 0.17 (p = 0.04); analysis was underpowered for calculating SNP heritability of episodicity. AAO and episodicity aggregate in families to a moderate and small degree, respectively. AAO is under stronger additive genetic control than episodicity. Larger samples are needed to calculate the SNP heritability of episodicity. The described statistical framework could be useful in future analyses.

  20. Mineralocorticoid receptor haplotype, estradiol, progesterone and emotional information processing.

    PubMed

    Hamstra, Danielle A; de Kloet, E Ronald; Quataert, Ina; Jansen, Myrthe; Van der Does, Willem

    2017-02-01

    Carriers of MR-haplotype 1 and 3 (GA/CG; rs5522 and rs2070951) are more sensitive to the influence of oral contraceptives (OC) and menstrual cycle phase on emotional information processing than MR-haplotype 2 (CA) carriers. We investigated whether this effect is associated with estradiol (E2) and/or progesterone (P4) levels. Healthy MR-genotyped premenopausal women were tested twice in a counterbalanced design. Naturally cycling (NC) women were tested in the early-follicular and mid-luteal phase and OC-users during OC-intake and in the pill-free week. At both sessions E2 and P4 were assessed in saliva. Tests included implicit and explicit positive and negative affect, attentional blink accuracy, emotional memory, emotion recognition, and risky decision-making (gambling). MR-haplotype 2 homozygotes had higher implicit happiness scores than MR-haplotype 2 heterozygotes (p=0.031) and MR-haplotype 1/3 carriers (p<0.001). MR-haplotype 2 homozygotes also had longer reaction times to happy faces in an emotion recognition test than MR-haplotype 1/3 (p=0.001). Practice effects were observed for most measures. The pattern of correlations between information processing and P4 or E2 differed between sessions, as well as the moderating effects of the MR genotype. In the first session the MR-genotype moderated the influence of P4 on implicit anxiety (sr=-0.30; p=0.005): higher P4 was associated with reduction in implicit anxiety, but only in MR-haplotype 2 homozygotes (sr=-0.61; p=0.012). In the second session the MR-genotype moderated the influence of E2 on the recognition of facial expressions of happiness (sr=-0.21; p=0.035): only in MR-haplotype 1/3 higher E2 was correlated with happiness recognition (sr=0.29; p=0.005). In the second session higher E2 and P4 were negatively correlated with accuracy in lag2 trials of the attentional blink task (p<0.001). Thus NC women, compared to OC-users, performed worse on lag 2 trials (p=0.041). The higher implicit happiness scores of MR-haplotype

  1. SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

    PubMed Central

    Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2014-01-01

    The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047

  2. The role of the JAK2 GGCC haplotype and the TET2 gene in familial myeloproliferative neoplasms

    PubMed Central

    Olcaydu, Damla; Rumi, Elisa; Harutyunyan, Ashot; Passamonti, Francesco; Pietra, Daniela; Pascutto, Cristiana; Berg, Tiina; Jäger, Roland; Hammond, Emma; Cazzola, Mario; Kralovics, Robert

    2011-01-01

    Background Myeloproliferative neoplasms constitute a group of diverse chronic myeloid malignancies that share pathogenic features such as acquired mutations in the JAK2, TET2, CBL and MPL genes. There are recent reports that a JAK2 gene haplotype (GGCC or 46/1) confers susceptibility to JAK2 mutation-positive myeloproliferative neoplasms. The aim of this study was to examine the role of the JAK2 GGCC haplotype and germline mutations of TET2, CBL and MPL in familial myeloproliferative neoplasms. Design and Methods We investigated patients with familial (n=88) or sporadic (n=684) myeloproliferative neoplasms, and a control population (n=203) from the same demographic area in Italy. Association analysis was performed using tagged single nucleotide polymorphisms (rs10974944 and rs12343867) of the JAK2 haplotype. Sequence analysis of TET2, CBL and MPL was conducted in the 88 patients with familial myeloproliferative neoplasms. Results Association analysis revealed no difference in haplotype frequency between familial and sporadic cases of myeloproliferative neoplasms (P=0.6529). No germline mutations in TET2, CBL or MPL that segregate with the disease phenotype were identified. As we observed variability in somatic mutations in the affected members of a pedigree with myeloproliferative neoplasms, we postulated that somatic mutagenesis is increased in familial myeloproliferative neoplasms. Accordingly, we compared the incidence of malignant disorders between sporadic and familial patients. Although the overall incidence of malignant disorders did not differ significantly between cases of familial and sporadic myeloproliferative neoplasms, malignancies were more frequent in patients with familial disease aged between 50 to 70 years (P=0.0198) than in patients in the same age range with sporadic myeloproliferative neoplasms. Conclusions We conclude that the JAK2 GGCC haplotype and germline mutations of TET2, CBL or MPL do not explain familial clustering of

  3. Detecting novel SNPs and breed-specific haplotypes at calpastatin gene in Iranian fat- and thin-tailed sheep breeds and their effects on protein structure.

    PubMed

    Aali, Mohsen; Moradi-Shahrbabak, Mohammad; Moradi-Shahrbabak, Hosein; Sadeghi, Mostafa

    2014-03-01

    Calpastatin has been introduced as a potential candidate gene for growth and meat quality traits. In this study, genetic variability was investigated in the exon 6 and its intron boundaries of ovine CAST gene by PCR-SSCP analysis and DNA sequencing. Also a protein sequence and structural analysis were performed to predict the possible impact of amino acid substitutions on physicochemical properties and structure of the CAST protein. A total of 487 animals belonging to four ancient Iranian sheep breeds with different fat metabolisms, Lori-Bakhtiari and Chall (fat-tailed), Zel-Atabay cross-bred (medium fat-tailed) and Zel (thin-tailed), were analyzed. Eight unique SSCP patterns, representing eight different sequences or haplotypes, CAST-1, CAST-2 and CAST-6 to CAST-11, were identified. Haplotypes CAST-1 and CAST-2 were most common with frequency of 0.365 and 0.295. The novel haplotype CAST-8 had considerable frequency in Iranian sheep breeds (0.129). All the consensus sequences showed 98-99%, 94-98%, 92-93% and 82-83% similarity to the published ovine, caprine, bovine and porcine CAST locus sequences, respectively. Sequence analysis revealed four SNPs in intron 5 (C24T, G62A, G65T and T69-) and three SNPs in exon 6 (c.197A>T, c.282G>T and c.296C>G). All three SNPs in exon 6 were missense mutations which would result in p.Gln 66 Leu, p.Glu 94 Asp and p.Pro 99 Arg substitutions, respectively, in CAST protein. All three amino acid substitutions affected the physicochemical properties of ovine CAST protein including hydrophobicity, amphiphilicity and net charge and subsequently might influence its structure and effect on the activity of Ca2+ channels; hence, they might regulate calpain activity and afterwards meat tenderness and growth rate. The Lori-Bakhtiari population showed the highest heterozygosity in the ovine CAST locus (0.802). Frequency difference of haplotypes CAST-10 and CAST-8 between Lori-Bakhtiari (fat-tailed) and Zel (thin-tailed) breeds was highly

  4. Genomic evolution in domestic cattle: ancestral haplotypes and healthy beef.

    PubMed

    Williamson, Joseph F; Steele, Edward J; Lester, Susan; Kalai, Oscar; Millman, John A; Wolrige, Lindsay; Bayard, Dominic; McLure, Craig; Dawkins, Roger L

    2011-05-01

    We have identified numerous Ancestral Haplotypes encoding a 14-Mb region of Bota C19. Three are frequent in Simmental, Angus and Wagyu and have been conserved since common progenitor populations. Others are more relevant to the differences between these 3 breeds including fat content and distribution in muscle. SREBF1 and Growth Hormone, which have been implicated in the production of healthy beef, are included within these haplotypes. However, we conclude that alleles at these 2 loci are less important than other sequences within the haplotypes. Identification of breeds and hybrids is improved by using haplotypes rather than individual alleles. Copyright © 2010 Elsevier Inc. All rights reserved.

  5. Association of Inducible T Cell Costimulator Polymorphisms with Susceptibility and Outcome of Hepatitis B Virus Infection in a Chinese Han Population.

    PubMed

    Hu, J; Li, Q-L; Hou, S-H; Peng, H; Guo, J-J

    2015-09-01

    Inducible T cell costimulator (ICOS) functions to regulate cell-cell signalling, immune responses and cell proliferation. ICOS single nucleotide polymorphism (SNP) may affect protein expression and functions. This study investigated the association of ICOS SNPs with hepatitis B virus (HBV) infection and outcome in a Chinese population. A total of 1290 Chinese Han individuals were enrolled, including 63 asymptomatic HBV carriers, 220 chronic hepatitis B patients (CHB), 249 HBV-related liver cirrhosis patients (LC), 108 patients with HBV-related hepatocellular carcinoma (HCC), 338 patients with natural HBV clearance and 312 healthy subjects (as controls). DNA samples from these subjects were genotyped for four ICOS SNPs (rs11883722, rs10932029, rs1559931 and rs4675379) using TaqMan SNP Genotyping Assay and analysed. The data showed that genotype and allele frequencies of ICOS SNPs in cases and controls followed the Hardy-Weinberg distribution. The CC genotype of rs4675379 was higher in patients with HBV infection (including AC, CHB, LC and HCC) than in patients with HBV clearance (P = 0.006). Furthermore, the genotype 'GA' and the minor allele 'A' of rs1559931 were associated with a decreased HCC susceptibility (P < 0.001). Haplotype analysis data showed that 'GC' haplotype in block 2 (rs1559931 and rs4675379) had a lower frequency in patients than in HBV-cleared subjects (P = 0.034), although its overall frequency was only 1.6%. Our study found that ICOS rs1559931 SNP was associated with decreased HBV-related HCC risk in the studied Chinese Han population, except for patients with natural clearance of HBV. © 2015 The Foundation for the Scandinavian Journal of Immunology.

  6. Case-control study of eczema associated with IL13 genetic polymorphisms in Japanese children.

    PubMed

    Miyake, Yoshihiro; Kiyohara, Chikako; Koyanagi, Midori; Fujimoto, Takahiro; Shirasawa, Senji; Tanaka, Keiko; Sasaki, Satoshi; Hirota, Yoshio

    2011-01-01

    Several association studies have investigated the relationships between single nucleotide polymorphisms (SNPs) in the IL13 gene and eczema, with inconsistent results. We conducted a case-control study of the relationship between the polymorphisms of rs1800925 and rs20541 and the risk of eczema in Japanese children aged 3 years. Included were the 209 cases identified based on criteria of the International Study of Asthma and Allergies in Childhood (ISAAC). Controls were 451 children without eczema based on ISAAC questions who had not been diagnosed by a physician as having asthma or atopic eczema. The minor TT genotype of the rs1800925 SNP and the minor AA genotype of the rs20541 SNP were significantly related to an increased risk of eczema: adjusted odds ratio for the TT genotype was 2.78 (95% confidence interval 1.22-6.30) and that for the AA genotype was 2.38 (95% confidence interval 1.35-4.18). Haplotype analyses showed a protective association between the CG haplotype and eczema, whereas the TA haplotype was positively related to the risk of eczema. Perinatal smoking exposure did not interact with genotypes of the IL13 gene in the etiology of eczema. The significant association of the rs20541 SNP with eczema essentially disappeared after additional adjustment for the rs1800925 SNP, whereas a relationship with the rs1800925 SNP remained significant. A common genetic variation in the IL13 gene at the levels of both single SNPs and haplotypes was associated with eczema. However, the significant association with the rs20541 SNP might be ascribed to the rs1800925 SNP. Copyright © 2010 S. Karger AG, Basel.

  7. Methylenetetrahydrofolate reductase gene haplotypes affect toxicity during maintenance therapy for childhood acute lymphoblastic leukemia in Japanese patients.

    PubMed

    Tanaka, Yoichi; Manabe, Atsushi; Nakadate, Hisaya; Kondoh, Kensuke; Nakamura, Kozue; Koh, Katsuyoshi; Kikuchi, Akira; Komiyama, Takako

    2014-05-01

    Abstract The aim of this study was to investigate the influence of daily 6-mercaptopurine (6-MP) and low-dose weekly methotrexate (MTX) combination treatment and methylenetetrahydrofolate reductase (MTHFR) haplotypes on toxicity during maintenance therapy in Japanese childhood acute lymphoblastic leukemia (ALL). We retrospectively analyzed the MTHFR C677T and A1298C polymorphisms and influence of haplotypes on toxicity in 73 patients. Patients with the MTHFR 677TT and 677CT + 1298AC were associated with severe liver toxicity (p = 0.014, odds ratio [OR] = 3.82, 95% confidence interval [CI] = 1.27-11.46) and more rapid onset of liver toxicity (p = 0.010). Patients with MTHFR 677TT and 677CT + 1298AC were associated with lower frequency of 6-MP and MTX dose reduction due to leukopenia (p < 0.05). No difference was observed in average drug doses in the MTHFR genotypes. In conclusion, the MTHFR C677T and A1298C haplotypes might be useful for monitoring adverse effects in childhood ALL maintenance therapy in Japanese patients.

  8. Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA.

    PubMed

    Holt, Kathryn E; Teo, Yik Y; Li, Heng; Nair, Satheesh; Dougan, Gordon; Wain, John; Parkhill, Julian

    2009-08-15

    Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded > or =80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40x, declining only slightly at read depths 20-40x. The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/.

  9. HLA class-I and class-II allele frequencies and two-locus haplotypes in Melanesians of Vanuatu and New Caledonia.

    PubMed

    Maitland, K; Bunce, M; Harding, R M; Barnardo, M C N M; Clegg, J B; Welsh, K; Bowden, D K; Williams, T N

    2004-12-01

    HLA class-I and class-II allele frequencies and two-locus haplotypes were examined in 367 unrelated Melanesians living on the islands of Vanuatu and New Caledonia. Diversity at all HLA class-I and class-II loci was relatively limited. In class-I loci, three HLA-A allelic groups (HLA-A*24, HLA-A*34 and HLA-A*11), seven HLA-B alleles or allelic groups (HLA-B*1506, HLA-B*5602, HLA-B*13, HLA-B*5601, HLA-B*4001, HLA-B*4002 and HLA-B*2704) and four HLA-C alleles or allelic groups (HLA-Cw*04, HLA-Cw*01, HLA-Cw*0702 and HLA-Cw*15) constituted more than 90% of the alleles observed. In the class-II loci, four HLA-DRB1 alleles (HLA-DRB1*15, HLA-DRB1*11, HLA-DRB1*04 and HLA-DRB1*16), three HLA-DRB3-5 alleles (HLA-DRB3*02, HLA-DRB4*01 and HLA-DRB5*01/02) and five HLA-DQB1 alleles (HLA-DQB1*0301, HLA-DQB1*04, HLA-DQB1*05, HLA-DQB1*0601 and HLA-DQB1*0602) constituted over 93, 97 and 98% of the alleles observed, respectively. Homozygosity showed significant departures from expected levels for neutrality based on allele frequency (i.e. excess diversity) at the HLA-B, HLA-Cw, HLA-DQB1 and HLA-DRB3/5 loci on some islands. The locus with the strongest departure from neutrality was HLA-DQB1, homozygosity being significantly lower than expected on all islands except New Caledonia. No consistent pattern was demonstrated for any HLA locus in relation to malaria endemicity.

  10. Cloud computing-based TagSNP selection algorithm for human genome data.

    PubMed

    Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

    2015-01-05

    Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.

  11. A spatial haplotype copying model with applications to genotype imputation.

    PubMed

    Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan

    2015-05-01

    Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.

  12. Significant association between ERCC2 and MTHR polymorphisms and breast cancer susceptibility in Moroccan population: genotype and haplotype analysis in a case-control study.

    PubMed

    Hardi, Hanaa; Melki, Rahma; Boughaleb, Zouhour; El Harroudi, Tijani; Aissaoui, Souria; Boukhatem, Noureddine

    2018-03-15

    Genetic determinants of breast cancer (BC) remained largely unknown in the majority of Moroccan patients. The purpose of this study was to explore the association of ERCC2 and MTHFR polymorphisms with genetic susceptibility to breast cancer in Moroccan population. We genotyped ERCC2 polymorphisms (rs1799793 (G934A) and rs13181 (A2251C)) and MTHFR polymorphisms (rs1801133 (C677T) and rs1801131 (A1298C)) using TaqMan SNP Genotyping Assays. Genotypes were compared in 151 BC cases and 156 population-matched controls. Allelic, genotypic and haplotype associations with the risk and clinicopathological features of BC were assessed using logistic regression analyses. ERCC2-rs1799793-AA genotype was associated with high risk of BC compared to wild type genotype (recessive model: OR: 2.90, 95% CI: 1.34-6.26, p = 0.0069) even after Bonferroni correction (p < 0,0125). MTHFR rs1801133-TT genotype was associated with increased risk of BC (recessive model, OR: 2.49, 95% CI: 1.17-5.29, p = 0.017) but the association turned insignificant after Bonferroni correction. For the rest of SNPs, no statistical associations to BC risk were detected. Significant association with clinical features was detected for MTHFR-rs1801133-TC genotype with early age at diagnosis and familial BC. Following Bonferroni correction, only association with familial BC remained significant. MTHFR-rs1801131-CC genotype was associated with sporadic BC. ERCC2-rs1799793-AA genotype correlated with ER+ and PR+ breast cancer. ERCC2-rs13181-CA genotype was significantly associated large tumors (T ≥ 3) in BC patients. None of these associations passed Bonferroni correction. Haplotype analysis showed that ERCC2 A-C haplotype was significantly associated with increased BC risk (OR: 3.71, 95% CI: 1.7-8.12, p = 0.0002 and p = 0.0008 before and after Bonferroni correction, respectively) and positive expression of ER and PR in BC patients. ERCC2 G-C haplotype was correlated with PR negative and

  13. Rs219780 SNP of Claudin 14 Gene is not Related to Clinical Expression in Primary Hyperparathyroidism.

    PubMed

    Piedra, María; Berja, Ana; García-Unzueta, María Teresa; Ramos, Laura; Valero, Carmen; Amado, José Antonio

    2015-01-01

    The CLDN14 gene encodes a protein involved in the regulation of paracellular permeability or ion transport at epithelial tight junctions as in the nephron. The C allele of the rs219780 SNP (single nucleotide polymorphism) of CLDN14 has been associated with renal lithiasis, high levels of parathormone (PTH), and with low bone mineral density (BMD) in healthy women. Our aim is to study the relationship between rs219780 SNP of CLDN14 and renal lithiasis, fractures, and BMD in patients with primary hyperparathyroidism (PHPT). We enrolled 298 Caucasian patients with PHPT and 328 healthy volunteers in a cross-sectional study. We analysed anthropometric data, history of fractures or kidney stones, biochemical parameters including markers for bone remodelling, abdominal ultrasound, and BMD and genotyping for the rs219780 SNP of CLDN14. We did not find any difference in the frequency of fractures or renal lithiasis between the genotype groups in PHPT patients. Moreover, we did not find any relationship between the T or C alleles and BMD or biochemical parameters. rs219780 SNP of CLDN14 does not appear to be a risk factor for the development of PHPT nor does it seem to influence the clinical expression of PHPT.

  14. Mathematical properties and bounds on haplotyping populations by pure parsimony.

    PubMed

    Wang, I-Lin; Chang, Chia-Yuan

    2011-06-01

    Although the haplotype data can be used to analyze the function of DNA, due to the significant efforts required in collecting the haplotype data, usually the genotype data is collected and then the population haplotype inference (PHI) problem is solved to infer haplotype data from genotype data for a population. This paper investigates the PHI problem based on the pure parsimony criterion (HIPP), which seeks the minimum number of distinct haplotypes to infer a given genotype data. We analyze the mathematical structure and properties for the HIPP problem, propose techniques to reduce the given genotype data into an equivalent one of much smaller size, and analyze the relations of genotype data using a compatible graph. Based on the mathematical properties in the compatible graph, we propose a maximal clique heuristic to obtain an upper bound, and a new polynomial-sized integer linear programming formulation to obtain a lower bound for the HIPP problem. Copyright © 2011 Elsevier Inc. All rights reserved.

  15. No evidence for MHC class II-based non-random mating at the gametic haplotype in Atlantic salmon.

    PubMed

    Promerová, M; Alavioon, G; Tusso, S; Burri, R; Immler, S

    2017-06-01

    Genes of the major histocompatibility complex (MHC) are a likely target of mate choice because of their role in inbreeding avoidance and potential benefits for offspring immunocompetence. Evidence for female choice for complementary MHC alleles among competing males exists both for the pre- and the postmating stages. However, it remains unclear whether the latter may involve non-random fusion of gametes depending on gametic haplotypes resulting in transmission ratio distortion or non-random sequence divergence among fused gametes. We tested whether non-random gametic fusion of MHC-II haplotypes occurs in Atlantic salmon Salmo salar. We performed in vitro fertilizations that excluded interindividual sperm competition using a split family design with large clutch sample sizes to test for a possible role of the gametic haplotype in mate choice. We sequenced two MHC-II loci in 50 embryos per clutch to assess allelic frequencies and sequence divergence. We found no evidence for transmission ratio distortion at two linked MHC-II loci, nor for non-random gamete fusion with respect to MHC-II alleles. Our findings suggest that the gametic MHC-II haplotypes play no role in gamete association in Atlantic salmon and that earlier findings of MHC-based mate choice most likely reflect choice among diploid genotypes. We discuss possible explanations for these findings and how they differ from findings in mammals.

  16. Population-specific variation in haplotype composition and heterozygosity at the POLB locus.

    PubMed

    Yamtich, Jennifer; Speed, William C; Straka, Eva; Kidd, Judith R; Sweasy, Joann B; Kidd, Kenneth K

    2009-05-01

    DNA polymerase beta plays a central role in base excision repair (BER), which removes large numbers of endogenous DNA lesions from each cell on a daily basis. Little is currently known about germline polymorphisms within the POLB locus, making it difficult to study the association of variants at this locus with human diseases such as cancer. Yet, approximately thirty percent of human tumor types show variants of DNA polymerase beta. We have assessed the global frequency distributions of coding and common non-coding SNPs in and flanking the POLB gene for a total of 14 sites typed in approximately 2400 individuals from anthropologically defined human populations worldwide. We have found a marked difference between haplotype frequencies in African populations and in non-African populations.

  17. TPH2 -703G/T SNP may have important effect on susceptibility to suicidal behavior in major depression.

    PubMed

    Yoon, Ho-Kyoung; Kim, Yong-Ku

    2009-04-30

    Serotonergic system-related genes can be good candidate genes for both major depressive disorder (MDD) and suicidal behavior. In this study, we aimed to investigate the association of serotonin 2A receptor gene -1438A/G SNP (HTR2A -1438A/G), tryptophan hydroxylase 2 gene -703G/T SNP (TPH2 -703G/T) and serotonin 1A receptor C-1019G (HTR1A C-1019G) with suicidal behavior. One hundred and eighty one suicidal depressed patients and 143 non-suicidal depressed patients who met DSM-IV criteria for major depressive disorder were recruited from patients who were admitted to Korea University Ansan Hospital. One hundred seventy six normal controls were healthy volunteers who were recruited by local advertisement. Patients and normal controls were genotyped for HTR2A -1438A/G, TPH2 -703G/T and 5-HT1A C-1019G. The suicidal depressed patients were evaluated by the lethality of individual suicide attempts using Weisman and Worden's risk-rescue rating (RRR) and the Lethality Suicide Attempt Rating Scale-updated (LSARS-II). In order to assess the severity of depressive symptoms of patients, Hamilton's Depression Rating Scale (HDRS) was administered. Genotype and allele frequencies were compared between groups by chi(2) statistics. Association of genotype of the candidate genes with the lethality of suicidal behavior was examined with ANOVA by comparing the mean scores of LSARS and RRR according to the genotype. There were statistically significant differences in the genotype distributions and allele frequencies of TPH2 -703G/T between the suicidal depressive group and the normal control group. The homozygous allele G (G/G genotype) frequency was significantly higher in suicidal depressed patients than in controls. However, no differences in either genotype distribution or in allele frequencies of HTR2A -1438A/G and HTR1A C-1019G were observed between the suicidal depressed patients, the non-suicidal depressed patients, and the normal controls. There were no differences in the

  18. Haplotypes and effects on growth traits of bovine Wnt7a gene in Chinese Qinchuan cattle.

    PubMed

    Xue, Jing; Sun, Yujia; Guo, Wenjiao; Yang, Ziqi; Tian, Huibin; Zhang, Chunlei; Lei, Chuzhao; Lan, Xianyong; Chen, Hong

    2013-07-25

    Wnt7a is a member of the WNT gene family, which encodes secreted signaling proteins and responds to many biological processes. Specifically Wnt7a influences satellite stem cells and regulates the regenerative potential of the muscle. However, similar researches about the bovine Wnt7a gene are lacking. Therefore, in this study, polymorphisms of the bovine Wnt7a gene were detected in 488 individuals from Chinese Qinchuan cattle by DNA pooling, forced PCR-RFLP, and DNA sequencing methods. 3 novel SNPs were identified, two SNPs (g.T4926C and g.A21943G) were in the intron and the last one (g.C63777T) was in the exon. Five haplotypes involved in these three variant sites in the Wnt7a gene were identified and their effects on growth traits were analyzed. The results revealed that haplotype 1 had the highest haplotype frequencies and was highly significantly associated with body height (P<0.01), body weight (P<0.05), chest width (P<0.05) and height at hip cross (P<0.01) respectively. Copyright © 2013 Elsevier B.V. All rights reserved.

  19. Origin and Diversification Dynamics of Self-Incompatibility Haplotypes

    PubMed Central

    Gervais, Camille E.; Castric, Vincent; Ressayre, Adrienne; Billiard, Sylvain

    2011-01-01

    Self-incompatibility (SI) is a genetic system found in some hermaphrodite plants. Recognition of pollen by pistils expressing cognate specificities at two linked genes leads to rejection of self pollen and pollen from close relatives, i.e., to avoidance of self-fertilization and inbred matings, and thus increased outcrossing. These genes generally have many alleles, yet the conditions allowing the evolution of new alleles remain mysterious. Evolutionary changes are clearly necessary in both genes, since any mutation affecting only one of them would result in a nonfunctional self-compatible haplotype. Here, we study diversification at the S-locus (i.e., a stable increase in the total number of SI haplotypes in the population, through the incorporation of new SI haplotypes), both deterministically (by investigating analytically the fate of mutations in an infinite population) and by simulations of finite populations. We show that the conditions allowing diversification are far less stringent in finite populations with recurrent mutations of the pollen and pistil genes, suggesting that diversification is possible in a panmictic population. We find that new SI haplotypes emerge fastest in populations with few SI haplotypes, and we discuss some implications for empirical data on S-alleles. However, allele numbers in our simulations never reach values as high as observed in plants whose SI systems have been studied, and we suggest extensions of our models that may reconcile the theory and data. PMID:21515570

  20. Ancestral inference from haplotypes and mutations.

    PubMed

    Griffiths, Robert C; Tavaré, Simon

    2018-04-25

    We consider inference about the history of a sample of DNA sequences, conditional upon the haplotype counts and the number of segregating sites observed at the present time. After deriving some theoretical results in the coalescent setting, we implement rejection sampling and importance sampling schemes to perform the inference. The importance sampling scheme addresses an extension of the Ewens Sampling Formula for a configuration of haplotypes and the number of segregating sites in the sample. The implementations include both constant and variable population size models. The methods are illustrated by two human Y chromosome datasets. Copyright © 2018. Published by Elsevier Inc.

  1. Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data

    PubMed Central

    Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

    2015-01-01

    Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used. PMID:25569088

  2. High Frequency of Haplotype HLA-DQ7 in Celiac Disease Patients from South Italy: Retrospective Evaluation of 5,535 Subjects at Risk of Celiac Disease

    PubMed Central

    Tinto, Nadia; Cola, Arturo; Piscopo, Chiara; Capuano, Marina; Galatola, Martina; Greco, Luigi; Sacchetti, Lucia

    2015-01-01

    Background Celiac disease (CD) has a strong genetic component mainly due to HLA DQ2/DQ8 encoding genes. However, a minority of CD patients are DQ2/DQ8-negative. To address this issue, we retrospectively characterized HLA haplotypes in 5,535 subjects at risk of CD (either relatives of CD patients or subjects with CD-like symptoms) referred to our center during a 10-year period. Methods We identified loci DQA1/DQB1/DRB1 by sequence-specific oligonucleotide-PCR and sequence-specific primer-PCR; anti-transglutaminase IgA/IgG and anti-endomysium IgA by ELISA and indirect immunofluorescence, respectively. Results We diagnosed CD in 666/5,535 individuals, 4.2% of whom were DQ2/DQ8-negative. Interestingly, DQ7 was one of the most abundant haplotypes in all CD patients and significantly more frequent in DQ2/DQ8-negative (38%) than in DQ2/DQ8-positive CD patients (24%) (p<0.05). Conclusion Our data lend support to the concept that DQ7 represents an additive or independent CD risk haplotype with respect to DQ2/DQ8 haplotypes but this finding should be verified in other large CD populations. PMID:26398634

  3. PWHATSHAP: efficient haplotyping for future generation sequencing.

    PubMed

    Bracciali, Andrea; Aldinucci, Marco; Patterson, Murray; Marschall, Tobias; Pisanti, Nadia; Merelli, Ivan; Torquati, Massimo

    2016-09-22

    Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WHATSHAP is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered PWHATSHAP, a parallel, high-performance version of WHATSHAP. PWHATSHAP is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WHATSHAP, PWHATSHAP exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WHATSHAP, which increases with coverage. Due to its structure and management of the large datasets, the parallelisation of WHATSHAP posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, PWHATSHAP, is a freely available toolkit that improves the efficiency of the analysis of genomics

  4. Summarizing techniques that combine three non-parametric scores to detect disease-associated 2-way SNP-SNP interactions.

    PubMed

    Sengupta Chattopadhyay, Amrita; Hsiao, Ching-Lin; Chang, Chien Ching; Lian, Ie-Bin; Fann, Cathy S J

    2014-01-01

    Identifying susceptibility genes that influence complex diseases is extremely difficult because loci often influence the disease state through genetic interactions. Numerous approaches to detect disease-associated SNP-SNP interactions have been developed, but none consistently generates high-quality results under different disease scenarios. Using summarizing techniques to combine a number of existing methods may provide a solution to this problem. Here we used three popular non-parametric methods-Gini, absolute probability difference (APD), and entropy-to develop two novel summary scores, namely principle component score (PCS) and Z-sum score (ZSS), with which to predict disease-associated genetic interactions. We used a simulation study to compare performance of the non-parametric scores, the summary scores, the scaled-sum score (SSS; used in polymorphism interaction analysis (PIA)), and the multifactor dimensionality reduction (MDR). The non-parametric methods achieved high power, but no non-parametric method outperformed all others under a variety of epistatic scenarios. PCS and ZSS, however, outperformed MDR. PCS, ZSS and SSS displayed controlled type-I-errors (<0.05) compared to GS, APDS, ES (>0.05). A real data study using the genetic-analysis-workshop 16 (GAW 16) rheumatoid arthritis dataset identified a number of interesting SNP-SNP interactions. © 2013 Elsevier B.V. All rights reserved.

  5. A comprehensive SNP and indel imputability database.

    PubMed

    Duan, Qing; Liu, Eric Yi; Croteau-Chonka, Damien C; Mohlke, Karen L; Li, Yun

    2013-02-15

    Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms. SNP and indel imputability information can be retrieved through a user-friendly interface by providing the ID(s) of the desired variant(s) or by specifying the desired genomic region. The query results can be refined by selecting relevant GWAS genotyping platform(s). This is the first database providing variant imputability information specific to each continental group and to each genotyping platform. In Filipino individuals from the Cebu Longitudinal Health and Nutrition Survey, our database can achieve an area under the receiver-operating characteristic curve of 0.97, 0.91, 0.88 and 0.79 for markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. Specifically, by filtering out 48.6% of markers (corresponding to a reduction of up to 48.6% in computational costs for actual imputation) based on the imputability information in our database, we can remove 77%, 58%, 51% and 42% of the poorly imputed markers at the cost of only 0.3%, 0.8%, 1.5% and 4.6% of the well-imputed markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. http://www.unc.edu/∼yunmli/imputability.html

  6. Complement factor H gene (CFH) polymorphisms C-257T, G257A and haplotypes are associated with protection against severe dengue phenotype, possible related with high CFH expression.

    PubMed

    Pastor, André F; Rodrigues Moura, Laís; Neto, José W D; Nascimento, Eduardo J M; Calzavara-Silva, Carlos E; Gomes, Ana Lisa V; Silva, Ana Maria da; Cordeiro, Marli T; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H V G; Marques, Ernesto T A; Acioli-Santos, Bartolomeu

    2013-09-01

    Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The -257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of -257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  7. Electronic and spectroscopic characterizations of SNP isomers

    NASA Astrophysics Data System (ADS)

    Trabelsi, Tarek; Al Mogren, Muneerah Mogren; Hochlaf, Majdi; Francisco, Joseph S.

    2018-02-01

    High-level ab initio electronic structure calculations were performed to characterize SNP isomers. In addition to the known linear SNP, cyc-PSN, and linear SPN isomers, we identified a fourth isomer, linear PSN, which is located ˜2.4 eV above the linear SNP isomer. The low-lying singlet and triplet electronic states of the linear SNP and SPN isomers were investigated using a multi-reference configuration interaction method and large basis set. Several bound electronic states were identified. However, their upper rovibrational levels were predicted to pre-dissociate, leading to S + PN, P + NS products, and multi-step pathways were discovered. For the ground states, a set of spectroscopic parameters were derived using standard and explicitly correlated coupled-cluster methods in conjunction with augmented correlation-consistent basis sets extrapolated to the complete basis set limit. We also considered scalar and core-valence effects. For linear isomers, the rovibrational spectra were deduced after generation of their 3D-potential energy surfaces along the stretching and bending coordinates and variational treatments of the nuclear motions.

  8. Recent Advances in Experimental Whole Genome Haplotyping Methods

    PubMed Central

    Huang, Mengting; Lu, Zuhong

    2017-01-01

    Haplotype plays a vital role in diverse fields; however, the sequencing technologies cannot resolve haplotype directly. Pioneers demonstrated several approaches to resolve haplotype in the early years, which was extensively reviewed. Since then, numerous methods have been developed recently that have significantly improved phasing performance. Here, we review experimental methods that have emerged mainly over the past five years, and categorize them into five classes according to their maximum scale of contiguity: (i) encapsulation, (ii) 3D structure capture and construction, (iii) compartmentalization, (iv) fluorography, (v) long-read sequencing. Several subsections of certain methods are attached to each class as instances. We also discuss the relative advantages and disadvantages of different classes and make comparisons among representative methods of each class. PMID:28891974

  9. Distinct genotype distribution and haplotype profiles in MDR1 gene among Chinese Han, Bai, Wa and Tibetan ethnic groups.

    PubMed

    Lai, Yong; Huang, Min; Li, Hui; Wang, Xue-Ding; Li, Jia-Li

    2012-11-01

    P-Glycoprotein (P-gp, encoded by MDR1 gene) plays an important role in determining bioavailability and pharmacologic effects of many drugs. There is increasing evidence that P-gp activity may be genetically determined. In this study, we investigated the genotype distribution and the haplotype profiles of MDR1 gene in Chinese Han, Bai, Wa and Tibetan subjects. Much lower frequencies of the 1236T allele and the 2677T allele were found in Wa subjects than those in other three ethnic groups, while the 2677A allele was found about 6-fold more frequently in Han subjects than in subjects of other three ethnic groups. The Han, Bai and Tibetan subjects share the same three predominant haplotypes (T-T-T, T-G-C and C-G-C), and T-T-T is the highest and accounts for more than one third of the number of haplotypes in the subjects from each ethnic group. However, T-T-T was less common than T-G-C, T-G-T and C-G-C and occurring at only 13.8% in Wa subjects, furthermore, higher frequencies of T-G-T, C-T-C, C-G-T and C-T-T were observed in Wa subjects compared to those in other three ethnic groups. Frequencies of C-A-C and T-A-C in Han subjects were higher than those in other three ethnic groups. The findings of this study will be of some relevance in predicting MDR1 phenotype and pharmacokinetics as well as pharmacodynamic effects of many commonly used drugs that are P-gp substrates in these four Chinese ethnic groups.

  10. Haplotype combination of the bovine INSIG1 gene sequence variants and association with growth traits in Nanyang cattle.

    PubMed

    Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong

    2012-06-01

    The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.

  11. Increased Frequency of De Novo Copy Number Variations in Congenital Heart Disease by Integrative Analysis of SNP Array and Exome Sequence Data

    PubMed Central

    Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R.; Golhar, Ryan; Sanders, Stephan J.; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A. Jeremy; State, Matthew W.; Kaltman, Jonathan R.; White, Peter S.; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D.; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K.

    2014-01-01

    Rationale Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown etiology. Objective To determine the contribution of de novo copy number variants (CNVs) in the etiology of sporadic CHD. Methods and Results We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism (SNP) arrays and/or whole exome sequencing (WES). Results were experimentally validated using digital droplet PCR. We compared validated CNVs in CHD cases to CNVs in 1,301 healthy control trios. The two complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either SNP array (p=7x10−5, Odds Ratio (OR)=4.6) or WES data (p=6x10−4, OR=3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (p=0.02, OR=2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in WES and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q sub-telomeric deletions. Conclusions We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. PMID:25205790

  12. De novo assembly of a haplotype-resolved human genome.

    PubMed

    Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

    2015-06-01

    The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.

  13. Genome-Wide Association Study for Identification and Validation of Novel SNP Markers for Sr6 Stem Rust Resistance Gene in Bread Wheat.

    PubMed

    Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S

    2018-01-01

    Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.

  14. Screening of Two ADH4 Variations in a Swedish Cluster Headache Case–Control Material

    PubMed Central

    Fourier, Carmen; Ran, Caroline; Steinberg, Anna; Sjöstrand, Christina; Waldenlind, Elisabet

    2016-01-01

    Background Cluster headache (CH) is a severe neurovascular disorder and an increasing amount of evidence points to a genetic contribution to this disease. When CH was first described, it was observed that alcohol may precipitate an attack during the active phase of the disease. The alcohol dehydrogenase 4 (ADH4) gene encodes an enzyme which contributes to the metabolization of alcohol and is, therefore, an interesting candidate gene for CH. Two Italian groups have reported association of the single nucleotide polymorphism (SNP) rs1126671 located in the ADH4 gene with an increased risk of CH in Italy. In addition, one of the groups found an association between the ADH4 SNP rs1800759 and CH. Objective To perform a replication study on the ADH4 SNPs rs1126671 and rs1800759 in a large homogeneous Swedish case–control cohort in order to further investigate the possible contribution of ADH4 to CH. Methods A total of 390 unrelated patients diagnosed with CH and 389 controls representing a general Swedish population were recruited to the study. DNA samples from patients and controls were genotyped for the two ADH4 SNPs rs1126671 and rs1800759 using quantitative real‐time polymerase chain reaction. Statistical analyses of genotype, allele and haplotype frequencies for the two SNPs were performed and compared between patients and controls. Results For rs1126671, the minor allele frequency (A allele) was 32.8% (n = 254) in controls compared with 31.9% (n = 249) in CH patients. The minor allele frequency (A allele) of rs1800759 was 42.3% (n = 324) in controls and 41.9% (n = 327) in CH patients. Statistical analysis showed no significant differences in allele as well as in genotype or haplotype frequencies between the patient and control group for either SNP. This was also seen after stratifying the patient group for experiencing alcohol as a trigger factor. Conclusions The data did not support an association of the ADH4 SNPs rs1126671 and rs1800759 with CH

  15. World-wide distributions of lactase persistence alleles and the complex effects of recombination and selection.

    PubMed

    Liebert, Anke; López, Saioa; Jones, Bryony Leigh; Montalva, Nicolas; Gerbault, Pascale; Lau, Winston; Thomas, Mark G; Bradman, Neil; Maniatis, Nikolas; Swallow, Dallas M

    2017-11-01

    The genetic trait of lactase persistence (LP) is associated with at least five independent functional single nucleotide variants in a regulatory region about 14 kb upstream of the lactase gene [-13910*T (rs4988235), -13907*G (rs41525747), -13915*G (rs41380347), -14009*G (rs869051967) and -14010*C (rs145946881)]. These alleles have been inferred to have spread recently and present-day frequencies have been attributed to positive selection for the ability of adult humans to digest lactose without risk of symptoms of lactose intolerance. One of the inferential approaches used to estimate the level of past selection has been to determine the extent of haplotype homozygosity (EHH) of the sequence surrounding the SNP of interest. We report here new data on the frequencies of the known LP alleles in the 'Old World' and their haplotype lineages. We examine and confirm EHH of each of the LP alleles in relation to their distinct lineages, but also show marked EHH for one of the older haplotypes that does not carry any of the five LP alleles. The region of EHH of this (B) haplotype exactly coincides with a region of suppressed recombination that is detectable in families as well as in population data, and the results show how such suppression may have exaggerated haplotype-based measures of past selection.

  16. Association of SNP3 polymorphism in the apolipoprotein A-V gene with plasma triglyceride level in Tunisian type 2 diabetes

    PubMed Central

    Chaaba, Raja; Attia, Nebil; Hammami, Sonia; Smaoui, Maha; Mahjoub, Sylvia; Hammami, Mohamed; Masmoudi, Ahmed Slaheddine

    2005-01-01

    Background Apolipoprotein A-V (Apo A-V) gene has recently been identified as a new apolipoprotein involved in triglyceride metabolism. A single nucleotide polymorphism (SNP3) located in the gene promoter (-1131) was associated with triglyceride variation in healthy subjects. In type 2 diabetes the triglyceride level increased compared to healthy subjects. Hypertriglyceridemia is a risk factor for coronary artery disease. We aimed to examine the interaction between SNP3 and lipid profile and coronary artery disease (CAD) in Tunisian type 2 diabetic patients. Results The genotype frequencies of T/T, T/C and C/C were 0.74, 0.23 and 0.03 respectively in non diabetic subjects, 0.71, 0.25 and 0.04 respectively in type 2 diabetic patients. Triglyceride level was higher in heterozygous genotype (-1131 T/C) of apo A-V (p = 0.024). Heterozygous genotype is more frequent in high triglyceride group (40.9%) than in low triglyceride group (18.8%) ; p = 0.011. Despite the relation between CAD and hypertriglyceridemia the SNP 3 was not associated with CAD. Conclusion In type 2 diabetic patients SNP3 is associated with triglyceride level, however there was no association between SNP3 and coronary artery disease. PMID:15636639

  17. Mineralocorticoid receptor haplotypes sex-dependently moderate depression susceptibility following childhood maltreatment.

    PubMed

    Vinkers, Christiaan H; Joëls, Marian; Milaneschi, Yuri; Gerritsen, Lotte; Kahn, René S; Penninx, Brenda W J H; Boks, Marco P M

    2015-04-01

    The MR is an important regulator of the hypothalamic-pituitary-adrenal (HPA) axis and a prime target for corticosteroids. There is increasing evidence from both clinical and preclinical studies that the MR has different effects on behavior and mood in males and females. To investigate the hypothesis that the MR sex-dependently influences the relation between childhood maltreatment and depression, we investigated three common and functional MR haplotypes (GA, CA, and CG haplotype, based on rs5522 and rs2070951) in a population-based cohort (N = 665) and an independent clinical cohort from the Netherlands Study of Depression and Anxiety (NESDA) (N = 1639). The CA haplotype sex-dependently moderated the relation between childhood maltreatment and depressive symptoms both in the population-based sample (sex × maltreatment × haplotype: β = -4.07, P = 0.029) and in the clinical sample (sex × maltreatment × haplotype, β = -2.40, P = 0.011). Specifically, female individuals in the population-based sample were protected (β = -4.58, P = 2.0 e(-5)), whereas males in the clinical sample were at increased risk (β = 2.54, P = 0.0022). In line with these results, female GA haplotype carriers displayed increased vulnerability in the population-based sample (β = 4.58, P = 7.5 e(-5)) whereas male CG-carriers showed increased resilience in the clinical sample (β = -2.71, P = 0.016). Consistently, we found a decreased lifetime MDD risk for male GA haplotype carriers following childhood maltreatment but an increased risk for male CA haplotype carriers in the clinical sample. In both samples, sex-dependent effects were observed for GA-GA diplotype carriers. In summary, sex plays an important role in determining whether functional genetic variation in MR is beneficial or detrimental, with an apparent female advantage for the CA haplotype but male advantage for the GA and CG haplotype. These sex-dependent effects of MR on depression susceptibility following childhood

  18. Association of an MHC Class II Haplotype with Increased Risk of Polymyositis in Hungarian Vizsla Dogs

    PubMed Central

    Massey, Jonathan; Rothwell, Simon; Rusbridge, Clare; Tauro, Anna; Addicott, Diane; Chinoy, Hector; Cooper, Robert G.; Ollier, William E. R.; Kennedy, Lorna J.

    2013-01-01

    A breed-specific polymyositis is frequently observed in the Hungarian Vizsla. Beneficial clinical response to immunosuppressive therapies has been demonstrated which points to an immune-mediated aetiology. Canine inflammatory myopathies share clinical and histological similarities with the human immune-mediated myopathies. As MHC class II associations have been reported in the human conditions we investigated whether an MHC class II association was present in the canine myopathy seen in this breed. 212 Hungarian Vizsla pedigree dogs were stratified both on disease status and degree of relatedness to an affected dog. This generated a group of 29 cases and 183 “graded” controls: 93 unaffected dogs with a first degree affected relative, 44 unaffected dogs with a second degree affected relative, and 46 unaffected dogs with no known affected relatives. Eleven DLA class II haplotypes were identified, of which, DLA-DRB1*02001/DQA1*00401/DQB1*01303, was at significantly raised frequency in cases compared to controls (OR = 1.92, p = 0.032). When only control dogs with no family history of the disease were compared to cases, the association was further strengthened (OR = 4.08, p = 0.00011). Additionally, a single copy of the risk haplotype was sufficient to increase disease risk, with the risk substantially increasing for homozygotes. There was a trend of increasing frequency of this haplotype with degree of relatedness, indicating low disease penetrance. These findings support the hypothesis of an immune-mediated aetiology for this canine myopathy and give credibility to potentially using the Hungarian Vizsla as a genetic model for comparative studies with human myositis. PMID:23457575

  19. HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

    PubMed

    Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

    2017-11-01

    The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by

  20. Toll-like receptor 4 polymorphisms and their haplotypes modulate the risk of developing diabetic retinopathy in type 2 diabetes patients

    PubMed Central

    Singh, Kanhaiya; Kant, Shri; Singh, Vivek Kumar; Agrawal, Neeraj K.; Gupta, Sanjeev K.

    2014-01-01

    Purpose Persistent inflammation and impaired neovascularization in type 2 diabetes mellitus (T2DM) patients may lead to development of macro- and microvascular complications. Diabetic retinopathy (DR) is one of the secondary microvascular complications of T2DM. Improper activation of the innate immune system may be an important contributor in the pathophysiology of DR. Toll-like receptor 4 (TLR4) is an important mediator of innate immunity, and genetic alterations in TLR4 support inflammation in the hyperglycemic condition. The present work was designed to investigate whether the TLR4 single nucleotide polymorphisms (SNPs) rs4986790, rs4986791, rs10759931, rs1927911, and rs1927914 are associated with DR in a north Indian population. Methods The study group of 698 individuals (128 DR, 250 T2DM, 320 controls) was genotyped by PCR-RFLP. Haplotype and linkage disequilibrium between SNPs were determined using Haploview software. Results Combined risk genotypes of TLR4 SNPs rs10759931 (odds ratio [OR] 1.50, p = 0.05) and rs1927914 (OR 1.48, p = 0.05) were found to be significantly associated with pathogenesis of DR. A total of 14 haplotypes with frequency >1% were obtained using Haploview software. Haplotypes ACATC (37.5%) and ACATT (14.8%) were the two most common haplotypes obtained. Conclusions Results of the present case-control study that included 698 north Indian subjects suggested that TLR4 SNPs rs10759931 and rs1927914 modulate the risk of DR in T2DM cases. Association analysis using haplotypes showed none of the haplotypes were associated with either susceptibility or resistance to DR in a north Indian population. PMID:24883015

  1. Haplotype specific alteration of diabetes MHC risk by olfactory receptor gene polymorphism.

    PubMed

    Jahromi, Mohamed M

    2012-12-01

    Evidence for genes associated with risk for Type 1 diabetes (T1D) in the extended region of the major histocompatibility complex (MHC) genes is accumulating. The aim of this study was to investigate the association pattern of the extended MHC region with T1D susceptibility to identify effects independent of well established DR/DQ genes. A total of 394 Europid families with T1D were genotyped for the single nucleotide polymorphism (SNP) in the olfactory receptor family 14, subfamily J, member 1 (OR14J1) gene, rs9257691, in the MHC telomeric region. The OR provides "an internal depiction of our external world" through the capture of odorant molecules in the main OR system by several large families of G-protein coupled receptors (GPCR). These receptors transduce and chemosignals into the central nervous system (CNS). This SNP was chosen to identify its association with T1D. Interestingly, OR14J1C allele was significantly associated with T1D that seems to go with DRB1*0401, Χ(2)=10.9, p=0.0003. However, by fixing both genes of DR*0401-DQB1*0302, high risk, the association of T1D with OR14J1C still existed, Χ(2)=7.4, p=0.005. The occurrence of association of the OR14J1C allele with T1D patients with DRB1*401/DQB1*0302 is an independent risk for T1D. As an accumulative report suggests the role of OR in the pathogenesis of diabetic microvascular and other diabetic complications, undoubtedly, this haplotype specific alteration of T1D risk is an independent risk for the disease and can address the promising MHC-linked gene other than DR/DQ. Moreover, there is nothing to hinder for that this might be a signal that identifies the role of OR gene in the pathogenesis of T1D in patients who are prone to diabetic complications. Copyright © 2012. Published by Elsevier B.V.

  2. Rice SNP-seek database update: new SNPs, indels, and queries.

    PubMed

    Mansueto, Locedie; Fuentes, Roven Rommel; Borja, Frances Nikki; Detras, Jeffery; Abriol-Santos, Juan Miguel; Chebotarov, Dmytro; Sanciangco, Millicent; Palis, Kevin; Copetti, Dario; Poliakov, Alexandre; Dubchak, Inna; Solovyev, Victor; Wing, Rod A; Hamilton, Ruaraidh Sackville; Mauleon, Ramil; McNally, Kenneth L; Alexandrov, Nickolai

    2017-01-04

    We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Candidate Gene Approach for Parasite Resistance in Sheep – Variation in Immune Pathway Genes and Association with Fecal Egg Count

    PubMed Central

    Periasamy, Kathiravan; Pichler, Rudolf; Poli, Mario; Cristel, Silvina; Cetrá, Bibiana; Medus, Daniel; Basar, Muladno; A. K., Thiruvenkadan; Ramasamy, Saravanan; Ellahi, Masroor Babbar; Mohammed, Faruque; Teneva, Atanaska; Shamsuddin, Mohammed; Podesta, Mario Garcia; Diallo, Adama

    2014-01-01

    Sheep chromosome 3 (Oar3) has the largest number of QTLs reported to be significantly associated with resistance to gastro-intestinal nematodes. This study aimed to identify single nucleotide polymorphisms (SNPs) within candidate genes located in sheep chromosome 3 as well as genes involved in major immune pathways. A total of 41 SNPs were identified across 38 candidate genes in a panel of unrelated sheep and genotyped in 713 animals belonging to 22 breeds across Asia, Europe and South America. The variations and evolution of immune pathway genes were assessed in sheep populations across these macro-environmental regions that significantly differ in the diversity and load of pathogens. The mean minor allele frequency (MAF) did not vary between Asian and European sheep reflecting the absence of ascertainment bias. Phylogenetic analysis revealed two major clusters with most of South Asian, South East Asian and South West Asian breeds clustering together while European and South American sheep breeds clustered together distinctly. Analysis of molecular variance revealed strong phylogeographic structure at loci located in immune pathway genes, unlike microsatellite and genome wide SNP markers. To understand the influence of natural selection processes, SNP loci located in chromosome 3 were utilized to reconstruct haplotypes, the diversity of which showed significant deviations from selective neutrality. Reduced Median network of reconstructed haplotypes showed balancing selection in force at these loci. Preliminary association of SNP genotypes with phenotypes recorded 42 days post challenge revealed significant differences (P<0.05) in fecal egg count, body weight change and packed cell volume at two, four and six SNP loci respectively. In conclusion, the present study reports strong phylogeographic structure and balancing selection operating at SNP loci located within immune pathway genes. Further, SNP loci identified in the study were found to have potential for

  4. Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes

    PubMed Central

    Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.

    2016-01-01

    The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800

  5. Report on the development of putative functional SSR and SNP markers in passion fruits.

    PubMed

    da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro

    2017-09-06

    Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.

  6. Genetic variation in C-reactive protein (CRP) gene may be associated with risk of systemic lupus erythematosus and CRP concentrations.

    PubMed

    Shih, P Betty; Manzi, Susan; Shaw, Penny; Kenney, Margaret; Kao, Amy H; Bontempo, Franklin; Barmada, M Michael; Kammerer, Candace; Kamboh, M Ilyas

    2008-11-01

    The gene coding for C-reactive protein (CRP) is located on chromosome 1q23.2, which falls within a linkage region thought to harbor a systemic lupus erythematosus (SLE) susceptibility gene. Recently, 2 single-nucleotide polymorphisms (SNP) in the CRP gene (+838, +2043) have been shown to be associated with CRP concentrations and/or SLE risk in a British family-based cohort. Our study was done to confirm the reported association in an independent population-based case-control cohort, and also to investigate the influence of 3 additional CRP tagSNP (-861, -390, +90) on SLE risk and serum CRP concentrations. DNA from 337 Caucasian women who met the American College of Rheumatology criteria for definite (n = 324) or probable (n = 13) SLE and 448 Caucasian healthy female controls was genotyped for 5 CRP tagSNP (-861, -390, +90, +838, +2043). Genotyping was performed using restriction fragment length polymorphism-polymerase chain reaction, pyrosequencing, or TaqMan assays. Serum CRP levels were measured using ELISA. Association studies were performed using the chi-squared distribution, Z-test, Fisher's exact test, and analysis of variance. Haplotype analysis was performed using EH software and the haplo.stats package in R 2.1.2. While none of the SNP were found to be associated with SLE risk individually, there was an association with the 5 SNP haplotypes (p < 0.001). Three SNP (-861, -390, +90) were found to significantly influence serum CRP level in SLE cases, both independently and as haplotypes. Our data suggest that unique haplotype combinations in the CRP gene may modify the risk of developing SLE and influence circulating CRP levels.

  7. Analysis of extended human leukocyte antigen haplotype association with Addison's disease in three populations.

    PubMed

    Gombos, Z; Hermann, R; Kiviniemi, M; Nejentsev, S; Reimand, K; Fadeyev, V; Peterson, P; Uibo, R; Ilonen, J

    2007-12-01

    Addison's disease is an organ-specific autoimmune disorder with a polygenic background. The aim of the study was to identify non-class II human leukocyte antigen (HLA) susceptibility genes for Addison's disease. Addison's disease patients from three European populations were analysed for selected HLA-DR-DQ alleles and for 11 microsatellite markers covering approximately 4 Mb over the HLA region. Subjects were 69 patients with Addison's disease from Estonia (24), Finland (14) and Russia (31). Consecutively recruited healthy newborns from the same geographical regions were used as controls (269 Estonian, 1000 Finnish and 413 Russian). Association measures for HLA-DRB1, DQB1, DQA1 and 11 microsatellites between D6S273 and D6S2223 were taken. A low-resolution full-house typing was used for HLA class II genes, while microsatellite markers were studied using fluorescence-based DNA fragment sizing technology. We confirmed that the HLA-DR3-DQ2 and the DQB1*0302-DRB1*0404 haplotypes confer disease susceptibility. In Russian patients, we also found an increase of DRB1*0403 allele, combined with DQB1*0305 allele in three out of six cases (P<0.0001). Analysis of 11 microsatellite markers including STR MICA confirmed the strong linkage in DR3-DQ2 haplotypes but DRB1*0404-DQB1*0302 haplotypes were diverse. MICA5.1 allele was found in 22 out of 24 Estonian patients, but results from Finnish and Russian patients did not support its independent role in disease susceptibility. HLA-DRB1*0403 was identified as a novel susceptibility allele for Addison's disease. Additionally, we found no evidence of a non-class II HLA disease susceptibility locus; however, the HLA-DR3-DQ2 haplotype appeared more conserved in patient groups with high DR-DQ2 frequencies.

  8. Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors

    PubMed Central

    Dannemann, Michael; Andrés, Aida M.; Kelso, Janet

    2016-01-01

    Pathogens and the diseases they cause have been among the most important selective forces experienced by humans during their evolutionary history. Although adaptive alleles generally arise by mutation, introgression can also be a valuable source of beneficial alleles. Archaic humans, who lived in Europe and Western Asia for more than 200,000 years, were probably well adapted to this environment and its local pathogens. It is therefore conceivable that modern humans entering Europe and Western Asia who admixed with them obtained a substantial immune advantage from the introgression of archaic alleles. Here we document a cluster of three Toll-like receptors (TLR6-TLR1-TLR10) in modern humans that carries three distinct archaic haplotypes, indicating repeated introgression from archaic humans. Two of these haplotypes are most similar to the Neandertal genome, and the third haplotype is most similar to the Denisovan genome. The Toll-like receptors are key components of innate immunity and provide an important first line of immune defense against bacteria, fungi, and parasites. The unusually high allele frequencies and unexpected levels of population differentiation indicate that there has been local positive selection on multiple haplotypes at this locus. We show that the introgressed alleles have clear functional effects in modern humans; archaic-like alleles underlie differences in the expression of the TLR genes and are associated with reduced microbial resistance and increased allergic disease in large cohorts. This provides strong evidence for recurrent adaptive introgression at the TLR6-TLR1-TLR10 locus, resulting in differences in disease phenotypes in modern humans. PMID:26748514

  9. Intragenic SNP haplotypes associated with 84dup18 mutation in TNFRSF11A in four FEO pedigrees suggest three independent origins for this mutation.

    PubMed

    Elahi, Elahe; Shafaghati, Yousef; Asadi, Sareh; Absalan, Farnaz; Goodarzi, Hani; Gharaii, Nava; Karimi-Nejad, Mohammad Hassan; Shahram, Farhad; Hughes, Anne E

    2007-01-01

    Familial expansile osteolysis (FEO) is a rare disorder causing bone dysplasia. The clinical features of FEO include early-onset hearing loss, tooth destruction, and progressive lytic expansion within limb bones causing pain, fracture, and deformity. An 18-bp duplication in the first exon of the TNFRSF11A gene encoding RANK has been previously identified in four FEO pedigrees. Despite having the identical mutation, phenotypic variations among affected individuals of the same and different pedigrees were noted. Another 18-bp duplication, one base proximal to the duplication previously reported, was subsequently found in two unrelated FEO patients. Finally, mutations overlapping with the mutations found in the FEO pedigrees have been found in ESH and early-onset PDB pedigrees. An Iranian FEO pedigree that contains six affected individuals dispersed in three generations has previously been introduced; here, the clinical features of the proband are reported in greater detail, and the genetic defect of the pedigree is presented. Direct sequencing of the entire coding region and upstream and downstream noncoding regions of TNFRSF11A in her DNA revealed the same 18-bp duplication mutation as previously found in the four FEO pedigrees. Additionally, eight sequence variations as compared to the TNFRSF11A reference sequence were identified, and a haplotype linked to the mutation based on these variations was defined. Although the mutation in the Iranian and four of the previously described FEO pedigrees was the same, haplotypes based on the intragenic SNPs suggest that the mutations do not share a common descent.

  10. When Whole-Genome Alignments Just Won't Work: kSNP v2 Software for Alignment-Free SNP Discovery and Phylogenetics of Hundreds of Microbial Genomes

    PubMed Central

    Gardner, Shea N.; Hall, Barry G.

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four “raw read” genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths. PMID:24349125

  11. When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

    PubMed

    Gardner, Shea N; Hall, Barry G

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.

  12. Identification of rheumatoid arthritis biomarkers based on single nucleotide polymorphisms and haplotype blocks: A systematic review and meta-analysis

    PubMed Central

    Saad, Mohamed N.; Mabrouk, Mai S.; Eldeib, Ayman M.; Shaker, Olfat G.

    2015-01-01

    Genetics of autoimmune diseases represent a growing domain with surpassing biomarker results with rapid progress. The exact cause of Rheumatoid Arthritis (RA) is unknown, but it is thought to have both a genetic and an environmental bases. Genetic biomarkers are capable of changing the supervision of RA by allowing not only the detection of susceptible individuals, but also early diagnosis, evaluation of disease severity, selection of therapy, and monitoring of response to therapy. This review is concerned with not only the genetic biomarkers of RA but also the methods of identifying them. Many of the identified genetic biomarkers of RA were identified in populations of European and Asian ancestries. The study of additional human populations may yield novel results. Most of the researchers in the field of identifying RA biomarkers use single nucleotide polymorphism (SNP) approaches to express the significance of their results. Although, haplotype block methods are expected to play a complementary role in the future of that field. PMID:26843965

  13. HLA-G Haplotypes Are Differentially Associated with Asthmatic Features.

    PubMed

    Ribeyre, Camille; Carlini, Federico; René, Céline; Jordier, François; Picard, Christophe; Chiaroni, Jacques; Abi-Rached, Laurent; Gouret, Philippe; Marin, Grégory; Molinari, Nicolas; Chanez, Pascal; Paganini, Julien; Gras, Delphine; Di Cristofaro, Julie

    2018-01-01

    Human leukocyte antigen (HLA)-G, a HLA class Ib molecule, interacts with receptors on lymphocytes such as T cells, B cells, and natural killer cells to influence immune responses. Unlike classical HLA molecules, HLA-G expression is not found on all somatic cells, but restricted to tissue sites, including human bronchial epithelium cells (HBEC). Individual variation in HLA-G expression is linked to its genetic polymorphism and has been associated with many pathological situations such as asthma, which is characterized by epithelium abnormalities and inflammatory cell activation. Studies reported both higher and equivalent soluble HLA-G (sHLA-G) expression in different cohorts of asthmatic patients. In particular, we recently described impaired local expression of HLA-G and abnormal profiles for alternatively spliced isoforms in HBEC from asthmatic patients. sHLA-G dosage is challenging because of its many levels of polymorphism (dimerization, association with β2-microglobulin, and alternative splicing), thus many clinical studies focused on HLA-G single-nucleotide polymorphisms as predictive biomarkers, but few analyzed HLA-G haplotypes. Here, we aimed to characterize HLA-G haplotypes and describe their association with asthmatic clinical features and sHLA-G peripheral expression and to describe variations in transcription factor (TF) binding sites and alternative splicing sites. HLA - G haplotypes were differentially distributed in 330 healthy and 580 asthmatic individuals. Furthermore, HLA-G haplotypes were associated with asthmatic clinical features showed. However, we did not confirm an association between sHLA-G and genetic, biological, or clinical parameters. HLA-G haplotypes were phylogenetically split into distinct groups, with each group displaying particular variations in TF binding or RNA splicing sites that could reflect differential HLA-G qualitative or quantitative expression, with tissue-dependent specificities. Our results, based on a multicenter

  14. HLA-G Haplotypes Are Differentially Associated with Asthmatic Features

    PubMed Central

    Ribeyre, Camille; Carlini, Federico; René, Céline; Jordier, François; Picard, Christophe; Chiaroni, Jacques; Abi-Rached, Laurent; Gouret, Philippe; Marin, Grégory; Molinari, Nicolas; Chanez, Pascal; Paganini, Julien; Gras, Delphine; Di Cristofaro, Julie

    2018-01-01

    Human leukocyte antigen (HLA)-G, a HLA class Ib molecule, interacts with receptors on lymphocytes such as T cells, B cells, and natural killer cells to influence immune responses. Unlike classical HLA molecules, HLA-G expression is not found on all somatic cells, but restricted to tissue sites, including human bronchial epithelium cells (HBEC). Individual variation in HLA-G expression is linked to its genetic polymorphism and has been associated with many pathological situations such as asthma, which is characterized by epithelium abnormalities and inflammatory cell activation. Studies reported both higher and equivalent soluble HLA-G (sHLA-G) expression in different cohorts of asthmatic patients. In particular, we recently described impaired local expression of HLA-G and abnormal profiles for alternatively spliced isoforms in HBEC from asthmatic patients. sHLA-G dosage is challenging because of its many levels of polymorphism (dimerization, association with β2-microglobulin, and alternative splicing), thus many clinical studies focused on HLA-G single-nucleotide polymorphisms as predictive biomarkers, but few analyzed HLA-G haplotypes. Here, we aimed to characterize HLA-G haplotypes and describe their association with asthmatic clinical features and sHLA-G peripheral expression and to describe variations in transcription factor (TF) binding sites and alternative splicing sites. HLA-G haplotypes were differentially distributed in 330 healthy and 580 asthmatic individuals. Furthermore, HLA-G haplotypes were associated with asthmatic clinical features showed. However, we did not confirm an association between sHLA-G and genetic, biological, or clinical parameters. HLA-G haplotypes were phylogenetically split into distinct groups, with each group displaying particular variations in TF binding or RNA splicing sites that could reflect differential HLA-G qualitative or quantitative expression, with tissue-dependent specificities. Our results, based on a multicenter

  15. Association between SLC11A1 (NRAMP1) polymorphisms and susceptibility to tuberculosis in Chinese Holstein cattle.

    PubMed

    Liu, Kaihua; Zhang, Bin; Teng, Zhaochun; Wang, Youtao; Dong, Guodong; Xu, Cong; Qin, Bo; Song, Chunlian; Chai, Jun; Li, Yang; Shi, Xianwei; Shu, Xianghua; Zhang, Yifang

    2017-03-01

    We investigated the associations between SLC11A1 polymorphisms and susceptibility to tuberculosis (TB) in Chinese Holstein cattle, using a case-control study of 136 animals that had positive reactions to TB tests and showed symptoms and 96 animals that had negative reactions to tests and showed no symptoms. Polymerase chain reaction (PCR) sequencing and the restriction fragment length polymorphism (RFLP) technique were used to detect and determine SLC11A1 polymorphisms. Association analysis identified significant correlations between SLC11A1 polymorphisms and susceptibility/resistance to TB, and two genetic markers for SLC11A1 were established using PCR-RFLP. Sequence alignment of SLC11A1 revealed seven single-nucleotide polymorphisms (SNPs). This is the first report of MaeII PCR-RFLP markers for the SLC11A1-SNP3 site and PstI PCR-RFLP markers for the SLC11A1-SNP5 and SLC11A1-SNP6 sites in Chinese Holstein cattle. Logistic regression analysis indicated that SLC11A1-SNP1, SLC11A1-SNP3, and SLC11A1-SNP5 were significantly associated with susceptibility/resistance to TB. Two genotypes of SLC11A1-SNP3 were susceptible to TB, whereas one genotype of SLC11A1-SNP1 and two genotypes of SLC11A1-SNP5 were resistant. Haplotype analysis showed that nine haplotypes were potentially resistant to TB. After Bonferroni correction, three of the haplotypes remained significantly associated with TB resistance. SLC11A1 is a useful candidate gene related to TB in Chinese Holstein cattle. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. In silico SNP analysis of the breast cancer antigen NY-BR-1.

    PubMed

    Kosaloglu, Zeynep; Bitzer, Julia; Halama, Niels; Huang, Zhiqin; Zapatka, Marc; Schneeweiss, Andreas; Jäger, Dirk; Zörnig, Inka

    2016-11-18

    Breast cancer is one of the most common malignancies with increasing incidences every year and a leading cause of death among women. Although early stage breast cancer can be effectively treated, there are limited numbers of treatment options available for patients with advanced and metastatic disease. The novel breast cancer associated antigen NY-BR-1 was identified by SEREX analysis and is expressed in the majority (>70%) of breast tumors as well as metastases, in normal breast tissue, in testis and occasionally in prostate tissue. The biological function and regulation of NY-BR-1 is up to date unknown. We performed an in silico analysis on the genetic variations of the NY-BR-1 gene using data available in public SNP databases and the tools SIFT, Polyphen and Provean to find possible functional SNPs. Additionally, we considered the allele frequency of the found damaging SNPs and also analyzed data from an in-house sequencing project of 55 breast cancer samples for recurring SNPs, recorded in dbSNP. Over 2800 SNPs are recorded in the dbSNP and NHLBI ESP databases for the NY-BR-1 gene. Of these, 65 (2.07%) are synonymous SNPs, 191 (6.09%) are non-synoymous SNPs, and 2430 (77.48%) are noncoding intronic SNPs. As a result, 69 non-synoymous SNPs were predicted to be damaging by at least two, and 16 SNPs were predicted as damaging by all three of the used tools. The SNPs rs200639888, rs367841401 and rs377750885 were categorized as highly damaging by all three tools. Eight damaging SNPs are located in the ankyrin repeat domain (ANK), a domain known for its frequent involvement in protein-protein interactions. No distinctive features could be observed in the allele frequency of the analyzed SNPs. Considering these results we expect to gain more insights into the variations of the NY-BR-1 gene and their possible impact on giving rise to splice variants and therefore influence the function of NY-BR-1 in healthy tissue as well as in breast cancer.

  17. β3 Integrin Haplotype Influences Gene Regulation and Plasma von Willebrand Factor Activity

    PubMed Central

    Payne, Katie E; Bray, Paul F; Grant, Peter J; Carter, Angela M

    2008-01-01

    The Leu33Pro polymorphism of the gene encoding β3 integrin (ITGB3) is associated with acute coronary syndromes and influences platelet aggregation. Three common promoter polymorphisms have also been identified. The aims of this study were to (1) investigate the influence of the ITGB3 −400C/A, −425A/C and −468G/A promoter polymorphisms on reporter gene expression and nuclear protein binding and (2) determine genotype and haplotype associations with platelet αIIbβ3 receptor density. Promoter haplotypes were introduced into an ITGB3 promoter-pGL3 construct by site directed mutagenesis and luciferase reporter gene expression analysed in HEL and HMEC-1 cells. Binding of nuclear proteins was assessed by electrophoretic mobility shift assay. The association of ITGB3 haplotype with platelet αIIbβ3 receptor density was determined in 223 subjects. Species conserved motifs were identified in the ITGB3 promoter in the vicinity of the 3 polymorphisms. The GAA, GCC, AAC, AAA and ACC constructs induced ~50% increased luciferase expression relative to the GAC construct in both cell types. Haplotype analysis including Leu33Pro indicated 5 common haplotypes; no associations between ITGB3 haplotypes and receptor density were found. However, the GCC-Pro33 haplotype was associated with significantly higher vWF activity (128.6 [112.1–145.1]%) compared with all other haplotypes (107.1 [101.2–113.0]%, p=0.02). In conclusion, the GCC-Pro33 haplotype was associated with increased vWF activity but not with platelet αIIbβ3 receptor density, which may indicate ITGB3 haplotype influences endothelial function. PMID:18045606

  18. [SNP-19 genotypic variants of CAPN10 gene and its relation to diabetes mellitus type 2 in a population of Ciudad Juarez, Mexico].

    PubMed

    Loya Méndez, Yolanda; Reyes Leal, Gilberto; Sánchez González, Adriana; Portillo Reyes, Verónica; Reyes Ruvalcaba, David; Bojórquez Rangel, Guillermo

    2014-09-28

    Diabetes Mellitus (DM) type 2 is a common pathology with multifactorial etiology, which exact genetic bases remain unknown. Some studies suggest that single nucleotides polymorphisms (SNPs) in the CAPN10 gene (Locus 2q37.3) could be associated with the development of this disease, including the insertion/deletion polymorphism SNP-19 (2R→3R). The present study determined the association between the SNP-19 and the risk of developing DM type 2 in Ciudad Juarez population. For this study 107 participants were selected: 43 diabetics type 2 (cases) and 64 non diabetics with no family history of DM type 2 in first grade (control). Anthropometric studies were realized as well as lipids, lipoproteins and serum glucose biochemical profiles. The genotypification of SNP-19 was performed using peripheral blood lymphocytes DNA, polymerase chain reactions (PCR), and electrophoretic analysis in agarose gels. Once obtained the genotypic and allelic frequencies, the Hardy-Weinberg equilibrium test (GenAlEx 6.4) was also performed. Using the X² analysis it was identified the genotypic differences between cases and control with higher frequency of the homozygous genotype 3R of SNP- 19 in the cases group (0.418) compared to control group (0.265). Also, it was observed an association between genotype 2R/3R with elevated weight, body mass index, and waist and hip circumferences, but only in the diabetic group (P=< 0.05). The findings in this study suggest that SNP-19 in CAPN10 may participate in the development of DM type 2 in the studied population. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.

  19. The evolutionary history of the DMRT3 'Gait keeper' haplotype.

    PubMed

    Staiger, E A; Almén, M S; Promerová, M; Brooks, S; Cothran, E G; Imsland, F; Jäderkvist Fegraeus, K; Lindgren, G; Mehrabani Yeganeh, H; Mikko, S; Vega-Pla, J L; Tozaki, T; Rubin, C J; Andersson, L

    2017-10-01

    A previous study revealed a strong association between the DMRT3:Ser301STOP mutation in horses and alternate gaits as well as performance in harness racing. Several follow-up studies have confirmed a high frequency of the mutation in gaited horse breeds and an effect on gait quality. The aim of this study was to determine when and where the mutation arose, to identify additional potential causal mutations and to determine the coalescence time for contemporary haplotypes carrying the stop mutation. We utilized sequences from 89 horses representing 26 breeds to identify 102 SNPs encompassing the DMRT3 gene that are in strong linkage disequilibrium with the stop mutation. These 102 SNPs were genotyped in an additional 382 horses representing 72 breeds, and we identified 14 unique haplotypes. The results provided conclusive evidence that DMRT3:Ser301STOP is causal, as no other sequence polymorphisms showed an equally strong association to locomotion traits. The low sequence diversity among mutant chromosomes demonstrated that they must have diverged from a common ancestral sequence within the last 10 000 years. Thus, the mutation occurred either just before domestication or more likely some time after domestication and then spread across the world as a result of selection on locomotion traits. © 2017 Stichting International Foundation for Animal Genetics.

  20. Genetic polymorphisms in MDR1 and CYP3A4 genes in Asians and the influence of MDR1 haplotypes on cyclosporin disposition in heart transplant recipients.

    PubMed

    Chowbay, Balram; Cumaraswamy, Sivathasan; Cheung, Yin Bun; Zhou, Qingyu; Lee, Edmund J D

    2003-02-01

    Intestinal cytochrome P450 3A4 (CYP3A4) and P-glycoprotein (P-gp) both play a vital role in the metabolism of oral cyclosporine (CsA). We investigated the genetic polymorphisms in CYP3A4(promoter region and exons 5, 7 and 9) and MDR1 (exons 12, 21 and 26) genes and the impact of these polymorphisms on the pharmacokinetics of oral CsA in stable heart transplant patients (n = 14). CYP3A4 polymorphisms were rare in the Asian population and transplant patients. Haplotype analysis revealed 12 haplotypes in the Chinese, eight in the Malays and 10 in the Indians. T-T-T was the most common haplotype in all ethnic groups. The frequency of the homozygous mutant genotype at all three loci (TT-TT-TT) was highest in the Indians (31%) compared to 19% and 15% in the Chinese and Malays, respectively. In heart transplant patients, CsA exposure (AUC(0-4 h), AUC(0-12 h) and C(max)) was high in patients with the T-T-T haplotypes compared to those with C-G-C haplotypes. These findings suggest that haplotypes rather than genotypes influence CsA disposition in transplant patients.

  1. Predictive value of interleukin-10 promoter genotypes and haplotypes in determining the susceptibility to nephropathy in type 2 diabetes patients.

    PubMed

    Mtiraoui, Nabil; Ezzidi, Intissar; Kacem, Maha; Ben Hadj Mohamed, Manel; Chaieb, Molka; Haj Jilani, Aoutef Bel; Mahjoub, Touhami; Almawi, Wassim Y

    2009-01-01

    The IL-10 promoter polymorphisms -1082G/A, -819C/T, and -592C/A have been consistently associated with type 2 diabetes (T2DM). We examined whether these polymorphisms variants are also associated with progression of diabetic nephropathy (DN). These promoter variants were genotyped in 917 T2DM patients comprising 515 DN patients and 402 control patients without nephropathy (DWN), together with 748 non-diabetic control subjects. Haplotype analysis and multivariate regression analysis were employed in assessing the contribution of IL-10 haplotypes to DN risk, using genotype, clinical and biochemical profile, and their interactions as predictors of DN. Carriers of mutant -592A and -819T alleles, and -819T/T, -592A/A, and -819C/T genotypes were more frequent in T2DM. However, the -819C/T genotype appeared to be protective of DN, since lower frequency -819T allele and -819C/T genotype were seen in DN patients. Regression analysis identified -1082G/-819T/-592A (GTA) and -1082G/-819T/-592C (GTC) haplotypes as DN-protective haplotypes. Relative to the -1082G/-819C/-592C haplotype, GTA [P = 0.044; odds ratio (OR) = 0.54, 95% confidence interval (CI): 0.30-0.98] and GTC (P = 0.045; OR = 0.56, 95% CI: 0.31-0.99) haplotypes were associated with decreased odds ratio (OR) for DN, after controlling for a number of covariates (age, sex, body mass index (BMI), hypertension, glucose, HbA(1c), DN duration, total cholesterol). Our results indicate that genetic variations at the IL-10 promoter influence the risk of nephropathy in T2DM patients and thus represent a potential DN genetic-susceptibility locus worthy of replication. Copyright 2009 John Wiley & Sons, Ltd.

  2. Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649).

    PubMed

    Knappskog, Stian; Gansmo, Liv B; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D; Lin, Dongxin; Van Camp, Guy; Manolopoulos, Vangelis G; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E

    2014-09-30

    The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 - 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk.

  3. Partitioned learning of deep Boltzmann machines for SNP data.

    PubMed

    Hess, Moritz; Lenz, Stefan; Blätte, Tamara J; Bullinger, Lars; Binder, Harald

    2017-10-15

    Learning the joint distributions of measurements, and in particular identification of an appropriate low-dimensional manifold, has been found to be a powerful ingredient of deep leaning approaches. Yet, such approaches have hardly been applied to single nucleotide polymorphism (SNP) data, probably due to the high number of features typically exceeding the number of studied individuals. After a brief overview of how deep Boltzmann machines (DBMs), a deep learning approach, can be adapted to SNP data in principle, we specifically present a way to alleviate the dimensionality problem by partitioned learning. We propose a sparse regression approach to coarsely screen the joint distribution of SNPs, followed by training several DBMs on SNP partitions that were identified by the screening. Aggregate features representing SNP patterns and the corresponding SNPs are extracted from the DBMs by a combination of statistical tests and sparse regression. In simulated case-control data, we show how this can uncover complex SNP patterns and augment results from univariate approaches, while maintaining type 1 error control. Time-to-event endpoints are considered in an application with acute myeloid leukemia patients, where SNP patterns are modeled after a pre-screening based on gene expression data. The proposed approach identified three SNPs that seem to jointly influence survival in a validation dataset. This indicates the added value of jointly investigating SNPs compared to standard univariate analyses and makes partitioned learning of DBMs an interesting complementary approach when analyzing SNP data. A Julia package is provided at 'http://github.com/binderh/BoltzmannMachines.jl'. binderh@imbi.uni-freiburg.de. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  4. The effects of NAMPT haplotypes and metabolic risk factors on circulating visfatin/NAMPT levels in childhood obesity.

    PubMed

    Belo, V A; Luizon, M R; Lacchini, R; Miranda, J A; Lanna, C M M; Souza-Costa, D C; Tanus-Santos, J E

    2015-01-01

    Polymorphisms in the NAMPT gene, which encodes the adipocytokine visfatin/nicotinamide phosphorybosil transferase (NAMPT), affect the circulating visfatin/NAMPT levels and are associated with obesity and cardiovascular diseases. However, no study has tested the hypothesis that NAMPT haplotypes could affect visfatin/NAMPT levels in case of childhood obesity. We investigated the effects of traditional metabolic risk factors (MRFs) and NAMPT polymorphisms T/C (rs1319501) and A/G (rs3801266) or haplotypes on visfatin/NAMPT levels in obese children and adolescents, and whether NAMPT polymorphisms and/or haplotypes are associated with susceptibility to childhood obesity. We studied 175 control, 99 obese and 82 obese with ⩾ 3 MRFs children and adolescents. Genotypes were determined by a Taqman allele discrimination assay and real-time PCR. The plasma visfatin/NAMPT level was measured using an enzyme immunoassay. Obese children and adolescents with ⩾ 3 MRFs had higher plasma visfatin/NAMPT levels in comparison with control children and adolescents (P<0.05). Although positive associations were observed between visfatin/NAMPT and body mass index (rs = 0.157; P = 0.034) as well as visfatin/NAMPT and waist circumference (rs = 0.192; P = 0.011), visfatin/NAMPT and high-density lipoprotein cholesterol were inversely associated (rs = -0.162; P = 0.031). No significant differences in genotype, allele or haplotype frequency distributions for the studied polymorphisms were found when the three groups were compared. However, higher plasma visfatin/NAMPT levels were found in control and obese subjects carrying the GG genotype for the A/G (rs3801266) polymorphism (P<0.05) but not in obese children with ⩾ 3 MRFs. Moreover, control subjects carrying the 'T-G' haplotype showed higher plasma visfatin/NAMPT levels. NAMPT genotypes or haplotypes were not associated with childhood obesity. Obesity in children with ⩾ 3 MRFs increases plasma visfatin/NAMPT levels, and this marker was

  5. Haplotype estimation using sequencing reads.

    PubMed

    Delaneau, Olivier; Howie, Bryan; Cox, Anthony J; Zagury, Jean-François; Marchini, Jonathan

    2013-10-03

    High-throughput sequencing technologies produce short sequence reads that can contain phase information if they span two or more heterozygote genotypes. This information is not routinely used by current methods that infer haplotypes from genotype data. We have extended the SHAPEIT2 method to use phase-informative sequencing reads to improve phasing accuracy. Our model incorporates the read information in a probabilistic model through base quality scores within each read. The method is primarily designed for high-coverage sequence data or data sets that already have genotypes called. One important application is phasing of single samples sequenced at high coverage for use in medical sequencing and studies of rare diseases. Our method can also use existing panels of reference haplotypes. We tested the method by using a mother-father-child trio sequenced at high-coverage by Illumina together with the low-coverage sequence data from the 1000 Genomes Project (1000GP). We found that use of phase-informative reads increases the mean distance between switch errors by 22% from 274.4 kb to 328.6 kb. We also used male chromosome X haplotypes from the 1000GP samples to simulate sequencing reads with varying insert size, read length, and base error rate. When using short 100 bp paired-end reads, we found that using mixtures of insert sizes produced the best results. When using longer reads with high error rates (5-20 kb read with 4%-15% error per base), phasing performance was substantially improved. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  6. SNPdbe: constructing an nsSNP functional impacts database.

    PubMed

    Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana

    2012-02-15

    Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.

  7. Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

    PubMed Central

    Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.

    2009-01-01

    Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876

  8. MGMT DNA repair gene promoter/enhancer haplotypes alter transcription factor binding and gene expression.

    PubMed

    Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z

    2016-10-01

    The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.

  9. Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan

    PubMed Central

    Ting, Jason C; Ye, Ying; Thomas, George H; Ruczinski, Ingo; Pevsner, Jonathan

    2006-01-01

    Background A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Étude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by

  10. Haplotype Variation of Flowering Time Genes of Sugar Beet and Its Wild Relatives and the Impact on Life Cycle Regimes.

    PubMed

    Höft, Nadine; Dally, Nadine; Hasler, Mario; Jung, Christian

    2017-01-01

    The species Beta vulgaris encompasses wild and cultivated members with a broad range of phenological development. The annual life cycle is commonly found in sea beets (ssp. maritima ) from Mediterranean environments which germinate, bolt, and flower within one season under long day conditions. Biennials such as the cultivated sugar beet ( B. vulgaris ssp. vulgaris ) as well as sea beets from northern latitudes require prolonged exposure to cold temperature over winter to acquire floral competence. Sugar beet is mainly cultivated for sugar production in Europe and is likely to have originated from sea beet. Flowering time strongly affects seed yield and yield potential and is thus a trait of high agronomic relevance. Besides environmental cues, there are complex genetic networks known to impact life cycle switch in flowering plants. In sugar beet, BTC1, BvBBX19, BvFT1 , and BvFT2 are major flowering time regulators. In this study, we phenotyped plants from a diversity Beta panel encompassing cultivated and wild species from different geographical origin. Plants were grown under different day length regimes with and without vernalization. Haplotype analysis of BTC1, BvBBX19, BvFT1 , and BvFT2 was performed to identify natural diversity of these genes and their impact on flowering. We found that accessions from northern latitudes flowered significantly later than those from southern latitudes. Some plants did not flower at all, indicating a strong impact of latitude of origin on life cycle. Haplotype analysis revealed a high conservation of the CCT-, REC-, BBX-, and PEBP-domains with regard to SNP occurrence. We identified sequence variation which may impact life cycle adaptation in beet. Our data endorse the importance of BTC1 in the domestication process of cultivated beets and contribute to the understanding of distribution and adaption of Beta species to different life cycle regimes in response to different environments. Moreover, our data provide a resource for

  11. Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm

    PubMed Central

    Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron

    2012-01-01

    Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421

  12. Single nucleotide polymorphism coverage and inference of N-acetyltransferase-2 acetylator phenotypes in wordwide population groups.

    PubMed

    Suarez-Kurtz, Guilherme; Fuchshuber-Moraes, Mateus; Struchiner, Claudio J; Parra, Esteban J

    2016-08-01

    Several algorithms have been proposed to reduce the genotyping effort and cost, while retaining the accuracy of N-acetyltransferase-2 (NAT2) phenotype prediction. Data from the 1000 Genomes (1KG) project and an admixed cohort of Black Brazilians were used to assess the accuracy of NAT2 phenotype prediction using algorithms based on paired single nucleotide polymorphisms (SNPs) (rs1041983 and rs1801280) or a tag SNP (rs1495741). NAT2 haplotypes comprising SNPs rs1801279, rs1041983, rs1801280, rs1799929, rs1799930, rs1208 and rs1799931 were assigned according to the arylamine N-acetyltransferases database. Contingency tables were used to visualize the agreement between the NAT2 acetylator phenotypes on the basis of these haplotypes versus phenotypes inferred by the prediction algorithms. The paired and tag SNP algorithms provided more than 96% agreement with the 7-SNP derived phenotypes in Europeans, East Asians, South Asians and Admixed Americans, but discordance of phenotype prediction occurred in 30.2 and 24.8% 1KG Africans and in 14.4 and 18.6% Black Brazilians, respectively. Paired SNP panel misclassification occurs in carriers of NATs haplotypes *13A (282T alone), *12B (282T and 803G), *6B (590A alone) and *14A (191A alone), whereas haplotype *14, defined by the 191A allele, is the major culprit of misclassification by the tag allele. Both the paired SNP and the tag SNP algorithms may be used, with economy of scale, to infer NAT2 acetylator phenotypes, including the ultra-slow phenotype, in European, East Asian, South Asian and American populations represented in the 1KG cohort. Both algorithms, however, perform poorly in populations of predominant African descent, including admixed African-Americans, African Caribbeans and Black Brazilians.

  13. Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.

    PubMed

    Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M

    2001-11-14

    The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.

  14. Association of Single-Nucleotide Polymorphisms of the Tau Gene With Late-Onset Parkinson Disease

    PubMed Central

    Martin, Eden R.; Scott, William K.; Nance, Martha A.; Watts, Ray L.; Hubble, Jean P.; Koller, William C.; Lyons, Kelly; Pahwa, Rajesh; Stern, Matthew B.; Colcher, Amy; Hiner, Bradley C.; Jankovic, Joseph; Ondo, William G.; Allen, Fred H.; Goetz, Christopher G.; Small, Gary W.; Masterman, Donna; Mastaglia, Frank; Laing, Nigel G.; Stajich, Jeffrey M.; Ribble, Robert C.; Booze, Michael W.; Rogala, Allison; Hauser, Michael A.; Zhang, Fengyu; Gibson, Rachel A.; Middleton, Lefkos T.; Roses, Allen D.; Haines, Jonathan L.; Scott, Burton L.; Pericak-Vance, Margaret A.; Vance, Jeffery M.

    2013-01-01

    Context The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. Objective To investigate whether the tau gene is involved in idiopathic PD. Design, Setting, and Participants Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Main Outcome Measure Family-based tests of association, calculated using asymptotic distributions. Results Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P = .03; SNP 9i, P = .04; and SNP 11, P = .04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P = .11, and SNP 9iii, P = .87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P = .009) and a negative association with another haplotype (P = .007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3,9i, 9ii, and 11). Conclusions This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD. PMID:11710889

  15. Optimized Next-Generation Sequencing Genotype-Haplotype Calling for Genome Variability Analysis

    PubMed Central

    Navarro, Javier; Nevado, Bruno; Hernández, Porfidio; Vera, Gonzalo; Ramos-Onsins, Sebastián E

    2017-01-01

    The accurate estimation of nucleotide variability using next-generation sequencing data is challenged by the high number of sequencing errors produced by new sequencing technologies, especially for nonmodel species, where reference sequences may not be available and the read depth may be low due to limited budgets. The most popular single-nucleotide polymorphism (SNP) callers are designed to obtain a high SNP recovery and low false discovery rate but are not designed to account appropriately the frequency of the variants. Instead, algorithms designed to account for the frequency of SNPs give precise results for estimating the levels and the patterns of variability. These algorithms are focused on the unbiased estimation of the variability and not on the high recovery of SNPs. Here, we implemented a fast and optimized parallel algorithm that includes the method developed by Roesti et al and Lynch, which estimates the genotype of each individual at each site, considering the possibility to call both bases from the genotype, a single one or none. This algorithm does not consider the reference and therefore is independent of biases related to the reference nucleotide specified. The pipeline starts from a BAM file converted to pileup or mpileup format and the software outputs a FASTA file. The new program not only reduces the running times but also, given the improved use of resources, it allows its usage with smaller computers and large parallel computers, expanding its benefits to a wider range of researchers. The output file can be analyzed using software for population genetics analysis, such as the R library PopGenome, the software VariScan, and the program mstatspop for analysis considering positions with missing data. PMID:28894353

  16. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus).

    PubMed

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-07-27

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.

  17. SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations

    PubMed Central

    van der Heide, Han G. J.; Heuvelman, Kees J.; Kallonen, Teemu; He, Qiushui; Mertsola, Jussi; Advani, Abdolreza; Hallander, Hans O.; Janssens, Koen; Hermans, Peter W.; Mooi, Frits R.

    2011-01-01

    To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE) and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA). In this study, a single nucleotide polymorphism (SNP) typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in the Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis. PMID:21647370

  18. Admixture patterns and genetic differentiation in negrito groups from West Malaysia estimated from genome-wide SNP data.

    PubMed

    Jinam, Timothy A; Phipps, Maude E; Saitou, Naruya

    2013-01-01

    Southeast Asia houses various culturally and linguistically diverse ethnic groups. In Malaysia, where the Malay, Chinese, and Indian ethnic groups form the majority, there exist minority groups such as the "negritos" who are believed to be descendants of the earliest settlers of Southeast Asia. Here we report patterns of genetic substructure and admixture in two Malaysian negrito populations (Jehai and Kensiu), using ~50,000 genome-wide single-nucleotide polymorphism (SNP) data. We found traces of recent admixture in both the negrito populations, particularly in the Jehai, with the Malay through principal component analysis and STRUCTURE analysis software, which suggested that the admixture was as recent as one generation ago. We also identified significantly differentiated nonsynonymous SNPs and haplotype blocks related to intracellular transport, metabolic processes, and detection of stimulus. These results highlight the different levels of admixture experienced by the two Malaysian negritos. Delineating admixture and differentiated genomic regions should be of importance in designing and interpretation of molecular anthropology and disease association studies. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.

  19. Novel SNP markers in InvGE and SssI genes are associated with natural variation of sugar contents and frying color in Solanum tuberosum Group Phureja.

    PubMed

    Duarte-Delgado, Diana; Juyó, Deissy; Gebhardt, Christiane; Sarmiento, Felipe; Mosquera-Vásquez, Teresa

    2017-03-09

    Potato frying color is an agronomic trait influenced by the sugar content of tubers. The candidate gene approach was employed to elucidate the molecular basis of this trait in Solanum tuberosum Group Phureja, which is mainly diploid and represents an important genetic resource for potato breeding. The objective of this research was to identify novel genetic variants related with frying quality in loci with key functions in carbohydrate metabolism, with the purpose of discovering genetic variability useful in breeding programs. Therefore, an association analysis was implemented with 109 SNP markers identified in ten candidate genes. The analyses revealed four associations in the locus InvGE coding for an apoplastic invertase and one association in the locus SssI coding for a soluble starch synthase. The SNPs SssI-C 45711901 T and InvGE-C 2475454 T were associated with sucrose content and frying color, respectively, and were not found previously in tetraploid genotypes. The rare haplotype InvGE-A 2475187 C 2475295 A 2475344 was associated with higher fructose contents. Our study allowed a more detailed analysis of the sequence variation of exon 3 from InvGE, which was not possible in previous studies because of the high frequency of insertion-deletion polymorphisms in tetraploid potatoes. The association mapping strategy using a candidate gene approach in Group Phureja allowed the identification of novel SNP markers in InvGE and SssI associated with frying color and the tuber sugar content measured by High Performance Liquid Chromatography (HPLC). These novel associations might be useful in potato breeding programs for improving quality traits and to increase crop genetic variability. The results suggest that some genes involved in the natural variation of tuber sugar content and frying color are conserved in both Phureja and tetraploid germplasm. Nevertheless, the associated variants in both types of germplasm were present in different regions of these genes. This

  20. Determination of βS haplotypes in patients with sickle-cell anemia in the state of Rio Grande do Norte, Brazil

    PubMed Central

    Cabral, Cynthia Hatsue Kitayama; Serafim, Édvis Santos Soares; de Medeiros, Waleska Rayane Dantas Bezerra; de Medeiros Fernandes, Thales Allyrio Araújo; Kimura, Elza Miyuki; Costa, Fernando Ferreira; de Fátima Sonati, Maria; Rebecchi, Ivanise Marina Moretti; de Medeiros, Tereza Maria Dantas

    2011-01-01

    βS haplotypes were studied in 47 non-related patients with sickle-cell anemia from the state of Rio Grande do Norte, Brazil. Molecular analysis was conducted by PCR/RFLP using restriction endonucleases XmnI, HindIII, HincII and HinfI to analyze six polymorphic sites from the beta cluster. Twenty-seven patients (57.5%) were identified with genotype CAR/CAR, 9 (19.1%) CAR/BEN, 6 (12.8%) CAR/CAM, 1 (2.1%) BEN/BEN, 2 (4.3%) CAR/Atp, 1 (2.1%) BEN/Atp and 1 (2.1%) with genotype Atp/Atp. The greater frequency of Cameroon haplotypes compared to other Brazilian states suggests the existence of a peculiarity of African origin in the state of Rio Grande do Norte. PMID:21931513