Sample records for haplotype assembly problem

  1. Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.

    PubMed

    Hashemi, Abolfazl; Zhu, Banghua; Vikalo, Haris

    2018-03-21

    Haplotype assembly is the task of reconstructing haplotypes of an individual from a mixture of sequenced chromosome fragments. Haplotype information enables studies of the effects of genetic variations on an organism's phenotype. Most of the mathematical formulations of haplotype assembly are known to be NP-hard and haplotype assembly becomes even more challenging as the sequencing technology advances and the length of the paired-end reads and inserts increases. Assembly of haplotypes polyploid organisms is considerably more difficult than in the case of diploids. Hence, scalable and accurate schemes with provable performance are desired for haplotype assembly of both diploid and polyploid organisms. We propose a framework that formulates haplotype assembly from sequencing data as a sparse tensor decomposition. We cast the problem as that of decomposing a tensor having special structural constraints and missing a large fraction of its entries into a product of two factors, U and [Formula: see text]; tensor [Formula: see text] reveals haplotype information while U is a sparse matrix encoding the origin of erroneous sequencing reads. An algorithm, AltHap, which reconstructs haplotypes of either diploid or polyploid organisms by iteratively solving this decomposition problem is proposed. The performance and convergence properties of AltHap are theoretically analyzed and, in doing so, guarantees on the achievable minimum error correction scores and correct phasing rate are established. The developed framework is applicable to diploid, biallelic and polyallelic polyploid species. The code for AltHap is freely available from https://github.com/realabolfazl/AltHap . AltHap was tested in a number of different scenarios and was shown to compare favorably to state-of-the-art methods in applications to haplotype assembly of diploids, and significantly outperforms existing techniques when applied to haplotype assembly of polyploids.

  2. Haplotype assembly in polyploid genomes and identical by descent shared tracts.

    PubMed

    Aguiar, Derek; Istrail, Sorin

    2013-07-01

    Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and computational efficiency. Furthermore, current models and methodologies for haplotype assembly (i) do not consider individuals sharing haplotypes jointly, which reduces the size and accuracy of assembled haplotypes, and (ii) are unable to model genomes having more than two sets of homologous chromosomes (polyploidy). Polyploid organisms are increasingly becoming the target of many research groups interested in the genomics of disease, phylogenetics, botany and evolution but there is an absence of theory and methods for polyploid haplotype reconstruction. In this work, we present a number of results, extensions and generalizations of compass graphs and our HapCompass framework. We prove the theoretical complexity of two haplotype assembly optimizations, thereby motivating the use of heuristics. Furthermore, we present graph theory-based algorithms for the problem of haplotype assembly using our previously developed HapCompass framework for (i) novel implementations of haplotype assembly optimizations (minimum error correction), (ii) assembly of a pair of individuals sharing a haplotype tract identical by descent and (iii) assembly of polyploid genomes. We evaluate our methods on 1000 Genomes Project, Pacific Biosciences and simulated sequence data. HapCompass is available for download at http://www.brown.edu/Research/Istrail_Lab/. Supplementary data are available at Bioinformatics online.

  3. Better ILP models for haplotype assembly.

    PubMed

    Etemadi, Maryam; Bagherian, Mehri; Chen, Zhi-Zhong; Wang, Lusheng

    2018-02-19

    The haplotype assembly problem for diploid is to find a pair of haplotypes from a given set of aligned Single Nucleotide Polymorphism (SNP) fragments (reads). It has many applications in association studies, drug design, and genetic research. Since this problem is computationally hard, both heuristic and exact algorithms have been designed for it. Although exact algorithms are much slower, they are still of great interest because they usually output significantly better solutions than heuristic algorithms in terms of popular measures such as the Minimum Error Correction (MEC) score, the number of switch errors, and the QAN50 score. Exact algorithms are also valuable because they can be used to witness how good a heuristic algorithm is. The best known exact algorithm is based on integer linear programming (ILP) and it is known that ILP can also be used to improve the output quality of every heuristic algorithm with a little decline in speed. Therefore, faster ILP models for the problem are highly demanded. As in previous studies, we consider not only the general case of the problem but also its all-heterozygous case where we assume that if a column of the input read matrix contains at least one 0 and one 1, then it corresponds to a heterozygous SNP site. For both cases, we design new ILP models for the haplotype assembly problem which aim at minimizing the MEC score. The new models are theoretically better because they contain significantly fewer constraints. More importantly, our experimental results show that for both simulated and real datasets, the new model for the all-heterozygous (respectively, general) case can usually be solved via CPLEX (an ILP solver) at least 5 times (respectively, twice) faster than the previous bests. Indeed, the running time can sometimes be 41 times better. This paper proposes a new ILP model for the haplotype assembly problem and its all-heterozygous case, respectively. Experiments with both real and simulated datasets show that the new models can be solved within much shorter time by CPLEX than the previous bests. We believe that the models can be used to improve heuristic algorithms as well.

  4. Exact algorithms for haplotype assembly from whole-genome sequence data.

    PubMed

    Chen, Zhi-Zhong; Deng, Fei; Wang, Lusheng

    2013-08-15

    Haplotypes play a crucial role in genetic analysis and have many applications such as gene disease diagnoses, association studies, ancestry inference and so forth. The development of DNA sequencing technologies makes it possible to obtain haplotypes from a set of aligned reads originated from both copies of a chromosome of a single individual. This approach is often known as haplotype assembly. Exact algorithms that can give optimal solutions to the haplotype assembly problem are highly demanded. Unfortunately, previous algorithms for this problem either fail to output optimal solutions or take too long time even executed on a PC cluster. We develop an approach to finding optimal solutions for the haplotype assembly problem under the minimum-error-correction (MEC) model. Most of the previous approaches assume that the columns in the input matrix correspond to (putative) heterozygous sites. This all-heterozygous assumption is correct for most columns, but it may be incorrect for a small number of columns. In this article, we consider the MEC model with or without the all-heterozygous assumption. In our approach, we first use new methods to decompose the input read matrix into small independent blocks and then model the problem for each block as an integer linear programming problem, which is then solved by an integer linear programming solver. We have tested our program on a single PC [a Linux (x64) desktop PC with i7-3960X CPU], using the filtered HuRef and the NA 12878 datasets (after applying some variant calling methods). With the all-heterozygous assumption, our approach can optimally solve the whole HuRef data set within a total time of 31 h (26 h for the most difficult block of the 15th chromosome and only 5 h for the other blocks). To our knowledge, this is the first time that MEC optimal solutions are completely obtained for the filtered HuRef dataset. Moreover, in the general case (without the all-heterozygous assumption), for the HuRef dataset our approach can optimally solve all the chromosomes except the most difficult block in chromosome 15 within a total time of 12 days. For both of the HuRef and NA12878 datasets, the optimal costs in the general case are sometimes much smaller than those in the all-heterozygous case. This implies that some columns in the input matrix (after applying certain variant calling methods) still correspond to false-heterozygous sites. Our program, the optimal solutions found for the HuRef dataset available at http://rnc.r.dendai.ac.jp/hapAssembly.html.

  5. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2012-12-11

    Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.

  6. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

    PubMed Central

    2012-01-01

    Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411

  7. TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS

    PubMed Central

    AGUIAR, DEREK; WONG, WENDY S.W.; ISTRAIL, SORIN

    2014-01-01

    The growing availability of inexpensive high-throughput sequence data is enabling researchers to sequence tumor populations within a single individual at high coverage. But, cancer genome sequence evolution and mutational phenomena like driver mutations and gene fusions are difficult to investigate without first reconstructing tumor haplotype sequences. Haplotype assembly of single individual tumor populations is an exceedingly difficult task complicated by tumor haplotype heterogeneity, tumor or normal cell sequence contamination, polyploidy, and complex patterns of variation. While computational and experimental haplotype phasing of diploid genomes has seen much progress in recent years, haplotype assembly in cancer genomes remains uncharted territory. In this work, we describe HapCompass-Tumor a computational modeling and algorithmic framework for haplotype assembly of copy number variable cancer genomes containing haplotypes at different frequencies and complex variation. We extend our polyploid haplotype assembly model and present novel algorithms for (1) complex variations, including copy number changes, as varying numbers of disjoint paths in an associated graph, (2) variable haplotype frequencies and contamination, and (3) computation of tumor haplotypes using simple cycles of the compass graph which constrain the space of haplotype assembly solutions. The model and algorithm are implemented in the software package HapCompass-Tumor which is available for download from http://www.brown.edu/Research/Istrail_Lab/. PMID:24297529

  8. An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.

    PubMed

    Bansal, Vikas

    2018-01-01

    The short read lengths of current high-throughput sequencing technologies limit the ability to recover long-range haplotype information. Dilution pool methods for preparing DNA sequencing libraries from high molecular weight DNA fragments enable the recovery of long DNA fragments from short sequence reads. These approaches require computational methods for identifying the DNA fragments using aligned sequence reads and assembling the fragments into long haplotypes. Although a number of computational methods have been developed for haplotype assembly, the problem of identifying DNA fragments from dilution pool sequence data has not received much attention. We formulate the problem of detecting DNA fragments from dilution pool sequencing experiments as a genome segmentation problem and develop an algorithm that uses dynamic programming to optimize a likelihood function derived from a generative model for the sequence reads. This algorithm uses an iterative approach to automatically infer the mean background read depth and the number of fragments in each pool. Using simulated data, we demonstrate that our method, FragmentCut, has 25-30% greater sensitivity compared with an HMM based method for fragment detection and can also detect overlapping fragments. On a whole-genome human fosmid pool dataset, the haplotypes assembled using the fragments identified by FragmentCut had greater N50 length, 16.2% lower switch error rate and 35.8% lower mismatch error rate compared with two existing methods. We further demonstrate the greater accuracy of our method using two additional dilution pool datasets. FragmentCut is available from https://bansal-lab.github.io/software/FragmentCut. vibansal@ucsd.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  9. De novo assembly of a haplotype-resolved human genome.

    PubMed

    Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

    2015-06-01

    The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.

  10. BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.

    PubMed

    Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F

    2014-01-01

    We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.

  11. Reference quality assembly of the 3.5-Gb genome of Capsicum annuum from a single linked-read library.

    PubMed

    Hulse-Kemp, Amanda M; Maheshwari, Shamoni; Stoffel, Kevin; Hill, Theresa A; Jaffe, David; Williams, Stephen R; Weisenfeld, Neil; Ramakrishnan, Srividya; Kumar, Vijay; Shah, Preyas; Schatz, Michael C; Church, Deanna M; Van Deynze, Allen

    2018-01-01

    Linked-Read sequencing technology has recently been employed successfully for de novo assembly of human genomes, however, the utility of this technology for complex plant genomes is unproven. We evaluated the technology for this purpose by sequencing the 3.5-gigabase (Gb) diploid pepper ( Capsicum annuum ) genome with a single Linked-Read library. Plant genomes, including pepper, are characterized by long, highly similar repetitive sequences. Accordingly, significant effort is used to ensure that the sequenced plant is highly homozygous and the resulting assembly is a haploid consensus. With a phased assembly approach, we targeted a heterozygous F 1 derived from a wide cross to assess the ability to derive both haplotypes and characterize a pungency gene with a large insertion/deletion. The Supernova software generated a highly ordered, more contiguous sequence assembly than all currently available C. annuum reference genomes. Over 83% of the final assembly was anchored and oriented using four publicly available  de novo linkage maps. A comparison of the annotation of conserved eukaryotic genes indicated the completeness of assembly. The validity of the phased assembly is further demonstrated with the complete recovery of both 2.5-Kb insertion/deletion haplotypes of the PUN1 locus in the F 1 sample that represents pungent and nonpungent peppers, as well as nearly full recovery of the BUSCO2 gene set within each of the two haplotypes. The most contiguous pepper genome assembly to date has been generated which demonstrates that Linked-Read library technology provides a tool to de novo assemble complex highly repetitive heterozygous plant genomes. This technology can provide an opportunity to cost-effectively develop high-quality genome assemblies for other complex plants and compare structural and gene differences through accurate haplotype reconstruction.

  12. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity

    PubMed Central

    Sperschneider, Jana; Garnica, Diana P.; Miller, Marisa E.; Taylor, Jennifer M.; Dodds, Peter N.; Park, Robert F.

    2018-01-01

    ABSTRACT A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb) and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales. In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies. PMID:29463659

  13. De novo assembly and phasing of a Korean human genome.

    PubMed

    Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon

    2016-10-13

    Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.

  14. De Novo Assembly and Phasing of Dikaryotic Genomes from Two Isolates of Puccinia coronata f. sp. avenae, the Causal Agent of Oat Crown Rust

    PubMed Central

    Miller, Marisa E.; Zhang, Ying; Omidvar, Vahid; Sperschneider, Jana; Raley, Castle; Palmer, Jonathan M.; Garnica, Diana; Upadhyaya, Narayana; Rathjen, John; Taylor, Jennifer M.; Park, Robert F.; Dodds, Peter N.; Hirsch, Cory D.

    2018-01-01

    ABSTRACT Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenae. PMID:29463655

  15. De Novo Assembly and Phasing of Dikaryotic Genomes from Two Isolates of Puccinia coronata f. sp. avenae, the Causal Agent of Oat Crown Rust.

    PubMed

    Miller, Marisa E; Zhang, Ying; Omidvar, Vahid; Sperschneider, Jana; Schwessinger, Benjamin; Raley, Castle; Palmer, Jonathan M; Garnica, Diana; Upadhyaya, Narayana; Rathjen, John; Taylor, Jennifer M; Park, Robert F; Dodds, Peter N; Hirsch, Cory D; Kianian, Shahryar F; Figueroa, Melania

    2018-02-20

    Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae , is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenae IMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae , which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae , resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae . Copyright © 2018 Miller et al.

  16. Extended Islands of Tractability for Parsimony Haplotyping

    NASA Astrophysics Data System (ADS)

    Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi

    Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.

  17. Haplotype resolution of leukocyte receptor complex in cattle through targeted enrichment and SMRT sequencing

    USDA-ARS?s Scientific Manuscript database

    The highly repetitive nature of cattle leukocyte receptor complex (LRC) has made it difficult to assemble and fully characterize this region with short reads used by second-generation sequencing. Previously, we reported the first two cattle killer immunoglobulin-like receptors (KIR) haplotypes; one ...

  18. Ultraaccurate genome sequencing and haplotyping of single human cells.

    PubMed

    Chu, Wai Keung; Edge, Peter; Lee, Ho Suk; Bansal, Vikas; Bafna, Vineet; Huang, Xiaohua; Zhang, Kun

    2017-11-21

    Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10 -8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.

  19. Honey bee-inspired algorithms for SNP haplotype reconstruction problem

    NASA Astrophysics Data System (ADS)

    PourkamaliAnaraki, Maryam; Sadeghi, Mehdi

    2016-03-01

    Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/

  20. Mathematical properties and bounds on haplotyping populations by pure parsimony.

    PubMed

    Wang, I-Lin; Chang, Chia-Yuan

    2011-06-01

    Although the haplotype data can be used to analyze the function of DNA, due to the significant efforts required in collecting the haplotype data, usually the genotype data is collected and then the population haplotype inference (PHI) problem is solved to infer haplotype data from genotype data for a population. This paper investigates the PHI problem based on the pure parsimony criterion (HIPP), which seeks the minimum number of distinct haplotypes to infer a given genotype data. We analyze the mathematical structure and properties for the HIPP problem, propose techniques to reduce the given genotype data into an equivalent one of much smaller size, and analyze the relations of genotype data using a compatible graph. Based on the mathematical properties in the compatible graph, we propose a maximal clique heuristic to obtain an upper bound, and a new polynomial-sized integer linear programming formulation to obtain a lower bound for the HIPP problem. Copyright © 2011 Elsevier Inc. All rights reserved.

  1. Efficient algorithms for polyploid haplotype phasing.

    PubMed

    He, Dan; Saha, Subrata; Finkers, Richard; Parida, Laxmi

    2018-05-09

    Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.

  2. Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Jisen; Sharma, Anupma; Yu, Qingyi

    Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less

  3. Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

    DOE PAGES

    Zhang, Jisen; Sharma, Anupma; Yu, Qingyi; ...

    2016-06-10

    Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less

  4. Haplotyping for disease association: a combinatorial approach.

    PubMed

    Lancia, Giuseppe; Ravi, R; Rizzi, Romeo

    2008-01-01

    We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.

  5. A Primary Assembly of a Bovine Haplotype Block Map Based on a 15,036-Single-Nucleotide Polymorphism Panel Genotyped in Holstein–Friesian Cattle

    PubMed Central

    Khatkar, Mehar S.; Zenger, Kyall R.; Hobbs, Matthew; Hawken, Rachel J.; Cavanagh, Julie A. L.; Barris, Wes; McClintock, Alexander E.; McClintock, Sara; Thomson, Peter C.; Tier, Bruce; Nicholas, Frank W.; Raadsma, Herman W.

    2007-01-01

    Analysis of data on 1000 Holstein–Friesian bulls genotyped for 15,036 single-nucleotide polymorphisms (SNPs) has enabled genomewide identification of haplotype blocks and tag SNPs. A final subset of 9195 SNPs in Hardy–Weinberg equilibrium and mapped on autosomes on the bovine sequence assembly (release Btau 3.1) was used in this study. The average intermarker spacing was 251.8 kb. The average minor allele frequency (MAF) was 0.29 (0.05–0.5). Following recent precedents in human HapMap studies, a haplotype block was defined where 95% of combinations of SNPs within a region are in very high linkage disequilibrium. A total of 727 haplotype blocks consisting of ≥3 SNPs were identified. The average block length was 69.7 ± 7.7 kb, which is ∼5–10 times larger than in humans. These blocks comprised a total of 2964 SNPs and covered 50,638 kb of the sequence map, which constitutes 2.18% of the length of all autosomes. A set of tag SNPs, which will be useful for further fine-mapping studies, has been identified. Overall, the results suggest that as many as 75,000–100,000 tag SNPs would be needed to track all important haplotype blocks in the bovine genome. This would require ∼250,000 SNPs in the discovery phase. PMID:17435229

  6. A new mathematical modeling for pure parsimony haplotyping problem.

    PubMed

    Feizabadi, R; Bagherian, M; Vaziri, H R; Salahi, M

    2016-11-01

    Pure parsimony haplotyping (PPH) problem is important in bioinformatics because rational haplotyping inference plays important roles in analysis of genetic data, mapping complex genetic diseases such as Alzheimer's disease, heart disorders and etc. Haplotypes and genotypes are m-length sequences. Although several integer programing models have already been presented for PPH problem, its NP-hardness characteristic resulted in ineffectiveness of those models facing the real instances especially instances with many heterozygous sites. In this paper, we assign a corresponding number to each haplotype and genotype and based on those numbers, we set a mixed integer programing model. Using numbers, instead of sequences, would lead to less complexity of the new model in comparison with previous models in a way that there are neither constraints nor variables corresponding to heterozygous nucleotide sites in it. Experimental results approve the efficiency of the new model in producing better solution in comparison to two state-of-the art haplotyping approaches. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

    PubMed Central

    Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John

    2008-01-01

    The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213

  8. A parsimonious tree-grow method for haplotype inference.

    PubMed

    Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan

    2005-09-01

    Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf

  9. Long-range barcode labeling-sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Feng; Zhang, Tao; Singh, Kanwar K.

    Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.

  10. Mitochondrial Genome Sequence of the Scabies Mite Provides Insight into the Genetic Diversity of Individual Scabies Infections

    PubMed Central

    Mofiz, Ehtesham; Seemann, Torsten; Bahlo, Melanie; Holt, Deborah; Currie, Bart J.

    2016-01-01

    The scabies mite, Sarcoptes scabiei, is an obligate parasite of the skin that infects humans and other animal species, causing scabies, a contagious disease characterized by extreme itching. Scabies infections are a major health problem, particularly in remote Indigenous communities in Australia, where co-infection of epidermal scabies lesions by Group A Streptococci or Staphylococcus aureus is thought to be responsible for the high rate of rheumatic heart disease and chronic kidney disease. We collected and separately sequenced mite DNA from several pools of thousands of whole mites from a porcine model of scabies (S. scabiei var. suis) and two human patients (S. scabiei var. hominis) living in different regions of northern Australia. Our sequencing samples the mite and its metagenome, including the mite gut flora and the wound micro-environment. Here, we describe the mitochondrial genome of the scabies mite. We developed a new de novo assembly pipeline based on a bait-and-reassemble strategy, which produced a 14 kilobase mitochondrial genome sequence assembly. We also annotated 35 genes and have compared these to other Acari mites. We identified single nucleotide polymorphisms (SNPs) and used these to infer the presence of six haplogroups in our samples, Remarkably, these fall into two closely-related clades with one clade including both human and pig varieties. This supports earlier findings that only limited genetic differences may separate some human and animal varieties, and raises the possibility of cross-host infections. Finally, we used these mitochondrial haplotypes to show that the genetic diversity of individual infections is typically small with 1–3 distinct haplotypes per infestation. PMID:26872064

  11. Mitochondrial Genome Sequence of the Scabies Mite Provides Insight into the Genetic Diversity of Individual Scabies Infections.

    PubMed

    Mofiz, Ehtesham; Seemann, Torsten; Bahlo, Melanie; Holt, Deborah; Currie, Bart J; Fischer, Katja; Papenfuss, Anthony T

    2016-02-01

    The scabies mite, Sarcoptes scabiei, is an obligate parasite of the skin that infects humans and other animal species, causing scabies, a contagious disease characterized by extreme itching. Scabies infections are a major health problem, particularly in remote Indigenous communities in Australia, where co-infection of epidermal scabies lesions by Group A Streptococci or Staphylococcus aureus is thought to be responsible for the high rate of rheumatic heart disease and chronic kidney disease. We collected and separately sequenced mite DNA from several pools of thousands of whole mites from a porcine model of scabies (S. scabiei var. suis) and two human patients (S. scabiei var. hominis) living in different regions of northern Australia. Our sequencing samples the mite and its metagenome, including the mite gut flora and the wound micro-environment. Here, we describe the mitochondrial genome of the scabies mite. We developed a new de novo assembly pipeline based on a bait-and-reassemble strategy, which produced a 14 kilobase mitochondrial genome sequence assembly. We also annotated 35 genes and have compared these to other Acari mites. We identified single nucleotide polymorphisms (SNPs) and used these to infer the presence of six haplogroups in our samples, Remarkably, these fall into two closely-related clades with one clade including both human and pig varieties. This supports earlier findings that only limited genetic differences may separate some human and animal varieties, and raises the possibility of cross-host infections. Finally, we used these mitochondrial haplotypes to show that the genetic diversity of individual infections is typically small with 1-3 distinct haplotypes per infestation.

  12. Observation of quantum criticality with ultracold atoms in optical lattices

    NASA Astrophysics Data System (ADS)

    Zhang, Xibo

    As biological problems are becoming more complex and data growing at a rate much faster than that of computer hardware, new and faster algorithms are required. This dissertation investigates computational problems arising in two of the fields: comparative genomics and epigenomics, and employs a variety of computational techniques to address the problems. One fundamental question in the studies of chromosome evolution is whether the rearrangement breakpoints are happening at random positions or along certain hotspots. We investigate the breakpoint reuse phenomenon, and show the analyses that support the more recently proposed fragile breakage model as opposed to the conventional random breakage models for chromosome evolution. The identification of syntenic regions between chromosomes forms the basis for studies of genome architectures, comparative genomics, and evolutionary genomics. The previous synteny block reconstruction algorithms could not be scaled to a large number of mammalian genomes being sequenced; neither did they address the issue of generating non-overlapping synteny blocks suitable for analyzing rearrangements and evolutionary history of large-scale duplications prevalent in plant genomes. We present a new unified synteny block generation algorithm based on A-Bruijn graph framework that overcomes these shortcomings. In the epigenome sequencing, a sample may contain a mixture of epigenomes and there is a need to resolve the distinct methylation patterns from the mixture. Many sequencing applications, such as haplotype inference for diploid or polyploid genomes, and metagenomic sequencing, share the similar objective: to infer a set of distinct assemblies from reads that are sequenced from a heterogeneous sample and subsequently aligned to a reference genome. We model the problem from both a combinatorial and a statistical angles. First, we describe a theoretical framework. A linear-time algorithm is then given to resolve a minimum number of assemblies that are consistent with all reads, substantially improving on previous algorithms. An efficient algorithm is also described to determine a set of assemblies that is consistent with a maximum subset of the reads, a previously untreated problem. We then prove that allowing nested reads or permitting mismatches between reads and their assemblies renders these problems NP-hard. Second, we describe a mixture model-based approach, and applied the model for the detection of allele-specific methylations.

  13. Capturing haplotypes in germplasm core collections

    USDA-ARS?s Scientific Manuscript database

    Genomewide data sets of single nucleotide polymorphisms (SNPs) offer great potential to improve ex situ conservation. Two factors impede their use for producing core collections. First, due to the large number of SNPs, the assembly of collections that maximize diversity may be intractable using ex...

  14. Inferring mechanisms of copy number change from haplotype structures at the human DEFA1A3 locus.

    PubMed

    Black, Holly A; Khan, Fayeza F; Tyson, Jess; Al Armour, John

    2014-07-21

    The determination of structural haplotypes at copy number variable regions can indicate the mechanisms responsible for changes in copy number, as well as explain the relationship between gene copy number and expression. However, obtaining spatial information at regions displaying extensive copy number variation, such as the DEFA1A3 locus, is complex, because of the difficulty in the phasing and assembly of these regions. The DEFA1A3 locus is intriguing in that it falls within a region of high linkage disequilibrium, despite its high variability in copy number (n = 3-16); hence, the mechanisms responsible for changes in copy number at this locus are unclear. In this study, a region flanking the DEFA1A3 locus was sequenced across 120 independent haplotypes with European ancestry, identifying five common classes of DEFA1A3 haplotype. Assigning DEFA1A3 class to haplotypes within the 1000 Genomes project highlights a significant difference in DEFA1A3 class frequencies between populations with different ancestry. The features of each DEFA1A3 class, for example, the associated DEFA1A3 copy numbers, were initially assessed in a European cohort (n = 599) and replicated in the 1000 Genomes samples, showing within-class similarity, but between-class and between-population differences in the features of the DEFA1A3 locus. Emulsion haplotype fusion-PCR was used to generate 61 structural haplotypes at the DEFA1A3 locus, showing a high within-class similarity in structure. Structural haplotypes across the DEFA1A3 locus indicate that intra-allelic rearrangement is the predominant mechanism responsible for changes in DEFA1A3 copy number, explaining the conservation of linkage disequilibrium across the locus. The identification of common structural haplotypes at the DEFA1A3 locus could aid studies into how DEFA1A3 copy number influences expression, which is currently unclear.

  15. The diploid genome sequence of an Asian individual

    PubMed Central

    Wang, Jun; Wang, Wei; Li, Ruiqiang; Li, Yingrui; Tian, Geng; Goodman, Laurie; Fan, Wei; Zhang, Junqing; Li, Jun; Zhang, Juanbin; Guo, Yiran; Feng, Binxiao; Li, Heng; Lu, Yao; Fang, Xiaodong; Liang, Huiqing; Du, Zhenglin; Li, Dong; Zhao, Yiqing; Hu, Yujie; Yang, Zhenzhen; Zheng, Hancheng; Hellmann, Ines; Inouye, Michael; Pool, John; Yi, Xin; Zhao, Jing; Duan, Jinjie; Zhou, Yan; Qin, Junjie; Ma, Lijia; Li, Guoqing; Yang, Zhentao; Zhang, Guojie; Yang, Bin; Yu, Chang; Liang, Fang; Li, Wenjie; Li, Shaochuan; Li, Dawei; Ni, Peixiang; Ruan, Jue; Li, Qibin; Zhu, Hongmei; Liu, Dongyuan; Lu, Zhike; Li, Ning; Guo, Guangwu; Zhang, Jianguo; Ye, Jia; Fang, Lin; Hao, Qin; Chen, Quan; Liang, Yu; Su, Yeyang; san, A.; Ping, Cuo; Yang, Shuang; Chen, Fang; Li, Li; Zhou, Ke; Zheng, Hongkun; Ren, Yuanyuan; Yang, Ling; Gao, Yang; Yang, Guohua; Li, Zhuo; Feng, Xiaoli; Kristiansen, Karsten; Wong, Gane Ka-Shu; Nielsen, Rasmus; Durbin, Richard; Bolund, Lars; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian

    2009-01-01

    Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. PMID:18987735

  16. Proof-of-principle rapid noninvasive prenatal diagnosis of autosomal recessive founder mutations

    PubMed Central

    Zeevi, David A.; Altarescu, Gheona; Weinberg-Shukron, Ariella; Zahdeh, Fouad; Dinur, Tama; Chicco, Gaya; Herskovitz, Yair; Renbaum, Paul; Elstein, Deborah; Levy-Lahad, Ephrat; Rolfs, Arndt; Zimran, Ari

    2015-01-01

    BACKGROUND. Noninvasive prenatal testing can be used to accurately detect chromosomal aneuploidies in circulating fetal DNA; however, the necessity of parental haplotype construction is a primary drawback to noninvasive prenatal diagnosis (NIPD) of monogenic disease. Family-specific haplotype assembly is essential for accurate diagnosis of minuscule amounts of circulating cell-free fetal DNA; however, current haplotyping techniques are too time-consuming and laborious to be carried out within the limited time constraints of prenatal testing, hampering practical application of NIPD in the clinic. Here, we have addressed this pitfall and devised a universal strategy for rapid NIPD of a prevalent mutation in the Ashkenazi Jewish (AJ) population. METHODS. Pregnant AJ couples, carrying mutation(s) in GBA, which encodes acid β-glucosidase, were recruited at the SZMC Gaucher Clinic. Targeted next-generation sequencing of GBA-flanking SNPs was performed on peripheral blood samples from each couple, relevant mutation carrier family members, and unrelated individuals who are homozygotes for an AJ founder mutation. Allele-specific haplotypes were constructed based on linkage, and a consensus Gaucher disease–associated founder mutation–flanking haplotype was fine mapped. Together, these haplotypes were used for NIPD. All test results were validated by conventional prenatal or postnatal diagnostic methods. RESULTS. Ten parental alleles in eight unrelated fetuses were diagnosed successfully based on the noninvasive method developed in this study. The consensus mutation–flanking haplotype aided diagnosis for 6 of 9 founder mutation alleles. CONCLUSIONS. The founder NIPD method developed and described here is rapid, economical, and readily adaptable for prenatal testing of prevalent autosomal recessive disease-causing mutations in an assortment of worldwide populations. FUNDING. SZMC, Protalix Biotherapeutics Inc., and Centogene AG. PMID:26426075

  17. Haplotype Sharing Provides Insights into Fine-Scale Population History and Disease in Finland.

    PubMed

    Martin, Alicia R; Karczewski, Konrad J; Kerminen, Sini; Kurki, Mitja I; Sarin, Antti-Pekka; Artomov, Mykyta; Eriksson, Johan G; Esko, Tõnu; Genovese, Giulio; Havulinna, Aki S; Kaprio, Jaakko; Konradi, Alexandra; Korányi, László; Kostareva, Anna; Männikkö, Minna; Metspalu, Andres; Perola, Markus; Prasad, Rashmi B; Raitakari, Olli; Rotar, Oxana; Salomaa, Veikko; Groop, Leif; Palotie, Aarno; Neale, Benjamin M; Ripatti, Samuli; Pirinen, Matti; Daly, Mark J

    2018-05-03

    Finland provides unique opportunities to investigate population and medical genomics because of its adoption of unified national electronic health records, detailed historical and birth records, and serial population bottlenecks. We assembled a comprehensive view of recent population history (≤100 generations), the timespan during which most rare-disease-causing alleles arose, by comparing pairwise haplotype sharing from 43,254 Finns to that of 16,060 Swedes, Estonians, Russians, and Hungarians from geographically and linguistically adjacent countries with different population histories. We find much more extensive sharing in Finns, with at least one ≥ 5 cM tract on average between pairs of unrelated individuals. By coupling haplotype sharing with fine-scale birth records from more than 25,000 individuals, we find that although haplotype sharing broadly decays with geographical distance, there are pockets of excess haplotype sharing; individuals from northeast Finland typically share several-fold more of their genome in identity-by-descent segments than individuals from southwest regions. We estimate recent effective population-size changes through time across regions of Finland, and we find that there was more continuous gene flow as Finns migrated from southwest to northeast between the early- and late-settlement regions than was dichotomously described previously. Lastly, we show that haplotype sharing is locally enriched by an order of magnitude among pairs of individuals sharing rare alleles and especially among pairs sharing rare disease-causing variants. Our work provides a general framework for using haplotype sharing to reconstruct an integrative view of recent population history and gain insight into the evolutionary origins of rare variants contributing to disease. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  18. A single nucleotide polymorphism in osteonectin 3’ untranslated region regulates bone volume and is targeted by miR-433

    PubMed Central

    Dole, Neha S.; Kapinas, Kristina; Kessler, Catherine B.; Yee, Siu-Pok; Adams, Douglas J.; Pereira, Renata C.; Delany, Anne M.

    2014-01-01

    Osteonectin/SPARC is one of the most abundant non-collagenous extracellular matrix proteins in bone, regulating collagen fiber assembly and promoting osteoblast differentiation. Osteonectin-null and –haploinsufficient mice have low turnover osteopenia, indicating that osteonectin contributes to normal bone formation. In male idiopathic osteoporosis patients, osteonectin 3’ UTR single nucleotide polymorphism (SNP) haplotypes that differed only at SNP1599 (rs1054204) were previously associated with bone mass. Haplotype A (containing SNP1599G) was more frequent in severely affected patients, whereas haplotype B (containing SNP1599C) was more frequent in less affected patients and healthy controls. We hypothesized that SNP1599 contributes to variability in bone mass by modulating osteonectin levels. Osteonectin 3’UTR reporter constructs demonstrated that haplotype A has a repressive effect on gene expression compared to B. We found that SNP1599G contributed to a miR-433 binding site and miR-433 inhibitor relieved repression of the haplotype A, but not B, 3’ UTR reporter construct. We tested our hypothesis in vivo, using a knock-in approach to replace the mouse osteonectin 3’ UTR with human haplotype A or B 3’ UTR. Compared to haplotype A mice, bone osteonectin levels were higher in haplotype B mice. B mice displayed higher bone formation rate and gained more trabecular bone with age. When parathyroid hormone was administered intermittently, haplotype B mice gained more cortical bone area than A mice. Cultured marrow stromal cells from B mice deposited more mineralized matrix and had higher osteocalcin mRNA compared with A mice, demonstrating a cell-autonomous effect on differentiation. Altogether, SNP1599 differentially regulates osteonectin expression and contributes to variability in bone mass, by a mechanism that may involve differential targeting by miR-433. This work validates the findings of the previous candidate gene study, and it assigns a physiological function to a common osteonectin allele, providing support for its role in the complex trait of skeletal phenotype. PMID:25262637

  19. H-PoP and H-PoPG: heuristic partitioning algorithms for single individual haplotyping of polyploids.

    PubMed

    Xie, Minzhu; Wu, Qiong; Wang, Jianxin; Jiang, Tao

    2016-12-15

    Some economically important plants including wheat and cotton have more than two copies of each chromosome. With the decreasing cost and increasing read length of next-generation sequencing technologies, reconstructing the multiple haplotypes of a polyploid genome from its sequence reads becomes practical. However, the computational challenge in polyploid haplotyping is much greater than that in diploid haplotyping, and there are few related methods. This article models the polyploid haplotyping problem as an optimal poly-partition problem of the reads, called the Polyploid Balanced Optimal Partition model. For the reads sequenced from a k-ploid genome, the model tries to divide the reads into k groups such that the difference between the reads of the same group is minimized while the difference between the reads of different groups is maximized. When the genotype information is available, the model is extended to the Polyploid Balanced Optimal Partition with Genotype constraint problem. These models are all NP-hard. We propose two heuristic algorithms, H-PoP and H-PoPG, based on dynamic programming and a strategy of limiting the number of intermediate solutions at each iteration, to solve the two models, respectively. Extensive experimental results on simulated and real data show that our algorithms can solve the models effectively, and are much faster and more accurate than the recent state-of-the-art polyploid haplotyping algorithms. The experiments also show that our algorithms can deal with long reads and deep read coverage effectively and accurately. Furthermore, H-PoP might be applied to help determine the ploidy of an organism. https://github.com/MinzhuXie/H-PoPG CONTACT: xieminzhu@hotmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. Assembly and diploid architecture of an individual human genome via single-molecule technologies

    PubMed Central

    Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

    2015-01-01

    We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality. PMID:26121404

  1. Assembly and diploid architecture of an individual human genome via single-molecule technologies.

    PubMed

    Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

    2015-08-01

    We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

  2. The recovery and analysis of mitochondrial DNA from exploded pipe bombs.

    PubMed

    Foran, David R; Gehring, Michael E; Stallworth, Shawn E

    2009-01-01

    Improvised explosive devices (IEDs) represent one of the most common modes of arbitrarily injuring or killing human beings. Because of the heat generated by, and destruction to, an IED postconflagration, most methods for identifying who assembled the device are ineffective. In the research presented, steel pipe bombs were mock-assembled by volunteers, and the bombs detonated under controlled conditions. The resultant shrapnel was collected and swabbed for residual cellular material. Mitochondrial DNA profiles were generated and compared blind to the pool of individuals who assembled the bombs. Assemblers were correctly identified 50% of the time, while another 19% could be placed into a group of three individuals with shared haplotypes. Only one bomb was assigned incorrectly. In some instances a contaminating profile (mixture) was also observed. Taken together, the results speak to the extreme sensitivity the methods have for identifying those who assemble IEDs, along with precautions needed when collecting and processing such evidence.

  3. Evolutionary and functional mitogenomics associated with the genetic restoration of the Florida panther

    USGS Publications Warehouse

    Ochoa, Alexander; Onorato, David P.; Fitak, Robert R.; Roelke-Parker, Melody; Culver, Melanie

    2017-01-01

    Florida panthers are endangered pumas that currently persist in reduced patches of habitat in South Florida, USA. We performed mitogenome reference-based assemblies for most parental lines of the admixed Florida panthers that resulted from the introduction of female Texas pumas into South Florida in 1995. With the addition of 2 puma mitogenomes, we characterized 174 single nucleotide polymorphisms (SNPs) across 12 individuals. We defined 5 haplotypes (Pco1–Pco5), one of which (Pco1) had a geographic origin exclusive to Costa Rica and Panama and was possibly introduced into the Everglades National Park, Florida, prior to 1995. Haplotype Pco2 was native to Florida. Haplotypes Pco3 and Pco4 were exclusive to Texas, whereas haplotype Pco5 had an undetermined geographic origin. Phylogenetic inference suggests that haplotypes Pco1–Pco4 diverged ~202000 (95% HPDI = 83000–345000) years ago and that haplotypes Pco2–Pco4 diverged ~61000 (95% HPDI = 9000–127000) years ago. These results are congruent with a south-to-north continental expansion and with a recent North American colonization by pumas. Furthermore, pumas may have migrated from Texas to Florida no earlier than ~44000 (95% HPDI = 2000–98000) years ago. Synonymous mutations presented a greater mean substitution rate than other mitochondrial functional regions: nonsynonymous mutations, tRNAs, rRNAs, and control region. Similarly, all protein-coding genes were under predominant negative selection constraints. We directly and indirectly assessed the presence of potential deleterious SNPs in the ND2 and ND5 genes in Florida panthers prior to and as a consequence of the introduction of Texas pumas. Screenings for such variants are recommended in extant Florida panthers.

  4. The Association of DRD2 with Insight Problem Solving.

    PubMed

    Zhang, Shun; Zhang, Jinghuan

    2016-01-01

    Although the insight phenomenon has attracted great attention from psychologists, it is still largely unknown whether its variation in well-functioning human adults has a genetic basis. Several lines of evidence suggest that genes involved in dopamine (DA) transmission might be potential candidates. The present study explored for the first time the association of dopamine D2 receptor gene ( DRD2 ) with insight problem solving. Fifteen single-nucleotide polymorphisms (SNPs) covering DRD2 were genotyped in 425 unrelated healthy Chinese undergraduates, and were further tested for association with insight problem solving. Both single SNP and haplotype analysis revealed several associations of DRD2 SNPs and haplotypes with insight problem solving. In conclusion, the present study provides the first evidence for the involvement of DRD2 in insight problem solving, future studies are necessary to validate these findings.

  5. The Association of DRD2 with Insight Problem Solving

    PubMed Central

    Zhang, Shun; Zhang, Jinghuan

    2016-01-01

    Although the insight phenomenon has attracted great attention from psychologists, it is still largely unknown whether its variation in well-functioning human adults has a genetic basis. Several lines of evidence suggest that genes involved in dopamine (DA) transmission might be potential candidates. The present study explored for the first time the association of dopamine D2 receptor gene (DRD2) with insight problem solving. Fifteen single-nucleotide polymorphisms (SNPs) covering DRD2 were genotyped in 425 unrelated healthy Chinese undergraduates, and were further tested for association with insight problem solving. Both single SNP and haplotype analysis revealed several associations of DRD2 SNPs and haplotypes with insight problem solving. In conclusion, the present study provides the first evidence for the involvement of DRD2 in insight problem solving, future studies are necessary to validate these findings. PMID:27933030

  6. PWHATSHAP: efficient haplotyping for future generation sequencing.

    PubMed

    Bracciali, Andrea; Aldinucci, Marco; Patterson, Murray; Marschall, Tobias; Pisanti, Nadia; Merelli, Ivan; Torquati, Massimo

    2016-09-22

    Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WHATSHAP is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered PWHATSHAP, a parallel, high-performance version of WHATSHAP. PWHATSHAP is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WHATSHAP, PWHATSHAP exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WHATSHAP, which increases with coverage. Due to its structure and management of the large datasets, the parallelisation of WHATSHAP posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, PWHATSHAP, is a freely available toolkit that improves the efficiency of the analysis of genomics information.

  7. A DRD1 haplotype is associated with risk for autism spectrum disorders in male-only affected sib-pair families.

    PubMed

    Hettinger, Joe A; Liu, Xudong; Schwartz, Charles E; Michaelis, Ron C; Holden, Jeanette J A

    2008-07-05

    Individuals with autism spectrum disorders (ASDs) have impairments in executive function and social cognition, with males generally being more severely affected in these areas than females. Because the dopamine D1 receptor (encoded by DRD1) is integral to the neural circuitry mediating these processes, we examined the DRD1 gene for its role in susceptibility to ASDs by performing single marker and haplotype case-control comparisons, family-based association tests, and genotype-phenotype assessments (quantitative transmission disequilibrium tests: QTDT) using three DRD1 polymorphisms, rs265981C/T, rs4532A/G, and rs686T/C. Our previous findings suggested that the dopaminergic system may be more integrally involved in families with affected males only than in other families. We therefore restricted our study to families with two or more affected males (N = 112). There was over-transmission of rs265981-C and rs4532-A in these families (P = 0.040, P = 0.038), with haplotype TDT analysis showing over-transmission of the C-A-T haplotype (P = 0.022) from mothers to affected sons (P = 0.013). In addition, haplotype case-control comparisons revealed an increase of this putative risk haplotype in affected individuals relative to a comparison group (P = 0.004). QTDT analyses showed associations of the rs265981-C, rs4532-A, rs686-T alleles, and the C-A-T haplotype with more severe problems in social interaction, greater difficulties with nonverbal communication and increased stereotypies compared to individuals with other haplotypes. Preferential haplotype transmission of markers at the DRD1 locus and an increased frequency of a specific haplotype support the DRD1 gene as a risk gene for core symptoms of ASD in families having only affected males. Copyright 2008 Wiley-Liss, Inc.

  8. Very long haplotype tracts characterized at high resolution from HLA homozygous cell lines

    PubMed Central

    Norman, Paul J.; Norberg, Steve; Nemat-Gorgani, Neda; Royce, Thomas; Hollenbach, Jill A.; Won, Melissa Shults; Guethlein, Lisbeth A.; Gunderson, Kevin L.; Ronaghi, Mostafa; Parham, Peter

    2015-01-01

    The HLA region of chromosome 6 contains the most polymorphic genes in humans. Spanning ~5Mbp the densely packed region encompasses approximately 175 expressed genes including the highly polymorphic HLA class I and II loci. Most of the other genes and functional elements are also polymorphic, and many of them are directly implicated in immune function or immune-related disease. For these reasons this complex genomic region is subject to intense scrutiny by researchers with the common goal of aiding further understanding and diagnoses of multiple immune-related diseases and syndromes. To aid assay development and characterization of the classical loci, a panel of cell lines partially or fully homozygous for HLA class I and II was assembled over time by the International Histocompatibility Working Group (IHWG). Containing a minimum of 88 unique HLA haplotypes, we show this panel represents a significant proportion of European HLA allelic and haplotype diversity (60–95%). Using a high-density whole genome array that includes 13,331 HLA region SNPs, we analyzed 99 IHWG cells to map the coordinates of the homozygous tracts at a fine scale. The mean homozygous tract length within chromosome 6 from these individuals is 21Mbp. Within HLA the mean haplotype length is 4.3Mbp, and 65% of the cell lines were shown to be homozygous throughout the entire region. In addition, four cell lines are homozygous throughout the complex KIR region of chromosome 19 (~250kbp). The data we describe will provide a valuable resource for characterizing haplotypes, designing and refining imputation algorithms and developing assay controls. PMID:26198775

  9. Computational intelligence in bioinformatics: SNP/haplotype data in genetic association study for common diseases.

    PubMed

    Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan

    2009-09-01

    Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.

  10. Facilitated sequence counting and assembly by template mutagenesis

    PubMed Central

    Levy, Dan; Wigler, Michael

    2014-01-01

    Presently, inferring the long-range structure of the DNA templates is limited by short read lengths. Accurate template counts suffer from distortions occurring during PCR amplification. We explore the utility of introducing random mutations in identical or nearly identical templates to create distinguishable patterns that are inherited during subsequent copying. We simulate the applications of this process under assumptions of error-free sequencing and perfect mapping, using cytosine deamination as a model for mutation. The simulations demonstrate that within readily achievable conditions of nucleotide conversion and sequence coverage, we can accurately count the number of otherwise identical molecules as well as connect variants separated by long spans of identical sequence. We discuss many potential applications, such as transcript profiling, isoform assembly, haplotype phasing, and de novo genome assembly. PMID:25313059

  11. No shortcut solution to the problem of Y-STR match probability calculation.

    PubMed

    Caliebe, Amke; Jochens, Arne; Willuweit, Sascha; Roewer, Lutz; Krawczak, Michael

    2015-03-01

    Match probability calculation is deemed much more intricate for lineage genetic markers, including Y-chromosomal short tandem repeats (Y-STRs), than for autosomal markers. This is because, owing to the lack of recombination, strong interdependence between markers is likely, which implies that haplotype frequency estimates cannot simply be obtained through the multiplication of allele frequency estimates. As yet, however, the practical relevance of this problem has not been studied in much detail using real data. In fact, such scrutiny appears well warranted because the high mutation rates of Y-STRs and the possibility of backward mutation should have worked against the statistical association of Y-STRs. We examined haplotype data of 21 markers included in the PowerPlex(®)Y23 set (PPY23, Promega Corporation, Madison, WI) originating from six different populations (four European and two Asian). Assessing the conditional entropies of the markers, given different subsets of markers from the same panel, we demonstrate that the PowerPlex(®)Y23 set cannot be decomposed into smaller marker subsets that would be (conditionally) independent. Nevertheless, in all six populations, >94% of the joint entropy of the 21 markers is explained by the seven most rapidly mutating markers. Although this result might render a reduction in marker number a sensible option for practical casework, the partial haplotypes would still be almost as diverse as the full haplotypes. Therefore, match probability calculation remains difficult and calls for the improvement of currently available methods of haplotype frequency estimation. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  12. Analysis of betaS and betaA genes in a Mexican population with African roots.

    PubMed

    Magaña, María Teresa; Ongay, Zoyla; Tagle, Juan; Bentura, Gilberto; Cobián, José G; Perea, F Javier; Casas-Castañeda, Maricela; Sánchez-López, Yoaly J; Ibarra, Bertha

    2002-01-01

    To investigate the origin of the beta(A) and beta(S) genes in a Mexican population with African roots and a high frequency of hemoglobin S, we analyzed 467 individuals (288 unrelated) from different towns in the states of Guerrero and Oaxaca in the Costa Chica region. The frequency of the sickle-cell trait was 12.8%, which may represent a public health problem. The frequencies of the beta-haplotypes were determined from 350 nonrelated chromosomes (313 beta(A) and 37 beta(S)). We observed 15 different beta(A) haplotypes, the most common of which were haplotypes 1 (48.9%), 2 (13.4%), and 3 (13.4%). The calculation of pairwise distributions and Nei's genetic distance analysis using 32 worldwide populations showed that the beta(A) genes are more closely related to those of Mexican Mestizos and North Africans. Bantu and Benin haplotypes and haplotype 9 were related to the beta(S) genes, with frequencies of 78.8, 18.2, and 3.0%, respectively. Comparison of these haplotypes with 17 other populations revealed a high similitude with the population of the Central African Republic. These data suggest distinct origins for the beta(A) and beta(S) genes in Mexican individuals from the Costa Chica region.

  13. All 17 S-locus F-box proteins of the S2 - and S3 -haplotypes of Petunia inflata are assembled into similar SCF complexes with a specific function in self-incompatibility.

    PubMed

    Li, Shu; Williams, Justin S; Sun, Penglin; Kao, Teh-Hui

    2016-09-01

    The collaborative non-self-recognition model for S-RNase-based self-incompatibility predicts that multiple S-locus F-box proteins (SLFs) produced by pollen of a given S-haplotype collectively mediate ubiquitination and degradation of all non-self S-RNases, but not self S-RNases, in the pollen tube, thereby resulting in cross-compatible pollination but self-incompatible pollination. We had previously used pollen extracts containing GFP-fused S2 -SLF1 (SLF1 with an S2 -haplotype) of Petunia inflata for co-immunoprecipitation (Co-IP) and mass spectrometry (MS), and identified PiCUL1-P (a pollen-specific Cullin1), PiSSK1 (a pollen-specific Skp1-like protein) and PiRBX1 (a conventional Rbx1) as components of the SCF(S) (2-) (SLF) (1) complex. Using pollen extracts containing PiSSK1:FLAG:GFP for Co-IP/MS, we identified two additional SLFs (SLF4 and SLF13) that were assembled into SCF(SLF) complexes. As 17 SLF genes (SLF1 to SLF17) have been identified in S2 and S3 pollen, here we examined whether all 17 SLFs are assembled into similar complexes and, if so, whether these complexes are unique to SLFs. We modified the previous Co-IP/MS procedure, including the addition of style extracts from four different S-genotypes to pollen extracts containing PiSSK1:FLAG:GFP, to perform four separate experiments. The results taken together show that all 17 SLFs and an SLF-like protein, SLFLike1 (encoded by an S-locus-linked gene), co-immunoprecipitated with PiSSK1:FLAG:GFP. Moreover, of the 179 other F-box proteins predicted by S2 and S3 pollen transcriptomes, only a pair with 94.9% identity and another pair with 99.7% identity co-immunoprecipitated with PiSSK1:FLAG:GFP. These results suggest that SCF(SLF) complexes have evolved specifically to function in self-incompatibility. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  14. RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination

    PubMed Central

    Mirzaei, Sajad; Wu, Yufeng

    2017-01-01

    Abstract Motivation: Haplotypes from one or multiple related populations share a common genealogical history. If this shared genealogy can be inferred from haplotypes, it can be very useful for many population genetics problems. However, with the presence of recombination, the genealogical history of haplotypes is complex and cannot be represented by a single genealogical tree. Therefore, inference of genealogical history with recombination is much more challenging than the case of no recombination. Results: In this paper, we present a new approach called RENT+ for the inference of local genealogical trees from haplotypes with the presence of recombination. RENT+ builds on a previous genealogy inference approach called RENT, which infers a set of related genealogical trees at different genomic positions. RENT+ represents a significant improvement over RENT in the sense that it is more effective in extracting information contained in the haplotype data about the underlying genealogy than RENT. The key components of RENT+ are several greatly enhanced genealogy inference rules. Through simulation, we show that RENT+ is more efficient and accurate than several existing genealogy inference methods. As an application, we apply RENT+ in the inference of population demographic history from haplotypes, which outperforms several existing methods. Availability and Implementation: RENT+ is implemented in Java, and is freely available for download from: https://github.com/SajadMirzaei/RentPlus. Contacts: sajad@engr.uconn.edu or ywu@engr.uconn.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28065901

  15. A mitochondrial analysis reveals distinct founder effect signatures in Canarian and Balearic goats.

    PubMed

    Ferrando, A; Manunza, A; Jordana, J; Capote, J; Pons, A; Pais, J; Delgado, T; Atoche, P; Cabrera, B; Martínez, A; Landi, V; Delgado, J V; Argüello, A; Vidal, O; Lalueza-Fox, C; Ramírez, O; Amills, M

    2015-08-01

    In the course of human migrations, domestic animals often have been translocated to islands with the aim of assuring food availability. These founder events are expected to leave a genetic footprint that may be recognised nowadays. Herewith, we have examined the mitochondrial diversity of goat populations living in the Canarian and Balearic archipelagos. Median-joining network analysis produced very distinct network topologies for these two populations. Indeed, a majority of Canarian goats shared a single ancestral haplotype that segregated in all sampled islands, suggesting a single founder effect followed by a stepping-stone pattern of diffusion. This haplotype also was present in samples collected from archaeological assemblies at Gran Canaria and Lanzarote, making evident its widespread distribution in ancient times. In stark contrast, goats from Majorca and Ibiza did not share any mitochondrial haplotypes, indicating the occurrence of two independent founder events. Furthermore, in Majorcan goats, we detected the segregation of the mitochondrial G haplogroup that has only been identified in goats from Egypt, Iran and Turkey. This finding suggests the translocation of Asian and/or African goats to Majorca, possibly as a consequence of the Phoenician and Carthaginian colonisations of this island. © 2015 Stichting International Foundation for Animal Genetics.

  16. Towards a comprehensive barcode library for arctic life - Ephemeroptera, Plecoptera, and Trichoptera of Churchill, Manitoba, Canada

    PubMed Central

    2009-01-01

    Background This study reports progress in assembling a DNA barcode reference library for Ephemeroptera, Plecoptera, and Trichoptera ("EPTs") from a Canadian subarctic site, which is the focus of a comprehensive biodiversity inventory using DNA barcoding. These three groups of aquatic insects exhibit a moderate level of species diversity, making them ideal for testing the feasibility of DNA barcoding for routine biotic surveys. We explore the correlation between the morphological species delineations, DNA barcode-based haplotype clusters delimited by a sequence threshold (2%), and a threshold-free approach to biodiversity quantification--phylogenetic diversity. Results A DNA barcode reference library is built for 112 EPT species for the focal region, consisting of 2277 COI sequences. Close correspondence was found between EPT morphospecies and haplotype clusters as designated using a standard threshold value. Similarly, the shapes of taxon accumulation curves based upon haplotype clusters were very similar to those generated using phylogenetic diversity accumulation curves, but were much more computationally efficient. Conclusion The results of this study will facilitate other lines of research on northern EPTs and also bode well for rapidly conducting initial biodiversity assessments in unknown EPT faunas. PMID:20003245

  17. Genotype-based association models of complex diseases to detect gene-gene and gene-environment interactions.

    PubMed

    Lobach, Iryna; Fan, Ruzong; Manga, Prashiela

    A central problem in genetic epidemiology is to identify and rank genetic markers involved in a disease. Complex diseases, such as cancer, hypertension, diabetes, are thought to be caused by an interaction of a panel of genetic factors, that can be identified by markers, which modulate environmental factors. Moreover, the effect of each genetic marker may be small. Hence, the association signal may be missed unless a large sample is considered, or a priori biomedical data are used. Recent advances generated a vast variety of a priori information, including linkage maps and information about gene regulatory dependence assembled into curated pathway databases. We propose a genotype-based approach that takes into account linkage disequilibrium (LD) information between genetic markers that are in moderate LD while modeling gene-gene and gene-environment interactions. A major advantage of our method is that the observed genetic information enters a model directly thus eliminating the need to estimate haplotype-phase. Our approach results in an algorithm that is inexpensive computationally and does not suffer from bias induced by haplotype-phase ambiguity. We investigated our model in a series of simulation experiments and demonstrated that the proposed approach results in estimates that are nearly unbiased and have small variability. We applied our method to the analysis of data from a melanoma case-control study and investigated interaction between a set of pigmentation genes and environmental factors defined by age and gender. Furthermore, an application of our method is demonstrated using a study of Alcohol Dependence.

  18. RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination.

    PubMed

    Mirzaei, Sajad; Wu, Yufeng

    2017-04-01

    : Haplotypes from one or multiple related populations share a common genealogical history. If this shared genealogy can be inferred from haplotypes, it can be very useful for many population genetics problems. However, with the presence of recombination, the genealogical history of haplotypes is complex and cannot be represented by a single genealogical tree. Therefore, inference of genealogical history with recombination is much more challenging than the case of no recombination. : In this paper, we present a new approach called RENT+  for the inference of local genealogical trees from haplotypes with the presence of recombination. RENT+  builds on a previous genealogy inference approach called RENT , which infers a set of related genealogical trees at different genomic positions. RENT+  represents a significant improvement over RENT in the sense that it is more effective in extracting information contained in the haplotype data about the underlying genealogy than RENT . The key components of RENT+  are several greatly enhanced genealogy inference rules. Through simulation, we show that RENT+  is more efficient and accurate than several existing genealogy inference methods. As an application, we apply RENT+  in the inference of population demographic history from haplotypes, which outperforms several existing methods. : RENT+  is implemented in Java, and is freely available for download from: https://github.com/SajadMirzaei/RentPlus . : sajad@engr.uconn.edu or ywu@engr.uconn.edu. : Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  19. Tag SNP selection via a genetic algorithm.

    PubMed

    Mahdevar, Ghasem; Zahiri, Javad; Sadeghi, Mehdi; Nowzari-Dalini, Abbas; Ahrabian, Hayedeh

    2010-10-01

    Single Nucleotide Polymorphisms (SNPs) provide valuable information on human evolutionary history and may lead us to identify genetic variants responsible for human complex diseases. Unfortunately, molecular haplotyping methods are costly, laborious, and time consuming; therefore, algorithms for constructing full haplotype patterns from small available data through computational methods, Tag SNP selection problem, are convenient and attractive. This problem is proved to be an NP-hard problem, so heuristic methods may be useful. In this paper we present a heuristic method based on genetic algorithm to find reasonable solution within acceptable time. The algorithm was tested on a variety of simulated and experimental data. In comparison with the exact algorithm, based on brute force approach, results show that our method can obtain optimal solutions in almost all cases and runs much faster than exact algorithm when the number of SNP sites is large. Our software is available upon request to the corresponding author.

  20. De-Novo Assembly and Analysis of the Heterozygous Triploid Genome of the Wine Spoilage Yeast Dekkera bruxellensis AWRI1499

    PubMed Central

    Chambers, Paul J.; Pretorius, Isak S.

    2012-01-01

    Despite its industrial importance, the yeast species Dekkera (Brettanomyces) bruxellensis has remained poorly understood at the genetic level. In this study we describe whole genome sequencing and analysis for a prevalent wine spoilage strain, AWRI1499. The 12.7 Mb assembly, consisting of 324 contigs in 99 scaffolds (super-contigs) at 26-fold coverage, exhibits a relatively high density of single nucleotide polymorphisms (SNPs). Haplotype sampling for 1.2% of open reading frames suggested that the D. bruxellensis AWRI1499 genome is comprised of a moderately heterozygous diploid genome, in combination with a divergent haploid genome. Gene content analysis revealed enrichment in membrane proteins, particularly transporters, along with oxidoreductase enzymes. Availability of this assembly and annotation provides a resource for further investigation of genomic organization in this species, and functional characterization of genes that may confer important phenotypic traits. PMID:22470482

  1. The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

    PubMed

    Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A

    2016-10-11

    Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.

  2. The versican gene and the risk of intracranial aneurysms.

    PubMed

    Ruigrok, Ynte M; Rinkel, Gabriël J E; Wijmenga, Cisca

    2006-09-01

    The proteoglycan versican is an excellent candidate gene for intracranial aneurysms (IAs) because it plays an important role in extracellular matrix assembly and is localized in a previously implicated locus for IAs on chromosome 5q. We analyzed all the common variations using 16-tag single nucleotide polymorphisms (SNPs) and haplotypes in the versican gene using a 2-stage genotyping approach. For stage 1, 16 SNPs were genotyped in 307 cases and 639 controls. For stage 2, the two SNPs yielding the most significant associations (P<0.01) were genotyped in a second independent cohort of 310 cases for confirmation of the associations. In stage 1, we found several SNPs in strong linkage disequilibrium and haplotypes constituting these SNPs associated with IAs in the Dutch population (strongest SNP association for rs173686 with odds ratio=1.34, 95% CI=1.09 to 1.65, P=0.004). In stage 2, we confirmed association for the 2 SNPs with the most significant associations (strongest SNP association for rs173686 with odds ratio=1.36, 95% CI=1.11 to 1.67, P=0.003). SNPs in strong linkage disequilibrium and haplotypes constituting these SNPs in the versican gene are associated with IAs suggesting that variation in or near the versican gene plays a role in susceptibility to IAs.

  3. A mixed integer linear programming model to reconstruct phylogenies from single nucleotide polymorphism haplotypes under the maximum parsimony criterion

    PubMed Central

    2013-01-01

    Background Phylogeny estimation from aligned haplotype sequences has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from medical research, to drug discovery, to epidemiology, to population dynamics. The literature on molecular phylogenetics proposes a number of criteria for selecting a phylogeny from among plausible alternatives. Usually, such criteria can be expressed by means of objective functions, and the phylogenies that optimize them are referred to as optimal. One of the most important estimation criteria is the parsimony which states that the optimal phylogeny T∗for a set H of n haplotype sequences over a common set of variable loci is the one that satisfies the following requirements: (i) it has the shortest length and (ii) it is such that, for each pair of distinct haplotypes hi,hj∈H, the sum of the edge weights belonging to the path from hi to hj in T∗ is not smaller than the observed number of changes between hi and hj. Finding the most parsimonious phylogeny for H involves solving an optimization problem, called the Most Parsimonious Phylogeny Estimation Problem (MPPEP), which is NP-hard in many of its versions. Results In this article we investigate a recent version of the MPPEP that arises when input data consist of single nucleotide polymorphism haplotypes extracted from a population of individuals on a common genomic region. Specifically, we explore the prospects for improving on the implicit enumeration strategy of implicit enumeration strategy used in previous work using a novel problem formulation and a series of strengthening valid inequalities and preliminary symmetry breaking constraints to more precisely bound the solution space and accelerate implicit enumeration of possible optimal phylogenies. We present the basic formulation and then introduce a series of provable valid constraints to reduce the solution space. We then prove that these constraints can often lead to significant reductions in the gap between the optimal solution and its non-integral linear programming bound relative to the prior art as well as often substantially faster processing of moderately hard problem instances. Conclusion We provide an indication of the conditions under which such an optimal enumeration approach is likely to be feasible, suggesting that these strategies are usable for relatively large numbers of taxa, although with stricter limits on numbers of variable sites. The work thus provides methodology suitable for provably optimal solution of some harder instances that resist all prior approaches. PMID:23343437

  4. Single haplotype assembly of the human genome from a hydatidiform mole.

    PubMed

    Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

    2014-12-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.

  5. Single haplotype assembly of the human genome from a hydatidiform mole

    PubMed Central

    Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

    2014-01-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144

  6. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly

    PubMed Central

    Schneider, Valerie A.; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A.; Murphy, Terence D.; Pruitt, Kim D.; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S.; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T.; Threadgold, Glen; Torrance, James; Wood, Jonathan M.; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M.; Durbin, Richard; Wilson, Richard K.; Flicek, Paul; Eichler, Evan E.; Church, Deanna M.

    2017-01-01

    The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. PMID:28396521

  7. Y-Chromosome Markers for the Red Fox.

    PubMed

    Rando, Halie M; Stutchman, Jeremy T; Bastounes, Estelle R; Johnson, Jennifer L; Driscoll, Carlos A; Barr, Christina S; Trut, Lyudmila N; Sacks, Benjamin N; Kukekova, Anna V

    2017-09-01

    The de novo assembly of the red fox (Vulpes vulpes) genome has facilitated the development of genomic tools for the species. Efforts to identify the population history of red foxes in North America have previously been limited by a lack of information about the red fox Y-chromosome sequence. However, a megabase of red fox Y-chromosome sequence was recently identified over 2 scaffolds in the reference genome. Here, these scaffolds were scanned for repeated motifs, revealing 194 likely microsatellites. Twenty-three of these loci were selected for primer development and, after testing, produced a panel of 11 novel markers that were analyzed alongside 2 markers previously developed for the red fox from dog Y-chromosome sequence. The markers were genotyped in 76 male red foxes from 4 populations: 7 foxes from Newfoundland (eastern Canada), 12 from Maryland (eastern United States), and 9 from the island of Great Britain, as well as 48 foxes of known North American origin maintained on an experimental farm in Novosibirsk, Russia. The full marker panel revealed 22 haplotypes among these red foxes, whereas the 2 previously known markers alone would have identified only 10 haplotypes. The haplotypes from the 4 populations clustered primarily by continent, but unidirectional gene flow from Great Britain and farm populations may influence haplotype diversity in the Maryland population. The development of new markers has increased the resolution at which red fox Y-chromosome diversity can be analyzed and provides insight into the contribution of males to red fox population diversity and patterns of phylogeography. © The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  8. Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.

    PubMed

    Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M

    2001-11-14

    The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.

  9. Genomic Analyses Reveal the Influence of Geographic Origin, Migration, and Hybridization on Modern Dog Breed Development.

    PubMed

    Parker, Heidi G; Dreger, Dayna L; Rimbault, Maud; Davis, Brian W; Mullen, Alexandra B; Carpintero-Ramirez, Gretchen; Ostrander, Elaine A

    2017-04-25

    There are nearly 400 modern domestic dog breeds with a unique histories and genetic profiles. To track the genetic signatures of breed development, we have assembled the most diverse dataset of dog breeds, reflecting their extensive phenotypic variation and heritage. Combining genetic distance, migration, and genome-wide haplotype sharing analyses, we uncover geographic patterns of development and independent origins of common traits. Our analyses reveal the hybrid history of breeds and elucidate the effects of immigration, revealing for the first time a suggestion of New World dog within some modern breeds. Finally, we used cladistics and haplotype sharing to show that some common traits have arisen more than once in the history of the dog. These analyses characterize the complexities of breed development, resolving longstanding questions regarding individual breed origination, the effect of migration on geographically distinct breeds, and, by inference, transfer of trait and disease alleles among dog breeds. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  10. DNA origami-based shape IDs for single-molecule nanomechanical genotyping

    NASA Astrophysics Data System (ADS)

    Zhang, Honglu; Chao, Jie; Pan, Dun; Liu, Huajie; Qiang, Yu; Liu, Ke; Cui, Chengjun; Chen, Jianhua; Huang, Qing; Hu, Jun; Wang, Lianhui; Huang, Wei; Shi, Yongyong; Fan, Chunhai

    2017-04-01

    Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ~10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level.

  11. DNA origami-based shape IDs for single-molecule nanomechanical genotyping

    PubMed Central

    Zhang, Honglu; Chao, Jie; Pan, Dun; Liu, Huajie; Qiang, Yu; Liu, Ke; Cui, Chengjun; Chen, Jianhua; Huang, Qing; Hu, Jun; Wang, Lianhui; Huang, Wei; Shi, Yongyong; Fan, Chunhai

    2017-01-01

    Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ∼10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level. PMID:28382928

  12. A haplotype map of genomic variations and genome-wide association studies of agronomic traits in foxtail millet (Setaria italica).

    PubMed

    Jia, Guanqing; Huang, Xuehui; Zhi, Hui; Zhao, Yan; Zhao, Qiang; Li, Wenjun; Chai, Yang; Yang, Lifang; Liu, Kunyan; Lu, Hengyun; Zhu, Chuanrang; Lu, Yiqi; Zhou, Congcong; Fan, Danlin; Weng, Qijun; Guo, Yunli; Huang, Tao; Zhang, Lei; Lu, Tingting; Feng, Qi; Hao, Hangfei; Liu, Hongkuan; Lu, Ping; Zhang, Ning; Li, Yuhui; Guo, Erhu; Wang, Shujun; Wang, Suying; Liu, Jinrong; Zhang, Wenfei; Chen, Guoqiu; Zhang, Baojin; Li, Wei; Wang, Yongfang; Li, Haiquan; Zhao, Baohua; Li, Jiayang; Diao, Xianmin; Han, Bin

    2013-08-01

    Foxtail millet (Setaria italica) is an important grain crop that is grown in arid regions. Here we sequenced 916 diverse foxtail millet varieties, identified 2.58 million SNPs and used 0.8 million common SNPs to construct a haplotype map of the foxtail millet genome. We classified the foxtail millet varieties into two divergent groups that are strongly correlated with early and late flowering times. We phenotyped the 916 varieties under five different environments and identified 512 loci associated with 47 agronomic traits by genome-wide association studies. We performed a de novo assembly of deeply sequenced genomes of a Setaria viridis accession (the wild progenitor of S. italica) and an S. italica variety and identified complex interspecies and intraspecies variants. We also identified 36 selective sweeps that seem to have occurred during modern breeding. This study provides fundamental resources for genetics research and genetic improvement in foxtail millet.

  13. Evaluating allopolyploid origins in strawberries (Fragaria) using haplotypes generated from target capture sequencing.

    PubMed

    Kamneva, Olga K; Syring, John; Liston, Aaron; Rosenberg, Noah A

    2017-08-04

    Hybridization is observed in many eukaryotic lineages and can lead to the formation of polyploid species. The study of hybridization and polyploidization faces challenges both in data generation and in accounting for population-level phenomena such as coalescence processes in phylogenetic analysis. Genus Fragaria is one example of a set of plant taxa in which a range of ploidy levels is observed across species, but phylogenetic origins are unknown. Here, using 20 diploid and polyploid Fragaria species, we combine approaches from NGS data analysis and phylogenetics to infer evolutionary origins of polyploid strawberries, taking into account coalescence processes. We generate haplotype sequences for 257 low-copy nuclear markers assembled from Illumina target capture sequence data. We then identify putative hybridization events by analyzing gene tree topologies, and further test predicted hybridizations in a coalescence framework. This approach confirms the allopolyploid ancestry of F. chiloensis and F. virginiana, and provides new allopolyploid ancestry hypotheses for F. iturupensis, F. moschata, and F. orientalis. Evidence of gene flow between diploids F. bucharica and F. vesca is also detected, suggesting that it might be appropriate to consider these groups as conspecifics. This study is one of the first in which target capture sequencing followed by computational deconvolution of individual haplotypes is used for tracing origins of polyploid taxa. The study also provides new perspectives on the evolutionary history of Fragaria.

  14. Whole genome resequencing of a laboratory-adapted Drosophila melanogaster population sample

    PubMed Central

    Gilks, William P.; Pennell, Tanya M.; Flis, Ilona; Webster, Matthew T.; Morrow, Edward H.

    2016-01-01

    As part of a study into the molecular genetics of sexually dimorphic complex traits, we used high-throughput sequencing to obtain data on genomic variation in an outbred laboratory-adapted fruit fly ( Drosophila melanogaster) population. We successfully resequenced the whole genome of 220 hemiclonal females that were heterozygous for the same Berkeley reference line genome (BDGP6/dm6), and a unique haplotype from the outbred base population (LH M). The use of a static and known genetic background enabled us to obtain sequences from whole-genome phased haplotypes. We used a BWA-Picard-GATK pipeline for mapping sequence reads to the dm6 reference genome assembly, at a median depth-of coverage of 31X, and have made the resulting data publicly-available in the NCBI Short Read Archive (Accession number SRP058502). We used Haplotype Caller to discover and genotype 1,726,931 small genomic variants (SNPs and indels, <200bp). Additionally we detected and genotyped 167 large structural variants (1-100Kb in size) using GenomeStrip/2.0. Sequence and genotype data are publicly-available at the corresponding NCBI databases: Short Read Archive, dbSNP and dbVar (BioProject PRJNA282591). We have also released the unfiltered genotype data, and the code and logs for data processing and summary statistics ( https://zenodo.org/communities/sussex_drosophila_sequencing/). PMID:27928499

  15. Two haplotype clusters of Echinococcus granulosus sensu stricto in northern Iraq (Kurdistan region) support the hypothesis of a parasite cradle in the Middle East.

    PubMed

    Hassan, Zuber Ismael; Meerkhan, Azad Abdullah; Boufana, Belgees; Hama, Abdullah A; Ahmed, Bayram Dawod; Mero, Wijdan Mohammed Salih; Orsten, Serra; Interisano, Maria; Pozio, Edoardo; Casulli, Adriano

    2017-08-01

    Human cystic echinococcosis (CE) caused by Echinococcus granulosus s.s. is a major public health problem in Iraqi Kurdistan with a reported surgical incidence of 6.3 per 100,000 Arbil inhabitants. A total of 125 Echinococcus isolates retrieved from sheep, goats and cattle were used in this study. Our aim was to determine species/genotypes infecting livestock in Iraqi Kurdistan and examine intraspecific variation and population structure of Echinococcus granulosus s.s. in this region and relate it to that of other regions worldwide. Using nucleotide sequences of the mitochondrial cytochrome c oxidase subunit 1 (cox 1) we identified E. granulosus s.s. as the cause of hydatidosis in all examined animals. The haplotype network displayed a double-clustered topology with two main E. granulosus s.s. haplotypes, (KU05) and (KU33). The 'founder' haplotype (KU05) confirmed the presence of a common lineage of non-genetically differentiated populations as inferred by the low non-significant fixation index values. Overall diversity and neutrality indices indicated demographic expansion. We used E. granulosus s.s. nucleotide sequences from GenBank to draw haplotype networks for the Middle East (Iran, Jordan and Turkey), Europe (Albania, Greece, Italy, Romania and Spain), China, Mongolia, Russia, South America (Argentina, Brazil, Chile and Mexico) and Tunisia. Networks with two haplotype clusters like that reported here for Iraqi Kurdistan were seen for the Middle East, Europe, Mongolia, Russia and Tunisia using both 827bp and 1609bp cox1 nucleotide sequences, whereas a star-like network was observed for China and South America. We hypothesize that the double clustering seen at what is generally assumed to be the cradle of domestication may have emerged independently and dispersed from the Middle East to other regions and that haplotype (KU33) may be the main haplotype within a second cluster in the Middle East from where it has spread into Europe, Mongolia, Russia and North Africa. Further studies using metacestodes of human origin are required to investigate the biological importance of E. granulosus s.s. haplotypes/clusters and their association, if any with clinical manifestations of CE infection. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

    PubMed

    Schneider, Valerie A; Graves-Lindsay, Tina; Howe, Kerstin; Bouk, Nathan; Chen, Hsiu-Chuan; Kitts, Paul A; Murphy, Terence D; Pruitt, Kim D; Thibaud-Nissen, Françoise; Albracht, Derek; Fulton, Robert S; Kremitzki, Milinn; Magrini, Vincent; Markovic, Chris; McGrath, Sean; Steinberg, Karyn Meltz; Auger, Kate; Chow, William; Collins, Joanna; Harden, Glenn; Hubbard, Timothy; Pelan, Sarah; Simpson, Jared T; Threadgold, Glen; Torrance, James; Wood, Jonathan M; Clarke, Laura; Koren, Sergey; Boitano, Matthew; Peluso, Paul; Li, Heng; Chin, Chen-Shan; Phillippy, Adam M; Durbin, Richard; Wilson, Richard K; Flicek, Paul; Eichler, Evan E; Church, Deanna M

    2017-05-01

    The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health. © 2017 Schneider et al.; Published by Cold Spring Harbor Laboratory Press.

  17. Diversification of Fijian halictine bees: insights into a recent island radiation.

    PubMed

    Groom, Scott V C; Stevens, Mark I; Schwarz, Michael P

    2013-09-01

    Although bees form a key pollinator suite for flowering plants, very few studies have examined the evolutionary radiation of non-domesticated bees over human time-scales. This is surprising given the importance of bees for crop pollination and the effect of humans in transforming ecosystems via agriculture. In the Pacific, where the bee fauna appears depauperate, their importance as pollinators is not clear, particularly in Fiji where species diversity is even lower than neighbouring archipelagos. Here we explore the radiation of halictine bees in Fiji using phylogenetic analyses of mtDNA COI sequence data. Our analyses indicate the existence of several 'deep' clades whose divergences are close to the crown node, along with a highly derived 'broom' clade showing very high haplotype diversity, and mostly limited to low-lying agricultural regions. This derived clade is very abundant, whereas the more basal clades were relatively rare. Although nearly all haplotype diversity in Fijian Homalictus comprises synonymous substitutions, a small number of amino acid changes are associated with the major clades, including the hyper-diverse clade. Analyses of haplotype lineage accumulation show a steep increase in selectively neutral COI haplotypes corresponding to the emergence of this 'broom' clade. We explore three possible scenarios for this dramatic increase: (i) a key change in adaptedness to the environment, (ii) a large-scale extinction event, or (iii) a dramatic increase in suitable habitats leading to rapid population expansion. Using estimated mutation rates of mitochondrial DNA in other invertebrates, we argue that Homalictus first colonised the Fijian archipelago in the middle-late Pleistocene, and the rapid accumulation of haplotypes in the hyper-diverse clade occurred in the Holocene, but prior to recorded human presence in the Fijian region. Our results indicate that bees have not been important pollinators of Fijian ecosystems until very recent times. Post-Pleistocene climate change and anthropogenic effects on Fijian ecosystems are likely to have greatly transformed pollinator suites from the conditions when those ecosystems were first being assembled. Copyright © 2013 Elsevier Inc. All rights reserved.

  18. Reference-based phasing using the Haplotype Reference Consortium panel.

    PubMed

    Loh, Po-Ru; Danecek, Petr; Palamara, Pier Francesco; Fuchsberger, Christian; A Reshef, Yakir; K Finucane, Hilary; Schoenherr, Sebastian; Forer, Lukas; McCarthy, Shane; Abecasis, Goncalo R; Durbin, Richard; L Price, Alkes

    2016-11-01

    Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ∼20× speedup and ∼10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European-ancestry samples, Eagle2 with the HRC panel achieves >2× the accuracy of 1000 Genomes-based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.

  19. Linear-time reconstruction of zero-recombinant Mendelian inheritance on pedigrees without mating loops.

    PubMed

    Liu, Lan; Jiang, Tao

    2007-01-01

    With the launch of the international HapMap project, the haplotype inference problem has attracted a great deal of attention in the computational biology community recently. In this paper, we study the question of how to efficiently infer haplotypes from genotypes of individuals related by a pedigree without mating loops, assuming that the hereditary process was free of mutations (i.e. the Mendelian law of inheritance) and recombinants. We model the haplotype inference problem as a system of linear equations as in [10] and present an (optimal) linear-time (i.e. O(mn) time) algorithm to generate a particular solution (A particular solution of any linear system is an assignment of numerical values to the variables in the system which satisfies the equations in the system.) to the haplotype inference problem, where m is the number of loci (or markers) in a genotype and n is the number of individuals in the pedigree. Moreover, the algorithm also provides a general solution (A general solution of any linear system is denoted by the span of a basis in the solution space to its associated homogeneous system, offset from the origin by a vector, namely by any particular solution. A general solution for ZRHC is very useful in practice because it allows the end user to efficiently enumerate all solutions for ZRHC and performs tasks such as random sampling.) in O(mn2) time, which is optimal because the size of a general solution could be as large as Theta(mn2). The key ingredients of our construction are (i) a fast consistency checking procedure for the system of linear equations introduced in [10] based on a careful investigation of the relationship between the equations (ii) a novel linear-time method for solving linear equations without invoking the Gaussian elimination method. Although such a fast method for solving equations is not known for general systems of linear equations, we take advantage of the underlying loop-free pedigree graph and some special properties of the linear equations.

  20. Estimating trace-suspect match probabilities for singleton Y-STR haplotypes using coalescent theory.

    PubMed

    Andersen, Mikkel Meyer; Caliebe, Amke; Jochens, Arne; Willuweit, Sascha; Krawczak, Michael

    2013-02-01

    Estimation of match probabilities for singleton haplotypes of lineage markers, i.e. for haplotypes observed only once in a reference database augmented by a suspect profile, is an important problem in forensic genetics. We compared the performance of four estimators of singleton match probabilities for Y-STRs, namely the count estimate, both with and without Brenner's so-called 'kappa correction', the surveying estimate, and a previously proposed, but rarely used, coalescent-based approach implemented in the BATWING software. Extensive simulation with BATWING of the underlying population history, haplotype evolution and subsequent database sampling revealed that the coalescent-based approach is characterized by lower bias and lower mean squared error than the uncorrected count estimator and the surveying estimator. Moreover, in contrast to the two count estimators, both the surveying and the coalescent-based approach exhibited a good correlation between the estimated and true match probabilities. However, although its overall performance is thus better than that of any other recognized method, the coalescent-based estimator is still computation-intense on the verge of general impracticability. Its application in forensic practice therefore will have to be limited to small reference databases, or to isolated cases of particular interest, until more powerful algorithms for coalescent simulation have become available. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  1. Replication and meta-analysis of TMEM132D gene variants in panic disorder

    PubMed Central

    Erhardt, A; Akula, N; Schumacher, J; Czamara, D; Karbalai, N; Müller-Myhsok, B; Mors, O; Borglum, A; Kristensen, A S; Woldbye, D P D; Koefoed, P; Eriksson, E; Maron, E; Metspalu, A; Nurnberger, J; Philibert, R A; Kennedy, J; Domschke, K; Reif, A; Deckert, J; Otowa, T; Kawamura, Y; Kaiya, H; Okazaki, Y; Tanii, H; Tokunaga, K; Sasaki, T; Ioannidis, J P A; McMahon, F J; Binder, E B

    2012-01-01

    A recent genome-wide association study in patients with panic disorder (PD) identified a risk haplotype consisting of two single-nucleotide polymorphisms (SNPs) (rs7309727 and rs11060369) located in intron 3 of TMEM132D to be associated with PD in three independent samples. Now we report a subsequent confirmation study using five additional PD case–control samples (n=1670 cases and n=2266 controls) assembled as part of the Panic Disorder International Consortium (PanIC) study for a total of 2678 cases and 3262 controls in the analysis. In the new independent samples of European ancestry (EA), the association of rs7309727 and the risk haplotype rs7309727–rs11060369 was, indeed, replicated, with the strongest signal coming from patients with primary PD, that is, patients without major psychiatric comorbidities (n=1038 cases and n=2411 controls). This finding was paralleled by the results of the meta-analysis across all samples, in which the risk haplotype and rs7309727 reached P-levels of P=1.4e−8 and P=1.1e−8, respectively, when restricting the samples to individuals of EA with primary PD. In the Japanese sample no associations with PD could be found. The present results support the initial finding that TMEM132D gene contributes to genetic susceptibility for PD in individuals of EA. Our results also indicate that patient ascertainment and genetic background could be important sources of heterogeneity modifying this association signal in different populations. PMID:22948381

  2. [Identification of Tibetan medicine "Dida" of Gentianaceae using DNA barcoding].

    PubMed

    Liu, Chuan; Zhang, Yu-Xin; Liu, Yue; Chen, Yi-Long; Fan, Gang; Xiang, Li; Xu, Jiang; Zhang, Yi

    2016-02-01

    The ITS2 barcode was used toidentify Tibetan medicine "Dida", and tosecure its quality and safety in medication. A total of 13 species, 151 experimental samples for the study from the Tibetan Plateau, including Gentianaceae Swertia, Halenia, Gentianopsis, Comastoma, Lomatogonium ITS2 sequences were amplified, and purified PCR products were sequenced. Sequence assembly and consensus sequence generation were performed using the CodonCode Aligner V3.7.1. The Kimura 2-Parameter (K2P) distances were calculated using MEGA 6.0. The neighbor-joining (NJ) phylogenetic trees were constructed. There are 31 haplotypes among 231 bp after alignment of all ITS2 sequence haplotypes, and the average G±C content of 61.40%. The NJ tree strongly supported that every species clustered into their own clade and high identification success rate, except that Swertia bifolia and Swertia wolfangiana could not be distinguished from each other based on the sequence divergences. DNA barcoding could be used as a fast and accurate identification method to distinguish Tibetan medicine "Dida" to ensure its safe use. Copyright© by the Chinese Pharmaceutical Association.

  3. Assembly of a micro-hotspot of caenogastropod endemism in the southern Nevada desert, with a description of a new species of Tryonia (Truncatelloidea, Cochliopidae).

    PubMed

    Hershler, Robert; Liu, Hsiu-Ping; Simpson, Jeffrey S

    2015-01-01

    Newly obtained and previously published sequences of the cytochrome c oxidase subunit I (COI) gene were analyzed to examine the biogeographic assembly of the caenogastropod fauna (belonging to the families Assimineidae, Cochliopidae, and Hydrobiidae) of an isolated spring along the lower Colorado River in southern Nevada (Blue Point Spring). Based on available COI clock calibrations, the three lineages that comprise this fauna are 2.78-1.42 million years old, which is roughly coeval or slightly younger than the age of Blue Point Spring (inferred from local fossil spring deposits). Two of the lineages-endemic Pyrgulopsiscoloradensis and Assimineaaff.infima-are most closely related to snails in the Death Valley area (well to the west) and likely colonized Blue Point Spring by transport on birds. A single haplotype was detected in both of these snails, suggesting that they may have only recently colonized Blue Point Spring. The third lineage-endemic Tryoniainfernalis, newly described herein based on morphological and molecular evidence-is most closely related to a geographically proximal species in a lower Colorado River tributary (Tryoniaclathrata); the split between these taxa may be the product of vicariance (severance of a prior drainage connection) or a separate jump dispersal event. The considerable genetic diversity in Tryoniainfernalis (three haplotypes differing by 0.6% mean sequence divergence) suggests a possibly lengthy history of local differentiation. Our findings also identify Blue Point Spring as a new micro-hotspot of groundwater-dependent biodiversity in Nevada and will assist ongoing efforts to protect and conserve these imperiled ecosystems.

  4. Shedding genomic light on Aristotle's lantern.

    PubMed

    Sodergren, Erica; Shen, Yufeng; Song, Xingzhi; Zhang, Lan; Gibbs, Richard A; Weinstock, George M

    2006-12-01

    Sea urchins have proved fascinating to biologists since the time of Aristotle who compared the appearance of their bony mouth structure to a lantern in The History of Animals. Throughout modern times it has been a model system for research in developmental biology. Now, the genome of the sea urchin Strongylocentrotus purpuratus is the first echinoderm genome to be sequenced. A high quality draft sequence assembly was produced using the Atlas assembler to combine whole genome shotgun sequences with sequences from a collection of BACs selected to form a minimal tiling path along the genome. A formidable challenge was presented by the high degree of heterozygosity between the two haplotypes of the selected male representative of this marine organism. This was overcome by use of the BAC tiling path backbone, in which each BAC represents a single haplotype, as well as by improvements in the Atlas software. Another innovation introduced in this project was the sequencing of pools of tiling path BACs rather than individual BAC sequencing. The Clone-Array Pooled Shotgun Strategy greatly reduced the cost and time devoted to preparing shotgun libraries from BAC clones. The genome sequence was analyzed with several gene prediction methods to produce a comprehensive gene list that was then manually refined and annotated by a volunteer team of sea urchin experts. This latter annotation community edited over 9000 gene models and uncovered many unexpected aspects of the sea urchin genetic content impacting transcriptional regulation, immunology, sensory perception, and an organism's development. Analysis of the basic deuterostome genetic complement supports the sea urchin's role as a model system for deuterostome and, by extension, chordate development.

  5. Association of Single-Nucleotide Polymorphisms of the Tau Gene With Late-Onset Parkinson Disease

    PubMed Central

    Martin, Eden R.; Scott, William K.; Nance, Martha A.; Watts, Ray L.; Hubble, Jean P.; Koller, William C.; Lyons, Kelly; Pahwa, Rajesh; Stern, Matthew B.; Colcher, Amy; Hiner, Bradley C.; Jankovic, Joseph; Ondo, William G.; Allen, Fred H.; Goetz, Christopher G.; Small, Gary W.; Masterman, Donna; Mastaglia, Frank; Laing, Nigel G.; Stajich, Jeffrey M.; Ribble, Robert C.; Booze, Michael W.; Rogala, Allison; Hauser, Michael A.; Zhang, Fengyu; Gibson, Rachel A.; Middleton, Lefkos T.; Roses, Allen D.; Haines, Jonathan L.; Scott, Burton L.; Pericak-Vance, Margaret A.; Vance, Jeffery M.

    2013-01-01

    Context The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. Objective To investigate whether the tau gene is involved in idiopathic PD. Design, Setting, and Participants Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Main Outcome Measure Family-based tests of association, calculated using asymptotic distributions. Results Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P = .03; SNP 9i, P = .04; and SNP 11, P = .04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P = .11, and SNP 9iii, P = .87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P = .009) and a negative association with another haplotype (P = .007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3,9i, 9ii, and 11). Conclusions This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD. PMID:11710889

  6. High-Resolution Genotyping of the Endemic Salmonella Typhi Population during a Vi (Typhoid) Vaccination Trial in Kolkata

    PubMed Central

    Manna, Byomkesh; Bhattacharya, Sujit K.; Bhaduri, Barnali; Pickard, Derek J.; Ochiai, R. Leon; Ali, Mohammad; Clemens, John D.; Dougan, Gordon

    2012-01-01

    Background Typhoid fever, caused by Salmonella enterica serovar Typhi (S. Typhi), is a major health problem especially in developing countries. Vaccines against typhoid are commonly used by travelers but less so by residents of endemic areas. Methodology We used single nucleotide polymorphism (SNP) typing to investigate the population structure of 372 S. Typhi isolated during a typhoid disease burden study and Vi vaccine trial in Kolkata, India. Approximately sixty thousand people were enrolled for fever surveillance for 19 months prior to, and 24 months following, Vi vaccination of one third of the study population (May 2003–December 2006, vaccinations given December 2004). Principal Findings A diverse S. Typhi population was detected, including 21 haplotypes. The most common were of the H58 haplogroup (69%), which included all multidrug resistant isolates (defined as resistance to chloramphenicol, ampicillin and co-trimoxazole). Quinolone resistance was particularly high among H58-G isolates (97% Nalidixic acid resistant, 30% with reduced susceptibility to ciprofloxacin). Multiple typhoid fever episodes were detected in 22 households, however household clustering was not associated with specific S. Typhi haplotypes. Conclusions Typhoid fever in Kolkata is caused by a diverse population of S. Typhi, however H58 haplotypes dominate and are associated with multidrug and quinolone resistance. Vi vaccination did not obviously impact on the haplotype population structure of the S. Typhi circulating during the study period. PMID:22303491

  7. IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

    PubMed Central

    Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

    2017-01-01

    Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656

  8. Sickle cell disease in the Kurdish population of northern Iraq.

    PubMed

    Al-Allawi, Nasir A S; Jalal, Sana D; Nerwey, Farida F; Al-Sayan, Galawezh O O; Al-Zebari, Sahima S M; Alshingaly, Awny A; Markous, Raji D; Jubrael, Jaladet M S; Hamamy, Hanan

    2012-01-01

    Epidemiological studies have revealed that sickle cell disease patients are clustered in two geographical areas in Iraq, one among the Arabs in the extreme south, another among the Kurdish population in the extreme north, where they constitute major health problems. However, no studies have focused on the genotypes responsible for sickle cell disease or the β-globin gene haplotypes associated with it. For the latter purpose, a total of 103 unrelated Kurdish sickle cell disease patients were evaluated by restriction fragment length polymorphism (RFLP) for the sickle cell mutation, followed by multiplex polymerase chain reaction (PCR) and reverse hybridization for β- and α-thalassemia (β- and α-thal) mutations, whenever indicated. Results showed that the most common genotype was sickle cell anemia (68.0%) followed by Hb S/β(0)-thal and Hb S/β(+)-thal at frequencies of 24.2 and 7.8%, respectively. Eight β-thal mutations were associated with the latter two genotypes including: IVS-II-1 (G>A), IVS-I-110 (G>A), codon 8 (-AA), codon 44 (-C), codon 22 (-7 bp), IVS-I-1 (G>A), codon 30 (G>C) and IVS-I-6 (T>C). In Hb SS patients, the -α(3.7) deletion was documented in 10.0% and was the only α-thal mutation detected. Furthermore, 5' β-globin gene cluster haplotyping of 128 β(S) chromosomes revealed that the most common haplotype seen in 69.5% was the Benin haplotype, followed by the Arab-Indian haplotype in 12.5%. These latter findings closely resemble reports from neighboring Turkey, Syria, Jordan, Lebanon and Mediterranean countries, suggesting a possible common origin, but are in contrast to findings from the Eastern Arabian Peninsula and Iran.

  9. A genome-wide association study of production traits in a commercial population of Large White pigs: evidence of haplotypes affecting meat quality

    PubMed Central

    2014-01-01

    Background Numerous quantitative trait loci (QTL) have been detected in pigs over the past 20 years using microsatellite markers. However, due to the low density of these markers, the accuracy of QTL location has generally been poor. Since 2009, the dense genome coverage provided by the Illumina PorcineSNP60 BeadChip has made it possible to more accurately map QTL using genome-wide association studies (GWAS). Our objective was to perform high-density GWAS in order to identify genomic regions and corresponding haplotypes associated with production traits in a French Large White population of pigs. Methods Animals (385 Large White pigs from 106 sires) were genotyped using the PorcineSNP60 BeadChip and evaluated for 19 traits related to feed intake, growth, carcass composition and meat quality. Of the 64 432 SNPs on the chip, 44 412 were used for GWAS with an animal mixed model that included a regression coefficient for the tested SNPs and a genomic kinship matrix. SNP haplotype effects in QTL regions were then tested for association with phenotypes following phase reconstruction based on the Sscrofa10.2 pig genome assembly. Results Twenty-three QTL regions were identified on autosomes and their effects ranged from 0.25 to 0.75 phenotypic standard deviation units for feed intake and feed efficiency (four QTL), carcass (12 QTL) and meat quality traits (seven QTL). The 10 most significant QTL regions had effects on carcass (chromosomes 7, 10, 16, 17 and 18) and meat quality traits (two regions on chromosome 1 and one region on chromosomes 8, 9 and 13). Thirteen of the 23 QTL regions had not been previously described. A haplotype block of 183 kb on chromosome 1 (six SNPs) was identified and displayed three distinct haplotypes with significant (0.0001 < P < 0.03) associations with all evaluated meat quality traits. Conclusions GWAS analyses with the PorcineSNP60 BeadChip enabled the detection of 23 QTL regions that affect feed consumption, carcass and meat quality traits in a LW population, of which 13 were novel QTL. The proportionally larger number of QTL found for meat quality traits suggests a specific opportunity for improving these traits in the pig by genomic selection. PMID:24528607

  10. Salt tolerance underlies the cryptic invasion of North American salt marshes by an introduced haplotype of the common reed Phragmites australis (Poaceae)

    USGS Publications Warehouse

    Vasquez, Edward A.; Glenn, Edward P.; Brown, J. Jed; Guntenspergen, Glenn R.; Nelson, Stephen G.

    2005-01-01

    A distinct, non-native haplotype of the common reed Phragmites australis has become invasive in Atlantic coastal Spartina marshes. We compared the salt tolerance and other growth characteristics of the invasive M haplotype with 2 native haplotypes (F and AC) in greenhouse experiments. The M haplotype retained 50% of its growth potential up to 0.4 M NaCl, whereas the F and AC haplotypes did not grow above 0.1 M NaCl. The M haplotype produced more shoots per gram of rhizome tissue and had higher relative growth rates than the native haplotypes on both freshwater and saline water treatments. The M haplotype also differed from the native haplotypes in shoot water content and the biometrics of shoots and rhizomes. The results offer an explanation for how the M haplotype is able to spread in coastal salt marshes and support the conclusion of DNA analyses that the M haplotype is a distinct ecotype of P. australis.

  11. Implication of common and disease specific variants in CLU, CR1, and PICALM.

    PubMed

    Ferrari, Raffaele; Moreno, Jorge H; Minhajuddin, Abu T; O'Bryant, Sid E; Reisch, Joan S; Barber, Robert C; Momeni, Parastoo

    2012-08-01

    Two recent genome-wide association studies (GWAS) for late onset Alzheimer's disease (LOAD) revealed 3 new genes: clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), and complement receptor 1 (CR1). In order to evaluate association with these genome-wide association study-identified genes and to isolate the variants contributing to the pathogenesis of LOAD, we genotyped the top single nucleotide polymorphisms (SNPs), rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), and sequenced the entire coding regions of these genes in our cohort of 342 LOAD patients and 277 control subjects. We confirmed the association of rs3851179 (PICALM) (p = 7.4 × 10(-3)) with the disease status. Through sequencing we identified 18 variants in CLU, 3 of which were found exclusively in patients; 8 variants (out of 65) in CR1 gene were only found in patients and the 16 variants identified in PICALM gene were present in both patients and controls. In silico analysis of the variants in PICALM did not predict any damaging effect on the protein. The haplotype analysis of the variants in each gene predicted a common haplotype when the 3 single nucleotide polymorphisms rs11136000 (CLU), rs3818361 (CR1), and rs3851179 (PICALM), respectively, were included. For each gene the haplotype structure and size differed between patients and controls. In conclusion, we confirmed association of CLU, CR1, and PICALM genes with the disease status in our cohort through identification of a number of disease-specific variants among patients through the sequencing of the coding region of these genes. Published by Elsevier Inc.

  12. Hb S [β6(A3)Glu→Val, GAG>GTG] and β-globin gene cluster haplotype distribution in Mauritania.

    PubMed

    Veten, Fatimetou M; Abdelhamid, Isselmou O; Meiloud, Ghlana M; Ghaber, Sidi M; Salem, Mohamed L; Abbes, Salem; Houmeida, Ahmed O

    2012-01-01

    Of 1050 Mauritanian blood donors screened from the two main racial groups, i.e., the Moors and Black Africans, 60 were found to carry Hb S [β6(A3)Glu→Val, GAG>GTG], giving a global frequency of 5.71%. The prevalence observed in the Black African Mauritanians (10.69%) is almost five times that found in the Moor group (2.25%). Four of the five main β(S) haplotypes were detected in this study: Senegal (77.8%), Benin (8.8%), Arab-Indian (5.5%) and Bantu (4.4%). These data showed that Hb S is a serious public health problem in Mauritania. They also confirm the ethnic heterogeneity of the Mauritanian population.

  13. Ancestral Asian source(s) of new world Y-chromosome founder haplotypes.

    PubMed Central

    Karafet, T M; Zegura, S L; Posukh, O; Osipova, L; Bergen, A; Long, J; Goldman, D; Klitz, W; Harihara, S; de Knijff, P; Wiebe, V; Griffiths, R C; Templeton, A R; Hammer, M F

    1999-01-01

    Haplotypes constructed from Y-chromosome markers were used to trace the origins of Native Americans. Our sample consisted of 2,198 males from 60 global populations, including 19 Native American and 15 indigenous North Asian groups. A set of 12 biallelic polymorphisms gave rise to 14 unique Y-chromosome haplotypes that were unevenly distributed among the populations. Combining multiallelic variation at two Y-linked microsatellites (DYS19 and DXYS156Y) with the unique haplotypes results in a total of 95 combination haplotypes. Contra previous findings based on Y- chromosome data, our new results suggest the possibility of more than one Native American paternal founder haplotype. We postulate that, of the nine unique haplotypes found in Native Americans, haplotypes 1C and 1F are the best candidates for major New World founder haplotypes, whereas haplotypes 1B, 1I, and 1U may either be founder haplotypes and/or have arrived in the New World via recent admixture. Two of the other four haplotypes (YAP+ haplotypes 4 and 5) are probably present because of post-Columbian admixture, whereas haplotype 1G may have originated in the New World, and the Old World source of the final New World haplotype (1D) remains unresolved. The contrasting distribution patterns of the two major candidate founder haplotypes in Asia and the New World, as well as the results of a nested cladistic analysis, suggest the possibility of more than one paternal migration from the general region of Lake Baikal to the Americas. PMID:10053017

  14. A simulation-based approach for solving assembly line balancing problem

    NASA Astrophysics Data System (ADS)

    Wu, Xiaoyu

    2017-09-01

    Assembly line balancing problem is directly related to the production efficiency, since the last century, the problem of assembly line balancing was discussed and still a lot of people are studying on this topic. In this paper, the problem of assembly line is studied by establishing the mathematical model and simulation. Firstly, the model of determing the smallest production beat under certain work station number is anysized. Based on this model, the exponential smoothing approach is applied to improve the the algorithm efficiency. After the above basic work, the gas stirling engine assembly line balancing problem is discussed as a case study. Both two algorithms are implemented using the Lingo programming environment and the simulation results demonstrate the validity of the new methods.

  15. On the comparison of population-level estimates of haplotype and nucleotide diversity: a case study using the gene cox1 in animals.

    PubMed

    Goodall-Copestake, W P; Tarling, G A; Murphy, E J

    2012-07-01

    Estimates of genetic diversity represent a valuable resource for biodiversity assessments and are increasingly used to guide conservation and management programs. The most commonly reported estimates of DNA sequence diversity in animal populations are haplotype diversity (h) and nucleotide diversity (π) for the mitochondrial gene cytochrome c oxidase subunit I (cox1). However, several issues relevant to the comparison of h and π within and between studies remain to be assessed. We used population-level cox1 data from peer-reviewed publications to quantify the extent to which data sets can be re-assembled, to provide a standardized summary of h and π estimates, to explore the relationship between these metrics and to assess their sensitivity to under-sampling. Only 19 out of 42 selected publications had archived data that could be unambiguously re-assembled; this comprised 127 population-level data sets (n ≥ 15) from 23 animal species. Estimates of h and π were calculated using a 456-base region of cox1 that was common to all the data sets (median h=0.70130, median π=0.00356). Non-linear regression methods and Bayesian information criterion analysis revealed that the most parsimonious model describing the relationship between the estimates of h and π was π=0.0081 h(2). Deviations from this model can be used to detect outliers due to biological processes or methodological issues. Subsampling analyses indicated that samples of n>5 were sufficient to discriminate extremes of high from low population-level cox1 diversity, but samples of n ≥ 25 are recommended for greater accuracy.

  16. Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms.

    PubMed

    Zapata, Luis; Ding, Jia; Willing, Eva-Maria; Hartwig, Benjamin; Bezdan, Daniela; Jiao, Wen-Biao; Patel, Vipul; Velikkakam James, Geo; Koornneef, Maarten; Ossowski, Stephan; Schneeberger, Korbinian

    2016-07-12

    Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.

  17. Association Between Chloroplast DNA and Mitochondrial DNA Haplotypes in Prunus spinosa L. (Rosaceae) Populations across Europe

    PubMed Central

    MOHANTY, APARAJITA; MARTÍN, JUAN PEDRO; GONZÁLEZ, LUIS MIGUEL; AGUINAGALDE, ITZIAR

    2003-01-01

    Chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) were studied in 24 populations of Prunus spinosa sampled across Europe. The cpDNA and mtDNA fragments were amplified using universal primers and subsequently digested with restriction enzymes to obtain the polymorphisms. Combinations of all the polymorphisms resulted in 33 cpDNA haplotypes and two mtDNA haplotypes. Strict association between the cpDNA haplotypes and the mtDNA haplotypes was detected in most cases, indicating conjoint inheritance of the two genomes. The most frequent and abundant cpDNA haplotype (C20; frequency, 51 %) is always associated with the more frequent and abundant mtDNA haplotype (M1; frequency, 84 %). All but two of the cpDNA haplotypes associated with the less frequent mtDNA haplotype (M2) are private haplotypes. These private haplotypes are phylogenetically related but geographically unrelated. They form a separate cluster on the minimum‐length spanning tree. PMID:14534199

  18. [Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].

    PubMed

    Gu, Ming-liang; Chu, Jia-you

    2007-12-01

    Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.

  19. Genetic predisposition for femoral neck stress fractures in military conscripts.

    PubMed

    Korvala, Johanna; Hartikka, Heini; Pihlajamäki, Harri; Solovieva, Svetlana; Ruohola, Juha-Petri; Sahi, Timo; Barral, Sandra; Ott, Jürg; Ala-Kokko, Leena; Männikkö, Minna

    2010-10-21

    Stress fractures are a significant problem among athletes and soldiers and may result in devastating complications or even permanent handicap. Genetic factors may increase the risk, but no major susceptibility genes have been identified. The purpose of this study was to search for possible genetic factors predisposing military conscripts to femoral neck stress fractures. Eight genes involved in bone metabolism or pathology (COL1A1, COL1A2, OPG, ESR1, VDR, CTR, LRP5, IL-6) were examined in 72 military conscripts with a femoral neck stress fracture and 120 controls. The risk of femoral neck stress fracture was significantly higher in subjects with low weight and body mass index (BMI). An interaction between the CTR (rs1801197) minor allele C and the VDR C-A haplotype was observed, and subjects lacking the C allele in CTR and/or the C-A haplotype in VDR had a 3-fold higher risk of stress fracture than subjects carrying both (OR = 3.22, 95% CI 1.38-7.49, p = 0.007). In addition, the LRP5 haplotype A-G-G-C alone and in combination with the VDR haplotype C-A was associated with stress fractures through reduced body weight and BMI. Our findings suggest that genetic factors play a role in the development of stress fractures in individuals subjected to heavy exercise and mechanical loading. The present results can be applied to the design of future studies that will further elucidate the genetics of stress fractures.

  20. Haplotype diversity and linkage disequilibrium at DRD2 locus--a study on four population groups of Andhra Pradesh, India.

    PubMed

    Saraswathy, Kallur Nava; Mukhopadhyay, Rupak; Shukla, Deepti; Kaur, Harpreet; Sachdeva, Mohinder Pal; Rao, A P; Saksena, Deepti; Kalla, Aloke Kumar

    2009-02-01

    Dopamine receptor D2 (DRD2) is expressed in the central nervous system and has a high affinity for many antipsychotic drugs. Besides several epidemiological investigations on association of DRD2 locus polymorphism(s) with neuropsychiatric problems and addictive behavior, a few polymorphisms in this locus have also been used to understand genomic diversity and population migratory histories globally. The present study attempts to understand the genomic diversity/affinity among four endogamous groups of Andhra Pradesh (India) against the backdrop of diversity studies from other parts of India and the rest of the world, with special reference to DRD2 locus. The four population groups from Adilabad District of Andhra Pradesh, namely, Brahmin (n=50), Nayakpod (n=49), Thoti (n=52), and Kolam (n=53), were included in the study. The DRD2 markers typed for the present study are three biallelic restriction fragments, that is, TaqI A (rs1800497), TaqI B (rs1079597), and TaqI D (rs1800498). Scoring of DRD2 haplotypes with respect to the three TaqI sites shows that five out of eight possible haplotypes are shared by the four populations. Ancestral haplotype B2D2A1 is most frequent among Thotis (0.359). The results of the present study indicate a differential gene flow into South India followed by certain important demographic events resulting in diversified peopling of India.

  1. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations

    PubMed Central

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-01-01

    Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle cell disease, could lie considerable distances away from β-globin. PMID:17688704

  2. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations.

    PubMed

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-08-10

    The sickle (betas) mutation in the beta-globin gene (HBB) occurs on five "classical" betas haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the betas allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical betas haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical betas haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). The most common betas sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the betas mutation. Two different classical betas haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of betas haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle cell disease, could lie considerable distances away from beta-globin.

  3. Heuristics for Multiobjective Optimization of Two-Sided Assembly Line Systems

    PubMed Central

    Jawahar, N.; Ponnambalam, S. G.; Sivakumar, K.; Thangadurai, V.

    2014-01-01

    Products such as cars, trucks, and heavy machinery are assembled by two-sided assembly line. Assembly line balancing has significant impacts on the performance and productivity of flow line manufacturing systems and is an active research area for several decades. This paper addresses the line balancing problem of a two-sided assembly line in which the tasks are to be assigned at L side or R side or any one side (addressed as E). Two objectives, minimum number of workstations and minimum unbalance time among workstations, have been considered for balancing the assembly line. There are two approaches to solve multiobjective optimization problem: first approach combines all the objectives into a single composite function or moves all but one objective to the constraint set; second approach determines the Pareto optimal solution set. This paper proposes two heuristics to evolve optimal Pareto front for the TALBP under consideration: Enumerative Heuristic Algorithm (EHA) to handle problems of small and medium size and Simulated Annealing Algorithm (SAA) for large-sized problems. The proposed approaches are illustrated with example problems and their performances are compared with a set of test problems. PMID:24790568

  4. Animal selection for whole genome sequencing by quantifying the unique contribution of homozygous haplotypes sequenced

    USDA-ARS?s Scientific Manuscript database

    Major whole genome sequencing projects promise to identify rare and causal variants within livestock species; however, the efficient selection of animals for sequencing remains a major problem within these surveys. The goal of this project was to develop a library of high accuracy genetic variants f...

  5. Detecting local haplotype sharing and haplotype association

    USDA-ARS?s Scientific Manuscript database

    A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...

  6. Inverse Problem in Self-assembly

    NASA Astrophysics Data System (ADS)

    Tkachenko, Alexei

    2012-02-01

    By decorating colloids and nanoparticles with DNA, one can introduce highly selective key-lock interactions between them. This leads to a new class of systems and problems in soft condensed matter physics. In particular, this opens a possibility to solve inverse problem in self-assembly: how to build an arbitrary desired structure with the bottom-up approach? I will present a theoretical and computational analysis of the hierarchical strategy in attacking this problem. It involves self-assembly of particular building blocks (``octopus particles''), that in turn would assemble into the target structure. On a conceptual level, our approach combines elements of three different brands of programmable self assembly: DNA nanotechnology, nanoparticle-DNA assemblies and patchy colloids. I will discuss the general design principles, theoretical and practical limitations of this approach, and illustrate them with our simulation results. Our crucial result is that not only it is possible to design a system that has a given nanostructure as a ground state, but one can also program and optimize the kinetic pathway for its self-assembly.

  7. Canis mtDNA HV1 database: a web-based tool for collecting and surveying Canis mtDNA HV1 haplotype in public database.

    PubMed

    Thai, Quan Ke; Chung, Dung Anh; Tran, Hoang-Dung

    2017-06-26

    Canine and wolf mitochondrial DNA haplotypes, which can be used for forensic or phylogenetic analyses, have been defined in various schemes depending on the region analyzed. In recent studies, the 582 bp fragment of the HV1 region is most commonly used. 317 different canine HV1 haplotypes have been reported in the rapidly growing public database GenBank. These reported haplotypes contain several inconsistencies in their haplotype information. To overcome this issue, we have developed a Canis mtDNA HV1 database. This database collects data on the HV1 582 bp region in dog mitochondrial DNA from the GenBank to screen and correct the inconsistencies. It also supports users in detection of new novel mutation profiles and assignment of new haplotypes. The Canis mtDNA HV1 database (CHD) contains 5567 nucleotide entries originating from 15 subspecies in the species Canis lupus. Of these entries, 3646 were haplotypes and grouped into 804 distinct sequences. 319 sequences were recognized as previously assigned haplotypes, while the remaining 485 sequences had new mutation profiles and were marked as new haplotype candidates awaiting further analysis for haplotype assignment. Of the 3646 nucleotide entries, only 414 were annotated with correct haplotype information, while 3232 had insufficient or lacked haplotype information and were corrected or modified before storing in the CHD. The CHD can be accessed at http://chd.vnbiology.com . It provides sequences, haplotype information, and a web-based tool for mtDNA HV1 haplotyping. The CHD is updated monthly and supplies all data for download. The Canis mtDNA HV1 database contains information about canine mitochondrial DNA HV1 sequences with reconciled annotation. It serves as a tool for detection of inconsistencies in GenBank and helps identifying new HV1 haplotypes. Thus, it supports the scientific community in naming new HV1 haplotypes and to reconcile existing annotation of HV1 582 bp sequences.

  8. Analysis of the Aedes albopictus C6/36 genome provides insight into cell line utility for viral propagation.

    PubMed

    Miller, Jason R; Koren, Sergey; Dilley, Kari A; Puri, Vinita; Brown, David M; Harkins, Derek M; Thibaud-Nissen, Françoise; Rosen, Benjamin; Chen, Xiao-Guang; Tu, Zhijian; Sharakhov, Igor V; Sharakhova, Maria V; Sebra, Robert; Stockwell, Timothy B; Bergman, Nicholas H; Sutton, Granger G; Phillippy, Adam M; Piermarini, Peter M; Shabman, Reed S

    2018-03-01

    The 50-year-old Aedes albopictus C6/36 cell line is a resource for the detection, amplification, and analysis of mosquito-borne viruses including Zika, dengue, and chikungunya. The cell line is derived from an unknown number of larvae from an unspecified strain of Aedes albopictus mosquitoes. Toward improved utility of the cell line for research in virus transmission, we present an annotated assembly of the C6/36 genome. The C6/36 genome assembly has the largest contig N50 (3.3 Mbp) of any mosquito assembly, presents the sequences of both haplotypes for most of the diploid genome, reveals independent null mutations in both alleles of the Dicer locus, and indicates a male-specific genome. Gene annotation was computed with publicly available mosquito transcript sequences. Gene expression data from cell line RNA sequence identified enrichment of growth-related pathways and conspicuous deficiency in aquaporins and inward rectifier K+ channels. As a test of utility, RNA sequence data from Zika-infected cells were mapped to the C6/36 genome and transcriptome assemblies. Host subtraction reduced the data set by 89%, enabling faster characterization of nonhost reads. The C6/36 genome sequence and annotation should enable additional uses of the cell line to study arbovirus vector interactions and interventions aimed at restricting the spread of human disease.

  9. Beta-globin gene cluster haplotypes of Amerindian populations from the Brazilian Amazon region.

    PubMed

    Guerreiro, J F; Figueiredo, M S; Zago, M A

    1994-01-01

    We have determined the beta-globin cluster haplotypes for 80 Indians from four Brazilian Amazon tribes: Kayapó, Wayampí, Wayana-Apalaí, and Arára. The results are analyzed together with 20 Yanomámi previously studied. From 2 to 4 different haplotypes were identified for each tribe, and 7 of the possible 32 haplotypes were found in a sample of 172 chromosomes for which the beta haplotypes were directly determined or derived from family studies. The haplotype distribution does not differ significantly among the five populations. The two most common haplotypes in all tribes were haplotypes 2 and 6, with average frequencies of 0.843 and 0.122, respectively. The genetic affinities between Brazilian Indians and other human populations were evaluated by estimates of genetic distance based on haplotype data. The lowest values were observed in relation to Asians, especially Chinese, Polynesians, and Micronesians.

  10. Mitochondrial haplotype variation and phylogeography of Iberian brown trout populations.

    PubMed

    MacHordom, A; Suárez, J; Almodóvar, A; Bautista, J M

    2000-09-01

    The biogeographical distribution of brown trout mitochondrial DNA haplotypes throughout the Iberian Peninsula was established by polymerase chain reaction-restriction fragment polymorphism analysis. The study of 507 specimens from 58 localities representing eight widely separated Atlantic-slope (north and west Iberian coasts) and six Mediterranean drainage systems served to identify five main groups of mitochondrial haplotypes: (i) haplotypes corresponding to non-native, hatchery-reared brown trout that were widely distributed but also found in wild populations of northern Spain (Cantabrian slope); (ii) a widespread Atlantic haplotype group; (iii) a haplotype restricted to the Duero Basin; (iv) a haplotype shown by southern Iberian populations; and (v) a Mediterranean haplotype. The Iberian distribution of these haplotypes reflects both the current fishery management policy of introducing non-native brown trout, and Messinian palaeobiogeography. Our findings complement and extend previous allozyme studies on Iberian brown trout and improve present knowledge of glacial refugia and postglacial movement of brown trout lineages.

  11. How Have Self-Incompatibility Haplotypes Diversified? Generation of New Haplotypes during the Evolution of Self-Incompatibility from Self-Compatibility.

    PubMed

    Sakai, Satoki

    2016-08-01

    I developed a gametophytic self-incompatibility (SI) model to study the conditions leading to diversification in SI haplotypes. In the model, the SI system is assumed to be incomplete, and the pollen expressing a given specificity is not fully rejected by the pistils expressing the same specificity. I also assumed that mutations can occur that enhance the rejection of pollen by pistils with the same haplotype variant and reduce rejection by pistils with other variants in the same haplotype. I found that if such mutations occur, the new haplotypes (mutant variants) can stably coexist with the ancestral haplotype in which the mutant arose. This is because pollen bearing the new haplotype is most strongly rejected by pistils bearing the same new haplotype among the pistils in the population; hence, negative frequency-dependent selection prevents their fixation. I also performed simulations and found that the nearly complete SI system evolves from completely self-compatible populations and that SI haplotypes can increase to about 40-50 within a few thousand generations. On the basis of my findings, I propose that diversification of SI haplotypes occurred during the evolution of SI from self-compatibility.

  12. Sequences of 95 human MHC haplotypes reveal extreme coding variation in genes other than highly polymorphic HLA class I and II

    PubMed Central

    Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter

    2017-01-01

    The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230

  13. The effect of using genealogy-based haplotypes for genomic prediction

    PubMed Central

    2013-01-01

    Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971

  14. The effect of using genealogy-based haplotypes for genomic prediction.

    PubMed

    Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

    2013-03-06

    Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.

  15. Recombinant structures expand and contract inter and intragenic diversification at the KIR locus

    PubMed Central

    2013-01-01

    Background The human KIR genes are arranged in at least six major gene-content haplotypes, all of which are combinations of four centromeric and two telomeric motifs. Several less frequent or minor haplotypes also exist, including insertions, deletions, and hybridization of KIR genes derived from the major haplotypes. These haplotype structures and their concomitant linkage disequilibrium among KIR genes suggest that more meaningful correlative data from studies of KIR genetics and complex disease may be achieved by measuring haplotypes of the KIR region in total. Results Towards that end, we developed a KIR haplotyping method that reports unambiguous combinations of KIR gene-content haplotypes, including both phase and copy number for each KIR. A total of 37 different gene content haplotypes were detected from 4,512 individuals and new sequence data was derived from haplotypes where the detailed structure was not previously available. Conclusions These new structures suggest a number of specific recombinant events during the course of KIR evolution, and add to an expanding diversity of potential new KIR haplotypes derived from gene duplication, deletion, and hybridization. PMID:23394822

  16. High-throughput physical mapping of chromosomes using automated in situ hybridization.

    PubMed

    George, Phillip; Sharakhova, Maria V; Sharakhov, Igor V

    2012-06-28

    Projects to obtain whole-genome sequences for 10,000 vertebrate species and for 5,000 insect and related arthropod species are expected to take place over the next 5 years. For example, the sequencing of the genomes for 15 malaria mosquitospecies is currently being done using an Illumina platform. This Anopheles species cluster includes both vectors and non-vectors of malaria. When the genome assemblies become available, researchers will have the unique opportunity to perform comparative analysis for inferring evolutionary changes relevant to vector ability. However, it has proven difficult to use next-generation sequencing reads to generate high-quality de novo genome assemblies. Moreover, the existing genome assemblies for Anopheles gambiae, although obtained using the Sanger method, are gapped or fragmented. Success of comparative genomic analyses will be limited if researchers deal with numerous sequencing contigs, rather than with chromosome-based genome assemblies. Fragmented, unmapped sequences create problems for genomic analyses because: (i) unidentified gaps cause incorrect or incomplete annotation of genomic sequences; (ii) unmapped sequences lead to confusion between paralogous genes and genes from different haplotypes; and (iii) the lack of chromosome assignment and orientation of the sequencing contigs does not allow for reconstructing rearrangement phylogeny and studying chromosome evolution. Developing high-resolution physical maps for species with newly sequenced genomes is a timely and cost-effective investment that will facilitate genome annotation, evolutionary analysis, and re-sequencing of individual genomes from natural populations. Here, we present innovative approaches to chromosome preparation, fluorescent in situ hybridization (FISH), and imaging that facilitate rapid development of physical maps. Using An. gambiae as an example, we demonstrate that the development of physical chromosome maps can potentially improve genome assemblies and, thus, the quality of genomic analyses. First, we use a high-pressure method to prepare polytene chromosome spreads. This method, originally developed for Drosophila, allows the user to visualize more details on chromosomes than the regular squashing technique. Second, a fully automated, front-end system for FISH is used for high-throughput physical genome mapping. The automated slide staining system runs multiple assays simultaneously and dramatically reduces hands-on time. Third, an automatic fluorescent imaging system, which includes a motorized slide stage, automatically scans and photographs labeled chromosomes after FISH. This system is especially useful for identifying and visualizing multiple chromosomal plates on the same slide. In addition, the scanning process captures a more uniform FISH result. Overall, the automated high-throughput physical mapping protocol is more efficient than a standard manual protocol.

  17. BCL11A Enhancer Haplotypes and Fetal Hemoglobin in Sickle Cell Anemia

    PubMed Central

    Sebastiani, P.; Farrell, J.J.; Alsultan, A.; Wang, S.; Edward, H. L.; Shappell, H.; Bae, H.; Milton, J. N.; Baldwin, C.T.; Al-Rubaish, A.M.; Naserullah, Z.; Al-Muhanna, F.; Alsuliman, A.; Patra, P. K.; Farrer, L.A.; Ngo, D.; Vathipadiekal, V.; Chui, D.H.K.; Al-Ali, A.K.; Steinberg, M.H.

    2015-01-01

    Background Fetal hemoglobin (HbF) levels in sickle cell anemia patients vary. We genotyped polymorphisms in the erythroid-specific enhancer of BCL11A to see if they might account for the very high HbF associated with the Arab-Indian (AI) haplotype and Benin haplotype of sickle cell anemia. Methods and Results Six BCL112A enhancer SNPs and their haplotypes were studied in Saudi Arabs from the Eastern Province and Indian patients with AI haplotype (HbF ~20%), African Americans (HbF ~7%), and Saudi Arabs from the Southwestern Province (HbF ~12%). Four SNPs (rs1427407, rs6706648, rs6738440, and rs7606173) and their haplotypes were consistently associated with HbF levels. The distributions of haplotypes differ in the 3 cohorts but not their genetic effects: the haplotype TCAG was associated with the lowest HbF level and the haplotype GTAC was associated with the highest HbF level and differences in HbF levels between carriers of these haplotypes in all cohorts was approximately 6%. Conclusions Common HbF BCL11A enhancer haplotypes in patients with African origin and AI sickle cell anemia have similar effects on HbF but they do not explain their differences in HbF. PMID:25703683

  18. Association of MBL2 Gene Polymorphism with Dental Caries in Saudi Children.

    PubMed

    Alyousef, Yousef M; Borgio, J Francis; AbdulAzeez, Sayed; Al-Masoud, Naif; Al-Ali, Ali A; Al-Shwaimi, Emad; Al-Ali, Amein K

    2017-01-01

    The high prevalence of dental caries in children worldwide is a major oral health problem which requires early intervention. Dental caries is mainly caused by the action of acids produced by bacteria in addition to many other factors. Recent genetic studies have reported that a number of genes are associated with the susceptibility to dental caries. The majority of these genes are associated with inflammation, increased susceptibility to infection, and dentine matrix formation. Using the TaqMan assay and direct DNA sequencing, the prevalence of 6 single-nucleotide polymorphisms (SNPs) in MMP9, MBL2, MMP2, and TIMP2 genes was determined in 102 children with caries and in 100 age-matched caries-free controls. Out of the 6 SNPs tested in the 4 selected genes, only rs11003125 in the MBL2 gene was shown to be associated with a high prevalence of caries in our cohort. In addition, haplotype analysis of the 6 SNPs tested revealed that certain haplotypes, namely GT of rs11003125G and rs7501477T and GT of rs7096206G and rs7501477T, were found to be associated with a high prevalence of dental caries in our cohort, while haplotype AG of rs17576A and rs7501477G was found to have a protective effect against dental caries. In conclusion, the data indicate that rs11003125 in the MBL2 gene was shown to be associated with a high prevalence of caries in our cohort, and 2 haplotypes are also involved in the increased susceptibility to dental caries. © 2016 S. Karger AG, Basel.

  19. Genetic predisposition for femoral neck stress fractures in military conscripts

    PubMed Central

    2010-01-01

    Background Stress fractures are a significant problem among athletes and soldiers and may result in devastating complications or even permanent handicap. Genetic factors may increase the risk, but no major susceptibility genes have been identified. The purpose of this study was to search for possible genetic factors predisposing military conscripts to femoral neck stress fractures. Results Eight genes involved in bone metabolism or pathology (COL1A1, COL1A2, OPG, ESR1, VDR, CTR, LRP5, IL-6) were examined in 72 military conscripts with a femoral neck stress fracture and 120 controls. The risk of femoral neck stress fracture was significantly higher in subjects with low weight and body mass index (BMI). An interaction between the CTR (rs1801197) minor allele C and the VDR C-A haplotype was observed, and subjects lacking the C allele in CTR and/or the C-A haplotype in VDR had a 3-fold higher risk of stress fracture than subjects carrying both (OR = 3.22, 95% CI 1.38-7.49, p = 0.007). In addition, the LRP5 haplotype A-G-G-C alone and in combination with the VDR haplotype C-A was associated with stress fractures through reduced body weight and BMI. Conclusions Our findings suggest that genetic factors play a role in the development of stress fractures in individuals subjected to heavy exercise and mechanical loading. The present results can be applied to the design of future studies that will further elucidate the genetics of stress fractures. PMID:20961463

  20. African-American mitochondrial DNAs often match mtDNAs found in multiple African ethnic groups

    PubMed Central

    Ely, Bert; Wilson, Jamie Lee; Jackson, Fatimah; Jackson, Bruce A

    2006-01-01

    Background Mitochondrial DNA (mtDNA) haplotypes have become popular tools for tracing maternal ancestry, and several companies offer this service to the general public. Numerous studies have demonstrated that human mtDNA haplotypes can be used with confidence to identify the continent where the haplotype originated. Ideally, mtDNA haplotypes could also be used to identify a particular country or ethnic group from which the maternal ancestor emanated. However, the geographic distribution of mtDNA haplotypes is greatly influenced by the movement of both individuals and population groups. Consequently, common mtDNA haplotypes are shared among multiple ethnic groups. We have studied the distribution of mtDNA haplotypes among West African ethnic groups to determine how often mtDNA haplotypes can be used to reconnect Americans of African descent to a country or ethnic group of a maternal African ancestor. The nucleotide sequence of the mtDNA hypervariable segment I (HVS-I) usually provides sufficient information to assign a particular mtDNA to the proper haplogroup, and it contains most of the variation that is available to distinguish a particular mtDNA haplotype from closely related haplotypes. In this study, samples of general African-American and specific Gullah/Geechee HVS-I haplotypes were compared with two databases of HVS-I haplotypes from sub-Saharan Africa, and the incidence of perfect matches recorded for each sample. Results When two independent African-American samples were analyzed, more than half of the sampled HVS-I mtDNA haplotypes exactly matched common haplotypes that were shared among multiple African ethnic groups. Another 40% did not match any sequence in the database, and fewer than 10% were an exact match to a sequence from a single African ethnic group. Differences in the regional distribution of haplotypes were observed in the African database, and the African-American haplotypes were more likely to match haplotypes found in ethnic groups from West or West Central Africa than those found in eastern or southern Africa. Fewer than 14% of the African-American mtDNA sequences matched sequences from only West Africa or only West Central Africa. Conclusion Our database of sub-Saharan mtDNA sequences includes the most common haplotypes that are shared among ethnic groups from multiple regions of Africa. These common haplotypes have been found in half of all sub-Saharan Africans. More than 60% of the remaining haplotypes differ from the common haplotypes at a single nucleotide position in the HVS-I region, and they are likely to occur at varying frequencies within sub-Saharan Africa. However, the finding that 40% of the African-American mtDNAs analyzed had no match in the database indicates that only a small fraction of the total number of African haplotypes has been identified. In addition, the finding that fewer than 10% of African-American mtDNAs matched mtDNA sequences from a single African region suggests that few African Americans might be able to trace their mtDNA lineages to a particular region of Africa, and even fewer will be able to trace their mtDNA to a single ethnic group. However, no firm conclusions should be made until a much larger database is available. It is clear, however, that when identical mtDNA haplotypes are shared among many ethnic groups from different parts of Africa, it is impossible to determine which single ethnic group was the source of a particular maternal ancestor based on the mtDNA sequence. PMID:17038170

  1. New HLA haplotype frequency reference standards: high-resolution and large sample typing of HLA DR-DQ haplotypes in a sample of European Americans.

    PubMed

    Klitz, W; Maiers, M; Spellman, S; Baxter-Lowe, L A; Schmeckpeper, B; Williams, T M; Fernandez-Viña, M

    2003-10-01

    A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.

  2. Sex differences in TTC12/ANKK1 haplotype associations with daily tobacco smoking in Black and White Americans.

    PubMed

    David, Sean P; Mezuk, Briana; Zandi, Peter P; Strong, David; Anthony, James C; Niaura, Raymond; Uhl, George R; Eaton, William W

    2010-03-01

    The 11q23.1 genomic region has been associated with nicotine dependence in Black and White Americans. By conducting linkage disequilibrium analyses of 7 informative single nucleotide polymorphisms (SNPs) within the tetratricopeptide repeat domain 12 (TTC12)/ankyrin repeat and kinase containing 1 (ANKK1)/dopamine (D2) receptor gene cluster, we identified haplotype block structures in 270 Black and 368 White (n = 638) participants, from the Baltimore Epidemiologic Catchment Area cohort study, spanning the TTC12 and ANKK1 genes consisting of three SNPs (rs2303380-rs4938015-rs11604671). Informative haplotypes were examined for sex-specific associations with daily tobacco smoking initiation and cessation using longitudinal data from 1993-1994 and 2004-2005 interviews. There was a Haplotype x Sex interaction such that Black men possessing the GTG haplotype who were smokers in 1993-2004 were more likely to have stopped smoking by 2004-2005 (55.6% GTG vs. 22.0% other haplotypes), while Black women were less likely to have quit smoking if they possessed the GTG (20.8%) versus other haplotypes (24.0%; p = .028). In Whites, the GTG haplotype (vs. other haplotypes) was associated with lifetime history of daily smoking (smoking initiation; odds ratio = 1.6; 95% CI = 1.1-2.4; p = .013). Moreover, there was a Haplotype x Sex interaction such that there was higher prevalence of smoking initiation with GTG (77.6%) versus other haplotypes (57.0%; p = .043). In 2 different ethnic American populations, we observed man-woman variation in the influence of the rs2303380-rs4938015-rs11604671 GTG haplotype on smoking initiation and cessation. These results should be replicated in larger cohorts to establish the relationship among the rs2303380-rs4938015-rs11604671 haplotype block, sex, and smoking behavior.

  3. Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection.

    PubMed

    Louzoun, Yoram; Alter, Idan; Gragert, Loren; Albrecht, Mark; Maiers, Martin

    2018-05-01

    Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.

  4. Research to Assembly Scheme for Satellite Deck Based on Robot Flexibility Control Principle

    NASA Astrophysics Data System (ADS)

    Guo, Tao; Hu, Ruiqin; Xiao, Zhengyi; Zhao, Jingjing; Fang, Zhikai

    2018-03-01

    Deck assembly is critical quality control point in final satellite assembly process, and cable extrusion and structure collision problems in assembly process will affect development quality and progress of satellite directly. Aimed at problems existing in deck assembly process, assembly project scheme for satellite deck based on robot flexibility control principle is proposed in this paper. Scheme is introduced firstly; secondly, key technologies on end force perception and flexible docking control in the scheme are studied; then, implementation process of assembly scheme for satellite deck is described in detail; finally, actual application case of assembly scheme is given. Result shows that compared with traditional assembly scheme, assembly scheme for satellite deck based on robot flexibility control principle has obvious advantages in work efficiency, reliability and universality aspects etc.

  5. Kullback-Leibler divergence for detection of rare haplotype common disease association.

    PubMed

    Lin, Shili

    2015-11-01

    Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.

  6. Population Structure With Localized Haplotype Clusters

    PubMed Central

    Browning, Sharon R.; Weir, Bruce S.

    2010-01-01

    We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data. PMID:20457877

  7. An analysis of variation in the long-range genomic organization of the human major histocompatibility complex class II region by pulsed-field gel electrophoresis.

    PubMed

    Dunham, I; Sargent, C A; Dawkins, R L; Campbell, R D

    1989-11-01

    The class II region of the human major histocompatibility complex in seven common HLA haplotypes has been analyzed using pulsed-field gel electrophoresis, restriction enzymes that cut genomic DNA infrequently, and Southern blotting. This analysis has revealed that there are differences in the amount of DNA present in the DQ and DR subregions dependent on the haplotype. The class II region of the DR3 haplotype spans approximately 750 kb and has the same amount of DNA as the class II region of the DR5 and DR6 haplotypes. However, the DR2 haplotype has approximately 30 kb more DNA within the DR subregion. The DR4 haplotype has an additional approximately 110 kb of DNA within the DQ or DR subregions compared to the DR3, DR5, and DR6 haplotypes. These haplotype-specific differences could have some bearing both on the analysis of disease susceptibility and on the ability of chromosomes possessing different HLA haplotypes to recombine within the DQ/DR subregions.

  8. A Multiple Objective Test Assembly Approach for Exposure Control Problems in Computerized Adaptive Testing

    ERIC Educational Resources Information Center

    Veldkamp, Bernard P.; Verschoor, Angela J.; Eggen, Theo J. H. M.

    2010-01-01

    Overexposure and underexposure of items in the bank are serious problems in operational computerized adaptive testing (CAT) systems. These exposure problems might result in item compromise, or point at a waste of investments. The exposure control problem can be viewed as a test assembly problem with multiple objectives. Information in the test has…

  9. Analysis of the type II robotic mixed-model assembly line balancing problem

    NASA Astrophysics Data System (ADS)

    Çil, Zeynel Abidin; Mete, Süleyman; Ağpak, Kürşad

    2017-06-01

    In recent years, there has been an increasing trend towards using robots in production systems. Robots are used in different areas such as packaging, transportation, loading/unloading and especially assembly lines. One important step in taking advantage of robots on the assembly line is considering them while balancing the line. On the other hand, market conditions have increased the importance of mixed-model assembly lines. Therefore, in this article, the robotic mixed-model assembly line balancing problem is studied. The aim of this study is to develop a new efficient heuristic algorithm based on beam search in order to minimize the sum of cycle times over all models. In addition, mathematical models of the problem are presented for comparison. The proposed heuristic is tested on benchmark problems and compared with the optimal solutions. The results show that the algorithm is very competitive and is a promising tool for further research.

  10. Genome structure and emerging evidence of an incipient sex chromosome in Populus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yin, Tongming; DiFazio, Stephen P; Gunter, Lee E

    The genus Populus consists of dioecious woody species with largely unknown genetic mechanisms for gender determination. We have discovered genetic and genomic features in the peritelomeric region of chromosome XIX that suggest this region of the Populus genome is in the process of developing characteristics of a sex chromosome. We have identified a gender-associated locus that consistently maps to this region. Furthermore, comparison of genetic maps across multiple Populus families reveals consistently distorted segregation within this region. We have intensively characterized this region using an F1 interspecific cross involving the female genotype that was used for genome sequencing. This regionmore » shows suppressed recombination and high divergence between the alternate haplotypes, as revealed by dense map-based genome assembly using microsatellite markers. The suppressed recombination, distorted segregation, and haplotype divergence were observed only for the maternal parent in this cross. Furthermore, the progeny of this cross showed a strongly male-biased sex ratio, in agreement with Haldane's rule that postulates that the heterogametic sex is more likely to be absent, rare, or sterile in interspecific crosses. Together, these results support the role of chromosome XIX in sex determination and suggest that sex determination in Populus occurs through a ZW system in which the female is the heterogametic gender.« less

  11. Analyses to help identify individuals from a historical mass grave in Kassel, Germany.

    PubMed

    v Grumbkow, Philipp; Zipp, Anna; Grosskopf, Birgit; Fueldner, Kai; Hummel, Susanne

    2012-01-01

    In 2008, the skeletal remains of more than 60 human individuals were found in a mass grave on the grounds of the University of Kassel, Germany. There was no evidence helping to identify them or throwing light on the cause of their death. Mainly due to 14C age determination and initial hints on age and sex distribution, historians hypothesized that they had been soldiers of Napoleon's army who died in an epidemic in the winter of 1813/14. To test this assumption, morphological and molecular analyses were carried out on a sample. The morphological analyses comprised an age and sex determination as well as a macro- and micro-morphological inspection for pathological deviations after the commingled bones had been assembled as individuals. The molecular investigations aimed to identify the geographic origin of the remains. For this, mitochondrial and Y-chromosomal haplotypings were carried out. The results point to a group of mainly young men, some of them suffering from systemic inflammation of the periosteum. Others revealed severe aberrations in bone microstructure. The greatest similarities revealed by Y-haplogroup and -haplotype distribution were to populations that live in what are now the Benelux countries. All aspects support the thesis that these were soldiers of the Napoleonic army.

  12. Extraction of High Molecular Weight DNA from Fungal Rust Spores for Long Read Sequencing.

    PubMed

    Schwessinger, Benjamin; Rathjen, John P

    2017-01-01

    Wheat rust fungi are complex organisms with a complete life cycle that involves two different host plants and five different spore types. During the asexual infection cycle on wheat, rusts produce massive amounts of dikaryotic urediniospores. These spores are dikaryotic (two nuclei) with each nucleus containing one haploid genome. This dikaryotic state is likely to contribute to their evolutionary success, making them some of the major wheat pathogens globally. Despite this, most published wheat rust genomes are highly fragmented and contain very little haplotype-specific sequence information. Current long-read sequencing technologies hold great promise to provide more contiguous and haplotype-phased genome assemblies. Long reads are able to span repetitive regions and phase structural differences between the haplomes. This increased genome resolution enables the identification of complex loci and the study of genome evolution beyond simple nucleotide polymorphisms. Long-read technologies require pure high molecular weight DNA as an input for sequencing. Here, we describe a DNA extraction protocol for rust spores that yields pure double-stranded DNA molecules with molecular weight of >50 kilo-base pairs (kbp). The isolated DNA is of sufficient purity for PacBio long-read sequencing, but may require additional purification for other sequencing technologies such as Nanopore and 10× Genomics.

  13. Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

    PubMed

    Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

    2009-02-01

    High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.

  14. Cis-acting mutation and duplication: History of molecular evolution in a P450 haplotype responsible for insecticide resistance in Culex quinquefasciatus.

    PubMed

    Itokawa, Kentaro; Komagata, Osamu; Kasai, Shinji; Masada, Masahiro; Tomita, Takashi

    2011-07-01

    A cytochrome P450 gene, Cyp9m10, is more than 200-fold overexpressed in a pyrethroid resistant strain of Culex quinquefasciatus, JPal-per. The haplotype of this strain contains two copies of Cyp9m10 resulted from recent tandem duplication. In this study, we discovered and isolated a Cyp9m10 haplotype closely related to this duplicated Cyp9m10 haplotype from JHB, a strain used for the recent genome project for this mosquito species. The isolated haplotype (JHB-NIID-B haplotype) shared the same insertion of a transposable element upstream of the coding region with JPal-per strain but not duplicated. The JHB-NIID-B haplotype was considered to have diverged from the JPal-per lineage just before the duplication event. Cyp9m10 was moderately overexpressed in larvae with the JHB-NIID-B haplotype. The overexpressions in JHB-NIID-B and JPal-per haplotypes were developmentally regulated in similar pattern indicating both haplotypes share a common cis-acting mutation responsible for the overexpressions. The isolated moderately overexpressed haplotype conferred resistance, however, its efficacy was relatively small. We hypothesized that the first cis-acting mutation modified the consequence of the subsequent duplication in JPal-per lineage to confer stronger phenotypic effect than that if it occurred before the first cis-acting mutation. Copyright © 2011 Elsevier Ltd. All rights reserved.

  15. A Candidate Trans-acting Modulator of Fetal Hemoglobin Gene Expression in the Arab-Indian Haplotype of Sickle Cell Anemia

    PubMed Central

    Vathipadiekal, Vinod; Farrell, John J.; Wang, Shuai; Edward, Heather L.; Shappell, Heather; Al-Rubaish, A.M.; Al-Muhanna, Fahad; Naserullah, Z.; Alsuliman, A.; Qutub, Hatem Othman; Simkin, Irene; Farrer, Lindsay A.; Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Mostoslavsky, Gustavo; Murphy, George J.; Patra, Pradeep.K.; Chui, David H.K.; Alsultan, Abdulrahman; Al-Ali, Amein K.; Sebastiani, Paola.; Steinberg, Martin. H.

    2016-01-01

    Fetal hemoglobin (HbF) levels are higher in the Arab-Indian (AI) β-globin gene haplotype of sickle cell anemia compared with African-origin haplotypes. To study genetic elements that effect HbF expression in the AI haplotype we completed whole genome sequencing in 14 Saudi AI haplotype sickle hemoglobin homozygotes—seven selected for low HbF (8.2±1.3%) and seven selected for high HbF (23.5±.2.6%). An intronic single nucleotide polymorphism (SNP) in ANTXR1, an anthrax toxin receptor (chromosome 2p13), was associated with HbF. These results were replicated in two independent Saudi AI haplotype cohorts of 120 and 139 patients, but not in 76 Saudi Benin haplotype, 894 African origin haplotype and 44 Arab Indian haplotype patients of Indian descent, suggesting that this association is effective only in the Saudi AI haplotype background. ANTXR1 variants explained 10% of the HbF variability compared with 8% for BCL11A. These two genes had independent, additive effects on HbF and together explained about 15% of HbF variability in Saudi AI sickle cell anemia patients. ANTXR1 was expressed at mRNA and protein levels in erythroid progenitors derived from induced pluripotent stem cells (iPSCs) and CD34+ cells. As CD34+ cells matured and their HbF decreased ANTXR1 expression increased; as iPSCs differentiated and their HbF increased, ANTXR1 expression decreased. Along with elements in cis to the HbF genes, ANTXR1 contributes to the variation in HbF in Saudi AI haplotype sickle cell anemia and is the first gene in trans to HBB that is associated with HbF only in carriers of the Saudi AI haplotype. PMID:27501013

  16. Complement factor H polymorphisms in Japanese population with age-related macular degeneration.

    PubMed

    Okamoto, Haru; Umeda, Shinsuke; Obazawa, Minoru; Minami, Masayoshi; Noda, Toru; Mizota, Atsushi; Honda, Miki; Tanaka, Minoru; Koyama, Risa; Takagi, Ikue; Sakamoto, Yoshihiro; Saito, Yoshihiro; Miyake, Yozo; Iwata, Takeshi

    2006-03-06

    To study the frequency of five haplotypes previously reported in the complement factor H (CFH) gene for Japanese patients with age-related macular degeneration (AMD). Genomic DNA was isolated from peripheral blood samples taken from 96 Japanese AMD patients and 89 age-matched controls. All patients were diagnosed as having exudative (wet-type) AMD. The amplified polymerase chain reaction (PCR) products of CFH exons 2, 9, and 13, and intron 6 were analyzed by temperature gradient capillary electrophoresis (TGCE) and by direct sequencing. The haplotypes were identified, and their frequencies were calculated and compared with reported results. Five haplotypes were identified in the Japanese population including four already reported in the American population. The frequencies of these haplotypes were significantly different between Japanese and American in both control and case groups. The haplotype containing Y402H, which was previously reported to be associated with AMD, was only 4% in the control and case population, with a p value of 0.802. However, two other haplotypes were found as risk factors, which gave an increased likelihood of AMD of 1.9 and 2.5 fold (95% CI 1.12-3.69 and 1.42-6.38). One protective haplotype that decreased the likelihood of AMD by 1.6 fold (95% CI 0.26-0.67) was identified. The frequencies for five haplotypes previously identified were analyzed in a Japanese population with AMD. Four previously found haplotypes were identified and one additional haplotype was found. The frequencies of each haplotype were significantly different from that in found Americans affected with AMD. Two of the haplotypes were identified as risk factors and one was considered protective.

  17. HaploForge: a comprehensive pedigree drawing and haplotype visualization web application.

    PubMed

    Tekman, Mehmet; Medlar, Alan; Mozere, Monika; Kleta, Robert; Stanescu, Horia

    2017-12-15

    Haplotype reconstruction is an important tool for understanding the aetiology of human disease. Haplotyping infers the most likely phase of observed genotypes conditional on constraints imposed by the genotypes of other pedigree members. The results of haplotype reconstruction, when visualized appropriately, show which alleles are identical by descent despite the presence of untyped individuals. When used in concert with linkage analysis, haplotyping can help delineate a locus of interest and provide a succinct explanation for the transmission of the trait locus. Unfortunately, the design choices made by existing haplotype visualization programs do not scale to large numbers of markers. Indeed, following haplotypes from generation to generation requires excessive scrolling back and forth. In addition, the most widely used program for haplotype visualization produces inconsistent recombination artefacts for the X chromosome. To resolve these issues, we developed HaploForge, a novel web application for haplotype visualization and pedigree drawing. HaploForge takes advantage of HTML5 to be fast, portable and avoid the need for local installation. It can accurately visualize autosomal and X-linked haplotypes from both outbred and consanguineous pedigrees. Haplotypes are coloured based on identity by descent using a novel A* search algorithm and we provide a flexible viewing mode to aid visual inspection. HaploForge can currently process haplotype reconstruction output from Allegro, GeneHunter, Merlin and Simwalk. HaploForge is licensed under GPLv3 and is hosted and maintained via GitHub. https://github.com/mtekman/haploforge. r.kleta@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  18. Influence of promoter/enhancer region haplotypes on MGMT transcriptional regulation: a potential biomarker for human sensitivity to alkylating agents.

    PubMed

    Xu, Meixiang; Nekhayeva, Ilona; Cross, Courtney E; Rondelli, Catherine M; Wickliffe, Jeffrey K; Abdel-Rahman, Sherif Z

    2014-03-01

    The O6-methylguanine-DNA methyltransferase gene (MGMT) encodes the direct reversal DNA repair protein that removes alkyl adducts from the O6 position of guanine. Several single-nucleotide polymorphisms (SNPs) exist in the MGMT promoter/enhancer (P/E) region. However, the haplotype structure encompassing these SNPs and their functional/biological significance are currently unknown. We hypothesized that MGMT P/E haplotypes, rather than individual SNPs, alter MGMT transcription and can thus alter human sensitivity to alkylating agents. To identify the haplotype structure encompassing the MGMT P/E region SNPs, we sequenced 104 DNA samples from healthy individuals and inferred the haplotypes using the data generated. We identified eight SNPs in this region, namely T7C (rs180989103), T135G (rs1711646), G290A (rs61859810), C485A (rs1625649), C575A (rs113813075), G666A (rs34180180), C777A (rs34138162) and C1099T (rs16906252). Phylogenetics and Sequence Evolution analysis predicted 21 potential haplotypes that encompass these SNPs ranging in frequencies from 0.000048 to 0.39. Of these, 10 were identified in our study population as 20 paired haplotype combinations. To determine the functional significance of these haplotypes, luciferase reporter constructs representing these haplotypes were transfected into glioblastoma cells and their effect on MGMT promoter activity was determined. Compared with the most common (reference) haplotype 1, seven haplotypes significantly upregulated MGMT promoter activity (18-119% increase; P < 0.05), six significantly downregulated MGMT promoter activity (29-97% decrease; P < 0.05) and one haplotype had no effect. Mechanistic studies conducted support the conclusion that MGMT P/E haplotypes, rather than individual SNPs, differentially regulate MGMT transcription and could thus play a significant role in human sensitivity to environmental and therapeutic alkylating agents.

  19. Haplotype-Based Genotyping in Polyploids.

    PubMed

    Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

    2018-01-01

    Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.

  20. Identification of parental line specific effects of MLF2 on resistance to coccidiosis in chickens

    PubMed Central

    2011-01-01

    Background MLF2 was the candidate gene associated with coccidiosis resistance in chickens. Although single marker analysis supported the association between MLF2 and coccidiosis resistance, causative mutation relevant to coccidiosis was not identified yet. Thus, this study suggested segregation analysis of MLF2 haplotype and the association test of the other candidate genes using improved data transformation. Results A haplotype probably originated from one parental line was found out of 4 major haplotypes of MLF2. Frequency of this haplotype was 0.2 in parental chickens and its offspring in 12 families. Allele substitution effect of the MLF2 haplotype originated from a specific line was associated with increased body weight and fecal egg count explaining coccidiosis resistance. Nevertheless Box-Cox transformation was able to improve normality; association test did not produce obvious different results compared with analysis with log transformed phenotype. Conclusion Allele substitution effect analysis and classification of MLF2 haplotype identified the segregation of haplotype associated with coccidiosis resistance. The haplotype originated from a specific parental line was associated with improving disease resistance. Estimating effect of MLF2 haplotype on coccidiosis resistance will provide useful information for selecting animals or lines for future study. PMID:21645301

  1. Haplotype diversity in 11 candidate genes across four populations.

    PubMed

    Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

    2005-09-01

    Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.

  2. Detecting structure of haplotypes and local ancestry

    USDA-ARS?s Scientific Manuscript database

    We present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local an...

  3. Reconstruction of Haplotype-Blocks Selected during Experimental Evolution.

    PubMed

    Franssen, Susanne U; Barton, Nicholas H; Schlötterer, Christian

    2017-01-01

    The genetic analysis of experimentally evolving populations typically relies on short reads from pooled individuals (Pool-Seq). While this method provides reliable allele frequency estimates, the underlying haplotype structure remains poorly characterized. With small population sizes and adaptive variants that start from low frequencies, the interpretation of selection signatures in most Evolve and Resequencing studies remains challenging. To facilitate the characterization of selection targets, we propose a new approach that reconstructs selected haplotypes from replicated time series, using Pool-Seq data. We identify selected haplotypes through the correlated frequencies of alleles carried by them. Computer simulations indicate that selected haplotype-blocks of several Mb can be reconstructed with high confidence and low error rates, even when allele frequencies change only by 20% across three replicates. Applying this method to real data from D. melanogaster populations adapting to a hot environment, we identify a selected haplotype-block of 6.93 Mb. We confirm the presence of this haplotype-block in evolved populations by experimental haplotyping, demonstrating the power and accuracy of our haplotype reconstruction from Pool-Seq data. We propose that the combination of allele frequency estimates with haplotype information will provide the key to understanding the dynamics of adaptive alleles. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. Native and European haplotypes of Phragmites Australis (common reed) in the central Platte River, Nebraska

    USGS Publications Warehouse

    Larson, D.L.; Galatowitsch, S.M.; Larson, J.L.

    2011-01-01

    Phragmites australis (common reed) is known to have occurred along the Platte River historically, but recent rapid increases in both distribution and density have begun to impact habitat for migrating sandhill cranes and nesting piping plovers and least terns. Invasiveness in Phragmites has been associated with the incursion of a European genotype (haplotype M) in other areas; determining the genotype of Phragmites along the central Platte River has implications for proper management of the river system. In 2008 we sampled Phragmites patches along the central Platte River from Lexington to Chapman, NE, stratified by bridge segments, to determine the current distribution of haplotype E (native) and haplotype M genotypes. In addition, we did a retrospective analysis of historical Phragmites collections from the central Platte watershed (1902-2006) at the Bessey Herbarium. Fresh tissue from the 2008 survey and dried tissue from the herbarium specimens were classified as haplotype M or E using the restriction fragment length polymorphism procedure. The European haplotype was predominant in the 2008 samples: only 14 Phragmites shoots were identified as native haplotype E; 224 were non-native haplotype M. The retrospective analysis revealed primarily native haplotype individuals. Only collections made in Lancaster County, near Lincoln, NE, were haplotype M, and the earliest of these was collected in 1973. ?? 2011 Copyright by the Center for Great Plains Studies, University of Nebraska-Lincoln.

  5. Transport synthetic acceleration for long-characteristics assembly-level transport problems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zika, M.R.; Adams, M.L.

    2000-02-01

    The authors apply the transport synthetic acceleration (TSA) scheme to the long-characteristics spatial discretization for the two-dimensional assembly-level transport problem. This synthetic method employs a simplified transport operator as its low-order approximation. Thus, in the acceleration step, the authors take advantage of features of the long-characteristics discretization that make it particularly well suited to assembly-level transport problems. The main contribution is to address difficulties unique to the long-characteristics discretization and produce a computationally efficient acceleration scheme. The combination of the long-characteristics discretization, opposing reflecting boundary conditions (which are present in assembly-level transport problems), and TSA presents several challenges. The authorsmore » devise methods for overcoming each of them in a computationally efficient way. Since the boundary angular data exist on different grids in the high- and low-order problems, they define restriction and prolongation operations specific to the method of long characteristics to map between the two grids. They implement the conjugate gradient (CG) method in the presence of opposing reflection boundary conditions to solve the TSA low-order equations. The CG iteration may be applied only to symmetric positive definite (SPD) matrices; they prove that the long-characteristics discretization yields an SPD matrix. They present results of the acceleration scheme on a simple test problem, a typical pressurized water reactor assembly, and a typical boiling water reactor assembly.« less

  6. Mice, humans and haplotypes--the hunt for disease genes in SLE.

    PubMed

    Rigby, R J; Fernando, M M A; Vyse, T J

    2006-09-01

    Defining the polymorphisms that contribute to the development of complex genetic disease traits is a challenging, although increasingly tractable problem. Historically, the technical difficulties in conducting association studies across the entire human genome are such that murine models have been used to generate candidate genes for analysis in human complex diseases, such as SLE. In this article we discuss the advantages and disadvantages of this approach and specifically address some assumptions made in the transition from studying one species to another, using lupus as an example. These issues include differences in genetic structure and genetic organisation which are a reflection on the population history. Clearly there are major differences in the histories of the human population and inbred laboratory strains of mice. Both human and murine genomes do exhibit structure at the genetic level. That is to say, they comprise haplotypes which are genomic regions that carry runs of polymorphisms that are not independently inherited. Haplotypes therefore reduce the number of combinations of the polymorphisms in the DNA in that region and facilitate the identification of disease susceptibility genes in both mice and humans. There are now novel means of generating candidate genes in SLE using mutagenesis (with ENU) in mice and identifying mice that generate antinuclear autoimmunity. In addition, murine models still provide a valuable means of exploring the functional consequences of genetic variation. However, advances in technology are such that human geneticists can now screen large fractions of the human genome for disease associations using microchip technologies that provide information on upwards of 100,000 different polymorphisms. These approaches are aimed at identifying haplotypes that carry disease susceptibility mutations and rely less on the generation of candidate genes.

  7. Mapping of the Pim-1 oncogene in mouse t-haplotypes and its use to define the relative map positions of the tcl loci t0(t6) and tw12 and the marker tf (tufted).

    PubMed

    Ark, B; Gummere, G; Bennett, D; Artzt, K

    1991-06-01

    Pim-1 is an oncogene activated in mouse T-cell lymphomas induced by Moloney and AKR mink cell focus (MCF) viruses. Pim-1 was previously mapped to chromosome 17 by somatic cell hybrids, and subsequently to the region between the hemoglobin alpha-chain pseudogene 4 (Hba-4ps) and the alpha-crystalline gene (Crya-1) by Southern blot analysis of DNA obtained from panels of recombinant inbred strains. We have now mapped Pim-1 more accurately in t-haplotypes by analysis of recombinant t-chromosomes. The recombinants were derived from Tts6tf/t12 parents backcrossed to + tf/ + tf, and scored for recombination between the loci of T and tf. For simplicity all t-complex lethal genes properly named tcl-tx are shortened to tx. The Pim-1 gene was localized 0.6 cM proximal to the tw12 lethal gene, thus placing the Pim-1 gene 5.2 cM distal to the H-2 region in t-haplotypes. Once mapped, the Pim-1 gene was used as a marker for further genetic analysis of t-haplotypes. tw12 is so close to tf that even with a large number of recombinants it was not possible to determine whether it is proximal or distal to tf. Southern blot analysis of DNA from T-tf recombinants with a separation of tw12 and tf indicated that tw12 is proximal to tf. The mapping of two allelic t-lethals, t0 and t6 with respect to tw12 and tf has also been a problem.(ABSTRACT TRUNCATED AT 250 WORDS)

  8. Associations Between Neurotransmitter Genes and Fatigue and Energy Levels in Women Following Breast Cancer Surgery

    PubMed Central

    Eshragh, Jasmine; Dhruva, Anand; Paul, Steven M.; Cooper, Bruce A.; Mastick, Judy; Hamolsky, Deborah; Levine, Jon D.; Miaskowski, Christine; Kober, Kord M.

    2016-01-01

    Context Fatigue is a common problem in oncology patients. Less is known about decrements in energy levels and the mechanisms that underlie both fatigue and energy. Objectives In patients with breast cancer, variations in neurotransmitter genes between Lower and Higher Fatigue latent classes and between the Higher and Lower Energy latent classes were evaluated. Methods Patients completed assessments prior to and monthly for 6 months following surgery. Growth mixture modeling was used to identify distinct latent classes for fatigue severity and energy levels. Thirty candidate genes involved in various aspects of neurotransmission were evaluated. Results Eleven single nucleotide polymorphisms (SNPs) or haplotypes (i.e., ADRB2 rs1042718, BDNF rs6265, COMT rs9332377, CYP3A4 rs4646437, GALR1 rs949060, GCH1 rs3783642, NOS1 rs9658498, NOS1 rs2293052, NPY1R Haplotype A04, SLC6A2 rs17841327 and 5HTTLPR + rs25531 in SLC6A4) were associated with latent class membership for fatigue. Seven SNPs or haplotypes (i.e., NOS1 rs471871, SLC6A1 rs2675163, SLC6A1 Haplotype D01, SLC6A2 rs36027, SLC6A3 rs37022, SLC6A4 rs2020942, and TAC1 rs2072100) were associated with latent class membership for energy. Three of thirteen genes (i.e., NOS1, SLC6A2, SLC6A4) were associated with latent class membership for both fatigue and energy. Conclusions Molecular findings support the hypothesis that fatigue and energy are distinct, yet related symptoms. Results suggest that a large number of neurotransmitters play a role in the development and maintenance of fatigue and energy levels in breast cancer patients. PMID:27720787

  9. MGMT DNA repair gene promoter/enhancer haplotypes alter transcription factor binding and gene expression.

    PubMed

    Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z

    2016-10-01

    The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.

  10. Phylogenetic status of brown trout Salmo trutta populations in five rivers from the southern Caspian Sea and two inland lake basins, Iran: a morphogenetic approach.

    PubMed

    Hashemzadeh Segherloo, I; Farahmand, H; Abdoli, A; Bernatchez, L; Primmer, C R; Swatdipong, A; Karami, M; Khalili, B

    2012-10-01

    Interrelationships, origin and phylogenetic affinities of brown trout Salmo trutta populations from the southern Caspian Sea basin, Orumieh and Namak Lake basins in Iran were analysed from complete mtDNA control region sequences, 12 microsatellite loci and morphological characters. Among 129 specimens from six populations, seven haplotypes were observed. Based on mtDNA haplotype data, the Orumieh and southern Caspian populations did not differ significantly, but the Namak basin-Karaj population presented a unique haplotype closely related to the haplotypes of the other populations (0·1% Kimura two-parameter, K2P divergence). All Iranian haplotypes clustered as a distinct group within the Danube phylogenetic grouping, with an average K2P distance of 0·41% relative to other Danubian haplotypes. The Karaj haplotype in the Namak basin was related to a haplotype (Da26) formerly identified in the Tigris basin in Turkey, to a Salmo trutta oxianus haplotype from the Aral Sea basin, and to haplotype Da1a with two mutational steps, as well as to other Iranian haplotypes with one to two mutational steps, which may indicate a centre of origin in the Caspian basin. In contrast to results of the mtDNA analysis, more pronounced differentiation was observed among the populations studied in the morphological and microsatellite DNA data, except for the two populations from the Orumieh basin, which were similar, possibly due to anthropogenic causes. © 2012 The Authors. Journal of Fish Biology © 2012 The Fisheries Society of the British Isles.

  11. Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes

    PubMed Central

    Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.

    2016-01-01

    The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800

  12. Y-SNPs haplotype diversity in four Chinese cattle breeds.

    PubMed

    Zhang, Runfeng; Cheng, Ming; Li, Xiaofeng; Chen, Fuying; Zheng, Jing; Wang, Xiaofei; Meng, Quanke

    2013-01-01

    To investigate the genetic diversity of Chinese cattle, 96 male samples of 4 Chinese native cattle breeds were investigated using 5 single nucleotide polymorphisms specific to the bovine Y chromosome. Two previously described haplotypes (taurine Y2 and indicine Y3) were detected in 74 and 22 animals, respectively. The haplotype frequencies varied amongst the four native breeds. The taurine Y2 haplotype dominated in the Qinchuan, Dabieshan, and Yunba breeds. However, the indicine Y3 haplotype occurred in high frequency in the Enshi breed. Among the four native breeds, Yunba had the highest haplotype diversity (0.4330 ± 0.0750), followed by Qinchuan (0.2899 ± 0.1028) and Enshi (0.2222 ± 0.1662), Dabieshan was the least differentiated (0.1079 ± 0.0680). Compared with some foreign cattle breeds, the low level of haplotype diversity was detected in our breeds (0.2633 ± 0.1030).

  13. Genetic variation of 'Candidatus Liberibacter solanacearum' haplotype C and identification of a novel haplotype from Trioza urticae and stinging nettle.

    PubMed

    Haapalainen, Minna L; Wang, Jinhui; Latvala, Satu; Lehtonen, Mikko T; Pirhonen, Minna; Nissinen, Anne I

    2018-03-30

    'Candidatus Liberibacter solanacearum' (CLso) haplotype C is associated with disease in carrots and transmitted by the carrot psyllid Trioza apicalis. To identify possible other sources and vectors of this pathogen in Finland, samples were taken of wild plants within and near the carrot fields, the psyllids feeding on these plants, parsnips growing next to carrots, and carrot seeds. For analyzing the genotype of the CLso positive samples, a multi-locus sequence typing (MLST) scheme was developed. CLso haplotype C was detected in 11% of the Trioza anthrisci samples, in 35% of the Anthriscus sylvestris plants with discoloration, and in parsnips showing leaf discoloration. MLST revealed that the CLso in T. anthrisci and most A. sylvestris plants represent different strains than the bacteria found in T. apicalis and the cultivated plants. CLso haplotype D was detected in two of the 34 carrot seed lots tested, but was not detected in the plants grown from these seeds. Phylogenetic analysis by UPGMA clustering suggested that the haplotype D is more closely related to the haplotype A than to C. A novel, sixth haplotype of CLso, most closely related to A and D, was found in the psyllid Trioza urticae and stinging nettle (Urtica dioica, Urticaceae), and named as haplotype U.

  14. Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.

    PubMed

    Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng

    2010-02-01

    Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.

  15. Hierarchical Scaffolding With Bambus

    PubMed Central

    Pop, Mihai; Kosack, Daniel S.; Salzberg, Steven L.

    2004-01-01

    The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. Most recent assemblers include a scaffolding module; however, users have little control over the scaffolding algorithm or the information produced. We thus developed a general-purpose scaffolder, called Bambus, which affords users significant flexibility in controlling the scaffolding parameters. Bambus was used recently to scaffold the low-coverage draft dog genome data. Most significantly, Bambus enables the use of linking data other than that inferred from mate-pair information. For example, the sequence of a completed genome can be used to guide the scaffolding of a related organism. We present several applications of Bambus: support for finishing, comparative genomics, analysis of the haplotype structure of genomes, and scaffolding of a mammalian genome at low coverage. Bambus is available as an open-source package from our Web site. PMID:14707177

  16. Hierarchical scaffolding with Bambus.

    PubMed

    Pop, Mihai; Kosack, Daniel S; Salzberg, Steven L

    2004-01-01

    The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. Most recent assemblers include a scaffolding module; however, users have little control over the scaffolding algorithm or the information produced. We thus developed a general-purpose scaffolder, called Bambus, which affords users significant flexibility in controlling the scaffolding parameters. Bambus was used recently to scaffold the low-coverage draft dog genome data. Most significantly, Bambus enables the use of linking data other than that inferred from mate-pair information. For example, the sequence of a completed genome can be used to guide the scaffolding of a related organism. We present several applications of Bambus: support for finishing, comparative genomics, analysis of the haplotype structure of genomes, and scaffolding of a mammalian genome at low coverage. Bambus is available as an open-source package from our Web site.

  17. Contributions of Zea mays subspecies mexicana haplotypes to modern maize.

    PubMed

    Yang, Ning; Xu, Xi-Wen; Wang, Rui-Ru; Peng, Wen-Lei; Cai, Lichun; Song, Jia-Ming; Li, Wenqiang; Luo, Xin; Niu, Luyao; Wang, Yuebin; Jin, Min; Chen, Lu; Luo, Jingyun; Deng, Min; Wang, Long; Pan, Qingchun; Liu, Feng; Jackson, David; Yang, Xiaohong; Chen, Ling-Ling; Yan, Jianbing

    2017-11-30

    Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17 × 10 -8 ~3.87 × 10 -8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.

  18. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  19. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  20. Global spread and genetic variants of the two CYP9M10 haplotype forms associated with insecticide resistance in Culex quinquefasciatus Say.

    PubMed

    Itokawa, K; Komagata, O; Kasai, S; Kawada, H; Mwatele, C; Dida, G O; Njenga, S M; Mwandawiro, C; Tomita, T

    2013-09-01

    Insecticide resistance develops as a genetic factor (allele) conferring lower susceptibility to insecticides proliferates within a target insect population under strong positive selection. Intriguingly, a resistance allele pre-existing in a population often bears a series of further adaptive allelic variants through new mutations. This phenomenon occasionally results in replacement of the predominating resistance allele by fitter new derivatives, and consequently, development of greater resistance at the population level. The overexpression of the cytochrome P450 gene CYP9M10 is associated with pyrethroid resistance in the southern house mosquito Culex quinquefasciatus. Previously, we have found two genealogically related overexpressing CYP9M10 haplotypes, which differ in gene copy number (duplicated and non-duplicated). The duplicated haplotype was derived from the non-duplicated overproducer probably recently. In the present study, we investigated allelic series of CYP9M10 involved in three C. quinquefasciatus laboratory colonies recently collected from three different localities. Duplicated and non-duplicated overproducing haplotypes coexisted in African and Asian colonies indicating a global distribution of both haplotype lineages. The duplicated haplotypes both in the Asian and African colonies were associated with higher expression levels and stronger resistance than non-duplicated overproducing haplotypes. There were slight variation in expression level among the non-duplicated overproducing haplotypes. The nucleotide sequences in coding and upstream regions among members of this group also showed a little diversity. Non-duplicated overproducing haplotypes with relatively higher expression were genealogically closer to the duplicated haplotypes than the other non-duplicated overproducing haplotypes, suggesting multiple cis-acting mutations before duplication.

  1. [A total of 362 HLA different haplotypes and HLA recombination haplotypes based on analysis of their family pedigree in Chinese partial Han populations].

    PubMed

    Gao, Su-Qing; Cheng, Xi; Li, Qian; Li, Yu-Zhu; Deng, Zhi-Hui

    2009-06-01

    This study was aimed to discover the novel HLA recombination haplotypes and investigate the distribution of haplotypes in Chinese Han population. Based on the HLA-A, B, DRB1 typing results of 179 family members, 791 haplotypes were assigned by the mode of inheritance. The results showed that a total of 4 novel recombinant haplotypes in HLA-DRB1 locus region were observed in 4 families, which ratio of paternal to maternal chromosomes was 3:1. The recombination ratio between HLA-DRB1 and HLA-A or B loci was 0.92% (4/433). There were a total of 362 kinds of HLA-A, -B, -DRB1 haplotypes to be confirmed in Chinese Han partial population. A33-B58-DR17, A2-B46-DR9, A30-B13-DR7, A11-B13-DR15, A11-B75-DR12 and A2-B46-DR14 were the most common haplotypes that was consistent with the distribution of HLA alleles in unrelated donors. There were A1-B63-DR12, A29-B46-DR15, A1-B61-DR10, A34-B35-DR9, A29-B54-DR4, A23-B13-DR16 and A34-B62-DR15 haplotypes and so on, which were rare haplotypes not yet reported in Chinese. It is concluded that the HLA-A-B-DRB1 haplotypes would be confirmed by analysis of their family pedigree. The results obtained in this study are basic data for study of Chinese anthropology, organ transplantation and disease correlation analysis.

  2. Haplotype-Based Association Analysis via Variance-Components Score Test

    PubMed Central

    Tzeng, Jung-Ying ; Zhang, Daowen 

    2007-01-01

    Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies. PMID:17924336

  3. Mapping of HLA- DQ haplotypes in a group of Danish patients with celiac disease.

    PubMed

    Lund, Flemming; Hermansen, Mette N; Pedersen, Merete F; Hillig, Thore; Toft-Hansen, Henrik; Sölétormos, György

    2015-10-01

    A cost-effective identification of HLA- DQ risk haplotypes using the single nucleotide polymorphism (SNP) technique has recently been applied in the diagnosis of celiac disease (CD) in four European populations. The objective of the study was to map risk HLA- DQ haplotypes in a group of Danish CD patients using the SNP technique. Cohort A: Among 65 patients with gastrointestinal symptoms we compared the HLA- DQ2 and HLA- DQ8 risk haplotypes obtained by the SNP technique (method 1) with results based on a sequence specific primer amplification technique (method 2) and a technique used in an assay from BioDiagene (method 3). Cohort B: 128 patients with histologically verified CD were tested for CD risk haplotypes (method 1). Patients with negative results were further tested for sub-haplotypes of HLA- DQ2 (methods 2 and 3). Cohort A: The three applied methods provided the same HLA- DQ2 and HLA- DQ8 results among 61 patients. Four patients were negative for the HLA- DQ2 and HLA- DQ8 haplotypes (method 1) but were positive for the HLA- DQ2.5-trans and HLA- DQ2.2 haplotypes (methods 2 and 3). Cohort B: A total of 120 patients were positive for the HLA- DQ2.5-cis and HLA- DQ8 haplotypes (method 1). The remaining seven patients were positive for HLA- DQ2.5-trans or HLA- DQ2.2 haplotypes (methods 2 and 3). One patient was negative with all three HLA methods. The HLA- DQ risk haplotypes were detected in 93.8% of the CD patients using the SNP technique (method 1). The sensitivity increased to 99.2% by combining methods 1 - 3.

  4. Haplotype-based approach to known MS-associated regions increases the amount of explained risk

    PubMed Central

    Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S

    2015-01-01

    Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143

  5. β3 Integrin Haplotype Influences Gene Regulation and Plasma von Willebrand Factor Activity

    PubMed Central

    Payne, Katie E; Bray, Paul F; Grant, Peter J; Carter, Angela M

    2008-01-01

    The Leu33Pro polymorphism of the gene encoding β3 integrin (ITGB3) is associated with acute coronary syndromes and influences platelet aggregation. Three common promoter polymorphisms have also been identified. The aims of this study were to (1) investigate the influence of the ITGB3 −400C/A, −425A/C and −468G/A promoter polymorphisms on reporter gene expression and nuclear protein binding and (2) determine genotype and haplotype associations with platelet αIIbβ3 receptor density. Promoter haplotypes were introduced into an ITGB3 promoter-pGL3 construct by site directed mutagenesis and luciferase reporter gene expression analysed in HEL and HMEC-1 cells. Binding of nuclear proteins was assessed by electrophoretic mobility shift assay. The association of ITGB3 haplotype with platelet αIIbβ3 receptor density was determined in 223 subjects. Species conserved motifs were identified in the ITGB3 promoter in the vicinity of the 3 polymorphisms. The GAA, GCC, AAC, AAA and ACC constructs induced ~50% increased luciferase expression relative to the GAC construct in both cell types. Haplotype analysis including Leu33Pro indicated 5 common haplotypes; no associations between ITGB3 haplotypes and receptor density were found. However, the GCC-Pro33 haplotype was associated with significantly higher vWF activity (128.6 [112.1–145.1]%) compared with all other haplotypes (107.1 [101.2–113.0]%, p=0.02). In conclusion, the GCC-Pro33 haplotype was associated with increased vWF activity but not with platelet αIIbβ3 receptor density, which may indicate ITGB3 haplotype influences endothelial function. PMID:18045606

  6. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.

    PubMed

    N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.

  7. Phylogeography of the Qinghai-Tibetan Plateau endemic Juniperus przewalskii (Cupressaceae) inferred from chloroplast DNA sequence variation.

    PubMed

    Zhang, Q; Chiang, T Y; George, M; Liu, J Q; Abbott, R J

    2005-10-01

    The vegetation of the northeast Qinghai-Tibetan Plateau is dominated by alpine meadow and desert-steppe with sparse forests scattered within it. To obtain a better understanding of the phylogeography of one constituent species of the forests in this region, we examined chloroplast trnT-trnF and trnS-trnG sequence variation within Juniperus przewalskii, a key endemic tree species. Sequence data were obtained from 392 trees in 20 populations covering the entire distribution range of the species. Six cpDNA haplotypes were identified. Significant population subdivision was detected (G(ST) = 0.772, N(ST) = 0.834), suggesting low levels of recurrent gene flow among populations and significant phylogeographic structure (N(ST) > G(ST), P < 0.05). Eight of the nine disjunct populations surveyed on the high-elevation northeast plateau were fixed for a single haplotype (A), while the remaining, more westerly population, contained the same haplotype at high frequency together with two low frequency haplotypes (C and F). In contrast, most populations that occurred at lower altitudes at the plateau edge were fixed or nearly fixed for one of two haplotypes, A or E. However, two plateau edge populations had haplotype compositions different from the rest. In one, four haplotypes (A, B, D and E) were present at approximately equivalent frequencies, which might reflect a larger refugium in the area of this population during the last glacial period. Phylogenetic analysis indicated that the most widely distributed haplotype A is not ancestral to other haplotypes. The contrasting phylogeographic structures of the haplotype-rich plateau edge area and the almost haplotype-uniform plateau platform region indicate that the plateau platform was recolonized by J. przewalskii during the most recent postglacial period. This is supported by the findings of a nested clade analysis, which inferred that postglacial range expansion from the plateau edge followed by recent fragmentation is largely responsible for the present-day spatial distribution of cpDNA haplotypes within the species.

  8. Solving Infeasibility Problems in Computerized Test Assembly.

    ERIC Educational Resources Information Center

    Timminga, Ellen

    1998-01-01

    Discusses problems of diagnosing and repairing infeasible linear-programming models in computerized test assembly. Demonstrates that it is possible to localize the causes of infeasibility, although this is not always easy. (SLD)

  9. Three potato centromeres are associated with distinct haplotypes with or without megabase-sized satellite repeat arrays.

    PubMed

    Wang, Linsheng; Zeng, Zixian; Zhang, Wenli; Jiang, Jiming

    2014-02-01

    We report discoveries of different haplotypes associated with the centromeres of three potato chromosomes, including haplotypes composed of long arrays of satellite repeats and haplotypes lacking the same repeats. These results are in favor of the hypothesis that satellite repeat-based centromeres may originate from neocentromeres that lack repeats.

  10. Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

    PubMed

    Patil, N; Berno, A J; Hinds, D A; Barrett, W A; Doshi, J M; Hacker, C R; Kautzer, C R; Lee, D H; Marjoribanks, C; McDonough, D P; Nguyen, B T; Norris, M C; Sheehan, J B; Shen, N; Stern, D; Stokowski, R P; Thomas, D J; Trulson, M O; Vyas, K R; Frazer, K A; Fodor, S P; Cox, D R

    2001-11-23

    Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. We have used high-density oligonucleotide arrays, in combination with somatic cell genetics, to identify a large fraction of all common human chromosome 21 SNPs and to directly observe the haplotype structure defined by these SNPs. This structure reveals blocks of limited haplotype diversity in which more than 80% of a global human sample can typically be characterized by only three common haplotypes.

  11. COMT haplotypes, catecholamine metabolites in plasma and clinical response in schizophrenic and bipolar patients.

    PubMed

    Zumárraga, Mercedes; Arrúe, Aurora; Basterreche, Nieves; Macías, Isabel; Catalán, Ana; Madrazo, Arantza; Bustamante, Sonia; Zamalloa, María I; Erkoreka, Leire; Gordo, Estibaliz; Arnaiz, Ainara; Olivas, Olga; Arroita, Ariane; Marín, Elena; González-Torres, Miguel A

    2016-06-01

    We examined the association of COMT haplotypes and plasma metabolites of catecholamines in relation to the clinical response to antipsychotics in schizophrenic and bipolar patients. We studied 165 patients before and after four weeks of treatment, and 163 healthy controls. We assessed four COMT haplotypes and the plasma concentrations of HVA, DOPAC and MHPG. Bipolar patients: haplotypes are associated with age at onset and clinical evolution. In schizophrenic patients, an haplotype previously associated with increased risk, is related to better response of negative symptoms. Haplotypes would be good indicators of the clinical status and the treatment response in bipolar and schizophrenic patients. Larger studies are required to elucidate the clinical usefulness of these findings.

  12. Multi-objective problem of the modified distributed parallel machine and assembly scheduling problem (MDPMASP) with eligibility constraints

    NASA Astrophysics Data System (ADS)

    Amallynda, I.; Santosa, B.

    2017-11-01

    This paper proposes a new generalization of the distributed parallel machine and assembly scheduling problem (DPMASP) with eligibility constraints referred to as the modified distributed parallel machine and assembly scheduling problem (MDPMASP) with eligibility constraints. Within this generalization, we assume that there are a set non-identical factories or production lines, each one with a set unrelated parallel machine with different speeds in processing them disposed to a single assembly machine in series. A set of different products that are manufactured through an assembly program of a set of components (jobs) according to the requested demand. Each product requires several kinds of jobs with different sizes. Beside that we also consider to the multi-objective problem (MOP) of minimizing mean flow time and the number of tardy products simultaneously. This is known to be NP-Hard problem, is important to practice, as the former criterions to reflect the customer's demand and manufacturer's perspective. This is a realistic and complex problem with wide range of possible solutions, we propose four simple heuristics and two metaheuristics to solve it. Various parameters of the proposed metaheuristic algorithms are discussed and calibrated by means of Taguchi technique. All proposed algorithms are tested by Matlab software. Our computational experiments indicate that the proposed problem and fourth proposed algorithms are able to be implemented and can be used to solve moderately-sized instances, and giving efficient solutions, which are close to optimum in most cases.

  13. FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads.

    PubMed

    Wang, Meng; Lin, Shili

    2014-09-15

    In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Frequency and origin of haplotypes associated with the beta-globin gene cluster in individuals with trait and sickle cell anemia in the Atlantic and Pacific coastal regions of Colombia

    PubMed Central

    Fong, Cristian; Lizarralde-Iragorri, María Alejandra; Rojas-Gallardo, Diana; Barreto, Guillermo

    2013-01-01

    Sickle cell anemia is a genetic disease with high prevalence in people of African descent. There are five typical haplotypes associated with this disease and the haplotypes associated with the beta-globin gene cluster have been used to establish the origin of African-descendant people in America. In this work, we determined the frequency and the origin of haplotypes associated with hemoglobin S in a sample of individuals with sickle cell anemia (HbSS) and sickle cell hemoglobin trait (HbAS) in coastal regions of Colombia. Blood samples from 71 HbAS and 79 HbSS individuals were obtained. Haplotypes were determined based on the presence of variable restriction sites within the β-globin gene cluster. On the Pacific coast of Colombia the most frequent haplotype was Benin, while on the Atlantic coast Bantu was marginally higher than Benin. Eight atypical haplotypes were observed on both coasts, being more diverse in the Atlantic than in the Pacific region. These results suggest a differential settlement of the coasts, dependent on where slaves were brought from, either from the Gulf of Guinea or from Angola, where the haplotype distributions are similar. Atypical haplotypes probably originated from point mutations that lost or gained a restriction site and/or by recombination events. PMID:24385850

  15. Inheritance of Hetero-Diploid Pollen S-Haplotype in Self-Compatible Tetraploid Chinese Cherry (Prunus pseudocerasus Lindl)

    PubMed Central

    Gu, Chao; Liu, Qing-Zhong; Yang, Ya-Nan; Zhang, Shu-Jun; Khan, Muhammad Awais; Wu, Jun; Zhang, Shao-Ling

    2013-01-01

    The breakdown of self-incompatibility, which could result from the accumulation of non-functional S-haplotypes or competitive interaction between two different functional S-haplotypes, has been studied extensively at the molecular level in tetraploid Rosaceae species. In this study, two tetraploid Chinese cherry (Prunus pseudocerasus) cultivars and one diploid sweet cherry (Prunus avium) cultivar were used to investigate the ploidy of pollen grains and inheritance of pollen-S alleles. Genetic analysis of the S-genotypes of two intercross-pollinated progenies showed that the pollen grains derived from Chinese cherry cultivars were hetero-diploid, and that the two S-haplotypes were made up of every combination of two of the four possible S-haplotypes. Moreover, the distributions of single S-haplotypes expressed in self- and intercross-pollinated progenies were in disequilibrium. The number of individuals of the two different S-haplotypes was unequal in two self-pollinated and two intercross-pollinated progenies. Notably, the number of individuals containing two different S-haplotypes (S1- and S5-, S5- and S8-, S1- and S4-haplotype) was larger than that of other individuals in the two self-pollinated progenies, indicating that some of these hetero-diploid pollen grains may have the capability to inactivate stylar S-RNase inside the pollen tube and grow better into the ovaries. PMID:23596519

  16. Relationship of the bovine growth hormone gene to carcass traits in Japanese black cattle.

    PubMed

    Tatsuda, K; Oka, A; Iwamoto, E; Kuroda, Y; Takeshita, H; Kataoka, H; Kouno, S

    2008-02-01

    The bovine growth hormone gene (bGH) possesses three haplotypes, A, B and C, that differ by amino acid mutations at positions 127 and 172 in the fifth exon: (leucine 127, threonine 172), (valine 127, threonine 172) and (valine 127, methionine 172) respectively. The correlation between meat quality or carcass weight and these haplotypes was investigated in Japanese black cattle. Altogether, 940 bGH haplotypes were compared with respect to six carcass traits: carcass weight, longissimus muscle area, rib thickness, subcutaneous fat thickness, beef marbling score and beef colour. The frequency of the B haplotype was higher (0.421) than that of A (0.269) and C (0.311). High carcass weight and low beef marbling were associated with haplotype A (p < 0.05 and p < 0.01 respectively), whereas beef marbling was increased by haplotype C (p < 0.05). Estimated regression coefficient of the A haplotype substitution effect for carcass weight and beef marbling score were 5.55 (13.1% of the phenotypic SD) and -0.31 (17.0%) respectively. That of the C haplotype for beef marbling score was 0.20 (11.0%). The other traits showed no relationship to the haplotypes examined. The results of this investigation suggest that information pertaining to bGH polymorphisms in Japanese black cattle could be used to improve the selection of meat traits.

  17. Fetal hemoglobin in sickle cell anemia: The Arab-Indian haplotype and new therapeutic agents.

    PubMed

    Habara, Alawi H; Shaikho, Elmutaz M; Steinberg, Martin H

    2017-11-01

    Fetal hemoglobin (HbF) has well-known tempering effects on the symptoms of sickle cell disease and its levels vary among patients with different haplotypes of the sickle hemoglobin gene. Compared with sickle cell anemia haplotypes found in patients of African descent, HbF levels in Saudi and Indian patients with the Arab-Indian (AI) haplotype exceed that in any other haplotype by nearly twofold. Genetic association studies have identified some loci associated with high HbF in the AI haplotype but these observations require functional confirmation. Saudi patients with the Benin haplotype have HbF levels almost twice as high as African patients with this haplotype but this difference is unexplained. Hydroxyurea is still the only FDA approved drug for HbF induction in sickle cell disease. While most patients treated with hydroxyurea have an increase in HbF and some clinical improvement, 10 to 20% of adults show little response to this agent. We review the genetic basis of HbF regulation focusing on sickle cell anemia in Saudi Arabia and discuss new drugs that can induce increased levels of HbF. © 2017 Wiley Periodicals, Inc.

  18. A phased SNP-based classification of sickle cell anemia HBB haplotypes.

    PubMed

    Shaikho, Elmutaz M; Farrell, John J; Alsultan, Abdulrahman; Qutub, Hatem; Al-Ali, Amein K; Figueiredo, Maria Stella; Chui, David H K; Farrer, Lindsay A; Murphy, George J; Mostoslavsky, Gustavo; Sebastiani, Paola; Steinberg, Martin H

    2017-08-11

    Sickle cell anemia causes severe complications and premature death. Five common β-globin gene cluster haplotypes are each associated with characteristic fetal hemoglobin (HbF) levels. As HbF is the major modulator of disease severity, classifying patients according to haplotype is useful. The first method of haplotype classification used restriction fragment length polymorphisms (RFLPs) to detect single nucleotide polymorphisms (SNPs) in the β-globin gene cluster. This is labor intensive, and error prone. We used genome-wide SNP data imputed to the 1000 Genomes reference panel to obtain phased data distinguishing parental alleles. We successfully haplotyped 813 sickle cell anemia patients previously classified by RFLPs with a concordance >98%. Four SNPs (rs3834466, rs28440105, rs10128556, and rs968857) marking four different restriction enzyme sites unequivocally defined most haplotypes. We were able to assign a haplotype to 86% of samples that were either partially or misclassified using RFLPs. Phased data using only four SNPs allowed unequivocal assignment of a haplotype that was not always possible using a larger number of RFLPs. Given the availability of genome-wide SNP data, our method is rapid and does not require high computational resources.

  19. Practical interpretation of CYP2D6 haplotypes: Comparison and integration of automated and expert calling.

    PubMed

    Ruaño, Gualberto; Kocherla, Mohan; Graydon, James S; Holford, Theodore R; Makowski, Gregory S; Goethe, John W

    2016-05-01

    We describe a population genetic approach to compare samples interpreted with expert calling (EC) versus automated calling (AC) for CYP2D6 haplotyping. The analysis represents 4812 haplotype calls based on signal data generated by the Luminex xMap analyzers from 2406 patients referred to a high-complexity molecular diagnostics laboratory for CYP450 testing. DNA was extracted from buccal swabs. We compared the results of expert calls (EC) and automated calls (AC) with regard to haplotype number and frequency. The ratio of EC to AC was 1:3. Haplotype frequencies from EC and AC samples were convergent across haplotypes, and their distribution was not statistically different between the groups. Most duplications required EC, as only expansions with homozygous or hemizygous haplotypes could be automatedly called. High-complexity laboratories can offer equivalent interpretation to automated calling for non-expanded CYP2D6 loci, and superior interpretation for duplications. We have validated scientific expert calling specified by scoring rules as standard operating procedure integrated with an automated calling algorithm. The integration of EC with AC is a practical strategy for CYP2D6 clinical haplotyping. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes

    PubMed Central

    Lindenau, Juliana D.; Wagner, Sandrine C.; de Castro, Simone M.; Hutz, Mara H.

    2016-01-01

    Abstract Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management. PMID:27706371

  1. Haplotype Reconstruction in Large Pedigrees with Many Untyped Individuals

    NASA Astrophysics Data System (ADS)

    Li, Xin; Li, Jing

    Haplotypes, as they specify the linkage patterns between dispersed genetic variations, provide important information for understanding the genetics of human traits. However haplotypes are not directly available from current genotyping platforms, and hence there are extensive investigations of computational methods to recover such information. Two major computational challenges arising in current family-based disease studies are large family sizes and many ungenotyped family members. Traditional haplotyping methods can neither handle large families nor families with missing members. In this paper, we propose a method which addresses these issues by integrating multiple novel techniques. The method consists of three major components: pairwise identical-bydescent (IBD) inference, global IBD reconstruction and haplotype restoring. By reconstructing the global IBD of a family from pairwise IBD and then restoring the haplotypes based on the inferred IBD, this method can scale to large pedigrees, and more importantly it can handle families with missing members. Compared with existing methods, this method demonstrates much higher power to recover haplotype information, especially in families with many untyped individuals.

  2. Population-specific recombination sites within the human MHC region.

    PubMed

    Lam, T H; Shen, M; Chia, J-M; Chan, S H; Ren, E C

    2013-08-01

    Genetic rearrangement by recombination is one of the major driving forces for genome evolution, and recombination is known to occur in non-random, discreet recombination sites within the genome. Mapping of recombination sites has proved to be difficult, particularly, in the human MHC region that is complicated by both population variation and highly polymorphic HLA genes. To overcome these problems, HLA-typed individuals from three representative populations: Asian, European and African were used to generate phased HLA haplotypes. Extended haplotype homozygosity (EHH) plots constructed from the phased haplotype data revealed discreet EHH drops corresponding to recombination events and these signatures were observed to be different for each population. Surprisingly, the majority of recombination sites detected are unique to each population, rather than being common. Unique recombination sites account for 56.8% (21/37 of total sites) in the Asian cohort, 50.0% (15/30 sites) in Europeans and 63.2% (24/38 sites) in Africans. Validation carried out at a known sperm typing recombination site of 45 kb (HLA-F-telomeric) showed that EHH was an efficient method to narrow the recombination region to 826 bp, and this was further refined to 660 bp by resequencing. This approach significantly enhanced mapping of the genomic architecture within the human MHC, and will be useful in studies to identify disease risk genes.

  3. The influence of maternal lineages on social affiliations among humpback whales (Megaptera novaeangliae) on their feeding grounds in the southern gulf of Maine.

    PubMed

    Weinrich, Mason T; Rosenbaum, Howard; Scott Baker, C; Blackmer, Alexis L; Whitehead, Hal

    2006-01-01

    Humpback whales on their feeding grounds in the Gulf of Maine typically form fluid fission/fusion groups of two to three individuals characterized by noncompetitive and, at times, cooperative behavior. Here we test the hypothesis that, despite the apparent absence of close kinship bonds, the fluid associations between feeding whales are influenced by "maternal lineages" as represented by mtDNA haplotypes. Using skin samples collected with a biopsy dart, variation in the hypervariable segment of the mtDNA control region identified 17 unique haplotypes among 159 individually identified whales from the southern Gulf of Maine. The haplotypes of a further 143 individuals were inferred from known direct maternal (cow-calf) relationships. The frequencies of associations among these 302 individuals were calculated from 21,617 sighting records collected from 1980 to 1995, excluding associations between a cow and her dependent calf. For groups of two where the haplotypes of both individuals were known (n = 3,151), individuals with the same haplotype were together significantly more often (26%) than expected by random association (20%). To account for different group sizes and associations with individuals of unknown haplotype and sex, we used Monte Carlo simulations to test for nonrandom associations in the full data set, as well as known female-only (n = 1,512), male-only (n = 730), and mixed-sex (n = 2,745) groups. Within-haplotype associations were significantly more frequent than expected at random for all groups (P = .002) and female-only groups (P = .011) but not male-only groups, while mixed-sex groups approached significance (P = .062). A Mantel test of individual pairwise association indices and haplotype identity confirmed that within-haplotype associations were more frequent than expected for all sex combinations except male-male associations, with females forming within-haplotype associations 1.7 times more often than expected by random assortment. Partial matrix correlations and permutation analyses indicated that the skew toward within-haplotype associations could not be accounted for by short-term temporal co-occurrence or fine-scale spatial distributions of individuals with shared haplotypes. While the mechanism by which individuals with a common mtDNA haplotype assort remains unknown, our results strongly suggest an influence of maternal lineages on the social organization of humpback whales within a regional feeding ground.

  4. Batch Scheduling for Hybrid Assembly Differentiation Flow Shop to Minimize Total Actual Flow Time

    NASA Astrophysics Data System (ADS)

    Maulidya, R.; Suprayogi; Wangsaputra, R.; Halim, A. H.

    2018-03-01

    A hybrid assembly differentiation flow shop is a three-stage flow shop consisting of Machining, Assembly and Differentiation Stages and producing different types of products. In the machining stage, parts are processed in batches on different (unrelated) machines. In the assembly stage, each part of the different parts is assembled into an assembly product. Finally, the assembled products will further be processed into different types of final products in the differentiation stage. In this paper, we develop a batch scheduling model for a hybrid assembly differentiation flow shop to minimize the total actual flow time defined as the total times part spent in the shop floor from the arrival times until its due date. We also proposed a heuristic algorithm for solving the problems. The proposed algorithm is tested using a set of hypothetic data. The solution shows that the algorithm can solve the problems effectively.

  5. Single molecule sequencing-guided scaffolding and correction of draft assemblies.

    PubMed

    Zhu, Shenglong; Chen, Danny Z; Emrich, Scott J

    2017-12-06

    Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies. We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-corrected single molecule sequences. To achieve this goal, we formulate a maximum alternating path cover problem. We prove that this problem is NP-hard, and solve it by a 2-approximation algorithm. Our experimental results show that our approach can improve the structural correctness of target assemblies in the cost of some contiguity, even with smaller amounts of long reads. In addition, our reassembling process can also serve as a competitive scaffolder relative to well-established assembly benchmarks.

  6. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map

    PubMed Central

    Haile, Jemanesh K.; Cory, Aron T.; Clarke, Fran R.; Clarke, John M.; Knox, Ron E.; Pozniak, Curtis J.

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat. PMID:28135299

  7. DLA Class II Alleles Are Associated with Risk for Canine Symmetrical Lupoid Onychodystropy (SLO)

    PubMed Central

    Wilbe, Maria; Ziener, Martine Lund; Aronsson, Anita; Harlos, Charlotte; Sundberg, Katarina; Norberg, Elin; Andersson, Lisa; Lindblad-Toh, Kerstin; Hedhammar, Åke; Andersson, Göran; Lingaas, Frode

    2010-01-01

    Symmetrical lupoid onychodystrophy (SLO) is an immune-mediated disease in dogs affecting the claws with a suggested autoimmune aethiology. Sequence-based genotyping of the polymorphic exon 2 from DLA-DRB1, -DQA1, and -DQB1 class II loci were performed in a total of 98 SLO Gordon setter cases and 98 healthy controls. A risk haplotype (DRB1*01801/DQA1*00101/DQB1*00802) was present in 53% of cases and 34% of controls and conferred an elevated risk of developing SLO with an odds ratio (OR) of 2.1. When dogs homozygous for the risk haplotype were compared to all dogs not carrying the haplotype the OR was 5.4. However, a stronger protective haplotype (DRB1*02001/DQA1*00401/DQB1*01303, OR = 0.03, 1/OR = 33) was present in 16.8% of controls, but only in a single case (0.5%). The effect of the protective haplotype was clearly stronger than the risk haplotype, since 11.2% of the controls were heterozygous for the risk and protective haplotypes, whereas this combination was absent from cases. When the dogs with the protective haplotype were excluded, an OR of 2.5 was obtained when dogs homozygous for the risk haplotype were compared to those heterozygous for the risk haplotype, suggesting a co-dominant effect of the risk haplotype. In smaller sample sizes of the bearded collie and giant schnauzer breeds we found the same or similar haplotypes, sharing the same DQA1 allele, over-represented among the cases suggesting that the risk is associated primarily with DLA-DQ. We obtained conclusive results that DLA class II is significantly associated with risk of developing SLO in Gordon setters, thus supporting that SLO is an immune-mediated disease. Further studies of SLO in dogs may provide important insight into immune privilege of the nail apparatus and also knowledge about a number of inflammatory disorders of the nail apparatus like lichen planus, psoriasis, alopecia areata and onycholysis. PMID:20808798

  8. Solving Assembly Sequence Planning using Angle Modulated Simulated Kalman Filter

    NASA Astrophysics Data System (ADS)

    Mustapa, Ainizar; Yusof, Zulkifli Md.; Adam, Asrul; Muhammad, Badaruddin; Ibrahim, Zuwairie

    2018-03-01

    This paper presents an implementation of Simulated Kalman Filter (SKF) algorithm for optimizing an Assembly Sequence Planning (ASP) problem. The SKF search strategy contains three simple steps; predict-measure-estimate. The main objective of the ASP is to determine the sequence of component installation to shorten assembly time or save assembly costs. Initially, permutation sequence is generated to represent each agent. Each agent is then subjected to a precedence matrix constraint to produce feasible assembly sequence. Next, the Angle Modulated SKF (AMSKF) is proposed for solving ASP problem. The main idea of the angle modulated approach in solving combinatorial optimization problem is to use a function, g(x), to create a continuous signal. The performance of the proposed AMSKF is compared against previous works in solving ASP by applying BGSA, BPSO, and MSPSO. Using a case study of ASP, the results show that AMSKF outperformed all the algorithms in obtaining the best solution.

  9. Automated assembly in space

    NASA Technical Reports Server (NTRS)

    Srivastava, Sandanand; Dwivedi, Suren N.; Soon, Toh Teck; Bandi, Reddy; Banerjee, Soumen; Hughes, Cecilia

    1989-01-01

    The installation of robots and their use of assembly in space will create an exciting and promising future for the U.S. Space Program. The concept of assembly in space is very complicated and error prone and it is not possible unless the various parts and modules are suitably designed for automation. Certain guidelines are developed for part designing and for an easy precision assembly. Major design problems associated with automated assembly are considered and solutions to resolve these problems are evaluated in the guidelines format. Methods for gripping and methods for part feeding are developed with regard to the absence of gravity in space. The guidelines for part orientation, adjustments, compliances and various assembly construction are discussed. Design modifications of various fasteners and fastening methods are also investigated.

  10. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction

    PubMed Central

    Laehnemann, David; Borkhardt, Arndt

    2016-01-01

    Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159

  11. Temporal fluctuation in North East Baltic Sea region cattle population revealed by mitochondrial and Y-chromosomal DNA analyses.

    PubMed

    Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Harjula, Janne; Nyström Edmark, Veronica; Rannamäe, Eve; Lõugas, Lembi; Sajantila, Antti; Lidén, Kerstin; Taavitsainen, Jussi-Pekka

    2015-01-01

    Ancient DNA analysis offers a way to detect changes in populations over time. To date, most studies of ancient cattle have focused on their domestication in prehistory, while only a limited number of studies have analysed later periods. Conversely, the genetic structure of modern cattle populations is well known given the undertaking of several molecular and population genetic studies. Bones and teeth from ancient cattle populations from the North-East Baltic Sea region dated to the Prehistoric (Late Bronze and Iron Age, 5 samples), Medieval (14), and Post-Medieval (26) periods were investigated by sequencing 667 base pairs (bp) from the mitochondrial DNA (mtDNA) and 155 bp of intron 19 in the Y-chromosomal UTY gene. Comparison of maternal (mtDNA haplotypes) genetic diversity in ancient cattle (45 samples) with modern cattle populations in Europe and Asia (2094 samples) revealed 30 ancient mtDNA haplotypes, 24 of which were shared with modern breeds, while 6 were unique to the ancient samples. Of seven Y-chromosomal sequences determined from ancient samples, six were Y2 and one Y1 haplotype. Combined data including Swedish samples from the same periods (64 samples) was compared with the occurrence of Y-chromosomal haplotypes in modern cattle (1614 samples). The diversity of haplogroups was highest in the Prehistoric samples, where many haplotypes were unique. The Medieval and Post-Medieval samples also show a high diversity with new haplotypes. Some of these haplotypes have become frequent in modern breeds in the Nordic Countries and North-Western Russia while other haplotypes have remained in only a few local breeds or seem to have been lost. A temporal shift in Y-chromosomal haplotypes from Y2 to Y1 was detected that corresponds with the appearance of new mtDNA haplotypes in the Medieval and Post-Medieval period. This suggests a replacement of the Prehistoric mtDNA and Y chromosomal haplotypes by new types of cattle.

  12. Effects of IL-10 haplotype and atomic bomb radiation exposure on gastric cancer risk.

    PubMed

    Hayashi, Tomonori; Ito, Reiko; Cologne, John; Maki, Mayumi; Morishita, Yukari; Nagamura, Hiroko; Sasaki, Keiko; Hayashi, Ikue; Imai, Kazue; Yoshida, Kengo; Kajimura, Junko; Kyoizumi, Seishi; Kusunoki, Yoichiro; Ohishi, Waka; Fujiwara, Saeko; Akahoshi, Masazumi; Nakachi, Kei

    2013-07-01

    Gastric cancer (GC) is one of the cancers that reveal increased risk of mortality and incidence in atomic bomb survivors. The incidence of gastric cancer in the Life Span Study cohort of the Radiation Effects Research Foundation (RERF) increased with radiation dose (gender-averaged excess relative risk per Gy = 0.28) and remains high more than 65 years after exposure. To assess a possible role of gene-environment interaction, we examined the dose response for gastric cancer incidence based on immunosuppression-related IL-10 genotype, in a cohort study with 200 cancer cases (93 intestinal, 96 diffuse and 11 other types) among 4,690 atomic bomb survivors participating in an immunological substudy. Using a single haplotype block composed of four haplotype-tagging SNPs (comprising the major haplotype allele IL-10-ATTA and the minor haplotype allele IL-10-GGCG, which are categorized by IL-10 polymorphisms at -819A>G and -592T>G, +1177T>C and +1589A>G), multiplicative and additive models for joint effects of radiation and this IL-10 haplotyping were examined. The IL-10 minor haplotype allele(s) was a risk factor for intestinal type gastric cancer but not for diffuse type gastric cancer. Radiation was not associated with intestinal type gastric cancer. In diffuse type gastric cancer, the haplotype-specific excess relative risk (ERR) for radiation was statistically significant only in the major homozygote category of IL-10 (ERR = 0.46/Gy, P = 0.037), whereas estimated ERR for radiation with the minor IL-10 homozygotes was close to 0 and nonsignificant. Thus, the minor IL-10 haplotype might act to reduce the radiation related risk of diffuse-type gastric cancer. The results suggest that this IL-10 haplotyping might be involved in development of radiation-associated gastric cancer of the diffuse type, and that IL-10 haplotypes may explain individual differences in the radiation-related risk of gastric cancer. © 2013 by Radiation Research Society

  13. Temporal Fluctuation in North East Baltic Sea Region Cattle Population Revealed by Mitochondrial and Y-Chromosomal DNA Analyses

    PubMed Central

    Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Harjula, Janne; Nyström Edmark, Veronica; Rannamäe, Eve; Lõugas, Lembi; Sajantila, Antti; Lidén, Kerstin; Taavitsainen, Jussi-Pekka

    2015-01-01

    Background Ancient DNA analysis offers a way to detect changes in populations over time. To date, most studies of ancient cattle have focused on their domestication in prehistory, while only a limited number of studies have analysed later periods. Conversely, the genetic structure of modern cattle populations is well known given the undertaking of several molecular and population genetic studies. Results Bones and teeth from ancient cattle populations from the North-East Baltic Sea region dated to the Prehistoric (Late Bronze and Iron Age, 5 samples), Medieval (14), and Post-Medieval (26) periods were investigated by sequencing 667 base pairs (bp) from the mitochondrial DNA (mtDNA) and 155 bp of intron 19 in the Y-chromosomal UTY gene. Comparison of maternal (mtDNA haplotypes) genetic diversity in ancient cattle (45 samples) with modern cattle populations in Europe and Asia (2094 samples) revealed 30 ancient mtDNA haplotypes, 24 of which were shared with modern breeds, while 6 were unique to the ancient samples. Of seven Y-chromosomal sequences determined from ancient samples, six were Y2 and one Y1 haplotype. Combined data including Swedish samples from the same periods (64 samples) was compared with the occurrence of Y-chromosomal haplotypes in modern cattle (1614 samples). Conclusions The diversity of haplogroups was highest in the Prehistoric samples, where many haplotypes were unique. The Medieval and Post-Medieval samples also show a high diversity with new haplotypes. Some of these haplotypes have become frequent in modern breeds in the Nordic Countries and North-Western Russia while other haplotypes have remained in only a few local breeds or seem to have been lost. A temporal shift in Y-chromosomal haplotypes from Y2 to Y1 was detected that corresponds with the appearance of new mtDNA haplotypes in the Medieval and Post-Medieval period. This suggests a replacement of the Prehistoric mtDNA and Y chromosomal haplotypes by new types of cattle. PMID:25992976

  14. SLC22A1-ABCB1 haplotype profiles predict imatinib pharmacokinetics in Asian patients with chronic myeloid leukemia.

    PubMed

    Singh, Onkar; Chan, Jason Yongsheng; Lin, Keegan; Heng, Charles Chuah Thuan; Chowbay, Balram

    2012-01-01

    This study aimed to explore the influence of SLC22A1, PXR, ABCG2, ABCB1 and CYP3A5 3 genetic polymorphisms on imatinib mesylate (IM) pharmacokinetics in Asian patients with chronic myeloid leukemia (CML). Healthy subjects belonging to three Asian populations (Chinese, Malay, Indian; n = 70 each) and CML patients (n = 38) were enrolled in a prospective pharmacogenetics study. Imatinib trough (C(0h)) and clearance (CL) were determined in the patients at steady state. Haplowalk method was applied to infer the haplotypes and generalized linear model (GLM) to estimate haplotypic effects on IM pharmacokinetics. Association of haplotype copy numbers with IM pharmacokinetics was defined by Mann-Whitney U test. Global haplotype score statistics revealed a SLC22A1 sub-haplotypic region encompassing three polymorphisms (rs3798168, rs628031 and IVS7+850C>T), to be significantly associated with IM clearance (p = 0.013). Haplotype-specific GLM estimated that the haplotypes AGT and CGC were both associated with 22% decrease in clearance compared to CAC [CL (10(-2) L/hr/mg): CAC vs AGT: 4.03 vs 3.16, p = 0.017; CAC vs CGC: 4.03 vs 3.15, p = 0.017]. Patients harboring 2 copies of AGT or CGC haplotypes had 33.4% lower clearance and 50% higher C(0h) than patients carrying 0 or 1 copy [CL (10(-2) L/hr/mg): 2.19 vs 3.29, p = 0.026; C(0h) (10(-6) 1/ml): 4.76 vs 3.17, p = 0.013, respectively]. Further subgroup analysis revealed SLC22A1 and ABCB1 haplotypic combinations to be significantly associated with clearance and C(0h) (p = 0.002 and 0.009, respectively). This exploratory study suggests that SLC22A1-ABCB1 haplotypes may influence IM pharmacokinetics in Asian CML patients.

  15. Molecular identification and first report of mitochondrial COI gene haplotypes in the hawksbill turtle Eretmochelys imbricata (Testudines: Cheloniidae) in the Colombian Caribbean nesting colonies.

    PubMed

    Daza-Criado, L; Hernández-Fernández, J

    2014-02-21

    Hawksbill sea turtles Eretmochelys imbricata are found extensively around the world, including the Atlantic, Pacific, and Indian Oceans; the Persian Gulf, and the Red and Mediterranean Seas. Populations of this species are affected by international trafficking of their shields, meat, and eggs, making it a critically endangered animal. We determined the haplotypes of 17 hawksbill foraging turtles of Islas del Rosario (Bolivar) and of the nesting beach Don Diego (Magdalena) in the Colombian Caribbean based on amplification and sequencing of the mitochondrial gene cytochrome oxidase c subunit I (COI). We identified 5 haplotypes, including EI-A1 previously reported in Puerto Rico, which was similar to 10 of the study samples. To our knowledge, the remaining 4 haplotypes have not been described. Samples EICOI11 and EICOI3 showed 0.2% divergence from EI-A1, by a single nucleotide change, and were classified as the EI-A2 haplotype. EICOI6, EICOI14, and EICOI12 samples showed 0.2% divergence from EI-A1 and 0.3% divergence from EI-A2 and were classified as EI-A3 haplotype. Samples EICOI16 and EICOI15 presented 5 nucleotide changes each and were classified as 2 different haplotypes, EI-A4 and EI-A5, respectively. The last 2 haplotypes had higher nucleotide diversity (K2P=1.7%) than that by the first 3 haplotypes. EI-A1 and EI-A2 occurred in nesting individuals, and EI-A2, EI-A3, EI-A4, and EI-A5 occurred in foraging individuals. The description of the haplotypes may be associated with reproductive migrations or foraging and could support the hypothesis of natal homing. Furthermore, they can be used in phylogeographic studies.

  16. Performance of Single Nucleotide Polymorphisms versus Haplotypes for Genome-Wide Association Analysis in Barley

    PubMed Central

    Jannink, Jean-Luc

    2010-01-01

    Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933

  17. Molecular phylogenetic identification of Fasciola flukes in Nepal.

    PubMed

    Shoriki, Takuya; Ichikawa-Seki, Madoka; Devkota, Bhuminand; Rana, Hari B; Devkota, Shiva P; Humagain, Sudeep K; Itagaki, Tadashi

    2014-12-01

    Eighty-one Fasciola flukes collected from 8 districts in Nepal were analyzed for their species identification on the basis of their spermatogenic status and nuclear ribosomal internal transcribed spacer 1 (ITS1) and for their phylogenetic relation with Fasciola flukes from other Asian countries on the basis of the mitochondrial NADH dehydrogenase subunit 1 (nad1) gene. Sixty-one flukes (75.3%) were aspermic Fasciola sp., and 20 flukes (24.7%) were identified as Fasciola gigantica. All of the aspermic flukes displayed the Fh/Fg type in ITS1, which was predominant in aspermic Fasciola sp. from China, and most (60 flukes) displayed the Fsp-ND1-N1 haplotype in the nad1, which had an identical nucleotide sequence to the major haplotype (Fg-C2) of the aspermic flukes from China. These results suggest that aspermic Fasciola sp. was introduced into Nepal from China. Furthermore, the results of the diversity indices, neutrality indices, and median-joining network analysis with reference haplotypes from Asian countries suggest that aspermic Fasciola sp. rapidly expanded its distribution. In contrasts, F. gigantica displayed 10 nad1 haplotypes, which showed higher population diversity indices than the haplotypes of aspermic flukes, indicating that the F. gigantica population was clearly distributed in Nepal earlier than the aspermic Fasciola population. Although the F. gigantica haplotypes from Nepal formed a star-like phylogeny consisting of a main founder haplotype (Fg-ND1-N1), together with some F. gigantica haplotypes from Myanmar and Thailand, the Nepal population differed genetically from F. gigantica populations of neighboring countries as each country had distinct founder haplotype(s). Copyright © 2014 Elsevier Inc. All rights reserved.

  18. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil.

    PubMed

    Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe

    2010-07-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.

  19. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil

    PubMed Central

    2010-01-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405

  20. The putative oncogene Pim-1 in the mouse: its linkage and variation among t haplotypes.

    PubMed

    Nadeau, J H; Phillips, S J

    1987-11-01

    Pim-1, a putative oncogene involved in T-cell lymphomagenesis, was mapped between the pseudo-alpha globin gene Hba-4ps and the alpha-crystallin gene Crya-1 on mouse chromosome 17 and therefore within the t complex. Pim-1 restriction fragment variants were identified among t haplotypes. Analysis of restriction fragment sizes obtained with 12 endonucleases demonstrated that the Pim-1 genes in some t haplotypes were indistinguishable from the sizes for the Pim-1b allele in BALB/c inbred mice. There are now three genes, Pim-1, Crya-1 and H-2 I-E, that vary among independently derived t haplotypes and that have indistinguishable alleles in t haplotypes and inbred strains. These genes are closely linked within the distal inversion of the t complex. Because it is unlikely that these variants arose independently in t haplotypes and their wild-type homologues, we propose that an exchange of chromosomal segments, probably through double crossingover, was responsible for indistinguishable Pim-1 genes shared by certain t haplotypes and their wild-type homologues. There was, however, no apparent association between variant alleles of these three genes among t haplotypes as would be expected if a single exchange introduced these alleles into t haplotypes. If these variant alleles can be shown to be identical to the wild-type allele, then lack of association suggests that multiple exchanges have occurred during the evolution of the t complex.

  1. High variation and strong phylogeographic pattern among cpDNA haplotypes in Taxus wallichiana (Taxaceae) in China and North Vietnam.

    PubMed

    Gao, L M; Möller, M; Zhang, X-M; Hollingsworth, M L; Liu, J; Mill, R R; Gibby, M; Li, D-Z

    2007-11-01

    We studied the phylogeography of Chinese yew (Taxus wallichiana), a tree species distributed over most of southern China and adjacent regions. A total of 1235 individuals from 50 populations from China and North Vietnam were analysed for chloroplast DNA variation using polymerase chain reaction-restriction fragment length polymorphism of the trnL-F intron-spacer region. A total of 19 different haplotypes were distinguished. We found a very high level of population differentiation and a strong phylogeographic pattern, suggesting low levels of recurrent gene flow among populations. Haplotype differentiation was most marked along the boundary between the Sino-Himalayan and Sino-Japanese Forest floristic subkingdoms, with only one haplotype being shared among these two subkingdoms. The Malesian and Sino-Himalayan Forest subkingdoms had five and 10 haplotypes, respectively, while the relatively large Sino-Japanese Forest subkingdom had only eight. The strong geography-haplotype correlation persisted at the regional floristic level, with most regions possessing a unique set of haplotypes, except for the central China region. Strong landscape effects were observed in the Hengduan and Dabashan mountains, where steep mountains and valleys might have been natural dispersal barriers. The molecular phylogenetic data, together with the geographic distribution of the haplotypes, suggest the existence of several localized refugia during the last glaciation from which the present-day distribution may be derived. The pattern of haplotype distribution across China and North Vietnam corresponded well with the current taxonomic delineation of the three intraspecific varieties of T. wallichiana.

  2. Genetic diversity and geographical structure of the pitcher plant Nepenthes vieillardii in New Caledonia: A chloroplast DNA haplotype analysis.

    PubMed

    Kurata, Kaoruko; Jaffré, Tanguy; Setoguchi, Hiroaki

    2008-12-01

    Among the many species that grow in New Caledonia, the pitcher plant Nepenthes vieillardii (Nepenthaceae) has a high degree of morphological variation. In this study, we present the patterns of genetic differentiation of pitcher plant populations based on chloroplast DNA haplotype analysis using the sequences of five spacers. We analyzed 294 samples from 16 populations covering the entire range of the species, using 4660 bp of sequence. Our analysis identified 17 haplotypes, including one that is widely distributed across the islands, as well as regional and private haplotypes. The greatest haplotype diversity was detected on the eastern coast of the largest island and included several private haplotypes, while haplotype diversity was low in the southern plains region. The parsimony network analysis of the 17 haplotypes suggested that the genetic divergence is the result of long-term isolation of individual populations. Results from a spatial analysis of molecular variance and a cluster analysis suggest that the plants once covered the entire serpentine area of New Caledonia and that subsequent regional fragmentation resulted in the isolation of each population and significantly restricted seed flow. This isolation may have been an important factor in the development of the morphological and genetic variation among pitcher plants in New Caledonia.

  3. A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

    PubMed Central

    Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

    2013-01-01

    Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962

  4. β-globin gene cluster haplotypes in ethnic minority populations of southwest China

    PubMed Central

    Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

    2017-01-01

    The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625

  5. Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.

    PubMed

    Städler, Thomas; Delph, Lynda F

    2002-09-03

    Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.

  6. Identification and genetic effect of haplotype in the bovine BMP7 gene.

    PubMed

    Huang, Yong-Zhen; Wang, Xin-Lei; He, Hua; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong

    2013-12-15

    Bone morphogenetic proteins (BMPs) are peptide growth factors belonging to the transforming growth factor-beta (TGF-β) superfamily, and some members of the BMP family support white adipocyte differentiation. In this study, we focused on the BMP7 which singularly promotes the differentiation of brown preadipocytes. Haplotypes involving 5 single nucleotide polymorphism (SNP) sites in the bovine BMP7 gene were identified and their effect on body weight was analyzed. 16 haplotypes and 18 combined haplotypes were revealed and the linkage disequilibrium was assessed in the cattle population with 602 individuals representing three main cattle breeds from China. The results showed that haplotypes 3, 10 and 14 were predominant and accounted for 75.64%, 69.85%, and 83.36% in Nanyang, Qinchuan and Jiaxian cattle breeds, respectively. The statistical analyses indicated that the SNP 1, 4, and 5 are associated with the body weight, body length, and heart girth at 12 and 24 months in Nanyang cattle population (P<0.05), whereas there is no significant association between their 16 haplotypes and 18 combined haplotypes. Our results provide evidence that some SNPs and haplotypes in BMP7 are associated with growth traits, and may be utilized as a genetic marker in marker-assisted selection for beef cattle breeding programs. Copyright © 2013. Published by Elsevier B.V.

  7. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco

    2018-04-11

    The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.

  8. Massively parallel haplotyping on microscopic beads for the high-throughput phase analysis of single molecules.

    PubMed

    Boulanger, Jérôme; Muresan, Leila; Tiemann-Boege, Irene

    2012-01-01

    In spite of the many advances in haplotyping methods, it is still very difficult to characterize rare haplotypes in tissues and different environmental samples or to accurately assess the haplotype diversity in large mixtures. This would require a haplotyping method capable of analyzing the phase of single molecules with an unprecedented throughput. Here we describe such a haplotyping method capable of analyzing in parallel hundreds of thousands single molecules in one experiment. In this method, multiple PCR reactions amplify different polymorphic regions of a single DNA molecule on a magnetic bead compartmentalized in an emulsion drop. The allelic states of the amplified polymorphisms are identified with fluorescently labeled probes that are then decoded from images taken of the arrayed beads by a microscope. This method can evaluate the phase of up to 3 polymorphisms separated by up to 5 kilobases in hundreds of thousands single molecules. We tested the sensitivity of the method by measuring the number of mutant haplotypes synthesized by four different commercially available enzymes: Phusion, Platinum Taq, Titanium Taq, and Phire. The digital nature of the method makes it highly sensitive to detecting haplotype ratios of less than 1:10,000. We also accurately quantified chimera formation during the exponential phase of PCR by different DNA polymerases.

  9. Characterization of swine leukocyte antigen alleles and haplotypes on a novel miniature pig line, Microminipig.

    PubMed

    Ando, A; Imaeda, N; Ohshima, S; Miyamoto, A; Kaneko, N; Takasu, M; Shiina, T; Kulski, J K; Inoko, H; Kitagawa, H

    2014-12-01

    Microminipigs are extremely small-sized, novel miniature pigs that were recently developed for medical research. The inbred Microminipigs with defined swine leukocyte antigen (SLA) haplotypes are expected to be useful for allo- and xenotransplantation studies and also for association analyses between SLA haplotypes and immunological traits. To establish SLA-defined Microminipig lines, we characterized the polymorphic SLA alleles for three class I (SLA-1, SLA-2 and SLA-3) and two class II (SLA-DRB1 and SLA-DQB1) genes of 14 parental Microminipigs using a high-resolution nucleotide sequence-based typing method. Eleven class I and II haplotypes, including three recombinant haplotypes, were found in the offspring of the parental Microminipigs. Two class I and class II haplotypes, Hp-31.0 (SLA-1*1502-SLA-3*070102-SLA-2*1601) and Hp-0.37 (SLA-DRB1*0701-SLA-DQB1*0502), are novel and have not so far been reported in other pig breeds. Crossover regions were defined by the analysis of 22 microsatellite markers within the SLA class III region of three recombinant haplotypes. The SLA allele and haplotype information of Microminipigs in this study will be useful to establish SLA homozygous lines including three recombinants for transplantation and immunological studies. © 2014 Stichting International Foundation for Animal Genetics.

  10. Genomic evolution in domestic cattle: ancestral haplotypes and healthy beef.

    PubMed

    Williamson, Joseph F; Steele, Edward J; Lester, Susan; Kalai, Oscar; Millman, John A; Wolrige, Lindsay; Bayard, Dominic; McLure, Craig; Dawkins, Roger L

    2011-05-01

    We have identified numerous Ancestral Haplotypes encoding a 14-Mb region of Bota C19. Three are frequent in Simmental, Angus and Wagyu and have been conserved since common progenitor populations. Others are more relevant to the differences between these 3 breeds including fat content and distribution in muscle. SREBF1 and Growth Hormone, which have been implicated in the production of healthy beef, are included within these haplotypes. However, we conclude that alleles at these 2 loci are less important than other sequences within the haplotypes. Identification of breeds and hybrids is improved by using haplotypes rather than individual alleles. Copyright © 2010 Elsevier Inc. All rights reserved.

  11. Y-STR haplotypes of Native American populations from the Brazilian Amazon region.

    PubMed

    Palha, Teresinha Jesus Brabo Ferreira; Rodrigues, Elzemar Martins Ribeiro; dos Santos, Sidney Emanuel Batista

    2010-10-01

    The allele and haplotype frequencies of nine Y-STRs (DYS19, DYS389 I, DYS389 II, DYS390, DYS391, DYS392, DYS393, DYS385 I/II) were determined in a sample of six native tribes from the Brazilian Amazon (Tiriyó, Awa-Guajá, Waiãpi, Urubu-Kaapor, Zoé and Parakanã). Forty-eight different haplotypes were identified, 28 of which unique. Five haplotypes are very frequent and were shared by over 10 individuals. The estimated haplotype diversity (0.9114) was very low compared to other geographic groups, including Africans, Europeans and Asians. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  12. Cloud computing-based TagSNP selection algorithm for human genome data.

    PubMed

    Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

    2015-01-05

    Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.

  13. iXora: exact haplotype inferencing and trait association.

    PubMed

    Utro, Filippo; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar E; Royaert, Stefan; Schnell, Raymond J; Motamayor, Juan Carlos; Kuhn, David N; Parida, Laxmi

    2013-06-06

    We address the task of extracting accurate haplotypes from genotype data of individuals of large F1 populations for mapping studies. While methods for inferring parental haplotype assignments on large F1 populations exist in theory, these approaches do not work in practice at high levels of accuracy. We have designed iXora (Identifying crossovers and recombining alleles), a robust method for extracting reliable haplotypes of a mapping population, as well as parental haplotypes, that runs in linear time. Each allele in the progeny is assigned not just to a parent, but more precisely to a haplotype inherited from the parent. iXora shows an improvement of at least 15% in accuracy over similar systems in literature. Furthermore, iXora provides an easy-to-use, comprehensive environment for association studies and hypothesis checking in populations of related individuals. iXora provides detailed resolution in parental inheritance, along with the capability of handling very large populations, which allows for accurate haplotype extraction and trait association. iXora is available for non-commercial use from http://researcher.ibm.com/project/3430.

  14. Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data

    PubMed Central

    Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

    2015-01-01

    Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used. PMID:25569088

  15. VNTR alleles associated with the {alpha}-globin locus are haplotype and population related

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martinson, J.J.; Clegg, J.B.; Boyce, A.J.

    1994-09-01

    The human {alpha}-globin complex contains several polymorphic restriction-enzyme sites (i.e., RFLPs) linked to form haplotypes and is flanked by two hypervariable VNTR loci, the 5{prime} hypervariable region (HVR) and the more highly polymorphic 3{prime}HVR. Using a combination of RFLP analysis and PCR, the authors have characterized the 5{prime}HVR and 3{prime}HVR alleles associated with the {alpha}-globin haplotypes of 133 chromosomes, and they here show that specific {alpha}-globin haplotypes are each associated with discrete subsets of the alleles observed at these two VNTR loci. This statistically highly significant association is observed over a region spanning {approximately} 100 kb. With the exception ofmore » closely related haplotypes, different haplotypes do not share identically sized 3{prime}HVR alleles. Earlier studies have shown that {alpha}-globin haplotype distributions differ between populations; the current findings also reveal extensive population substructure in the repertoire of {alpha}-globin VNTRs. If similar features are characteristic of other VNTR loci, this will have important implications for forensic and anthropological studies. 42 refs., 5 figs., 5 tabs.« less

  16. A novel haplotype of spinocerebellar ataxia type 6 contributes to the highest prevalence in western Japan.

    PubMed

    Terasawa, Hideo; Oda, Masaya; Morino, Hiroyuki; Miyachi, Takafumi; Izumi, Yuishin; Maruyama, Hirofumi; Matsumoto, Masayasu; Kawakami, Hideshi

    2004-03-25

    The highest prevalence rate of spinocerebellar ataxia type 6 (SCA6) in the worldwide population is in the Chugoku and Kansai areas of Western Japan, but the reason of this geographic characteristics is unclear. We investigated the predisposing haplotypes and their geographic distribution. Genotyping of five microsatellite markers and three single nucleotide polymorphisms linked to the CACNA1A gene in 150 Japanese SCA6 patients from unrelated 118 families revealed three major haplotypes, carrying a pool of one common haplotype core. A founder chromosome was thought to have historically diverged into at least three types. One of the major haplotypes newly identified showed a strong geographical cluster around the Seto Inland Sea in the Chugoku and Kansai areas of Western Japan, whereas the others were widely distributed throughout Japan. The distribution of predisposing haplotypes contributes to the geographical differences in prevalence of SCA6.

  17. Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

    PubMed

    Delaneau, Olivier; Marchini, Jonathan

    2014-06-13

    A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.

  18. Near East mtDNA haplotype variants in Roman cattle from Augusta Raurica, Switzerland, and in the Swiss Evolène breed.

    PubMed

    Schlumbaum, A; Turgay, M; Schibler, J

    2006-08-01

    Typical Near East mitochondrial haplotypes of the T2 lineage were found in one cattle metacarpus sample from the Roman period and in two present-day Evolène cattle in Switzerland. Sequences from eight additional Evolène and four Raetian Grey aligned to the European haplotype T3. Analysis of nucleotide diversity within the mitochondrial D-loop of both studied Swiss cattle breeds revealed high haplotype diversity and similar diversity to a European cattle reference group. Mitochondrial T3 haplotypes radiated star-like from two similarly frequent haplotypes, possibly indicating two different expansion routes. The breed structure of Evolène cattle can be explained either by an introduction of diverse female lineages from the domestication centre or by later admixture. The introduction of the Near East lineage to Switzerland must have happened during the Roman time or earlier.

  19. Estrogen Receptor 1 ( ESR1) Gene Polymorphisms and Obesity Phenotypes in a Population of Young Adults.

    PubMed

    Correa-Rodríguez, María; Schmidt-RioValle, Jacqueline; González-Jiménez, Emilio; Rueda-Medina, Blanca

    2017-06-01

    Obesity is considered an increasingly serious health problem determined by multiple genetic and environmental factors. Estrogens have been found to play a major role in body weight and adiposity regulation through estrogen receptor 1 ( ESR1). The aim of this study was to determine whether genotype and haplotype frequencies of ESR1 polymorphisms are associated with body composition measures in a population of 572 young adults. A lack of significant association between genotypes of ESR1 gene polymorphisms and obesity phenotypes was seen after adjustment for confounding factors. Linkage disequilibrium (LD) analysis identified a single LD block for the ESR1 gene including PvuII and XbaI single-nucleotide polymorphisms (SNPs) (pairwise r 2 = .66). None of the haplotypes identified revealed statistically significant associations with any of the obesity phenotypes. Our results suggest that polymorphisms of the ESR1 gene do not contribute significantly to the genetic risk for obesity phenotypes in a population of young Caucasian adults.

  20. [Fine mapping of complex disease susceptibility loci].

    PubMed

    Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao

    2014-01-01

    Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.

  1. Minimum Description Length Block Finder, a Method to Identify Haplotype Blocks and to Compare the Strength of Block Boundaries

    PubMed Central

    Mannila, H.; Koivisto, M.; Perola, M.; Varilo, T.; Hennah, W.; Ekelund, J.; Lukk, M.; Peltonen, L.; Ukkonen, E.

    2003-01-01

    We describe a new probabilistic method for finding haplotype blocks that is based on the use of the minimum description length (MDL) principle. We give a rigorous definition of the quality of a segmentation of a genomic region into blocks and describe a dynamic programming algorithm for finding the optimal segmentation with respect to this measure. We also describe a method for finding the probability of a block boundary for each pair of adjacent markers: this gives a tool for evaluating the significance of each block boundary. We have applied the method to the published data of Daly and colleagues. The results expose some problems that exist in the current methods for the evaluation of the significance of predicted block boundaries. Our method, MDL block finder, can be used to compare block borders in different sample sets, and we demonstrate this by applying the MDL-based method to define the block structure in chromosomes from population isolates. PMID:12761696

  2. Minimum description length block finder, a method to identify haplotype blocks and to compare the strength of block boundaries.

    PubMed

    Mannila, H; Koivisto, M; Perola, M; Varilo, T; Hennah, W; Ekelund, J; Lukk, M; Peltonen, L; Ukkonen, E

    2003-07-01

    We describe a new probabilistic method for finding haplotype blocks that is based on the use of the minimum description length (MDL) principle. We give a rigorous definition of the quality of a segmentation of a genomic region into blocks and describe a dynamic programming algorithm for finding the optimal segmentation with respect to this measure. We also describe a method for finding the probability of a block boundary for each pair of adjacent markers: this gives a tool for evaluating the significance of each block boundary. We have applied the method to the published data of Daly and colleagues. The results expose some problems that exist in the current methods for the evaluation of the significance of predicted block boundaries. Our method, MDL block finder, can be used to compare block borders in different sample sets, and we demonstrate this by applying the MDL-based method to define the block structure in chromosomes from population isolates.

  3. Founder haplotype analysis of Fanconi anemia in the Korean population finds common ancestral haplotypes for a FANCG variant.

    PubMed

    Park, Joonhong; Kim, Myungshin; Jang, Woori; Chae, Hyojin; Kim, Yonggoo; Chung, Nack-Gyun; Lee, Jae-Wook; Cho, Bin; Jeong, Dae-Chul; Park, In Yang; Park, Mi Sun

    2015-05-01

    A common ancestral haplotype is strongly suggested in the Korean and Japanese patients with Fanconi anemia (FA), because common mutations have been frequently found: c.2546delC and c.3720_3724delAAACA of FANCA; c.307+1G>C, c.1066C>T, and c.1589_1591delATA of FANCG. Our aim in this study was to investigate the origin of these common mutations of FANCA and FANCG. We genotyped 13 FA patients consisting of five FA-A patients and eight FA-G patients from the Korean FA population. Microsatellite markers used for haplotype analysis included four CA repeat markers which are closely linked with FANCA and eight CA repeat markers which are contiguous with FANCG. As a result, Korean FA-A patients carrying c.2546delC or c.3720_3724delAAACA did not share the same haplotypes. However, three unique haplotypes carrying c.307+1G>C, c.1066C > T, or c.1589_1591delATA, that consisted of eight polymorphic loci covering a flanking region were strongly associated with Korean FA-G, consistent with founder haplotypes reported previously in the Japanese FA-G population. Our finding confirmed the common ancestral haplotypes on the origins of the East Asian FA-G patients, which will improve our understanding of the molecular population genetics of FA-G. To the best of our knowledge, this is the first report on the association between disease-linked mutations and common ancestral haplotypes in the Korean FA population. © 2015 John Wiley & Sons Ltd/University College London.

  4. Mitochondrial DNA haplotype distribution patterns in Pinus ponderosa (Pinaceae): range-wide evolutionary history and implications for conservation.

    PubMed

    Potter, Kevin M; Hipkins, Valerie D; Mahalovich, Mary F; Means, Robert E

    2013-08-01

    Ponderosa pine (Pinus ponderosa Douglas ex P. Lawson & C. Lawson) exhibits complicated patterns of morphological and genetic variation across its range in western North America. This study aims to clarify P. ponderosa evolutionary history and phylogeography using a highly polymorphic mitochondrial DNA marker, with results offering insights into how geographical and climatological processes drove the modern evolutionary structure of tree species in the region. We amplified the mtDNA nad1 second intron minisatellite region for 3,100 trees representing 104 populations, and sequenced all length variants. We estimated population-level haplotypic diversity and determined diversity partitioning among varieties, races and populations. After aligning sequences of minisatellite repeat motifs, we evaluated evolutionary relationships among haplotypes. The geographical structuring of the 10 haplotypes corresponded with division between Pacific and Rocky Mountain varieties. Pacific haplotypes clustered with high bootstrap support, and appear to have descended from Rocky Mountain haplotypes. A greater proportion of diversity was partitioned between Rocky Mountain races than between Pacific races. Areas of highest haplotypic diversity were the southern Sierra Nevada mountain range in California, northwestern California, and southern Nevada. Pinus ponderosa haplotype distribution patterns suggest a complex phylogeographic history not revealed by other genetic and morphological data, or by the sparse paleoecological record. The results appear consistent with long-term divergence between the Pacific and Rocky Mountain varieties, along with more recent divergences not well-associated with race. Pleistocene refugia may have existed in areas of high haplotypic diversity, as well as the Great Basin, Southwestern United States/northern Mexico, and the High Plains.

  5. Haplotypes composed of minor frequency single nucleotide polymorphisms of the TNF gene protect from progression into sepsis: A study using the new sepsis classification.

    PubMed

    Retsas, Theodoros; Huse, Klaus; Lazaridis, Lazaros-Dimitrios; Karampela, Niki; Bauer, Michael; Platzer, Matthias; Kolonia, Virginia; Papageorgiou, Eirini; Giamarellos-Bourboulis, Evangelos J; Dimopoulos, George

    2018-02-01

    Several articles have provided conflicting results regarding the role of single nucleotide polymorphisms (SNPs) in the promoter region of the TNF gene in susceptibility to sepsis. Former articles have been based on previous definitions of sepsis. This study investigated the influence of TNF haplotypes on the development of sepsis using the new Sepsis-3 definitions. DNA was isolated from patients suffering from infection and systemic inflammatory response syndrome. Haplotyping was performed for six SNPs of TNF. The serum levels of tumour necrosis factor alpha (TNF-α) of these patients were measured using an enzyme immunosorbent assay. Patients were classified into infection and sepsis categories using the Sepsis-3 definitions. Associations between the TNF haplotypes and the clinical characteristics and serum TNF-α levels of the patients were examined. The most common TNF haplotype h1 was composed of major alleles of the studied SNPs. Carriage of haplotypes composed of minor frequency alleles was associated with a lower risk of developing sepsis (odds ratio 0.41, 95% confidence interval 0.19-0.88, p=0.022), but this did not affect the 28-day outcome. Serum TNF-α levels were significantly higher among patients homozygous for h1 haplotypes who developed sepsis compared to infection (p=0.032); a similar result was not observed for patients carrying other haplotypes. Haplotypes containing minor frequency SNP alleles of TNF protect against the development of sepsis without affecting the outcome. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.

  6. Mitochondrial Haplotype Diversity in Zambian Lions: Bridging a Gap in the Biogeography of an Iconic Species

    PubMed Central

    Curry, Caitlin J.; White, Paula A.; Derr, James N.

    2015-01-01

    Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species. PMID:26674533

  7. The targetable A1 Huntington disease haplotype has distinct Amerindian and European origins in Latin America

    PubMed Central

    Kay, Chris; Tirado-Hurtado, Indira; Cornejo-Olivas, Mario; Collins, Jennifer A; Wright, Galen; Inca-Martinez, Miguel; Veliz-Otani, Diego; Ketelaar, Maria E; Slama, Ramy A; Ross, Colin J; Mazzetti, Pilar; Hayden, Michael R

    2017-01-01

    Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin (HTT) gene. HD occurs worldwide, but the causative mutation is found on different HTT haplotypes in distinct ethnic groups. In Latin America, HD is thought to have European origins, but indigenous Amerindian ancestry has not been investigated. Here, we report dense HTT haplotypes in 62 mestizo Peruvian HD families, 17 HD families from across Latin America, and 42 controls of defined Peruvian Amerindian ethnicity to determine the origin of HD in populations of admixed Amerindian and European descent. HD in Peru occurs most frequently on the A1 HTT haplotype (73%), as in Europe, but on an unexpected indigenous variant also found in Amerindian controls. This Amerindian A1 HTT haplotype predominates over the European A1 variant among geographically disparate Latin American controls and in HD families from across Latin America, supporting an indigenous origin of the HD mutation in mestizo American populations. We also show that a proportion of HD mutations in Peru occur on a C1 HTT haplotype of putative Amerindian origin (14%). The majority of HD mutations in Latin America may therefore occur on haplotypes of Amerindian ancestry rather than on haplotypes resulting from European admixture. Despite the distinct ethnic ancestry of Amerindian and European A1 HTT, alleles on the parent A1 HTT haplotype allow for development of identical antisense molecules to selectively silence the HD mutation in the greatest proportion of patients in both Latin American and European populations. PMID:28000697

  8. Mitochondrial Haplotype Diversity in Zambian Lions: Bridging a Gap in the Biogeography of an Iconic Species.

    PubMed

    Curry, Caitlin J; White, Paula A; Derr, James N

    2015-01-01

    Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species.

  9. [Simulation and Design of Infant Incubator Assembly Line].

    PubMed

    Ke, Huqi; Hu, Xiaoyong; Ge, Xia; Hu, Yanhai; Chen, Zaihong

    2015-11-01

    According to current assembly situation of infant incubator in company A, basic industrial engineering means such as time study was used to analyze the actual products assembly production and an assembly line was designed. The assembly line was modeled and simulated with software Flexsim. The problem of the assembly line was found by comparing simulation result and actual data, then through optimization to obtain high efficiency assembly line.

  10. Reproductive status of overwintering potato psyllid: absence of photoperiod effects

    USDA-ARS?s Scientific Manuscript database

    We examined the effects of photoperiod on reproductive diapause of three haplotypes of potato psyllid, Bactericera cockerelli (Hemiptera: Triozidae), collected from three geographic locations: south Texas (Central haplotype), California (Western haplotype), and Washington State (Northwestern haploty...

  11. MtDNA diversity among four Portuguese autochthonous dog breeds: a fine-scale characterisation

    PubMed Central

    van Asch, Barbara; Pereira, Luísa; Pereira, Filipe; Santa-Rita, Pedro; Lima, Manuela; Amorim, António

    2005-01-01

    Background The picture of dog mtDNA diversity, as obtained from geographically wide samplings but from a small number of individuals per region or breed, has revealed weak geographic correlation and high degree of haplotype sharing between very distant breeds. We aimed at a more detailed picture through extensive sampling (n = 143) of four Portuguese autochthonous breeds – Castro Laboreiro Dog, Serra da Estrela Mountain Dog, Portuguese Sheepdog and Azores Cattle Dog-and comparatively reanalysing published worldwide data. Results Fifteen haplotypes belonging to four major haplogroups were found in these breeds, of which five are newly reported. The Castro Laboreiro Dog presented a 95% frequency of a new A haplotype, while all other breeds contained a diverse pool of existing lineages. The Serra da Estrela Mountain Dog, the most heterogeneous of the four Portuguese breeds, shared haplotypes with the other mainland breeds, while Azores Cattle Dog shared no haplotypes with the other Portuguese breeds. A review of mtDNA haplotypes in dogs across the world revealed that: (a) breeds tend to display haplotypes belonging to different haplogroups; (b) haplogroup A is present in all breeds, and even uncommon haplogroups are highly dispersed among breeds and continental areas; (c) haplotype sharing between breeds of the same region is lower than between breeds of different regions and (d) genetic distances between breeds do not correlate with geography. Conclusion MtDNA haplotype sharing occurred between Serra da Estrela Mountain dogs (with putative origin in the centre of Portugal) and two breeds in the north and south of the country-with the Castro Laboreiro Dog (which behaves, at the mtDNA level, as a sub-sample of the Serra da Estrela Mountain Dog) and the southern Portuguese Sheepdog. In contrast, the Azores Cattle Dog did not share any haplotypes with the other Portuguese breeds, but with dogs sampled in Northern Europe. This suggested that the Azores Cattle Dog descended maternally from Northern European dogs rather than Portuguese mainland dogs. A review of published mtDNA haplotypes identified thirteen non-Portuguese breeds with sufficient data for comparison. Comparisons between these thirteen breeds, and the four Portuguese breeds, demonstrated widespread haplotype sharing, with the greatest diversity among Asian dogs, in accordance with the central role of Asia in canine domestication. PMID:15972107

  12. Genetic differences in the two main groups of the Japanese population based on autosomal SNPs and haplotypes.

    PubMed

    Yamaguchi-Kabata, Yumi; Tsunoda, Tatsuhiko; Kumasaka, Natsuhiko; Takahashi, Atsushi; Hosono, Naoya; Kubo, Michiaki; Nakamura, Yusuke; Kamatani, Naoyuki

    2012-05-01

    Although the Japanese population has a rather low genetic diversity, we recently confirmed the presence of two main clusters (the Hondo and Ryukyu clusters) through principal component analysis of genome-wide single-nucleotide polymorphism (SNP) genotypes. Understanding the genetic differences between the two main clusters requires further genome-wide analyses based on a dense SNP set and comparison of haplotype frequencies. In the present study, we determined haplotypes for the Hondo cluster of the Japanese population by detecting SNP homozygotes with 388,591 autosomal SNPs from 18,379 individuals and estimated the haplotype frequencies. Haplotypes for the Ryukyu cluster were inferred by a statistical approach using the genotype data from 504 individuals. We then compared the haplotype frequencies between the Hondo and Ryukyu clusters. In most genomic regions, the haplotype frequencies in the Hondo and Ryukyu clusters were very similar. However, in addition to the human leukocyte antigen region on chromosome 6, other genomic regions (chromosomes 3, 4, 5, 7, 10 and 12) showed dissimilarities in haplotype frequency. These regions were enriched for genes involved in the immune system, cell-cell adhesion and the intracellular signaling cascade. These differentiated genomic regions between the Hondo and Ryukyu clusters are of interest because they (1) should be examined carefully in association studies and (2) likely contain genes responsible for morphological or physiological differences between the two groups.

  13. Two families from New England with usher syndrome type IC with distinct haplotypes.

    PubMed

    DeAngelis, M M; McGee, T L; Keats, B J; Slim, R; Berson, E L; Dryja, T P

    2001-03-01

    To search for patients with Usher syndrome type IC among those with Usher syndrome type I who reside in New England. Genotype analysis of microsatellite markers closely linked to the USH1C locus was done using the polymerase chain reaction. We compared the haplotype of our patients who were homozygous in the USH1C region with the haplotypes found in previously reported USH1C Acadian families who reside in southwestern Louisiana and from a single family residing in Lebanon. Of 46 unrelated cases of Usher syndrome type I residing in New England, two were homozygous at genetic markers in the USH1C region. Of these, one carried the Acadian USH1C haplotype and had Acadian ancestors (that is, from Nova Scotia) who did not participate in the 1755 migration of Acadians to Louisiana. The second family had a haplotype that proved to be the same as that of a family with USH1C residing in Lebanon. Each of the two families had haplotypes distinct from the other. This is the first report that some patients residing in New England have Usher syndrome type IC. Patients with Usher syndrome type IC can have the Acadian haplotype or the Lebanese haplotype compatible with the idea that at least two independently arising pathogenic mutations have occurred in the yet-to-be identified USH1C gene.

  14. Hapl-o-Mat: open-source software for HLA haplotype frequency estimation from ambiguous and heterogeneous data.

    PubMed

    Schäfer, Christian; Schmidt, Alexander H; Sauter, Jürgen

    2017-05-30

    Knowledge of HLA haplotypes is helpful in many settings as disease association studies, population genetics, or hematopoietic stem cell transplantation. Regarding the recruitment of unrelated hematopoietic stem cell donors, HLA haplotype frequencies of specific populations are used to optimize both donor searches for individual patients and strategic donor registry planning. However, the estimation of haplotype frequencies from HLA genotyping data is challenged by the large amount of genotype data, the complex HLA nomenclature, and the heterogeneous and ambiguous nature of typing records. To meet these challenges, we have developed the open-source software Hapl-o-Mat. It estimates haplotype frequencies from population data including an arbitrary number of loci using an expectation-maximization algorithm. Its key features are the processing of different HLA typing resolutions within a given population sample and the handling of ambiguities recorded via multiple allele codes or genotype list strings. Implemented in C++, Hapl-o-Mat facilitates efficient haplotype frequency estimation from large amounts of genotype data. We demonstrate its accuracy and performance on the basis of artificial and real genotype data. Hapl-o-Mat is a versatile and efficient software for HLA haplotype frequency estimation. Its capability of processing various forms of HLA genotype data allows for a straightforward haplotype frequency estimation from typing records usually found in stem cell donor registries.

  15. Factor IX gene haplotypes in Amerindians.

    PubMed

    Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S

    1997-02-01

    We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.

  16. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

    PubMed Central

    Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

    1999-01-01

    With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World. PMID:10388827

  17. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

    PubMed

    Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

    1999-07-01

    With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World.

  18. Mineralocorticoid receptor haplotype moderates the effects of oral contraceptives and menstrual cycle on emotional information processing.

    PubMed

    Hamstra, Danielle A; de Kloet, E Ronald; Tollenaar, Marieke; Verkuil, Bart; Manai, Meriem; Putman, Peter; Van der Does, Willem

    2016-10-01

    The processing of emotional information is affected by menstrual cycle phase and by the use of oral contraceptives (OCs). The stress hormone cortisol is known to affect emotional information processing via the limbic mineralocorticoid receptor (MR). We investigated in an exploratory study whether the MR-genotype moderates the effect of both OC-use and menstrual cycle phase on emotional cognition. Healthy premenopausal volunteers (n=93) of West-European descent completed a battery of emotional cognition tests. Forty-nine participants were OC users and 44 naturally cycling, 21 of whom were tested in the early follicular (EF) and 23 in the mid-luteal (ML) phase of the menstrual cycle. In MR-haplotype 1/3 carriers, ML women gambled more than EF women when their risk to lose was relatively small. In MR-haplotype 2, ML women gambled more than EF women, regardless of their odds of winning. OC-users with MR-haplotype 1/3 recognised fewer facial expressions than ML women with MR-haplotype 1/3. MR-haplotype 1/3 carriers may be more sensitive to the influence of their female hormonal status. MR-haplotype 2 carriers showed more risky decision-making. As this may reflect optimistic expectations, this finding may support previous observations in female carriers of MR-haplotype 2 in a naturalistic cohort study. © The Author(s) 2016.

  19. Evaluation of haplotype diversity of Achatina fulica (Lissachatina) [Bowdich] from Indian sub-continent by means of 16S rDNA sequence and its phylogenetic relationships with other global populations.

    PubMed

    Ayyagari, Vijaya Sai; Sreerama, Krupanidhi

    2017-08-01

    Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.

  20. A TNF region haplotype offers protection from typhoid fever in Vietnamese patients

    PubMed Central

    2009-01-01

    The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085

  1. Mineralocorticoid receptor haplotypes sex-dependently moderate depression susceptibility following childhood maltreatment.

    PubMed

    Vinkers, Christiaan H; Joëls, Marian; Milaneschi, Yuri; Gerritsen, Lotte; Kahn, René S; Penninx, Brenda W J H; Boks, Marco P M

    2015-04-01

    The MR is an important regulator of the hypothalamic-pituitary-adrenal (HPA) axis and a prime target for corticosteroids. There is increasing evidence from both clinical and preclinical studies that the MR has different effects on behavior and mood in males and females. To investigate the hypothesis that the MR sex-dependently influences the relation between childhood maltreatment and depression, we investigated three common and functional MR haplotypes (GA, CA, and CG haplotype, based on rs5522 and rs2070951) in a population-based cohort (N = 665) and an independent clinical cohort from the Netherlands Study of Depression and Anxiety (NESDA) (N = 1639). The CA haplotype sex-dependently moderated the relation between childhood maltreatment and depressive symptoms both in the population-based sample (sex × maltreatment × haplotype: β = -4.07, P = 0.029) and in the clinical sample (sex × maltreatment × haplotype, β = -2.40, P = 0.011). Specifically, female individuals in the population-based sample were protected (β = -4.58, P = 2.0 e(-5)), whereas males in the clinical sample were at increased risk (β = 2.54, P = 0.0022). In line with these results, female GA haplotype carriers displayed increased vulnerability in the population-based sample (β = 4.58, P = 7.5 e(-5)) whereas male CG-carriers showed increased resilience in the clinical sample (β = -2.71, P = 0.016). Consistently, we found a decreased lifetime MDD risk for male GA haplotype carriers following childhood maltreatment but an increased risk for male CA haplotype carriers in the clinical sample. In both samples, sex-dependent effects were observed for GA-GA diplotype carriers. In summary, sex plays an important role in determining whether functional genetic variation in MR is beneficial or detrimental, with an apparent female advantage for the CA haplotype but male advantage for the GA and CG haplotype. These sex-dependent effects of MR on depression susceptibility following childhood maltreatment are relevant in light of the increased prevalence of mood disorders in women and point to a sex-specific role of MR in the etiology of depression following childhood maltreatment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  2. HLA DPA1, DPB1 alleles and haplotypes contribute to the risk associated with type 1 diabetes: analysis of the type 1 diabetes genetics consortium families.

    PubMed

    Varney, Michael D; Valdes, Ana Maria; Carlson, Joyce A; Noble, Janelle A; Tait, Brian D; Bonella, Persia; Lavant, Eva; Fear, Anna Lisa; Louey, Anthony; Moonsamy, Priscilla; Mychaleckyj, Josyf C; Erlich, Henry

    2010-08-01

    To determine the relative risk associated with DPA1 and DPB1 alleles and haplotypes in type 1 diabetes. The frequency of DPA1 and DPB1 alleles and haplotypes in type 1 diabetic patients was compared to the family based control frequency in 1,771 families directly and conditional on HLA (B)-DRB1-DQA1-DQB1 linkage disequilibrium. A relative predispositional analysis (RPA) was performed in the presence or absence of the primary HLA DR-DQ associations and the contribution of DP haplotype to individual DR-DQ haplotype risks examined. Eight DPA1 and thirty-eight DPB1 alleles forming seventy-four DPA1-DPB1 haplotypes were observed; nineteen DPB1 alleles were associated with multiple DPA1 alleles. Following both analyses, type 1 diabetes susceptibility was significantly associated with DPB1*0301 (DPA1*0103-DPB1*0301) and protection with DPB1*0402 (DPA1*0103-DPB1*0402) and DPA1*0103-DPB1*0101 but not DPA1*0201-DPB1*0101. In addition, DPB1*0202 (DPA1*0103-DPB1*0202) and DPB1*0201 (DPA1*0103-DPB1*0201) were significantly associated with susceptibility in the presence of the high risk and protective DR-DQ haplotypes. Three associations (DPB1*0301, *0402, and *0202) remained statistically significant when only the extended HLA-A1-B8-DR3 haplotype was considered, suggesting that DPB1 alone may delineate the risk associated with this otherwise conserved haplotype. HLA DP allelic and haplotypic diversity contributes significantly to the risk for type 1 diabetes; DPB1*0301 (DPA1*0103-DPB1*0301) is associated with susceptibility and DPB1*0402 (DPA1*0103-DPB1*0402) and DPA1*0103-DPB1*0101 with protection. Additional evidence is presented for the susceptibility association of DPB1*0202 (DPA1*0103-DPB1*0202) and for a contributory role of individual amino acids and DPA1 or a gene in linkage disequilibrium in DR3-DPB1*0101 positive haplotypes.

  3. Biological impact of α genes, β haplotypes, and G6PD activity in sickle cell anemia at baseline and with hydroxyurea

    PubMed Central

    Arnaud, Cécile; Kamdem, Annie; Hau, Isabelle; Lelong, Françoise; Epaud, Ralph; Pondarré, Corinne; Pissard, Serge

    2018-01-01

    Sickle cell anemia (SCA), albeit monogenic, has heterogeneous phenotypic expression, mainly related to the level of hemoglobin F (HbF). No large cohort studies have ever compared biological parameters in patients with major β-globin haplotypes; ie, Senegal (SEN), Benin (BEN), and Bantu/Central African Republic (CAR). The aim of this study was to evaluate the biological impact of α genes, β haplotypes, and glucose-6-phosphate dehydrogenase (G6PD) activity at baseline and with hydroxyurea (HU). Homozygous HbS patients from the Créteil pediatric cohort with available α-gene and β-haplotype data were included (n = 580; 301 females and 279 males) in this retrospective study. Homozygous β-haplotype patients represented 74% of cases (37.4% CAR/CAR, 24.3% BEN/BEN, and 12.1% SEN/SEN). HU was given to 168 cohort SCA children. Hematological parameters were recorded when HbF was maximal, and changes (ΔHU-T0) were calculated. At baseline, CAR-haplotype and α-gene numbers were independently and negatively correlated with Hb and positively correlated with lactate dehydrogenase. HbF was negatively correlated with CAR-haplotype numbers and positively with BEN- and SEN-haplotype numbers. The BCL11A/rs1427407 “T” allele, which is favorable for HbF expression, was positively correlated with BEN- and negatively correlated with CAR-haplotype numbers. With HU treatment, Δ and HbF values were positively correlated with the BEN-haplotype number. BEN/BEN patients had higher HbF and Hb levels than CAR/CAR and SEN/SEN patients. In conclusion, we show that BEN/BEN patients have the best response on HU and suggest that this could be related to the higher prevalence of the favorable BCL11A/rs1427407/T/allele for HbF expression in these patients. PMID:29555644

  4. A comprehensive literature review of haplotyping software and methods for use with unrelated individuals.

    PubMed

    Salem, Rany M; Wessel, Jennifer; Schork, Nicholas J

    2005-03-01

    Interest in the assignment and frequency analysis of haplotypes in samples of unrelated individuals has increased immeasurably as a result of the emphasis placed on haplotype analyses by, for example, the International HapMap Project and related initiatives. Although there are many available computer programs for haplotype analysis applicable to samples of unrelated individuals, many of these programs have limitations and/or very specific uses. In this paper, the key features of available haplotype analysis software for use with unrelated individuals, as well as pooled DNA samples from unrelated individuals, are summarised. Programs for haplotype analysis were identified through keyword searches on PUBMED and various internet search engines, a review of citations from retrieved papers and personal communications, up to June 2004. Priority was given to functioning computer programs, rather than theoretical models and methods. The available software was considered in light of a number of factors: the algorithm(s) used, algorithm accuracy, assumptions, the accommodation of genotyping error, implementation of hypothesis testing, handling of missing data, software characteristics and web-based implementations. Review papers comparing specific methods and programs are also summarised. Forty-six haplotyping programs were identified and reviewed. The programs were divided into two groups: those designed for individual genotype data (a total of 43 programs) and those designed for use with pooled DNA samples (a total of three programs). The accuracy of programs using various criteria are assessed and the programs are categorised and discussed in light of: algorithm and method, accuracy, assumptions, genotyping error, hypothesis testing, missing data, software characteristics and web implementation. Many available programs have limitations (eg some cannot accommodate missing data) and/or are designed with specific tasks in mind (eg estimating haplotype frequencies rather than assigning most likely haplotypes to individuals). It is concluded that the selection of an appropriate haplotyping program for analysis purposes should be guided by what is known about the accuracy of estimation, as well as by the limitations and assumptions built into a program.

  5. Phylogeography and connectivity of molluscan parasites: Perkinsus spp. in Panama and beyond.

    PubMed

    Pagenkopp Lohan, Katrina M; Hill-Spanik, Kristina M; Torchin, Mark E; Fleischer, Robert C; Carnegie, Ryan B; Reece, Kimberly S; Ruiz, Gregory M

    2018-02-01

    Panama is a major hub for commercial shipping between two oceans, making it an ideal location to examine parasite biogeography, potential invasions, and the spread of infectious agents. Our goals were to (i) characterise the diversity and genetic connectivity of Perkinsus spp. haplotypes across the Panamanian Isthmus and (ii) combine these data with sequences from around the world to evaluate the current phylogeography and genetic connectivity of these widespread molluscan parasites. We collected 752 bivalves from 12 locations along the coast of Panama including locations around the Bocas del Toro archipelago and the Caribbean and Pacific entrances to the Panama Canal, from December 2012 to February 2013. We used molecular genetic methods to screen for Perkinsus spp. and obtained internal transcribed spacer region (ITS) ribosomal DNA (rDNA) sequences for all positive samples. Our sequence data were used to evaluate regional haplotype diversity and distribution across both coasts of Panama, and were then combined with publicly available sequences to create global haplotype networks. We found 26 ITS haplotypes from four Perkinsus spp. (1-12 haplotypes per species) in Panama. Perkinsus beihaiensis haplotypes had the highest genetic diversity, were the most regionally widespread, and were associated with the greatest number of hosts. On a global scale, network analyses demonstrated that some haplotypes found in Panama were cosmopolitan (Perkinsus chesapeaki, Perkinsus marinus), while others were more geographically restricted (Perkinsus olseni, P. beihaiensis), indicating different levels of genetic connectivity and dispersal. We found some Perkinsus haplotypes were shared across the Isthmus of Panama and several regions around the world, including across ocean basins. We also found that haplotype diversity is currently underestimated and directly related to the number of sequences. Nevertheless, our results demonstrate long-range dispersal and global connectivity for many haplotypes, suggesting that dispersal through shipping probably contributes to these biogeographical patterns. Published by Elsevier Ltd.

  6. Variation in the prion protein sequence in Dutch goat breeds.

    PubMed

    Windig, J J; Hoving, R A H; Priem, J; Bossers, A; van Keulen, L J M; Langeveld, J P M

    2016-10-01

    Scrapie is a neurodegenerative disease occurring in goats and sheep. Several haplotypes of the prion protein increase resistance to scrapie infection and may be used in selective breeding to help eradicate scrapie. In this study, frequencies of the allelic variants of the PrP gene are determined for six goat breeds in the Netherlands. Overall frequencies in Dutch goats were determined from 768 brain tissue samples in 2005, 766 in 2008 and 300 in 2012, derived from random sampling for the national scrapie surveillance without knowledge of the breed. Breed specific frequencies were determined in the winter 2013/2014 by sampling 300 breeding animals from the main breeders of the different breeds. Detailed analysis of the scrapie-resistant K222 haplotype was carried out in 2014 for 220 Dutch Toggenburger goats and in 2015 for 942 goats from the Saanen derived White Goat breed. Nine haplotypes were identified in the Dutch breeds. Frequencies for non-wild type haplotypes were generally low. Exception was the K222 haplotype in the Dutch Toggenburger (29%) and the S146 haplotype in the Nubian and Boer breeds (respectively 7 and 31%). The frequency of the K222 haplotype in the Toggenburger was higher than for any other breed reported in literature, while for the White Goat breed it was with 3.1% similar to frequencies of other Saanen or Saanen derived breeds. Further evidence was found for the existence of two M142 haplotypes, M142 /S240 and M142 /P240 . Breeds vary in haplotype frequencies but frequencies of resistant genotypes are generally low and consequently selective breeding for scrapie resistance can only be slow but will benefit from animals identified in this study. The unexpectedly high frequency of the K222 haplotype in the Dutch Toggenburger underlines the need for conservation of rare breeds in order to conserve genetic diversity rare or absent in other breeds. © 2016 Blackwell Verlag GmbH.

  7. Contribution of HLA-A/B/C/DRB1/DQB1 common haplotypes to donor search outcome in unrelated hematopoietic stem cell transplantation.

    PubMed

    Pédron, Béatrice; Guérin-El Khourouj, Valérie; Dalle, Jean-Hugues; Ouachée-Chardin, Marie; Yakouben, Karima; Corroyez, France; Auvrignon, Anne; Petit, Arnaud; Landman-Parker, Judith; Leverger, Guy; Baruchel, André; Sterkers, Ghislaine

    2011-11-01

    In unrelated hematopoietic stem cell transplantation (HSCT), the prediction of donor search outcome at the time of search initiation is of great value for the physicians to delineate the strategy of patient care. The probability of finding an unrelated donor is high for patients who carry at least 1 of the 10 most common HLA haplotypes in Caucasians. As only 10% to 20% patients respond to this criterion, here we aimed at finding additional common haplotypes to improve the prediction of a successful search. HLA broad HLA-A/B/DRB1 haplotypes that were observed with frequencies ≥0.19% in patient families of European origin and that split into ≤2 predominant 4-digit HLA-A/B/C/DRB1/DQB1 haplotypes were considered as common. Carriage of at least 1 of those in 168 patients of various geographic areas with no family donor was confronted to the chance of finding ≥9/10 HLA-matched unrelated donors. Fifty common 4-digit haplotypes were identified. A higher (P < 5 × 10(-6)) chance of finding a suitable donor was found for 55 of 170 (32%) recipients that carried at least 1 of these common haplotypes. Up to now, estimates classified patients into ≥3 groups of probability with ≥1 intermediate group of poor utility for the clinicians. Considering carriage of these common haplotypes together with the frequencies of alleles and of B/C and DRB1/DQB1 associations, which are carried by patient HLA haplotypes, we could classify the patients into 2 groups of probability with a 98% and 26% chance of finding a donor, respectively. Prediction of search outcome could be improved by including the 50 most common HLA haplotypes in the current approaches. Copyright © 2011 American Society for Blood and Marrow Transplantation. Published by Elsevier Inc. All rights reserved.

  8. Association between platelet P2Y12 haplotype and risk of cardiovascular events in chronic coronary disease.

    PubMed

    Schettert, Isolmar T; Pereira, Alexandre C; Lopes, Neuza H; Hueb, Whady A; Krieger, Jose E

    2006-01-01

    A positive association was recently described between P2Y12 platelet receptor H1 and H2 haplotypes and peripheral artery disease. We tested the described P2Y12 receptor haplotypes in a group of patients with coronary artery disease. The P2Y12 platelet receptor H1 and H2 haplotypes was tested in a group of 540 patients enrolled in the Medical, Angioplasty, or Surgery Study II (MASS II), a randomized trial comparing treatments for patients with coronary artery disease (CAD) and preserved left ventricular function. After a 3-year follow-up period, the incidence of the composite end point of cardiac death, myocardial infarction, and refractory angina requiring revascularization was determined in the H1/H1, H1/H2 and H2/H2 haplotype groups. We used Student's t-test and the chi-square test to analyze the differences among groups and Kaplan-Meier method to calculate survival curves. Risk was assessed with the use of a Cox proportional-hazards model. The frequency of haplotypes among studied patients were 410 (75.9%) H1/H1, 119 (22.0%) H1/H2 and 11 (2.1%) H2/H2. The baseline clinical characteristics, mean clinical follow-up time and received treatment of each genotype group were similar. We did not disclose any association between haplotype groups regarding the incidence of any of the studied cardiovascular end-points. This is the first report studying the association of P2Y12 platelet receptor H1 and H2 haplotype and cardiovascular events. Our findings do not provide evidence for a strong association between H1/H1 and H1/H2 haplotypes and a increased risk of cardiovascular events in a population with CAD. Future works should address the role of the H2/H2 haplotype as a genetic marker for cardiovascular events.

  9. Association of KIR genotypes and haplotypes with susceptibility to chronic hepatitis B virus infection in Chinese Han population.

    PubMed

    Lu, Zhiming; Zhang, Bingchang; Chen, Shijun; Gai, Zhongtao; Feng, Zhaolei; Liu, Xiangdong; Liu, Yiqing; Wen, Xin; Li, Li; Jiao, Yulian; Ma, Chunyan; Shao, Song; Cui, Xiangfa; Chen, Guojian; Li, Jianfeng; Zhao, Yueran

    2008-12-01

    Killer immunoglobulin-like receptor (KIR) genes can regulate the activation of NK and T cells upon interaction with HLA class I molecules. Hepatitis B virus (HBV) infection has been regarded as a multi-factorial disorder disease. Previous studies revealed that KIRs were involved in HCV and HIV infection or clearance. The aim of this study was to explore the possibility of the inheritance of KIR genotypes and haplotypes as a candidate for susceptibility to persistent HBV infection or HBV clearance. The sequence specific primer polymerase chain reaction (SSP-PCR) was employed to identify the KIR genes and pseudogenes in 150 chronic hepatitis B (CHB) patients, 251 spontaneously recovered (SR) controls, and 412 healthy controls. The frequencies of genotype G, M, FZ1 increased in CHB patients compared with healthy control subjects. The frequency of genotype AH was higher in SR controls than that in both CHB patients and healthy controls. The carriage frequencies of genotype G and AH were higher; while, the frequencies of AF and AJ were lower in SR controls than those in healthy control subjects. The frequency of A haplotype was lower, whereas, the frequency of B haplotype was higher in CHB patients and SR controls than those in healthy controls. In healthy controls, haplotype 4 was found lower compared with that in CHB patients and SR controls and the frequency of haplotype 5 was higher in SR controls than that in other two groups. Based on these findings, it seems that the genotypes M and FZ1 are HBV susceptive genotypes; AH, on the other hand, may be protective genotypes that facilitate the clearance of HBV. It appears that the haplotype 4 is HBV susceptive haplotype, whereas, haplotype 5 may be the protective haplotype that facilitates the clearance of HBV.

  10. Impacts of TNF-LTA SNPs/Haplotypes and Lifestyle Factors on Oral Carcinoma in an Indian Population.

    PubMed

    Bandil, Kapil; Singhal, Pallavi; Sharma, Upma; Hussain, Showket; Basu, Surojit; Parashari, Aditya; Singh, Veena; Sehgal, Ashok; Shivam, Animesh; Ahuja, Puneet; Bharadwaj, Mausumi; Banerjee, Basu Dev; Mehrotra, Ravi

    2016-10-01

    To investigate a potential association between single-nucleotide polymorphisms (SNPs) and  haplotypes at the TNFA-LTA locus and the development of oral cancer in an Indian population. In this study, 150 oral precancer/cancer samples (50 precancer and 100 cancer), along with an equal number of control samples, were genotyped. Six SNPs at the TNF-LTA locus (i.e., -238G/A, -308G/A, -857C/T, -863C/A, -1031T/C, and +252A/G) were analyzed by use of a polymerase chain reaction-restriction fragment length polymorphism method, the assay was validated by sequencing 10 % of samples. The allelic frequencies of TNFA and LTA SNPs were found to be significantly associated with the risk of oral cancer and precancerous lesions in comparison with controls (P < 0.0003). Further haplotypic analysis showed that two haplotypes (ATCTGG and ACACGG) served as risk haplotypes for oral cancer. These haplotypes were also found to be significantly and positively associated with lifestyle habits (tobacco chewing P = 0.04, odds ratio [OR] 3.4) and socioeconomic status (P = 0.01, OR 3.4). We noticed an increased percentage of risk haplotypes correlating with the aggressiveness of oral cancer. The percentages of risk haplotypes were found to be threefold higher in precancer and fourfold higher in advanced stages of oral cancer in comparison with controls. Five SNPs at the TNF-LTA locus (i.e., -308G>A, -857C>T, -863C>A, -1031T>C, and +252A>G) were found to be associated with the development of oral cancer. Two haplotypes (ATCTGG and ACACGG) emerged as major risk haplotypes for oral carcinoma progression and were also found to be associated with lifestyle factors and clinical aggressiveness. These findings make the TNF-LTA locus a suitable candidate for a future biomarker, which may be used either for early detection or for helping to improve treatment efficacy and effectiveness.

  11. A Comparison of Phasing Algorithms for Trios and Unrelated Individuals

    PubMed Central

    Marchini, Jonathan; Cutler, David; Patterson, Nick; Stephens, Matthew; Eskin, Eleazar; Halperin, Eran; Lin, Shin; Qin, Zhaohui S.; Munro, Heather M.; Abecasis, Gonçalo R.; Donnelly, Peter

    2006-01-01

    Knowledge of haplotype phase is valuable for many analysis methods in the study of disease, population, and evolutionary genetics. Considerable research effort has been devoted to the development of statistical and computational methods that infer haplotype phase from genotype data. Although a substantial number of such methods have been developed, they have focused principally on inference from unrelated individuals, and comparisons between methods have been rather limited. Here, we describe the extension of five leading algorithms for phase inference for handling father-mother-child trios. We performed a comprehensive assessment of the methods applied to both trios and to unrelated individuals, with a focus on genomic-scale problems, using both simulated data and data from the HapMap project. The most accurate algorithm was PHASE (v2.1). For this method, the percentages of genotypes whose phase was incorrectly inferred were 0.12%, 0.05%, and 0.16% for trios from simulated data, HapMap Centre d'Etude du Polymorphisme Humain (CEPH) trios, and HapMap Yoruban trios, respectively, and 5.2% and 5.9% for unrelated individuals in simulated data and the HapMap CEPH data, respectively. The other methods considered in this work had comparable but slightly worse error rates. The error rates for trios are similar to the levels of genotyping error and missing data expected. We thus conclude that all the methods considered will provide highly accurate estimates of haplotypes when applied to trio data sets. Running times differ substantially between methods. Although it is one of the slowest methods, PHASE (v2.1) was used to infer haplotypes for the 1 million–SNP HapMap data set. Finally, we evaluated methods of estimating the value of r2 between a pair of SNPs and concluded that all methods estimated r2 well when the estimated value was ⩾0.8. PMID:16465620

  12. Molecular tracing of confiscated pangolin scales for conservation and illegal trade monitoring in Southeast Asia

    USGS Publications Warehouse

    Zhang, Huarong; Miller, Mark P.; Yang, Feng; Chan, Hon Ki; Gaubert, Philippe; Ades, Gary; Fischer, Gunter A

    2015-01-01

    Despite being protected by both international and national regulations, pangolins are threatened by illegal trade. Here we report mitochondrial DNA identification and haplotype richness estimation, using 239 pangolin scale samples from two confiscations in Hong Kong. We found a total of 13 genetically distinct cytochrome c oxidase I (COI) haplotypes in two confiscations (13 and ten haplotypes respectively, with ten shared haplotypes between confiscations). These haplotypes clustered in two distinct clades with one clade representing the Sunda pangolin (Manisjavanica). The other clade did not match with any known Asian pangolin sequences, and likely represented a cryptic pangolin lineage in Asia. By fitting sample coverage and rarefaction/regression models to our sample data, we predicted that the total number of COI haplotypes in two confiscations were 14.86 and 11.06 respectively, suggesting that our sampling caught the majority of haplotypes and that we had adequately characterized each confiscation. We detected substantial sequence divergence among the seized scales, likely evidencing that the Sunda pangolins were harvested over wide geographical areas across Southeast Asia. Our study illustrates the value of applying DNA forensics for illegal wildlife trade monitoring.

  13. Whole-loop mitochondrial DNA D-loop sequence variability in Egyptian Arabian equine matrilines

    PubMed Central

    Hudson, William

    2017-01-01

    Background Egyptian Arabian horses have been maintained in a state of genetic isolation for over a hundred years. There is only limited genetic proof that the studbook records of female lines of Egyptian Arabian pedigrees are reliable. This study characterized the mitochondrial DNA (mtDNA) signatures of 126 horses representing 14 matrilines in the Egyptian Agricultural Organization (EAO) horse-breeding program. Findings Analysis of the whole D-loop sequence yielded additional information compared to hypervariable region-1 (HVR1) analysis alone, with 42 polymorphic sites representing ten haplotypes compared to 16 polymorphic sites representing nine haplotypes, respectively. Most EAO haplotypes belonged to ancient haplogroups, suggesting origin from a wide geographical area over many thousands of years, although one haplotype was novel. Conclusions Historical families share haplotypes and some individuals from different strains belonged to the same haplogroup: the classical EAO strain designation is not equivalent to modern monophyletic matrilineal groups. Phylogenetic inference showed that the foundation mares of the historical haplotypes were highly likely to have the same haplotypes as the animals studied (p > 0.998 in all cases), confirming the reliability of EAO studbook records and providing the opportunity for breeders to confirm the ancestry of their horses. PMID:28859174

  14. The Geographic Distribution of Human Y Chromosome Variation

    PubMed Central

    Hammer, M. F.; Spurdle, A. B.; Karafet, T.; Bonner, M. R.; Wood, E. T.; Novelletto, A.; Malaspina, P.; Mitchell, R. J.; Horai, S.; Jenkins, T.; Zegura, S. L.

    1997-01-01

    We examined variation on the nonrecombining portion of the human Y chromosome to investigate human evolution during the last 200,000 years. The Y-specific polymorphic sites included the Y Alu insertional polymorphism or ``YAP'' element (DYS287), the poly(A) tail associated with the YAP element, three point mutations in close association with the YAP insertion site, an A-G polymorphic transition (DYS271), and a tetranucleotide microsatellite (DYS19). Global variation at the five bi-allelic sites (DYS271, DYS287, and the three point mutations) gave rise to five ``YAP haplotypes'' in 60 populations from Africa, Europe, Asia, Australasia, and the New World (n = 1500). Combining the multi-allelic variation at the microsatellite loci (poly(A) tail and DYS19) with the YAP haplotypes resulted in a total of 27 ``combination haplotypes''. All five of the YAP haplotypes and 21 of the 27 combination haplotypes were found in African populations, which had greater haplotype diversity than did populations from other geographical locations. Only subsets of the five YAP haplotypes were found outside of Africa. Patterns of observed variation were compatible with a variety of hypotheses, including multiple human migrations and range expansions. PMID:9055088

  15. MHC Class II haplotypes of Colombian Amerindian tribes

    PubMed Central

    Yunis, Juan J.; Yunis, Edmond J.; Yunis, Emilio

    2013-01-01

    We analyzed 1041 individuals belonging to 17 Amerindian tribes of Colombia, Chimila, Bari and Tunebo (Chibcha linguistic family), Embera, Waunana (Choco linguistic family), Puinave and Nukak (Maku-Puinave linguistic families), Cubeo, Guanano, Tucano, Desano and Piratapuyo (Tukano linguistic family), Guahibo and Guayabero (Guayabero Linguistic Family), Curripaco and Piapoco (Arawak linguistic family) and Yucpa (Karib linguistic family). for MHC class II haplotypes (HLA-DRB1, DQA1, DQB1). Approximately 90% of the MHC class II haplotypes found among these tribes are haplotypes frequently encountered in other Amerindian tribes. Nonetheless, striking differences were observed among Chibcha and non-Chibcha speaking tribes. The DRB1*04:04, DRB1*04:11, DRB1*09:01 carrying haplotypes were frequently found among non-Chibcha speaking tribes, while the DRB1*04:07 haplotype showed significant frequencies among Chibcha speaking tribes, and only marginal frequencies among non-Chibcha speaking tribes. Our results suggest that the differences in MHC class II haplotype frequency found among Chibcha and non-Chibcha speaking tribes could be due to genetic differentiation in Mesoamerica of the ancestral Amerindian population into Chibcha and non-Chibcha speaking populations before they entered into South America. PMID:23885196

  16. HIV-1 Vertical Transmission in Zimbabwe in 622 Mother and Infant Pairs: Rethinking the Contribution of Mannose Binding Lectin Deficiency in Africa.

    PubMed

    Zinyama-Gutsire, Rutendo B L; Christiansen, Michael; Hedley, Paula L; Rusakaniko, Simbarashe; Hagen, Christian; Stray-Pedersen, Babill; Buzdugan, Raluca; Cowan, Frances; Chasela, Charles

    2016-07-01

    Vertical transmission of human immunodeficiency virus (HIV) remains a major global health problem. We assessed the association of mannose binding lectin (MBL) deficiency and vertical transmission of HIV. Novel diagnostics would be a major breakthrough in this regard. MBL is a liver-derived protein and a key component of the innate immune system. MBL levels may be classified as normal, intermediate, or deficient in the plasma and can use MBL2 haplotypes as a proxy. These haplotypes comprise polymorphisms in the MBL2 gene and promoter region and are known to result in varying levels of MBL deficiency. MBL deficiency can be defined as presence of A/O and O/O genotypes in the mothers and their children. MBL deficiency leads to defective opsonization activities of the innate immune system and increased susceptibility to several infections, including HIV-1. We determined the prevalence of MBL deficiency, using MBL2 haplotypes among 622 HIV-positive Zimbabwean mothers and their children aged 9-18 months old, in relation to the HIV-1 vertical transmission risk. The median age of the mothers was 30 (26-34, interquartile range [IQR]) years, and the babies' median age was 13 (11-15, IQR) months old at the time of enrollment. From the sample of 622 mothers who were HIV-1 infected, 574 babies were HIV negative and 48 were HIV-1-positive babies, giving a transmission rate of 7.7%. MBL2 normal structural allele A and variants B (codon 5 A>G), C (codon 57 A>G), and promoter region SNPs -550(H/L) and -221(X/Y) were detected. Prevalence of haplotype-predicted MBL deficiency was 34% among the mothers and 32% among the children. We found no association between maternal MBL2 deficiency and HIV-1 transmission to their children. We found no difference in the distribution of HIV-1 infected and uninfected children between the MBL2 genotypes of the mothers and those of the children. Taken together, the present study in a large sample of mother-infant pairs in Zimbabwe adds to the emerging literature and the hypothesis that MBL2 variation as predicted by haplotypes does not influence the vertical transmission risk for HIV. Research from other populations from the African continent is called for to test this hypothesis further.

  17. Optimal use of human and machine resources for Space Station assembly operations

    NASA Technical Reports Server (NTRS)

    Parrish, Joseph C.

    1988-01-01

    This paper investigates the issues involved in determining the best mix of human and machine resources for assembly of the Space Station. It presents the current Station assembly sequence, along with descriptions of the available assembly resources. A number of methodologies for optimizing the human/machine tradeoff problem have been developed, but the Space Station assembly offers some unique issues that have not yet been addressed. These include a strong constraint on available EVA time for early flights and a phased deployment of assembly resources over time. A methodology for incorporating the previously developed decision methods to the special case of the Space Station is presented. This methodology emphasizes an application of multiple qualitative and quantitative techniques, including simulation and decision analysis, for producing an objective, robust solution to the tradeoff problem.

  18. The distribution of HLA haplotypes in the ethnic groups that make up the Brazilian Bone Marrow Volunteer Donor Registry (REDOME).

    PubMed

    Halagan, Michael; Oliveira, Danielli Cristina; Maiers, Martin; Fabreti-Oliveira, Raquel A; Moraes, Maria Elisa Hue; Visentainer, Jeane Eliete Laguila; Pereira, Noemi Farah; Romero, Matilde; Cardoso, Juliana Fernandes; Porto, Luís Cristóvão

    2018-04-26

    The Registries of Bone Marrow Donors around the world include more than 30 million volunteer donors from 57 different countries, and were responsible for over 17,000 hematopoietic stem cell transplants in 2016. The Brazilian Bone Marrow Volunteer Donor Registry (REDOME) was established in 1993 and is the third largest registry in the world with more than 4.3 million donors. We characterized HLA allele and haplotypes frequencies from REDOME comparing them with the donor self-reported race group classification. Five-locus haplotype frequencies (A~C~B~DRB1~DQB1) were estimated for each of the six race groups, resolving phase and allelic ambiguity using the expectation-maximization (EM) algorithm. The top 100 haplotypes in the race groups were separated into eight clusters of haplotypes, based on haplotype similarity, using CLUTO. We present HLA allele and haplotype frequency data from six race groups from 2,938,259 individuals from REDOME. The most frequent haplotype was the same for all groups: A*01:01g~C*07:01g~B*08:01g~DRB1*03:01g~DQB1*02:01g. Some frequent haplotypes such as A*02:01g~C*16:01g~B*44:03~DRB1*07:01g~DQB1*02:01g was not found in people with Preta (Sub-Saharan African descent). A cluster including Branca (European) and Parda or non-informed (admixed) could be distinguished from both Preta (SubSaharan) and Indígena (Amerindian) groups, and from the Amarela (Asian) ones, which clustered with their original population. These results have implications on cross-population matching and can help in donor searches and population-based recruitment strategies.

  19. Molecular analysis and association with clinical and laboratory manifestations in children with sickle cell anemia

    PubMed Central

    Camilo-Araújo, Roberta Faria; Amancio, Olga Maria Silverio; Figueiredo, Maria Stella; Cabanãs-Pedro, Ana Carolina; Braga, Josefina Aparecida Pellegrini

    2014-01-01

    Objectives To analyze the frequency of βS-globin haplotypes and alpha-thalassemia, and their influence on clinical manifestations and the hematological profile of children with sickle cell anemia. Method The frequency of βS-globin haplotypes and alpha-thalassemia and any association with clinical and laboratorial manifestations were determined in 117 sickle cell anemia children aged 3–71 months. The confirmation of hemoglobin SS and determination of the haplotypes were achieved by polymerase chain reaction-restriction fragment length polymorphism, and alpha-thalassemia genotyping was by multiplex polymerase chain reaction (single-tube multiplex-polymerase chain reaction). Results The genotype distribution of haplotypes was 43 (36.7%) Central African Republic/Benin, 41 (35.0%) Central African Republic/Central African Republic, 20 (17.0%) Rare/atypical, and 13 (11.1%) Benin/Benin. The frequency of the α3.7 deletion was 1.71% as homozygous (−α3.7/−α3.7) and 11.9% as heterozygous (−α3.7/αα). The only significant association in respect to haplotypes was related to the mean corpuscular volume. The presence of alpha-thalassemia was significantly associated to decreases in mean corpuscular volume, mean corpuscular hemoglobin and reticulocyte count and to an increase in the red blood cell count. There were no significant associations of βS-globin haplotypes and alpha-thalassemia with clinical manifestations. Conclusions In the study population, the frequency of alpha-thalassemia was similar to published data in Brazil with the Central African Republic haplotype being the most common, followed by the Benin haplotype. βS-globin haplotypes and interaction between alpha-thalassemia and sickle cell anemia did not influence fetal hemoglobin concentrations or the number of clinical manifestations. PMID:25305165

  20. The "Sardinian" HLA-A30,B18,DR3,DQw2 haplotype constantly lacks the 21-OHA and C4B genes. Is it an ancestral haplotype without duplication?

    PubMed

    Contu, L; Carcassi, C; Dausset, J

    1989-01-01

    The C4 and 21-OH loci of the class III HLA have been studied by specific DNA probes and the restriction enzyme Taq 1 in 24 unrelated Sardinian individuals selected from completely HLA-typed families. All 24 individuals had the HLA extended haplotype A30,Cw5,B18, BfF1,DR3,DRw52,DQw2, named "Sardinian" in the present paper because of its frequency of 15% in the Sardinian population. Eighteen of these were homozygous for the entire haplotype, and six were heterozygous at the A locus and blank (or homozygous) at all the other loci. In all completely homozygous cells and in four heterozygous cells at the A locus, the restriction fragments of the 21-OHA (3.2 kb) and C4B (5.8 kb or 5.4 kb) genes were absent, and the fragments of the C4A (7.0 kb) and 21-OHB (3.7 kb) genes were present. It is suggested that the "Sardinian" haplotype is an ancestral haplotype without duplication of the C4 and 21-OH genes, practically always identical in its structure, also in unrelated individuals. The diversity of this haplotype in the class III region (about 30 kb less) may be at least partially responsible for its misalignment with most haplotypes, which have duplicated C4 and 21-OH genes, and therefore also for its decreased probability to recombine. This can help explain its high stability and frequency in the Sardinian population. The same conclusion can be suggested for the Caucasian extended haplotype A1,B8,DR3 that always seems to lack the C4A and 21-OHA genes.

  1. Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

    PubMed Central

    McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

    2013-01-01

    To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982

  2. Association of Per1 and Npas2 with autistic disorder: support for the clock genes/social timing hypothesis.

    PubMed

    Nicholas, B; Rudrasingham, V; Nash, S; Kirov, G; Owen, M J; Wimpory, D C

    2007-06-01

    Clock gene anomalies have been suggested as causative factors in autism. We screened eleven clock/clock-related genes in a predominantly high-functioning Autism Genetic Resource Exchange sample of strictly diagnosed autistic disorder progeny and their parents (110 trios) for association of clock gene variants with autistic disorder. We found significant association (P<0.05) for two single-nucleotide polymorphisms in per1 and two in npas2. Analysis of all possible combinations of two-marker haplotypes for each gene showed that in npas2 40 out of the 136 possible two-marker combinations were significant at the P<0.05 level, with the best result between markers rs1811399 and rs2117714, P=0.001. Haplotype analysis within per1 gave a single significant result: a global P=0.027 for the markers rs2253820-rs885747. No two-marker haplotype was significant in any of the other genes, despite the large number of tests performed. Our findings support the hypothesis that these epistatic clock genes may be involved in the etiology of autistic disorder. Problems in sleep, memory and timing are all characteristics of autistic disorder and aspects of sleep, memory and timing are each clock-gene-regulated in other species. We identify how our findings may be relevant to theories of autism that focus on the amygdala, cerebellum, memory and temporal deficits. We outline possible implications of these findings for developmental models of autism involving temporal synchrony/social timing.

  3. Assessing transmission of ‘Candidatus Liberibacter solanacearum’ haplotypes through seed potato

    USDA-ARS?s Scientific Manuscript database

    Conflicting data has previously been reported concerning the impact of zebra chip disease transmission through seed tubers. These discrepancies may be due to the experimental design of each study, whereby different pathogen haplotypes, insect vector haplotypes, and potato plant varieties were used....

  4. Association between endothelin type A receptor haplotypes and mortality in coronary heart disease.

    PubMed

    Ellis, Katrina L; Pilbrow, Anna P; Potter, Howard C; Frampton, Chris M; Doughty, Rob N; Whalley, Gillian A; Ellis, Chris J; Palmer, Barry R; Skelton, Lorraine; Yandle, Tim G; Troughton, Richard W; Richards, A Mark; A Cameron, Vicky

    2012-05-01

    The endothelin type A receptor, encoded by EDNRA, mediates the effects of endothelin-1 to promote vasoconstriction, vascular cell growth, adhesion, fibrosis and thrombosis. We investigated the association between EDNRA haplotype and cardiovascular outcomes in patients with coronary artery disease. Coronary disease patients (n = 1007) were genotyped for the His323His (rs5333) variant and one tag SNP from each of the major EDNRA haplotype blocks (rs6537484, rs1568136, rs5335 and rs10003447). EDNRA haplotype associations with clinical history, natriuretic peptides cardiac function and cardiovascular outcomes were tested over a median 3.8 years. Univariate analysis identified a 'low-risk' EDNRA haplotype associated with later age of Type 2 diabetes onset (p = 0.004) smaller BMI (p = 0.021), and reduced mortality (log rank p = 0.001). Cox proportional hazards analysis including established cardiovascular risk factors revealed an independent association between haplotype and mortality (p < 0.0001). These data highlight the potential importance of the endothelin system, and in particular EDNRA in coronary disease.

  5. A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.

    PubMed

    Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin

    2018-04-26

    Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.

  6. Linkage Disequilibrium and Haplotype Diversity in the Genes of the Renin–Angiotensin System: Findings From the Family Blood Pressure Program

    PubMed Central

    Zhu, Xiaofeng; Yan, Denise; Cooper, Richard S.; Luke, Amy; Ikeda, Morna A.; Chang, Yen-Pei C.; Weder, Alan; Chakravarti, Aravinda

    2003-01-01

    Association studies of candidate genes with complex traits have generally used one or a few single nucleotide polymorphisms (SNPs), although variation in the extent of linkage disequilibrium (LD) within genes markedly influences the sensitivity and precision of association studies. The extent of LD and the underlying haplotype structure for most candidate genes are still unavailable. We sampled 193 blacks (African-Americans) and 160 whites (European-Americans) and estimated the intragenic LD and the haplotype structure in four genes of the renin–angiotensin system. We genotyped 25 SNPs, with all but one of the pairs spaced between 1 and 20 kb, thus providing resolution at small scale. The pattern of LD within a gene was very heterogeneous. Using a robust method to define haplotype blocks, blocks of limited haplotype diversity were identified at each locus; between these blocks, LD was lost owing to the history of recombination events. As anticipated, there was less LD among blacks, the number of haplotypes was substantially larger, and shorter haplotype segments were found, compared with whites. These findings have implications for candidate-gene association studies and indicate that variation between populations of European and African origin in haplotype diversity is characteristic of most genes. [The sequence data described in this paper are available in GenBank under the following accession nos: AGT, MIM 106150; Renin, MIM 179820; ACE, MIM 106180; Angiotensin receptor I, MIM 106165. Supplementary material is available online at http://www.genome.org.] PMID:12566395

  7. Alpha-globin gene haplotypes in South American Indians.

    PubMed

    Zago, M A; Melo Santos, E J; Clegg, J B; Guerreiro, J F; Martinson, J J; Norwich, J; Figueiredo, M S

    1995-08-01

    The haplotypes of the alpha-globin gene cluster were determined for 99 Indians from the Brazilian Amazon region who belong to 5 tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Three predominant haplotypes were identified: Ia (present in 38.9% of chromosomes), IIIa (25.8%), and IIe (22.1%). The only alpha-globin gene rearrangement detected was alpha alpha alpha 3.7 I gene triplication associated with haplotype IIIa, found in high frequencies (5.6% and 10.6%) in two tribes and absent in the others. alpha-Globin gene deletions that cause alpha-thalassemia were not seen, supporting the argument that malaria was absent in these populations until recently. The heterogeneous distribution of alpha-globin gene haplotypes and rearrangements among the different tribes differs markedly from the homogeneous distribution of beta-globin gene cluster haplotypes and reflects the action of various genetic mechanisms (genetic drift, founder effect, consanguinity) on small isolated population groups with a complicated history of divergence-fusion events. The alpha-globin gene haplotype distribution has some similarities to distributions observed in Southeast Asian and Pacific Island populations, indicating that these populations have considerable genetic affinities. However, the absence of several features of the alpha-globin gene cluster that are consistently present among the Pacific Islanders suggests that the similarity of haplotypes between Brazilian Indians and people from Polynesia, Micronesia, and Melanesia is more likely to result of ancient common ancestry rather than the consequence of recent direct genetic contribution through immigration.

  8. Investigating biogeographic boundaries of the Sunda shelf: A phylogenetic analysis of two island populations of Macaca fascicularis.

    PubMed

    Klegarth, A R; Sanders, S A; Gloss, A D; Lane-deGraaf, K E; Jones-Engel, L; Fuentes, A; Hollocher, H

    2017-08-01

    Cyclical submergence and re-emergence of the Sunda Shelf throughout the Pleistocene served as a dynamic biogeographic landscape, across which long-tailed macaques (Macaca fascicularis) have migrated and evolved. Here, we tested the integrity of the previously reported continental-insular haplotype divide reported among Y and mitochondrial DNA lineages across multiple studies. The continental-insular haplotype divide was tested by heavily sampling wild macaques from two important biogeographic regions within Sundaland: (1) Singapore, the southernmost tip of continental Asia and (2) Bali, Indonesia, the southeastern edge of the Indonesian archipelago, immediately west of Wallace's line. Y DNA was haplotyped for samples from Bali, deep within the Indonesian archipelago. Mitochondrial D-loop from both islands was analyzed against existing data using Maximum Likelihood and Bayesian approaches. We uncovered both "continental" and "insular" Y DNA haplotypes in Bali. Between Singapore and Bali we found 52 unique mitochondrial haplotypes, none of which had been previously described. Phylogenetic analyses confirmed a major haplogroup division within Singapore and identified five new Singapore subclades and two primary subclades in Bali. While we confirmed the continental-insular divide among mtDNA haplotypes, maintenance of both Y DNA haplotypes on Bali, deep within the Indonesian archipelago calls into question the mechanism by which Y DNA diversity has been maintained. It also suggests the continental-insular designation is less appropriate for Y DNA, leading us to propose geographically neutral Y haplotype designations. © 2017 Wiley Periodicals, Inc.

  9. Tryptophan Hydroxylase 2 haplotype association with borderline personality disorder and aggression in a sample of patients with personality disorders and healthy controls

    PubMed Central

    Perez-Rodriguez, M. Mercedes; Weinstein, Shauna; New, Antonia S.; Bevilacqua, Laura; Yuan, Qiaoping; Zhou, Zhifeng; Hodgkinson, Colin; Goodman, Marianne; Koenigsberg, Harold W.; Goldman, David; Siever, Larry J.

    2010-01-01

    Background There is decreased serotonergic function in impulsive aggression and borderline personality disorder (BPD), and genetic association studies suggest a role of serotonergic genes in impulsive aggression and BPD. Only one study has analyzed the association between the tryptophan-hydroxylase 2 (TPH2) gene and BPD. A TPH2 “risk” haplotype has been described that is associated with anxiety, depression and suicidal behavior. Methods We assessed the relationship between the previously identified “risk” haplotype at the TPH2 locus and BPD diagnosis, impulsive aggression, affective lability, and suicidal/parasuicidal behaviors, in a well-characterized clinical sample of 103 healthy controls (HCs) and 251 patients with personality disorders (109 with BPD). A logistic regression including measures of depression, affective lability and aggression scores in predicting “risk” haplotype was conducted. Results The prevalence of the “risk” haplotype was significantly higher in patients with BPD compared to HCs. Those with the “risk” haplotype have higher aggression and affect lability scores and more suicidal/parasuicidal behaviors than those without it. In the logistic regression model, affect lability was the only significant predictor and it correctly classified 83.1% of the subjects as “risk” or “non-risk” haplotype carriers. Conclusions We found an association between the previously described TPH2 “risk” haplotype and BPD diagnosis, affective lability, suicidal/parasuicidal behavior, and aggression scores. PMID:20451217

  10. A spatial haplotype copying model with applications to genotype imputation.

    PubMed

    Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan

    2015-05-01

    Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.

  11. Genetic variability of populations of Nyssomyia neivai in the Northern State of Paraná, Brazil

    PubMed Central

    Gasparotto, Jaqueline de Carvalho; da Costa-Ribeiro, Magda Clara Vieira; Thomaz-Soccol, Vanete; Liebel, Sandra Mara Rodrigues da Silva; Neitzke-Abreu, Herintha Coeto; Reinhold-Castro, Kárin Rosi; Cristovão, Edilson Colhera; Teodoro, Ueslei

    2017-01-01

    ABSTRACT The genetic study of sandfly populations needs to be further explored given the importance of these insects for public health. Were sequenced the NDH4 mitochondrial gene from populations of Nyssomyia neivai from Doutor Camargo, Lobato, Japira, and Porto Rico, municipalities in the State of Paraná, Brazil, to understand the genetic structure and gene flow. Eighty specimens of Ny. Neivai were sequenced, 20 from each municipality, and 269 base pairs were obtained. A total of 27 haplotypes and 28 polymorphic sites were found, along with a haplotypic diversity of 0.80696 and a nucleotide diversity of 0.00567. Haplotype H5, with 33 specimens, was the most common among the four populations. Only haplotypes H5 and H7 were present in all four populations. The population from Doutor Camargo showed the highest genetic diversity, and only this population shared haplotypes with those from the other municipalities. The highest number of haplotypes was sheared with Lobato which also had the highest number of unique haplotypes. This probably occurred because of constant anthropic changes that happened in the environment during the first half of the twentieth century, mainly after 1998. There was no significant correlation between genetic and geographical distances regarding these populations. However, the highest genetic and geographical distances, and the lowest gene flow were observed between Japira and Porto Rico. Geographical distance is a possible barrier between these municipalities through the blocking of haplotype sharing. PMID:28380111

  12. Mutation Analysis in Classical Phenylketonuria Patients Followed by Detecting Haplotypes Linked to Some PAH Mutations.

    PubMed

    Dehghanian, Fatemeh; Silawi, Mohammad; Tabei, Seyed M B

    2017-02-01

    Deficiency of phenylalanine hydroxylase (PAH) enzyme and elevation of phenylalanine in body fluids cause phenylketonuria (PKU). The gold standard for confirming PKU and PAH deficiency is detecting causal mutations by direct sequencing of the coding exons and splicing involved sequences of the PAH gene. Furthermore, haplotype analysis could be considered as an auxiliary approach for detecting PKU causative mutations before direct sequencing of the PAH gene by making comparisons between prior detected mutation linked-haplotypes and new PKU case haplotypes with undetermined mutations. In this study, 13 unrelated classical PKU patients took part in the study detecting causative mutations. Mutations were identified by polymerase chain reaction (PCR) and direct sequencing in all patients. After that, haplotype analysis was performed by studying VNTR and PAHSTR markers (linked genetic markers of the PAH gene) through application of PCR and capillary electrophoresis (CE). Mutation analysis was performed successfully and the detected mutations were as follows: c.782G>A, c.754C>T, c.842C>G, c.113-115delTCT, c.688G>A, and c.696A>G. Additionally, PAHSTR/VNTR haplotypes were detected to discover haplotypes linked to each mutation. Mutation detection is the best approach for confirming PAH enzyme deficiency in PKU patients. Due to the relatively large size of the PAH gene and high cost of the direct sequencing in developing countries, haplotype analysis could be used before DNA sequencing and mutation detection for a faster and cheaper way via identifying probable mutated exons.

  13. Scaffolding of long read assemblies using long range contact information

    USDA-ARS?s Scientific Manuscript database

    Long read technologies have made a revolution in de novo genome assembly by generating long contigs. Although the assembly contiguity has increased, it may not span a chromosome, resulting in an unfinished chromosome level assembly. To address this problem, we develop a scaffolding method that can b...

  14. A review on simple assembly line balancing type-e problem

    NASA Astrophysics Data System (ADS)

    Jusop, M.; Rashid, M. F. F. Ab

    2015-12-01

    Simple assembly line balancing (SALB) is an attempt to assign the tasks to the various workstations along the line so that the precedence relations are satisfied and some performance measure are optimised. Advanced approach of algorithm is necessary to solve large-scale problems as SALB is a class of NP-hard. Only a few studies are focusing on simple assembly line balancing of Type-E problem (SALB-E) since it is a general and complex problem. SALB-E problem is one of SALB problem which consider the number of workstation and the cycle time simultaneously for the purpose of maximising the line efficiency. This paper review previous works that has been done in order to optimise SALB -E problem. Besides that, this paper also reviewed the Genetic Algorithm approach that has been used to optimise SALB-E. From the reviewed that has been done, it was found that none of the existing works are concern on the resource constraint in the SALB-E problem especially on machine and tool constraints. The research on SALB-E will contribute to the improvement of productivity in real industrial application.

  15. Reducing assembly complexity of microbial genomes with single-molecule sequencing.

    PubMed

    Koren, Sergey; Harhay, Gregory P; Smith, Timothy P L; Bono, James L; Harhay, Dayna M; Mcvey, Scott D; Radune, Diana; Bergman, Nicholas H; Phillippy, Adam M

    2013-01-01

    The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.

  16. Missing data imputation and haplotype phase inference for genome-wide association studies

    PubMed Central

    Browning, Sharon R.

    2009-01-01

    Imputation of missing data and the use of haplotype-based association tests can improve the power of genome-wide association studies (GWAS). In this article, I review methods for haplotype inference and missing data imputation, and discuss their application to GWAS. I discuss common features of the best algorithms for haplotype phase inference and missing data imputation in large-scale data sets, as well as some important differences between classes of methods, and highlight the methods that provide the highest accuracy and fastest computational performance. PMID:18850115

  17. Short communication: casein haplotype variability in sicilian dairy goat breeds.

    PubMed

    Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B

    2008-09-01

    In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.

  18. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed Central

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina

    2018-01-01

    The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446

  19. Genetic polymorphisms in MDR1 and CYP3A4 genes in Asians and the influence of MDR1 haplotypes on cyclosporin disposition in heart transplant recipients.

    PubMed

    Chowbay, Balram; Cumaraswamy, Sivathasan; Cheung, Yin Bun; Zhou, Qingyu; Lee, Edmund J D

    2003-02-01

    Intestinal cytochrome P450 3A4 (CYP3A4) and P-glycoprotein (P-gp) both play a vital role in the metabolism of oral cyclosporine (CsA). We investigated the genetic polymorphisms in CYP3A4(promoter region and exons 5, 7 and 9) and MDR1 (exons 12, 21 and 26) genes and the impact of these polymorphisms on the pharmacokinetics of oral CsA in stable heart transplant patients (n = 14). CYP3A4 polymorphisms were rare in the Asian population and transplant patients. Haplotype analysis revealed 12 haplotypes in the Chinese, eight in the Malays and 10 in the Indians. T-T-T was the most common haplotype in all ethnic groups. The frequency of the homozygous mutant genotype at all three loci (TT-TT-TT) was highest in the Indians (31%) compared to 19% and 15% in the Chinese and Malays, respectively. In heart transplant patients, CsA exposure (AUC(0-4 h), AUC(0-12 h) and C(max)) was high in patients with the T-T-T haplotypes compared to those with C-G-C haplotypes. These findings suggest that haplotypes rather than genotypes influence CsA disposition in transplant patients.

  20. Haplotype frequency distribution for 7 microsatellites in chromosome 8 and 11 in relation to the metabolic syndrome in four ethnic groups: Tehran Lipid and Glucose Study.

    PubMed

    Daneshpour, Maryam Sadat; Hosseinzadeh, Nima; Zarkesh, Maryam; Azizi, Fereidoun

    2012-03-01

    Different variants of haplotype frequencies may lead to various frequencies of the same variants in individuals with drug resistance and disease susceptibility at the population level. In this study, the haplotype frequencies of 4 STR loci including the D8S1132, D8S1779, D8S514 and D8S1743, and 3 STR loci including D11S1304, D11S1998 and D11S934 were investigated in 563 individuals of four Iranian ethnic groups in the capital city of Iran, Tehran. One hundred thirty subjects had the metabolic syndrome. Haplotype frequencies of all markers were calculated. There were significant differences in the haplotype frequencies in short and long alleles between the metabolic affected subjects and controls. In addition, haplotype frequencies were significant in the four ethnic groups in both chromosomes 8 and 11. Our findings show a relation between the short allele of D8S1743 in all related haplotype frequencies of subjects with metabolic syndrome. These findings may require more studies of some candidate genes, including the lipoprotein lipase gene, in this chromosomal region. Copyright © 2011. Published by Elsevier B.V.

  1. Huntingtin Haplotypes Provide Prioritized Target Panels for Allele-specific Silencing in Huntington Disease Patients of European Ancestry

    PubMed Central

    Kay, Chris; Collins, Jennifer A; Skotte, Niels H; Southwell, Amber L; Warby, Simon C; Caron, Nicholas S; Doty, Crystal N; Nguyen, Betty; Griguoli, Annamaria; Ross, Colin J; Squitieri, Ferdinando; Hayden, Michael R

    2015-01-01

    Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin gene (HTT). Heterozygous polymorphisms in cis with the mutation allow for allele-specific suppression of the pathogenic HTT transcript as a therapeutic strategy. To prioritize target selection, precise heterozygosity estimates are needed across diverse HD patient populations. Here we present the first comprehensive investigation of all common target alleles across the HTT gene, using 738 reference haplotypes from the 1000 Genomes Project and 2364 haplotypes from HD patients and relatives in Canada, Sweden, France, and Italy. The most common HD haplotypes (A1, A2, and A3a) define mutually exclusive sets of polymorphisms for allele-specific therapy in the greatest number of patients. Across all four populations, a maximum of 80% are treatable using these three target haplotypes. We identify a novel deletion found exclusively on the A1 haplotype, enabling potent and selective silencing of mutant HTT in approximately 40% of the patients. Antisense oligonucleotides complementary to the deletion reduce mutant A1 HTT mRNA by 78% in patient cells while sparing wild-type HTT expression. By suppressing specific haplotypes on which expanded CAG occurs, we demonstrate a rational approach to the development of allele-specific therapy for a monogenic disorder. PMID:26201449

  2. Multi-objective Analysis for a Sequencing Planning of Mixed-model Assembly Line

    NASA Astrophysics Data System (ADS)

    Shimizu, Yoshiaki; Waki, Toshiya; Yoo, Jae Kyu

    Diversified customer demands are raising importance of just-in-time and agile manufacturing much more than before. Accordingly, introduction of mixed-model assembly lines becomes popular to realize the small-lot-multi-kinds production. Since it produces various kinds on the same assembly line, a rational management is of special importance. With this point of view, this study focuses on a sequencing problem of mixed-model assembly line including a paint line as its preceding process. By taking into account the paint line together, reducing work-in-process (WIP) inventory between these heterogeneous lines becomes a major concern of the sequencing problem besides improving production efficiency. Finally, we have formulated the sequencing problem as a bi-objective optimization problem to prevent various line stoppages, and to reduce the volume of WIP inventory simultaneously. Then we have proposed a practical method for the multi-objective analysis. For this purpose, we applied the weighting method to derive the Pareto front. Actually, the resulting problem is solved by a meta-heuristic method like SA (Simulated Annealing). Through numerical experiments, we verified the validity of the proposed approach, and discussed the significance of trade-off analysis between the conflicting objectives.

  3. Optimisation of assembly scheduling in VCIM systems using genetic algorithm

    NASA Astrophysics Data System (ADS)

    Dao, Son Duy; Abhary, Kazem; Marian, Romeo

    2017-09-01

    Assembly plays an important role in any production system as it constitutes a significant portion of the lead time and cost of a product. Virtual computer-integrated manufacturing (VCIM) system is a modern production system being conceptually developed to extend the application of traditional computer-integrated manufacturing (CIM) system to global level. Assembly scheduling in VCIM systems is quite different from one in traditional production systems because of the difference in the working principles of the two systems. In this article, the assembly scheduling problem in VCIM systems is modeled and then an integrated approach based on genetic algorithm (GA) is proposed to search for a global optimised solution to the problem. Because of dynamic nature of the scheduling problem, a novel GA with unique chromosome representation and modified genetic operations is developed herein. Robustness of the proposed approach is verified by a numerical example.

  4. A vertebrate case study of the quality of assemblies derived from next-generation sequences

    PubMed Central

    2011-01-01

    The unparalleled efficiency of next-generation sequencing (NGS) has prompted widespread adoption, but significant problems remain in the use of NGS data for whole genome assembly. We explore the advantages and disadvantages of chicken genome assemblies generated using a variety of sequencing and assembly methodologies. NGS assemblies are equivalent in some ways to a Sanger-based assembly yet deficient in others. Nonetheless, these assemblies are sufficient for the identification of the majority of genes and can reveal novel sequences when compared to existing assembly references. PMID:21453517

  5. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.

  6. Multi-allelic haplotype model based on genetic partition for genomic prediction and variance component estimation using SNP markers.

    PubMed

    Da, Yang

    2015-12-18

    The amount of functional genomic information has been growing rapidly but remains largely unused in genomic selection. Genomic prediction and estimation using haplotypes in genome regions with functional elements such as all genes of the genome can be an approach to integrate functional and structural genomic information for genomic selection. Towards this goal, this article develops a new haplotype approach for genomic prediction and estimation. A multi-allelic haplotype model treating each haplotype as an 'allele' was developed for genomic prediction and estimation based on the partition of a multi-allelic genotypic value into additive and dominance values. Each additive value is expressed as a function of h - 1 additive effects, where h = number of alleles or haplotypes, and each dominance value is expressed as a function of h(h - 1)/2 dominance effects. For a sample of q individuals, the limit number of effects is 2q - 1 for additive effects and is the number of heterozygous genotypes for dominance effects. Additive values are factorized as a product between the additive model matrix and the h - 1 additive effects, and dominance values are factorized as a product between the dominance model matrix and the h(h - 1)/2 dominance effects. Genomic additive relationship matrix is defined as a function of the haplotype model matrix for additive effects, and genomic dominance relationship matrix is defined as a function of the haplotype model matrix for dominance effects. Based on these results, a mixed model implementation for genomic prediction and variance component estimation that jointly use haplotypes and single markers is established, including two computing strategies for genomic prediction and variance component estimation with identical results. The multi-allelic genetic partition fills a theoretical gap in genetic partition by providing general formulations for partitioning multi-allelic genotypic values and provides a haplotype method based on the quantitative genetics model towards the utilization of functional and structural genomic information for genomic prediction and estimation.

  7. Evaluation of the influence of dominance rules for the assembly line design problem under consideration of product design alternatives

    NASA Astrophysics Data System (ADS)

    Oesterle, Jonathan; Lionel, Amodeo

    2018-06-01

    The current competitive situation increases the importance of realistically estimating product costs during the early phases of product and assembly line planning projects. In this article, several multi-objective algorithms using difference dominance rules are proposed to solve the problem associated with the selection of the most effective combination of product and assembly lines. The list of developed algorithms includes variants of ant colony algorithms, evolutionary algorithms and imperialist competitive algorithms. The performance of each algorithm and dominance rule is analysed by five multi-objective quality indicators and fifty problem instances. The algorithms and dominance rules are ranked using a non-parametric statistical test.

  8. Dominant Sequences of Human Major Histocompatibility Complex Conserved Extended Haplotypes from HLA-DQA2 to DAXX

    PubMed Central

    Larsen, Charles E.; Alford, Dennis R.; Trautwein, Michael R.; Jalloh, Yanoh K.; Tarnacki, Jennifer L.; Kunnenkeri, Sushruta K.; Fici, Dolores A.; Yunis, Edmond J.; Awdeh, Zuheir L.; Alper, Chester A.

    2014-01-01

    We resequenced and phased 27 kb of DNA within 580 kb of the MHC class II region in 158 population chromosomes, most of which were conserved extended haplotypes (CEHs) of European descent or contained their centromeric fragments. We determined the single nucleotide polymorphism and deletion-insertion polymorphism alleles of the dominant sequences from HLA-DQA2 to DAXX for these CEHs. Nine of 13 CEHs remained sufficiently intact to possess a dominant sequence extending at least to DAXX, 230 kb centromeric to HLA-DPB1. We identified the regions centromeric to HLA-DQB1 within which single instances of eight “common” European MHC haplotypes previously sequenced by the MHC Haplotype Project (MHP) were representative of those dominant CEH sequences. Only two MHP haplotypes had a dominant CEH sequence throughout the centromeric and extended class II region and one MHP haplotype did not represent a known European CEH anywhere in the region. We identified the centromeric recombination transition points of other MHP sequences from CEH representation to non-representation. Several CEH pairs or groups shared sequence identity in small blocks but had significantly different (although still conserved for each separate CEH) sequences in surrounding regions. These patterns partly explain strong calculated linkage disequilibrium over only short (tens to hundreds of kilobases) distances in the context of a finite number of observed megabase-length CEHs comprising half a population's haplotypes. Our results provide a clearer picture of European CEH class II allelic structure and population haplotype architecture, improved regional CEH markers, and raise questions concerning regional recombination hotspots. PMID:25299700

  9. Neuropsychiatric systemic lupus erythematosus is associated with imbalance in interleukin 10 promoter haplotypes

    PubMed Central

    Rood, M; Keijsers, V; van der Linden, M W; Tong, T; Borggreve, S; Verweij, C; Breedveld, F; Huizinga, T

    1999-01-01

    OBJECTIVE—To investigate the association of interleukin 10 (IL10) promoter polymorphisms and neuropsychiatric manifestations of systemic lupus erythematosus (SLE).
METHODS—IL10 haplotypes of 11 healthy volunteers were cloned to confirm that in the Dutch population, only the three common haplotypes (-1082/-819/-592) GCC, ACC and ATA exist. The IL10 promoter polymorphisms of 92 SLE patients and 162 healthy controls were determined. The medical records of the SLE patients were screened for the presence of neuropsychiatric involvement.
RESULTS—All cloned haplotypes were either GCC, ACC or ATA. Forty two SLE patients had suffered from neuropsychiatric manifestations (NP-SLE). In NP-SLE patients, the frequency of the ATA haplotype is 30% versus 18% in the controls and 17% in the non-NP-SLE group (odds ratios 1.9, p=0.02, and 2.1, p=0.04, respectively), whereas the GCC haplotype frequency is lower in the NP-SLE group compared with controls and non-NP-SLE patients (40% versus 55% and 61%, odds ratios 0.6, p=0.02 and 0.4 p=0.006). The odds ratio for the presence of NP-SLE is inversely proportional to the number of GCC haplotypes per genotype when the NP-SLE group is compared with non-NP-SLE patients.
CONCLUSIONS—The IL10 locus is associated with neuropsychiatric manifestations in SLE. This suggests that IL10 is implicated in the immunopathogenesis of neuropsychiatric manifestations in SLE.

 Keywords: systemic lupus erythematosus; neuropsychiatric manifestations; genetics; interleukin 10 promoter haplotypes PMID:10343522

  10. African gene flow to north Brazil as revealed by HBB*S gene haplotype analysis.

    PubMed

    Lemos Cardoso, Greice; Farias Guerreiro, João

    2006-01-01

    Haplotypes linked to the HBB*S gene were analyzed in a sample of 260 chromosomes of Brazilian sickle cell anemia patients from the population of Belém, state of Pará, to evaluate if the present-day haplotype frequencies correlate as well as expected with historical information on the geographic origin of African slaves sent directly to Northern Brazil. The HBB*S gene haplotype distribution (66% Bantu, 21.8% Benin, 10.9% Senegal, and 1.3% Cameroon) is in agreement with those observed for other Brazilian populations regarding the highest proportion of the Bantu type, followed by the Benin type, but it differs significantly concerning the Senegal type as this haplotype is rare or absent in samples from other Brazilian regions already studied. In addition, our results are in accordance with historical records that establish that about 90% of the slaves sent to Northern Brazil were from Angola, Congo, and Mozambique, where the Bantu haplotype predominates, in contrast to 10% of slaves from Senegambia, Guine-Bissau, and Cape Verde, where the Senegal haplotype is the most common. On the other hand, the observed frequency of the Benin haplotype in Belém was much higher than that expected by historical data. This fact corroborates the suggestion that the high prevalence of the Benin type in Belém is due to domestic slave trade and later internal migrations, mainly from the Northeast, since there are no historical records of direct slave trade from Central West Africa to North Brazil. Am. J. Hum. Biol. 18:93-98, 2006. (c) 2005 Wiley-Liss, Inc.

  11. Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2014-07-01

    The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  12. F8 haplotype and inhibitor risk: results from the Hemophilia Inhibitor Genetics Study (HIGS) Combined Cohort

    PubMed Central

    Schwarz, John; Astermark, Jan; Menius, Erika D.; Carrington, Mary; Donfield, Sharyne M.; Gomperts, Edward D.; Nelson, George W.; Oldenburg, Johannes; Pavlova, Anna; Shapiro, Amy D.; Winkler, Cheryl A.; Berntorp, Erik

    2012-01-01

    Background Ancestral background, specifically African descent, confers higher risk for development of inhibitory antibodies to factor VIII (FVIII) in hemophilia A. It has been suggested that differences in the distribution of factor VIII gene (F8) haplotypes, and mismatch between endogenous F8 haplotypes and those comprising products used for treatment could contribute to risk. Design and Methods Data from the HIGS Combined Cohort were used to determine the association between F8 haplotype 3 (H3) vs. haplotypes 1 and 2 (H1+H2) and inhibitor risk among individuals of genetically-determined African descent. Other variables known to affect inhibitor risk including type of F8 mutation and HLA were included in the analysis. A second research question regarding risk related to mismatch in endogenous F8 haplotype and recombinant FVIII products used for treatment was addressed. Results H3 was associated with higher inhibitor risk among those genetically-identified (N=49) as of African ancestry, but the association did not remain significant after adjustment for F8 mutation type and the HLA variables. Among subjects of all racial ancestries enrolled in HIGS who reported early use of recombinant products (N=223), mismatch in endogenous haplotype and the FVIII proteins constituting the products used did not confer greater risk for inhibitor development. Conclusion H3 was not an independent predictor of inhibitor risk. Further, our findings did not support a higher risk of inhibitors in the presence of a haplotype mismatch between the FVIII molecule infused and that of the individual. PMID:22958194

  13. Dual African Origins of Global Aedes aegypti s.l. Populations Revealed by Mitochondrial DNA

    PubMed Central

    Moore, Michelle; Sylla, Massamba; Goss, Laura; Burugu, Marion Warigia; Sang, Rosemary; Kamau, Luna W.; Kenya, Eucharia Unoma; Bosio, Chris; Munoz, Maria de Lourdes; Sharakova, Maria; Black, William Cormack

    2013-01-01

    Background Aedes aegypti is the primary global vector to humans of yellow fever and dengue flaviviruses. Over the past 50 years, many population genetic studies have documented large genetic differences among global populations of this species. These studies initially used morphological polymorphisms, followed later by allozymes, and most recently various molecular genetic markers including microsatellites and mitochondrial markers. In particular, since 2000, fourteen publications and four unpublished datasets have used sequence data from the NADH dehydrogenase subunit 4 mitochondrial gene to compare Ae. aegypti collections and collectively 95 unique mtDNA haplotypes have been found. Phylogenetic analyses in these many studies consistently resolved two clades but no comprehensive study of mtDNA haplotypes have been made in Africa, the continent in which the species originated. Methods and Findings ND4 haplotypes were sequenced in 426 Ae. aegypti s.l. from Senegal, West Africa and Kenya, East Africa. In Senegal 15 and in Kenya 7 new haplotypes were discovered. When added to the 95 published haplotypes and including 6 African Aedes species as outgroups, phylogenetic analyses showed that all but one Senegal haplotype occurred in a basal clade while most East African haplotypes occurred in a second clade arising from the basal clade. Globally distributed haplotypes occurred in both clades demonstrating that populations outside Africa consist of mixtures of mosquitoes from both clades. Conclusions Populations of Ae. aegypti outside Africa consist of mosquitoes arising from one of two ancestral clades. One clade is basal and primarily associated with West Africa while the second arises from the first and contains primarily mosquitoes from East Africa PMID:23638196

  14. Discovery of novel MHC-class I alleles and haplotypes in Filipino cynomolgus macaques (Macaca fascicularis) by pyrosequencing and Sanger sequencing: Mafa-class I polymorphism.

    PubMed

    Shiina, Takashi; Yamada, Yukiho; Aarnink, Alice; Suzuki, Shingo; Masuya, Anri; Ito, Sayaka; Ido, Daisuke; Yamanaka, Hisashi; Iwatani, Chizuru; Tsuchiya, Hideaki; Ishigaki, Hirohito; Itoh, Yasushi; Ogasawara, Kazumasa; Kulski, Jerzy K; Blancher, Antoine

    2015-10-01

    Although the low polymorphism of the major histocompatibility complex (MHC) transplantation genes in the Filipino cynomolgus macaque (Macaca fascicularis) is expected to have important implications in the selection and breeding of animals for medical research, detailed polymorphism information is still lacking for many of the duplicated class I genes. To better elucidate the degree and types of MHC polymorphisms and haplotypes in the Filipino macaque population, we genotyped 127 unrelated animals by the Sanger sequencing method and high-resolution pyrosequencing and identified 112 different alleles, 28 at cynomolgus macaque MHC (Mafa)-A, 54 at Mafa-B, 12 at Mafa-I, 11 at Mafa-E, and seven at Mafa-F alleles, of which 56 were newly described. Of them, the newly discovered Mafa-A8*01:01 lineage allele had low nucleotide similarities (<86%) with primate MHC class I genes, and it was also conserved in the Vietnamese and Indonesian populations. In addition, haplotype estimations revealed 17 Mafa-A, 23 Mafa-B, and 12 Mafa-E haplotypes integrated with 84 Mafa-class I haplotypes and Mafa-F alleles. Of these, the two Mafa-class I haplotypes, F/A/E/B-Hp1 and F/A/E/B-Hp2, had the highest haplotype frequencies at 10.6 and 10.2%, respectively. This suggests that large scale genetic screening of the Filipino macaque population would identify these and other high-frequency Mafa-class I haplotypes that could be used as MHC control animals for the benefit of biomedical research.

  15. Discovery of a haplotype affecting fertility in Ayrshire dairy dattle and identification of a putative causal variant

    USDA-ARS?s Scientific Manuscript database

    Initial genomic test results for US Ayrshire dairy cattle became available in January of 2013. Several haplotypes that showed a deficiency of homozygotes were investigated to determine if they had an effect on fertility. A haplotype on chromosome 17 was determined to affect fertility, indicating tha...

  16. Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes

    PubMed Central

    Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.

    1990-01-01

    We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812

  17. Phylogeography of Japanese horse chestnut (Aesculus turbinata) in the Japanese Archipelago based on chloroplast DNA haplotypes.

    PubMed

    Sugahara, Kanako; Kaneko, Yuko; Ito, Satoshi; Yamanaka, Keisuke; Sakio, Hitoshi; Hoshizaki, Kazuhiko; Suzuki, Wajiro; Yamanaka, Norikazu; Setoguchi, Hiroaki

    2011-01-01

    Japanese horse chestnut (Aesculus turbinata: Hippocastanaceae) is one of the typical woody plants that grow in temperate riparian forests in the Japanese Archipelago. To analyze the phylogeography of this plant in the Japanese Archipelago, we determined cpDNA haplotypes for 337 samples from 55 populations covering the entire distribution range. Based on 1,313 bp of two spacers, we determined ten haplotypes that are distinguished from adjacent haplotypes by one or two steps. Most of the populations had a single haplotype, suggesting low diversity. Spatial analysis of molecular variance suggested three obvious phylogeographic structures in western Japan, where Japanese horse chestnut is scattered and isolated in mountainous areas. Conversely, no clear phylogeographic structure was observed from the northern to the southern limit of this species, including eastern Japan, where this plant is more common. Rare and private haplotypes were also found in southwestern Japan, where Japanese horse chestnuts are distributed sparsely. These findings imply that western Japan might have maintained a relatively large habitat for A. turbinata during the Quaternary climatic oscillations, while northerly regions could not.

  18. In Vivo Characterization of Human APOA5 Haplotypes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahituv, Nadav; Akiyama, Jennifer; Chapman-Helleboid, Audrey

    2006-10-01

    Increased plasma triglycerides concentrations are an independent risk factor for cardiovascular disease. Numerous studies support a reproducible genetic association between two minor haplotypes in the human apolipoprotein A5 gene (APOA5) and increased plasma triglyceride concentrations. We thus sought to investigate the effect of these minor haplotypes (APOA5*2 and APOA5*3) on ApoAV plasma levels through the precise insertion of single-copy intact APOA5 haplotypes at a targeted location in the mouse genome. While we found no difference in the amount of human plasma ApoAV in mice containing the common APOA5*1 and minor APOA5*2 haplotype, the introduction of the single APOA5*3 defining allelemore » (19W) resulted in 3-fold lower ApoAV plasma levels consistent with existing genetic association studies. These results indicate that S19W polymorphism is likely to be functional and explain the strong association of this variant with plasma triglycerides supporting the value of sensitive in vivo assays to define the functional nature of human haplotypes.« less

  19. Mineralocorticoid receptor haplotype, oral contraceptives and emotional information processing.

    PubMed

    Hamstra, D A; de Kloet, E R; van Hemert, A M; de Rijk, R H; Van der Does, A J W

    2015-02-12

    Oral contraceptives (OCs) affect mood in some women and may have more subtle effects on emotional information processing in many more users. Female carriers of mineralocorticoid receptor (MR) haplotype 2 have been shown to be more optimistic and less vulnerable to depression. To investigate the effects of oral contraceptives on emotional information processing and a possible moderating effect of MR haplotype. Cross-sectional study in 85 healthy premenopausal women of West-European descent. We found significant main effects of oral contraceptives on facial expression recognition, emotional memory and decision-making. Furthermore, carriers of MR haplotype 1 or 3 were sensitive to the impact of OCs on the recognition of sad and fearful faces and on emotional memory, whereas MR haplotype 2 carriers were not. Different compounds of OCs were included. No hormonal measures were taken. Most naturally cycling participants were assessed in the luteal phase of their menstrual cycle. Carriers of MR haplotype 2 may be less sensitive to depressogenic side-effects of OCs. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.

  20. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing

    PubMed Central

    Howie, Bryan; Fuchsberger, Christian; Stephens, Matthew; Marchini, Jonathan; Abecasis, Gonçalo R.

    2013-01-01

    Sequencing efforts, including the 1000 Genomes Project and disease-specific efforts, are producing large collections of haplotypes that can be used for genotype imputation in genome-wide association studies (GWAS). Imputing from these reference panels can help identify new risk alleles, but the use of large panels with existing methods imposes a high computational burden. To keep imputation broadly accessible, we introduce a strategy called “pre-phasing” that maintains the accuracy of leading methods while cutting computational costs by orders of magnitude. In brief, we first statistically estimate the haplotypes for each GWAS individual (“pre-phasing”) and then impute missing genotypes into these estimated haplotypes. This reduces the computational cost because: (i) the GWAS samples must be phased only once, whereas standard methods would implicitly re-phase with each reference panel update; (ii) it is much faster to match a phased GWAS haplotype to one reference haplotype than to match unphased GWAS genotypes to a pair of reference haplotypes. This strategy will be particularly valuable for repeated imputation as reference panels evolve. PMID:22820512

  1. Multiple genetic origins of histidine-rich protein 2 gene deletion in Plasmodium falciparum parasites from Peru

    PubMed Central

    Akinyi, Sheila; Hayden, Tonya; Gamboa, Dionicia; Torres, Katherine; Bendezu, Jorge; Abdallah, Joseph F.; Griffing, Sean M.; Quezada, Wilmer Marquiño; Arrospide, Nancy; De Oliveira, Alexandre Macedo; Lucas, Carmen; Magill, Alan J.; Bacon, David J.; Barnwell, John W.; Udhayakumar, Venkatachalam

    2013-01-01

    The majority of malaria rapid diagnostic tests (RDTs) detect Plasmodium falciparum histidine-rich protein 2 (PfHRP2), encoded by the pfhrp2 gene. Recently, P. falciparum isolates from Peru were found to lack pfhrp2 leading to false-negative RDT results. We hypothesized that pfhrp2-deleted parasites in Peru derived from a single genetic event. We evaluated the parasite population structure and pfhrp2 haplotype of samples collected between 1998 and 2005 using seven neutral and seven chromosome 8 microsatellite markers, respectively. Five distinct pfhrp2 haplotypes, corresponding to five neutral microsatellite-based clonal lineages, were detected in 1998-2001; pfhrp2 deletions occurred within four haplotypes. In 2003-2005, outcrossing among the parasite lineages resulted in eight population clusters that inherited the five pfhrp2 haplotypes seen previously and a new haplotype; pfhrp2 deletions occurred within four of these haplotypes. These findings indicate that the genetic origin of pfhrp2 deletion in Peru was not a single event, but likely occurred multiple times. PMID:24077522

  2. Gene Flow Patterns of the Mayfly Fallceon quilleri in San Diego County, California.

    NASA Astrophysics Data System (ADS)

    Zickovich, J.; Bohonak, A. J.

    2005-05-01

    Management decisions and conservation strategies for freshwater invertebrates critically depend on an understanding of gene flow and genetic structure. We collected the mayfly Fallceon quilleri (Ephemeroptera: Baetidae) from 15 streams across three geographically distinct watersheds in San Diego County, California (San Dieguito, Santa Margarita, and Tijuana) and one site in Anza-Borrego desert. We sequenced a 667 base pair region of the mitochondrial DNA (COI) to assess genetic structure and gene flow. We found eight haplotypes across all populations. San Dieguito and Santa Margarita each contained six haplotypes. Tijuana and Anza Borrego each contained four haplotypes. The expected heterozygosity for San Dieguito, Santa Margarita, Tijuana, and Anza Borrego was 0.81, 0.83, 0.75, and 1.0, respectively. A hierarchical AMOVA analysis indicated restricted gene flow and a pairwise comparison indicated that Tijuana watershed differs significantly from San Dieguito and Anza Borrego. A haplotype cladogram revealed two internal ancestral haplotypes and six derived tip haplotypes that are unique to particular watersheds. These results suggest that Tijuana (the southernmost and the most impacted watershed) is more genetically distinct and isolated than the other watersheds sampled.

  3. Interactions Between Serotonin Transporter Gene Haplotypes and Quality of Mothers’ Parenting Predict the Development of Children’s Noncompliance

    PubMed Central

    Sulik, Michael J.; Eisenberg, Nancy; Lemery-Chalfant, Kathryn; Spinrad, Tracy L.; Silva, Kassondra M.; Eggum, Natalie D.; Betkowski, Jennifer A.; Kupfer, Anne; Smith, Cynthia L.; Gaertner, Bridget; Stover, Daryn A.; Verrelli, Brian C.

    2012-01-01

    The LPR and STin2 polymorphisms of the serotonin transporter gene (SLC6A4) were combined into haplotypes that, together with quality of maternal parenting, were used to predict initial levels and linear change in children’s (N = 138) noncompliance and aggression from age 18 –54 months. Quality of mothers’ parenting behavior was observed when children were 18 months old, and nonparental caregivers’ reports of noncompliance and aggression were collected annually from 18 to 54 months of age. Quality of early parenting was negatively related to the slope of noncompliance only for children with the LPR-S/STin2-10 haplotype and to 18-month noncompliance only for children with haplotypes that did not include LPR-S. The findings support the notion that SLC6A4 haplotypes index differential susceptibility to variability in parenting quality, with certain haplotypes showing greater reactivity to both supportive and unsupportive environments. These different genetic backgrounds likely reflect an evolutionary response to variation in the parenting environment. PMID:22059451

  4. Use of an Automatic Problem Generator to Teach Basic Skills in a First Course in Assembly Language.

    ERIC Educational Resources Information Center

    Benander, Alan; And Others

    1989-01-01

    Discussion of the use of computer aided instruction (CAI) and instructional software in college level courses highlights an automatic problem generator, AUTOGEN, that was written for computer science students learning assembly language. Design of the software is explained, and student responses are reported. (nine references) (LRW)

  5. Phylogenetic analysis of mtDNA lineages in South American mummies.

    PubMed

    Monsalve, M V; Cardenas, F; Guhl, F; Delaney, A D; Devine, D V

    1996-07-01

    Some studies of mtDNA propose that contemporary Amerindians have descended from four haplotype groups, each defined by specific sets of polymorphisms. One recent study also found evidence of other potential founder haplotypes. We wanted to determine whether the four haplotypes in modern populations were also present in ancient South American aboriginals. We subjected mtDNA from Colombian mummies (470 to 1849 AD) to PCR amplification and restriction endonuclease analysis. The mtDNA D-loop region was surveyed for sequence variation by restriction analysis and a segment of this region was sequenced for each mummy to characterize the haplotypes. Our mummies exhibited three of the four major characteristic haplotypes of Amerindian populations defined by four markers. With sequence data obtained in the ancient samples and published data on contemporary Amerindians it was possible to infer the origin of these six mummies.

  6. The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets

    PubMed Central

    McCaskie, Pamela A; Carter, Kim W; McCaskie, Simon R; Palmer, Lyle J

    2005-01-01

    We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes. PMID:16451612

  7. Haplotype Frequency Distribution in Northeastern European Saduria entomon (Crustacea: Isopoda) Populations. A Phylogeographic Approach

    NASA Astrophysics Data System (ADS)

    Sell, Jerzy

    2003-11-01

    The distribution pattern of mtDNA haplotypes in distinct populations of the glacial relict crustacean Saduria entomon was examined to assess phylogeographic relationships among them. Populations from the Baltic, the White Sea and the Barents Sea were screened for mtDNA variation using PCR-based RFLP analysis of a 1150 bp fragment containing part of the CO I and CO II genes. Five mtDNA haplotypes were recorded. An analysis of geographical heterogeneity in haplotype frequency distributions revealed significant differences among populations. The isolated populations of S. entomon have diverged since the retreat of the last glaciation. The geographical pattern of variation is most likely the result of stochastic (founder effect, genetic drift) mechanisms and suggests that the haplotype differentiation observed is probably older than the isolation of the Baltic and Arctic seas.

  8. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Andrew, S.E.; Goldberg, Y.P.; Squitieri, F.

    Huntington disease (HD) is one of 7 disorders now known to be caused by expansion of a trinucleotide repeat. The HD mutation is a polymorphic trinucleotide (CAG) repeat in the 5{prime} region of a novel gene that expands beyond the normal range of 10-35 repeats in persons destined to develop the disease. Haplotype analysis of other dynamic mutation disorders such as myotonic dystrophy and Fragil X have suggested that a rare ancestral expansion event on a normal chromosome is followed by subsequent expansion events, resulting in a pool of chromosomes in the premutation range, which is inherently unstable and pronemore » to further multiple expansion events leading to disease range chromosomes. Haplotype analysis of 67 HD and 84 control chromosomes using 5 polymorphic markers, both intragenic and 5{prime} to the disease mutation, demonstrate that multiple haplotypes underlie HD. However, 94% of the chromosomes can be grouped under two major haplotypes. These two haplotypes are also present in the normal population. A third major haplotype is seen on 38% of normal chromosomes but rarely on HD chromosomes (6%). CAG lengths on the normal chromosomes with the two haplotypes seen in the HD population are higher than those seen on the normal chromosomes with the haplotype rarely seen on HD chromosomes. Furthermore, in populations with a diminished frequency of HD, CAG length on normal chromosomes is significantly less than other populations with higher prevalence rates for HD. These data suggest that CAG length on normal chromosomes may be a significant factor contributing to repeat instability that eventually leads to chromosomes with CAG repeat lengths in the HD range. Haplotypes on the HD chromosomes are identical to those normal chromosomes which have CAG lengths in the high range of normal, suggesting that further expansions of this pool of chromosomes leads to chromosomes with CAG repeat sizes within the disease range, consistent with a multistep model.« less

  9. Genetic structure of Phytophthora infestans populations in China indicates multiple migration events.

    PubMed

    Guo, Liyun; Zhu, Xiao-Qiong; Hu, Chia-Hui; Ristaino, Jean Beagle

    2010-10-01

    One hundred isolates of Phytophthora infestans collected from 10 provinces in China between 1998 and 2004 were analyzed for mating type, metalaxyl resistance, mitochondrial DNA (mtDNA) haplotype, allozyme genotype, and restriction fragment length polymorphism (RFLP) with the RG-57 probe. In addition, herbarium samples collected in China, Russia, Australia, and other Asian countries were also typed for mtDNA haplotype. The Ia haplotype was found during the first outbreaks of the disease in China (1938 and 1940), Japan (1901, 1930, and 1931), India (1913), Peninsular Malaysia (1950), Nepal (1954), The Philippines (1910), Australia (1917), Russia (1917), and Latvia (1935). In contrast, the Ib haplotype was found after 1950 in China on both potato and tomato (1952, 1954, 1956, and 1982) and in India (1968 and 1974). Another migration of a genotype found in Siberia called SIB-1 (Glucose-6-phosphate isomerase [Gpi] 100/100, Peptidase [Pep] 100/100, IIa mtDNA haplotype) was identified using RFLP fingerprints among 72% of the isolates and was widely distributed in the north and south of China and has also been reported in Japan. A new genotype named CN-11 (Gpi 100/111, Pep 100/100, IIb mtDNA haplotype), found only in the south of China, and two additional genotypes (Gpi 100/100, Pep 100/100, Ia mtDNA haplotype) named CN-9 and CN-10 were identified. There were more diverse genotypes among isolates from Yunnan province than elsewhere. The SIB-1 (IIa) genotype is identical to those from Siberia, suggesting later migration of this genotype from either Russia or Japan into China. The widespread predominance of SIB-1 suggests that this genotype has enhanced fitness compared with other genotypes found. Movement of the pathogen into China via infected seed from several sources most likely accounts for the distribution of pathogen genotypes observed. MtDNA haplotype evidence and RFLP data suggest multiple migrations of the pathogen into China after the initial introduction of the Ia haplotype in the 1930s.

  10. Geographic Patterns of Genetic Variation in a Broadly Distributed Marine Vertebrate: New Insights into Loggerhead Turtle Stock Structure from Expanded Mitochondrial DNA Sequences

    PubMed Central

    Shamblin, Brian M.; Bolten, Alan B.; Abreu-Grobois, F. Alberto; Bjorndal, Karen A.; Cardona, Luis; Carreras, Carlos; Clusa, Marcel; Monzón-Argüello, Catalina; Nairn, Campbell J.; Nielsen, Janne T.; Nel, Ronel; Soares, Luciano S.; Stewart, Kelly R.; Vilaça, Sibelle T.; Türkozan, Oguz; Yilmaz, Can; Dutton, Peter H.

    2014-01-01

    Previous genetic studies have demonstrated that natal homing shapes the stock structure of marine turtle nesting populations. However, widespread sharing of common haplotypes based on short segments of the mitochondrial control region often limits resolution of the demographic connectivity of populations. Recent studies employing longer control region sequences to resolve haplotype sharing have focused on regional assessments of genetic structure and phylogeography. Here we synthesize available control region sequences for loggerhead turtles from the Mediterranean Sea, Atlantic, and western Indian Ocean basins. These data represent six of the nine globally significant regional management units (RMUs) for the species and include novel sequence data from Brazil, Cape Verde, South Africa and Oman. Genetic tests of differentiation among 42 rookeries represented by short sequences (380 bp haplotypes from 3,486 samples) and 40 rookeries represented by long sequences (∼800 bp haplotypes from 3,434 samples) supported the distinction of the six RMUs analyzed as well as recognition of at least 18 demographically independent management units (MUs) with respect to female natal homing. A total of 59 haplotypes were resolved. These haplotypes belonged to two highly divergent global lineages, with haplogroup I represented primarily by CC-A1, CC-A4, and CC-A11 variants and haplogroup II represented by CC-A2 and derived variants. Geographic distribution patterns of haplogroup II haplotypes and the nested position of CC-A11.6 from Oman among the Atlantic haplotypes invoke recent colonization of the Indian Ocean from the Atlantic for both global lineages. The haplotypes we confirmed for western Indian Ocean RMUs allow reinterpretation of previous mixed stock analysis and further suggest that contemporary migratory connectivity between the Indian and Atlantic Oceans occurs on a broader scale than previously hypothesized. This study represents a valuable model for conducting comprehensive international cooperative data management and research in marine ecology. PMID:24465810

  11. Molecular and geographic evolutionary support for the essential role of GIGANTEAa in soybean domestication of flowering time.

    PubMed

    Wang, Yan; Gu, Yongzhe; Gao, Huihui; Qiu, Lijuan; Chang, Ruzhen; Chen, Shouyi; He, Chaoying

    2016-04-12

    Flowering time is a domestication trait of Glycine max and varies in soybeans, yet, a gene for flowering time variation has not been associated with soybean domestication. GIGANTEA (GI) is a major gene involved in the control of flowering time in Arabidopsis, although three GI homologs complicate this model in the soybean genome. In the present work, we revealed that the geographic evolution of the GIGANTEAa (GIa) haplotypes in G. max (GmGIa) and Glycine soja (GsGIa). Three GIa haplotypes (H1, H2, and H3) were found among cultivated soybeans and their wild relatives, yet an additional 44 diverse haplotypes were observed in wild soybeans. H1 had a premature stop codon in the 10(th) exon, whereas the other haplotypes encoded full-length GIa protein isoforms. In both wild-type and cultivated soybeans, H2 was present in the Southern region of China, and H3 was restricted to areas near the Northeast region of China. H1 was genetically derived from H2, and it was dominant and widely distributed among cultivated soybeans, whereas in wild populations, the ortholog of this domesticated haplotype H1 was only found in Yellow River basin with a low frequency. Moreover, this mutated GIa haplotype significantly correlated with early flowering. We further determined that the differences in gene expression of the three GmGIa haplotypes were not correlated to flowering time variations in cultivated soybeans. However, only the truncated GmGIa H1 could partially rescue gi-2 Arabidopsis from delayed flowering in transgenic plants, whereas both GmGIa H2 and H3 haplotypes could significantly repress flowering in transgenic Arabidopsis with a wild-type background. Thus, GmGIa haplotype diversification may have contributed to flowering time adaptation that facilitated the radiation of domesticated soybeans. In light of the evolution of the GIa gene, soybean domestication history for an early flowering phenotype is discussed.

  12. [The haplomatch program for comparing Y-chromosome STR-haplotypes and its application to the analysis of the origin of Don Cossacks].

    PubMed

    Chukhryaeva, M I; Ivanov, I O; Frolova, S A; Koshel, S M; Utevska, O M; Skhalyakho, R A; Agdzhoyan, A T; Bogunov, Yu V; Balanovska, E V; Balanovsky, O P

    2016-05-01

    STR haplotypes of the Y chromosome are widely used as effective genetic markers in studies of human populations and in forensic DNA analysis. The task often arises to compare the spectrum of haplotypes in individuals or entire populations. Performing this task manually is too laborious and thus unrealistic. We propose an algorithm for counting similarity between STR haplotypes. This algorithm is suitable for massive analyses of samples. It is implemented in the computer program Haplomatch, which makes it possible to find haplotypes that differ from the target haplotype by 0, 1, 2, 3, or more mutational steps. The program may operate in two modes: comparison of individuals and comparison of populations. Flexibility of the program (the possibility of using any external database), its usability (MS Excel spreadsheets are used), and the capability of being applied to other chromosomes and other species could make this software a new useful tool in population genetics and forensic and genealogical studies. The Haplomatch software is freely available on our website www.genofond.ru. The program is applied to studying the gene pool of Cossacks. Experimental analysis of Y-chromosomal diversity in a representative set (N = 131) of Upper Don Cossacks is performed. Analysis of the STR haplotypes detects genetic proximity of Cossacks to East Slavic populations (in particular, to Southern and Central Russians, as well as to Ukrainians), which confirms the hypothesis of the origin of the Cossacks mainly due to immigration from Russia and Ukraine. Also, a small genetic influence of Turkicspeaking Nogais is found, probably caused by their occurrence in the Don Voisko as part of the Tatar layer. No similarities between haplotype spectra of Cossacks and Caucasus populations are found. This case study demonstrates the effectiveness of the Haplomatch software in analyzing large sets of STR haplotypes.

  13. Autosomal Dominant Retinal Dystrophies Caused by a Founder Splice Site Mutation, c.828+3A>T, in PRPH2 and Protein Haplotypes in trans as Modifiers

    PubMed Central

    Shankar, Suma P.; Hughbanks-Wheaton, Dianna K.; Birch, David G.; Sullivan, Lori S.; Conneely, Karen N.; Bowne, Sara J.; Stone, Edwin M.; Daiger, Stephen P.

    2016-01-01

    Purpose We determined the phenotypic variation, disease progression, and potential modifiers of autosomal dominant retinal dystrophies caused by a splice site founder mutation, c.828+3A>T, in the PRPH2 gene. Methods A total of 62 individuals (19 families) harboring the PRPH2 c.828+3A>T mutation, had phenotype analysis by fundus appearance, electrophysiology, and visual fields. The PRPH2 haplotypes in trans were sequenced for potential modifying variants and generalized estimating equations (GEE) used for statistical analysis. Results Several distinct phenotypes caused by the PRPH2 c.828+3A>T mutation were observed and fell into two clinical categories: Group I (N = 44) with mild pattern dystrophies (PD) and Group II (N = 18) with more severe cone-rod dystrophy (CRD), retinitis pigmentosa (RP), and central areolar chorioretinal dystrophy (CACD). The PRPH2 Gln304-Lys310-Asp338 protein haplotype in trans was found in Group I only (29.6% vs. 0%), whereas the Glu304-Lys310-Gly338 haplotype was predominant in Group II (94.4% vs. 70.4%). Generalized estimating equations analysis for PD versus the CRD/CACD/RP phenotypes in individuals over 43 years alone with the PRPH2 haplotypes in trans and age as predictors, adjusted for correlation within families, confirmed a significant effect of haplotype on severity (P = 0.03) with an estimated odds ratio of 7.16 (95% confidence interval [CI] = [2.8, 18.4]). Conclusions The PRPH2 c.828+3A>T mutation results in multiple distinct phenotypes likely modified by protein haplotypes in trans; the odds of having the CACD/RP-like phenotype (versus the PD phenotype) are 7.16 times greater with a Glu304-Lys310-Gly338 haplotype in trans. Further functional studies of the modifying haplotypes in trans and PRPH2 splice variants may offer therapeutic targets. PMID:26842753

  14. Genetic variants in a haplotype block spanning IDE are significantly associated with plasma Abeta42 levels and risk for Alzheimer disease.

    PubMed

    Ertekin-Taner, Nilüfer; Allen, Mariet; Fadale, Daniel; Scanlin, Leah; Younkin, Linda; Petersen, Ronald C; Graff-Radford, Neill; Younkin, Steven G

    2004-04-01

    Risk for late onset Alzheimer disease (LOAD) and plasma amyloid beta levels (Abeta42; encoded by APP), an intermediate phenotype for LOAD, show linkage to chromosome 10q. Several strong candidate genes (VR22, PLAU, IDE) lie within the 1-lod support interval for linkage. Others have independently identified haplotypes in the chromosome 10q region harboring IDE that show highly significant association with intermediate AD phenotypes and with risk for AD. To pursue these associations, we analyzed the same haplotypes for association with plasma Abeta42 in 24 extended LOAD families and for association with LOAD in two independent case-control series. One series (MCR, 188 age-matched case-control pairs) did not show association (p=0.64) with the six haplotypes in the 276-kb region spanning three genes (IDE, KNSL1, and HHEX) previously shown to associate with LOAD. The other series (MCJ, 109 age-matched case-control pairs) showed significant (p=0.003) association with these haplotypes. In the MCJ series, the H4 (odds ratio [OR]=5.1, p=0.003) and H2(H7) haplotypes (OR=0.60, p=0.04) had the same effects previously reported. In this series, the H8 haplotype (OR=2.7, p=0.098) also had an effect similar as in one previous case control series but not in others. In the extended families, the H8 haplotype was associated with significantly elevated plasma Abeta42 (p=0.02). In addition, the H5(H10) haplotype, which is associated with reduced risk for AD in the other study is associated with reduced plasma Abeta42 (p=0.007) in our family series. These results provide strong evidence for pathogenic variant(s) in the 276-kb region harboring IDE that influence intermediate AD phenotypes and risk for AD. Copyright 2004 Wiley-Liss, Inc.

  15. Congruence as a measurement of extended haplotype structure across the genome

    PubMed Central

    2012-01-01

    Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. PMID:22369243

  16. Evidence of triple mutant Pfdhps ISGNGA haplotype in Plasmodium falciparum isolates from North-east India: An analysis of sulfadoxine resistant haplotype selection.

    PubMed

    Das, Manuj K; Chetry, Sumi; Kalita, Mohan C; Dutta, Prafulla

    2016-12-01

    North-east region of India has consistent role in the spread of multi drug resistant Plasmodium (P.) falciparum to other parts of Southeast Asia. After rapid clinical treatment failure of Artemisinin based combination therapy-Sulphadoxine/Pyrimethamine (ACT-SP) chemoprophylaxis, Artemether-Lumefantrine (ACT-AL) combination therapy was introduced in the year 2012 in this region for the treatment of uncomplicated P. falciparum malaria. In a DNA sequencing based polymorphism analysis, seven codons of P. falciparum dihydropteroate synthetase ( Pf dhps) gene were screened in a total of 127 P. falciparum isolates collected from Assam, Arunachal Pradesh and Tripura of North-east India during the year 2014 and 2015 to document current sulfadoxine resistant haplotypes. Sequences were analyzed to rearrange both nucleotide and protein haplotypes. Molecular diversity indices were analyzed in DNA Sequence Polymorphism software (DnaSP) on the basis of Pf dhps gene sequences. Disappearance from selective neutrality was assessed based on the ratio of non-synonomous to synonomous nucleotide substitutions [dN/dS ratio]. Moreover, two-tailed Z test was performed in search of the significance for probability of rejecting null hypothesis of strict neutrality [dN = dS]. Presence of mutant P. falciparum multidrug resistance protein1 ( Pf mdr1) was also checked in those isolates that were present with new Pf dhps haplotypes. Phylogenetic relationship based on Pf dhps gene was reconstructed in Molecular Evolutionary Genetics Analysis (MEGA). Among eight different sulfadoxine resistant haplotypes found, IS GNG A haplotype was documented in a total of five isolates from Tripura with association of a new mutant M538 R allele. Sequence analysis of Pf mdr1 gene in these five isolates came to notice that not all but only one isolate was mutant at codon 86 (N86 Y ; Y YSND) in the multidrug resistance protein. Molecular diversity based on Pf dhps haplotypes revealed that P. falciparum populations in Assam and Tripura were under balancing selection for sulfadoxine resistant haplotypes but population from Arunachal Pradesh was under positive selection with comparatively high haplotype diversity ( h  = 0.870). In reconstructed phylogenetic analysis, isolates having IS GNG A haplotype were grouped into two separate sub-clusters from the other isolates based on their genetic distances and diversities. This study suggests that sulfadoxine resistant isolates are still migrating from its epicenter to the other parts of Southeast Asia and hence control and elimination of the drug resistant isolates have become impedimental. Moreover, P. falciparum populations in different areas may undergo selection of particular sulfadoxine resistant haplotypes either in the presence of drug or after its removal to maintain their plasticity.

  17. The mathematics of virus shell assembly. Progress report 1995--1996

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berger, B.

    1996-08-01

    This research focuses on applying computational and mathematical techniques to problems in biology, and more specifically to problems in protein folding. Significant progress has been made in the following areas relating to virus shell assembly: the local rules theory has been further developed; development has begun on a second-generation simulator which provides a more physically realistic model of assembly, collaborative efforts have continued with an experimental biologist to verify and inspire the local rules theory; an investigation has been initiated into the mechanics of virus shell assembly; laboratory experiments have been conducted on bacteriophage T4 which verify that the previouslymore » believed structure for the core may be incorrect.« less

  18. Mitochondrial DNA analyses reveal low genetic diversity in Culex quinquefasciatus from residential areas in Malaysia.

    PubMed

    Low, V L; Lim, P E; Chen, C D; Lim, Y A L; Tan, T K; Norma-Rashid, Y; Lee, H L; Sofian-Azirun, M

    2014-06-01

    The present study explored the intraspecific genetic diversity, dispersal patterns and phylogeographic relationships of Culex quinquefasciatus Say (Diptera: Culicidae) in Malaysia using reference data available in GenBank in order to reveal this species' phylogenetic relationships. A statistical parsimony network of 70 taxa aligned as 624 characters of the cytochrome c oxidase subunit I (COI) gene and 685 characters of the cytochrome c oxidase subunit II (COII) gene revealed three haplotypes (A1-A3) and four haplotypes (B1-B4), respectively. The concatenated sequences of both COI and COII genes with a total of 1309 characters revealed seven haplotypes (AB1-AB7). Analysis using tcs indicated that haplotype AB1 was the common ancestor and the most widespread haplotype in Malaysia. The genetic distance based on concatenated sequences of both COI and COII genes ranged from 0.00076 to 0.00229. Sequence alignment of Cx. quinquefasciatus from Malaysia and other countries revealed four haplotypes (AA1-AA4) by the COI gene and nine haplotypes (BB1-BB9) by the COII gene. Phylogenetic analyses demonstrated that Malaysian Cx. quinquefasciatus share the same genetic lineage as East African and Asian Cx. quinquefasciatus. This study has inferred the genetic lineages, dispersal patterns and hypothetical ancestral genotypes of Cx. quinquefasciatus. © 2013 The Royal Entomological Society.

  19. Interrelationships between Amerindian tribes of lower Amazonia as manifest by HLA haplotype disequilibria.

    PubMed

    Black, F L

    1984-11-01

    HLA B-C haplotypes exhibit common disequilibria in populations drawn from four continents, indicating that they are subject to broadly active selective forces. However, the A-B and A-C associations we have examined show no consistent disequilibrium pattern, leaving open the possibility that these disequilibria are due to descent from common progenitors. By examining HLA haplotype distributions, I have explored the implications that would follow from the hypothesis that biological selection played no role in determining A-C disequilibria in 10 diverse tribes of the lower Amazon Basin. Certain haplotypes are in strong positive disequilibria across a broad geographic area, suggesting that members of diverse tribes descend from common ancestors. On the basis of the extent of diffusion of the components of these haplotypes, one can estimate that the progenitors lived less than 6,000 years ago. One widely encountered lineage entered the area within the last 1,200 years. When haplotype frequencies are used in genetic distance measurements, they give a pattern of relationships very similar to that obtained by conventional chord measurements based on several genetic markers; but more than that, when individual haplotype disequilibria in the several tribes are compared, multiple origins of a single tribe are discernible and relationships are revealed that correlate more closely to geographic and linguistic patterns than do the genetic distance measurements.

  20. The interactive effects of child maltreatment and the FK506 binding protein 5 gene (FKBP5) on dissociative symptoms in adolescence.

    PubMed

    Yaylaci, Fatima Tuba; Cicchetti, Dante; Rogosch, Fred A; Bulut, Okan; Hetzel, Susan R

    2017-08-01

    The FK506 binding protein 5 gene (FKBP5) has been associated with susceptibility to pathogenic effects of childhood trauma including dissociative symptoms. This study examines the impact of maltreatment on dissociative tendencies in adolescence as moderated by the FKBP5 gene. Dissociative symptoms and variation within FKBP5 were assessed in a high-risk, low socioeconomic status community sample of 279 maltreated and 171 nonmaltreated adolescents. Following the assignment of haplotypes across four single nucleotide polymorphisms (rs3800373, rs9296158, rs1360780, and rs9470080), individuals with one or more copies of the CATT haplotype (N = 230) were grouped together and compared to individuals with zero copies of this haplotype (N = 185). Analyses of covariance were conducted to test hypotheses regarding the effects of developmental timing and the chronicity of maltreatment and the CATT haplotype. We found a significant interactive effect of timing/chronicity of maltreatment and the CATT haplotype on dissociative symptoms. Among adolescents who had no copies of the CATT haplotype, dissociative symptoms were higher for chronically maltreated adolescents who had an infancy onset compared to those who were not maltreated or whose maltreatment experience was either relatively less chronic or not started in infancy. The groups did not differ significantly among subjects who carry one or more copies of the CATT haplotype.

  1. Longitudinal analysis of haplotypes and polymorphisms of the APOA5 and APOC3 genes associated with variation in serum triglyceride levels: the Bogalusa Heart Study.

    PubMed

    Hallman, D Michael; Srinivasan, Sathanur R; Chen, Wei; Boerwinkle, Eric; Berenson, Gerald S

    2006-12-01

    Polymorphisms in the APOC3 and APOA5 genes, from the APOA1/APOC3/APOA4/APOA5 gene cluster on chromosome 11q23, have been associated with interindividual variation in plasma triglycerides. APOA5 polymorphisms implicated include 2 in the promoter region (-1131 T/C and -3 A/G) and 1 in exon 2 (+56 C/G). APOC3 polymorphisms implicated include 1 (SstI) in the 3' untranslated region and 1 (-2854 G/T) in the APOC3-APOA4 intergenic region. We analyzed the associations of haplotypes and multilocus genotypes of these polymorphisms on longitudinal serum triglyceride profiles in 360 African American and 823 white subjects from the Bogalusa Heart Study. Subjects were examined from 2 to 8 times (mean +/- SD, 5.4 +/- 1.3) between 1973 and 1996, at ages ranging from 4 to 38 years, with 1978 observations in African Americans and 4465 in whites. Serum triglycerides were significantly higher among whites across all ages. Allele frequencies differed significantly between African Americans and whites at all but the APOA5 +56 C/G locus. Linkage disequilibrium among the loci was higher in whites and haplotype diversity lower: 6 haplotypes had estimated frequencies of more than 1% in African Americans, 5 in whites. Individually, all polymorphisms except APOC3 -2854 G/T showed significant associations with triglyceride levels in the full sample. However, genotype models including all 5 loci showed significant triglyceride associations for only 3 (APOC3 SstI, APOA5 -1131 T/C, and APOA5 +56 C/G); significant interactions among them indicated their effects were not independent. Neither APOC3 -2854 G/T nor APOA5 -3 A/G had significant effects when the other 3 loci were in the models. The EM algorithm was used to estimate haplotype frequencies and assign haplotype probabilities to individuals, which is conditional on their genotypes; individuals' haplotype probability vectors were then used as predictors in multilevel mixed models of longitudinal triglyceride profiles. Of haplotypes comprising, in order, APOC3 SstI and -2854 G/T and APOA5 -1131 T/C, -3 A/G, and +56 C/G, 3 were significantly associated with higher triglycerides, even after adjusting for multiple tests: GGTAG (P = .002), GTTAG (P < .0001), and CGCGC (P = .0002). Each GGTAG haplotype carried would be expected to raise triglyceride levels (relative to those of GTTAC homozygotes) by approximately 19 mg/dL, each GTTAG haplotype by approximately 15 mg/dL, and each CGCGC haplotype by approximately 7 mg/dL. Haplotypes comprising the 3 loci implicated by genotype analyses (SstI, -1131 T/C, and +56 C/G) were also tested: haplotypes C_C_C and G_T_G significantly raised triglycerides, even after adjustment for multiple comparisons (P < .002 for both), with each copy of C_C_C expected to raise triglycerides by approximately 7 mg/dL and each copy of G_T_G by approximately 15 mg/dL. Overall, our findings support those of others in associating specific polymorphisms and haplotypes in the APOA1/C3/A4/A5 gene cluster with higher serum triglyceride levels. However, the degree to which polymorphisms in the APOC3 and APOA5 genes may be independently associated with triglyceride levels remains to be determined.

  2. Effect of malaria transmission reduction by insecticide-treated bed nets (ITNs) on the genetic diversity of Plasmodium falciparum merozoite surface protein (MSP-1) and circumsporozoite (CSP) in western Kenya.

    PubMed

    Kariuki, Simon K; Njunge, James; Muia, Ann; Muluvi, Geofrey; Gatei, Wangeci; Ter Kuile, Feiko; Terlouw, Dianne J; Hawley, William A; Phillips-Howard, Penelope A; Nahlen, Bernard L; Lindblade, Kim A; Hamel, Mary J; Slutsker, Laurence; Shi, Ya Ping

    2013-08-27

    Although several studies have investigated the impact of reduced malaria transmission due to insecticide-treated bed nets (ITNs) on the patterns of morbidity and mortality, there is limited information on their effect on parasite diversity. Sequencing was used to investigate the effect of ITNs on polymorphisms in two genes encoding leading Plasmodium falciparum vaccine candidate antigens, the 19 kilodalton blood stage merozoite surface protein-1 (MSP-1(19kDa)) and the Th2R and Th3R T-cell epitopes of the pre-erythrocytic stage circumsporozoite protein (CSP) in a large community-based ITN trial site in western Kenya. The number and frequency of haplotypes as well as nucleotide and haplotype diversity were compared among parasites obtained from children <5 years old prior to the introduction of ITNs (1996) and after 5 years of high coverage ITN use (2001). A total of 12 MSP-1(19kDa) haplotypes were detected in 1996 and 2001. The Q-KSNG-L and E-KSNG-L haplotypes corresponding to the FVO and FUP strains of P. falciparum were the most prevalent (range 32-37%), with an overall haplotype diversity of > 0.7. No MSP-1(19kDa) 3D7 sequence-types were detected in 1996 and the frequency was less than 4% in 2001. The CSP Th2R and Th3R domains were highly polymorphic with a total of 26 and 14 haplotypes, respectively detected in 1996 and 34 and 13 haplotypes in 2001, with an overall haplotype diversity of > 0.9 and 0.75 respectively. The frequency of the most predominant Th2R and Th3R haplotypes was 14 and 36%, respectively. The frequency of Th2R and Th3R haplotypes corresponding to the 3D7 parasite strain was less than 4% at both time points. There was no significant difference in nucleotide and haplotype diversity in parasite isolates collected at both time points. High diversity in these two genes has been maintained overtime despite marked reductions in malaria transmission due to ITNs use. The frequency of 3D7 sequence-types was very low in this area. These findings provide information that could be useful in the design of future malaria vaccines for deployment in endemic areas with high ITN coverage and in interpretation of efficacy data for malaria vaccines based on 3D7 parasite strains.

  3. Mineralocorticoid receptor haplotype, estradiol, progesterone and emotional information processing.

    PubMed

    Hamstra, Danielle A; de Kloet, E Ronald; Quataert, Ina; Jansen, Myrthe; Van der Does, Willem

    2017-02-01

    Carriers of MR-haplotype 1 and 3 (GA/CG; rs5522 and rs2070951) are more sensitive to the influence of oral contraceptives (OC) and menstrual cycle phase on emotional information processing than MR-haplotype 2 (CA) carriers. We investigated whether this effect is associated with estradiol (E2) and/or progesterone (P4) levels. Healthy MR-genotyped premenopausal women were tested twice in a counterbalanced design. Naturally cycling (NC) women were tested in the early-follicular and mid-luteal phase and OC-users during OC-intake and in the pill-free week. At both sessions E2 and P4 were assessed in saliva. Tests included implicit and explicit positive and negative affect, attentional blink accuracy, emotional memory, emotion recognition, and risky decision-making (gambling). MR-haplotype 2 homozygotes had higher implicit happiness scores than MR-haplotype 2 heterozygotes (p=0.031) and MR-haplotype 1/3 carriers (p<0.001). MR-haplotype 2 homozygotes also had longer reaction times to happy faces in an emotion recognition test than MR-haplotype 1/3 (p=0.001). Practice effects were observed for most measures. The pattern of correlations between information processing and P4 or E2 differed between sessions, as well as the moderating effects of the MR genotype. In the first session the MR-genotype moderated the influence of P4 on implicit anxiety (sr=-0.30; p=0.005): higher P4 was associated with reduction in implicit anxiety, but only in MR-haplotype 2 homozygotes (sr=-0.61; p=0.012). In the second session the MR-genotype moderated the influence of E2 on the recognition of facial expressions of happiness (sr=-0.21; p=0.035): only in MR-haplotype 1/3 higher E2 was correlated with happiness recognition (sr=0.29; p=0.005). In the second session higher E2 and P4 were negatively correlated with accuracy in lag2 trials of the attentional blink task (p<0.001). Thus NC women, compared to OC-users, performed worse on lag 2 trials (p=0.041). The higher implicit happiness scores of MR-haplotype 2 homozygotes are in line with previous reports. Performance in the attentional blink task may be influenced by OC-use. The MR-genotype moderates the influence of E2 and P4 on emotional information processing. This moderating effect may depend on the novelty of the situation. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Assignment of the SLA alleles and reproductive potential of selective breeding Duroc pig lines.

    PubMed

    Soe, Ok Kar; Ohba, Yasunori; Imaeda, Noriaki; Nishii, Naohito; Takasu, Masaki; Yoshioka, Gou; Kawata, Hisako; Shigenari, Atsuko; Uenishi, Hirohide; Inoko, Hidetoshi; Ando, Asako; Kitagawa, Hitoshi

    2008-01-01

    Pigs with defined swine leukocyte antigen (SLA) haplotypes and their detailed information are useful for transplantation and immunological studies. We developed two herds of SLA homozygous Duroc pigs with novel SLA haplotypes and characterized their reproductive potential. For selective inbreeding, a pair of Duroc pigs was chosen as initial breeders, and substantial breeding within progenies was carried out for eight generations. In the selective breeding Duroc pigs, SLA haplotypes were assigned by nucleotide sequence determination of reverse transcription polymerase chain reaction (RT-PCR) products of three SLA classical class I genes and two class II genes. Based on this sequence information, we developed a rapid and simple SLA class II DNA typing method by polymerase chain reaction-sequence specific primer (PCR-SSP) technique. As a complementary method for the characterization of the SLA haplotypes, genetic polymorphisms of 36 microsatellite (MS) markers within the SLA region were also analyzed in the selective breeding pigs with SLA homozygous/heterozygous haplotypes. Among the selective breeding pigs from the third to fifth generations, only two SLA haplotypes were identified by the RT-PCR based SLA typing method; Hp-27.30 (SLA-1*08an03, SLA-1*06an04, SLA-2*0102, SLA-3*0101 DRB1*1101 and DQB1*0503) and Hp-60.13 (SLA-1*an02, SLA-2*1002, SLA-3*0502, DRB1*0403 and DQB1*0303). In these two SLA haplotypes, two class I haplotypes, Hp-27.0 and Hp-60.0, are novel. Furthermore, two class II haplotypes, Hp-0.30 and Hp-0.13, which were previously reported in Korean native pigs and pigs of Hanford breed, respectively, were also assigned by a simple assay using a PCR-SSP technique in the entire selective breeding stock. Moreover, two haplotype specific MS patterns were observed across the entire SLA region in the selective breeding (homozygous/heterozygous) pigs. No morphological abnormalities were observed in selective breeding pigs. The theoretical inbreeding coefficient at the eighth generation was 78.5%. In all generations of selective breeding pigs, litter sizes were comparable and weaning weights from the fifth to eighth generation produced progenies significantly lighter (P < 0.01) than those in the non-selective breeding pigs. We established and characterized SLA homozygous Duroc herds with two kinds of haplotypes that can be used as a new resource for transplantation and other biomedical studies.

  5. Development of an Italian RM Y-STR haplotype database: Results of the 2013 GEFI collaborative exercise.

    PubMed

    Robino, C; Ralf, A; Pasino, S; De Marchi, M R; Ballantyne, K N; Barbaro, A; Bini, C; Carnevali, E; Casarino, L; Di Gaetano, C; Fabbri, M; Ferri, G; Giardina, E; Gonzalez, A; Matullo, G; Nutini, A L; Onofri, V; Piccinini, A; Piglionica, M; Ponzano, E; Previderè, C; Resta, N; Scarnicci, F; Seidita, G; Sorçaburu-Cigliero, S; Turrina, S; Verzeletti, A; Kayser, M

    2015-03-01

    Recently introduced rapidly mutating Y-chromosomal short tandem repeat (RM Y-STR) loci, displaying a multiple-fold higher mutation rate relative to any other Y-STRs, including those conventionally used in forensic casework, have been demonstrated to improve the resolution of male lineage differentiation and to allow male relative separation usually impossible with standard Y-STRs. However, large and geographically-detailed frequency haplotype databases are required to estimate the statistical weight of RM Y-STR haplotype matches if observed in forensic casework. With this in mind, the Italian Working Group (GEFI) of the International Society for Forensic Genetics launched a collaborative exercise aimed at generating an Italian quality controlled forensic RM Y-STR haplotype database. Overall 1509 male individuals from 13 regional populations covering northern, central and southern areas of the Italian peninsula plus Sicily were collected, including both "rural" and "urban" samples classified according to population density in the sampling area. A subset of individuals was additionally genotyped for Y-STR loci included in the Yfiler and PowerPlex Y23 (PPY23) systems (75% and 62%, respectively), allowing the comparison of RM and conventional Y-STRs. Considering the whole set of 13 RM Y-STRs, 1501 unique haplotypes were observed among the 1509 sampled Italian men with a haplotype diversity of 0.999996, largely superior to Yfiler and PPY23 with 0.999914 and 0.999950, respectively. AMOVA indicated that 99.996% of the haplotype variation was within populations, confirming that genetic-geographic structure is almost undetected by RM Y-STRs. Haplotype sharing among regional Italian populations was not observed at all with the complete set of 13 RM Y-STRs. Haplotype sharing within Italian populations was very rare (0.27% non-unique haplotypes), and lower in urban (0.22%) than rural (0.29%) areas. Additionally, 422 father-son pairs were investigated, and 20.1% of them could be discriminated by the whole set of 13 RM Y-STRs, which was very close to the theoretically expected estimate of 19.5% given the mutation rates of the markers used. Results obtained from a high-coverage Italian haplotype dataset confirm on the regional scale the exceptional ability of RM Y-STRs to resolve male lineages previously observed globally, and attest the unsurpassed value of RM Y-STRs for male-relative differentiation purposes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  6. Quality data collection and management technology of aerospace complex product assembly process

    NASA Astrophysics Data System (ADS)

    Weng, Gang; Liu, Jianhua; He, Yongxi; Zhuang, Cunbo

    2017-04-01

    Aiming at solving problems of difficult management and poor traceability for discrete assembly process quality data, a data collection and management method is proposed which take the assembly process and BOM as the core. Data collection method base on workflow technology, data model base on BOM and quality traceability of assembly process is included in the method. Finally, assembly process quality data management system is developed and effective control and management of quality information for complex product assembly process is realized.

  7. Meta-analysis of haplotype-association studies: comparison of methods and empirical evaluation of the literature

    PubMed Central

    2011-01-01

    Background Meta-analysis is a popular methodology in several fields of medical research, including genetic association studies. However, the methods used for meta-analysis of association studies that report haplotypes have not been studied in detail. In this work, methods for performing meta-analysis of haplotype association studies are summarized, compared and presented in a unified framework along with an empirical evaluation of the literature. Results We present multivariate methods that use summary-based data as well as methods that use binary and count data in a generalized linear mixed model framework (logistic regression, multinomial regression and Poisson regression). The methods presented here avoid the inflation of the type I error rate that could be the result of the traditional approach of comparing a haplotype against the remaining ones, whereas, they can be fitted using standard software. Moreover, formal global tests are presented for assessing the statistical significance of the overall association. Although the methods presented here assume that the haplotypes are directly observed, they can be easily extended to allow for such an uncertainty by weighting the haplotypes by their probability. Conclusions An empirical evaluation of the published literature and a comparison against the meta-analyses that use single nucleotide polymorphisms, suggests that the studies reporting meta-analysis of haplotypes contain approximately half of the included studies and produce significant results twice more often. We show that this excess of statistically significant results, stems from the sub-optimal method of analysis used and, in approximately half of the cases, the statistical significance is refuted if the data are properly re-analyzed. Illustrative examples of code are given in Stata and it is anticipated that the methods developed in this work will be widely applied in the meta-analysis of haplotype association studies. PMID:21247440

  8. Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers

    PubMed Central

    Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.

    2018-01-01

    Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092

  9. Aldehyde dehydrogenase-2 genotypes and HLA haplotypes in Japanese patients with esophageal cancer.

    PubMed

    Watanabe, Seishiro; Sasahara, Katsuyuki; Kinekawa, Fumihiko; Uchida, Naohito; Masaki, Tsutomu; Kurokohchi, Kazutaka; Murota, Masayuki; Touge, Tetsuo; Kawauchi, Kazuyoshi; Oda, Syuji; Kuriyama, Shigeki

    2002-01-01

    The aim of this study was to examine how aldehyde dehydrogenase-2 (ALDH2) genotypes and human leukocyte antigen (HLA) haplotypes contribute to the risk for esophageal cancer. We examined ALDH2 genotypes and HLA haplotypes in 29 Japanese patients with esophageal cancer. The ratio of patients who experienced current or former intense vasodilatation upon consuming alcohol (flushing type) was much higher in individuals with the inactive form of ALDH2 encoded by the ALDH2(2)/2(2) or ALDH2(1)/2(2) genotype than in those with the active form of ALDH2 encoded by the ALDH2(1)/2(1) genotype. The ratio of inactive ALDH2 was significantly higher in patients with esophageal cancer than in control normal subjects, suggesting that alcoholics with inactive ALDH2 were susceptible to esophageal cancer. HLA haplotypes A24, A26, B54, B61 and DR9 were prevalent in patients with esophageal cancer (82.8, 24.1, 34.5, 37.9 and 44.8%, respectively). HLA haplotype of A24 and inactive ALDH2 were simultaneously found in 58.6% of patients with esophageal cancer. Furthermore, we found other primary malignancies in 6 of 29 (20.7%) patients with esophageal cancer, and 4 of these 6 patients had both the inactive form of ALDH2 and the HLA A24 haplotype. The present study showed the high prevalence of the inactive form of ALDH2 and HLA haplotypes A24, A26, B54, B61 and DR9 in Japanese patients with esophageal cancer. Therefore, the examination of genotypes of ALDH2 loci and HLA haplotypes may allow the early detection of esophageal cancer in the Japanese population.

  10. Dimensional Anxiety Mediates Linkage of GABRA2 Haplotypes With Alcoholism

    PubMed Central

    Enoch, Mary-Anne; Schwartz, Lori; Albaugh, Bernard; Virkkunen, Matti; Goldman, David

    2015-01-01

    The GABAAα2 receptor gene (GABRA2) modulates anxiety and stress response. Three recent association studies implicate GABRA2 in alcoholism, however in these papers both common, opposite-configuration haplotypes in the region distal to intron3 predict risk. We have now replicated the GABRA2 association with alcoholism in 331 Plains Indian men and women and 461 Finnish Caucasian men. Using a dimensional measure of anxiety, harm avoidance (HA), we also found that the association with alcoholism is mediated, or moderated, by anxiety. Nine SNPs were genotyped revealing two haplotype blocks. Within the previously implicated block 2 region, we identified the two common, opposite-configuration risk haplotypes, A and B. Their frequencies differed markedly in Finns and Plains Indians. In both populations, most block 2 SNPs were significantly associated with alcoholism. The associations were due to increased frequencies of both homozygotes in alcoholics, indicating the possibility of alcoholic subtypes with opposite genotypes. Congruently, there was no significant haplotype association. Using HA as an indicator variable for anxiety, we found haplotype linkage to alcoholism with high and low dimensional anxiety, and to HA itself, in both populations. High HA alcoholics had the highest frequency of the more abundant haplotype (A in Finns, B in Plains Indians); low HA alcoholics had the highest frequency of the less abundant haplotype (B in Finns, A in Plains Indians) (Finns: P α0.007, OR α2.1, Plains Indians: P α0.040, OR α1.9). Non-alcoholics had intermediate frequencies. Our results suggest that within the distal GABRA2 region is a functional locus or loci that may differ between populations but that alters risk for alcoholism via the mediating action of anxiety. PMID:16874763

  11. A Haplotype Information Theory Method Reveals Genes of Evolutionary Interest in European vs. Asian Pigs.

    PubMed

    Hudson, Nicholas J; Naval-Sánchez, Marina; Porto-Neto, Laercio; Pérez-Enciso, Miguel; Reverter, Antonio

    2018-06-05

    Asian and European wild boars were independently domesticated ca. 10,000 years ago. Since the 17th century, Chinese breeds have been imported to Europe to improve the genetics of European animals by introgression of favourable alleles, resulting in a complex mosaic of haplotypes. To interrogate the structure of these haplotypes further, we have run a new haplotype segregation analysis based on information theory, namely compression efficiency (CE). We applied the approach to sequence data from individuals from each phylogeographic region (n = 23 from Asia and Europe) including a number of major pig breeds. Our genome-wide CE is able to discriminate the breeds in a manner reflecting phylogeography. Furthermore, 24,956 non-overlapping sliding windows (each comprising 1,000 consecutive SNP) were quantified for extent of haplotype sharing within and between Asia and Europe. The genome-wide distribution of extent of haplotype sharing was quite different between groups. Unlike European pigs, Asian pigs haplotype sharing approximates a normal distribution. In line with this, we found the European breeds possessed a number of genomic windows of dramatically higher haplotype sharing than the Asian breeds. Our CE analysis of sliding windows capture some of the genomic regions reported to contain signatures of selection in domestic pigs. Prominent among these regions, we highlight the role of a gene encoding the mitochondrial enzyme LACTB which has been associated with obesity, and the gene encoding MYOG a fundamental transcriptional regulator of myogenesis. The origin of these regions likely reflects either a population bottleneck in European animals, or selective targets on commercial phenotypes reducing allelic diversity in particular genes and/or regulatory regions.

  12. Rapid growth of a Eurasian haplotype of Phragmites australis in a restored brackish marsh in Louisiana, USA

    USGS Publications Warehouse

    Howard, R.J.; Travis, S.E.; Sikes, B.A.

    2008-01-01

    While numerous studies have documented patterns of invasion by non-indigenous plant species, few have considered the invasive properties of non-native genotypes of native species. Characteristics associated with specific genotypes, such as tolerance to disturbance, may mistakenly be applied to an entire species in the absence of genetic information, which consequently may affect management decisions. We report here on the incidence and growth of an introduced lineage of Phragmites australis in the Gulf of Mexico coastal zone of Louisiana. P. australis was collected from nine separate locations for inclusion in a series of growth experiments. Chloroplast DNA analysis indicated that specimens collected from four locations in the Mississippi River Delta represented the introduced Eurasian haplotype; the remainder represented the gulf coast haplotype. Three distinct genotypes, or clones, were identified within each haplotype via analysis using amplified fragment length polymorphisms, which also revealed reduced genetic diversity of the gulf coast clones compared to the Eurasian clones. Clones of each haplotype were planted along with three other native macrophytes at similar densities in a restored brackish marsh and monitored for growth. After 14 months, the Eurasian haplotype had spread vegetatively to cover about 82% of the experimental plots, more than four times the coverage (18%) of the gulf coast haplotype. Thus, the use of P. australis plantings for wetland restoration should consider the genetic lineage of plants used since our results indicate the potential of the Eurasian haplotype to grow rapidly at newly restored sites. This rapid growth may limit the establishment of more slowly growing native species. ?? 2007 Springer Science+Business Media B.V.

  13. IL7Rα Expression and Upregulation by IFNβ in Dendritic Cell Subsets Is Haplotype-Dependent

    PubMed Central

    McKay, Fiona C.; Hoe, Edwin; Parnell, Grant; Gatt, Prudence; Schibeci, Stephen D.; Stewart, Graeme J.; Booth, David R.

    2013-01-01

    The IL7Rα gene is unequivocally associated with susceptibility to multiple sclerosis (MS). Haplotype 2 (Hap 2) confers protection from MS, and T cells and dendritic cells (DCs) of Hap 2 exhibit reduced splicing of exon 6, resulting in production of relatively less soluble receptor, and potentially more response to ligand. We have previously shown in CD4 T cells that IL7Rα haplotypes 1 and 2, but not 4, respond to interferon beta (IFNβ), the most commonly used immunomodulatory drug in MS, and that haplotype 4 (Hap 4) homozygotes have the highest risk of developing MS. We now show that IL7R expression increases in myeloid cells in response to IFNβ, but that the response is haplotype-dependent, with cells from homozygotes for Hap 4 again showing no response. This was shown using freshly derived monocytes, in vitro cultured immature and mature monocyte-derived dendritic cells, and by comparing homozygotes for the common haplotypes, and relative expression of alleles in heterozygotes (Hap 4 vs not Hap 4). As for T cells, in all myeloid cell subsets examined, Hap 2 homozygotes showed a trend for reduced splicing of exon 6 compared to the other haplotypes, significantly so in most conditions. These data are consistent with increased signaling being protective from MS, constitutively and in response to IFNβ. We also demonstrate significant regulation of immune response, chemokine activity and cytokine biosynthesis pathways by IL7Rα signaling in IFNβ -treated myeloid subsets. IFNβ-responsive genes are over-represented amongst genes associated with MS susceptibility. IL7Rα haplotype may contribute to MS susceptibility through reduced capacity for IL7Rα signalling in myeloid cells, especially in the presence of IFNβ, and is currently under investigation as a predictor of therapeutic response. PMID:24147013

  14. The Functional SNPs in the 5’ Regulatory Region of the Porcine PPARD Gene Have Significant Association with Fat Deposition Traits

    PubMed Central

    Hu, Shanyao; Lin, Bin; Yan, Dechao; Xu, Zaiyan; Zhang, Zijun; Mao, Yuanliang; Mao, Huimin; Wang, Litong; Wang, Guoshui; Xiong, Yuanzhu; Zuo, Bo

    2015-01-01

    Peroxisome proliferator-activated receptor delta (PPARD) is a key regulator of lipid metabolism, insulin sensitivity, cell proliferation and differentiation. In this study, we identified two Single Nucleotide Polymorphisms (SNPs, g.1015 A>G and g.1018 T>C) constituting four haplotypes (GT, GC, AC and AT) in the 5’ regulatory region of porcine PPARD gene. Functional analysis of the four haplotypes showed that the transcriptional activity of the PPARD promoter fragment carrying haplotype AC was significantly lower than that of the other haplotypes in 3T3-L1, C2C12 and PK-15 cells, and haplotype AC had the lowest binding capacities to the nuclear extracts. Transcription factor 7-like 2 (TCF7L2) enhanced the transcription activities of promoter fragments of PPARD gene carrying haplotypes GT, GC and AT in C2C12 and 3T3-L1 cells, and increased the protein expression of PPARD gene in C2C12 myoblasts. TCF7L2 differentially bound to the four haplotypes, and the binding capacity of TCF7L2 to haplotype AC was the lowest. There were significant associations between -655A/G and fat deposition traits in three pig populations including the Large White × Meishan F2 pigs, France and American Large White pigs. Pigs with genotype GG had significantly higher expression of PPARD at both mRNA and protein level than those with genotype AG. These results strongly suggested that the SNPs in 5’ regulatory region of PPARD genes had significant impact on pig fat deposition traits. PMID:26599230

  15. Sequence polymorphism at the human apolipoprotein AII gene ( APOA2): unexpected deficit of variation in an African-American sample.

    PubMed

    Fullerton, Stephanie M; Clark, Andrew G; Weiss, Kenneth M; Taylor, Scott L; Stengård, Jari H; Salomaa, Veikko; Boerwinkle, Eric; Nickerson, Deborah A

    2002-07-01

    A 3.3-kb region, encompassing the APOA2 gene and 2 kb of 5' and 3' flanking DNA, was re-sequenced in a "core" sample of 24 individuals, sampled without regard to the health from each of three populations: African-Americans from Jackson (Miss., USA), Europeans from North Karelia (Finland), and non-Hispanic European-Americans from Rochester, (Minn., USA). Fifteen variable sites were identified (14 SNPs and one multi-allelic microsatellite, all silent), and these sites segregated as 18 sequence haplotypes (or nine, if SNPs only are considered). The haplotype distribution in the core African-American sample was unusual, with a deficit of particular haplotypes compared with those found in the other two samples, and a significantly (P<0.05) low level of nucleotide diversity relative to patterns of polymorphism and divergence at other human loci. Six of the 14 SNPs, whose variation captured the haplotype structure of the core data, were then genotyped by oligonucleotide ligation assay in an additional 2183 individuals from the same three populations (n=843, n=452, and n=888, respectively). All six sites varied in each of the larger "epidemiological" samples, and together, they defined 19 SNP haplotypes, seven with relative frequencies greater than 1% in the total sample; all of these common haplotypes had been identified earlier in the core re-sequencing survey. Here also, the African-American sample showed significantly lower SNP heterozygosity and haplotype diversity than the other two samples. The deficit of polymorphism is consistent with a population-specific non-neutral increase in the relative frequency of several haplotypes in Jackson.

  16. Three Novel Haplotypes of Theileria bicornis in Black and White Rhinoceros in Kenya.

    PubMed

    Otiende, M Y; Kivata, M W; Jowers, M J; Makumi, J N; Runo, S; Obanda, V; Gakuya, F; Mutinda, M; Kariuki, L; Alasaad, S

    2016-02-01

    Piroplasms, especially those in the genera Babesia and Theileria, have been found to naturally infect rhinoceros. Due to natural or human-induced stress factors such as capture and translocations, animals often develop fatal clinical piroplasmosis, which causes death if not treated. This study examines the genetic diversity and occurrence of novel Theileria species infecting both black and white rhinoceros in Kenya. Samples collected opportunistically during routine translocations and clinical interventions from 15 rhinoceros were analysed by polymerase chain reaction (PCR) using a nested amplification of the small subunit ribosomal RNA (18S rRNA) gene fragments of Babesia and Theileria. Our study revealed for the first time in Kenya the presence of Theileria bicornis in white (Ceratotherium simum simum) and black (Diceros bicornis michaeli) rhinoceros and the existence of three new haplotypes: haplotypes H1 and H3 were present in white rhinoceros, while H2 was present in black rhinoceros. No specific haplotype was correlated to any specific geographical location. The Bayesian inference 50% consensus phylogram recovered the three haplotypes monophyleticly, and Theileria bicornis had very high support (BPP: 0.98). Furthermore, the genetic p-uncorrected distances and substitutions between T. bicornis and the three haplotypes were the same in all three haplotypes, indicating a very close genetic affinity. This is the first report of the occurrence of Theileria species in white and black rhinoceros from Kenya. The three new haplotypes reported here for the first time have important ecological and conservational implications, especially for population management and translocation programs and as a means of avoiding the transport of infected animals into non-affected areas. © 2014 Blackwell Verlag GmbH.

  17. Intricacies in arrangement of SNP haplotypes suggest "Great Admixture" that created modern humans.

    PubMed

    Dutta, Rajib; Mainsah, Joseph; Yatskiv, Yuriy; Chakrabortty, Sharmistha; Brennan, Patrick; Khuder, Basil; Qiu, Shuhao; Fedorova, Larisa; Fedorov, Alexei

    2017-06-05

    Inferring history from genomic sequences is challenging and problematic because chromosomes are mosaics of thousands of small Identicalby-descent (IBD) fragments, each of them having their own unique story. However, the main events in recent evolution might be deciphered from comparative analysis of numerous loci. A paradox of why humans, whose effective population size is only 10 4 , have nearly three million frequent SNPs is formulated and examined. We studied 5398 loci evenly covering all human autosomes. Common haplotypes built from frequent SNPs that are present in people from various populations have been examined. We demonstrated highly non-random arrangement of alleles in common haplotypes. Abundance of mutually exclusive pairs of common haplotypes that have different alleles at every polymorphic position (so-called Yin/Yang haplotypes) was found in 56% of loci. A novel widely spread category of common haplotypes named Mosaic has been described. Mosaic consists of numerous pieces of Yin/Yang haplotypes and represents an ancestral stage of one of them. Scenarios of possible appearance of large number of frequent human SNPs and their habitual arrangement in Yin/Yang common haplotypes have been evaluated with an advanced genomic simulation algorithm. Computer modeling demonstrated that the observed arrangement of 2.9 million frequent SNPs could not originate from a sole stand-alone population. A "Great Admixture" event has been proposed that can explain peculiarities with frequent SNP distributions. This Great Admixture presumably occurred 100-300 thousand years ago between two ancestral populations that had been separated from each other about a million years ago. Our programs and algorithms can be applied to other species to perform evolutionary and comparative genomics.

  18. Population structure and phylogeography of Toda buffalo in Nilgiris throw light on possible origin of aboriginal Toda tribe of South India.

    PubMed

    Kathiravan, P; Kataria, R S; Mishra, B P; Dubey, P K; Sadana, D K; Joshi, B K

    2011-08-01

    We report the genetic structure and evolutionary relationship of the endangered Toda buffalo of Nilgiris in South India with Kanarese and two other riverine buffalo breeds. The upgma phylogeny drawn using Nei's distance grouped South Kanara and Toda buffaloes at a single node while Marathwada and Murrah together formed a separate node. Principal component analysis was performed with pairwise interindividual chord distances which revealed clustering of Murrah and Marathwada buffaloes distinctly, while individuals of Toda and South Kanara breeds completely intermingled with each other. Furthermore, there were highly significant group variances (p < 0.01) when the breeds were grouped based on phylogeny, thus revealing the existence of cryptic genetic structure within these buffalo breeds. To know the evolutionary relationship among these breeds, 537-bp D-loop region of mitochondrial DNA was analysed. The phylogenetic analysis of mtDNA haplotypes following NJ algorithm with Chinese swamp buffalo as outgroup revealed a major cluster that included haplotypes from all the four investigated breeds and two minor clusters formed by South Kanara and Toda haplotypes. Reduced median network analysis revealed haplotypes of South Kanara and Toda to be quite distinct from the commonly found haplotypes indicating that these might have been ancestral to all the present-day haplotypes. Few mutations in two of the haplotypes of South Kanara buffalo were found to have contributed to ancestral haplotypes of Toda buffalo suggesting the possible migration of buffaloes from Kanarese region towards Nilgiris along the Western Ghats. Considering the close social, economic and cultural association of Todas with their buffaloes, the present study supports the theory of migration of Toda tribe from Kanarese/Mysore region along with their buffaloes. © 2011 Blackwell Verlag GmbH.

  19. Associations between mutations and a VNTR in the human phenylalanine hydroxylase gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Goltsov, A.A.; Eisensmith, R.C.; Woo, S.L.C.

    1992-09-01

    The HindIII RFLP in the human phenylalanine hydroxylase (PAH) gene is caused by the presence of an AT-rich (70%) minisatellite region. This region contains various multiples of 30-bp tandem repeats and is located 3 kb downstream of the final exon of the gene. PCR-mediated amplification of this region from haplotyped PAH chromosomes indicates that the previously reported 4.0-kb HindIII allele contains three of these repeats, while the 4.4-kb HindIII allele contains 12 of these repeats. The 4.2-kb HindIII fragment can contain six, seven, eight, or nine copies of this repeat. These variations permit more detailed analysis of mutant haplotypes 1,more » 5, 6, and, possibly, others. Kindred analysis in phenylketonuria families demonstrates Mendelian segregation of these VNTR alleles, as well as associations between theses alleles and certain PAH mutations. The R261Q mutation, associated with haplotype 1, is associated almost exclusively with an allele containing eight repeats; the R408W mutation, when occurring on a haplotype 1 background, may also be associated with the eight-repeat VNTR allele. Other PAH mutations associated with haplotype 1, R252W and P281L, do not appear to segregate with specific VNTR alleles. The IVS-10 mutation, when associated with haplotype 6, is associated exclusively with an allele containing seven repeats. The combined use of this VNTR system and the existing RFLP haplotype system will increase the performance of prenatal diagnostic tests based on haplotype analysis. In addition, this VNTR may prove useful in studies concerning the origins and distributions of PAH mutations in different human populations. 32 refs., 3 figs., 3 tabs.« less

  20. Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.

    PubMed

    Jiang, Yong; Schmidt, Renate H; Reif, Jochen C

    2018-05-04

    Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.

  1. Linear time algorithms to construct populations fitting multiple constraint distributions at genomic scales.

    PubMed

    Siragusa, Enrico; Haiminen, Niina; Utro, Filippo; Parida, Laxmi

    2017-10-09

    Computer simulations can be used to study population genetic methods, models and parameters, as well as to predict potential outcomes. For example, in plant populations, predicting the outcome of breeding operations can be studied using simulations. In-silico construction of populations with pre-specified characteristics is an important task in breeding optimization and other population genetic studies. We present two linear time Simulation using Best-fit Algorithms (SimBA) for two classes of problems where each co-fits two distributions: SimBA-LD fits linkage disequilibrium and minimum allele frequency distributions, while SimBA-hap fits founder-haplotype and polyploid allele dosage distributions. An incremental gap-filling version of previously introduced SimBA-LD is here demonstrated to accurately fit the target distributions, allowing efficient large scale simulations. SimBA-hap accuracy and efficiency is demonstrated by simulating tetraploid populations with varying numbers of founder haplotypes, we evaluate both a linear time greedy algoritm and an optimal solution based on mixed-integer programming. SimBA is available on http://researcher.watson.ibm.com/project/5669.

  2. In-gel multiple displacement amplification of long DNA fragments diluted to the single molecule level.

    PubMed

    Michikawa, Yuichi; Sugahara, Keisuke; Suga, Tomo; Ohtsuka, Yoshimi; Ishikawa, Kenichi; Ishikawa, Atsuko; Shiomi, Naoko; Shiomi, Tadahiro; Iwakawa, Mayumi; Imai, Takashi

    2008-12-15

    The isolation and multiple genotyping of long individual DNA fragments are needed to obtain haplotype information for diploid organisms. Limiting dilution of sample DNA followed by multiple displacement amplification is a useful technique but is restricted to short (<5 kb) DNA fragments. In the current study, a novel modification was applied to overcome these problems. A limited amount of cellular DNA was carefully released from intact cells into a mildly heated alkaline agarose solution and mixed thoroughly. The solution was then gently aliquoted and allowed to solidify while maintaining the integrity of the diluted DNA. Exogenously provided Phi29 DNA polymerase was used to perform consistent genomic amplification with random hexameric oligonucleotides within the agarose gels. Simple heat melting of the gel allowed recovery of the amplified materials in a solution of the polymerase chain reaction (PCR)-ready form. The haplotypes of seven SNPs spanning 240 kb of the DNA surrounding the human ATM gene region on chromosome 11 were determined for 10 individuals, demonstrating the feasibility of this new method.

  3. Molecular analysis and genetic diversity of Aedes albopictus (Diptera, Culicidae) from China.

    PubMed

    Ruiling, Zhang; Peien, Leng; Xuejun, Wang; Zhong, Zhang

    2018-05-01

    Aedes albopictus is one of the most invasive species, which can carry Dengue virus, Yellow fever virus and more than twenty arboviruses. Based on mitochondrial gene cytochrome c oxidase I (COI) and samples collected from 17 populations, we investigated the molecular character and genetic diversity of Ae. albopictus from China. Altogether, 25 haplotypes were detected, including 10 shared haplotypes and 15 private haplotypes. H1 was the dominant haplotype, which is widely distributed in 13 populations. Tajima'D value of most populations was significantly negative, demonstrating that populations experienced rapid range expansion recently. Most haplotypes clustered together both in phylogenetic and median-joining network analysis without clear phylogeographic patterns. However, neutrality tests revealed shallow divergences among Hainan and Guangxi with other populations (0.15599 ≤ F ST ≤ 0.75858), which probably due to interrupted gene flow, caused by geographical isolations. In conclusion, Ae. albopictus populations showed low genetic diversity in China.

  4. HERC1 polymorphisms: population-specific variations in haplotype composition.

    PubMed

    Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen

    2009-08-01

    Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.

  5. Haplotypes identified by 10 DNA restriction fragment length polymorphisms at the human low density lipoprotein receptor gene locus.

    PubMed Central

    Kotze, M J; Langenhoven, E; Retief, A E; Seftel, H C; Henderson, H E; Weich, H F

    1989-01-01

    Ten useful two allele restriction fragment length polymorphisms of the low density lipoprotein receptor gene were used for haplotype analysis in 45 unrelated familial hypercholesterolaemic (FH) patients, 60 normal controls, and 32 FH homozygotes, all of whom were white Afrikaners. Pedigree analysis in 27 informative heterozygous FH and 23 normal families has shown the segregation of at least 17 haplotypes in the normal population (111 chromosomes) compared to a predominant association of two of these haplotypes with the disease in the FH subjects. This association was further confirmed in 32 FH homozygotes, indicating at least two 'founder' members for the disease in the Afrikaner population. Recombination events were not detected in any of the families studied and we thus conclude that the haplotypes associated with FH function as specific markers for the disease and will allow presymptomatic diagnosis in affected families. PMID:2565980

  6. Identification of a Fourth Haplotype of Bactericera cockerelli (Hemiptera: Triozidae) in the United States

    PubMed Central

    Swisher, Kylie D.; Henne, Donald C.; Crosslin, James M.

    2014-01-01

    Abstract The potato psyllid, Bactericera cockerelli (Sulc) (Hemiptera: Triozidae), is a pest of potato and other solanaceous crops in North and Central America and New Zealand. Previous genotyping studies have demonstrated the presence of three different haplotypes of B. cockerelli in the United States corresponding to three geographical regions: Central, Western, and Northwestern. These studies utilized psyllids collected in the western and central United States between 1998 and 2011. In an effort to further genotype potato psyllids collected in the 2012 growing season, a fourth B. cockerelli haplotype was discovered corresponding to the Southwestern United States geographical region. High-resolution melting analyses identified this new haplotype using an amplicon generated from a portion of the B. cockerelli mitochondrial cytochrome coxidase subunit I gene. Sequencing of this gene, as well as use of a restriction enzyme assay, confirmed the identification of the novel B. cockerelli haplotype in the United States. PMID:25368079

  7. The Clark Phase-able Sample Size Problem: Long-Range Phasing and Loss of Heterozygosity in GWAS

    NASA Astrophysics Data System (ADS)

    Halldórsson, Bjarni V.; Aguiar, Derek; Tarpine, Ryan; Istrail, Sorin

    A phase transition is taking place today. The amount of data generated by genome resequencing technologies is so large that in some cases it is now less expensive to repeat the experiment than to store the information generated by the experiment. In the next few years it is quite possible that millions of Americans will have been genotyped. The question then arises of how to make the best use of this information and jointly estimate the haplotypes of all these individuals. The premise of the paper is that long shared genomic regions (or tracts) are unlikely unless the haplotypes are identical by descent (IBD), in contrast to short shared tracts which may be identical by state (IBS). Here we estimate for populations, using the US as a model, what sample size of genotyped individuals would be necessary to have sufficiently long shared haplotype regions (tracts) that are identical by descent (IBD), at a statistically significant level. These tracts can then be used as input for a Clark-like phasing method to obtain a complete phasing solution of the sample. We estimate in this paper that for a population like the US and about 1% of the people genotyped (approximately 2 million), tracts of about 200 SNPs long are shared between pairs of individuals IBD with high probability which assures the Clark method phasing success. We show on simulated data that the algorithm will get an almost perfect solution if the number of individuals being SNP arrayed is large enough and the correctness of the algorithm grows with the number of individuals being genotyped.

  8. Mathematical model and metaheuristics for simultaneous balancing and sequencing of a robotic mixed-model assembly line

    NASA Astrophysics Data System (ADS)

    Li, Zixiang; Janardhanan, Mukund Nilakantan; Tang, Qiuhua; Nielsen, Peter

    2018-05-01

    This article presents the first method to simultaneously balance and sequence robotic mixed-model assembly lines (RMALB/S), which involves three sub-problems: task assignment, model sequencing and robot allocation. A new mixed-integer programming model is developed to minimize makespan and, using CPLEX solver, small-size problems are solved for optimality. Two metaheuristics, the restarted simulated annealing algorithm and co-evolutionary algorithm, are developed and improved to address this NP-hard problem. The restarted simulated annealing method replaces the current temperature with a new temperature to restart the search process. The co-evolutionary method uses a restart mechanism to generate a new population by modifying several vectors simultaneously. The proposed algorithms are tested on a set of benchmark problems and compared with five other high-performing metaheuristics. The proposed algorithms outperform their original editions and the benchmarked methods. The proposed algorithms are able to solve the balancing and sequencing problem of a robotic mixed-model assembly line effectively and efficiently.

  9. Chloroplast DNA variation of northern red oak

    Treesearch

    Jeanne Romero-Severson; Preston Aldrich; Yi Feng; Weilin Sun; Charles Michler

    2003-01-01

    Chloroplast DNA (cpDNA) variation was examined in 48 northern red oaks at 14 sites representing contrasting glacial histories and age structures within the state of Indiana in the United States. PCR-RFLP of three intergenic regions revealed five haplotypes. Haplotype I was common to seven sites and was the most frequent (17 trees). Haplotype II was common to five sites...

  10. [Frequency distribution of HLA antigens and haplotypes in newly arrived inhabitants of Magadan].

    PubMed

    Solovenchuk, L L; Pereverzeva, V V; Nevretdinova, Z G

    1994-09-01

    Peculiarities of the frequency distribution of antigens and haplotypes of A, B, and Cw subloci of the HLA system in 924 Slavic inhabitants of Magadan are described. Significant differences in gene and haplotype frequencies between inhabitants of Magadan and those of Moscow, Odessa, Poles'e, Latvia, and England were revealed, which could not be attributed solely to the specificity of migration processes. On the basis of an analysis of gamete associations of the A and B subloci, an attempt was made to explain the specificity of the frequency distribution of HLA system alleles and haplotypes in the investigated sample from an ecological point of view.

  11. CPm gene diversity in field isolates of Citrus tristeza virus from Colombia.

    PubMed

    Oliveros-Garay, Oscar Arturo; Martinez-Salazar, Natalhie; Torres-Ruiz, Yanneth; Acosta, Orlando

    2009-01-01

    The nucleotide sequence diversity of the CPm gene from 28 field isolates of Citrus tristeza virus (CTV) was assessed by SSCP and sequence analyses. These isolates showed two major shared haplotypes, which differed in distribution: A1 was the major haplotype in 23 isolates from different geographic regions, whereas R1 was found in isolates from a discrete region. Phylogenetic reconstruction clustered A1 within an independent group, while R1 was grouped with mild isolates T30 from Florida and T385 from Spain. Some isolates contained several minor haplotypes, which were very similar to, and associated with, the major haplotype.

  12. Fitchi: haplotype genealogy graphs based on the Fitch algorithm.

    PubMed

    Matschiner, Michael

    2016-04-15

    : In population genetics and phylogeography, haplotype genealogy graphs are important tools for the visualization of population structure based on sequence data. In this type of graph, node sizes are often drawn in proportion to haplotype frequencies and edge lengths represent the minimum number of mutations separating adjacent nodes. I here present Fitchi, a new program that produces publication-ready haplotype genealogy graphs based on the Fitch algorithm. http://www.evoinformatics.eu/fitchi.htm : michaelmatschiner@mac.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  13. Identification of the ancestral haplotype for apolipoprotein B suggests an African origin of Homo sapiens sapiens and traces their subsequent migration to Europe and the Pacific.

    PubMed Central

    Rapacz, J; Chen, L; Butler-Brunner, E; Wu, M J; Hasler-Rapacz, J O; Butler, R; Schumaker, V N

    1991-01-01

    The probable ancestral haplotype for human apolipoprotein B (apoB) has been identified through immunological analysis of chimpanzee and gorilla serum and sequence analysis of their DNA. Moreover, the frequency of this ancestral apoB haplotype among different human populations provides strong support for the African origin of Homo sapiens sapiens and their subsequent migration from Africa to Europe and to the Pacific. The approach used here for the identification of the ancestral human apoB haplotype is likely to be applicable to many other genes. PMID:1996341

  14. Population data for 15 Y-chromosome STRs in a population sample from Quito (Ecuador).

    PubMed

    Baeza, Carlos; Guzmán, Rodrigo; Tirado, Miriam; López-Parra, Ana María; Rodríguez, Tatiana; Mesa, María Soledad; Fernández, Eva; Arroyo-Pardo, Eduardo

    2007-12-20

    Population frequencies for the 9 Y-STR loci included in the "minimal haplotype" from Y-STR Haplotype Reference Database (YHRD), plus other 6 Y-STRs (DYS437, DYS438, DYS439, GATA A7.2, GATA H4 and GATA A10) were obtained for a sample of 120 males from Quito (Ecuador). One hundred and sixteen unique haplotypes were identified within the sample. Haplotype diversity (0.9994) was among the highest in comparison to other populations from Iberia and South-America. Genetic distances were calculated and our sample presented significative differences with all other samples, the lowest values being with a Guinean sample.

  15. Relevance of ancestral surname identification in pedigrees of Afrikaner families with familial hypercholesterolaemia.

    PubMed

    Torrington, M; Brink, P A

    1990-03-17

    Familial hypercholesterolaemia (FH) is more prevalent among Afrikaans-speaking individuals in South Africa then elsewhere. Founder effects have been suggested as an explanation. A study was undertaken that demonstrated ancestral links for a low-density lipoprotein receptor allele, haplotype No. 2, in the two lines of descent identified and 2 other known pedigrees with the same haplotype. Probable founder members for this haplotype are identified. These differ from the founder members assumed to be responsible for a majority of FH. A minor founder effect is suggested. Explanations are given for the apparent lesser prevalence of the second haplotype associated with FH.

  16. ABO alleles are linked with haplotypes of an erythroid cell-specific regulatory element in intron 1 with a few exceptions attributable to genetic recombination.

    PubMed

    Nakajima, T; Sano, R; Takahashi, Y; Watanabe, K; Kubo, R; Kobayashi, M; Takahashi, K; Takeshita, H; Kominato, Y

    2016-01-01

    Recent investigation of transcriptional regulation of the ABO genes has identified a candidate erythroid cell-specific regulatory element, named the +5·8-kb site, in the first intron of ABO. Six haplotypes of the site have been reported previously. The present genetic population study demonstrated that each haplotype was mostly linked with specific ABO alleles with a few exceptions, possibly as a result of hybrid formation between common ABO alleles. Thus, investigation of these haplotypes could provide a clue to further elucidation of ABO alleles. © 2015 International Society of Blood Transfusion.

  17. Identification of the ancestral haplotype for apolipoprotein B suggests an African origin of Homo sapiens sapiens and traces their subsequent migration to Europe and the Pacific

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rapacz, J.; Hasler-Rapacz, J.O.; Chen, L.

    1991-02-15

    The probable ancestral haplotype for human apolipoprotein B (apoB) has been identified through immunological analysis of chimpanzee and gorilla serum and sequence analysis of their DNA. Moreover, the frequency of this ancestral apoB haplotype among different human populations provides strong support for the African origin of Homo sapiens sapiens and their subsequent migration from Africa to Europe and to the Pacific. The approach used here for the identification of the ancestral human apoB haplotype is likely to be applicable to many other genes.

  18. Review of General Algorithmic Features for Genome Assemblers for Next Generation Sequencers

    PubMed Central

    Wajid, Bilal; Serpedin, Erchin

    2012-01-01

    In the realm of bioinformatics and computational biology, the most rudimentary data upon which all the analysis is built is the sequence data of genes, proteins and RNA. The sequence data of the entire genome is the solution to the genome assembly problem. The scope of this contribution is to provide an overview on the art of problem-solving applied within the domain of genome assembly in the next-generation sequencing (NGS) platforms. This article discusses the major genome assemblers that were proposed in the literature during the past decade by outlining their basic working principles. It is intended to act as a qualitative, not a quantitative, tutorial to all working on genome assemblers pertaining to the next generation of sequencers. We discuss the theoretical aspects of various genome assemblers, identifying their working schemes. We also discuss briefly the direction in which the area is headed towards along with discussing core issues on software simplicity. PMID:22768980

  19. Oligonucleotide-genotyping as a method of detecting the HLA-DR2 (DRw15)-Dw2, -DR2 (DRw15)-Dw12, -DR4-Dw15, and -DR4-D"KT2" haplotypes in the Japanese population.

    PubMed

    Obata, F; Ito, I; Kaneko, T; Ohkubo, M; Ishimoto, A L; Abe, A; Kashiwagi, N

    1989-05-01

    We synthesized pairs of four different oligonucleotides, F22, F29, F42, and F158, to analyse the HLA-DR2 (DRw15) and -DR4 haplotypes in the Japanese population. After enzymatically amplifying the HLA-DRB1 gene, we hybridized the oligonucleotide probes with DNA extracted from 42 donors. Hybridization was completed between F22 and the DNA of haplotype DR2 (DRw15)-Dw2, between F29 and the DNA of DR2 (DRw15)-Dw12, between F42 and the DNA of DR4-D"KT2", and between F158 and the DNA of DR4-Dw15. In keeping with the nucleotide sequences of the probes, F29 hybridized also with DNA from the DR9-Dw23 haplotype and F158 with that from some of the DRw8 haplotypes (DRw8-Dw8.3) in the Japanese population. Results of this study demonstrate that the four oligonucleotides make useful probes for detecting the haplotypes above.

  20. HLA-G, -A haplotypes in Amerindians (Ecuador): HLA-G*01:05N World distribution.

    PubMed

    Arnaiz-Villena, Antonio; Palacio-Gruber, Jose; Enriquez de Salamanca, Mercedes; Juárez, Ignacio; Campos, Cristina; Nieto, Jorge; Muñiz, Ester; Martin-Villa, Jose Manuel

    2018-02-01

    HLA-G and HLA-A frequencies have been analysed in Amerindians from Ecuador. HLA-G allele frequencies are found to be closer to those of other Amerindians (Mayas from Guatemala and Uros from Peru) and closer to European ones than to Far East Asians groups, particularly, regarding to HLA-G*01:04 allele. HLA-G/-A haplotypes have been calculated for the first time in Amerindians. It is remarkable that HLA-G*01:05N "null" allele is found in a very low frequency (like in Amerindian Mayas and Uros) and is also found in haplotypes belonging to the HLA-A19 group of alleles (HLA-A*30, -A*31, -A*33). It was previously postulated that HLA-G*01:05N appeared in HLA-A*30/-B*13 haplotypes in Middle East Mediterraneans. It may be hypothesized that in Evolution, HLA-G*01:05N existed primarily in one of the HLA extant or extinct -A19 haplotype, whether this haplotype was placed in Middle East or other World areas, including America. However, the highest present day HLA-G*01:05N frequencies are found in Middle East Mediterraneans. Copyright © 2017. Published by Elsevier Inc.

  1. Geographic distribution of haplotype diversity at the bovine casein locus

    PubMed Central

    Jann, Oliver C; Ibeagha-Awemu, Eveline M; Özbeyaz, Ceyhan; Zaragoza, Pilar; Williams, John L; Ajmone-Marsan, Paolo; Lenstra, Johannes A; Moazami-Goudarzi, Katy; Erhardt, Georg

    2004-01-01

    The genetic diversity of the casein locus in cattle was studied on the basis of haplotype analysis. Consideration of recently described genetic variants of the casein genes which to date have not been the subject of diversity studies, allowed the identification of new haplotypes. Genotyping of 30 cattle breeds from four continents revealed a geographically associated distribution of haplotypes, mainly defined by frequencies of alleles at CSN1S1 and CSN3. The genetic diversity within taurine breeds in Europe was found to decrease significantly from the south to the north and from the east to the west. Such geographic patterns of cattle genetic variation at the casein locus may be a result of the domestication process of modern cattle as well as geographically differentiated natural or artificial selection. The comparison of African Bos taurus and Bos indicus breeds allowed the identification of several Bos indicus specific haplotypes (CSN1S1*C-CSN2*A2-CSN3*AI/CSN3*H) that are not found in pure taurine breeds. The occurrence of such haplotypes in southern European breeds also suggests that an introgression of indicine genes into taurine breeds could have contributed to the distribution of the genetic variation observed. PMID:15040901

  2. Selective sweep at the Drosophila melanogaster Suppressor of Hairless locus and its association with the In(2L)t inversion polymorphism.

    PubMed Central

    Depaulis, F; Brazier, L; Veuille, M

    1999-01-01

    The hitchhiking model of population genetics predicts that an allele favored by Darwinian selection can replace haplotypes from the same locus previously established at a neutral mutation-drift equilibrium. This process, known as "selective sweep," was studied by comparing molecular variation between the polymorphic In(2L)t inversion and the standard chromosome. Sequence variation was recorded at the Suppressor of Hairless (Su[H]) gene in an African population of Drosophila melanogaster. We found 47 nucleotide polymorphisms among 20 sequences of 1.2 kb. Neutrality tests were nonsignificant at the nucleotide level. However, these sites were strongly associated, because 290 out of 741 observed pairwise combinations between them were in significant linkage disequilibrium. We found only seven haplotypes, two occurring in the 9 In(2L)t chromosomes, and five in the 11 standard chromosomes, with no shared haplotype. Two haplotypes, one in each chromosome arrangement, made up two-thirds of the sample. This low haplotype diversity departed from neutrality in a haplotype test. This pattern supports a selective sweep hypothesis for the Su(H) chromosome region. PMID:10388820

  3. MDR1 haplotypes derived from exons 21 and 26 do not affect the steady-state pharmacokinetics of tacrolimus in renal transplant patients.

    PubMed

    Mai, Ingrid; Perloff, Elke S; Bauer, Steffen; Goldammer, Mark; Johne, Andreas; Filler, Guido; Budde, Klemens; Roots, Ivar

    2004-11-01

    This retrospective study investigated the influence of MDR1 haplotypes derived from the polymorphisms 2677G > T (exon 21) and 3435C > T (exon 26) on the pharmacokinetics of the immunosuppressant drug tacrolimus in 73 renal transplant patients. Based on both variants of SNPs 2677 and 3435, four different haplotypes and eight different genotypes were identified in the study sample. Tacrolimus trough concentrations (C(0)) were compared between different SNP variants and genotypes, as well as between carriers and noncarriers of each haplotype. Additionally, CYP3A5 genotype (6956G > A) was determined. No significant differences were observed between groups. Differences in mean tacrolimus C(0) values between carriers and noncarriers of each haplotype ranged from -0.04 microg/litre (95% confidence interval: -0.53 to 0.60) to -23 microg/litre (-1.07 to 1.53). No association was found between CYP3A5*1/*3 genotype and tacrolimus Co concentractions. MDR1 haplotypes derived from the SNPs 2677G > T (exon 21) and 3435C > T (exon 26) do not influence the pharmacokinetics of tacrolimus in renal transplant patients.

  4. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079

  5. Genetic variation in CXCR1 haplotypes linked to severity of Streptococcus uberis infection in an experimental challenge model.

    PubMed

    Siebert, Lydia; Headrick, Susan; Lewis, Mark; Gillespie, Barbara; Young, Charlie; Wojakiewicz, Leszek; Kerro-Dego, Oudessa; Prado, Maria E; Almeida, Raul; Oliver, Stephen P; Pighetti, Gina M

    2017-08-01

    Mastitis, an inflammation of the mammary gland, costs the dairy industry billions of dollars in lost revenues annually. The prevalence and costs associated with mastitis has made genetic selection methods a target for research. Previous research has identified amino acid changes at positions 122, 207, 245, 327, and 332 in the IL8 receptor, CXCR1, that result in three dominant amino acid haplotypes: VWHKH, VWHRR, and AWQRR. We hypothesize different haplotype combinations influence a cow's resistance, strength, and duration of response to mastitis. To test this, Holstein dairy cows (n=40) were intramammarily challenged with Streptococcus uberis within 3 d post-calving. All cows developed mastitis based on isolation of S. uberis from the challenged quarter at least twice. All cows with the VWHRR x VWHRR (n=5) and AWQRR x VWHRR (n=6) haplotype combinations required antibiotic therapy due to clinical signs of mastitis and tended (P=0.08) to be different from cows with a VWHRR x VWHKH (n=6) haplotype combination where only 33.3% required antibiotic therapy. Cows with a VWHRR homozygous haplotype combination displayed significantly higher responses to challenge indicated by elevated S. uberis counts (4340±5,521.9CFU/mL; P=0.01), mammary scores (1.1±0.18; P=0.03), milk scores (0.9±0.17; P=0.002), and SCC (1,010,832±489,993cells/mL; P=0.03). Contrastingly, AWQRR x VWHRR cows had significantly lower S. uberis counts (15.3±16.46CFU/mL; P=0.01), mammary scores (0.3±0.16; P=0.03), milk scores (0±0.15; P=0.002), and SCC (239,261±92,264.3cells/mL; P=0.03). Cows of the VWHKH x VWHRR haplotype combination displayed responses to challenge statistically comparable to other haplotype combinations, but appeared to have an earlier peak in SCC in comparison to all other haplotype combinations. Haplotype combination did not influence milk yield (P=0.6). Our results suggest using combinations of the SNPs within the CXCR1 gene gives a better indication of a cow's ability to combat S. uberis mastitis and could resolve prior studies' conflicting results focusing on individual SNP. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Research on the Development Prospect of Assembled Passive Building Based on Green Development Concept

    NASA Astrophysics Data System (ADS)

    Lixin, Zhang; Ju, Ma; Baohui, He

    2018-02-01

    In recent years, the vigorous development of the construction industry has brought about serious problems of environmental pollution and resource consumption. In order to reduce the negative impact that the construction industry has on the natural environment,this paper, from the perspective of environmental protection, studies the pollution and high consumption problems existing in the production and use of traditional construction industry, compares and analyzes the green and energy-saving advantages in the construction and using phase of assembled passive building, and at the same time, combined with our country is vigorously promoting the assembled passive building and the green development, concluded that the assembled passive building is the new development direction of China’s construction industry.

  7. Association between β2-adrenoceptor (ADRB2) haplotypes and insulin resistance in PCOS.

    PubMed

    Tellechea, Mariana L; Muzzio, Damián O; Iglesias Molli, Andrea E; Belli, Susana H; Graffigna, Mabel N; Levalle, Oscar A; Frechtel, Gustavo D; Cerrone, Gloria E

    2013-04-01

    The aim of this study was to explore β2-adrenoceptor (ADRB2) haplotype associations with phenotypes and quantitative traits related to insulin resistance (IR) and the metabolic syndrome (MS) in a polycystic ovary syndrome (PCOS) population. A secondary purpose was to assess the association between ADRB2 haplotype and PCOS. Genetic polymorphism analysis. Cross-sectional case-control association study. Medical University Hospital and research laboratory. One hundred and sixty-five unrelated women with PCOS and 116 unrelated women without PCOS (control sample). Clinical and biochemical measurements, and ADRB2 genotyping in PCOS patients and control subjects. ADRB2 haplotypes (comprising rs1042711, rs1801704, rs1042713 and rs1042714 in that order), genotyping and statistical analysis to evaluate associations with continuous variables and traits related to IR and MS in a PCOS population. Associations between ADRB2 haplotypes and PCOS were also assessed. We observed an age-adjusted association between ADRB2 haplotype CCGG and lower insulin (P = 0·018) and HOMA (P = 0·008) in the PCOS sample. Interestingly, the expected differences in surrogate measures of IR between cases and controls were not significant in CCGG/CCGG carriers. In the case-control study, genotype CCGG/CCGG was associated with a 14% decrease in PCOS risk (P = 0·043), taking into account confounding variables. Haplotype I (CCGG) has a protective role for IR and MS in PCOS. © 2012 Blackwell Publishing Ltd.

  8. Multiple distant origins for green sea turtles aggregating off Gorgona Island in the Colombian eastern Pacific.

    PubMed

    Amorocho, Diego F; Abreu-Grobois, F Alberto; Dutton, Peter H; Reina, Richard D

    2012-01-01

    Mitochondrial DNA analyses have been useful for resolving maternal lineages and migratory behavior to foraging grounds (FG) in sea turtles. However, little is known about source rookeries and haplotype composition of foraging green turtle aggregations in the southeastern Pacific. We used mitochondrial DNA control region sequences to identify the haplotype composition of 55 green turtles, Chelonia mydas, captured in foraging grounds of Gorgona National Park in the Colombian Pacific. Amplified fragments of the control region (457 bp) revealed the presence of seven haplotypes, with haplotype (h) and nucleotide (π) diversities of h = 0.300±0.080 and π = 0.009±0.005 respectively. The most common haplotype was CMP4 observed in 83% of individuals, followed by CMP22 (5%). The genetic composition of the Gorgona foraging population primarily comprised haplotypes that have been found at eastern Pacific rookeries including Mexico and the Galapagos, as well as haplotypes of unknown stock origin that likely originated from more distant western Pacific rookeries. Mixed stock analysis suggests that the Gorgona FG population is comprised mostly of animals from the Galapagos rookery (80%). Lagrangian drifter data showed that movement of turtles along the eastern Pacific coast and eastward from distant western and central Pacific sites was possible through passive drift. Our results highlight the importance of this protected area for conservation management of green turtles recruited from distant sites along the eastern Pacific Ocean.

  9. Transcriptome analysis reveals the same 17 S-locus F-box genes in two haplotypes of the self-incompatibility locus of Petunia inflata.

    PubMed

    Williams, Justin S; Der, Joshua P; dePamphilis, Claude W; Kao, Teh-Hui

    2014-07-01

    Petunia possesses self-incompatibility, by which pistils reject self-pollen but accept non-self-pollen for fertilization. Self-/non-self-recognition between pollen and pistil is regulated by the pistil-specific S-RNase gene and by multiple pollen-specific S-locus F-box (SLF) genes. To date, 10 SLF genes have been identified by various methods, and seven have been shown to be involved in pollen specificity. For a given S-haplotype, each SLF interacts with a subset of its non-self S-RNases, and an as yet unknown number of SLFs are thought to collectively mediate ubiquitination and degradation of all non-self S-RNases to allow cross-compatible pollination. To identify a complete suite of SLF genes of P. inflata, we used a de novo RNA-seq approach to analyze the pollen transcriptomes of S2-haplotype and S3-haplotype, as well as the leaf transcriptome of the S3S3 genotype. We searched for genes that fit several criteria established from the properties of the known SLF genes and identified the same seven new SLF genes in S2-haplotype and S3-haplotype, suggesting that a total of 17 SLF genes constitute pollen specificity in each S-haplotype. This finding lays the foundation for understanding how multiple SLF genes evolved and the biochemical basis for differential interactions between SLF proteins and S-RNases. © 2014 American Society of Plant Biologists. All rights reserved.

  10. Multiple Distant Origins for Green Sea Turtles Aggregating off Gorgona Island in the Colombian Eastern Pacific

    PubMed Central

    Amorocho, Diego F.; Abreu-Grobois, F. Alberto; Dutton, Peter H.; Reina, Richard D.

    2012-01-01

    Mitochondrial DNA analyses have been useful for resolving maternal lineages and migratory behavior to foraging grounds (FG) in sea turtles. However, little is known about source rookeries and haplotype composition of foraging green turtle aggregations in the southeastern Pacific. We used mitochondrial DNA control region sequences to identify the haplotype composition of 55 green turtles, Chelonia mydas, captured in foraging grounds of Gorgona National Park in the Colombian Pacific. Amplified fragments of the control region (457 bp) revealed the presence of seven haplotypes, with haplotype (h) and nucleotide (π) diversities of h = 0.300±0.080 and π = 0.009±0.005 respectively. The most common haplotype was CMP4 observed in 83% of individuals, followed by CMP22 (5%). The genetic composition of the Gorgona foraging population primarily comprised haplotypes that have been found at eastern Pacific rookeries including Mexico and the Galapagos, as well as haplotypes of unknown stock origin that likely originated from more distant western Pacific rookeries. Mixed stock analysis suggests that the Gorgona FG population is comprised mostly of animals from the Galapagos rookery (80%). Lagrangian drifter data showed that movement of turtles along the eastern Pacific coast and eastward from distant western and central Pacific sites was possible through passive drift. Our results highlight the importance of this protected area for conservation management of green turtles recruited from distant sites along the eastern Pacific Ocean. PMID:22319635

  11. Construction and forensic genetic characterization of 11 autosomal haplotypes consisting of 22 tri-allelic indels.

    PubMed

    Zhao, Xiaohong; Chen, Xiaogang; Zhao, Yuancun; Zhang, Shu; Gao, Zehua; Yang, Yiwen; Wang, Yufang; Zhang, Ji

    2018-05-01

    Insertion/deletion polymorphisms (indels), which combine the advantages of both short tandem repeats and single-nucleotide polymorphisms, are suitable for parentage testing. To overcome the limitations of the low polymorphism of di-allelic indels, we constructed a set of haplotypes with physically linked, multi-allelic indels. Candidate haplotypes were selected from the 1000 Genomes Project database, and were subject to the following criteria for inclusion: (i) each marker must have a minimum allele frequency (MAF) of ≥0.1 in the Han population of China; (ii) markers must exist in a non-coding region; (iii) the physical distance between a pair of candidate indels must be <500 bp; (iv) the allele length variation of each indel from 1 to 20 bp; (v) different haplotypes must be located on different chromosomes or chromosomal arms, or be more than 10 Mb apart if on the same chromosomal arm; and (vi) they must not be located across a recombination hotspot. A multiplex system with 11 haplotype markers, comprising 22 tri-allelic indel loci distributed over 10 chromosomes was developed. To validate the multiplex panel, we investigated the haplotype distribution in sets of two and three-generation pedigrees. The results demonstrated that the haplotypes consisting of multi-allelic indel markers exhibited higher polymorphism than a single indel locus, and thus provide Supplementary information for forensic kinship identification. Copyright © 2018 Elsevier B.V. All rights reserved.

  12. The Phylogeographical Pattern and Conservation of the Chinese Cobra (Naja atra) across Its Range Based on Mitochondrial Control Region Sequences

    PubMed Central

    Lin, Long-Hui; Hua, Lei; Qu, Yan-Fu; Gao, Jian-Fang; Ji, Xiang

    2014-01-01

    The vulnerable Chinese cobra (Naja atra) ranges from southeastern China south of the Yangtze River to northern Vietnam and Laos. Large mountain ranges and water bodies may influence the pattern of genetic diversity of this species. We sequenced the mitochondrial DNA control region (1029 bp) using 285 individuals collected from 23 localities across the species' range and obtained 18 sequences unique to Taiwan from GenBank for phylogenetic and population analysis. Two distinct clades were identified, one including haplotypes from the two westernmost localities (Hekou and Miyi) and the other including haplotypes from all sampling sites except Miyi. A strong population structure was found (Φst = 0.76, P<0.0001) with high haplotype diversity (h = 1.00) and low nucleotide diversity (π = 0.0049). The Luoxiao and Nanling Mountains act as historical geographical barriers limiting gene exchange. In the haplotype network there were two “star” clusters. Haplotypes from populations east of the Luoxiao Mountains were represented within one cluster and haplotypes from populations west of the mountain range within the other, with haplotypes from populations south of the Nanling Mountains in between. Lineage sorting between mainland and island populations is incomplete. It remains unknown as to how much adaptive differentiation there is between population groups or within each group. We caution against long-distance transfers within any group, especially when environmental differences are apparent. PMID:25184236

  13. The HLA-DRB9 gene and the origin of HLA-DR haplotypes.

    PubMed

    Gongora, R; Figueroa, F; Klein, J

    1996-11-01

    HLA-DRB9 is a gene fragment consisting of exon 2 and flanking intron sequences. It is located at the extreme end of the DRB subregion, whose other end is demarcated by the DRB1 locus. We sequenced approximately 1400 base pairs of the segment encompassing the DRB9 locus from eight human haplotypes (DR1, DR10, DR2, DR3, DR5, DR6, DR8, and DR9, the DR4 and DR7 having been sequenced by others earlier), as well as two chimpanzee, five gorillas, one orangutan and one macaque haplotype. The analysis of these sequences indicates that the DRB9 locus, which we estimate to be more than 58 million years (my) old, has been coevolving with the DRB1 locus for the last 4.2 my. As a consequence of this coevolution, the human DRB9 alleles fall into groups that correlate with the DRB1 allelic groups and with the gene organization of the human haplotypes. This observation implies that the present-day HLA-DR haplotype groups (DR1, DR51, DR52, DR8, and DR53) were founded more than 4 my ago and have remained intact (barring minor internal rearrangements that did not recombine the DRB1 and DRB9 genes) for this period of time. The haplotypes have been transmitted during speciations from ancestral to emerging species just like allelic lineages at the DRB1 locus. Thus not only allelic but also haplotype polymorphism evolves trans-specifically.

  14. Two Orangutan Species Have Evolved Different KIR Alleles and Haplotypes1

    PubMed Central

    Guethlein, Lisbeth A.; Norman, Paul J.; Heijmans, Corinne M. C.; de Groot, Natasja G.; Hilton, Hugo G.; Babrzadeh, Farbod; Abi-Rached, Laurent; Bontrop, Ronald E.; Parham, Peter

    2017-01-01

    The immune and reproductive functions of human Natural Killer (NK) cells are regulated by interactions of the C1 and C2 epitopes of HLA-C with C1-specific and C2-specific lineage III killer cell immunoglobulin-like receptors (KIR). This rapidly evolving and diverse system of ligands and receptors is restricted to humans and great apes. In this context, the orangutan has particular relevance because it represents an evolutionary intermediate, one having the C1 epitope and corresponding KIR, but lacking the C2 epitope. Through a combination of direct sequencing, KIR genotyping and data mining from the Great Ape Genome Project (GAGP) we characterized the KIR alleles and haplotypes for panels of ten Bornean orangutans and 19 Sumatran orangutans. The orangutan KIR haplotypes have between five and ten KIR genes. The seven orangutan lineage III KIR genes all locate to the centromeric region of the KIR locus, whereas their human counterparts also populate the telomeric region. One lineage III KIR gene is Bornean-specific, one is Sumatran-specific and five are shared. Of twelve KIR gene-content haplotypes five are Bornean-specific, five are Sumatran-specific and two are shared. The haplotypes have different combinations of genes encoding activating and inhibitory C1 receptors that can be of higher or lower affinity. All haplotypes encode an inhibitory C1 receptor, but only some haplotypes encode an activating C1 receptor. Of 130 KIR alleles, 55 are Bornean-specific, 65 are Sumatran specific and ten are shared. PMID:28264973

  15. Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.

    PubMed

    Long, Ji-Rong; Zhao, Lan-Juan; Liu, Peng-Yuan; Lu, Yan; Dvornyk, Volodymyr; Shen, Hui; Liu, Yong-Jun; Zhang, Yuan-Yuan; Xiong, Dong-Hai; Xiao, Peng; Deng, Hong-Wen

    2004-05-24

    The adequacy of association studies for complex diseases depends critically on the existence of linkage disequilibrium (LD) between functional alleles and surrounding SNP markers. We examined the patterns of LD and haplotype distribution in eight candidate genes for osteoporosis and/or obesity using 31 SNPs in 1,873 subjects. These eight genes are apolipoprotein E (APOE), type I collagen alpha1 (COL1A1), estrogen receptor-alpha (ER-alpha), leptin receptor (LEPR), parathyroid hormone (PTH)/PTH-related peptide receptor type 1 (PTHR1), transforming growth factor-beta1 (TGF-beta1), uncoupling protein 3 (UCP3), and vitamin D (1,25-dihydroxyvitamin D3) receptor (VDR). Yin yang haplotypes, two high-frequency haplotypes composed of completely mismatching SNP alleles, were examined. To quantify LD patterns, two common measures of LD, D' and r2, were calculated for the SNPs within the genes. The haplotype distribution varied in the different genes. Yin yang haplotypes were observed only in PTHR1 and UCP3. D' ranged from 0.020 to 1.000 with the average of 0.475, whereas the average r2 was 0.158 (ranging from 0.000 to 0.883). A decay of LD was observed as the intermarker distance increased, however, there was a great difference in LD characteristics of different genes or even in different regions within gene. The differences in haplotype distributions and LD patterns among the genes underscore the importance of characterizing genomic regions of interest prior to association studies.

  16. Genetic Diversity of Bactrocera dorsalis (Diptera: Tephritidae) on the Hawaiian Islands: Implications for an Introduction Pathway Into California.

    PubMed

    Barr, Norman B; Ledezma, Lisa A; Leblanc, Luc; San Jose, Michael; Rubinoff, Daniel; Geib, Scott M; Fujita, Brian; Bartels, David W; Garza, Daniel; Kerr, Peter; Hauser, Martin; Gaimari, Stephen

    2014-10-01

    Population genetic diversity of the oriental fruit fly, Bactrocera dorsalis (Hendel), on the Hawaiian islands of Oahu, Maui, Kauai, and Hawaii (the Big Island) was estimated using DNA sequences of the mitochondrial cytochrome c oxidase subunit I gene. In total, 932 flies representing 36 sampled sites across the four islands were sequenced for a 1,500-bp fragment of the gene named the C1500 marker. Genetic variation was low on the Hawaiian Islands with >96% of flies having just two haplotypes: C1500-Haplotype 1 (63.2%) or C1500-Haplotype 2 (33.3%). The other 33 flies (3.5%) had haplotypes similar to the two dominant haplotypes. No population structure was detected among the islands or within islands. The two haplotypes were present at similar frequencies at each sample site, suggesting that flies on the various islands can be considered one population. Comparison of the Hawaiian data set to DNA sequences of 165 flies from outbreaks in California between 2006 and 2012 indicates that a single-source introduction pathway of Hawaiian origin cannot explain many of the flies in California. Hawaii, however, could not be excluded as a maternal source for 69 flies. There was no clear geographic association for Hawaiian or non-Hawaiian haplotypes in the Bay Area or Los Angeles Basin over time. This suggests that California experienced multiple, independent introductions from different sources. © 2014 Entomological Society of America.

  17. Unique haplotypes of cacao trees as revealed by trnH-psbA chloroplast DNA

    PubMed Central

    Gutiérrez-López, Nidia; Ovando-Medina, Isidro; Salvador-Figueroa, Miguel; Molina-Freaner, Francisco; Avendaño-Arrazate, Carlos H.

    2016-01-01

    Cacao trees have been cultivated in Mesoamerica for at least 4,000 years. In this study, we analyzed sequence variation in the chloroplast DNA trnH-psbA intergenic spacer from 28 cacao trees from different farms in the Soconusco region in southern Mexico. Genetic relationships were established by two analysis approaches based on geographic origin (five populations) and genetic origin (based on a previous study). We identified six polymorphic sites, including five insertion/deletion (indels) types and one transversion. The overall nucleotide diversity was low for both approaches (geographic = 0.0032 and genetic = 0.0038). Conversely, we obtained moderate to high haplotype diversity (0.66 and 0.80) with 10 and 12 haplotypes, respectively. The common haplotype (H1) for both networks included cacao trees from all geographic locations (geographic approach) and four genetic groups (genetic approach). This common haplotype (ancient) derived a set of intermediate haplotypes and singletons interconnected by one or two mutational steps, which suggested directional selection and event purification from the expansion of narrow populations. Cacao trees from Soconusco region were grouped into one cluster without any evidence of subclustering based on AMOVA (FST = 0) and SAMOVA (FST = 0.04393) results. One population (Mazatán) showed a high haplotype frequency; thus, this population could be considered an important reservoir of genetic material. The indels located in the trnH-psbA intergenic spacer of cacao trees could be useful as markers for the development of DNA barcoding. PMID:27076998

  18. The phylogeographical pattern and conservation of the Chinese cobra (Naja atra) across its range based on mitochondrial control region sequences.

    PubMed

    Lin, Long-Hui; Hua, Lei; Qu, Yan-Fu; Gao, Jian-Fang; Ji, Xiang

    2014-01-01

    The vulnerable Chinese cobra (Naja atra) ranges from southeastern China south of the Yangtze River to northern Vietnam and Laos. Large mountain ranges and water bodies may influence the pattern of genetic diversity of this species. We sequenced the mitochondrial DNA control region (1029 bp) using 285 individuals collected from 23 localities across the species' range and obtained 18 sequences unique to Taiwan from GenBank for phylogenetic and population analysis. Two distinct clades were identified, one including haplotypes from the two westernmost localities (Hekou and Miyi) and the other including haplotypes from all sampling sites except Miyi. A strong population structure was found (Φst = 0.76, P<0.0001) with high haplotype diversity (h = 1.00) and low nucleotide diversity (π = 0.0049). The Luoxiao and Nanling Mountains act as historical geographical barriers limiting gene exchange. In the haplotype network there were two "star" clusters. Haplotypes from populations east of the Luoxiao Mountains were represented within one cluster and haplotypes from populations west of the mountain range within the other, with haplotypes from populations south of the Nanling Mountains in between. Lineage sorting between mainland and island populations is incomplete. It remains unknown as to how much adaptive differentiation there is between population groups or within each group. We caution against long-distance transfers within any group, especially when environmental differences are apparent.

  19. MHC variability in heritage breeds of chickens.

    PubMed

    Fulton, J E; Lund, A R; McCarron, A M; Pinegar, K N; Korver, D R; Classen, H L; Aggrey, S; Utterbach, C; Anthony, N B; Berres, M E

    2016-02-01

    The chicken Major Histocompatibility Complex (MHC) is very strongly associated with disease resistance and thus is a very important region of the chicken genome. Historically, MHC (B locus) has been identified by the use of serology with haplotype specific alloantisera. These antisera can be difficult to produce and frequently cross-react with multiple haplotypes and hence their application is generally limited to inbred and MHC-defined lines. As a consequence, very little information about MHC variability in heritage chicken breeds is available. DNA-based methods are now available for examining MHC variability in these previously uncharacterized populations. A high density SNP panel consisting of 101 SNP that span a 230,000 bp region of the chicken MHC was used to examine MHC variability in 17 heritage populations of chickens from five universities from Canada and the United States. The breeds included 6 heritage broiler lines, 3 Barred Plymouth Rock, 2 New Hampshire and one each of Rhode Island Red, Light Sussex, White Leghorn, Dark Brown Leghorn, and 2 synthetic lines. These heritage breeds contained from one to 11 haplotypes per line. A total of 52 unique MHC haplotypes were found with only 10 of them identical to serologically defined haplotypes. Furthermore, nine MHC recombinants with their respective parental haplotypes were identified. This survey confirms the value of these non-commercially utilized lines in maintaining genetic diversity. The identification of multiple MHC haplotypes and novel MHC recombinants indicates that diversity is being generated and maintained within these heritage populations. © 2016 Poultry Science Association Inc.

  20. Molecular characterization of a long range haplotype affecting protein yield and mastitis susceptibility in Norwegian Red cattle.

    PubMed

    Sodeland, Marte; Grove, Harald; Kent, Matthew; Taylor, Simon; Svendsen, Morten; Hayes, Ben J; Lien, Sigbjørn

    2011-08-11

    Previous fine mapping studies in Norwegian Red cattle (NRC) in the region 86-90.4 Mb on Bos taurus chromosome 6 (BTA6) has revealed a quantitative trait locus (QTL) for protein yield (PY) around 88 Mb and a QTL for clinical mastitis (CM) around 90 Mb. The close proximity of these QTLs may partly explain the unfavorable genetic correlation between these two traits in NRC. A long range haplotype covering this region was introduced into the NRC population through the importation of a Holstein-Friesian bull (1606 Frasse) from Sweden in the 1970s. It has been suggested that this haplotype has a favorable effect on milk protein content but an unfavorable effect on mastitis susceptibility. Selective breeding for milk production traits is likely to have increased the frequency of this haplotype in the NRC population. Association mapping for PY and CM in NRC was performed using genotypes from 556 SNPs throughout the region 86-97 Mb on BTA6 and daughter-yield-deviations (DYDs) from 2601 bulls made available from the Norwegian dairy herd recording system. Highest test scores for PY were found for single-nucleotide polymorphisms (SNPs) within and surrounding the genes CSN2 and CSN1S2, coding for the β-casein and α(S2)-casein proteins. High coverage re-sequencing by high throughput sequencing technology enabled molecular characterization of a long range haplotype from 1606 Frasse encompassing these two genes. Haplotype analysis of a large number of descendants from this bull indicated that the haplotype was not markedly disrupted by recombination in this region. The haplotype was associated with both increased milk protein content and increased susceptibility to mastitis, which might explain parts of the observed genetic correlation between PY and CM in NRC. Plausible causal polymorphisms affecting PY were detected in the promoter region and in the 5'-flanking UTR of CSN1S2. These polymorphisms could affect transcription or translation of CSN1S2 and thereby affect the amount of α(S2)-casein in milk. Highest test scores for CM were found in the region 89-91 Mb on BTA6, very close to a cluster of genes coding for CXC chemokines. Expression levels of some of these CXC chemokines have previously been shown to increase in bovine mammary gland cell lines after exposure to bacterial cell wall components. Molecular characterization of the long range haplotype from the Holstein-Friesian bull 1606 Frasse, imported into NRC in the 1970s, revealed polymorphisms that could affect transcription or translation of the casein gene CSN1S2. Sires with this haplotype had daughters with significantly elevated milk protein content and selection for milk production traits is likely to have increased the frequency of this haplotype in the NRC population. The haplotype was also associated with increased mastitis susceptibility, which might explain parts of the genetic correlation between PY and CM in NRC.

  1. An expert system executive for automated assembly of large space truss structures

    NASA Technical Reports Server (NTRS)

    Allen, Cheryl L.

    1993-01-01

    Langley Research Center developed a unique test bed for investigating the practical problems associated with the assembly of large space truss structures using robotic manipulators. The test bed is the result of an interdisciplinary effort that encompasses the full spectrum of assembly problems - from the design of mechanisms to the development of software. The automated structures assembly test bed and its operation are described, the expert system executive and its development are detailed, and the planned system evolution is discussed. Emphasis is on the expert system implementation of the program executive. The executive program must direct and reliably perform complex assembly tasks with the flexibility to recover from realistic system errors. The employment of an expert system permits information that pertains to the operation of the system to be encapsulated concisely within a knowledge base. This consolidation substantially reduced code, increased flexibility, eased software upgrades, and realized a savings in software maintenance costs.

  2. Automated Test Assembly Using lp_Solve Version 5.5 in R

    ERIC Educational Resources Information Center

    Diao, Qi; van der Linden, Wim J.

    2011-01-01

    This article reviews the use of the software program lp_solve version 5.5 for solving mixed-integer automated test assembly (ATA) problems. The program is freely available under Lesser General Public License 2 (LGPL2). It can be called from the statistical language R using the lpSolveAPI interface. Three empirical problems are presented to…

  3. Multi-Robot Assembly Strategies and Metrics.

    PubMed

    Marvel, Jeremy A; Bostelman, Roger; Falco, Joe

    2018-02-01

    We present a survey of multi-robot assembly applications and methods and describe trends and general insights into the multi-robot assembly problem for industrial applications. We focus on fixtureless assembly strategies featuring two or more robotic systems. Such robotic systems include industrial robot arms, dexterous robotic hands, and autonomous mobile platforms, such as automated guided vehicles. In this survey, we identify the types of assemblies that are enabled by utilizing multiple robots, the algorithms that synchronize the motions of the robots to complete the assembly operations, and the metrics used to assess the quality and performance of the assemblies.

  4. Multi-Robot Assembly Strategies and Metrics

    PubMed Central

    MARVEL, JEREMY A.; BOSTELMAN, ROGER; FALCO, JOE

    2018-01-01

    We present a survey of multi-robot assembly applications and methods and describe trends and general insights into the multi-robot assembly problem for industrial applications. We focus on fixtureless assembly strategies featuring two or more robotic systems. Such robotic systems include industrial robot arms, dexterous robotic hands, and autonomous mobile platforms, such as automated guided vehicles. In this survey, we identify the types of assemblies that are enabled by utilizing multiple robots, the algorithms that synchronize the motions of the robots to complete the assembly operations, and the metrics used to assess the quality and performance of the assemblies. PMID:29497234

  5. The joint evolutionary histories of Wolbachia and mitochondria in Hypolimnas bolina.

    PubMed

    Charlat, Sylvain; Duplouy, Anne; Hornett, Emily A; Dyson, Emily A; Davies, Neil; Roderick, George K; Wedell, Nina; Hurst, Gregory D D

    2009-03-24

    The interaction between the Blue Moon butterfly, Hypolimnas bolina, and Wolbachia has attracted interest because of the high prevalence of male-killing achieved within the species, the ecological consequences of this high prevalence, the intensity of selection on the host to suppress the infection, and the presence of multiple Wolbachia infections inducing different phenotypes. We examined diversity in the co-inherited marker, mtDNA, and the partitioning of this between individuals of different infection status, as a means to investigate the population biology and evolutionary history of the Wolbachia infections. Part of the mitochondrial COI gene was sequenced from 298 individuals of known infection status revealing ten different haplotypes. Despite very strong biological evidence that the sample represents a single species, the ten haplotypes did not fall within a monophyletic clade within the Hypolimnas genus, with one haplotype differing by 5% from the other nine. There were strong associations between infection status and mtDNA haplotype. The presence of wBol1 infection in association with strongly divergent haplotypes prompted closer examination of wBol1 genetic variation. This revealed the existence of two cryptic subtypes, wBol1a and wBol1b. The wBol1a infection, by far the most common, was in strict association with the single divergent mtDNA haplotype. The wBol1b infection was found with two haplotypes that were also observed in uninfected specimens. Finally, the wBol2 infection was associated with a large diversity of mtDNA haplotypes, most often shared with uninfected sympatric butterflies. This data overall supports the hypothesis that high prevalence of male-killing Wolbachia (wBol1) in H. bolina is associated with very high transmission efficiency rather than regular horizontal transmission. It also suggests this infection has undergone a recent selective sweep and was introduced in this species through introgression. In contrast, the sharing of haplotypes between wBol2-infected and uninfected individuals indicates that this strain is not perfectly transmitted and/or shows a significant level of horizontal transmission.

  6. The joint evolutionary histories of Wolbachia and mitochondria in Hypolimnas bolina

    PubMed Central

    Charlat, Sylvain; Duplouy, Anne; Hornett, Emily A; Dyson, Emily A; Davies, Neil; Roderick, George K; Wedell, Nina; Hurst, Gregory DD

    2009-01-01

    Background The interaction between the Blue Moon butterfly, Hypolimnas bolina, and Wolbachia has attracted interest because of the high prevalence of male-killing achieved within the species, the ecological consequences of this high prevalence, the intensity of selection on the host to suppress the infection, and the presence of multiple Wolbachia infections inducing different phenotypes. We examined diversity in the co-inherited marker, mtDNA, and the partitioning of this between individuals of different infection status, as a means to investigate the population biology and evolutionary history of the Wolbachia infections. Results Part of the mitochondrial COI gene was sequenced from 298 individuals of known infection status revealing ten different haplotypes. Despite very strong biological evidence that the sample represents a single species, the ten haplotypes did not fall within a monophyletic clade within the Hypolimnas genus, with one haplotype differing by 5% from the other nine. There were strong associations between infection status and mtDNA haplotype. The presence of wBol1 infection in association with strongly divergent haplotypes prompted closer examination of wBol1 genetic variation. This revealed the existence of two cryptic subtypes, wBol1a and wBol1b. The wBol1a infection, by far the most common, was in strict association with the single divergent mtDNA haplotype. The wBol1b infection was found with two haplotypes that were also observed in uninfected specimens. Finally, the wBol2 infection was associated with a large diversity of mtDNA haplotypes, most often shared with uninfected sympatric butterflies. Conclusion This data overall supports the hypothesis that high prevalence of male-killing Wolbachia (wBol1) in H. bolina is associated with very high transmission efficiency rather than regular horizontal transmission. It also suggests this infection has undergone a recent selective sweep and was introduced in this species through introgression. In contrast, the sharing of haplotypes between wBol2-infected and uninfected individuals indicates that this strain is not perfectly transmitted and/or shows a significant level of horizontal transmission. PMID:19317891

  7. Haplotype analysis of the HFE gene among populations of Northern Eurasia, in patients with metabolic disorders or stomach cancer, and in long-lived people.

    PubMed

    Mikhailova, S V; Babenko, V N; Ivanoshchuk, D E; Gubina, M A; Maksimov, V N; Solovjova, I G; Voevoda, M I

    2016-06-17

    Previously, it was shown that the HFE gene (associated with human hereditary hemochromatosis) has several haplotypes of intronic polymorphisms. Some haplotype frequencies are race specific and hence can be used in phylogenetic analysis. We assumed that analysis of Caucasoid patients-living now in Western Siberia and having diseases associated with dietary habits and metabolic rate-will allow us to understand the processes of possible selection during settling of the northern part of Asia. Haplotype analysis of Northern Eurasian native and recently settled ethnic groups was performed on polymorphisms rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, rs1572982, rs2794719, rs807209, and rs2032451 of this gene. The CCA haplotype of the rs2071303, rs1800708, and rs1572982 was found to be associated with HLA-A2 (39 %) in Asian populations. Haplotype analysis for the rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, and rs1572982 was performed on Russian patients with some metabolic disorders or stomach cancer and among long-lived people. Decreased frequencies of the TTA haplotype (T in rs2071303, T in rs1800708, and A in rs1572982) were observed in the groups of patients with diseases associated with overweight (fatty liver disease, type 2 diabetes mellitus, or metabolic syndrome + arterial hypertension) as compared with the control sample. We detected significant differences in this haplotype's frequency between the patients with type 2 diabetes mellitus and Russian adolescents, elderly citizens, and long-lived people (χ(2) P value = 0.003, 0.010, and 0.015, respectively). No significant differences in frequencies of the alleles with mutations in coding regions of the HFE gene (C282Y, H63D, and S65C) were detected between the analyzed patients (with stomach cancer, metabolic syndrome, fatty liver disease, or type 2 diabetes mellitus) and the control Caucasoid sample. Monophyletic origin of H63D (rs1799945) was confirmed in Caucasoids and Northern Asians. The reasons for a sharp increase in the frequency of CCA haplotype of HFE in the Asian race remain unclear.

  8. Association of interleukin-10 promoter haplotypes with disease susceptibility and IL-10 levels in Mexican patients with systemic lupus erythematosus.

    PubMed

    Palafox-Sánchez, Claudia Azucena; Oregon-Romero, Edith; Salazar-Camarena, Diana Celeste; Valle, Yeminia Maribel; Machado-Contreras, Jesús René; Cruz, Alvaro; Orozco-López, Mariana; Orozco-Barocio, Gerardo; Vázquez-Del Mercado, Mónica; Muñoz-Valle, José Francisco

    2015-11-01

    Systemic lupus erythematosus (SLE) is the prototype autoimmune rheumatic disease. The etiology of this disease is incompletely understood; however, environmental factors and genetic predisposition are involved. Cytokine-mediated immunity plays a crucial role in the pathogenesis of SLE. We investigate the association of interleukin-10 (IL-10) promoter polymorphisms and their haplotypes in SLE patients from the western Mexico. One hundred and twenty-five SLE patients fulfilling the 1997 ACR criteria and 260 unrelated healthy subjects (HS), both Mexican mestizos, were genotyped for IL-10 -1082A>G, -819C>T, and -592C>A polymorphisms. Haplotypes were inferred using the expectation-maximization algorithm, then allele and haplotype distributions were compared between patients and HS, as well as patients with different clinical variables. We identified at -1082, -819, and -592 four predominant haplotypes ACC (43.70 % in patients vs 46.55 % in HS), ATA (21.45 vs 22.97 %), GCC (16.28 vs 14.21 %), and GTA (14.12 vs 14.12 %). The ATC haplotype was more frequent in SLE respect to HS, suggesting a risk effect (3.23 vs 1.05 %; OR 3.55, CI 1.14-11.11; p = 0.0293). SLE patient carriers of -592 CC genotype as well as the dominant model of inheritance showed higher sIL-10 respect to AA genotype, suggesting that -592 C allele is associated with increased production of the cytokine (p < 0.05). The ACC haplotype had higher IL-10 serum levels and higher values of Mexican version of the Systemic Lupus Erythematosus Disease Activity Index compared with the other haplotype carriers; however, no association was found regarding autoantibodies. Our data suggest that the IL-10 promoter haplotypes play an important role in the risk of developing SLE and influence the production of IL-10 in Mexican population. Nevertheless, further studies are required to analyze the expression of mRNA as well as to investigate the interacting epigenetic factors that could help to define the true contribution of this marker in SLE pathogenesis.

  9. [Analysis of HLA haplotype frequency and linkage disequilibrium in patients with acute lymphoblastic leukemia from Northern Chinese Han].

    PubMed

    Gao, Su-qing; Cheng, Liang-hong; Lu, Liang; Jing, Shi-zheng; Cheng, Xi; Zhang, Yin-ze; Zou, Hong-yan; Deng, Zhi-hui

    2009-02-01

    To analyze the difference between the frequencies of HLA-A-B, B-DRB1 and A-B-DRB1 haplotype, as well as their linkage disequilibrium pattern in patients with acute lymphoblastic leukemia(ALL) and healthy controls from Northern Chinese Han. The frequencies of HLA-A-B, B-DRB1, A-B-DR haplotypes and linkage disequilibrium were estimated by Expectation Maximization method based on the genotypes of 643 patients with ALL and 2 0359 unrelated healthy donors, and the statistical significance between the two groups were estimated by chi-square test. Linkage disequilibrium was analyzed with population genetic methods. The most common HLA-A-B, B-DRB1, and A-B-DR haplotypes were A30-B13, A2-B46, A33-B58, B13-DR7, B46-DR9, B52-DR15, B58-DR17, A30-B13-DR7, A33-B58-DR17 and A1-B37-DR10 in both groups. The frequencies of A30-B13, A2-B46, A33-B44, B13-DR7, A30-B13-DR7 and A2-B46-DR9 haplotypes and linkage disequilibrium value were significantly decreased (P<0.05) in the patient group than that in the control group. On the other hand, the frequencies of A2-B52, A31-B61, A24- B8, B60-DR9, B27-DR4, B52-DR14, B44-DR17, B27-DR12 and A11-B27-DR12 haplotypes and linkage disequilibrium value were significantly increased (P<0.05) in the patient group than that in the control group. There are some common and positive linkage disequilibrium haplotypes in both the ALL patients and the healthy donors in Northern Chinese Han. Interestingly, some haplotypes and their linkage disequilibrium patterns had significantly different distributions between the two groups. The study provided basic data for the relationship of ALL and HLA haplotype and for finding the HLA-A, B, DR matching donors.

  10. A haplotype of polymorphisms in ASE-1, RAI and ERCC1 and the effects of tobacco smoking and alcohol consumption on risk of colorectal cancer: a Danish prospective case-cohort study.

    PubMed

    Hansen, Rikke D; Sørensen, Mette; Tjønneland, Anne; Overvad, Kim; Wallin, Håkan; Raaschou-Nielsen, Ole; Vogel, Ulla

    2008-02-20

    Single nucleotide polymorphisms (SNPs) are the most frequent type of genetic variation in the human genome, and are of interest for the study of susceptibility to and protection from diseases. The haplotype at chromosome 19q13.2-3 encompassing the three SNPs ASE-1 G-21A, RAI IVS1 A4364G and ERCC1 Asn118Asn have been associated with risk of breast cancer and lung cancer. Haplotype carriers are defined as the homozygous carriers of RAI IVS1 A4364GA, ERCC1 Asn118AsnT and ASE-1 G-21AG. We aimed to evaluate whether the three polymorphisms and the haplotype are associated to risk of colorectal cancer, and investigated gene-environment associations between the polymorphisms and the haplotype and smoking status at enrolment, smoking duration, average smoking intensity and alcohol consumption, respectively, in relation to risk of colorectal cancer. Associations between the three individual polymorphisms, the haplotype and risk of colorectal cancer were examined, as well as gene-environment interaction, in a Danish case-cohort study including 405 cases and a comparison group of 810 persons. Incidence rate ratio (IRR) were estimated by the Cox proportional hazards model stratified according to gender, and two-sided 95% confidence intervals (CI) and p-values were calculated based on robust estimates of the variance-covariance matrix and Wald's test of the Cox regression parameter. No consistent associations between the three individual polymorphisms, the haplotype and risk of colorectal cancer were found. No statistically significant interactions between the genotypes and the lifestyle exposures smoking or alcohol consumption were observed. Our results suggest that the ASE-1 G-21A, RAI IVS1 A4364G and ERCC1 Asn118Asn polymorphisms and the previously identified haplotype are not associated with risk of colorectal cancer. We found no evidence of gene-environment interaction between the three polymorphisms and the haplotype and smoking intensity and alcohol consumption, respectively, in relation to the risk of colorectal cancer.

  11. The prognostic impact of germline 46/1 haplotype of Janus kinase 2 in cytogenetically normal acute myeloid leukemia

    PubMed Central

    Nahajevszky, Sarolta; Andrikovics, Hajnalka; Batai, Arpad; Adam, Emma; Bors, Andras; Csomor, Judit; Gopcsa, Laszlo; Koszarska, Magdalena; Kozma, Andras; Lovas, Nora; Lueff, Sandor; Matrai, Zoltan; Meggyesi, Nora; Sinko, Janos; Sipos, Andrea; Varkonyi, Andrea; Fekete, Sandor; Tordai, Attila; Masszi, Tamas

    2011-01-01

    Background Prognostic risk stratification according to acquired or inherited genetic alterations has received increasing attention in acute myeloid leukemia in recent years. A germline Janus kinase 2 haplotype designated as the 46/1 haplotype has been reported to be associated with an inherited predisposition to myeloproliferative neoplasms, and also to acute myeloid leukemia with normal karyotype. The aim of this study was to assess the prognostic impact of the 46/1 haplotype on disease characteristics and treatment outcome in acute myeloid leukemia. Design and Methods Janus kinase 2 rs12343867 single nucleotide polymorphism tagging the 46/1 haplotype was genotyped by LightCycler technology applying melting curve analysis with the hybridization probe detection format in 176 patients with acute myeloid leukemia under 60 years diagnosed consecutively and treated with curative intent. Results The morphological subtype of acute myeloid leukemia with maturation was less frequent among 46/1 carriers than among non-carriers (5.6% versus 17.2%, P=0.018, cytogenetically normal subgroup: 4.3% versus 20.6%, P=0.031), while the morphological distribution shifted towards the myelomonocytoid form in 46/1 haplotype carriers (28.1% versus 14.9%, P=0.044, cytogenetically normal subgroup: 34.0% versus 11.8%, P=0.035). In cytogenetically normal cases of acute myeloid leukemia, the 46/1 carriers had a considerably lower remission rate (78.7% versus 94.1%, P=0.064) and more deaths in remission or in aplasia caused by infections (46.8% versus 23.5%, P=0.038), resulting in the 46/1 carriers having shorter disease-free survival and overall survival compared to the 46/1 non-carriers. In multivariate analysis, the 46/1 haplotype was an independent adverse prognostic factor for disease-free survival (P=0.024) and overall survival (P=0.024) in patients with a normal karyotype. Janus kinase 2 46/1 haplotype had no impact on prognosis in the subgroup with abnormal karyotype. Conclusions Janus kinase 2 46/1 haplotype influences morphological distribution, increasing the predisposition towards an acute myelomonocytoid form. It may be a novel, independent unfavorable risk factor in acute myeloid leukemia with a normal karyotype. PMID:21791467

  12. High-Resolution Analyses of Human Leukocyte Antigens Allele and Haplotype Frequencies Based on 169,995 Volunteers from the China Bone Marrow Donor Registry Program

    PubMed Central

    Zhou, Xiao-Yang; Zhu, Fa-Ming; Li, Jian-Ping; Mao, Wei; Zhang, De-Mei; Liu, Meng-Li; Hei, Ai-Lian; Dai, Da-Peng; Jiang, Ping; Shan, Xiao-Yan; Zhang, Bo-Wei; Zhu, Chuan-Fu; Shen, Jie; Deng, Zhi-Hui; Wang, Zheng-Lei; Yu, Wei-Jian; Chen, Qiang; Qiao, Yan-Hui; Zhu, Xiang-Ming; Lv, Rong; Li, Guo-Ying; Li, Guo-Liang; Li, Heng-Cong; Zhang, Xu; Pei, Bin; Jiao, Li-Xin; Shen, Gang; Liu, Ying; Feng, Zhi-Hui; Su, Yu-Ping; Xu, Zhao-Xia; Di, Wen-Ying; Jiang, Yao-Qin; Fu, Hong-Lei; Liu, Xiang-Jun; Liu, Xiang; Zhou, Mei-Zhen; Du, Dan; Liu, Qi; Han, Ying; Zhang, Zhi-Xin; Cai, Jian-Ping

    2015-01-01

    Allogeneic hematopoietic stem cell transplantation is a widely used and effective therapy for hematopoietic malignant diseases and numerous other disorders. High-resolution human leukocyte antigen (HLA) haplotype frequency distributions not only facilitate individual donor searches but also determine the probability with which a particular patient can find HLA-matched donors in a registry. The frequencies of the HLA-A, -B, -C, -DRB1, and -DQB1 alleles and haplotypes were estimated among 169,995 Chinese volunteers using the sequencing-based typing (SBT) method. Totals of 191 HLA-A, 244 HLA-B, 146 HLA-C, 143 HLA-DRB1 and 47 HLA-DQB1 alleles were observed, which accounted for 6.98%, 7.06%, 6.46%, 9.11% and 7.91%, respectively, of the alleles in each locus in the world (IMGT 3.16 Release, Apr. 2014). Among the 100 most common haplotypes from the 169,995 individuals, nine distinct haplotypes displayed significant regionally specific distributions. Among these, three were predominant in the South China region (i.e., the 20th, 31st, and 81sthaplotypes), another three were predominant in the Southwest China region (i.e., the 68th, 79th, and 95th haplotypes), one was predominant in the South and Southwest China regions (the 18th haplotype), one was relatively common in the Northeast and North China regions (the 94th haplotype), and one was common in the Northeast, North and Northwest China (the 40th haplotype). In conclusion, this is the first to analyze high-resolution HLA diversities across the entire country of China, based on a detailed and complete data set that covered 31 provinces, autonomous regions, and municipalities. Specifically, we also evaluated the HLA matching probabilities within and between geographic regions and analyzed the regional differences in the HLA diversities in China. We believe that the data presented in this study might be useful for unrelated HLA-matched donor searches, donor registry planning, population genetic studies, and anthropogenesis studies. PMID:26421847

  13. Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry.

    PubMed

    Karpyak, Victor M; Kim, Jeong-Hyun; Biernacka, Joanna M; Wieben, Eric D; Mrazek, David A; Black, John L; Choi, Doo-Sup

    2009-04-01

    Mpdz gene variations are known contributors of acute alcohol withdrawal severity and seizures in mice. To investigate the relevance of these findings for human alcoholism, we resequenced 46 exons, exon-intron boundaries, and 2 kilobases in the 5' region of the human MPDZ gene in 61 subjects with a history of alcohol withdrawal seizures (AWS), 59 subjects with a history of alcohol withdrawal without AWS, and 64 Coriell samples from self-reported nonalcoholic subjects [all European American (EA) ancestry] and compared with the Mpdz sequences of 3 mouse strains with different propensity to AWS. To explore potential associations of the human MPDZ gene with alcoholism and AWS, single SNP and haplotype analyses were performed using 13 common variants. Sixty-seven new, mostly rare variants were discovered in the human MPDZ gene. Sequence comparison revealed that the human gene does not have variations identical to those comprising Mpdz gene haplotype associated with AWS in mice. We also found no significant association between MPDZ haplotypes and AWS in humans. However, a global test of haplotype association revealed a significant difference in haplotype frequencies between alcohol-dependent subjects without AWS and Coriell controls (p = 0.015), suggesting a potential role of MPDZ in alcoholism and/or related phenotypes other than AWS. Haplotype-specific tests for the most common haplotypes (frequency > 0.05), revealed a specific high-risk haplotype (p = 0.006, maximum statistic p = 0.051), containing rs13297480G allele also found to be significantly more prevalent in alcoholics without AWS compared with nonalcoholic Coriell subjects (p = 0.019). Sequencing of MPDZ gene in individuals with EA ancestry revealed no variations in the sites identical to those associated with AWS in mice. Exploratory haplotype and single SNP association analyses suggest a possible association between the MPDZ gene and alcohol dependence but not AWS. Further functional genomic analysis of MPDZ variants and investigation of their association with a broader array of alcoholism-related phenotypes could reveal additional genetic markers of alcoholism.

  14. When less is more: 'slicing' sequencing data improves read decoding accuracy and de novo assembly quality.

    PubMed

    Lonardi, Stefano; Mirebrahim, Hamid; Wanamaker, Steve; Alpert, Matthew; Ciardo, Gianfranco; Duma, Denisa; Close, Timothy J

    2015-09-15

    As the invention of DNA sequencing in the 70s, computational biologists have had to deal with the problem of de novo genome assembly with limited (or insufficient) depth of sequencing. In this work, we investigate the opposite problem, that is, the challenge of dealing with excessive depth of sequencing. We explore the effect of ultra-deep sequencing data in two domains: (i) the problem of decoding reads to bacterial artificial chromosome (BAC) clones (in the context of the combinatorial pooling design we have recently proposed), and (ii) the problem of de novo assembly of BAC clones. Using real ultra-deep sequencing data, we show that when the depth of sequencing increases over a certain threshold, sequencing errors make these two problems harder and harder (instead of easier, as one would expect with error-free data), and as a consequence the quality of the solution degrades with more and more data. For the first problem, we propose an effective solution based on 'divide and conquer': we 'slice' a large dataset into smaller samples of optimal size, decode each slice independently, and then merge the results. Experimental results on over 15 000 barley BACs and over 4000 cowpea BACs demonstrate a significant improvement in the quality of the decoding and the final assembly. For the second problem, we show for the first time that modern de novo assemblers cannot take advantage of ultra-deep sequencing data. Python scripts to process slices and resolve decoding conflicts are available from http://goo.gl/YXgdHT; software Hashfilter can be downloaded from http://goo.gl/MIyZHs stelo@cs.ucr.edu or timothy.close@ucr.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. The genome sequence of the outbreeding globe artichoke constructed de novo incorporating a phase-aware low-pass sequencing strategy of F1 progeny

    PubMed Central

    Scaglione, Davide; Reyes-Chin-Wo, Sebastian; Acquadro, Alberto; Froenicke, Lutz; Portis, Ezio; Beitel, Christopher; Tirone, Matteo; Mauro, Rosario; Lo Monaco, Antonino; Mauromicale, Giovanni; Faccioli, Primetta; Cattivelli, Luigi; Rieseberg, Loren; Michelmore, Richard; Lanteri, Sergio

    2016-01-01

    Globe artichoke (Cynara cardunculus var. scolymus) is an out-crossing, perennial, multi-use crop species that is grown worldwide and belongs to the Compositae, one of the most successful Angiosperm families. We describe the first genome sequence of globe artichoke. The assembly, comprising of 13,588 scaffolds covering 725 of the 1,084 Mb genome, was generated using ~133-fold Illumina sequencing data and encodes 26,889 predicted genes. Re-sequencing (30×) of globe artichoke and cultivated cardoon (C. cardunculus var. altilis) parental genotypes and low-coverage (0.5 to 1×) genotyping-by-sequencing of 163 F1 individuals resulted in 73% of the assembled genome being anchored in 2,178 genetic bins ordered along 17 chromosomal pseudomolecules. This was achieved using a novel pipeline, SOILoCo (Scaffold Ordering by Imputation with Low Coverage), to detect heterozygous regions and assign parental haplotypes with low sequencing read depth and of unknown phase. SOILoCo provides a powerful tool for de novo genome analysis of outcrossing species. Our data will enable genome-scale analyses of evolutionary processes among crops, weeds, and wild species within and beyond the Compositae, and will facilitate the identification of economically important genes from related species. PMID:26786968

  16. Molecular characterisation of Galba truncatula, Lymnaea neotropica and L. schirazensis from Cajamarca, Peru and their potential role in transmission of human and animal fascioliasis

    PubMed Central

    2012-01-01

    Background Human and animal fascioliasis is emerging in many world regions, among which Andean countries constitute the largest regional hot spot and Peru the country presenting more human endemic areas. A survey was undertaken on the lymnaeid snails inhabiting the hyperendemic area of Cajamarca, where human prevalences are the highest known among the areas presenting a "valley transmission pattern", to establish which species are present, genetically characterise their populations by comparison with other human endemic areas, and discuss which ones have transmission capacity and their potential implications with human and animal infection. Methods Therefore, ribosomal DNA ITS-2 and ITS-1, and mitochondrial DNA 16S and cox1 were sequenced by the dideoxy chain-termination method. Results Results indicate the presence of three, morphologically similar, small lymnaeid species belonging to the Galba/Fossaria group: Galba truncatula, Lymnaea neotropica and L. schirazensis. Only one combined haplotype for each species was found. The ITS-1, 16S and cox1 haplotypes of G. truncatula are new. No new haplotypes were found in the other two species. This scenario changes previous knowledge, in which only L. viator (= L. viatrix) was mentioned. Galba truncatula appears to be the most abundant, with high population densities and evident anthropophyly including usual presence in human neighbourhood. Infection by Fasciola hepatica larval stages were molecularly confirmed in two populations of this species. The nearness between G. truncatula populations presenting liver fluke infection and both human settings and schools for children, together with the absence of populations of other lymnaeid species in the locality, suggest a direct relationship with human infection. Conclusions The geographical overlap of three lymnaeid species poses problems for epidemiological studies and control action. First, a problem in classifying lymnaeid specimens in both field and laboratory activities, given their transmission capacity differences: G. truncatula mainly involved in transmission to humans, L neotropica typically responsible for livestock infection, and L. schirazensis unable for transmission. Although several phenotypic characteristics may be helpful for a preliminary specimen classification, a definitive classification can only be obtained by marker sequencing. Aditionally, L. schirazensis increases the confusion, owing to its ability to mix with other Galba/Fossaria species and distort fascioliasis data such as transmission capacity and infection susceptibility. Second, a problem for epidemiological analysis, surveillance and control by methods as mathematical modelling and Remote Sensing - Geographical Information Systems. In Cajamarca, low resolution mapping may be insufficient, as already verified in Andean areas where different lymnaeid species overlap. PMID:22894178

  17. Molecular characterisation of Galba truncatula, Lymnaea neotropica and L. schirazensis from Cajamarca, Peru and their potential role in transmission of human and animal fascioliasis.

    PubMed

    Bargues, M Dolores; Artigas, Patricio; Khoubbane, Messaoud; Ortiz, Pedro; Naquira, Cesar; Mas-Coma, Santiago

    2012-08-15

    Human and animal fascioliasis is emerging in many world regions, among which Andean countries constitute the largest regional hot spot and Peru the country presenting more human endemic areas. A survey was undertaken on the lymnaeid snails inhabiting the hyperendemic area of Cajamarca, where human prevalences are the highest known among the areas presenting a "valley transmission pattern", to establish which species are present, genetically characterise their populations by comparison with other human endemic areas, and discuss which ones have transmission capacity and their potential implications with human and animal infection. Therefore, ribosomal DNA ITS-2 and ITS-1, and mitochondrial DNA 16S and cox1 were sequenced by the dideoxy chain-termination method. Results indicate the presence of three, morphologically similar, small lymnaeid species belonging to the Galba/Fossaria group: Galba truncatula, Lymnaea neotropica and L. schirazensis. Only one combined haplotype for each species was found. The ITS-1, 16S and cox1 haplotypes of G. truncatula are new. No new haplotypes were found in the other two species. This scenario changes previous knowledge, in which only L. viator (= L. viatrix) was mentioned. Galba truncatula appears to be the most abundant, with high population densities and evident anthropophyly including usual presence in human neighbourhood. Infection by Fasciola hepatica larval stages were molecularly confirmed in two populations of this species. The nearness between G. truncatula populations presenting liver fluke infection and both human settings and schools for children, together with the absence of populations of other lymnaeid species in the locality, suggest a direct relationship with human infection. The geographical overlap of three lymnaeid species poses problems for epidemiological studies and control action. First, a problem in classifying lymnaeid specimens in both field and laboratory activities, given their transmission capacity differences: G. truncatula mainly involved in transmission to humans, L neotropica typically responsible for livestock infection, and L. schirazensis unable for transmission. Although several phenotypic characteristics may be helpful for a preliminary specimen classification, a definitive classification can only be obtained by marker sequencing. Aditionally, L. schirazensis increases the confusion, owing to its ability to mix with other Galba/Fossaria species and distort fascioliasis data such as transmission capacity and infection susceptibility. Second, a problem for epidemiological analysis, surveillance and control by methods as mathematical modelling and Remote Sensing--Geographical Information Systems. In Cajamarca, low resolution mapping may be insufficient, as already verified in Andean areas where different lymnaeid species overlap.

  18. Intelligent laser soldering inspection and process control

    NASA Technical Reports Server (NTRS)

    Vanzetti, Riccardo

    1987-01-01

    Component assembly on printed circuitry keeps making giant strides toward denser packaging and smaller dimensions. From a single layer to multilayer, from through holes to surface mounted components and tape applied bonds, unrelenting progress results in new, difficult problems in assembling, soldering, inspecting and controlling the manufacturing process of the new electronics. Among the major problems are the variables introduced by human operators. The small dimensions and the tight assembly tolerances are now successfully met by machines which are faster and more precise than the human hand. The same is true for soldering. But visual inspection of the solder joints is now so severely limited by the ever shrinking area accessible to the human eye that the inspector's diagnosis cannot be trusted any longer. Solutions to correcting these problems are discussed.

  19. [Gene and haplotype frequencies for the loci HLA-A, B and DRB1 in 11755 north Chinese Han bone marrow registry donors].

    PubMed

    Wu, Qiang-Ju; Liu, Meng-Li; Qi, Jun; Liu, Sheng; Zhang, Yan; Wei, Xiao-Qian

    2007-04-01

    The study was aimed to investigate the human leukocyte antigen (HLA)-A, B, DRB1 alleles and haplotype frequencies and the characteristics of linkage disequilibrium in north Chinese Han bone marrow donors. HLA phenotype data of 11 755 north Chinese Han bone marrow donors were identified by PCR-SSP and PCR-SSO. HLA-A, B, DRB1 allele and haplotype frequencies were calculated by computer software named Arleguin which was based on Expectation-Maximization (EM) algorithms. The results showed that the population of 11755 unrelated-donors was tested by Hardy-Weinberg equilibrium, and 18,42 and 15 specificities of HLA alleles were identified on the HLA-A, B, DRB1 locus respectively, including HLA-A25, B42, B53, B73 and DR3 which were rarely reported in Han population. HLA-A36, A43, A80, B78, B82 and DR18 were not detected in this study. The most frequent alleles with a frequency of over 0.05 were HLA-A*02, A*11, A*24, A*33, A*30, A*01, A*03, A*13, B62, B*51, B*46, B60, B61, B*35, B*44, DRB1*15, DRB1*09, DRB1*04, DRB1*07, DRB1*12, DRB1*11, DRB1*14, DRB1*08, DRB1*13. There were a total of 2 026 kinds of HLA-A-B-DR haplotypes (with a frequency of over 10(-6)) to be obtained. The each frequency of 26 kinds of three-locus haplotypes including HLA-A30-B13-DR7, A2-B46-DR9, A33-B58-DR17 etc was higher than 0.005. A30-B13-DR7 was the most frequent haplotype in north Chinese Han population. There were a total of 538 kinds of haplotypes for HLA-A-B, 227 kinds for A-DR and 522 kinds for B-DR to be obtained, and there were 409, 195, 423 kinds of haplotypes respectively with a frequency higher than 10 - 6. There were 28 kinds of HLA-A-B haplotypes including A30-B13, A2-B46, A33-B58 etc, 26 kinds of HLA-A-DR haplotypes including A2-DR9, A2-DR15, A30-DR7 etc, and 24 kinds of HLA-B-DR haplotypes including B13-DR7, B46-DR9, B13-DR12 etc with a frequency higher than 0.01. 296 (72%) kinds of HLA-A-B, 130 (67%) kinds of A-DR and 308 (73%) kinds of B-DR haplotypes were statistical linkage disequilibrium. HLA-A30-B13, A33-B58, A1-B37, A30-DR7, A33-DR13, A1-DR10, B37-DR10, B8-DR17, B13-DR7, B58-DR17 were significant positive linkage disequilibrium. It is concluded that this HLA-A, B, DRB1 gene and haplotype frequencies and linkage disequilibrium data with the largest sample size up to now is unique in north Chinese Han population. The study will be helpful to find matched donors for patients and establish the important foundation for further studying of transplantation immunity, HLA-related diseases and population genetics of this area.

  20. Mitochondrial-DNA variation among subspecies and populations of sea otters (Enhydra lutris)

    USGS Publications Warehouse

    Cronin, Matthew A.; Bodkin, James L.; Ballachey, Brenda E.; Estes, James A.; Patton, John C.

    1996-01-01

    We used restriction-enzyme analysis of polymerase-chain reaction-amplified, mitochondrial DNA (mtDNA) to assess genetic differentiation of subspecies and populations of sea otters, Enhydra lutris, throughout the range of the species. There were several haplotypes of mtDNA in each subspecies and geographically separate populations. MtDNA sequence divergence of haplotypes of sea otters was 0.0004–0.0041 base substitutions per nucleotide. E. L nereis appears to have monophyletic mitochondrial DNA, while E. I. lutris and E. I. kenyoni do not. Different frequencies of haplotypes of mtDNA among populations reflect current restriction of gene flow and the unique histories of different populations. There are two or three haplotypes of mtDNA and diversity of haplotypes is 0.1376–0.5854 in each population of otters. This is consistent with theoretical work, which suggests that population bottlenecks of sea otters probably did not result in major losses of genetic variation for individual populations, or the species as a whole.

  1. Addiction Genetics and Pleiotropic Effects of Common Haplotypes that Make Polygenic Contributions to Vulnerability to Substance Dependence

    PubMed Central

    Uhl, George R.; Drgon, Tomas; Johnson, Catherine; Liu, Qing-Rong

    2016-01-01

    Abundant evidence from family, adoption, and twin studies point to large genetic contributions to individual differences in vulnerability to develop dependence on one or more addictive substances. Twin data suggest that most of this genetic vulnerability is shared by individuals who are dependent on a variety of addictive substances. Molecular genetic studies, especially genomewide and candidate gene association studies, have elucidated common haplotypes in dozens of genes that appear to make polygenic contributions to vulnerability to developing dependence. Most genes that harbor currently identified addiction-associated haplotypes are expressed in the brain. Haplotypes in many of the same genes are identified in genomewide association studies that compare allele frequencies in substance dependent vs. control individuals from European, African, and Asian racial/ethnic backgrounds. Many of these addiction-associated haplotypes display pleiotropic influences on a variety of related brain-based phenotypes that display 1) substantial heritability and 2) clinical cooccurence with substance dependence. PMID:19152208

  2. Association analysis of APOA5 rs662799 and rs3135506 polymorphisms with obesity in Moroccan patients.

    PubMed

    Lakbakbi El Yaagoubi, F; Charoute, H; Bakhchane, A; Ajjemami, M; Benrahma, H; Errouagui, A; Kandil, M; Rouba, H; Barakat, A

    2015-12-01

    The aim of the present study is to explore the association between the APOA5 polymorphisms and haplotypes with obesity in Moroccan patients. The study was performed in 459 subjects, Obese (n=164) and non-obese (n=295). All subjects were genotyped for the APOA5 -1131T>C (rs662799) and c.56C>G (rs3135506) polymorphisms. The contribution of APOA5 polymorphisms and haplotypes in the increased risk of obesity were explored using logistic regression analyses. The -1131T>C and c.56C>G polymorphisms were significantly associated with obesity. Both polymorphisms were strongly associated with increased BMI. Analysis of constructed haplotypes showed a significant association between CG haplotype and susceptibility to obesity (OR [95%CI]=3.09 [1.93-4.97]; P<0.001). These results support a potential role for APOA5 common variants and related haplotypes as risk factors for obesity. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  3. Morphometric characteristics and COI haplotype diversity of Arctodiaptomus spinosus (Copedoda) populations in soda pans in Hungary.

    PubMed

    Forró, László; Nédli, Judit; Csata, Enikő; Krízsik, Virág; Balogh, Csilla; G-Tóth, László

    2017-09-01

    Arctodiaptomus spinosus (Daday, 1891) is a characteristic species of the soda pan zooplankton in the Great Hungarian Plain. The biogeographical distribution of the species is interesting, since its range expands from the Pannonian Biogeographic region to the other side of the Carpathians, occurring in saline lakes in Eastern Anatolia, Armenia, Iran and in temporary waters in Ukraine. Our investigations focused on the morphometric characteristics and the COI haplotype diversity of four Hungarian populations in the Kiskunság area. We detected substantial morphological differences between the Böddi-szék population and the rest of the sampling sites, however considerable differences were not observable in the COI haplotypes in the populations. The 20 animals investigated for COI haplotypes belonged to the same haplotype network. Tajima's D indicated departures from the neutral Wright - Fisher population model and suggested population expansion. The genetic composition of Arctodiaptomus spinosus populations in the Kiskunság area is rather uniform.

  4. Linkage disequilibrium in HLA cannot be explained by selective recombination.

    PubMed

    Termijtelen, A; D'Amaro, J; van Rood, J J; Schreuder, G M

    1995-11-01

    Some combinations of HLA-A, -B and -DR antigens occur more frequently than would be expected from their gene frequencies in the population. This phenomenon, referred to as Linkage Disequilibrium (LD) has been the origin of many speculations. One hypothesis to explain LD is that some haplotypes are protected from recombination. A second hypothesis is that these HLA antigens preferentially recombine after cross-over to create an LD haplotype. We tested these 2 hypotheses: from a pool of over 10,000 families typed in our department, we analyzed 126 families in which HLA-A:B or B:DR recombinant offspring was documented. To overcome a possible bias in our material, we used the non-recombined haplotypes from the same 126 families as a control group. Our results show that the number of cross-overs through LD haplotypes is not significantly lower then would be expected if recombination occurred randomly. Also the number of LD haplotypes created upon recombination was not significantly increased.

  5. Insight into the Migration Routes of Plutella xylostella in China Using mtCOI and ISSR Markers

    PubMed Central

    Tian, Lixia; Xu, Baoyun; Xie, Wen; Wang, Shaoli; Zhang, Youjun; Wang, Xiangjing; Wu, Qingjun

    2015-01-01

    The larvae of the diamondback moth, Plutella xylostella, cause major economic losses to cruciferous crops, including cabbage, which is an important vegetable crop in China. In this study, we used the mitochondrial COI gene and 11 ISSR markers to characterize the genetic structure and seasonal migration routes of 23 P. xylostella populations in China. Both the mitochondrial and nuclear markers revealed high haplotype diversity and gene flow among the populations, although some degree of genetic isolation was evident between the populations of Hainan Island and other sampling sites. The dominant haplotypes, LX1 and LX2, differed significantly from all other haplotypes both in terms of the number of individuals with those haplotypes and their distributions. Haplotypes that were shared among populations revealed that P. xylostella migrates from the lower reaches of the Yangtze River to northern China and then to northeastern China. Our results also revealed another potential migration route for P. xylostella, i.e., from southwestern China to both northwestern and southern China. PMID:26098353

  6. Global variation in CYP2C8–CYP2C9 functional haplotypes

    PubMed Central

    Speed, William C; Kang, Soonmo Peter; Tuck, David P; Harris, Lyndsay N; Kidd, Kenneth K

    2009-01-01

    We have studied the global frequency distributions of 10 single nucleotide polymorphisms (SNPs) across 132 kb of CYP2C8 and CYP2C9 in ∼2500 individuals representing 45 populations. Five of the SNPs were in noncoding sequences; the other five involved the more common missense variants (four in CYP2C8, one in CYP2C9) that change amino acids in the gene products. One haplotype containing two CYP2C8 coding variants and one CYP2C9 coding variant reaches an average frequency of 10% in Europe; a set of haplotypes with a different CYP2C8 coding variant reaches 17% in Africa. In both cases these haplotypes are found in other regions of the world at <1%. This considerable geographic variation in haplotype frequencies impacts the interpretation of CYP2C8/CYP2C9 association studies, and has pharmacogenomic implications for drug interactions. PMID:19381162

  7. Hb S [β6(A3)Glu→Val, GAG>GTG] in Mexican Mestizos: frequency and analysis of the 5' β-globin haplotype.

    PubMed

    Guzmán, Luis F; Perea, Francisco J; Magaña, María T; Morales-González, Karina R; Chávez-Velazco, M Luz; Ibarra, Bertha

    2010-01-01

    Between 1978 and 2009, we studied 1,863 Mexican Mestizo patients with clinical data compatible with a hemoglobinopathy. Of these patients, 382 had some hemoglobin (Hb) abnormality (20.5%), 128 had a sickle cell hemoglobinopathy, representing a general frequency of 6.9%, which is similar to the percentage observed in previous studies on Mexican Mestizos. We analyzed the 5' β-globin haplotype (5'Hp) in 79 unrelated β(S) chromosomes (26 β(S)/β(S), 14 β(S)/β(Thal), nine β(S)/β(A) and four β(S)/β(D)), and four haplotypes were observed: 72.2% CAR 24.1% Benin, 2.5% Senegal and 1.2% Cameroon; the last two are reported for first time in Mexico. In some Latin American populations such as Brazil, the Bantu haplotype predominates, while in others such as Jamaica, the Benin haplotype is the most frequent, showing heterogeneity of African genes as a consequence of different regions involved in the slave trade.

  8. Probability distribution of haplotype frequencies under the two-locus Wright-Fisher model by diffusion approximation.

    PubMed

    Boitard, Simon; Loisel, Patrice

    2007-05-01

    The probability distribution of haplotype frequencies in a population, and the way it is influenced by genetical forces such as recombination, selection, random drift ...is a question of fundamental interest in population genetics. For large populations, the distribution of haplotype frequencies for two linked loci under the classical Wright-Fisher model is almost impossible to compute because of numerical reasons. However the Wright-Fisher process can in such cases be approximated by a diffusion process and the transition density can then be deduced from the Kolmogorov equations. As no exact solution has been found for these equations, we developed a numerical method based on finite differences to solve them. It applies to transient states and models including selection or mutations. We show by several tests that this method is accurate for computing the conditional joint density of haplotype frequencies given that no haplotype has been lost. We also prove that it is far less time consuming than other methods such as Monte Carlo simulations.

  9. PAVE: program for assembling and viewing ESTs.

    PubMed

    Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne

    2009-08-26

    New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.

  10. A Saturated Genetic Linkage Map of Autotetraploid Alfalfa (Medicago sativa L.) Developed Using Genotyping-by-Sequencing Is Highly Syntenous with the Medicago truncatula Genome

    PubMed Central

    Li, Xuehui; Wei, Yanling; Acharya, Ananta; Jiang, Qingzhen; Kang, Junmei; Brummer, E. Charles

    2014-01-01

    A genetic linkage map is a valuable tool for quantitative trait locus mapping, map-based gene cloning, comparative mapping, and whole-genome assembly. Alfalfa, one of the most important forage crops in the world, is autotetraploid, allogamous, and highly heterozygous, characteristics that have impeded the construction of a high-density linkage map using traditional genetic marker systems. Using genotyping-by-sequencing (GBS), we constructed low-cost, reasonably high-density linkage maps for both maternal and paternal parental genomes of an autotetraploid alfalfa F1 population. The resulting maps contain 3591 single-nucleotide polymorphism markers on 64 linkage groups across both parents, with an average density of one marker per 1.5 and 1.0 cM for the maternal and paternal haplotype maps, respectively. Chromosome assignments were made based on homology of markers to the M. truncatula genome. Four linkage groups representing the four haplotypes of each alfalfa chromosome were assigned to each of the eight Medicago chromosomes in both the maternal and paternal parents. The alfalfa linkage groups were highly syntenous with M. truncatula, and clearly identified the known translocation between Chromosomes 4 and 8. In addition, a small inversion on Chromosome 1 was identified between M. truncatula and M. sativa. GBS enabled us to develop a saturated linkage map for alfalfa that greatly improved genome coverage relative to previous maps and that will facilitate investigation of genome structure. GBS could be used in breeding populations to accelerate molecular breeding in alfalfa. PMID:25147192

  11. A saturated genetic linkage map of autotetraploid alfalfa (Medicago sativa L.) developed using genotyping-by-sequencing is highly syntenous with the Medicago truncatula genome.

    PubMed

    Li, Xuehui; Wei, Yanling; Acharya, Ananta; Jiang, Qingzhen; Kang, Junmei; Brummer, E Charles

    2014-08-21

    A genetic linkage map is a valuable tool for quantitative trait locus mapping, map-based gene cloning, comparative mapping, and whole-genome assembly. Alfalfa, one of the most important forage crops in the world, is autotetraploid, allogamous, and highly heterozygous, characteristics that have impeded the construction of a high-density linkage map using traditional genetic marker systems. Using genotyping-by-sequencing (GBS), we constructed low-cost, reasonably high-density linkage maps for both maternal and paternal parental genomes of an autotetraploid alfalfa F1 population. The resulting maps contain 3591 single-nucleotide polymorphism markers on 64 linkage groups across both parents, with an average density of one marker per 1.5 and 1.0 cM for the maternal and paternal haplotype maps, respectively. Chromosome assignments were made based on homology of markers to the M. truncatula genome. Four linkage groups representing the four haplotypes of each alfalfa chromosome were assigned to each of the eight Medicago chromosomes in both the maternal and paternal parents. The alfalfa linkage groups were highly syntenous with M. truncatula, and clearly identified the known translocation between Chromosomes 4 and 8. In addition, a small inversion on Chromosome 1 was identified between M. truncatula and M. sativa. GBS enabled us to develop a saturated linkage map for alfalfa that greatly improved genome coverage relative to previous maps and that will facilitate investigation of genome structure. GBS could be used in breeding populations to accelerate molecular breeding in alfalfa. Copyright © 2014 Li et al.

  12. Local environment rather than past climate determines community composition of mountain stream macroinvertebrates across Europe.

    PubMed

    Múrria, Cesc; Bonada, Núria; Vellend, Mark; Zamora-Muñoz, Carmen; Alba-Tercedor, Javier; Sainz-Cantero, Carmen Elisa; Garrido, Josefina; Acosta, Raul; El Alami, Majida; Barquín, Jose; Derka, Tomáš; Álvarez-Cabria, Mario; Sáinz-Bariain, Marta; Filipe, Ana F; Vogler, Alfried P

    2017-11-01

    Community assembly is determined by a combination of historical events and contemporary processes that are difficult to disentangle, but eco-evolutionary mechanisms may be uncovered by the joint analysis of species and genetic diversity across multiple sites. Mountain streams across Europe harbour highly diverse macroinvertebrate communities whose composition and turnover (replacement of taxa) among sites and regions remain poorly known. We studied whole-community biodiversity within and among six mountain regions along a latitudinal transect from Morocco to Scandinavia at three levels of taxonomic hierarchy: genus, species and haplotypes. Using DNA barcoding of four insect families (>3100 individuals, 118 species) across 62 streams, we found that measures of local and regional diversity and intraregional turnover generally declined slightly towards northern latitudes. However, at all hierarchical levels we found complete (haplotype) or high (species, genus) turnover among regions (and even among sites within regions), which counters the expectations of Pleistocene postglacial northward expansion from southern refugia. Species distributions were mostly correlated with environmental conditions, suggesting a strong role of lineage- or species-specific traits in determining local and latitudinal community composition, lineage diversification and phylogenetic community structure (e.g., loss of Coleoptera, but not Ephemeroptera, at northern sites). High intraspecific genetic structure within regions, even in northernmost sites, reflects species-specific dispersal and demographic histories and indicates postglacial migration from geographically scattered refugia, rather than from only southern areas. Overall, patterns were not strongly concordant across hierarchical levels, but consistent with the overriding influence of environmental factors determining community composition at the species and genus levels. © 2017 John Wiley & Sons Ltd.

  13. Haplotype Analysis of the Melanopsin Gene in Seasonal Affective Disorder and Controls

    DTIC Science & Technology

    2007-06-19

    Cole, P. A. (2002). Serotonin n-acetyltransferase: Mechanism and inhibition. Current Medicinal Chemistry , 9(12), 1187-1199. 152 APPENDIX A STRUCTURED ...such that low light levels fall below this threshold during winter in individuals with SAD. The present study investigated the haplotype structure of...Association Studies 51 Advantages of Population-Based Case-Control Samples 52 Haplotype Structure 53 Linkage Disequilibrium: A Measure of Correlation Between

  14. HLA class I antigen and HLA-A, -B, and -C haplotype frequencies in Uruguayans.

    PubMed

    Alvarez, Ines; Bengochea, Milka; Toledo, Roberto; Carretto, Elena; Hidalgo, Pedro C

    2006-08-01

    HLA class I antigens were determined for 959 unrelated Uruguayans. The predominant HLA alleles were A2, Cw4, and B35, and the most frequently observed two-loci haplotypes were A2-B44 and B35-Cw4. The most frequent three-loci HLA haplotype was A2-Cw5-B44. We compared the Uruguayan sample with similar data from other populations.

  15. Effects of ploidy level and haplotype on variation of photosynthetic traits: Novel evidence from two Fragaria species

    PubMed Central

    Gao, Song; Yan, Qiaodi; Chen, Luxi; Song, Yaobin; Fu, Chengxin; Dong, Ming

    2017-01-01

    To reveal the effects of ploidy level and haplotype on photosynthetic traits, we chose 175 genotypes of wild strawberries belonging to two haplotypes at two types of ploidy levels (diploidy and tetraploidy) and measured photosynthetic traits. Our results revealed that ploidy significantly affected the characteristics of light-response curves, CO2-response curves, and leaf gas exchange parameters, except intercellular CO2 concentration (Ci). Tetraploid species had a lower light saturation point (LSP) and CO2 saturation point (CSP), higher light compensation point (LCP), dark respiration (Rd), and CO2 compensation point (CCP) than diploid species. Furthermore, tetraploid species have lower photosynthetic capacity than diploid species, including net photosynthetic rate (Pn), stomatal conductivity (Gs), and transpiration rate (Tr). In addition, haplotype had a significant effect on LSP, CSP, Tr, and Ci as well as a significant interactive effect between ploidy and haplotype on the maximal photosynethic rate of the light-response curve and Rd. Most of the variance existed within haplotypes among individuals. These results suggest that polyploidization was the main driver for the evolution of photosynthesis with increasing ploidy level (i.e. from diploidy to tetraploidy in Fragaria species), while the origin of a chromosome could also affect the photosynthetic traits and the polyploidization effect on photosynthetic traits. PMID:28644876

  16. Effects of ploidy level and haplotype on variation of photosynthetic traits: Novel evidence from two Fragaria species.

    PubMed

    Gao, Song; Yan, Qiaodi; Chen, Luxi; Song, Yaobin; Li, Junmin; Fu, Chengxin; Dong, Ming

    2017-01-01

    To reveal the effects of ploidy level and haplotype on photosynthetic traits, we chose 175 genotypes of wild strawberries belonging to two haplotypes at two types of ploidy levels (diploidy and tetraploidy) and measured photosynthetic traits. Our results revealed that ploidy significantly affected the characteristics of light-response curves, CO2-response curves, and leaf gas exchange parameters, except intercellular CO2 concentration (Ci). Tetraploid species had a lower light saturation point (LSP) and CO2 saturation point (CSP), higher light compensation point (LCP), dark respiration (Rd), and CO2 compensation point (CCP) than diploid species. Furthermore, tetraploid species have lower photosynthetic capacity than diploid species, including net photosynthetic rate (Pn), stomatal conductivity (Gs), and transpiration rate (Tr). In addition, haplotype had a significant effect on LSP, CSP, Tr, and Ci as well as a significant interactive effect between ploidy and haplotype on the maximal photosynethic rate of the light-response curve and Rd. Most of the variance existed within haplotypes among individuals. These results suggest that polyploidization was the main driver for the evolution of photosynthesis with increasing ploidy level (i.e. from diploidy to tetraploidy in Fragaria species), while the origin of a chromosome could also affect the photosynthetic traits and the polyploidization effect on photosynthetic traits.

  17. Genetic evidence for nonrandom sorting of mitochondria in the basidiomycete Agrocybe aegerita.

    PubMed Central

    Barroso, G; Labarère, J

    1997-01-01

    We studied mitochondrial transmission in the homobasidiomycete Agrocybe aegerita during plasmogamy, vegetative growth, and basidiocarp differentiation. Plasmogamy between homokaryons from progeny of three wild-type strains resulted in bidirectional nuclear migration, and the dikaryotization speed was dependent on the nuclear genotype of the recipient homokaryon. Little mitochondrial migration accompanied the nuclear migration. A total of 75% of the dikaryons from the fusion lines had both parental mitochondrial haplotypes (mixed dikaryons), and 25% had only a single haplotype (homoplasmic dikaryons); with some matings, there was a strong bias in favor of one parental haplotype. We demonstrated the heteroplasmic nature of mixed dikaryons by (i) isolating and subculturing apical cells in micromanipulation experiments and (ii) identifying recombinant mitochondrial genomes. This heteroplasmy is consistent with the previously reported suggestion that there is recombination between mitochondrial alleles in A. aegerita. Conversion of heteroplasmons into homoplasmons occurred (i) during long-term storage, (ii) in mycelia regenerated from isolated apical cells, and (iii) during basidiocarp differentiation. Homokaryons that readily accepted foreign nuclei were the most efficient homokaryons in maintaining their mitochondrial haplotype during plasmogamy, long-term storage, and basidiocarp differentiation. This suggests that the mechanism responsible for the nonrandom retention or elimination of a given haplotype may be related to the nuclear genotype or the mitochondrial haplotype or both. PMID:9406387

  18. Genetic diversity and genetic structure of the striped field mouse Apodemus agrarius coreae (Muridae, Rodentia) in Korea.

    PubMed

    Kim, Hye Ri; Park, Yung Chul

    2015-11-10

    The aim of this study was to investigate the genetic diversity and genetic structure of the striped field mouse Apodemus agrarius coreae in Korea. The Korean A. a. coreae is characterized by high levels of haplotype diversity (Hd=0.967) and low levels of nucleotide diversity (π=0.00683). Haplogroup 1 is well separated from the haplotypes of the neighboring regions of the Korean Peninsula, while the other haplogroups are closely related to those from the Russian Far East. Thus, further investigations are required to confirm the validity of the subspecies status of A. a. coreae by implementing additional morphological characters as well as genetic data from the populations present in the Korean Peninsula and its neighboring countries. Haplogroup 1 includes most Korean haplotypes and forms a star-like haplotype network structure, which reveals relatively low levels of sequence divergence and high frequency of unique mutations (only few mutations are shared in most of the haplotype nodes). The results indicate that the haplotypes of Haplogroup 1 might have experienced population expansion since their migration into Korea, which was further corroborated with negative results of neutrality tests for Korean population of A. a. coreae. Copyright © 2015. Published by Elsevier B.V.

  19. The influence of casein haplotype on morphometric characteristics of fat globules and fatty acid composition of milk in Italian Holstein cows.

    PubMed

    Perna, Annamaria; Intaglietta, Immacolata; Simonetti, Amalia; Gambacorta, Emilio

    2016-04-01

    The aim of this work was to investigate the effect of casein haplotypes (αS1-, β-, and κ-caseins) on morphometric characteristics of fat globules and fatty acid composition of Italian Holstein milk. Casein haplotypes were determined by isoelectric focusing; milk fat globule size was measured by using a fluorescence microscope; and fatty acid profile was determined by gas chromatography. Casein haplotype significantly affected the fat globule size, the percentage incidence of each globule size class on total measured milk fat globules, and fatty acid composition. A higher incidence of smaller milk fat globules was associated with the BB-A(2)A(2)-BB genotype (αS1-, β-, and κ-casein haplotypes, respectively), whereas small globules were not detected in BB-A(2)A(1)-AA milk, but that milk had the highest percentage of large globules. A higher content of monounsaturated fatty acids was associated with the BB-A(2)A(2)-AB genotype, whereas higher contents of conjugated linoleic acid and docosahexaenoic acid were detected in BB-A(1)A(1)-AA milk. Our results indicate that casein haplotype could affect fat characteristics and, therefore, the nutritional and technological quality of milk. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  20. Haplotypes in the APOA1-C3-A4-A5 gene cluster affect plasma lipids in both humans and baboons

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Qian-fei; Liu, Xin; O'Connell, Jeff

    2003-09-15

    Genetic studies in non-human primates serve as a potential strategy for identifying genomic intervals where polymorphisms impact upon human disease-related phenotypes. It remains unclear, however, whether independently arising polymorphisms in orthologous regions of non-human primates leads to similar variation in a quantitative trait found in both species. To explore this paradigm, we studied a baboon apolipoprotein gene cluster (APOA1/C3/A4/A5) for which the human gene orthologs have well established roles in influencing plasma HDL-cholesterol and triglyceride concentrations. Our extensive polymorphism analysis of this 68 kb gene cluster in 96 pedigreed baboons identified several haplotype blocks each with limited diversity, consistent withmore » haplotype findings in humans. To determine whether baboons, like humans, also have particular haplotypes associated with lipid phenotypes, we genotyped 634 well characterized baboons using 16 haplotype tagging SNPs. Genetic analysis of single SNPs, as well as haplotypes, revealed an association of APOA5 and APOC3 variants with HDL cholesterol and triglyceride concentrations, respectively. Thus, independent variation in orthologous genomic intervals does associate with similar quantitative lipid traits in both species, supporting the possibility of uncovering human QTL genes in a highly controlled non-human primate model.« less

  1. Integration of genotoxicity and population genetic analyses in kangaroo rats (Dipodomys merriami) exposed to radionuclide contamination at the Nevada Test Site, USA

    USGS Publications Warehouse

    Theodorakis, Christopher W.; Bickham, John W.; Lamb, Trip; Medica, Philip A.; Lyne, T. Barrett

    2001-01-01

    We examined effects of radionuclide exposure at two atomic blast sites on kangaroo rats (Dipodomys merriami) at the Nevada Test Site, Nevada, USA, using genotoxicity and population genetic analyses. We assessed chromosome damage by micronucleus and flow cytometric assays and genetic variation by randomly amplified polymorphic DNA (RAPD) and mitochondrial DNA (mtDNA) analyses. The RAPD analysis showed no population structure, but mtDNA exhibited differentiation among and within populations. Genotoxicity effects were not observed when all individuals were analyzed. However, individuals with mtDNA haplotypes unique to the contaminated sites had greater chromosomal damage than contaminated-site individuals with haplotypes shared with reference sites. When interpopulation comparisons used individuals with unique haplotypes, one contaminated site had greater levels of chromosome damage than one or both of the reference sites. We hypothesize that shared-haplotype individuals are potential migrants and that unique-haplotype individuals are potential long-term residents. A parsimony approach was used to estimate the minimum number of migration events necessary to explain the haplotype distributions on a phylogenetic tree. The observed predominance of migration events into the contaminated sites supported our migration hypothesis. We conclude the atomic blast sites are ecological sinks and that immigration masks the genotoxic effects of radiation on the resident populations.

  2. Myopia and Late-Onset Progressive Cone Dystrophy Associate to LVAVA/MVAVA Exon 3 Interchange Haplotypes of Opsin Genes on Chromosome X.

    PubMed

    Orosz, Orsolya; Rajta, István; Vajas, Attila; Takács, Lili; Csutak, Adrienne; Fodor, Mariann; Kolozsvári, Bence; Resch, Miklós; Sényi, Katalin; Lesch, Balázs; Szabó, Viktória; Berta, András; Balogh, István; Losonczy, Gergely

    2017-03-01

    Rare interchange haplotypes in exon 3 of the OPN1LW and OPN1MW opsin genes cause X-linked myopia, color vision defect, and cone dysfunction. The severity of the disease varies on a broad scale from nonsyndromic high myopia to blue cone monochromatism. Here, we describe a new genotype-phenotype correlation attributed to rare exon 3 interchange haplotypes simultaneously present in the long- and middle-wavelength sensitive opsin genes (L- and M-opsin genes). A multigenerational family with X-linked high myopia and cone dystrophy was investigated. Affected male patients had infantile onset myopia with normal visual acuity and color vision until their forties. Visual acuity decreased thereafter, along with the development of severe protan and deutan color vision defects. A mild decrease in electroretinography response of cone photoreceptors was detected in childhood, which further deteriorated in middle-aged patients. Rods were also affected, however, to a lesser extent than cones. Clinical exome sequencing identified the LVAVA and MVAVA toxic haplotypes in the OPN1LW and OPN1MW opsin genes, respectively. Here, we show that LVAVA haplotype of the OPN1LW gene and MVAVA haplotype of the OPN1MW gene cause apparently nonsyndromic high myopia in young patients but lead to progressive cone-rod dystrophy with deuteranopia and protanopia in middle-aged patients corresponding to a previously unknown disease course. To the best of our knowledge, this is the first report on the joint effect of these toxic haplotypes in the two opsin genes on chromosome X.

  3. Combined influence of LDLR and HMGCR sequence variation on lipid-lowering response to simvastatin

    PubMed Central

    Mangravite, Lara M.; Medina, Marisa Wong; Cui, Jinrui; Pressman, Sheila; Smith, Joshua D.; Rieder, Mark J.; Guo, Xiuqing; Nickerson, Deborah A.; Rotter, Jerome I.; Krauss, Ronald M.

    2010-01-01

    Objectives Although statins are efficacious for lowering LDL-cholesterol (LDLC), there is wide inter-individual variation in response. We tested the extent to which combined effects of common alleles of LDLR and HMGCR can contribute to this variability. Methods and Results Haplotypes in the LDLR 3′-untranslated region (3UTR) were tested for association with lipid-lowering response to simvastatin treatment in the Cholesterol and Pharmacogenetics (CAP) trial (335 African-Americans and 609European-Americans). LDLR haplotype 5 (L5)was associated with smaller simvastatin-induced reductions in LDLC, total cholesterol, non-HDL cholesterol, and apolipoprotein B (P=0.0002–0.03)in African-Americans, but not European-Americans. The combined presence of L5 and previously described HMGCR haplotypes in African-Americans was associated with significantly attenuated apoB reduction(−22.4±1.5% N=89) both compared to noncarriers (−30.6±1.5% N=78, P=0.0001) and to carriers of either individual haplotype (−28.2±1.1% N=158, P=0.001). We observed similar differences when measuring simvastatin-mediated induction of LDLR surface expression using lymphoblast cell lines (P=0.03). Conclusions We have identified a common LDLR 3UTR haplotype that is associated with attenuated lipid-lowering response to simvastatin treatment. Response was further reduced in individuals with both LDLR and previously described HMGCR haplotypes. Previously identified racial differences in statin efficacy were partially explained by increased prevalence of these combined haplotypes in African-Americans. PMID:20413733

  4. Hypercontrols in genotype-phenotype analysis reveal ancestral haplotypes associated with essential hypertension.

    PubMed

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2012-04-01

    The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ(2)=23.9; P=0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8-4.9; P=0.000006) in the recessive model. Two polymorphisms, A-20C (P=0.003) and C3389T (P=0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ(2)=8.1; P=0.004) and H5 (χ(2)=5.1; P=0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as "superalleles."

  5. Hypercontrols in Genotype-Phenotype Analysis Reveal Ancestral Haplotypes Associated With Essential Hypertension

    PubMed Central

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2012-01-01

    The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ2 = 23.9; P = 0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8–4.9; P = 0.000006) in the recessive model. Two polymorphisms, A-20C (P = 0.003) and C3389T (P = 0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ2 = 8.1; P = 0.004) and H5 (χ2 = 5.1; P = 0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as “superalleles.” PMID:22371359

  6. Mutations in the von Hippel-Lindau (VHL) tumor suppressor gene and VHL-haplotype analysis in patients with presumable congenital erythrocytosis.

    PubMed

    Cario, Holger; Schwarz, Klaus; Jorch, Norbert; Kyank, Ulrike; Petrides, Petro E; Schneider, Dominik T; Uhle, Renate; Debatin, Klaus-Michael; Kohne, Elisabeth

    2005-01-01

    Congenital erythrocytoses or polycythemias are rare and heterogeneous. A homozygous mutation (C598T->Arg200Trp) in the von Hippel-Lindau (VHL) gene was originally identified as the cause of the endemic Chuvash polycythemia. Subsequently this and other mutations in the VHL gene were also detected in several patients of different ethnic origin. Haplotype analyses of the VHL gene suggested a common origin for the Chuvash-type mutation. Thirty-four patients with presumable congenital erythrocytosis due to an unknown underlying disorder were examined for VHL gene mutations and VHL region haplotypes. Four patients were homozygous and one patient heterozygous for the Chuvash-type mutation. One additional patient presented a previously not described heterozygous mutation G311->T VHL in exon 1. The haplotype analyses were in agreement with recently published data for three of the four patients with homozygous mutations as well as for the patient with a heterozygous Chuvash-type mutation. One patient of Turkish origin with homozygous Chuvash-type mutation had a haplotype not previously found in individuals with Chuvash-type mutation. These results confirm that mutations in the VHL gene are responsible for a substantial proportion of patients with congenital erythrocytoses. Erythrocytoses due to a C598->T mutation of the VHL gene are not geographically restricted. The majority of patients with Chuvash polycythemia share a common VHL gene haplotype. The different haplotype in one of the patients with Chuvash-type mutation indicates that this mutation was not spread only from a single founder but developed independently in other individuals.

  7. Lipoprotein lipase variants associated with an endophenotype of hypertension: hypertension combined with elevated triglycerides.

    PubMed

    Chen, Pei; Jou, Yuh-Shan; Fann, Cathy S J; Chen, Jaw-Wen; Chung, Chia-Min; Lin, Chin-Yu; Wu, Sheng-Yeu; Kang, Mei-Jyh; Chen, Ying-Chuang; Jong, Yuh-Shiun; Lo, Huey-Ming; Kang, Chih-Sen; Chen, Chien-Chung; Chang, Huan-Cheng; Huang, Nai-Kuei; Wu, Yi-Lin; Pan, Wen-Harn

    2009-01-01

    Previously, we observed that young-onset hypertension was independently associated with elevated plasma triglyceride(s) (TG) levels to a greater extent than other metabolic risk factors. Thus, focusing on the endophenotype--hypertension combined with elevated TG--we designed a family-based haplotype association study to explore its genetic connection with novel genetic variants of lipoprotein lipase gene (LPL), which encodes a major lipid metabolizing enzyme. Young-onset hypertension probands and their families were recruited, numbering 1,002 individuals from 345 families. Single-nucleotide polymorphism discovery for LPL, linkage disequilibrium (LD) analysis, transmission disequilibrium tests (TDT), bin construction, haplotype TDT association and logistic regression analysis were performed. We found that the CC- haplotype (i) spanning from intron 2 to intron 4 and the ACATT haplotype (ii) spanning from intron 5 to intron 6 were significantly associated with hypertension-related phenotypes: hypertension (ii, P=0.05), elevated TG (i, P=0.01), and hypertension combined with elevated TG (i, P=0.001; ii, P<0.0001), according to TDT. The risk of this hypertension subtype increased with the number of risk haplotypes in the two loci, using logistic regression model after adjusting within-family correlation. The relationships between LPL variants and hypertension-related disorders were also confirmed by an independent association study. Finally, we showed a trend that individuals with homozygous risk haplotypes had decreased LPL expression after a fatty meal, as opposed to those with protective haplotypes. In conclusion, this study strongly suggests that two LPL intronic variants may be associated with development of the hypertension endophenotype with elevated TG. Copyright 2008 Wiley-Liss, Inc.

  8. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data.

    PubMed

    Excoffier, L; Smouse, P E; Quattro, J M

    1992-06-01

    We present here a framework for the study of molecular variation within a single species. Information on DNA haplotype divergence is incorporated into an analysis of variance format, derived from a matrix of squared-distances among all pairs of haplotypes. This analysis of molecular variance (AMOVA) produces estimates of variance components and F-statistic analogs, designated here as phi-statistics, reflecting the correlation of haplotypic diversity at different levels of hierarchical subdivision. The method is flexible enough to accommodate several alternative input matrices, corresponding to different types of molecular data, as well as different types of evolutionary assumptions, without modifying the basic structure of the analysis. The significance of the variance components and phi-statistics is tested using a permutational approach, eliminating the normality assumption that is conventional for analysis of variance but inappropriate for molecular data. Application of AMOVA to human mitochondrial DNA haplotype data shows that population subdivisions are better resolved when some measure of molecular differences among haplotypes is introduced into the analysis. At the intraspecific level, however, the additional information provided by knowing the exact phylogenetic relations among haplotypes or by a nonlinear translation of restriction-site change into nucleotide diversity does not significantly modify the inferred population genetic structure. Monte Carlo studies show that site sampling does not fundamentally affect the significance of the molecular variance components. The AMOVA treatment is easily extended in several different directions and it constitutes a coherent and flexible framework for the statistical analysis of molecular data.

  9. Testing for post-copulatory selection for major histocompatibility complex genotype in a semi-free-ranging primate population.

    PubMed

    Setchell, Joanna M; Abbott, Kristin M; Gonzalez, Jean-Paul; Knapp, Leslie A

    2013-10-01

    A large body of evidence suggests that major histocompatibility complex (MHC) genotype influences mate choice. However, few studies have investigated MHC-mediated post-copulatory mate choice under natural, or even semi-natural, conditions. We set out to explore this question in a large semi-free-ranging population of mandrills (Mandrillus sphinx) using MHC-DRB genotypes for 127 parent-offspring triads. First, we showed that offspring MHC heterozygosity correlates positively with parental MHC dissimilarity suggesting that mating among MHC dissimilar mates is efficient in increasing offspring MHC diversity. Second, we compared the haplotypes of the parental dyad with those of the offspring to test whether post-copulatory sexual selection favored offspring with two different MHC haplotypes, more diverse gamete combinations, or greater within-haplotype diversity. Limited statistical power meant that we could only detect medium or large effect sizes. Nevertheless, we found no evidence for selection for heterozygous offspring when parents share a haplotype (large effect size), genetic dissimilarity between parental haplotypes (we could detect an odds ratio of ≥1.86), or within-haplotype diversity (medium-large effect). These findings suggest that comparing parental and offspring haplotypes may be a useful approach to test for post-copulatory selection when matings cannot be observed, as is the case in many study systems. However, it will be extremely difficult to determine conclusively whether post-copulatory selection mechanisms for MHC genotype exist, particularly if the effect sizes are small, due to the difficulty in obtaining a sufficiently large sample. © 2013 Wiley Periodicals, Inc.

  10. Restricted genetic variation in populations of Achatina (Lissachatina) fulica outside of East Africa and the Indian Ocean Islands points to the Indian Ocean Islands as the earliest known common source.

    PubMed

    Fontanilla, Ian Kendrich C; Sta Maria, Inna Mikaella P; Garcia, James Rainier M; Ghate, Hemant; Naggs, Fred; Wade, Christopher M

    2014-01-01

    The Giant African Land Snail, Achatina ( =  Lissachatina) fulica Bowdich, 1822, is a tropical crop pest species with a widespread distribution across East Africa, the Indian subcontinent, Southeast Asia, the Pacific, the Caribbean, and North and South America. Its current distribution is attributed primarily to the introduction of the snail to new areas by Man within the last 200 years. This study determined the extent of genetic diversity in global A. fulica populations using the mitochondrial 16S ribosomal RNA gene. A total of 560 individuals were evaluated from 39 global populations obtained from 26 territories. Results reveal 18 distinct A. fulica haplotypes; 14 are found in East Africa and the Indian Ocean islands, but only two haplotypes from the Indian Ocean islands emerged from this region, the C haplotype, now distributed across the tropics, and the D haplotype in Ecuador and Bolivia. Haplotype E from the Philippines, F from New Caledonia and Barbados, O from India and Q from Ecuador are variants of the emergent C haplotype. For the non-native populations, the lack of genetic variation points to founder effects due to the lack of multiple introductions from the native range. Our current data could only point with certainty to the Indian Ocean islands as the earliest known common source of A. fulica across the globe, which necessitates further sampling in East Africa to determine the source populations of the emergent haplotypes.

  11. Restricted Genetic Variation in Populations of Achatina (Lissachatina) fulica outside of East Africa and the Indian Ocean Islands Points to the Indian Ocean Islands as the Earliest Known Common Source

    PubMed Central

    Fontanilla, Ian Kendrich C.; Sta. Maria, Inna Mikaella P.; Garcia, James Rainier M.; Ghate, Hemant; Naggs, Fred; Wade, Christopher M.

    2014-01-01

    The Giant African Land Snail, Achatina ( = Lissachatina) fulica Bowdich, 1822, is a tropical crop pest species with a widespread distribution across East Africa, the Indian subcontinent, Southeast Asia, the Pacific, the Caribbean, and North and South America. Its current distribution is attributed primarily to the introduction of the snail to new areas by Man within the last 200 years. This study determined the extent of genetic diversity in global A. fulica populations using the mitochondrial 16S ribosomal RNA gene. A total of 560 individuals were evaluated from 39 global populations obtained from 26 territories. Results reveal 18 distinct A. fulica haplotypes; 14 are found in East Africa and the Indian Ocean islands, but only two haplotypes from the Indian Ocean islands emerged from this region, the C haplotype, now distributed across the tropics, and the D haplotype in Ecuador and Bolivia. Haplotype E from the Philippines, F from New Caledonia and Barbados, O from India and Q from Ecuador are variants of the emergent C haplotype. For the non-native populations, the lack of genetic variation points to founder effects due to the lack of multiple introductions from the native range. Our current data could only point with certainty to the Indian Ocean islands as the earliest known common source of A. fulica across the globe, which necessitates further sampling in East Africa to determine the source populations of the emergent haplotypes. PMID:25203830

  12. Origin of Japanese White-Eyes and Brown-Eared Bulbuls on the Volcano Islands.

    PubMed

    Sugita, Norimasa; Kawakami, Kazuto; Nishiumi, Isao

    2016-04-01

    The Ogasawara Archipelago comprises two groups of oceanic islands: the Bonin Islands, formed in the Paleogene, and the Volcano Islands, formed in the Quaternary. These groups are located within a moderate distance (ca. 160-270 km) of one another; thus, most land bird species are not distinguished as different subspecies. Two land birds, however, show unusual distribution. The Japanese white-eyes Zosterops japonicus originally inhabited only the Volcano Islands, but has been introduced to the Bonin Islands. The brown-eared bulbuls Hypsipetes amaurotis are distributed as a different subspecies. We investigated their genetic differences and divergences in the Ogasawara Archipelago using mitochondria DNA. The Volcano population of white-eyes had four endemic haplotypes that were divergent from one another, except for the Bonin population, which shared three haplotypes with the Volcano, Izu, and Ryukyu Islands and did not have any endemic haplotype. This is the first genetic suggestion that the Bonin population is a hybrid of introduced populations. With respect to bulbuls, the Volcano and Bonin Islands each had a single endemic haplotype. The Volcano haplotype is closest to a haplotype shared with Izu, the Japanese mainland, Daito and Ryukyu, whereas the Bonin haplotype is closest to one endemic to the south Ryukyu Islands. This indicates that the sources of the two bulbul populations can be geologically and temporally distinguished. The populations of the two species in the Ogasawara Archipelago are irreplaceable, owing to their genetic differences and should be regarded as evolutionarily significant units. In order to prevent introgression between the two populations, we must restrict interisland transfers.

  13. Vitamin K epoxide reductase complex subunit 1 (Vkorc1) haplotype diversity in mouse priority strains

    PubMed Central

    Song, Ying; Vera, Nicole; Kohn, Michael H

    2008-01-01

    Background Polymorphisms in the vitamin K-epoxide reductase complex subunit 1 gene, Vkorc1, could affect blood coagulation and other vitamin K-dependent proteins, such as osteocalcin (bone Gla protein, BGP). Here we sequenced the Vkorc1 gene in 40 mouse priority strains. We analyzed Vkorc1 haplotypes with respect to prothrombin time (PT) and bone mineral density and composition (BMD and BMC); phenotypes expected to be vitamin K-dependent and represented by data in the Mouse Phenome Database (MPD). Findings In the commonly used laboratory strains of Mus musculus domesticus we identified only four haplotypes differing in the intron or 5' region sequence of the Vkorc1. Six haplotypes differing by coding and non-coding polymorphisms were identified in the other subspecies of Mus. We detected no significant association of Vkorc1 haplotypes with PT, BMD and BMC within each subspecies of Mus. Vkorc1 haplotype sequences divergence between subspecies was associated with PT, BMD and BMC. Conclusion Phenotypic variation in PT, BMD and BMC within subspecies of Mus, while substantial, appears to be dominated by genetic variation in genes other than the Vkorc1. This was particularly evident for M. m. domesticus, where a single haplotype was observed in conjunction with virtually the entire range of PT, BMD and BMC values of all 5 subspecies of Mus included in this study. Differences in these phenotypes between subspecies also should not be attributed to Vkorc1 variants, but should be viewed as a result of genome wide genetic divergence. PMID:19046458

  14. Huntington disease in the South African population occurs on diverse and ethnically distinct genetic haplotypes

    PubMed Central

    Baine, Fiona K; Kay, Chris; Ketelaar, Maria E; Collins, Jennifer A; Semaka, Alicia; Doty, Crystal N; Krause, Amanda; Jacquie Greenberg, L; Hayden, Michael R

    2013-01-01

    Huntington disease (HD) is a neurodegenerative disorder resulting from the expansion of a CAG trinucleotide repeat in the huntingtin (HTT) gene. Worldwide prevalence varies geographically with the highest figures reported in populations of European ancestry. HD in South Africa has been reported in Caucasian, black and mixed subpopulations, with similar estimated prevalence in the Caucasian and mixed groups and a lower estimate in the black subpopulation. Recent studies have associated specific HTT haplotypes with HD in distinct populations. Expanded HD alleles in Europe occur predominantly on haplogroup A (specifically high-risk variants A1/A2), whereas in East Asian populations, HD alleles are associated with haplogroup C. Whether specific HTT haplotypes associate with HD in black Africans and how these compare with haplotypes found in European and East Asian populations remains unknown. The current study genotyped the HTT region in unaffected individuals and HD patients from each of the South African subpopulations, and haplotypes were constructed. CAG repeat sizes were determined and phased to haplotype. Results indicate that HD alleles from Caucasian and mixed patients are predominantly associated with haplogroup A, signifying a similar European origin for HD. However, in black patients, HD occurs predominantly on haplogroup B, suggesting several distinct origins of the mutation in South Africa. The absence of high-risk variants (A1/A2) in the black subpopulation may also explain the reported low prevalence of HD. Identification of haplotypes associated with HD-expanded alleles is particularly relevant to the development of population-specific therapeutic targets for selective suppression of the expanded HTT transcript. PMID:23463025

  15. Hereditary tyrosinemia type I: strong association with haplotype 6 in French Canadians permits simple carrier detection and prenatal diagnosis.

    PubMed Central

    Demers, S. I.; Phaneuf, D.; Tanguay, R. M.

    1994-01-01

    Hereditary tyrosinemia type 1 (HT1), a severe inborn error of tyrosine catabolism, is caused by deficiency of the terminal enzyme, fumarylacetoacetate hydrolase (FAH). The highest reported frequency of HT1 is in the French Canadian population, especially in the Saguenay-Lac-St-Jean region. Using human FAH cDNA probes, we have identified 10 haplotypes with TaqI, KpnI, RsaI, BglII, and MspI RFLPs in 118 normal chromosomes from the French Canadian population. Interestingly, in 29 HT1 children, a prevalent haplotype, haplotype 6, was found to be strongly associated with the disease, at a frequency of 90% of alleles, as compared with approximately 18% in 35 control individuals. This increased to 96% in the 24 patients originating from Saguenay-Lac-St-Jean. These results suggest that one or only a few prevailing mutations are responsible for most of the HT1 cases in Saguenay-Lac-St-Jean. Since most patients were found to be homozygous for a specific haplotype in this population, FAH RFLPs have permitted simple carrier detection in nine different informative HT1 families, with a confidence level of 99.9%. Heterozygosity rate values obtained from 52 carriers indicated that approximately 88% of families at risk from Saguenay-Lac-St-Jean are fully or partially informative. Prenatal diagnosis was also achieved in an American family. Analysis of 24 HT1 patients from nine countries gave a frequency of approximately 52% for haplotype 6, suggesting a relatively high association, worldwide, of HT1 with this haplotype. Images Figure 1 PMID:7913582

  16. Prion gene haplotypes of U.S. cattle

    PubMed Central

    Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W

    2006-01-01

    Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337

  17. Modeling biological problems in computer science: a case study in genome assembly.

    PubMed

    Medvedev, Paul

    2018-01-30

    As computer scientists working in bioinformatics/computational biology, we often face the challenge of coming up with an algorithm to answer a biological question. This occurs in many areas, such as variant calling, alignment and assembly. In this tutorial, we use the example of the genome assembly problem to demonstrate how to go from a question in the biological realm to a solution in the computer science realm. We show the modeling process step-by-step, including all the intermediate failed attempts. Please note this is not an introduction to how genome assembly algorithms work and, if treated as such, would be incomplete and unnecessarily long-winded. © The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. Comparison of Integer Programming (IP) Solvers for Automated Test Assembly (ATA). Research Report. ETS RR-15-05

    ERIC Educational Resources Information Center

    Donoghue, John R.

    2015-01-01

    At the heart of van der Linden's approach to automated test assembly (ATA) is a linear programming/integer programming (LP/IP) problem. A variety of IP solvers are available, ranging in cost from free to hundreds of thousands of dollars. In this paper, I compare several approaches to solving the underlying IP problem. These approaches range from…

  19. The MLH1 c.-27C>A and c.85G>T variants are linked to dominantly inherited MLH1 epimutation and are borne on a European ancestral haplotype.

    PubMed

    Kwok, Chau-To; Vogelaar, Ingrid P; van Zelst-Stams, Wendy A; Mensenkamp, Arjen R; Ligtenberg, Marjolijn J; Rapkins, Robert W; Ward, Robyn L; Chun, Nicolette; Ford, James M; Ladabaum, Uri; McKinnon, Wendy C; Greenblatt, Marc S; Hitchins, Megan P

    2014-05-01

    Germline mutations of the DNA mismatch repair genes MLH1, MSH2, MSH6 or PMS2, and deletions affecting the EPCAM gene adjacent to MSH2, underlie Lynch syndrome by predisposing to early-onset colorectal, endometrial and other cancers. An alternative but rare cause of Lynch syndrome is constitutional epimutation of MLH1, whereby promoter methylation and transcriptional silencing of one allele occurs throughout normal tissues. A dominantly transmitted constitutional MLH1 epimutation has been linked to an MLH1 haplotype bearing two single-nucleotide variants, NM_000249.2: c.-27C>A and c.85G>T, in a Caucasian family with Lynch syndrome from Western Australia. Subsequently, a second seemingly unrelated Caucasian Australian case with the same MLH1 haplotype and concomitant epimutation was reported. We now describe three additional, ostensibly unrelated, cancer-affected families of European heritage with this MLH1 haplotype in association with constitutional epimutation, bringing the number of index cases reported to five. Array-based genotyping in four of these families revealed shared haplotypes between individual families that extended across ≤2.6-≤6.4 megabase regions of chromosome 3p, indicating common ancestry. A minimal ≤2.6 megabase founder haplotype common to all four families was identified, which encompassed MLH1 and additional flanking genes and segregated with the MLH1 epimutation in each family. Our findings indicate that the MLH1 c.-27C>A and c.85G>T variants are borne on a European ancestral haplotype and provide conclusive evidence for its pathogenicity via a mechanism of epigenetic silencing of MLH1 within normal tissues. Additional descendants bearing this founder haplotype may exist who are also at high risk of developing Lynch syndrome-related cancers.

  20. Assessment of pfcrt 72-76 haplotypes eight years after chloroquine withdrawal in Kinshasa, Democratic Republic of Congo.

    PubMed

    Mvumbi, Dieudonné Makaba; Boreux, Raphael; Sacheli, Rosalie; Lelo, Mvumbi; Lengu, Bobanga; Nani-Tuma, Situakibanza; Melin, Pierrette; Ntumba, Kayembe; Lunganza, Kalala; DeMol, Patrick; Hayette, Marie-Pierre

    2013-12-20

    In 2001, the World Health Organization (WHO) has recommended the use of artemisinin-based combination therapy (ACT) as the first-line treatment of uncomplicated malaria cases, as monotherapies had become ineffective in many parts of the world. As a result, the Democratic Republic of Congo (DRC) withdrew chloroquine (CQ) from its malaria treatment policy in 2002 and an artesunate (AS)-amodiaquine (AQ) combination became the ACT of choice in DRC in 2005. AQ-resistance (AQR) has been reported in several parts of the world and mutations in codons 72-76 of the Plasmodium falciparum chloroquine-resistance transporter (pfcrt) gene have been strongly correlated with resistance, especially mutations encoding the SVMNT haplotype. This haplotype was first identified in Southeast Asia and South America but was recently reported in two African countries neighbouring DRC. These facts raised two questions: the first about the evolution of CQ resistance (CQR) in DRC and the second about the presence of the SVMNT haplotype, which would compromise the use of AQ as a partner drug for ACT. A total of 213 thick blood films were randomly collected in 2010 from a paediatric clinic in Kinshasa, DRC. Microscopy controls and real-time polymerase chain reaction (RT-PCR) were performed for Plasmodium species identification. Haplotypes of the pfcrt gene were determined by sequencing. The K76T mutation was detected in 145 out of 198 P. falciparum-positive samples (73.2%). In these 145 resistant strains, only the CVIET haplotype was detected. This study is the first to assess the molecular markers of resistance to CQ and AQ after the introduction of ACT in DRC. The results suggest first that CQR is decreasing, as wild-type pfcrt haplotypes were found in only 26.8% of the samples and secondly that the SVMNT haplotype is not yet present in Kinshasa, suggesting that AQ remains valid as a partner drug for ACT in this region.

  1. Identification of specific angiotensin-converting enzyme variants and haplotypes that confer risk and protection against type 2 diabetic nephropathy.

    PubMed

    Ezzidi, Intissar; Mtiraoui, Nabil; Kacem, Maha; Chaieb, Molka; Mahjoub, Touhami; Almawi, Wassim Y

    2009-11-01

    Cross-sectional and family studies identified angiotensin-converting enzyme (ACE) gene as a risk factor for diabetic nephropathy (DN). The contribution of ACE gene variants to DN development and progression is controversial and varies among different ethnic/racial groups. We investigated the association of three ACE gene variants with DN, rs1799752 insertion/deletion (I/D), rs1800764T/C and rs12449782A/G in 917 Tunisian type 2 diabetic (T2DM) patients: 515 with (DN) and 402 without (DWN) nephropathy. ACE genotyping was done by PCR-based assays; haplotype estimation was performed using H-Plus software (chi(2)-test based). Genotype frequency distributions of the three studied variants were in Hardy-Weinberg equilibrium. Minor allele frequency of rs1800764 was higher in DN patients than DWN patients or healthy controls, and minor allele frequency of rs1799752 was higher in DN than DWN patients. Higher frequency of rs1799752 and rs1800764 homozygous mutant genotypes was seen in DN compared to DWN patients. Of the three variants, only rs1799752 deletion/deletion (D/D) genotype was associated with a significant increase in albumin to creatinine ratios levels, and D/D carriers had elevated low-density lipoprotein, total cholesterol and urea. Three locus haplotype [rs1799752(I/D)/rs1800764(T/C)/rs12449782(A/G)] analysis revealed that the frequency of DCG haplotype was higher, while that of ITG and ICA haplotypes were lower among unselected type 2 diabetic patients. Taking ITA haplotype as reference, multivariate regression analysis confirmed the negative (ITG), and positive (DCG, DTG, DCA and DTA) association of specific ACE haplotypes with DN, after adjusting for potential nephropathy-linked covariates. Our results support the involvement of specific ACE variants in DN pathogenesis and demonstrate the presence of DN-specific haplotypes at the ACE locus.

  2. Genetic structure of Plasmodium vivax using the merozoite surface protein 1 icb5-6 fragment reveals new hybrid haplotypes in southern Mexico

    PubMed Central

    2014-01-01

    Background Plasmodium vivax is a protozoan parasite with an extensive worldwide distribution, being highly prevalent in Asia as well as in Mesoamerica and South America. In southern Mexico, P. vivax transmission has been endemic and recent studies suggest that these parasites have unique biological and genetic features. The msp1 gene has shown high rate of nucleotide substitutions, deletions, insertions, and its mosaic structure reveals frequent events of recombination, maybe between highly divergent parasite isolates. Methods The nucleotide sequence variation in the polymorphic icb5-6 fragment of the msp1 gene of Mexican and worldwide isolates was analysed. To understand how genotype diversity arises, disperses and persists in Mexico, the genetic structure and genealogical relationships of local isolates were examined. To identify new sequence hybrids and their evolutionary relationships with other P. vivax isolates circulating worldwide two haplotype networks were constructed questioning that two portions of the icb5-6 have different evolutionary history. Results Twelve new msp1 icb5-6 haplotypes of P. vivax from Mexico were identified. These nucleotide sequences show mosaic structure comprising three partially conserved and two variable subfragments and resulted into five different sequence types. The variable subfragment sV1 has undergone recombination events and resulted in hybrid sequences and the haplotype network allocated the Mexican haplotypes to three lineages, corresponding to the Sal I and Belem types, and other more divergent group. In contrast, the network from icb5-6 fragment but not sV1 revealed that the Mexican haplotypes belong to two separate lineages, none of which are closely related to Sal I or Belem sequences. Conclusions These results suggest that the new hybrid haplotypes from southern Mexico were the result of at least three different recombination events. These rearrangements likely resulted from the recombination between haplotypes of highly divergent lineages that are frequently distributed in South America and Asia and diversified rapidly. PMID:24472213

  3. Single Nucleotide Polymorphisms of the Angiotensin-Converting Enzyme (ACE) Gene Are Associated with Essential Hypertension and Increased ACE Enzyme Levels in Mexican Individuals

    PubMed Central

    Martínez-Rodríguez, Nancy; Posadas-Romero, Carlos; Villarreal-Molina, Teresa; Vallejo, Maite; Del-Valle-Mondragón, Leonardo; Ramírez-Bello, Julian; Valladares, Adan; Cruz-López, Miguel; Vargas-Alarcón, Gilberto

    2013-01-01

    Aim To explore the role of the ACE gene polymorphisms in the risk of essential hypertension in Mexican Mestizo individuals and evaluate the correlation between these polymorphisms and the serum ACE levels. Methods Nine ACE gene polymorphisms were genotyped by 5′ exonuclease TaqMan genotyping assays and polymerase chain reaction (PCR) in 239 hypertensive and 371 non- hypertensive Mexican individuals. Haplotypes were constructed after linkage disequilibrium analysis. ACE serum levels were determined in selected individuals according to different haplotypes. Results Under a dominant model, rs4291 rs4335, rs4344, rs4353, rs4362, and rs4363 polymorphisms were associated with an increased risk of hypertension after adjusting for age, gender, BMI, triglycerides, alcohol consumption, and smoking. Five polymorphisms (rs4335, rs4344, rs4353, rs4362 and rs4363) were in strong linkage disequilibrium and were included in four haplotypes: H1 (AAGCA), H2 (GGATG), H3 (AGATG), and H4 (AGACA). Haplotype H1 was associated with decreased risk of hypertension, while haplotype H2 was associated with an increased risk of hypertension (OR = 0.77, P = 0.023 and OR = 1.41, P = 0.004 respectively). According to the codominant model, the H2/H2 and H1/H2 haplotype combinations were significantly associated with risk of hypertension after adjusted by age, gender, BMI, triglycerides, alcohol consumption, and smoking (OR = 2.0; P = 0.002 and OR = 2.09; P = 0.011, respectively). Significant elevations in serum ACE concentrations were found in individuals with the H2 haplotype (H2/H2 and H2/H1) as compared to H1/H1 individuals (P = 0.0048). Conclusion The results suggest that single nucleotide polymorphisms and the “GGATG” haplotype of the ACE gene are associated with the development of hypertension and with increased ACE enzyme levels. PMID:23741507

  4. Single nucleotide polymorphisms of the angiotensin-converting enzyme (ACE) gene are associated with essential hypertension and increased ACE enzyme levels in Mexican individuals.

    PubMed

    Martínez-Rodríguez, Nancy; Posadas-Romero, Carlos; Villarreal-Molina, Teresa; Vallejo, Maite; Del-Valle-Mondragón, Leonardo; Ramírez-Bello, Julian; Valladares, Adan; Cruz-López, Miguel; Vargas-Alarcón, Gilberto

    2013-01-01

    To explore the role of the ACE gene polymorphisms in the risk of essential hypertension in Mexican Mestizo individuals and evaluate the correlation between these polymorphisms and the serum ACE levels. Nine ACE gene polymorphisms were genotyped by 5' exonuclease TaqMan genotyping assays and polymerase chain reaction (PCR) in 239 hypertensive and 371 non- hypertensive Mexican individuals. Haplotypes were constructed after linkage disequilibrium analysis. ACE serum levels were determined in selected individuals according to different haplotypes. Under a dominant model, rs4291 rs4335, rs4344, rs4353, rs4362, and rs4363 polymorphisms were associated with an increased risk of hypertension after adjusting for age, gender, BMI, triglycerides, alcohol consumption, and smoking. Five polymorphisms (rs4335, rs4344, rs4353, rs4362 and rs4363) were in strong linkage disequilibrium and were included in four haplotypes: H1 (AAGCA), H2 (GGATG), H3 (AGATG), and H4 (AGACA). Haplotype H1 was associated with decreased risk of hypertension, while haplotype H2 was associated with an increased risk of hypertension (OR = 0.77, P = 0.023 and OR = 1.41, P = 0.004 respectively). According to the codominant model, the H2/H2 and H1/H2 haplotype combinations were significantly associated with risk of hypertension after adjusted by age, gender, BMI, triglycerides, alcohol consumption, and smoking (OR = 2.0; P = 0.002 and OR = 2.09; P = 0.011, respectively). Significant elevations in serum ACE concentrations were found in individuals with the H2 haplotype (H2/H2 and H2/H1) as compared to H1/H1 individuals (P = 0.0048). The results suggest that single nucleotide polymorphisms and the "GGATG" haplotype of the ACE gene are associated with the development of hypertension and with increased ACE enzyme levels.

  5. Overview of worldwide diversity of Diaphorina citri Kuwayama mitochondrial cytochrome oxidase 1 haplotypes: two Old World lineages and a New World invasion

    PubMed Central

    Boykin, L.M.; De Barro, P.; Hall, D.G.; Hunter, W.B.; McKenzie, C.L.; Powell, C.A.; Shatters, R.G.

    2012-01-01

    Relationships among worldwide collections of Diaphorina citri (Asian citrus psyllid) were analyzed using mitochondrial cytochrome oxidase I (mtCOI) haplotypes from novel primers. Sequences were produced from PCR amplicons of an 821bp portion of the mtCOI gene using D. citri specific primers, derived from an existing EST library. An alignment was constructed using 612bps of this fragment and consisted of 212 individuals from 52 collections representing 15 countries. There were a total of eight polymorphic sites that separated the sequences into eight different haplotypes (Dcit-1 through Dcit-8). Phylogenetic network analysis using the statistical parsimony software, TCS, suggests two major haplotype groups with preliminary geographic bias between southwestern Asia (SWA) and southeastern Asia (SEA). The recent (within the last 15 to 25 years) invasion into the New World originated from only the SWA group in the northern hemisphere (USA and Mexico) and from both the SEA and SWA groups in the southern hemisphere (Brazil). In only one case, Reunion Island, did haplotypes from both the SEA and SWA group appear in the same location. In Brazil, both groups were present, but in separate locations. The Dcit-1 SWA haplotype was the most frequently encountered, including ~50% of the countries sampled and 87% of the total sequences obtained from India, Pakistan and Saudi Arabia. The second most frequently encountered haplotype, Dcit-2, the basis of the SEA group, represented ~50% of the countries and contained most of the sequences from Southeast Asia and China. Interestingly, only the Caribbean collections (Puerto Rico and Guadeloupe) represented a unique haplotype not found in other countries, indicating no relationship between the USA (Florida) and Caribbean introductions. There is no evidence for cryptic speciation for D. citri based on the COI region included in this study. PMID:22717059

  6. GNAS gene variants affect β-blocker-related survival after coronary artery bypass grafting.

    PubMed

    Frey, Ulrich H; Muehlschlegel, Jochen D; Ochterbeck, Christoph; Fox, Amanda A; Shernan, Stanton K; Collard, Charles D; Lichtner, Peter; Peters, Jürgen; Body, Simon

    2014-05-01

    Cardiac overexpression of the β-adrenoreceptor (βAR)-coupled stimulatory G-protein subunit Gαs enhances inotropic responses to adrenergic stimulation and improves survival in mice under βAR blockade. The authors recently identified three common haplotypes in the GNAS gene encoding Gαs, with the greatest Gαs protein expression and signal transduction in haplotype *3 carriers and less in haplotype *2 and *1 carriers. The authors tested the hypothesis that these GNAS variants result in altered mortality in patients after coronary artery bypass graft surgery, particularly in those receiving βAR blockade. This prospective analysis included 1,627 European ancestry patients undergoing primary coronary artery bypass graft surgery. Patients were genotyped for two GNAS haplotype tagging single-nucleotide polymorphisms defining three major haplotypes. Up to 5-yr all-cause mortality was estimated using a Cox proportional hazard model; hazard ratios and 95% CIs were calculated while adjusting for demographics, clinical covariates, and the new EuroSCORE II. Univariate analysis revealed haplotype-dependent 5-yr mortality rates (*1/*1: 18.9%, *2/*1: 13.7%, *2/*2: 9.3%, *3/*1: 10.6%, *3/*2: 9.1%, and *3/*3: 9.6%; P = 0.0006). After adjustment for other predictors of death, homozygote haplotype *1 carriers showed a doubled risk for death (hazard ratio, 2.2; 95% CI, 1.2 to 3.8; P = 0.006). Considering only patients receiving βAR blockers (n = 1,267), the adjusted risk of death even tripled (hazard ratio, 3.0; 95% CI, 1.5 to 6.1; P = 0.002). GNAS haplotypes independently associate with an increased risk of death after primary coronary artery bypass graft surgery. These results are most pronounced in patients receiving βAR blockers, strengthening the rationale for personalized treatment, to decrease medication side effects and improve outcomes.

  7. Haplotype analysis of the apolipoprotein A5 gene in obese pediatric patients.

    PubMed

    Horvatovich, Katalin; Bokor, Szilvia; Baráth, Akos; Maász, Anita; Kisfali, Péter; Járomi, Luca; Polgár, Noémi; Tóth, Dénes; Répásy, Judit; Endreffy, Emoke; Molnár, Dénes; Melegh, Béla

    2011-06-01

    Apolipoprotein A5 (APOA5) gene variants have been shown to be associated with elevated TG levels; the T-1131C (rs662799) variant has been reported to confer risk for the metabolic syndrome in adult populations. Little is known about the APOA5 variants in pediatric population, no such information is available for pediatric obesity at all. Here we examined four haplotype-tagging polymorphisms (T-1131C, IVS3 + G476A [rs2072560], T1259C [rs2266788] and C56G [rs3135506]) and studied also the frequency of major naturally occurring haplotypes of APOA5 in obese children. The polymorphisms were analyzed in 232 obese children, and in 137 healthy, normal weight controls, using PCR-RFLP methods. In the pediatric patients we could confirm the already known adult subjects based association of -1131C, IVS3 + 476A and 1259C variants with elevated triglyceride concentrations, both in obese patients and in the controls. The prevalence of the APOA5*2 haplotype (containing the minor allele of T-1131C, IVS3 + G476A and T1259C SNPs together) was 15.5% in obese children, and 5.80% in the controls (p<0.001); multiple logistic regression analysis revealed that this haplotype confers susceptibility for development of obesity (OR=2.87; 95% CI: 1.29-6.37; p≤0.01). By contrast, the APOA5*4 haplotype (with -1131C alone) did not show similar associations. Our findings also suggest that the APOA5*5 haplotype (1259C alone) can be protective against obesity (OR=0.25; 95% CI: 0.07-0.80; p<0.05). While previous studies in adults demonstrated, that the APOA5 -1131C minor allele confers risk for adult metabolic syndrome, here we show, that the susceptibility nature of this SNP restricted to the APOA5*2 haplotype in pediatric obese subjects.

  8. APC Yin-Yang haplotype associated with colorectal cancer risk

    PubMed Central

    GARRE, P.; DE LA HOYA, M.; INIESTA, P.; ROMERA, A.; LLOVET, P.; GONZALEZ, S.; PEREZ-SEGURA, P.; CAPELLA, G.; DIAZ-RUBIO, E.; CALDES, T.

    2010-01-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan® assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32–2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61–1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC. PMID:22993613

  9. APC Yin-Yang haplotype associated with colorectal cancer risk.

    PubMed

    Garre, P; DE LA Hoya, M; Iniesta, P; Romera, A; Llovet, P; Gonzalez, S; Perez-Segura, P; Capella, G; Diaz-Rubio, E; Caldes, T

    2010-09-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan(®) assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32-2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61-1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC.

  10. The genetic basis of resistance and matching-allele interactions of a host-parasite system: The Daphnia magna-Pasteuria ramosa model

    PubMed Central

    Fields, Peter D.; Bourgeois, Yann; Du Pasquier, Louis; Ebert, Dieter

    2017-01-01

    Negative frequency-dependent selection (NFDS) is an evolutionary mechanism suggested to govern host-parasite coevolution and the maintenance of genetic diversity at host resistance loci, such as the vertebrate MHC and R-genes in plants. Matching-allele interactions of hosts and parasites that prevent the emergence of host and parasite genotypes that are universally resistant and infective are a genetic mechanism predicted to underpin NFDS. The underlying genetics of matching-allele interactions are unknown even in host-parasite systems with empirical support for coevolution by NFDS, as is the case for the planktonic crustacean Daphnia magna and the bacterial pathogen Pasteuria ramosa. We fine-map one locus associated with D. magna resistance to P. ramosa and genetically characterize two haplotypes of the Pasteuria resistance (PR-) locus using de novo genome and transcriptome sequencing. Sequence comparison of PR-locus haplotypes finds dramatic structural polymorphisms between PR-locus haplotypes including a large portion of each haplotype being composed of non-homologous sequences resulting in haplotypes differing in size by 66 kb. The high divergence of PR-locus haplotypes suggest a history of multiple, diverse and repeated instances of structural mutation events and restricted recombination. Annotation of the haplotypes reveals striking differences in gene content. In particular, a group of glycosyltransferase genes that is present in the susceptible but absent in the resistant haplotype. Moreover, in natural populations, we find that the PR-locus polymorphism is associated with variation in resistance to different P. ramosa genotypes, pointing to the PR-locus polymorphism as being responsible for the matching-allele interactions that have been previously described for this system. Our results conclusively identify a genetic basis for the matching-allele interaction observed in a coevolving host-parasite system and provide a first insight into its molecular basis. PMID:28222092

  11. The genetic basis of resistance and matching-allele interactions of a host-parasite system: The Daphnia magna-Pasteuria ramosa model.

    PubMed

    Bento, Gilberto; Routtu, Jarkko; Fields, Peter D; Bourgeois, Yann; Du Pasquier, Louis; Ebert, Dieter

    2017-02-01

    Negative frequency-dependent selection (NFDS) is an evolutionary mechanism suggested to govern host-parasite coevolution and the maintenance of genetic diversity at host resistance loci, such as the vertebrate MHC and R-genes in plants. Matching-allele interactions of hosts and parasites that prevent the emergence of host and parasite genotypes that are universally resistant and infective are a genetic mechanism predicted to underpin NFDS. The underlying genetics of matching-allele interactions are unknown even in host-parasite systems with empirical support for coevolution by NFDS, as is the case for the planktonic crustacean Daphnia magna and the bacterial pathogen Pasteuria ramosa. We fine-map one locus associated with D. magna resistance to P. ramosa and genetically characterize two haplotypes of the Pasteuria resistance (PR-) locus using de novo genome and transcriptome sequencing. Sequence comparison of PR-locus haplotypes finds dramatic structural polymorphisms between PR-locus haplotypes including a large portion of each haplotype being composed of non-homologous sequences resulting in haplotypes differing in size by 66 kb. The high divergence of PR-locus haplotypes suggest a history of multiple, diverse and repeated instances of structural mutation events and restricted recombination. Annotation of the haplotypes reveals striking differences in gene content. In particular, a group of glycosyltransferase genes that is present in the susceptible but absent in the resistant haplotype. Moreover, in natural populations, we find that the PR-locus polymorphism is associated with variation in resistance to different P. ramosa genotypes, pointing to the PR-locus polymorphism as being responsible for the matching-allele interactions that have been previously described for this system. Our results conclusively identify a genetic basis for the matching-allele interaction observed in a coevolving host-parasite system and provide a first insight into its molecular basis.

  12. Association between polymorphisms in the β2-adrenergic receptor gene with myocardial infarction and ischemic stroke in women

    PubMed Central

    Schürks, Markus; Kurth, Tobias; Ridker, Paul M; Buring, Julie E.; Zee, Robert Y. L.

    2008-01-01

    Summary Results from studies investigating the association between polymorphisms in the β2-adrenergic receptor gene (ADRB2) and cardiovascular disease (CVD) are controversial. Using haplotype-based analysis, we have previously shown a protective effect of the Gly16-Gln27-Ile164 haplotype on myocardial infarction in men. We sought to replicate these findings in women and further investigated, whether the gene variants exert differential effects on myocardial infarction and ischemic stroke. We performed a prospective study among 25,224 women, participating in the Women's Health Study and free of CVD at study entry. We had information on polymorphisms Gly16Arg, Gln27Glu, and Thr164Ile in the ADRB2. Incident CVD was self-reported and confirmed after medical record review. We used proportional hazards models to investigate the association between genotypes and haplotypes with any myocardial infarction, any ischemic stroke, and CVD death. During a mean of 11.8 years of follow-up, 274 myocardial infarctions, 299 ischemic strokes, and 159 CVD deaths occurred. Among the whole cohort genotype- and haplotype-based analyses did not show an association for any of the gene variants with any of the CVD outcomes. When we focused on Caucasian women, the haplotype-based analysis, however, suggested an inverse association of the haplotype Gly16-Gln27-Thr164 with incident myocardial infarction (multivariable-adjusted hazard ratio 0.75; 95%CI 0.58−0.97; p=0.03). We did not find associations in the haplotype-based analyses with incident ischemic stroke or CVD death. Our results suggest that the haplotype Gly16-Gln27-Thr164 is associated with reduced risk of incident myocardial infarction but not ischemic stroke in Caucasian women and suggests differential pathophysiologies for myocardial infarction and stroke. PMID:19190821

  13. Asian population frequencies and haplotype distribution of killer cell immunoglobulin-like receptor (KIR) genes among Chinese, Malay, and Indian in Singapore.

    PubMed

    Lee, Yi Chuan; Chan, Soh Ha; Ren, Ee Chee

    2008-11-01

    Killer cell immunoglobulin-like receptors (KIR) gene frequencies have been shown to be distinctly different between populations and contribute to functional variation in the immune response. We have investigated KIR gene frequencies in 370 individuals representing three Asian populations in Singapore and report here the distribution of 14 KIR genes (2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 2DS1, 2DS2, 2DS3, 2DS4, 2DS5, 3DL1, 3DL2, 3DL3, 3DS1) with two pseudogenes (2DP1, 3DP1) among Singapore Chinese (n = 210); Singapore Malay (n = 80), and Singapore Indian (n = 80). Four framework genes (KIR3DL3, 3DP1, 2DL4, 3DL2) and a nonframework pseudogene 2DP1 were detected in all samples while KIR2DS2, 2DL2, 2DL5, and 2DS5 had the greatest significant variation across the three populations. Fifteen significant linkage patterns, consistent with associations between genes of A and B haplotypes, were observed. Eighty-four distinct KIR profiles were determined in our populations, 38 of which had not been described in other populations. KIR haplotype studies were performed using nine Singapore Chinese families comprising 34 individuals. All genotypes could be resolved into corresponding pairs of existing haplotypes with eight distinct KIR genotypes and eight different haplotypes. The haplotype A2 with frequency of 63.9% was dominant in Singapore Chinese, comparable to that reported in Korean and Chinese Han. The A haplotypes predominate in Singapore Chinese, with ratio of A to B haplotypes of approximately 3:1. Comparison with KIR frequencies in other populations showed that Singapore Chinese shared similar distributions with Chinese Han, Japanese, and Korean; Singapore Indian was found to be comparable with North Indian Hindus while Singapore Malay resembled the Thai.

  14. A YAC contig spanning the dominant retinitis pigmentosa locus (RP9) on chromosome 7p

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keen, T.J.; Inglehearn, C.F.; Patel, R.J.

    1995-08-10

    The dominant retinitis pigmentosa locus RP9 has previously been localized to 7p13-p15, in the interval D7S526-D7S484. We now report refinement of the locus to the interval D7S795-D7S484 and YAC contig of approximately 4.8 Mb spanning this region and extending both distally and proximally from it. The contig was constructed by STS content mapping and physically orders 29 STSs in 28 YAC clones. The order of polymorphic markers in the contig is consistent with a genetic map that has been assembled using haplotype data from the CEPH pedigrees. This contig will provide a primary resource for the construction of a transcriptionalmore » map of this region and for the identification of the defective gene causing this form of adRP. 27 refs., 3 figs., 1 tab.« less

  15. Molecular genealogy tools for white-tailed deer with chronic wasting disease

    PubMed Central

    Ernest, Holly B.; Hoar, Bruce R.; Well, Jay A.; O’Rourke, Katherine I.

    2010-01-01

    Molecular genetic data provide powerful tools for genealogy reconstruction to reveal mechanisms underlying disease ecology. White-tailed deer (Odocoileus virginianus) congregate in matriarchal groups; kin-related close social spacing may be a factor in the spread of infectious diseases. Spread of chronic wasting disease (CWD), a prion disorder of deer and their cervid relatives, is presumed to be associated with direct contact between individuals and by exposure to shared food and water sources contaminated with prions shed by infected deer. Key aspects of disease ecology are yet unknown. DNA tools for pedigree reconstruction were developed to fill knowledge gaps in disease dynamics in prion-infected wild animals. Kinship indices using data from microsatellite loci and sequence haplotypes of mitochondrial DNA were employed to assemble genealogies. Molecular genealogy tools will be useful for landscape-level population genetic research and monitoring, in addition to epidemiologic studies examining transmission of CWD in captive and free-ranging cervids. PMID:20592847

  16. Association of HLA haplotype with alopecia areata in Chinese Hans.

    PubMed

    Xiao, F-L; Ye, D-Q; Yang, S; Zhou, F-S; Zhou, S-M; Zhu, Y-G; Liang, Y-H; Ren, Y-Q; Zhang, X-J

    2006-11-01

    Some studies have shown discrepancies in human leucocyte antigen (HLA) associated with alopecia areata (AA) between different ethnic populations. To investigate whether HLA-I, -DQA1 and -DQB1 alleles and the HLA haplotype are associated with AA, and the correlation between the HLA haplotype profile, age of onset and severity of AA in Chinese Hans. The polymerase chain reaction-sequence specific primer (PCR-SSP) method was used to analyse the frequencies of HLA class I, -DQA1 and -DQB1 alleles in 192 patients with AA and 252 controls in Chinese Hans. The linkage disequilibrium was calculated using the 2 x 2 table. The 24 two-locus haplotypes [including A*02-B*18, A*02-B*27, A*02-B*52, A*02-Cw*0704, A*02-DQA1*0104, A*02-DQB1*0604, A*02-DQB1*0606, B*18-Cw*0704, B*18-DQA1*0104, B*18-DQA1*0302, B*18-DQB1*0606, B*27-Cw*0704, B*27-DQA1*0104, B*27-DQA1*0302, B*52-Cw*0704, B*52-DQA1*0104, B*52-DQA1*0302, B52-DQB1*0606, Cw*0704-DQA1*0104, Cw*0704-DQA1*0302, Cw*0704-DQB1*0606, DQA1*0104-DQB1*0604, DQA1*0104-DQB1*0606, DQA1*0302-DQB1*0606 (P<0.05)] were associated with AA, while eight extended haplotypes (A*02-B*18-DQA1*0104, A*02-B*27-DQA1*0104, A*02-B*52-DQA1*0104, A*02-B*52-DQA1*0302, A*02-B*52-DQB1*0606, B*52-Cw*0704-DQA1*0104, B*52-Cw*0704-DQA1*0302, A*02-B*52-DQA1*0302-DQB1*0606) were found to be related to AA in Chinese Hans. Through stratified analysis, we found that the extended haplotype B*52-Cw*0704-DQA1*0302 was related to early onset of AA, and no haplotype was only associated with severe AA. This is the first detailed report to elucidate HLA haplotypes associated with AA and that demonstrates the significant HLA haplotypes in Chinese Hans AA. The haplotype B*52-Cw*0704-DQA1*0302 was identified to be related to early onset of AA. Our results provide some information for future research on predisposing genes in HLA regions in Chinese Hans.

  17. Genetic diversity and population structure in the threatened Oregon silverspot butterfly (Speyeria zerene hippolyta) in western Oregon and northwestern California— Implications for future translocations and the establishment of new populations

    USGS Publications Warehouse

    Miller, Mark P.; Mullins, Thomas D.; Haig, Susan M.

    2016-09-20

    Executive SummaryWe present results of population genetic analyses performed on Oregon silverspot butterflies (OSB; Speyeria zerene hippolyta) in western Oregon and northwestern California. We used DNA sequences from a 561-base pair region of the mitochondrial cytochrome oxidase subunit I (COI) gene for a dataset comprised of 112 S. z. hippolyta and 32 S. z. gloriosa individuals collected at 9 locations in western Oregon and northwestern California. The most pertinent findings thus far are summarized as follows:Among OSB populations, genetic diversity is lowest at Mount Hebo and highest at Rock Creek and Bray Point. Of the 32 haplotypes detected in OSB, only 2 were shared among populations (1 shared by Mount Hebo, Cascade Head, Bray Point, and Rock Creek, and 1 shared by Rock Creek and Lake Earl). The remaining 30 haplotypes were identified in individual populations, highlighting the strong differentiation among sites. It is unclear if the shared haplotypes represent widespread, naturally occurring genetic variation or if allele sharing among populations is due to translocation history.Using full siblings of individuals that were released at Rock Creek and Bray Point in 2012 as comparison standards, the analyses suggest that 54 percent of the sampled individuals from Bray Point were naturally recruited into the population and were not originating from the 2012 release of captive reared individuals. Likewise, 33 percent of the analyzed individuals from Rock Creek were naturally recruited. Both of these estimates may be underestimates if the shared alleles that we identified among populations are naturally occurring and not a product of the 2012 translocations.The results suggest that there are about 12–13 COI haplotypes in the Mount Hebo population. The U.S. Fish and Wildlife Service anticipates using Mount Hebo as the source of individuals when establishing new populations in the future. Nonlinear regression models based on a series of rarefaction analyses suggest that progeny from 12, 37, 109, and 326 female individuals would be required to respectively capture 25, 50, 75, and 90 percent of the allelic diversity from Mount Hebo.Phylogenetic analyses identified two different haplotype groups, but the two groups did not correspond to the different subspecies used in the analysis. One group included 22 S. z. hippolyta haplotypes and 7 haplotypes identified in S. z. gloriosa. The second group included eight haplotypes from S. z. hippolyta, three haplotypes from S. z. gloriosa, and one haplotype that was detected in both subspecies.

  18. Review of general algorithmic features for genome assemblers for next generation sequencers.

    PubMed

    Wajid, Bilal; Serpedin, Erchin

    2012-04-01

    In the realm of bioinformatics and computational biology, the most rudimentary data upon which all the analysis is built is the sequence data of genes, proteins and RNA. The sequence data of the entire genome is the solution to the genome assembly problem. The scope of this contribution is to provide an overview on the art of problem-solving applied within the domain of genome assembly in the next-generation sequencing (NGS) platforms. This article discusses the major genome assemblers that were proposed in the literature during the past decade by outlining their basic working principles. It is intended to act as a qualitative, not a quantitative, tutorial to all working on genome assemblers pertaining to the next generation of sequencers. We discuss the theoretical aspects of various genome assemblers, identifying their working schemes. We also discuss briefly the direction in which the area is headed towards along with discussing core issues on software simplicity. Copyright © 2012 Beijing Institute of Genomics, Chinese Academy of Sciences. Published by Elsevier Ltd. All rights reserved.

  19. Genetic analysis of ossification of the posterior longitudinal ligament.

    PubMed

    Matsunaga, S; Yamaguchi, M; Hayashi, K; Sakou, T

    1999-05-15

    The human leukocyte antigen (HLA) haplotypes in families of patients with known ossification of the posterior longitudinal ligament (OPLL) were reviewed. To clarify how genetic factors relate to the development of OPLL. The association between genetic factors and the development of OPLL is still unknown. The association between HLA haplotypes and OPLL was studied in families of 24 patients with OPLL. The prevalence of OPLL was higher in the siblings showing a higher share of identical HLA haplotypes: 10 (53%) of 19 with concurrence of two strands, and 5 (24%) of 21 with concurrence of one strand. Of 21 subjects who had no HLA haplotype identical with that in OPLL patients, only one showed evidence of OPLL. Genetic factors predispose toward the development of OPLL.

  20. Modeling haplotype block variation using Markov chains.

    PubMed

    Greenspan, G; Geiger, D

    2006-04-01

    Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity.

  1. Modeling Haplotype Block Variation Using Markov Chains

    PubMed Central

    Greenspan, G.; Geiger, D.

    2006-01-01

    Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity. PMID:16361244

  2. Recent Advances in Experimental Whole Genome Haplotyping Methods

    PubMed Central

    Huang, Mengting; Lu, Zuhong

    2017-01-01

    Haplotype plays a vital role in diverse fields; however, the sequencing technologies cannot resolve haplotype directly. Pioneers demonstrated several approaches to resolve haplotype in the early years, which was extensively reviewed. Since then, numerous methods have been developed recently that have significantly improved phasing performance. Here, we review experimental methods that have emerged mainly over the past five years, and categorize them into five classes according to their maximum scale of contiguity: (i) encapsulation, (ii) 3D structure capture and construction, (iii) compartmentalization, (iv) fluorography, (v) long-read sequencing. Several subsections of certain methods are attached to each class as instances. We also discuss the relative advantages and disadvantages of different classes and make comparisons among representative methods of each class. PMID:28891974

  3. 17 Y-STR haplotype diversity in São Paulo state (southeast of Brazil).

    PubMed

    de Souza, Leandro Fonseca; da Motta, Carlos Henrique Ares Silveira; Moura-Neto, Rodrigo Soares

    2018-03-12

    A sample of 158 Brazilian males from São Paulo (SP), Brazilian southeast, was typed for 17 Y-STR loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, YGATA_H4.1, and DYS385ab). A total of 158 haplotypes were identified, of which all were unique. The haplotype diversity and discrimination capacity were calculated in 1.0 and the genetic diversity was 67.4%. Pairwise haplotype distances showed that the São Paulo population is not significantly different from Rio de Janeiro and Portugal, but is different from African and Native American.

  4. Genetic Variation at Selected SNPs in the Leptin Gene and Association of Alleles with Markers of Kidney Disease in a Xhosa Population of South Africa

    PubMed Central

    Okpechi, Ikechi G.; Rayner, Brian L.; van der Merwe, Lize; Mayosi, Bongani M.; Adeyemo, Adebowale; Tiffin, Nicki; Ramesar, Rajkumar

    2010-01-01

    Background Chronic kidney disease (CKD) is a significant public health problem that leads to end-stage renal disease (ESRD) with as many as 2 million people predicted to need therapy worldwide by 2010. Obesity is a risk factor for CKD and leptin, the obesity hormone, correlates with body fat mass and markers of renal function. A number of clinical and experimental studies have suggested a link between serum leptin and kidney disease. We hypothesised that variants in the leptin gene (LEP) may be associated with markers of CKD in indigenous black Africans. Methodology/Principal Findings Black South Africans of Xhosa (distinct cultural Bantu-speaking population) descent were recruited for the study and four common polymorphisms of the LEP (rs7799039, rs791620, rs2167270 and STS-U43653 [ENSSNP5824596]) were analysed for genotype and haplotype association with urine albumin-to-creatinine ratio (UACR), estimated glomerular filtration rate (eGFR), Serum creatinine (Scr) and serum leptin level. In one of the four single nucleotide polymorphisms (SNPs) we examined, an association with the renal phenotypes was observed. Hypertensive subjects with the T allele (CT genotype) of the ENSSNP5824596 SNP had a significantly higher eGFR (p = 0.0141), and significantly lower Scr (p = 0.0137). This was confirmed by haplotype analysis. Also, the haplotype GAAC had a modest effect on urine albumin-to-creatinine ratio in normotensive subjects (p = 0.0482). Conclusions/Significance These results suggest that genetic variations of the LEP may be associated with phenotypes that are markers of CKD in black Africans. PMID:20140086

  5. Genetic Variation of the Ghrelin Signalling System in Individuals with Amphetamine Dependence

    PubMed Central

    Jayaram-Lindström, Nitya; Nilsson, Staffan; Toren, Kjell; Rosengren, Annika; Engel, Jörgen A.; Franck, Johan

    2013-01-01

    The development of amphetamine dependence largely depends on the effects of amphetamine in the brain reward systems. Ghrelin, an orexigenic peptide, activates the reward systems and is required for reward induced by alcohol, nicotine, cocaine and amphetamine in mice. Human genetic studies have shown that polymorphisms in the pre-proghrelin (GHRL) as well as GHS-R1A (GHSR) genes are associated with high alcohol consumption, increased weight and smoking in males. Since the heritability factor underlying drug dependence is shared between different drugs of abuse, we here examine the association between single nucleotide polymorphisms (SNPs) and haplotypes in the GHRL and GHSR, and amphetamine dependence. GHRL and GHSR SNPs were genotyped in Swedish amphetamine dependent individuals (n = 104) and controls from the general population (n = 310). A case-control analysis was performed and SNPs and haplotypes were additionally tested for association against Addiction Severity Interview (ASI) composite score of drug use. The minor G-allele of the GHSR SNP rs2948694, was more common among amphetamine dependent individuals when compared to controls (pc = 0.02). A significant association between the GHRL SNP rs4684677 and ASI composite score of drug use was also reported (pc = 0.03). The haplotype analysis did not add to the information given by the individual polymorphisms. Although genetic variability of the ghrelin signalling system is not a diagnostic marker for amphetamine dependence and problem severity of drug use, the present results strengthen the notion that ghrelin and its receptor may be involved in the development of addictive behaviours and may thus serve as suitable targets for new treatments of such disorders. PMID:23579732

  6. Genetic variation of the ghrelin signalling system in individuals with amphetamine dependence.

    PubMed

    Suchankova, Petra; Jerlhag, Elisabet; Jayaram-Lindström, Nitya; Nilsson, Staffan; Toren, Kjell; Rosengren, Annika; Engel, Jörgen A; Franck, Johan

    2013-01-01

    The development of amphetamine dependence largely depends on the effects of amphetamine in the brain reward systems. Ghrelin, an orexigenic peptide, activates the reward systems and is required for reward induced by alcohol, nicotine, cocaine and amphetamine in mice. Human genetic studies have shown that polymorphisms in the pre-proghrelin (GHRL) as well as GHS-R1A (GHSR) genes are associated with high alcohol consumption, increased weight and smoking in males. Since the heritability factor underlying drug dependence is shared between different drugs of abuse, we here examine the association between single nucleotide polymorphisms (SNPs) and haplotypes in the GHRL and GHSR, and amphetamine dependence. GHRL and GHSR SNPs were genotyped in Swedish amphetamine dependent individuals (n = 104) and controls from the general population (n = 310). A case-control analysis was performed and SNPs and haplotypes were additionally tested for association against Addiction Severity Interview (ASI) composite score of drug use. The minor G-allele of the GHSR SNP rs2948694, was more common among amphetamine dependent individuals when compared to controls (pc  = 0.02). A significant association between the GHRL SNP rs4684677 and ASI composite score of drug use was also reported (pc  = 0.03). The haplotype analysis did not add to the information given by the individual polymorphisms. Although genetic variability of the ghrelin signalling system is not a diagnostic marker for amphetamine dependence and problem severity of drug use, the present results strengthen the notion that ghrelin and its receptor may be involved in the development of addictive behaviours and may thus serve as suitable targets for new treatments of such disorders.

  7. The role of donor-recipient relationship in long-term outcomes of living donor renal transplantation.

    PubMed

    Miles, Clifford D; Schaubel, Douglas E; Liu, Dandan; Port, Friedrich K; Rao, Panduranga S

    2008-05-27

    Graft failure related to acute and chronic rejection remains an important problem in transplantation. An association has been reported between microchimerism and the development of tolerance. Since it has been established that cells of fetal origin can be found in maternal tissues long after parturition, and cells of maternal origin may persist for years in offspring, we hypothesized that this fetal-maternal microchimerism may confer tolerance and thus less graft loss for kidneys transplanted between mothers and their offspring. We used data from the Scientific Registry of Transplant Recipients to compare death-censored graft survival among recipients of living-related renal transplants sharing at least one human leukocyte antigen (HLA) haplotype with their donor. A total of 23,064 such transplants were reported from 1995 to 2004. A Cox proportional hazards model was constructed to compare death-censored graft survival among the following donor-recipient pairings: child-to-mother, child-to-father, mother-to-child, father-to-child, 1-haplotype matched siblings, and HLA-identical siblings. HLA-identical sibling recipients had the best survival, but results for the child-to-father group were not significantly worse (hazard ratio=1.07, P=0.47). Mother-to-child transplants had the poorest graft survival (hazard ratio=2.61, P<0.0001). We found no evidence of tolerance to kidneys transplanted between mothers and offspring. Our analysis of 1-haplotype matched living-related renal transplants argues against tolerance to organs based on fetal-maternal microchimerism. Mechanistic studies examining the relationship between chimerism and immune sensitization would be useful to explore our results, and may contribute to a better understanding of tolerance.

  8. Population genetics and ecological niche of invasive Aedes albopictus in Mexico.

    PubMed

    Pech-May, Angélica; Moo-Llanes, David A; Puerto-Avila, María Belem; Casas, Mauricio; Danis-Lozano, Rogelio; Ponce, Gustavo; Tun-Ku, Ezequiel; Pinto-Castillo, José Francisco; Villegas, Alejandro; Ibáñez-Piñon, Clemente R; González, Cassandra; Ramsey, Janine M

    2016-05-01

    The Asian tiger mosquito Aedes albopictus (Skuse), is one of the most invasive mosquito species worldwide. In Mexico it is now recorded in 12 states and represents a serious public health problem, given the recent introduction of Chikungunya on the southern border. The aim of this study was to analyze the population genetics of A. albopictus from all major recorded foci, and model its ecological niche. Niche similarity with that from its autochthonous distribution in Asia and other invaded countries were analyzed and its potential future expansion and potential human exposure in climate change scenarios measured. We analyzed 125 sequences of a 317 bp fragment of the cyt b gene from seven A. albopictus populations across Mexico. The samples belong to 25 haplotypes with moderate population structuring (Fst=0.081, p<0.02) and population expansion. The most prevalent haplotype, found in all principal sites, was shared with the USA, Brazil, France, Madagascar, and Reunion Island. The ecological niche model using Mexican occurrence records covers 79.7% of the country, and has an 83% overlap with the Asian niche projected to Mexico. Both Neotropical and Nearctic regions are included in the Mexican niche model. Currently in Mexico, 38.6 million inhabitants are exposed to A. albopictus, which is expected to increase to 45.6 million by 2070. Genetic evidence supports collection information that A. albopictus was introduced to Mexico principally by land from the USA and Central and South America. Prevalent haplotypes from Mexico are shared with most invasive regions across the world, just as there was high niche similarity with both natural and invaded regions. The important overlap with the Asian niche model suggests a high potential for the species to disperse to sylvatic regions in Mexico. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. Assembly planning based on subassembly extraction

    NASA Technical Reports Server (NTRS)

    Lee, Sukhan; Shin, Yeong Gil

    1990-01-01

    A method is presented for the automatic determination of assembly partial orders from a liaison graph representation of an assembly through the extraction of preferred subassemblies. In particular, the authors show how to select a set of tentative subassemblies by decomposing a liaison graph into a set of subgraphs based on feasibility and difficulty of disassembly, how to evaluate each of the tentative subassemblies in terms of assembly cost using the subassembly selection indices, and how to construct a hierarchical partial order graph (HPOG) as an assembly plan. The method provides an approach to assembly planning by identifying spatial parallelism in assembly as a means of constructing temporal relationships among assembly operations and solves the problem of finding a cost-effective assembly plan in a flexible environment. A case study of the assembly planning of a mechanical assembly is presented.

  10. Accurate HLA type inference using a weighted similarity graph.

    PubMed

    Xie, Minzhu; Li, Jing; Jiang, Tao

    2010-12-14

    The human leukocyte antigen system (HLA) contains many highly variable genes. HLA genes play an important role in the human immune system, and HLA gene matching is crucial for the success of human organ transplantations. Numerous studies have demonstrated that variation in HLA genes is associated with many autoimmune, inflammatory and infectious diseases. However, typing HLA genes by serology or PCR is time consuming and expensive, which limits large-scale studies involving HLA genes. Since it is much easier and cheaper to obtain single nucleotide polymorphism (SNP) genotype data, accurate computational algorithms to infer HLA gene types from SNP genotype data are in need. To infer HLA types from SNP genotypes, the first step is to infer SNP haplotypes from genotypes. However, for the same SNP genotype data set, the haplotype configurations inferred by different methods are usually inconsistent, and it is often difficult to decide which one is true. In this paper, we design an accurate HLA gene type inference algorithm by utilizing SNP genotype data from pedigrees, known HLA gene types of some individuals and the relationship between inferred SNP haplotypes and HLA gene types. Given a set of haplotypes inferred from the genotypes of a population consisting of many pedigrees, the algorithm first constructs a weighted similarity graph based on a new haplotype similarity measure and derives constraint edges from known HLA gene types. Based on the principle that different HLA gene alleles should have different background haplotypes, the algorithm searches for an optimal labeling of all the haplotypes with unknown HLA gene types such that the total weight among the same HLA gene types is maximized. To deal with ambiguous haplotype solutions, we use a genetic algorithm to select haplotype configurations that tend to maximize the same optimization criterion. Our experiments on a previously typed subset of the HapMap data show that the algorithm is highly accurate, achieving an accuracy of 96% for gene HLA-A, 95% for HLA-B, 97% for HLA-C, 84% for HLA-DRB1, 98% for HLA-DQA1 and 97% for HLA-DQB1 in a leave-one-out test. Our algorithm can infer HLA gene types from neighboring SNP genotype data accurately. Compared with a recent approach on the same input data, our algorithm achieved a higher accuracy. The code of our algorithm is available to the public for free upon request to the corresponding authors.

  11. Diversity and population structure of Plasmodium falciparum in Thailand based on the spatial and temporal haplotype patterns of the C-terminal 19-kDa domain of merozoite surface protein-1.

    PubMed

    Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Siripoon, Napaporn; Seugorn, Aree; Kaewthamasorn, Morakot; Butcher, Robert D J; Harnyuttanakorn, Pongchai

    2014-02-12

    The 19-kDa C-terminal region of the merozoite surface protein-1 of the human malaria parasite Plasmodium falciparum (PfMSP-119) constitutes the major component on the surface of merozoites and is considered as one of the leading candidates for asexual blood stage vaccines. Because the protein exhibits a level of sequence variation that may compromise the effectiveness of a vaccine, the global sequence diversity of PfMSP-119 has been subjected to extensive research, especially in malaria endemic areas. In Thailand, PfMSP-119 sequences have been derived from a single parasite population in Tak province, located along the Thailand-Myanmar border, since 1995. However, the extent of sequence variation and the spatiotemporal patterns of the MSP-119 haplotypes along the Thai borders with Laos and Cambodia are unknown. Sixty-three isolates of P. falciparum from five geographically isolated populations along the Thai borders with Myanmar, Laos and Cambodia in three transmission seasons between 2002 and 2008 were collected and culture-adapted. The msp-1 gene block 17 was sequenced and analysed for the allelic diversity, frequency and distribution patterns of PfMSP-119 haplotypes in individual populations. The PfMSP-119 haplotype patterns were then compared between parasite populations to infer the population structure and genetic differentiation of the malaria parasite. Five conserved polymorphic positions, which accounted for five distinct haplotypes, of PfMSP-119 were identified. Differences in the prevalence of PfMSP-119 haplotypes were detected in different geographical regions, with the highest levels of genetic diversity being found in the Kanchanaburi and Ranong provinces along the Thailand-Myanmar border and Trat province located at the Thailand-Cambodia border. Despite this variability, the distribution patterns of individual PfMSP-119 haplotypes seemed to be very similar across the country and over the three malarial transmission seasons, suggesting that gene flow may operate between parasite populations circulating in Thailand and the three neighboring countries. The major MSP-119 haplotypes of P. falciparum populations in all endemic populations during three transmission seasons in Thailand were identified, providing basic information on the common haplotypes of MSP-119 that is of use for malaria vaccine development and inferring the population structure of P. falciparum populations in Thailand.

  12. Phylogeny and Haplotype Analysis of Fungi Within the Fusarium incarnatum-equiseti Species Complex.

    PubMed

    Ramdial, H; Latchoo, R K; Hosein, F N; Rampersad, S N

    2017-01-01

    Fusarium spp. are ranked among the top 10 most economically and scientifically important plant-pathogenic fungi in the world and are associated with plant diseases that include fruit decay of a number of crops. Fusarium isolates infecting bell pepper in Trinidad were identified based on sequence comparisons of the translation elongation factor gene (EF-1a) with sequences of Fusarium incarnatum-equiseti species complex (FIESC) verified in the FUSARIUM-ID database. Eighty-two isolates were identified as belonging to one of four phylogenetic species within the subclades FIESC-1, FIESC-15, FIESC-16, and FIESC-26, with the majority of isolates belonging to FIESC-15. A comparison of the level of DNA polymorphism and phylogenetic inference for sequences of the internal transcribed spacer region (ITS1-5.8S-ITS2) and EF-1a sequences for Trinidad and FUSARIUM-ID type species was carried out. The ITS sequences were less informative, had lower haplotype diversity and restricted haplotype distribution, and resulted in poor resolution and taxa placement in the consensus maximum-likelihood tree. EF-1a sequences enabled strongly supported phylogenetic inference with highly resolved branching patterns of the 30 phylogenetic species within the FIESC and placement of representative Trinidad isolates. Therefore, global phylogeny was inferred from EF-1a sequences representing 11 countries, and separation into distinct Incarnatum and Equiseti clades was again evident. In total, 42 haplotypes were identified: 12 were shared and the remaining were unique haplotypes. The most diverse haplotype was represented by sequences from China, Indonesia, Malaysia, and Trinidad and consisted exclusively of F. incarnatum isolates. Spain had the highest haplotype diversity, perhaps because both F. equiseti and F. incarnatum sequences were represented; followed by the United States, which contributed both F. equiseti and F. incarnatum sequences to the data set; then by countries representing Southeast Asia (China, Indonesia, Malaysia, Thailand, and Philippines) and Trinidad; both of these regions were represented by only F. incarnatum sequences. Trinidad shared two haplotypes with China and one haplotype with the United States for only F. incarnatum isolates. The findings of this study are important for devising disease management strategies and for understanding the phylogenetic relationships among members of the FIESC.

  13. Different DRB1*03:01-DQB1*02:01 haplotypes confer different risk for celiac disease.

    PubMed

    Alshiekh, S; Zhao, L P; Lernmark, Å; Geraghty, D E; Naluai, Å T; Agardh, D

    2017-08-01

    Celiac disease is associated with the HLA-DR3-DQA1*05:01-DQB1*02:01 and DR4-DQA1*03:01-DQB1*03:02 haplotypes. In addition, there are currently over 40 non-HLA loci associated with celiac disease. This study extends previous analyses on different HLA haplotypes in celiac disease using next generation targeted sequencing. Included were 143 patients with celiac disease and 135 non-celiac disease controls investigated at median 9.8 years (1.4-18.3 years). PCR-based amplification of HLA and sequencing with Illumina MiSeq technology were used for extended sequencing of the HLA class II haplotypes HLA-DRB1, DRB3, DRB4, DRB5, DQA1 and DQB1, respectively. Odds ratios were computed marginally for every allele and haplotype as the ratio of allelic frequency in patients and controls as ratio of exposure rates (RR), when comparing a null reference with equal exposure rates in cases and controls. Among the extended HLA haplotypes, the strongest risk haplotype for celiac disease was shown for DRB3*01:01:02 in linkage with DQA1*05:01-DQB1*02:01 (RR = 6.34; P-value < .0001). In a subpopulation analysis, DRB3*01:01:02-DQA1*05:01-DQB1*02:01 remained the most significant in patients with Scandinavian ethnicity (RR = 4.63; P < .0001) whereas DRB1*07:01:01-DRB4*01:03:01-DQA1*02:01-DQB1*02:02:01 presented the highest risk of celiac disease among non-Scandinavians (RR = 7.94; P = .011). The data also revealed 2 distinct celiac disease risk DR3-DQA1*05:01-DQB*02:01 haplotypes distinguished by either the DRB3*01:01:02 or DRB3*02:02:01 alleles, indicating that different DRB1*03:01-DQB1*02:01 haplotypes confer different risk for celiac disease. The associated risk of celiac disease for DR3-DRB3*01:01:02-DQA1*05:01-DQB1*02:01 is predominant among patients of Scandinavian ethnicity. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. Wolbachia association with the tsetse fly, Glossina fuscipes fuscipes, reveals high levels of genetic diversity and complex evolutionary dynamics

    PubMed Central

    2013-01-01

    Background Wolbachia pipientis, a diverse group of α-proteobacteria, can alter arthropod host reproduction and confer a reproductive advantage to Wolbachia-infected females (cytoplasmic incompatibility (CI)). This advantage can alter host population genetics because Wolbachia-infected females produce more offspring with their own mitochondrial DNA (mtDNA) haplotypes than uninfected females. Thus, these host haplotypes become common or fixed (selective sweep). Although simulations suggest that for a CI-mediated sweep to occur, there must be a transient phase with repeated initial infections of multiple individual hosts by different Wolbachia strains, this has not been observed empirically. Wolbachia has been found in the tsetse fly, Glossina fuscipes fuscipes, but it is not limited to a single host haplotype, suggesting that CI did not impact its population structure. However, host population genetic differentiation could have been generated if multiple Wolbachia strains interacted in some populations. Here, we investigated Wolbachia genetic variation in G. f. fuscipes populations of known host genetic composition in Uganda. We tested for the presence of multiple Wolbachia strains using Multi-Locus Sequence Typing (MLST) and for an association between geographic region and host mtDNA haplotype using Wolbachia DNA sequence from a variable locus, groEL (heat shock protein 60). Results MLST demonstrated that some G. f. fuscipes carry Wolbachia strains from two lineages. GroEL revealed high levels of sequence diversity within and between individuals (Haplotype diversity = 0.945). We found Wolbachia associated with 26 host mtDNA haplotypes, an unprecedented result. We observed a geographical association of one Wolbachia lineage with southern host mtDNA haplotypes, but it was non-significant (p = 0.16). Though most Wolbachia-infected host haplotypes were those found in the contact region between host mtDNA groups, this association was non-significant (p = 0.17). Conclusions High Wolbachia sequence diversity and the association of Wolbachia with multiple host haplotypes suggest that different Wolbachia strains infected G. f. fuscipes multiple times independently. We suggest that these observations reflect a transient phase in Wolbachia evolution that is influenced by the long gestation and low reproductive output of tsetse. Although G. f. fuscipes is superinfected with Wolbachia, our data does not support that bidirectional CI has influenced host genetic diversity in Uganda. PMID:23384159

  15. Improved nuclear fuel assembly grid spacer

    DOEpatents

    Marshall, John; Kaplan, Samuel

    1977-01-01

    An improved fuel assembly grid spacer and method of retaining the basic fuel rod support elements in position within the fuel assembly containment channel. The improvement involves attachment of the grids to the hexagonal channel and of forming the basic fuel rod support element into a grid structure, which provides a design which is insensitive to potential channel distortion (ballooning) at high fluence levels. In addition the improved method eliminates problems associated with component fabrication and assembly.

  16. New and Common Haplotypes Shape Genetic Diversity in Asian Tiger Mosquito Populations from Costa Rica and Panamá.

    PubMed

    Futami, K; Valderrama, A; Baldi, M; Minakawa, N; Marín Rodríguez, R; Chaves, L F

    2015-04-01

    The Asian tiger mosquito, Aedes albopictus (Skuse) (Diptera: Culicidae), is a vector of several human pathogens. Ae. albopictus is also an invasive species that, over recent years, has expanded its range out of its native Asia. Ae. albopictus was suspected to be present in Central America since the 1990s, and its presence was confirmed by most Central American nations by 2010. Recently, this species has been regularly found, yet in low numbers, in limited areas of Panamá and Costa Rica (CR). Here, we report that short sequences (∼558 bp) of the mitochondrial cytochrome oxidase subunit 1 (COI) and NADH dehydrogenase subunit 5 genes of Ae. albopictus, had no haplotype diversity. Instead, there was a common haplotype for each gene in both CR and Panamá. In contrast, a long COI sequence (∼1,390 bp) revealed that haplotype diversity (±SD) was relatively high in CR (0.72±0.04) when compared with Panamá (0.33±0.13), below the global estimate for reported samples (0.89±0.01). The long COI sequence allowed us to identify seven (five new) haplotypes in CR and two (one new) in Panamá. A haplotype network for the long COI gene sequence showed that samples from CR and Panamá belong to a single large group. The long COI gene sequences suggest that haplotypes in Panamá and CR, although similar to each other, had a significant geographic differentiation (Kst=1.33; P<0.001). Thus, most of our results suggest a recent range expansion in CR and Panamá. © The Authors 2015. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  17. Frequency distribution of interleukin-10 haplotypes (-1082 A>G, -819 C>T, and -592 C>A) in a Mexican population.

    PubMed

    Vázquez-Villamar, M; Palafox-Sánchez, C A; Hernández-Bello, J; Muñoz-Valle, J F; Valle, Y; Cruz, A; Alatorre-Meza, A I; Oregon-Romero, E

    2016-11-03

    Interleukin 10 (IL-10) is an immunoregulatory cytokine with multiple roles in the immune system. Three single nucleotide polymorphisms at positions -1082 (A>G), -819 (C>T), and -592 (C>A) in the promoter region of the IL10 gene are believed to be associated with different inflammatory, infectious, and autoimmune diseases. These polymorphisms exhibit a strong linkage disequilibrium (LD) and form three principal haplotypes (GCC, ACC, and ATA). The GCC and ATA haplotypes have been associated with high and low levels of IL-10 production, respectively. The aim of this study was to establish the allele and haplotype frequencies of the IL10 polymorphisms in Mestizos from western Mexico. SNPs were analyzed in 340 healthy unrelated Mestizos from western Mexico by polymerase chain reaction-restriction fragment length polymorphism. The studied population presented significant differences, in the distribution of IL10 polymorphisms, from the Asian, African, and European populations. We also observed a strong LD within -1082 A>G, -819 C>T, and -592 C>A (100% pc = 7.735 x 10 -18 ). The haplotypes ACC (45.4%), ATA (22.0%), GTA (14.9%), and GCC (13.9%) were most frequently observed in this population. The haplotype frequencies, however, differed from those reported previously in Mestizos from central Mexico, Asians, Africans, and European Caucasians, suggesting a differential gene flow in the Mexican Mestizo population. This could account for the genetic variability between Mexicans and populations of other ethnicities. The study of these polymorphisms and their haplotypes could help in expanding our knowledge to design future disease-risk studies on the western Mexican population.

  18. Tracing the route of modern humans out of Africa by using 225 human genome sequences from Ethiopians and Egyptians.

    PubMed

    Pagani, Luca; Schiffels, Stephan; Gurdasani, Deepti; Danecek, Petr; Scally, Aylwyn; Chen, Yuan; Xue, Yali; Haber, Marc; Ekong, Rosemary; Oljira, Tamiru; Mekonnen, Ephrem; Luiselli, Donata; Bradman, Neil; Bekele, Endashaw; Zalloua, Pierre; Durbin, Richard; Kivisild, Toomas; Tyler-Smith, Chris

    2015-06-04

    The predominantly African origin of all modern human populations is well established, but the route taken out of Africa is still unclear. Two alternative routes, via Egypt and Sinai or across the Bab el Mandeb strait into Arabia, have traditionally been proposed as feasible gateways in light of geographic, paleoclimatic, archaeological, and genetic evidence. Distinguishing among these alternatives has been difficult. We generated 225 whole-genome sequences (225 at 8× depth, of which 8 were increased to 30×; Illumina HiSeq 2000) from six modern Northeast African populations (100 Egyptians and five Ethiopian populations each represented by 25 individuals). West Eurasian components were masked out, and the remaining African haplotypes were compared with a panel of sub-Saharan African and non-African genomes. We showed that masked Northeast African haplotypes overall were more similar to non-African haplotypes and more frequently present outside Africa than were any sets of haplotypes derived from a West African population. Furthermore, the masked Egyptian haplotypes showed these properties more markedly than the masked Ethiopian haplotypes, pointing to Egypt as the more likely gateway in the exodus to the rest of the world. Using five Ethiopian and three Egyptian high-coverage masked genomes and the multiple sequentially Markovian coalescent (MSMC) approach, we estimated the genetic split times of Egyptians and Ethiopians from non-African populations at 55,000 and 65,000 years ago, respectively, whereas that of West Africans was estimated to be 75,000 years ago. Both the haplotype and MSMC analyses thus suggest a predominant northern route out of Africa via Egypt. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  19. Novel pfdhps Haplotypes among Imported Cases of Plasmodium falciparum Malaria in the United Kingdom ▿

    PubMed Central

    Sutherland, Colin J.; Fifer, Helen; Pearce, Richard J.; bin Reza, Faisal; Nicholas, Meredydd; Haustein, Thomas; Njimgye-Tekumafor, Njah E.; Doherty, Justin F.; Gothard, Philip; Polley, Spencer D.; Chiodini, Peter L.

    2009-01-01

    Treatment of acute malaria caused by Plasmodium falciparum may include long-half-life drugs, such as the antifolate combination sulfadoxine-pyrimethamine (SP), to provide posttreatment chemoprophylaxis against parasite recrudescence or delayed emergence from the liver. An unusual case of P. falciparum recrudescence in a returned British traveler who received such a regimen, as well as a series of 44 parasite isolates from the same hospital, was analyzed by PCR and direct DNA sequencing for the presence of markers of parasite resistance to chloroquine and antifolates. The index patient harbored a mixture of wild-type and resistant pfdhfr and pfdhps alleles upon initial presentation. During his second malaria episode, he harbored only resistant parasites, with the haplotypes IRNI (codons 51, 59, 108, and 164) and SGEAA (codons 436, 437, 540, 581, and 613) at these two loci, respectively. Analysis of isolates from 44 other patients showed that the pfdhfr haplotype IRNI was common (found in 81% of cases). The SGEAA haplotype of pfdhps was uncommon (found only in eight cases of East African origin [17%]). A previously undescribed mutation, I431V, was observed for seven cases of Nigerian origin, occurring as one of two haplotypes, VAGKGS or VAGKAA. The presence of this mutation was also confirmed in isolates of Nigerian origin from the United Kingdom Malaria Reference Laboratory. The presence of the pfdhps haplotype SGEAA in P. falciparum parasites of East African origin appears to compromise the efficacy of treatment regimens that include SP as a means to prevent recrudescence. Parasites with novel pfdhps haplotypes are circulating in West Africa. The response of these parasites to chemotherapy needs to be evaluated. PMID:19433569

  20. COL4A3 founder mutations in Greek-Cypriot families with thin basement membrane nephropathy and focal segmental glomerulosclerosis dating from around 18th century.

    PubMed

    Voskarides, Konstantinos; Patsias, Charalampos; Pierides, Alkis; Deltas, Constantinos

    2008-06-01

    Mutations in the COL4A3/COL4A4 genes of type IV collagen account for about 40% of cases of thin basement membrane nephropathy, a condition that is estimated to affect 1% or more of the general population. We recently described 10 Cypriot families with familial hematuria and thin basement membrane nephropathy in the presence of focal segmental glomerulosclerosis, with founder mutations on COL4A3 gene. Seven of the families carried mutation G1334E on haplotype K, and another three carried mutation G871C on haplotype Ky. In this report we performed extension of the haplotypes with additional polymorphic markers, 12 for haplotype K and 22 for haplotype Ky, to estimate the linkage disequilibrium value between the mutation and flanking noncommon markers. Haplotype Ky extended to 13.71 Mb, but we did not attempt further analysis owing to the small number of chromosomes. Haplotype K extended to 3.83 Mb, thereby suggesting that it was a much older event compared to mutation G871C. Mutation G1334E was calculated to be about 5-10 generations old with a possible origin between 1693 and 1818 AD, during the Ottoman ruling of the island. Both mutations are clustered in specific geographic regions with apparently formerly isolated populations, although mutation G1334E has been detected elsewhere on the island. The identification of founder mutations in large families with microscopic hematuria greatly facilitates presymptomatic diagnosis and provides useful information on the history of the population, while it may also assist in association studies in search for disease modifier genes.

  1. HLA-DR2-associated DRB1 and DRB5 alleles and haplotypes in Koreans.

    PubMed

    Song, E Y; Kang, S J; Lee, Y J; Park, M H

    2000-09-01

    There are considerable racial differences in the distribution of HLA-DR2-associated DRB1 and DRB5 alleles and the characteristics of linkage disequilibrium between these alleles. In this study, the frequencies of DR2-associated DRB1 and DRB5 alleles and related haplotypes were analyzed in 186 DR2-positive individuals out of 800 normal Koreans registered for unrelated bone marrow donors. HLA class I antigen typing was performed by the serological method and DRB1 and DRB5 genotyping by the PCR-single strand conformational polymorphism method. Only 3 alleles were detected for DR2-associated DRB1 and DRB5 genes, respectively: DRB1(*)1501 (gene frequency 8.0%), (*)1502 (3.2%), (*)1602 (0.9%); DRB5(*)0101 (8.0%), (*)0102 (3.2%), and (*)0202 (0.9%). DRB1-DRB5 haplotype analysis showed an exclusive association between these alleles: DRB1*1501-DRB5*0101 (haplotype frequency 8.0%), DRB1(*)1502-DRB5(*)0102 (3.2%), and DRB1(*)1602-DRB5(*)0202 (0.9%). The 5 most common DR2-associated A-B-DRB1 haplotypes occurring at frequencies of > or = 0.5% were A24-B52-DRB1(*)1502 (1.8%), A2-B62-DRB1(*)1501, A2-B54-DRB1(*)1501, A26-B61-DRB1(*)1501, and A24-B51-DRB1(*)1501. The remarkable homogeneity in the haplotypic associations between DR2-associated DRB1 and DRB5 alleles in Koreans would be advantageous for organ transplantation compared with other ethnic groups showing considerable heterogeneity in the distribution of DRB1-DRB5 haplotypes.

  2. The CRHR1 Gene, Trauma Exposure, and Alcoholism Risk: A Test of G × E Effects

    PubMed Central

    Ray, Lara A.; Sehl, Mary; Bujarski, Spencer; Hutchison, Kent; Blaine, Sara; Enoch, Mary-Anne

    2014-01-01

    The corticotropin-releasing hormone type I receptor (CRHR1) gene has been implicated in the liability for neuropsychiatric disorders, particularly under conditions of stress. Based on the hypothesized effects of CRHR1 variation on stress reactivity, measures of adulthood traumatic stress exposure were analyzed for their interaction with CRHR1 haplotypes and SNPs in predicting the risk for alcoholism. Phenotypic data on 2,533 non-related Caucasian individuals (1167 alcoholics and 1366 controls) were culled from the publically available Study of Addiction: Genetics and Environment (SAGE) genome-wide association study (GWAS). Genotypes were available for 19 tag SNPs. Logistic regression models examined the interaction between CRHR1 haplotypes / SNPs and adulthood traumatic stress exposure in predicting alcoholism risk. Two haplotype blocks spanned CRHR1. Haplotype analyses identified one haplotype in the proximal block 1 (p = 0.029) and two haplotypes in the distal block 2 (p = 0.026, 0.042) that showed nominally significant (corrected p < .025) genotype × traumatic stress interactive effects on the likelihood of developing alcoholism. The block 1 haplotype effect was driven by SNPs rs110402 (p = 0.019) and rs242924 (p = 0.019). In block 2, rs17689966 (p = 0.018) showed significant, and rs173365 (p = 0.026) showed nominally significant, gene × environment (G × E) effects on alcoholism status. This study extends the literature on the interplay between CRHR1 variation and alcoholism, in the context of exposure to traumatic stress. These findings are consistent with the hypothesized role of the extra hypothalamic CRF system dysregulation in the initiation and maintenance of alcoholism. Molecular and experimental studies are needed to more fully understand the mechanisms of risk and protection conferred by genetic variation at the identified loci. PMID:23473364

  3. Two independent apolipoprotein A5 haplotypes influence human plasma triglyceride levels.

    PubMed

    Pennacchio, Len A; Olivier, Michael; Hubacek, Jaroslav A; Krauss, Ronald M; Rubin, Edward M; Cohen, Jonathan C

    2002-11-15

    The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in approximately 16% of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceride concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n=419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12% of Caucasians, 14% of African-Americans and 28% of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25-50% of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.

  4. High-resolution HLA allele and haplotype frequencies in majority and minority populations of Costa Rica and Nicaragua: Differential admixture proportions in neighboring countries.

    PubMed

    Arrieta-Bolaños, E; Madrigal-Sánchez, J J; Stein, J E; Órlich-Pérez, P; Moreira-Espinoza, M J; Paredes-Carias, E; Vanegas-Padilla, Y; Salazar-Sánchez, L; Madrigal, J A; Marsh, S G E; Shaw, B E

    2018-06-01

    The HLA system shows the most extensive polymorphism in the human genome. Allelic and haplotypic frequencies of HLA genes vary dramatically across human populations. Due to a complex history of migration, populations in Latin America show a broad variety of admixture proportions, usually varying not only between countries, but also within countries. Knowledge of HLA allele and haplotype frequencies is essential for medical fields such as transplantation, but also serves as a means to assess genetic diversity and ancestry in human populations. Here, we have determined high-resolution HLA-A, -B, -C, and -DRB1 allele and haplotype frequencies in a sample of 713 healthy subjects from three Mestizo populations, one population of African descent, and Amerindians of five different groups from Costa Rica and Nicaragua and compared their profiles to a large set of indigenous populations from Iberia, Sub-Saharan Africa, and the Americas. Our results show a great degree of allelic and haplotypic diversity within and across these populations, with most extended haplotypes being private. Mestizo populations show alleles and haplotypes of putative European, Amerindian, and Sub-Saharan African origin, albeit with differential proportions. Despite some degree of gene flow, Amerindians and Afro-descendants show great similarity to other Amerindian and West African populations, respectively. This is the first comprehensive study reporting high-resolution HLA diversity in Central America, and its results will shed light into the genetic history of this region while also supporting the development of medical programs for organ and stem cell transplantation. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  5. Lower frequency of the HLA-G UTR-4 haplotype in women with unexplained recurrent miscarriage.

    PubMed

    Meuleman, T; Drabbels, J; van Lith, J M M; Dekkers, O M; Rozemuller, E; Cretu-Stancu, M; Claas, F H J; Bloemenkamp, K W M; Eikmans, M

    2018-04-01

    HLA-G expressed by trophoblasts at the fetal-maternal interface and its soluble form have immunomodulatory effects. HLA-G expression depends on the combination of DNA polymorphisms. We hypothesized that combinations of specific single nucleotide polymorphisms (SNPs) in the 3'untranslated region (3'UTR) of HLA-G play a role in unexplained recurrent miscarriage. In a case control design, 100 cases with at least three unexplained consecutive miscarriages prior to the 20th week of gestation were included. Cases were at time of the third miscarriage younger than 36 years, and they conceived all their pregnancies from the same partner. The control group included 89 women with an uneventful pregnancy. The association of HLA-G 3'UTR SNPs and specific HLA-G haplotype with recurrent miscarriage was studied with logistic regression. Odds ratios (OR) and 95% confidence intervals (95% CI) were reported. Individual SNPs were not significantly associated with recurrent miscarriage after correction for multiple comparisons. However, the presence of the UTR-4 haplotype, which included +3003C, was significantly lower in women with recurrent miscarriage (OR 0.4, 95% CI 0.2-0.8, p = 0.015). In conclusion, this is the first study to perform a comprehensive analysis of HLA-G SNPs and HLA-G haplotypes in a well-defined group of women with recurrent miscarriage and women with uneventful pregnancy. The UTR-4 haplotype was less frequently observed in women with recurrent miscarriage, suggesting an immunoregulatory role of this haplotype for continuation of the pregnancy without complications. Thus, association of HLA-G with recurrent miscarriage is not related to single polymorphisms in the 3'UTR, but is rather dependent on haplotypes. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. Haploidentical and Matched Sibling Donor Hematopoietic Cell Transplantation for Patients with HLA-Homozygous Haplotypes.

    PubMed

    Kanda, Junya; Ikegame, Kazuhiro; Fuji, Shigeo; Kurokawa, Mineo; Kanamori, Heiwa; Fukuda, Takahiro; Ohashi, Kazuteru; Ishikawa, Jun; Ogawa, Hiroyasu; Inoue, Masami; Ichinohe, Tatsuo; Atsuta, Yoshiko; Kanda, Yoshinobu

    2016-11-01

    More than 1% of the Japanese population has HLA-homozygous haplotypes. For patients with such haplotypes, HLA-haploidentical family members who have no HLA mismatch in the graft-versus-host direction are readily available donor candidates for hematopoietic cell transplantation (HCT). In this study, the outcomes of patients with homozygous HLA-A, -B, and -DRB1 antigens who received HCT without T cell depletion from a haploidentical related donor with mismatches in the host-versus-graft direction only (hetero-to-homo, n = 78) or from an HLA-matched sibling donor (MSD) (MSD-homo, n = 153) were compared with those in patients with heterozygous haplotypes who received HCT from an MSD (MSD-hetero, n = 7242). Transplant outcomes in the hetero-to-homo group were similar to those in the MSD-hetero group regarding neutrophil engraftment, grades III to IV acute graft-versus-host disease (aGVHD), nonrelapse mortality (NRM), relapse, and overall survival. On the other hand, the incidences of severe aGVHD and NRM in the MSD-homo group were significantly lower than those in the MSD-hetero group (grades III to IV aGVHD: aHR .50, P = .034; NRM: aHR .48, P = .004). In conclusion, patients with HLA-homozygous haplotypes achieved lower GVHD and NRM rates for MSD transplantation than those with HLA-heterozygous haplotypes. When an MSD or an appropriate alternative donor is not available for patients with HLA-homozygous haplotypes who need immediate transplantation, transplantation from a haploidentical donor without T cell depletion is a viable option, given the comparable transplant outcomes for hetero-to-homo HCT and MSD-hetero HCT. Copyright © 2016 The American Society for Blood and Marrow Transplantation. Published by Elsevier Inc. All rights reserved.

  7. Phylogeographic study of Chinese seabuckthorn (Hippophae rhamnoides subsp. sinensis Rousi) reveals two distinct haplotype groups and multiple microrefugia on the Qinghai-Tibet Plateau

    PubMed Central

    Wang, Hongfang; Liu, Han; Yang, Mingbo; Bao, Lei; Ge, Jianping

    2014-01-01

    Historical climate change can shape the genetic pattern of a species. Studies on this phenomenon provide great advantage in predicting the response of species to current and future global climate change. Chinese seabuckthorn (Hippophae rhamnoides subsp. sinensis) is one of the most important cultivated plants in Northwest China. However, the subspecies history and the potential genetic resources within the subspecies range remain unclear. In this study, we utilized two intergenic chloroplast regions to characterize the spatial genetic distribution of the species. We found 19 haplotypes in total, 12 of which were unique to the Chinese seabuckthorn. The populations observed on the Qinghai-Tibet Plateau (QTP) consisted of most of the haplotypes, while in the northeast of the range of the subspecies, an area not on the QTP, only four haplotypes were detected. Our study also revealed two distinct haplotype groups of the subspecies with a sharp transition region located in the south of the Zoige Basin. 89.96% of the genetic variation located between the regions. Mismatch analysis indicated old expansions of these two haplotype groups, approximately around the early stage of Pleistocene. Additional morphological proofs from existing studies and habitat differentiation supported a long independent colonization history among the two regions. Potential adaptation probably occurred but needs more genome and morphology data in future. Chinese seabuckthorn have an older population expansion compared with subspecies in Europe. The lack of large land ice sheets and the heterogeneous landscape of the QTP could have provided extensive microrefugia for Chinese seabuckthorn during the glaciation period. Multiple localities sustaining high-frequency private haplotypes support this hypothesis. Our study gives clear insight into the distribution of genetic resources and the evolutionary history of Chinese seabuckthorn. PMID:25540697

  8. In vitro and ex vivo analysis of CHRNA3 and CHRNA5 haplotype expression.

    PubMed

    Doyle, Glenn A; Wang, Min-Jung; Chou, Andrew D; Oleynick, John U; Arnold, Steven E; Buono, Russell J; Ferraro, Thomas N; Berrettini, Wade H

    2011-01-01

    Genome-wide association studies implicate variations in CHRNA5 and CHRNA3 as being associated with nicotine addiction (NA). Multiple common haplotypes ("risk", "mixed" and "protective") exist in Europeans; however, high linkage disequilibrium between variations in CHRNA5 and CHRNA3 makes assigning causative allele(s) for NA difficult through genotyping experiments alone. We investigated whether CHRNA5 or CHRNA3 promoter haplotypes, associated previously with NA, might influence allelic expression levels. For in vitro analyses, promoter haplotypes were sub-cloned into a luciferase reporter vector. When assessed in BE(2)-C cells, luciferase expression was equivalent among CHRNA3 haplotypes, but the combination of deletion at rs3841324 and variation at rs503464 decreased CHRNA5 promoter-derived luciferase activity, possibly due to loss of an SP-1 and other site(s). Variation within the CHRNA5 5'UTR at rs55853698 and rs55781567 also altered luciferase expression in BE(2)-C cells. Allelic expression imbalance (AEI) from the "risk" or "protective" haplotypes was assessed in post-mortem brain tissue from individuals heterozygous at coding polymorphisms in CHRNA3 (rs1051730) or CHRNA5 (rs16969968). In most cases, equivalent allelic expression was observed; however, one individual showed CHRNA5 AEI that favored the "protective" allele and that was concordant with heterozygosity at polymorphisms ∼13.5 kb upstream of the CHRNA5 transcription start site. Putative enhancer activity from these distal promoter elements was assessed using heterologous promoter constructs. We observed no differences in promoter activity from the two distal promoter haplotypes examined, but found that the distal promoter region strongly repressed transcription. We conclude that CHRNA5 promoter variants may affect relative risk for NA in some heterozygous individuals.

  9. Wide-ranging phylogeographic structure of invasive red lionfish in the Western Atlantic and Greater Caribbean

    USGS Publications Warehouse

    Butterfield, John S.; Díaz-Ferguson, Edgardo; Silliman, Brian R.; Saunders, Jonathan W.; Buddo, Dayne; Mignucci-Giannoni, Antonio A.; Searle, Linda; Allen, Aarin Conrad; Hunter, Margaret E.

    2015-01-01

    The red lionfish (Pterois volitans) is an invasive predatory marine fish that has rapidly expanded its presence in the Western Hemisphere. We collected 214 invasive red lionfish samples from nine countries and territories, including seven unpublished locations. To more comprehensively evaluate connectivity, we compiled our d-loop sequence data with 846 published sequences, resulting in 1,060 samples from 14 locations. We found low nucleotide diversity (π = 0.003) and moderate haplotype diversity (h = 0.59). Using haplotype population pairwise ΦST tests, we analyzed possible phylogeographic breaks that were previously proposed based on other reef organisms. We found support for the Bahamas/Turks/Caicos versus Caribbean break (ΦST = 0.12) but not for the Northwestern Caribbean, Eastern Caribbean, or US East Coast versus Bahamas breaks. The Northern Region had higher variation and more haplotypes, supporting introductions of at least five haplotypes to the region. Our wide-ranging samples showed that a lower-frequency haplotype in the Northern Region dominated the Southern Region and suggested multiple introductions, possibly to the south. We tested multiple scenarios of phylogeographic structure with analyses of molecular variance and found support for a Northern and Southern Region split at the Bahamas/Turks/Caicos versus Caribbean break (percentage of variation among regions = 8.49 %). We found that Puerto Rico clustered with the Southern Region more strongly than with the Northern Region, as opposed to previous reports. We also found the rare haplotype H03 for the first time in the southern Caribbean (Panama), indicating that either secondary releases occurred or that the low-frequency haplotypes have had time to disperse to extreme southern Caribbean locations.

  10. Origins of Host-Specific Populations of the Blast Pathogen Magnaporthe oryzae in Crop Domestication With Subsequent Expansion of Pandemic Clones on Rice and Weeds of Rice

    PubMed Central

    Couch, Brett C.; Fudal, Isabelle; Lebrun, Marc-Henri; Tharreau, Didier; Valent, Barbara; van Kim, Pham; Nottéghem, Jean-Loup; Kohn, Linda M.

    2005-01-01

    Rice, as a widely and intensively cultivated crop, should be a target for parasite host shifts and a source for shifts to co-occurring weeds. Magnaporthe oryzae, of the M. grisea species complex, is the most important fungal pathogen of rice, with a high degree of host specificity. On the basis of 10 loci from six of its seven linkage groups, 37 multilocus haplotypes among 497 isolates of M. oryzae from rice and other grasses were identified. Phylogenetic relationships among isolates from rice (Oryza sativa), millet (Setaria spp.), cutgrass (Leersia hexandra), and torpedo grass (Panicum repens) were predominantly tree like, consistent with a lack of recombination, but from other hosts were reticulate, consistent with recombination. The single origin of rice-infecting M. oryzae followed a host shift from a Setaria millet and was closely followed by additional shifts to weeds of rice, cutgrass, and torpedo grass. Two independent estimators of divergence time indicate that these host shifts predate the Green Revolution and could be associated with rice domestication. The rice-infecting lineage is characterized by high copy number of the transposable element MGR586 (Pot3) and, except in two haplotypes, by a loss of AVR-Co39. Both mating types have been retained in ancestral, well-distributed rice-infecting haplotypes 10 (mainly temperate) and 14 (mainly tropical), but only one mating type was recovered from several derived, geographically restricted haplotypes. There is evidence of a common origin of both ACE1 virulence genotypes in haplotype 14. Host-haplotype association is evidenced by low pathogenicity on hosts associated with other haplotypes. PMID:15802503

  11. Genetic Population Structure of Dastarcus helophoroides (Coleoptera: Bothrideridae) From Different Long-Horned Beetle Hosts Based on Complete Sequences of Mitochondrial COI.

    PubMed

    Zhang, Zhengqing; Chang, Yong; Li, Menglou

    2017-06-01

    Dastarcus helophoroides (Fairmaire) (Coleoptera: Bothrideridae) is an important natural enemy of long-horned beetles in China, Japan, and Korea. In this study, the genetic sequence of cytochrome oxidase subunit Ι was used to investigate the genetics and relationships within and among D. helophoroides populations collected from five different geographic locations. We used principal component analysis, heatmap, and Venn diagram results to determine the relationship between haplotypes and populations. In total, 26 haplotypes with 51 nucleotide polymorphic sites were defined, and low genetic diversity was found among the different populations. Significant genetic variations were observed mainly within populations, and no correlation was found between genetic distribution and geographical distance. Low pairwise fixation index values (-0.01424 to 0.04896) and high gene flows show that there was high gene exchange between populations. The codistributed haplotype DH01 was suggested to be the most ancestral haplotype, and other haplotypes were thought to have evolved from it through several mutations. In four of the populations, both common haplotypes (DH01, DH03, and DH22) and unique haplotypes were found. Low genetic diversity among different populations is related to a relatively high flight capacity, host movement, and human-aided dispersal of D. helophoroides. The high gene exchange and typically weak population genetic structure among five populations, especially among populations of Anoplophora glabripennis (Motschulsky), Monochamus alternatus (Hope), and Massicus raddei (Blessig), may suggest that these populations cross naturally in the field. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  12. Y-chromosomal analysis of Greek Cypriots reveals a primarily common pre-Ottoman paternal ancestry with Turkish Cypriots

    PubMed Central

    Fernández-Domínguez, Eva; Bertoncini, Stefania; Chimonas, Marios; Christofi, Vasilis; King, Jonathan; Budowle, Bruce; Manoli, Panayiotis

    2017-01-01

    Genetics can provide invaluable information on the ancestry of the current inhabitants of Cyprus. A Y-chromosome analysis was performed to (i) determine paternal ancestry among the Greek Cypriot (GCy) community in the context of the Central and Eastern Mediterranean and the Near East; and (ii) identify genetic similarities and differences between Greek Cypriots (GCy) and Turkish Cypriots (TCy). Our haplotype-based analysis has revealed that GCy and TCy patrilineages derive primarily from a single gene pool and show very close genetic affinity (low genetic differentiation) to Calabrian Italian and Lebanese patrilineages. In terms of more recent (past millennium) ancestry, as indicated by Y-haplotype sharing, GCy and TCy share much more haplotypes between them than with any surrounding population (7–8% of total haplotypes shared), while TCy also share around 3% of haplotypes with mainland Turks, and to a lesser extent with North Africans. In terms of Y-haplogroup frequencies, again GCy and TCy show very similar distributions, with the predominant haplogroups in both being J2a-M410, E-M78, and G2-P287. Overall, GCy also have a similar Y-haplogroup distribution to non-Turkic Anatolian and Southwest Caucasian populations, as well as Cretan Greeks. TCy show a slight shift towards Turkish populations, due to the presence of Eastern Eurasian (some of which of possible Ottoman origin) Y-haplogroups. Overall, the Y-chromosome analysis performed, using both Y-STR haplotype and binary Y-haplogroup data puts Cypriot in the middle of a genetic continuum stretching from the Levant to Southeast Europe and reveals that despite some differences in haplotype sharing and haplogroup structure, Greek Cypriots and Turkish Cypriots share primarily a common pre-Ottoman paternal ancestry. PMID:28622394

  13. [Association between the methylenetetrahydrofolate reductase gene polymorphisms and haplotype with toxicity response of high dose methotrexate chemotherapy].

    PubMed

    Liao, Qing-Chuan; Li, Xiao-Lei; Liu, Si-Ting; Zhang, Yong; Li, Tian-Yuan; Qiu, Jin-Chun

    2012-07-01

    To investigate the association between single nucleotide polymorphisms (SNP) and its haplotypes of methylenetetrahydrofolate reductase (MTHFR) gene with high dose methotrexate (HDMTX)-induced toxicity in children with acute lymphoblastic leukemia (ALL). HDMTX-treated children with ALL (1.2 to 14-years old) were selected from inpatient and followed for a retrospective study. The toxicity response of HDMTX chemotherapy was evaluated using WHO common toxicity criteria. Sixty-one patients with therapy-related toxicity and 36 patients without therapy-related toxicity were genotyped for 2 SNP (677C > T and 1298A > C) of the MTHFR gene by polymerase chain reaction-restriction fragment length polymorphism. Frequency of haplotypes and linkage disequilibrium of MTHFR gene were analyzed by SHEsis program. The distribution of MTHFR gene 677C > T polymorphism did not appeare different between groups with or without toxicity response (χ(2) = 4.609, P = 0.100), but the 1298A > C polymorphism was significantly different (χ(2) = 10.192, P = 0.006). Individuals who carried C allele (AC + CC genotype) had a decreased risk of toxicity response compared to AA genotype (OR = 0.245, 95%CI: 0.099 - 0.607, P = 0.002). 677C > T and 1298A > C polymorphisms showed strong linkage disequilibrium (D' = 0.895). The CC haplotype was significantly associated with decreased risk of toxicity response (OR = 0.338, 95%CI: 0.155 - 0.738, P = 0.005), while the TA haplotype was significantly associated with the increased risk of toxicity response (OR = 1.907, 95%CI: 1.045 - 3.482, P = 0.035). MTHFR gene 1298C allele and CC haplotype might serve as protective factors while TA haplotype as a risk factor for the susceptibility to toxicity response of HDMTX chemotherapy in children with ALL.

  14. Y-chromosomal analysis of Greek Cypriots reveals a primarily common pre-Ottoman paternal ancestry with Turkish Cypriots.

    PubMed

    Heraclides, Alexandros; Bashiardes, Evy; Fernández-Domínguez, Eva; Bertoncini, Stefania; Chimonas, Marios; Christofi, Vasilis; King, Jonathan; Budowle, Bruce; Manoli, Panayiotis; Cariolou, Marios A

    2017-01-01

    Genetics can provide invaluable information on the ancestry of the current inhabitants of Cyprus. A Y-chromosome analysis was performed to (i) determine paternal ancestry among the Greek Cypriot (GCy) community in the context of the Central and Eastern Mediterranean and the Near East; and (ii) identify genetic similarities and differences between Greek Cypriots (GCy) and Turkish Cypriots (TCy). Our haplotype-based analysis has revealed that GCy and TCy patrilineages derive primarily from a single gene pool and show very close genetic affinity (low genetic differentiation) to Calabrian Italian and Lebanese patrilineages. In terms of more recent (past millennium) ancestry, as indicated by Y-haplotype sharing, GCy and TCy share much more haplotypes between them than with any surrounding population (7-8% of total haplotypes shared), while TCy also share around 3% of haplotypes with mainland Turks, and to a lesser extent with North Africans. In terms of Y-haplogroup frequencies, again GCy and TCy show very similar distributions, with the predominant haplogroups in both being J2a-M410, E-M78, and G2-P287. Overall, GCy also have a similar Y-haplogroup distribution to non-Turkic Anatolian and Southwest Caucasian populations, as well as Cretan Greeks. TCy show a slight shift towards Turkish populations, due to the presence of Eastern Eurasian (some of which of possible Ottoman origin) Y-haplogroups. Overall, the Y-chromosome analysis performed, using both Y-STR haplotype and binary Y-haplogroup data puts Cypriot in the middle of a genetic continuum stretching from the Levant to Southeast Europe and reveals that despite some differences in haplotype sharing and haplogroup structure, Greek Cypriots and Turkish Cypriots share primarily a common pre-Ottoman paternal ancestry.

  15. Variants and Haplotypes in Angiotensinogen Gene Are Associated With Plasmatic Angiotensinogen Level in Mexican Population

    PubMed Central

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Alfaro-Ruiz, Luis; Carrillo, Karol; Elizalde, Adela; Gil, Trinidad; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2011-01-01

    Introduction The plasmatic angiotensinogen (AGT) level has been associated with essential hypertension. Linkage analysis has found a relationship between the AGT gene locus and hypertension in the Mexican-American population, but studies have failed to identify genetic variants associated with hypertension or plasma AGT levels. This study analyzes the relationship between polymorphisms in the AGT gene and plasmatic AGT levels in Mexican population. Methods Nine polymorphisms in AGT gene were genotyped, and plasma AGT level was determined by enzyme-linked immunosorbent assay. Results Differences in AGT plasma levels were associated with 2 polymorphisms: T-20G, TT = 25.3 ± 8.3 versus TG + GG = 21.6 ± 8.8 μg/mL; P = 0.008 and C3389T (T174M), CC = 25.8 ± 9.9 versus TC + TT = 20.5 ± 5.4 μg/mL; P = 0.0002. Haplotype 2 was associated with low plasma AGT (−5.1 μg/mL [95% confidence interval: −8.6 to −1.6], P = 0.004) and Haplotype 8 was associated with high plasma AGT (6.5 μg/mL [95% confidence interval: 2.5 to 10.6], P = 0.001). This association remained after adjustment for covariates. A Likelihood Ratio Test for haplotype-phenotype association adjusted for covariates resulted in χ2 = 38.9, P = 0.0005. The total effect of the haplotypes on plasma AGT level variance was 19.5%. No association was identified between haplotypes and quantitative traits of blood pressure. Conclusions Two polymorphisms (T-20G and C3389T) and 2 haplotypes (H2 and H8) showed an association with plasma AGT levels in Mexican population. PMID:21629041

  16. Global selection on sucrose synthase haplotypes during a century of wheat breeding.

    PubMed

    Hou, Jian; Jiang, Qiyan; Hao, Chenyang; Wang, Yuquan; Zhang, Hongna; Zhang, Xueyong

    2014-04-01

    Spike number per unit area, number of grains per spike, and thousand kernel weight (TKW) are important yield components. In China, increases in wheat (Triticum aestivum) yields are mainly due to increases in grain number per spike and TKW. TKW mainly depends on starch content, as starch accounts for about 70% of the grain endosperm. Sucrose synthase catalysis is the first step in the conversion of sucrose to starch, that is, the conversion of sucrose to fructose and UDP-glucose by the wheat sucrose synthase genes (TaSus1 and TaSus2) that are located on chromosomes 7A/7B/7D and 2A/2B/2D, respectively. A total of 1,520 wheat accessions were genotyped at the six loci. Two, two, five, and two haplotypes were identified at the TaSus2-2A, TaSus2-2B, TaSus1-7A, and TaSus1-7B loci, respectively. Their main variations were detected within the introns. Significant differences between the haplotypes correlated with TKW differences among 348 modern Chinese cultivars from the core collection. Frequency changes for favored haplotypes showed gradual increases in cultivars released since beginning of the last century in China, Europe, and North America. Geographic distributions and time changes of favored haplotypes were characterized in six major wheat production regions worldwide. Strong selection bottlenecks to haplotype variations occurred at polyploidization and domestication and during breeding of wheat. Genetic-effect differences between haplotypes at the same locus influence the selection time and intensity. This work shows that the endosperm starch synthesis pathway is a major target of indirect selection in global wheat breeding for higher yield.

  17. Detecting disease-predisposing variants: the haplotype method.

    PubMed Central

    Valdes, A M; Thomson, G

    1997-01-01

    For many HLA-associated diseases, multiple alleles-- and, in some cases, multiple loci--have been suggested as the causative agents. The haplotype method for identifying disease-predisposing amino acids in a genetic region is a stratification analysis. We show that, for each haplotype combination containing all the amino acid sites involved in the disease process, the relative frequencies of amino acid variants at sites not involved in disease but in linkage disequilibrium with the disease-predisposing sites are expected to be the same in patients and controls. The haplotype method is robust to mode of inheritance and penetrance of the disease and can be used to determine unequivocally whether all amino acid sites involved in the disease have not been identified. Using a resampling technique, we developed a statistical test that takes account of the nonindependence of the sites sampled. Further, when multiple sites in the genetic region are involved in disease, the test statistic gives a closer fit to the null expectation when some--compared with none--of the true predisposing factors are included in the haplotype analysis. Although the haplotype method cannot distinguish between very highly correlated sites in one population, ethnic comparisons may help identify the true predisposing factors. The haplotype method was applied to insulin-dependent diabetes mellitus (IDDM) HLA class II DQA1-DQB1 data from Caucasian, African, and Japanese populations. Our results indicate that the combination DQA1#52 (Arg predisposing) DQB1#57 (Asp protective), which has been proposed as an important IDDM agent, does not include all the predisposing elements. With rheumatoid arthritis HLA class II DRB1 data, the results were consistent with the shared-epitope hypothesis. PMID:9042931

  18. Toll-like receptor 4 polymorphisms and their haplotypes modulate the risk of developing diabetic retinopathy in type 2 diabetes patients

    PubMed Central

    Singh, Kanhaiya; Kant, Shri; Singh, Vivek Kumar; Agrawal, Neeraj K.; Gupta, Sanjeev K.

    2014-01-01

    Purpose Persistent inflammation and impaired neovascularization in type 2 diabetes mellitus (T2DM) patients may lead to development of macro- and microvascular complications. Diabetic retinopathy (DR) is one of the secondary microvascular complications of T2DM. Improper activation of the innate immune system may be an important contributor in the pathophysiology of DR. Toll-like receptor 4 (TLR4) is an important mediator of innate immunity, and genetic alterations in TLR4 support inflammation in the hyperglycemic condition. The present work was designed to investigate whether the TLR4 single nucleotide polymorphisms (SNPs) rs4986790, rs4986791, rs10759931, rs1927911, and rs1927914 are associated with DR in a north Indian population. Methods The study group of 698 individuals (128 DR, 250 T2DM, 320 controls) was genotyped by PCR-RFLP. Haplotype and linkage disequilibrium between SNPs were determined using Haploview software. Results Combined risk genotypes of TLR4 SNPs rs10759931 (odds ratio [OR] 1.50, p = 0.05) and rs1927914 (OR 1.48, p = 0.05) were found to be significantly associated with pathogenesis of DR. A total of 14 haplotypes with frequency >1% were obtained using Haploview software. Haplotypes ACATC (37.5%) and ACATT (14.8%) were the two most common haplotypes obtained. Conclusions Results of the present case-control study that included 698 north Indian subjects suggested that TLR4 SNPs rs10759931 and rs1927914 modulate the risk of DR in T2DM cases. Association analysis using haplotypes showed none of the haplotypes were associated with either susceptibility or resistance to DR in a north Indian population. PMID:24883015

  19. Novel full-length major histocompatibility complex class I allele discovery and haplotype definition in pig-tailed macaques.

    PubMed

    Semler, Matthew R; Wiseman, Roger W; Karl, Julie A; Graham, Michael E; Gieger, Samantha M; O'Connor, David H

    2018-06-01

    Pig-tailed macaques (Macaca nemestrina, Mane) are important models for human immunodeficiency virus (HIV) studies. Their infectability with minimally modified HIV makes them a uniquely valuable animal model to mimic human infection with HIV and progression to acquired immunodeficiency syndrome (AIDS). However, variation in the pig-tailed macaque major histocompatibility complex (MHC) and the impact of individual transcripts on the pathogenesis of HIV and other infectious diseases is understudied compared to that of rhesus and cynomolgus macaques. In this study, we used Pacific Biosciences single-molecule real-time circular consensus sequencing to describe full-length MHC class I (MHC-I) transcripts for 194 pig-tailed macaques from three breeding centers. We then used the full-length sequences to infer Mane-A and Mane-B haplotypes containing groups of MHC-I transcripts that co-segregate due to physical linkage. In total, we characterized full-length open reading frames (ORFs) for 313 Mane-A, Mane-B, and Mane-I sequences that defined 86 Mane-A and 106 Mane-B MHC-I haplotypes. Pacific Biosciences technology allows us to resolve these Mane-A and Mane-B haplotypes to the level of synonymous allelic variants. The newly defined haplotypes and transcript sequences containing full-length ORFs provide an important resource for infectious disease researchers as certain MHC haplotypes have been shown to provide exceptional control of simian immunodeficiency virus (SIV) replication and prevention of AIDS-like disease in nonhuman primates. The increased allelic resolution provided by Pacific Biosciences sequencing also benefits transplant research by allowing researchers to more specifically match haplotypes between donors and recipients to the level of nonsynonymous allelic variation, thus reducing the risk of graft-versus-host disease.

  20. An unusual haplotype structure on human chromosome 8p23 derived from the inversion polymorphism.

    PubMed

    Deng, Libin; Zhang, Yuezheng; Kang, Jian; Liu, Tao; Zhao, Hongbin; Gao, Yang; Li, Chaohua; Pan, Hao; Tang, Xiaoli; Wang, Dunmei; Niu, Tianhua; Yang, Huanming; Zeng, Changqing

    2008-10-01

    Chromosomal inversion is an important type of genomic variations involved in both evolution and disease pathogenesis. Here, we describe the refined genetic structure of a 3.8-Mb inversion polymorphism at chromosome 8p23. Using HapMap data of 1,073 SNPs generated from 209 unrelated samples from CEPH-Utah residents with ancestry from northern and western Europe (CEU); Yoruba in Ibadan, Nigeria (YRI); and Asian (ASN) samples, which were comprised of Han Chinese from Beijing, China (CHB) and Japanese from Tokyo, Japan (JPT)-we successfully deduced the inversion orientations of all their 418 haplotypes. In particular, distinct haplotype subgroups were identified based on principal component analysis (PCA). Such genetic substructures were consistent with clustering patterns based on neighbor-joining tree reconstruction, which revealed a total of four haplotype clades across all samples. Metaphase fluorescence in situ hybridization (FISH) in a subset of 10 HapMap samples verified their inversion orientations predicted by PCA or phylogenetic tree reconstruction. Positioning of the outgroup haplotype within one of YRI clades suggested that Human NCBI Build 36-inverted order is most likely the ancestral orientation. Furthermore, the population differentiation test and the relative extended haplotype homozygosity (REHH) analysis in this region discovered multiple selection signals, also in a population-specific manner. A positive selection signal was detected at XKR6 in the ASN population. These results revealed the correlation of inversion polymorphisms to population-specific genetic structures, and various selection patterns as possible mechanisms for the maintenance of a large chromosomal rearrangement at 8p23 region during evolution. In addition, our study also showed that haplotype-based clustering methods, such as PCA, can be applied in scanning for cryptic inversion polymorphisms at a genome-wide scale.

Top