haplotype reference database: Topics by Science.gov

Sample records for haplotype reference database

Estimating trace-suspect match probabilities for singleton Y-STR haplotypes using coalescent theory.

PubMed

Andersen, Mikkel Meyer; Caliebe, Amke; Jochens, Arne; Willuweit, Sascha; Krawczak, Michael

2013-02-01

Estimation of match probabilities for singleton haplotypes of lineage markers, i.e. for haplotypes observed only once in a reference database augmented by a suspect profile, is an important problem in forensic genetics. We compared the performance of four estimators of singleton match probabilities for Y-STRs, namely the count estimate, both with and without Brenner's so-called 'kappa correction', the surveying estimate, and a previously proposed, but rarely used, coalescent-based approach implemented in the BATWING software. Extensive simulation with BATWING of the underlying population history, haplotype evolution and subsequent database sampling revealed that the coalescent-based approach is characterized by lower bias and lower mean squared error than the uncorrected count estimator and the surveying estimator. Moreover, in contrast to the two count estimators, both the surveying and the coalescent-based approach exhibited a good correlation between the estimated and true match probabilities. However, although its overall performance is thus better than that of any other recognized method, the coalescent-based estimator is still computation-intense on the verge of general impracticability. Its application in forensic practice therefore will have to be limited to small reference databases, or to isolated cases of particular interest, until more powerful algorithms for coalescent simulation have become available. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Population data for 15 Y-chromosome STRs in a population sample from Quito (Ecuador).

PubMed

Baeza, Carlos; Guzmán, Rodrigo; Tirado, Miriam; López-Parra, Ana María; Rodríguez, Tatiana; Mesa, María Soledad; Fernández, Eva; Arroyo-Pardo, Eduardo

2007-12-20

Population frequencies for the 9 Y-STR loci included in the "minimal haplotype" from Y-STR Haplotype Reference Database (YHRD), plus other 6 Y-STRs (DYS437, DYS438, DYS439, GATA A7.2, GATA H4 and GATA A10) were obtained for a sample of 120 males from Quito (Ecuador). One hundred and sixteen unique haplotypes were identified within the sample. Haplotype diversity (0.9994) was among the highest in comparison to other populations from Iberia and South-America. Genetic distances were calculated and our sample presented significative differences with all other samples, the lowest values being with a Guinean sample.
Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

PubMed Central

Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John

2008-01-01

The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213
Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina.

PubMed

Kovačević, Lejla; Fatur-Cerić, Vera; Hadzic, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

2013-06-01

To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis.
Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina

PubMed Central

Kovačević, Lejla; Fatur-Cerić, Vera; Hadžić, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

2013-01-01

Aim To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. Methods The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. Results The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. Conclusion This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis. PMID:23771760
Canis mtDNA HV1 database: a web-based tool for collecting and surveying Canis mtDNA HV1 haplotype in public database.

PubMed

Thai, Quan Ke; Chung, Dung Anh; Tran, Hoang-Dung

2017-06-26

Canine and wolf mitochondrial DNA haplotypes, which can be used for forensic or phylogenetic analyses, have been defined in various schemes depending on the region analyzed. In recent studies, the 582 bp fragment of the HV1 region is most commonly used. 317 different canine HV1 haplotypes have been reported in the rapidly growing public database GenBank. These reported haplotypes contain several inconsistencies in their haplotype information. To overcome this issue, we have developed a Canis mtDNA HV1 database. This database collects data on the HV1 582 bp region in dog mitochondrial DNA from the GenBank to screen and correct the inconsistencies. It also supports users in detection of new novel mutation profiles and assignment of new haplotypes. The Canis mtDNA HV1 database (CHD) contains 5567 nucleotide entries originating from 15 subspecies in the species Canis lupus. Of these entries, 3646 were haplotypes and grouped into 804 distinct sequences. 319 sequences were recognized as previously assigned haplotypes, while the remaining 485 sequences had new mutation profiles and were marked as new haplotype candidates awaiting further analysis for haplotype assignment. Of the 3646 nucleotide entries, only 414 were annotated with correct haplotype information, while 3232 had insufficient or lacked haplotype information and were corrected or modified before storing in the CHD. The CHD can be accessed at http://chd.vnbiology.com . It provides sequences, haplotype information, and a web-based tool for mtDNA HV1 haplotyping. The CHD is updated monthly and supplies all data for download. The Canis mtDNA HV1 database contains information about canine mitochondrial DNA HV1 sequences with reconciled annotation. It serves as a tool for detection of inconsistencies in GenBank and helps identifying new HV1 haplotypes. Thus, it supports the scientific community in naming new HV1 haplotypes and to reconcile existing annotation of HV1 582 bp sequences.
Interpretation guidelines of a standard Y-chromosome STR 17-plex PCR-CE assay for crime casework.

PubMed

Roewer, Lutz; Geppert, Maria

2012-01-01

Y-STR analysis is an invaluable tool to examine evidence in sexual assault cases and in other forensic casework. Unambiguous detection of the male component in DNA mixtures with a high female background is still the main field of application of forensic Y-STR haplotyping. In the last years, powerful technologies including a 17-locus multiplex PCR assay have been introduced in the forensic laboratories. At the same time, statistical methods have been developed and adapted for interpretation of a nonrecombining, linear marker as the Y-chromosome which shows a strongly clustered geographical distribution due to the linear inheritance and the patrilocality of ancestral groups. Large population databases, namely the Y-STR Haplotype Reference Database (YHRD), have been established to assess the evidentiary value of Y-STR matches by means of frequency estimation methods (counting and extrapolation).
African-American mitochondrial DNAs often match mtDNAs found in multiple African ethnic groups

PubMed Central

Ely, Bert; Wilson, Jamie Lee; Jackson, Fatimah; Jackson, Bruce A

2006-01-01

Background Mitochondrial DNA (mtDNA) haplotypes have become popular tools for tracing maternal ancestry, and several companies offer this service to the general public. Numerous studies have demonstrated that human mtDNA haplotypes can be used with confidence to identify the continent where the haplotype originated. Ideally, mtDNA haplotypes could also be used to identify a particular country or ethnic group from which the maternal ancestor emanated. However, the geographic distribution of mtDNA haplotypes is greatly influenced by the movement of both individuals and population groups. Consequently, common mtDNA haplotypes are shared among multiple ethnic groups. We have studied the distribution of mtDNA haplotypes among West African ethnic groups to determine how often mtDNA haplotypes can be used to reconnect Americans of African descent to a country or ethnic group of a maternal African ancestor. The nucleotide sequence of the mtDNA hypervariable segment I (HVS-I) usually provides sufficient information to assign a particular mtDNA to the proper haplogroup, and it contains most of the variation that is available to distinguish a particular mtDNA haplotype from closely related haplotypes. In this study, samples of general African-American and specific Gullah/Geechee HVS-I haplotypes were compared with two databases of HVS-I haplotypes from sub-Saharan Africa, and the incidence of perfect matches recorded for each sample. Results When two independent African-American samples were analyzed, more than half of the sampled HVS-I mtDNA haplotypes exactly matched common haplotypes that were shared among multiple African ethnic groups. Another 40% did not match any sequence in the database, and fewer than 10% were an exact match to a sequence from a single African ethnic group. Differences in the regional distribution of haplotypes were observed in the African database, and the African-American haplotypes were more likely to match haplotypes found in ethnic groups from West or West Central Africa than those found in eastern or southern Africa. Fewer than 14% of the African-American mtDNA sequences matched sequences from only West Africa or only West Central Africa. Conclusion Our database of sub-Saharan mtDNA sequences includes the most common haplotypes that are shared among ethnic groups from multiple regions of Africa. These common haplotypes have been found in half of all sub-Saharan Africans. More than 60% of the remaining haplotypes differ from the common haplotypes at a single nucleotide position in the HVS-I region, and they are likely to occur at varying frequencies within sub-Saharan Africa. However, the finding that 40% of the African-American mtDNAs analyzed had no match in the database indicates that only a small fraction of the total number of African haplotypes has been identified. In addition, the finding that fewer than 10% of African-American mtDNAs matched mtDNA sequences from a single African region suggests that few African Americans might be able to trace their mtDNA lineages to a particular region of Africa, and even fewer will be able to trace their mtDNA to a single ethnic group. However, no firm conclusions should be made until a much larger database is available. It is clear, however, that when identical mtDNA haplotypes are shared among many ethnic groups from different parts of Africa, it is impossible to determine which single ethnic group was the source of a particular maternal ancestor based on the mtDNA sequence. PMID:17038170
Development of forensic-quality full mtGenome haplotypes: success rates with low template specimens.

PubMed

Just, Rebecca S; Scheible, Melissa K; Fast, Spence A; Sturk-Andreaggi, Kimberly; Higginbotham, Jennifer L; Lyons, Elizabeth A; Bush, Jocelyn M; Peck, Michelle A; Ring, Joseph D; Diegoli, Toni M; Röck, Alexander W; Huber, Gabriela E; Nagl, Simone; Strobl, Christina; Zimmermann, Bettina; Parson, Walther; Irwin, Jodi A

2014-05-01

Forensic mitochondrial DNA (mtDNA) testing requires appropriate, high quality reference population data for estimating the rarity of questioned haplotypes and, in turn, the strength of the mtDNA evidence. Available reference databases (SWGDAM, EMPOP) currently include information from the mtDNA control region; however, novel methods that quickly and easily recover mtDNA coding region data are becoming increasingly available. Though these assays promise to both facilitate the acquisition of mitochondrial genome (mtGenome) data and maximize the general utility of mtDNA testing in forensics, the appropriate reference data and database tools required for their routine application in forensic casework are lacking. To address this deficiency, we have undertaken an effort to: (1) increase the large-scale availability of high-quality entire mtGenome reference population data, and (2) improve the information technology infrastructure required to access/search mtGenome data and employ them in forensic casework. Here, we describe the application of a data generation and analysis workflow to the development of more than 400 complete, forensic-quality mtGenomes from low DNA quantity blood serum specimens as part of a U.S. National Institute of Justice funded reference population databasing initiative. We discuss the minor modifications made to a published mtGenome Sanger sequencing protocol to maintain a high rate of throughput while minimizing manual reprocessing with these low template samples. The successful use of this semi-automated strategy on forensic-like samples provides practical insight into the feasibility of producing complete mtGenome data in a routine casework environment, and demonstrates that large (>2kb) mtDNA fragments can regularly be recovered from high quality but very low DNA quantity specimens. Further, the detailed empirical data we provide on the amplification success rates across a range of DNA input quantities will be useful moving forward as PCR-based strategies for mtDNA enrichment are considered for targeted next-generation sequencing workflows. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Toward Male Individualization with Rapidly Mutating Y-Chromosomal Short Tandem Repeats

PubMed Central

Ballantyne, Kaye N; Ralf, Arwin; Aboukhalid, Rachid; Achakzai, Niaz M; Anjos, Maria J; Ayub, Qasim; Balažic, Jože; Ballantyne, Jack; Ballard, David J; Berger, Burkhard; Bobillo, Cecilia; Bouabdellah, Mehdi; Burri, Helen; Capal, Tomas; Caratti, Stefano; Cárdenas, Jorge; Cartault, François; Carvalho, Elizeu F; Carvalho, Monica; Cheng, Baowen; Coble, Michael D; Comas, David; Corach, Daniel; D'Amato, Maria E; Davison, Sean; de Knijff, Peter; De Ungria, Maria Corazon A; Decorte, Ronny; Dobosz, Tadeusz; Dupuy, Berit M; Elmrghni, Samir; Gliwiński, Mateusz; Gomes, Sara C; Grol, Laurens; Haas, Cordula; Hanson, Erin; Henke, Jürgen; Henke, Lotte; Herrera-Rodríguez, Fabiola; Hill, Carolyn R; Holmlund, Gunilla; Honda, Katsuya; Immel, Uta-Dorothee; Inokuchi, Shota; Jobling, Mark A; Kaddura, Mahmoud; Kim, Jong S; Kim, Soon H; Kim, Wook; King, Turi E; Klausriegler, Eva; Kling, Daniel; Kovačević, Lejla; Kovatsi, Leda; Krajewski, Paweł; Kravchenko, Sergey; Larmuseau, Maarten H D; Lee, Eun Young; Lessig, Ruediger; Livshits, Ludmila A; Marjanović, Damir; Minarik, Marek; Mizuno, Natsuko; Moreira, Helena; Morling, Niels; Mukherjee, Meeta; Munier, Patrick; Nagaraju, Javaregowda; Neuhuber, Franz; Nie, Shengjie; Nilasitsataporn, Premlaphat; Nishi, Takeki; Oh, Hye H; Olofsson, Jill; Onofri, Valerio; Palo, Jukka U; Pamjav, Horolma; Parson, Walther; Petlach, Michal; Phillips, Christopher; Ploski, Rafal; Prasad, Samayamantri P R; Primorac, Dragan; Purnomo, Gludhug A; Purps, Josephine; Rangel-Villalobos, Hector; Rębała, Krzysztof; Rerkamnuaychoke, Budsaba; Gonzalez, Danel Rey; Robino, Carlo; Roewer, Lutz; Rosa, Alexandra; Sajantila, Antti; Sala, Andrea; Salvador, Jazelyn M; Sanz, Paula; Schmitt, Cornelia; Sharma, Anil K; Silva, Dayse A; Shin, Kyoung-Jin; Sijen, Titia; Sirker, Miriam; Siváková, Daniela; Škaro, Vedrana; Solano-Matamoros, Carlos; Souto, Luis; Stenzl, Vlastimil; Sudoyo, Herawati; Syndercombe-Court, Denise; Tagliabracci, Adriano; Taylor, Duncan; Tillmar, Andreas; Tsybovsky, Iosif S; Tyler-Smith, Chris; van der Gaag, Kristiaan J; Vanek, Daniel; Völgyi, Antónia; Ward, Denise; Willemse, Patricia; Yap, Eric PH; Yong, Rita YY; Pajnič, Irena Zupanič; Kayser, Manfred

2014-01-01

Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836–0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father–son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database. PMID:24917567
mtDNA sequence diversity of Hazara ethnic group from Pakistan.

PubMed

Rakha, Allah; Fatima; Peng, Min-Sheng; Adan, Atif; Bi, Rui; Yasmin, Memona; Yao, Yong-Gang

2017-09-01

The present study was undertaken to investigate mitochondrial DNA (mtDNA) control region sequences of Hazaras from Pakistan, so as to generate mtDNA reference database for forensic casework in Pakistan and to analyze phylogenetic relationship of this particular ethnic group with geographically proximal populations. Complete mtDNA control region (nt 16024-576) sequences were generated through Sanger Sequencing for 319 Hazara individuals from Quetta, Baluchistan. The population sample set showed a total of 189 distinct haplotypes, belonging mainly to West Eurasian (51.72%), East & Southeast Asian (29.78%) and South Asian (18.50%) haplogroups. Compared with other populations from Pakistan, the Hazara population had a relatively high haplotype diversity (0.9945) and a lower random match probability (0.0085). The dataset has been incorporated into EMPOP database under accession number EMP00680. The data herein comprises the largest, and likely most thoroughly examined, control region mtDNA dataset from Hazaras of Pakistan. Copyright © 2017 Elsevier B.V. All rights reserved.
The diploid genome sequence of an Asian individual

PubMed Central

Wang, Jun; Wang, Wei; Li, Ruiqiang; Li, Yingrui; Tian, Geng; Goodman, Laurie; Fan, Wei; Zhang, Junqing; Li, Jun; Zhang, Juanbin; Guo, Yiran; Feng, Binxiao; Li, Heng; Lu, Yao; Fang, Xiaodong; Liang, Huiqing; Du, Zhenglin; Li, Dong; Zhao, Yiqing; Hu, Yujie; Yang, Zhenzhen; Zheng, Hancheng; Hellmann, Ines; Inouye, Michael; Pool, John; Yi, Xin; Zhao, Jing; Duan, Jinjie; Zhou, Yan; Qin, Junjie; Ma, Lijia; Li, Guoqing; Yang, Zhentao; Zhang, Guojie; Yang, Bin; Yu, Chang; Liang, Fang; Li, Wenjie; Li, Shaochuan; Li, Dawei; Ni, Peixiang; Ruan, Jue; Li, Qibin; Zhu, Hongmei; Liu, Dongyuan; Lu, Zhike; Li, Ning; Guo, Guangwu; Zhang, Jianguo; Ye, Jia; Fang, Lin; Hao, Qin; Chen, Quan; Liang, Yu; Su, Yeyang; san, A.; Ping, Cuo; Yang, Shuang; Chen, Fang; Li, Li; Zhou, Ke; Zheng, Hongkun; Ren, Yuanyuan; Yang, Ling; Gao, Yang; Yang, Guohua; Li, Zhuo; Feng, Xiaoli; Kristiansen, Karsten; Wong, Gane Ka-Shu; Nielsen, Rasmus; Durbin, Richard; Bolund, Lars; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian

2009-01-01

Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. PMID:18987735
Fast and accurate genotype imputation in genome-wide association studies through pre-phasing

PubMed Central

Howie, Bryan; Fuchsberger, Christian; Stephens, Matthew; Marchini, Jonathan; Abecasis, Gonçalo R.

2013-01-01

Sequencing efforts, including the 1000 Genomes Project and disease-specific efforts, are producing large collections of haplotypes that can be used for genotype imputation in genome-wide association studies (GWAS). Imputing from these reference panels can help identify new risk alleles, but the use of large panels with existing methods imposes a high computational burden. To keep imputation broadly accessible, we introduce a strategy called “pre-phasing” that maintains the accuracy of leading methods while cutting computational costs by orders of magnitude. In brief, we first statistically estimate the haplotypes for each GWAS individual (“pre-phasing”) and then impute missing genotypes into these estimated haplotypes. This reduces the computational cost because: (i) the GWAS samples must be phased only once, whereas standard methods would implicitly re-phase with each reference panel update; (ii) it is much faster to match a phased GWAS haplotype to one reference haplotype than to match unphased GWAS genotypes to a pair of reference haplotypes. This strategy will be particularly valuable for repeated imputation as reference panels evolve. PMID:22820512
Concept for estimating mitochondrial DNA haplogroups using a maximum likelihood approach (EMMA)☆

PubMed Central

Röck, Alexander W.; Dür, Arne; van Oven, Mannis; Parson, Walther

2013-01-01

The assignment of haplogroups to mitochondrial DNA haplotypes contributes substantial value for quality control, not only in forensic genetics but also in population and medical genetics. The availability of Phylotree, a widely accepted phylogenetic tree of human mitochondrial DNA lineages, led to the development of several (semi-)automated software solutions for haplogrouping. However, currently existing haplogrouping tools only make use of haplogroup-defining mutations, whereas private mutations (beyond the haplogroup level) can be additionally informative allowing for enhanced haplogroup assignment. This is especially relevant in the case of (partial) control region sequences, which are mainly used in forensics. The present study makes three major contributions toward a more reliable, semi-automated estimation of mitochondrial haplogroups. First, a quality-controlled database consisting of 14,990 full mtGenomes downloaded from GenBank was compiled. Together with Phylotree, these mtGenomes serve as a reference database for haplogroup estimates. Second, the concept of fluctuation rates, i.e. a maximum likelihood estimation of the stability of mutations based on 19,171 full control region haplotypes for which raw lane data is available, is presented. Finally, an algorithm for estimating the haplogroup of an mtDNA sequence based on the combined database of full mtGenomes and Phylotree, which also incorporates the empirically determined fluctuation rates, is brought forward. On the basis of examples from the literature and EMPOP, the algorithm is not only validated, but both the strength of this approach and its utility for quality control of mitochondrial haplotypes is also demonstrated. PMID:23948335
Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

PubMed

Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

2009-02-01

High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.
Validation of a reaction volume reduction protocol for analysis of Y chromosome haplotypes targeting DNA databases.

PubMed

Souza, C A; Oliveira, T C; Crovella, S; Santos, S M; Rabêlo, K C N; Soriano, E P; Carvalho, M V D; Junior, A F Caldas; Porto, G G; Campello, R I C; Antunes, A A; Queiroz, R A; Souza, S M

2017-04-28

The use of Y chromosome haplotypes, important for the detection of sexual crimes in forensics, has gained prominence with the use of databases that incorporate these genetic profiles in their system. Here, we optimized and validated an amplification protocol for Y chromosome profile retrieval in reference samples using lesser materials than those in commercial kits. FTA ® cards (Flinders Technology Associates) were used to support the oral cells of male individuals, which were amplified directly using the SwabSolution reagent (Promega). First, we optimized and validated the process to define the volume and cycling conditions. Three reference samples and nineteen 1.2 mm-diameter perforated discs were used per sample. Amplification of one or two discs (samples) with the PowerPlex ® Y23 kit (Promega) was performed using 25, 26, and 27 thermal cycles. Twenty percent, 32%, and 100% reagent volumes, one disc, and 26 cycles were used for the control per sample. Thereafter, all samples (N = 270) were amplified using 27 cycles, one disc, and 32% reagents (optimized conditions). Data was analyzed using a study of equilibrium values between fluorophore colors. In the samples analyzed with 20% volume, an imbalance was observed in peak heights, both inside and in-between each dye. In samples amplified with 32% reagents, the values obtained for the intra-color and inter-color standard balance calculations for verification of the quality of the analyzed peaks were similar to those of samples amplified with 100% of the recommended volume. The quality of the profiles obtained with 32% reagents was suitable for insertion into databases.
Whole genome resequencing of a laboratory-adapted Drosophila melanogaster population sample

PubMed Central

Gilks, William P.; Pennell, Tanya M.; Flis, Ilona; Webster, Matthew T.; Morrow, Edward H.

2016-01-01

As part of a study into the molecular genetics of sexually dimorphic complex traits, we used high-throughput sequencing to obtain data on genomic variation in an outbred laboratory-adapted fruit fly ( Drosophila melanogaster) population. We successfully resequenced the whole genome of 220 hemiclonal females that were heterozygous for the same Berkeley reference line genome (BDGP6/dm6), and a unique haplotype from the outbred base population (LH M). The use of a static and known genetic background enabled us to obtain sequences from whole-genome phased haplotypes. We used a BWA-Picard-GATK pipeline for mapping sequence reads to the dm6 reference genome assembly, at a median depth-of coverage of 31X, and have made the resulting data publicly-available in the NCBI Short Read Archive (Accession number SRP058502). We used Haplotype Caller to discover and genotype 1,726,931 small genomic variants (SNPs and indels, <200bp). Additionally we detected and genotyped 167 large structural variants (1-100Kb in size) using GenomeStrip/2.0. Sequence and genotype data are publicly-available at the corresponding NCBI databases: Short Read Archive, dbSNP and dbVar (BioProject PRJNA282591). We have also released the unfiltered genotype data, and the code and logs for data processing and summary statistics ( https://zenodo.org/communities/sussex_drosophila_sequencing/). PMID:27928499
Estimating haplotype frequencies by combining data from large DNA pools with database information.

PubMed

Gasbarra, Dario; Kulathinal, Sangita; Pirinen, Matti; Sillanpää, Mikko J

2011-01-01

We assume that allele frequency data have been extracted from several large DNA pools, each containing genetic material of up to hundreds of sampled individuals. Our goal is to estimate the haplotype frequencies among the sampled individuals by combining the pooled allele frequency data with prior knowledge about the set of possible haplotypes. Such prior information can be obtained, for example, from a database such as HapMap. We present a Bayesian haplotyping method for pooled DNA based on a continuous approximation of the multinomial distribution. The proposed method is applicable when the sizes of the DNA pools and/or the number of considered loci exceed the limits of several earlier methods. In the example analyses, the proposed model clearly outperforms a deterministic greedy algorithm on real data from the HapMap database. With a small number of loci, the performance of the proposed method is similar to that of an EM-algorithm, which uses a multinormal approximation for the pooled allele frequencies, but which does not utilize prior information about the haplotypes. The method has been implemented using Matlab and the code is available upon request from the authors.
Nuclear, chloroplast, and mitochondrial data of a US cannabis DNA database.

PubMed

Houston, Rachel; Birck, Matthew; LaRue, Bobby; Hughes-Stamm, Sheree; Gangitano, David

2018-05-01

As Cannabis sativa (marijuana) is a controlled substance in many parts of the world, the ability to track biogeographical origin of cannabis could provide law enforcement with investigative leads regarding its trade and distribution. Population substructure and inbreeding may cause cannabis plants to become more genetically related. This genetic relatedness can be helpful for intelligence purposes. Analysis of autosomal, chloroplast, and mitochondrial DNA allows for not only prediction of biogeographical origin of a plant but also discrimination between individual plants. A previously validated, 13-autosomal STR multiplex was used to genotype 510 samples. Samples were analyzed from four different sites: 21 seizures at the US-Mexico border, Northeastern Brazil, hemp seeds purchased in the US, and the Araucania area of Chile. In addition, a previously reported multi-loci system was modified and optimized to genotype five chloroplast and two mitochondrial markers. For this purpose, two methods were designed: a homopolymeric STR pentaplex and a SNP triplex with one chloroplast (Cscp001) marker shared by both methods for quality control. For successful mitochondrial and chloroplast typing, a novel real-time PCR quantitation method was developed and validated to accurately estimate the quantity of the chloroplast DNA (cpDNA) using a synthetic DNA standard. Moreover, a sequenced allelic ladder was also designed for accurate genotyping of the homopolymeric STR pentaplex. For autosomal typing, 356 unique profiles were generated from the 425 samples that yielded full STR profiles and 25 identical genotypes within seizures were observed. Phylogenetic analysis and case-to-case pairwise comparisons of 21 seizures at the US-Mexico border, using the Fixation Index (F ST ) as genetic distance, revealed the genetic association of nine seizures that formed a reference population. For mitochondrial and chloroplast typing, subsampling was performed, and 134 samples were genotyped. Complete haplotypes (STRs and SNPs) were observed for 127 samples. As expected, extensive haplotype sharing was observed; five distinguishable haplotypes were detected. In the reference population, the same haplotype was observed 39 times and two unique haplotypes were also detected. Haplotype sharing was observed between the US border seizures, Brazil, and Chile, while the hemp samples generated a distinct haplotype. Phylogenetic analysis of the four populations was performed, and results revealed that both autosomal and lineage markers could discern population substructure.
Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

PubMed Central

McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

2013-01-01

To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982

EMPOP-quality mtDNA control region sequences from Kashmiri of Azad Jammu & Kashmir, Pakistan.

PubMed

Rakha, Allah; Peng, Min-Sheng; Bi, Rui; Song, Jiao-Jiao; Salahudin, Zeenat; Adan, Atif; Israr, Muhammad; Yao, Yong-Gang

2016-11-01

The mitochondrial DNA (mtDNA) control region (nucleotide position 16024-576) sequences were generated through Sanger sequencing method for 317 self-identified Kashmiris from all districts of Azad Jammu & Kashmir Pakistan. The population sample set showed a total of 251 haplotypes, with a relatively high haplotype diversity (0.9977) and a low random match probability (0.54%). The containing matrilineal lineages belonging to three different phylogeographic origins of Western Eurasian (48.9%), South Asian (47.0%) and East Asian (4.1%). The present study was compared to previous data from Pakistan and other worldwide populations (Central Asia, Western Asia, and East & Southeast Asia). The dataset is made available through EMPOP under accession number EMP00679 and will serve as an mtDNA reference database in forensic casework in Pakistan. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
DNA Commission of the International Society for Forensic Genetics: revised and extended guidelines for mitochondrial DNA typing.

PubMed

Parson, W; Gusmão, L; Hares, D R; Irwin, J A; Mayr, W R; Morling, N; Pokorak, E; Prinz, M; Salas, A; Schneider, P M; Parsons, T J

2014-11-01

The DNA Commission of the International Society of Forensic Genetics (ISFG) regularly publishes guidelines and recommendations concerning the application of DNA polymorphisms to the question of human identification. Previous recommendations published in 2000 addressed the analysis and interpretation of mitochondrial DNA (mtDNA) in forensic casework. While the foundations set forth in the earlier recommendations still apply, new approaches to the quality control, alignment and nomenclature of mitochondrial sequences, as well as the establishment of mtDNA reference population databases, have been developed. Here, we describe these developments and discuss their application to both mtDNA casework and mtDNA reference population databasing applications. While the generation of mtDNA for forensic casework has always been guided by specific standards, it is now well-established that data of the same quality are required for the mtDNA reference population data used to assess the statistical weight of the evidence. As a result, we introduce guidelines regarding sequence generation, as well as quality control measures based on the known worldwide mtDNA phylogeny, that can be applied to ensure the highest quality population data possible. For both casework and reference population databasing applications, the alignment and nomenclature of haplotypes is revised here and the phylogenetic alignment proffered as acceptable standard. In addition, the interpretation of heteroplasmy in the forensic context is updated, and the utility of alignment-free database searches for unbiased probability estimates is highlighted. Finally, we discuss statistical issues and define minimal standards for mtDNA database searches. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Sequences of 95 human MHC haplotypes reveal extreme coding variation in genes other than highly polymorphic HLA class I and II

PubMed Central

Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter

2017-01-01

The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230
Genetic diversity and host specificity varies across three genera of blood parasites in ducks of the Pacific Americas Flyway

USGS Publications Warehouse

Reeves, Andrew B.; Smith, Matthew M.; Meixell, Brandt W.; Fleskes, Joseph P.; Ramey, Andrew M.

2015-01-01

Birds of the order Anseriformes, commonly referred to as waterfowl, are frequently infected by Haemosporidia of the genera Haemoproteus, Plasmodium, and Leucocytozoon via dipteran vectors. We analyzed nucleotide sequences of the Cytochrome b (Cytb) gene from parasites of these genera detected in six species of ducks from Alaska and California, USA to characterize the genetic diversity of Haemosporidia infecting waterfowl at two ends of the Pacific Americas Flyway. In addition, parasite Cytb sequences were compared to those available on a public database to investigate specificity of genetic lineages to hosts of the order Anseriformes. Haplotype and nucleotide diversity of Haemoproteus Cytb sequences was lower than was detected for Plasmodium and Leucocytozoon parasites. Although waterfowl are presumed to be infected by only a single species of Leucocytozoon, L. simondi, diversity indices were highest for haplotypes from this genus and sequences formed five distinct clades separated by genetic distances of 4.9%–7.6%, suggesting potential cryptic speciation. All Haemoproteus andLeucocytozoon haplotypes derived from waterfowl samples formed monophyletic clades in phylogenetic analyses and were unique to the order Anseriformes with few exceptions. In contrast, waterfowl-origin Plasmodium haplotypes were identical or closely related to lineages found in other avian orders. Our results suggest a more generalist strategy for Plasmodiumparasites infecting North American waterfowl as compared to those of the generaHaemoproteus and Leucocytozoon.
Genetic Diversity and Host Specificity Varies across Three Genera of Blood Parasites in Ducks of the Pacific Americas Flyway

PubMed Central

Reeves, Andrew B.; Smith, Mathew M.; Meixell, Brandt W.; Fleskes, Joseph P; Ramey, Andrew M.

2015-01-01

Birds of the order Anseriformes, commonly referred to as waterfowl, are frequently infected by Haemosporidia of the genera Haemoproteus, Plasmodium, and Leucocytozoon via dipteran vectors. We analyzed nucleotide sequences of the Cytochrome b (Cytb) gene from parasites of these genera detected in six species of ducks from Alaska and California, USA to characterize the genetic diversity of Haemosporidia infecting waterfowl at two ends of the Pacific Americas Flyway. In addition, parasite Cytb sequences were compared to those available on a public database to investigate specificity of genetic lineages to hosts of the order Anseriformes. Haplotype and nucleotide diversity of Haemoproteus Cytb sequences was lower than was detected for Plasmodium and Leucocytozoon parasites. Although waterfowl are presumed to be infected by only a single species of Leucocytozoon, L. simondi, diversity indices were highest for haplotypes from this genus and sequences formed five distinct clades separated by genetic distances of 4.9%–7.6%, suggesting potential cryptic speciation. All Haemoproteus and Leucocytozoon haplotypes derived from waterfowl samples formed monophyletic clades in phylogenetic analyses and were unique to the order Anseriformes with few exceptions. In contrast, waterfowl-origin Plasmodium haplotypes were identical or closely related to lineages found in other avian orders. Our results suggest a more generalist strategy for Plasmodium parasites infecting North American waterfowl as compared to those of the genera Haemoproteus and Leucocytozoon. PMID:25710468
Genetic portrait of Tamil non-tribal and Irula tribal population using Y chromosome STR markers.

PubMed

Raghunath, Rajshree; Krishnamoorthy, Kamalakshi; Balasubramanian, Lakshmi; Kunka Mohanram, Ramkumar

2016-03-01

The 17 Y chromosomal short tandem repeat loci included in the AmpFlSTR® Yfiler™ PCR Amplification Kit were used to analyse the genetic diversity of 517 unrelated males representing the non-tribal and Irula tribal population of Tamil Nadu. A total of 392 unique haplotypes were identified among the 400 non-tribal samples whereas 111 were observed among the 117 Irula tribal samples. Rare alleles for the loci DYS458, DYS635 and YGATAH4.1 were also observed in both population. The haplotype diversity for the non-tribal and Irula tribal population were found to be 0.9999, and the gene diversity ranged from 0.2041 (DYS391) to 0.9612 (DYS385). Comparison of the test population with 26 national and global population using principal coordinate analysis (PCoA) and determination of the genetic distance matrix using phylogenetic molecular analysis indicate a clustering of the Tamil Nadu non-tribal and Irula tribal population away from other unrelated population and proximity towards some Indo-European (IE) and Asian population. Data are available in the Y chromosome haplotype reference database (YHRD) under accession number YA004055 for Tamil non-tribal and YA004056 for the Irula tribal group.
Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

PubMed

Delaneau, Olivier; Marchini, Jonathan

2014-06-13

A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.
A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals

PubMed Central

Browning, Brian L.; Browning, Sharon R.

2009-01-01

We present methods for imputing data for ungenotyped markers and for inferring haplotype phase in large data sets of unrelated individuals and parent-offspring trios. Our methods make use of known haplotype phase when it is available, and our methods are computationally efficient so that the full information in large reference panels with thousands of individuals is utilized. We demonstrate that substantial gains in imputation accuracy accrue with increasingly large reference panel sizes, particularly when imputing low-frequency variants, and that unphased reference panels can provide highly accurate genotype imputation. We place our methodology in a unified framework that enables the simultaneous use of unphased and phased data from trios and unrelated individuals in a single analysis. For unrelated individuals, our imputation methods produce well-calibrated posterior genotype probabilities and highly accurate allele-frequency estimates. For trios, our haplotype-inference method is four orders of magnitude faster than the gold-standard PHASE program and has excellent accuracy. Our methods enable genotype imputation to be performed with unphased trio or unrelated reference panels, thus accounting for haplotype-phase uncertainty in the reference panel. We present a useful measure of imputation accuracy, allelic R2, and show that this measure can be estimated accurately from posterior genotype probabilities. Our methods are implemented in version 3.0 of the BEAGLE software package. PMID:19200528
Using population mixtures to optimize the utility of genomic databases: linkage disequilibrium and association study design in India.

PubMed

Pemberton, T J; Jakobsson, M; Conrad, D F; Coop, G; Wall, J D; Pritchard, J K; Patel, P I; Rosenberg, N A

2008-07-01

When performing association studies in populations that have not been the focus of large-scale investigations of haplotype variation, it is often helpful to rely on genomic databases in other populations for study design and analysis - such as in the selection of tag SNPs and in the imputation of missing genotypes. One way of improving the use of these databases is to rely on a mixture of database samples that is similar to the population of interest, rather than using the single most similar database sample. We demonstrate the effectiveness of the mixture approach in the application of African, European, and East Asian HapMap samples for tag SNP selection in populations from India, a genetically intermediate region underrepresented in genomic studies of haplotype variation.
Investigation of extended Y chromosome STR haplotypes in Sardinia.

PubMed

Lacerenza, D; Aneli, S; Di Gaetano, C; Critelli, R; Piazza, A; Matullo, G; Culigioni, C; Robledo, R; Robino, C; Calò, C

2017-03-01

Y-chromosomal variation of selected single nucleotide polymorphisms (SNPs) and 32 short tandem repeat (STR) loci was evaluated in Sardinia in three open population groups (Northern Sardinia, n=40; Central Sardinia, n=56; Southern Sardinia, n=91) and three isolates (Desulo, n=34; Benetutti, n=45, Carloforte, n=42). The tested Y-STRs consisted of Yfiler ® Plus markers and the seven rapidly mutating (RM) loci not included in the YFiler ® Plus kit (DYF399S1, DYF403S1ab, DYF404S1, DYS526ab, DYS547, DYS612, and DYS626). As expected, inclusion of additional Y-STR loci increased haplotype diversity (h), though complete differentiation of male lineages was impossible even by means of RM Y-STRs (h=0.99997). Analysis of molecular variance indicated that the three open populations were fairly homogeneous, whereas signs of genetic heterogeneity could be detected when the three isolates were also included in the analysis. Multidimensional scaling analysis showed that, even for extended haplotypes including RM Y-STR markers, Sardinians were clearly differentiated from populations of the Italian peninsula and Sicily. The only exception was represented by the Carloforte sample that, in accordance with its peculiar population history, clustered with Northern/Central Italian populations. The introduction of extended forensic Y-STR panels, including highly variable RM Y-STR markers, is expected to reduce the impact of population structure on haplotype frequency estimations. However, our results show that the availability of geographically detailed reference databases is still important for the assessment of the evidential value of a Y-haplotype match. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Development of an Italian RM Y-STR haplotype database: Results of the 2013 GEFI collaborative exercise.

PubMed

Robino, C; Ralf, A; Pasino, S; De Marchi, M R; Ballantyne, K N; Barbaro, A; Bini, C; Carnevali, E; Casarino, L; Di Gaetano, C; Fabbri, M; Ferri, G; Giardina, E; Gonzalez, A; Matullo, G; Nutini, A L; Onofri, V; Piccinini, A; Piglionica, M; Ponzano, E; Previderè, C; Resta, N; Scarnicci, F; Seidita, G; Sorçaburu-Cigliero, S; Turrina, S; Verzeletti, A; Kayser, M

2015-03-01

Recently introduced rapidly mutating Y-chromosomal short tandem repeat (RM Y-STR) loci, displaying a multiple-fold higher mutation rate relative to any other Y-STRs, including those conventionally used in forensic casework, have been demonstrated to improve the resolution of male lineage differentiation and to allow male relative separation usually impossible with standard Y-STRs. However, large and geographically-detailed frequency haplotype databases are required to estimate the statistical weight of RM Y-STR haplotype matches if observed in forensic casework. With this in mind, the Italian Working Group (GEFI) of the International Society for Forensic Genetics launched a collaborative exercise aimed at generating an Italian quality controlled forensic RM Y-STR haplotype database. Overall 1509 male individuals from 13 regional populations covering northern, central and southern areas of the Italian peninsula plus Sicily were collected, including both "rural" and "urban" samples classified according to population density in the sampling area. A subset of individuals was additionally genotyped for Y-STR loci included in the Yfiler and PowerPlex Y23 (PPY23) systems (75% and 62%, respectively), allowing the comparison of RM and conventional Y-STRs. Considering the whole set of 13 RM Y-STRs, 1501 unique haplotypes were observed among the 1509 sampled Italian men with a haplotype diversity of 0.999996, largely superior to Yfiler and PPY23 with 0.999914 and 0.999950, respectively. AMOVA indicated that 99.996% of the haplotype variation was within populations, confirming that genetic-geographic structure is almost undetected by RM Y-STRs. Haplotype sharing among regional Italian populations was not observed at all with the complete set of 13 RM Y-STRs. Haplotype sharing within Italian populations was very rare (0.27% non-unique haplotypes), and lower in urban (0.22%) than rural (0.29%) areas. Additionally, 422 father-son pairs were investigated, and 20.1% of them could be discriminated by the whole set of 13 RM Y-STRs, which was very close to the theoretically expected estimate of 19.5% given the mutation rates of the markers used. Results obtained from a high-coverage Italian haplotype dataset confirm on the regional scale the exceptional ability of RM Y-STRs to resolve male lineages previously observed globally, and attest the unsurpassed value of RM Y-STRs for male-relative differentiation purposes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
References for Haplotype Imputation in the Big Data Era

PubMed Central

Li, Wenzhi; Xu, Wei; Li, Qiling; Ma, Li; Song, Qing

2016-01-01

Imputation is a powerful in silico approach to fill in those missing values in the big datasets. This process requires a reference panel, which is a collection of big data from which the missing information can be extracted and imputed. Haplotype imputation requires ethnicity-matched references; a mismatched reference panel will significantly reduce the quality of imputation. However, currently existing big datasets cover only a small number of ethnicities, there is a lack of ethnicity-matched references for many ethnic populations in the world, which has hampered the data imputation of haplotypes and its downstream applications. To solve this issue, several approaches have been proposed and explored, including the mixed reference panel, the internal reference panel and genotype-converted reference panel. This review article provides the information and comparison between these approaches. Increasing evidence showed that not just one or two genetic elements dictate the gene activity and functions; instead, cis-interactions of multiple elements dictate gene activity. Cis-interactions require the interacting elements to be on the same chromosome molecule, therefore, haplotype analysis is essential for the investigation of cis-interactions among multiple genetic variants at different loci, and appears to be especially important for studying the common diseases. It will be valuable in a wide spectrum of applications from academic research, to clinical diagnosis, prevention, treatment, and pharmaceutical industry. PMID:27274952
New HLA haplotype frequency reference standards: high-resolution and large sample typing of HLA DR-DQ haplotypes in a sample of European Americans.

PubMed

Klitz, W; Maiers, M; Spellman, S; Baxter-Lowe, L A; Schmeckpeper, B; Williams, T M; Fernandez-Viña, M

2003-10-01

A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.
De novo assembly of a haplotype-resolved human genome.

PubMed

Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

2015-06-01

The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.
DNA from the Past Informs Ex Situ Conservation for the Future: An “Extinct” Species of Galápagos Tortoise Identified in Captivity

PubMed Central

Russello, Michael A.; Poulakakis, Nikos; Gibbs, James P.; Tapia, Washington; Benavides, Edgar; Powell, Jeffrey R.; Caccone, Adalgisa

2010-01-01

Background Although not unusual to find captive relicts of species lost in the wild, rarely are presumed extinct species rediscovered outside of their native range. A recent study detected living descendents of an extinct Galápagos tortoise species (Chelonoidis elephantopus) once endemic to Floreana Island on the neighboring island of Isabela. This finding adds to the growing cryptic diversity detected among these species in the wild. There also exists a large number of Galápagos tortoises in captivity of ambiguous origin. The recently accumulated population-level haplotypic and genotypic data now available for C. elephantopus add a critical reference population to the existing database of 11 extant species for investigating the origin of captive individuals of unknown ancestry. Methodology/Findings We reanalyzed mitochondrial DNA control region haplotypes and microsatellite genotypes of 156 captive individuals using an expanded reference database that included all extant Galápagos tortoise species as well as the extinct species from Floreana. Nine individuals (six females and three males) exhibited strong signatures of Floreana ancestry and a high probability of assignment to C. elephantopus as detected by Bayesian assignment and clustering analyses of empirical and simulated data. One male with high assignment probability to C. elephantopus based on microsatellite genotypic data also possessed a “Floreana-like” mitochondrial DNA haplotype. Significance Historical DNA analysis of museum specimens has provided critical spatial and temporal components to ecological, evolutionary, taxonomic and conservation-related research, but rarely has it informed ex situ species recovery efforts. Here, the availability of population-level genotypic data from the extinct C. elephantopus enabled the identification of nine Galápagos tortoise individuals of substantial conservation value that were previously misassigned to extant species of varying conservation status. As all captive individuals of C. elephantopus ancestry currently reside at a centralized breeding facility on Santa Cruz, these findings permit breeding efforts to commence in support of the reestablishment of this extinct species to its native range. PMID:20084268
Reference-based phasing using the Haplotype Reference Consortium panel.

PubMed

Loh, Po-Ru; Danecek, Petr; Palamara, Pier Francesco; Fuchsberger, Christian; A Reshef, Yakir; K Finucane, Hilary; Schoenherr, Sebastian; Forer, Lukas; McCarthy, Shane; Abecasis, Goncalo R; Durbin, Richard; L Price, Alkes

2016-11-01

Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ∼20× speedup and ∼10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European-ancestry samples, Eagle2 with the HRC panel achieves >2× the accuracy of 1000 Genomes-based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.
Genotype Imputation with Thousands of Genomes

PubMed Central

Howie, Bryan; Marchini, Jonathan; Stephens, Matthew

2011-01-01

Genotype imputation is a statistical technique that is often used to increase the power and resolution of genetic association studies. Imputation methods work by using haplotype patterns in a reference panel to predict unobserved genotypes in a study dataset, and a number of approaches have been proposed for choosing subsets of reference haplotypes that will maximize accuracy in a given study population. These panel selection strategies become harder to apply and interpret as sequencing efforts like the 1000 Genomes Project produce larger and more diverse reference sets, which led us to develop an alternative framework. Our approach is built around a new approximation that uses local sequence similarity to choose a custom reference panel for each study haplotype in each region of the genome. This approximation makes it computationally efficient to use all available reference haplotypes, which allows us to bypass the panel selection step and to improve accuracy at low-frequency variants by capturing unexpected allele sharing among populations. Using data from HapMap 3, we show that our framework produces accurate results in a wide range of human populations. We also use data from the Malaria Genetic Epidemiology Network (MalariaGEN) to provide recommendations for imputation-based studies in Africa. We demonstrate that our approximation improves efficiency in large, sequence-based reference panels, and we discuss general computational strategies for modern reference datasets. Genome-wide association studies will soon be able to harness the power of thousands of reference genomes, and our work provides a practical way for investigators to use this rich information. New methodology from this study is implemented in the IMPUTE2 software package. PMID:22384356
Swine Leukocyte Antigen Diversity in Canadian Specific Pathogen-Free Yorkshire and Landrace Pigs

PubMed Central

Gao, Caixia; Quan, Jinqiang; Jiang, Xinjie; Li, Changwen; Lu, Xiaoye; Chen, Hongyan

2017-01-01

The highly polymorphic swine major histocompatibility complex (MHC), termed swine leukocyte antigen (SLA), is associated with different levels of immunologic responses to infectious diseases, vaccines, and transplantation. Pig breeds with known SLA haplotypes are important genetic resources for biomedical research. Canadian Yorkshire and Landrace pigs represent the current specific pathogen-free (SPF) breeding stock maintained in the isolation environment at the Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences. In this study, we identified 61 alleles at five polymorphic SLA loci (SLA-1, SLA-2, SLA-3, DRB1, and DQB1) representing 17 class I haplotypes and 11 class II haplotypes using reverse transcription-polymerase chain reaction (RT-PCR) sequence-based typing and PCR-sequence specific primers methods in 367 Canadian SPF Yorkshire and Landrace pigs. The official designation of the alleles has been assigned by the SLA Nomenclature Committee of the International Society for Animal Genetics and released in updated Immuno Polymorphism Database-MHC SLA sequence database [Release 2.0.0.3 (2016-11-03)]. The submissions confirmed some unassigned alleles and standardized nomenclatures of many previously unconfirmed alleles in the GenBank database. Three class I haplotypes, Hp-37.0, 63.0, and 73.0, appeared to be novel and have not previously been reported in other pig populations. One crossover within the class I region and two between class I and class II regions were observed, resulting in three new recombinant haplotypes. The presence of the duplicated SLA-1 locus was confirmed in three class I haplotypes Hp-28.0, Hp-35.0, and Hp-63.0. Furthermore, we also analyzed the functional diversities of 19 identified frequent SLA class I molecules in this study and confirmed the existence of four supertypes using the MHCcluster method. These results will be useful for studying the adaptive immune response and immunological phenotypic differences in pigs, screening potential T-cell epitopes, and further developing the more effective vaccines. PMID:28360911
High-quality mtDNA control region sequences from 680 individuals sampled across the Netherlands to establish a national forensic mtDNA reference database.

PubMed

Chaitanya, Lakshmi; van Oven, Mannis; Brauer, Silke; Zimmermann, Bettina; Huber, Gabriela; Xavier, Catarina; Parson, Walther; de Knijff, Peter; Kayser, Manfred

2016-03-01

The use of mitochondrial DNA (mtDNA) for maternal lineage identification often marks the last resort when investigating forensic and missing-person cases involving highly degraded biological materials. As with all comparative DNA testing, a match between evidence and reference sample requires a statistical interpretation, for which high-quality mtDNA population frequency data are crucial. Here, we determined, under high quality standards, the complete mtDNA control-region sequences of 680 individuals from across the Netherlands sampled at 54 sites, covering the entire country with 10 geographic sub-regions. The complete mtDNA control region (nucleotide positions 16,024-16,569 and 1-576) was amplified with two PCR primers and sequenced with ten different sequencing primers using the EMPOP protocol. Haplotype diversity of the entire sample set was very high at 99.63% and, accordingly, the random-match probability was 0.37%. No population substructure within the Netherlands was detected with our dataset. Phylogenetic analyses were performed to determine mtDNA haplogroups. Inclusion of these high-quality data in the EMPOP database (accession number: EMP00666) will improve its overall data content and geographic coverage in the interest of all EMPOP users worldwide. Moreover, this dataset will serve as (the start of) a national reference database for mtDNA applications in forensic and missing person casework in the Netherlands. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Filipino DNA variation at 12 X-chromosome short tandem repeat markers.

PubMed

Salvador, Jazelyn M; Apaga, Dame Loveliness T; Delfin, Frederick C; Calacal, Gayvelline C; Dennis, Sheila Estacio; De Ungria, Maria Corazon A

2018-06-08

Demands for solving complex kinship scenarios where only distant relatives are available for testing have risen in the past years. In these instances, other genetic markers such as X-chromosome short tandem repeat (X-STR) markers are employed to supplement autosomal and Y-chromosomal STR DNA typing. However, prior to use, the degree of STR polymorphism in the population requires evaluation through generation of an allele or haplotype frequency population database. This population database is also used for statistical evaluation of DNA typing results. Here, we report X-STR data from 143 unrelated Filipino male individuals who were genotyped via conventional polymerase chain reaction-capillary electrophoresis (PCR-CE) using the 12 X-STR loci included in the Investigator ® Argus X-12 kit (Qiagen) and via massively parallel sequencing (MPS) of seven X-STR loci included in the ForenSeq ™ DNA Signature Prep kit of the MiSeq ® FGx ™ Forensic Genomics System (Illumina). Allele calls between PCR-CE and MPS systems were consistent (100% concordance) across seven overlapping X-STRs. Allele and haplotype frequencies and other parameters of forensic interest were calculated based on length (PCR-CE, 12 X-STRs) and sequence (MPS, seven X-STRs) variations observed in the population. Results of our study indicate that the 12 X-STRs in the PCR-CE system are highly informative for the Filipino population. MPS of seven X-STR loci identified 73 X-STR alleles compared with 55 X-STR alleles that were identified solely by length via PCR-CE. Of the 73 sequence-based alleles observed, six alleles have not been reported in the literature. The population data presented here may serve as a reference Philippine frequency database of X-STRs for forensic casework applications. Copyright © 2018 Elsevier B.V. All rights reserved.

Integration of genotoxicity and population genetic analyses in kangaroo rats (Dipodomys merriami) exposed to radionuclide contamination at the Nevada Test Site, USA

USGS Publications Warehouse

Theodorakis, Christopher W.; Bickham, John W.; Lamb, Trip; Medica, Philip A.; Lyne, T. Barrett

2001-01-01

We examined effects of radionuclide exposure at two atomic blast sites on kangaroo rats (Dipodomys merriami) at the Nevada Test Site, Nevada, USA, using genotoxicity and population genetic analyses. We assessed chromosome damage by micronucleus and flow cytometric assays and genetic variation by randomly amplified polymorphic DNA (RAPD) and mitochondrial DNA (mtDNA) analyses. The RAPD analysis showed no population structure, but mtDNA exhibited differentiation among and within populations. Genotoxicity effects were not observed when all individuals were analyzed. However, individuals with mtDNA haplotypes unique to the contaminated sites had greater chromosomal damage than contaminated-site individuals with haplotypes shared with reference sites. When interpopulation comparisons used individuals with unique haplotypes, one contaminated site had greater levels of chromosome damage than one or both of the reference sites. We hypothesize that shared-haplotype individuals are potential migrants and that unique-haplotype individuals are potential long-term residents. A parsimony approach was used to estimate the minimum number of migration events necessary to explain the haplotype distributions on a phylogenetic tree. The observed predominance of migration events into the contaminated sites supported our migration hypothesis. We conclude the atomic blast sites are ecological sinks and that immigration masks the genotoxic effects of radiation on the resident populations.
A high-throughput Sanger strategy for human mitochondrial genome sequencing

PubMed Central

2013-01-01

Background A population reference database of complete human mitochondrial genome (mtGenome) sequences is needed to enable the use of mitochondrial DNA (mtDNA) coding region data in forensic casework applications. However, the development of entire mtGenome haplotypes to forensic data quality standards is difficult and laborious. A Sanger-based amplification and sequencing strategy that is designed for automated processing, yet routinely produces high quality sequences, is needed to facilitate high-volume production of these mtGenome data sets. Results We developed a robust 8-amplicon Sanger sequencing strategy that regularly produces complete, forensic-quality mtGenome haplotypes in the first pass of data generation. The protocol works equally well on samples representing diverse mtDNA haplogroups and DNA input quantities ranging from 50 pg to 1 ng, and can be applied to specimens of varying DNA quality. The complete workflow was specifically designed for implementation on robotic instrumentation, which increases throughput and reduces both the opportunities for error inherent to manual processing and the cost of generating full mtGenome sequences. Conclusions The described strategy will assist efforts to generate complete mtGenome haplotypes which meet the highest data quality expectations for forensic genetic and other applications. Additionally, high-quality data produced using this protocol can be used to assess mtDNA data developed using newer technologies and chemistries. Further, the amplification strategy can be used to enrich for mtDNA as a first step in sample preparation for targeted next-generation sequencing. PMID:24341507
Near East mtDNA haplotype variants in Roman cattle from Augusta Raurica, Switzerland, and in the Swiss Evolène breed.

PubMed

Schlumbaum, A; Turgay, M; Schibler, J

2006-08-01

Typical Near East mitochondrial haplotypes of the T2 lineage were found in one cattle metacarpus sample from the Roman period and in two present-day Evolène cattle in Switzerland. Sequences from eight additional Evolène and four Raetian Grey aligned to the European haplotype T3. Analysis of nucleotide diversity within the mitochondrial D-loop of both studied Swiss cattle breeds revealed high haplotype diversity and similar diversity to a European cattle reference group. Mitochondrial T3 haplotypes radiated star-like from two similarly frequent haplotypes, possibly indicating two different expansion routes. The breed structure of Evolène cattle can be explained either by an introduction of diverse female lineages from the domestication centre or by later admixture. The introduction of the Near East lineage to Switzerland must have happened during the Roman time or earlier.
Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection.

PubMed

Louzoun, Yoram; Alter, Idan; Gragert, Loren; Albrecht, Mark; Maiers, Martin

2018-05-01

Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.
A phased SNP-based classification of sickle cell anemia HBB haplotypes.

PubMed

Shaikho, Elmutaz M; Farrell, John J; Alsultan, Abdulrahman; Qutub, Hatem; Al-Ali, Amein K; Figueiredo, Maria Stella; Chui, David H K; Farrer, Lindsay A; Murphy, George J; Mostoslavsky, Gustavo; Sebastiani, Paola; Steinberg, Martin H

2017-08-11

Sickle cell anemia causes severe complications and premature death. Five common β-globin gene cluster haplotypes are each associated with characteristic fetal hemoglobin (HbF) levels. As HbF is the major modulator of disease severity, classifying patients according to haplotype is useful. The first method of haplotype classification used restriction fragment length polymorphisms (RFLPs) to detect single nucleotide polymorphisms (SNPs) in the β-globin gene cluster. This is labor intensive, and error prone. We used genome-wide SNP data imputed to the 1000 Genomes reference panel to obtain phased data distinguishing parental alleles. We successfully haplotyped 813 sickle cell anemia patients previously classified by RFLPs with a concordance >98%. Four SNPs (rs3834466, rs28440105, rs10128556, and rs968857) marking four different restriction enzyme sites unequivocally defined most haplotypes. We were able to assign a haplotype to 86% of samples that were either partially or misclassified using RFLPs. Phased data using only four SNPs allowed unequivocal assignment of a haplotype that was not always possible using a larger number of RFLPs. Given the availability of genome-wide SNP data, our method is rapid and does not require high computational resources.
Practical interpretation of CYP2D6 haplotypes: Comparison and integration of automated and expert calling.

PubMed

Ruaño, Gualberto; Kocherla, Mohan; Graydon, James S; Holford, Theodore R; Makowski, Gregory S; Goethe, John W

2016-05-01

We describe a population genetic approach to compare samples interpreted with expert calling (EC) versus automated calling (AC) for CYP2D6 haplotyping. The analysis represents 4812 haplotype calls based on signal data generated by the Luminex xMap analyzers from 2406 patients referred to a high-complexity molecular diagnostics laboratory for CYP450 testing. DNA was extracted from buccal swabs. We compared the results of expert calls (EC) and automated calls (AC) with regard to haplotype number and frequency. The ratio of EC to AC was 1:3. Haplotype frequencies from EC and AC samples were convergent across haplotypes, and their distribution was not statistically different between the groups. Most duplications required EC, as only expansions with homozygous or hemizygous haplotypes could be automatedly called. High-complexity laboratories can offer equivalent interpretation to automated calling for non-expanded CYP2D6 loci, and superior interpretation for duplications. We have validated scientific expert calling specified by scoring rules as standard operating procedure integrated with an automated calling algorithm. The integration of EC with AC is a practical strategy for CYP2D6 clinical haplotyping. Copyright © 2016 Elsevier B.V. All rights reserved.
Genetic diversity of Taenia asiatica from Thailand and other geographical locations as revealed by cytochrome c oxidase subunit 1 sequences.

PubMed

Anantaphruti, Malinee Thairungroj; Thaenkham, Urusa; Watthanakulpanich, Dorn; Phuphisut, Orawan; Maipanich, Wanna; Yoonuan, Tippayarat; Nuamtanong, Supaporn; Pubampen, Somjit; Sanguankiat, Surapol

2013-02-01

Twelve 924 bp cytochrome c oxidase subunit 1 (cox1) mitochondrial DNA sequences from Taenia asiatica isolates from Thailand were aligned and compared with multiple sequence isolates from Thailand and 6 other countries from the GenBank database. The genetic divergence of T. asiatica was also compared with Taenia saginata database sequences from 6 different countries in Asia, including Thailand, and 3 countries from other continents. The results showed that there were minor genetic variations within T. asiatica species, while high intraspecies variation was found in T. saginata. There were only 2 haplotypes and 1 polymorphic site found in T. asiatica, but 8 haplotypes and 9 polymorphic sites in T. saginata. Haplotype diversity was very low, 0.067, in T. asiatica and high, 0.700, in T. saginata. The very low genetic diversity suggested that T. asiatica may be at a risk due to the loss of potential adaptive alleles, resulting in reduced viability and decreased responses to environmental changes, which may endanger the species.
Genetic Diversity of Taenia asiatica from Thailand and Other Geographical Locations as Revealed by Cytochrome c Oxidase Subunit 1 Sequences

PubMed Central

Thaenkham, Urusa; Watthanakulpanich, Dorn; Phuphisut, Orawan; Maipanich, Wanna; Yoonuan, Tippayarat; Nuamtanong, Supaporn; Pubampen, Somjit; Sanguankiat, Surapol

2013-01-01

Twelve 924 bp cytochrome c oxidase subunit 1 (cox1) mitochondrial DNA sequences from Taenia asiatica isolates from Thailand were aligned and compared with multiple sequence isolates from Thailand and 6 other countries from the GenBank database. The genetic divergence of T. asiatica was also compared with Taenia saginata database sequences from 6 different countries in Asia, including Thailand, and 3 countries from other continents. The results showed that there were minor genetic variations within T. asiatica species, while high intraspecies variation was found in T. saginata. There were only 2 haplotypes and 1 polymorphic site found in T. asiatica, but 8 haplotypes and 9 polymorphic sites in T. saginata. Haplotype diversity was very low, 0.067, in T. asiatica and high, 0.700, in T. saginata. The very low genetic diversity suggested that T. asiatica may be at a risk due to the loss of potential adaptive alleles, resulting in reduced viability and decreased responses to environmental changes, which may endanger the species. PMID:23467439
The complete mitogenome of a 500-year-old Inca child mummy.

PubMed

Gómez-Carballa, Alberto; Catelli, Laura; Pardo-Seco, Jacobo; Martinón-Torres, Federico; Roewer, Lutz; Vullo, Carlos; Salas, Antonio

2015-11-12

In 1985, a frozen mummy was found in Cerro Aconcagua (Argentina). Archaeological studies identified the mummy as a seven-year-old Inca sacrifice victim who lived >500 years ago, at the time of the expansion of the Inca Empire towards the southern cone. The sequence of its entire mitogenome was obtained. After querying a large worldwide database of mitogenomes (>28,000) we found that the Inca haplotype belonged to a branch of haplogroup C1b (C1bi) that has not yet been identified in modern Native Americans. The expansion of C1b into the Americas, as estimated using 203 C1b mitogenomes, dates to the initial Paleoindian settlements (~18.3 thousand years ago [kya]); however, its internal variation differs between Mesoamerica and South America. By querying large databases of control region haplotypes (>150,000), we found only a few C1bi members in Peru and Bolivia (e.g. Aymaras), including one haplotype retrieved from ancient DNA of an individual belonging to the Wari Empire (Peruvian Andes). Overall, the results suggest that the profile of the mummy represents a very rare sub-clade that arose 14.3 (5-23.6) kya and could have been more frequent in the past. A Peruvian Inca origin for present-day C1bi haplotypes would satisfy both the genetic and paleo-anthropological findings.
The complete mitogenome of a 500-year-old Inca child mummy

PubMed Central

Gómez-Carballa, Alberto; Catelli, Laura; Pardo-Seco, Jacobo; Martinón-Torres, Federico; Roewer, Lutz; Vullo, Carlos; Salas, Antonio

2015-01-01

In 1985, a frozen mummy was found in Cerro Aconcagua (Argentina). Archaeological studies identified the mummy as a seven-year-old Inca sacrifice victim who lived >500 years ago, at the time of the expansion of the Inca Empire towards the southern cone. The sequence of its entire mitogenome was obtained. After querying a large worldwide database of mitogenomes (>28,000) we found that the Inca haplotype belonged to a branch of haplogroup C1b (C1bi) that has not yet been identified in modern Native Americans. The expansion of C1b into the Americas, as estimated using 203 C1b mitogenomes, dates to the initial Paleoindian settlements (~18.3 thousand years ago [kya]); however, its internal variation differs between Mesoamerica and South America. By querying large databases of control region haplotypes (>150,000), we found only a few C1bi members in Peru and Bolivia (e.g. Aymaras), including one haplotype retrieved from ancient DNA of an individual belonging to the Wari Empire (Peruvian Andes). Overall, the results suggest that the profile of the mummy represents a very rare sub-clade that arose 14.3 (5–23.6) kya and could have been more frequent in the past. A Peruvian Inca origin for present-day C1bi haplotypes would satisfy both the genetic and paleo-anthropological findings. PMID:26561991
Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel.

PubMed

Huang, Jie; Howie, Bryan; McCarthy, Shane; Memari, Yasin; Walter, Klaudia; Min, Josine L; Danecek, Petr; Malerba, Giovanni; Trabetti, Elisabetta; Zheng, Hou-Feng; Gambaro, Giovanni; Richards, J Brent; Durbin, Richard; Timpson, Nicholas J; Marchini, Jonathan; Soranzo, Nicole

2015-09-14

Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-effective strategy for augmenting the single-nucleotide polymorphism (SNP) content of genome-wide arrays. The UK10K Cohorts project has generated a data set of 3,781 whole genomes sequenced at low depth (average 7x), aiming to exhaustively characterize genetic variation down to 0.1% minor allele frequency in the British population. Here we demonstrate the value of this resource for improving imputation accuracy at rare and low-frequency variants in both a UK and an Italian population. We show that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling. We also present a method for combining WGS panels to improve variant coverage and downstream imputation accuracy, which we illustrate by integrating 7,562 WGS haplotypes from the UK10K project with 2,184 haplotypes from the 1000 Genomes Project. Finally, we introduce a novel approximation that maintains speed without sacrificing imputation accuracy for rare variants.
Honey bee-inspired algorithms for SNP haplotype reconstruction problem

NASA Astrophysics Data System (ADS)

PourkamaliAnaraki, Maryam; Sadeghi, Mehdi

2016-03-01

Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/
A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

PubMed Central

Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

2013-01-01

Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962
Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes

PubMed Central

Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.

2016-01-01

The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800
Mendel-GPU: haplotyping and genotype imputation on graphics processing units

PubMed Central

Chen, Gary K.; Wang, Kai; Stram, Alex H.; Sobel, Eric M.; Lange, Kenneth

2012-01-01

Motivation: In modern sequencing studies, one can improve the confidence of genotype calls by phasing haplotypes using information from an external reference panel of fully typed unrelated individuals. However, the computational demands are so high that they prohibit researchers with limited computational resources from haplotyping large-scale sequence data. Results: Our graphics processing unit based software delivers haplotyping and imputation accuracies comparable to competing programs at a fraction of the computational cost and peak memory demand. Availability: Mendel-GPU, our OpenCL software, runs on Linux platforms and is portable across AMD and nVidia GPUs. Users can download both code and documentation at http://code.google.com/p/mendel-gpu/. Contact: gary.k.chen@usc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22954633
Northern Slavs from Serbia do not show a founder effect at autosomal and Y-chromosomal STRs and retain their paternal genetic heritage.

PubMed

Rębała, Krzysztof; Veselinović, Igor; Siváková, Daniela; Patskun, Erika; Kravchenko, Sergey; Szczerkowska, Zofia

2014-01-01

Studies on Y-chromosomal markers revealed significant genetic differentiation between Southern and Northern (Western and Eastern) Slavic populations. The northern Serbian region of Vojvodina is inhabited by Southern Slavic Serbian majority and, inter alia, Western Slavic (Slovak) and Eastern Slavic (Ruthenian) minorities. In the study, 15 autosomal STR markers were analysed in unrelated Slovaks, Ruthenians and Serbs from northern Serbia and western Slovakia. Additionally, Slovak males from Serbia were genotyped for 17 Y-chromosomal STR loci. The results were compared to data available for other Slavic populations. Genetic distances for autosomal markers revealed homogeneity between Serbs from northern Serbia and Slovaks from western Slovakia and distinctiveness of Serbian Slovaks and Ruthenians. Y-STR variation showed a clear genetic departure of the Slovaks and Ruthenians inhabiting Vojvodina from their Serbian neighbours and genetic similarity to the Northern Slavic populations of Slovakia and Ukraine. Admixture estimates revealed negligible Serbian paternal ancestry in both Northern Slavic minorities of Vojvodina, providing evidence for their genetic isolation from the Serbian majority population. No reduction of genetic diversity at autosomal and Y-chromosomal markers was found, excluding genetic drift as a reason for differences observed at autosomal STRs. Analysis of molecular variance detected significant population stratification of autosomal and Y-chromosomal microsatellites in the three Slavic populations of northern Serbia, indicating necessity for separate databases used for estimations of frequencies of autosomal and Y-chromosomal STR profiles in forensic casework. Our results demonstrate that regarding Y-STR haplotypes, Serbian Slovaks and Ruthenians fit in the Eastern European metapopulation defined in the Y chromosome haplotype reference database. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

PubMed Central

Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

2015-01-01

Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880
Next Generation Sequencing Plus (NGS+) with Y-chromosomal Markers for Forensic Pedigree Searches.

PubMed

Qian, Xiaoqin; Hou, Jiayi; Wang, Zheng; Ye, Yi; Lang, Min; Gao, Tianzhen; Liu, Jing; Hou, Yiping

2017-09-12

There is high demand for forensic pedigree searches with Y-chromosome short tandem repeat (Y-STR) profiling in large-scale crime investigations. However, when two Y-STR haplotypes have a few mismatched loci, it is difficult to determine if they are from the same male lineage because of the high mutation rate of Y-STRs. Here we design a new strategy to handle cases in which none of pedigree samples shares identical Y-STR haplotype. We combine next generation sequencing (NGS), capillary electrophoresis and pyrosequencing under the term 'NGS+' for typing Y-STRs and Y-chromosomal single nucleotide polymorphisms (Y-SNPs). The high-resolution Y-SNP haplogroup and Y-STR haplotype can be obtained with NGS+. We further developed a new data-driven decision rule, FSindex, for estimating the likelihood for each retrieved pedigree. Our approach enables positive identification of pedigree from mismatched Y-STR haplotypes. It is envisaged that NGS+ will revolutionize forensic pedigree searches, especially when the person of interest was not recorded in forensic DNA database.
Construction and forensic genetic characterization of 11 autosomal haplotypes consisting of 22 tri-allelic indels.

PubMed

Zhao, Xiaohong; Chen, Xiaogang; Zhao, Yuancun; Zhang, Shu; Gao, Zehua; Yang, Yiwen; Wang, Yufang; Zhang, Ji

2018-05-01

Insertion/deletion polymorphisms (indels), which combine the advantages of both short tandem repeats and single-nucleotide polymorphisms, are suitable for parentage testing. To overcome the limitations of the low polymorphism of di-allelic indels, we constructed a set of haplotypes with physically linked, multi-allelic indels. Candidate haplotypes were selected from the 1000 Genomes Project database, and were subject to the following criteria for inclusion: (i) each marker must have a minimum allele frequency (MAF) of ≥0.1 in the Han population of China; (ii) markers must exist in a non-coding region; (iii) the physical distance between a pair of candidate indels must be <500 bp; (iv) the allele length variation of each indel from 1 to 20 bp; (v) different haplotypes must be located on different chromosomes or chromosomal arms, or be more than 10 Mb apart if on the same chromosomal arm; and (vi) they must not be located across a recombination hotspot. A multiplex system with 11 haplotype markers, comprising 22 tri-allelic indel loci distributed over 10 chromosomes was developed. To validate the multiplex panel, we investigated the haplotype distribution in sets of two and three-generation pedigrees. The results demonstrated that the haplotypes consisting of multi-allelic indel markers exhibited higher polymorphism than a single indel locus, and thus provide Supplementary information for forensic kinship identification. Copyright © 2018 Elsevier B.V. All rights reserved.
Influence of promoter/enhancer region haplotypes on MGMT transcriptional regulation: a potential biomarker for human sensitivity to alkylating agents.

PubMed

Xu, Meixiang; Nekhayeva, Ilona; Cross, Courtney E; Rondelli, Catherine M; Wickliffe, Jeffrey K; Abdel-Rahman, Sherif Z

2014-03-01

The O6-methylguanine-DNA methyltransferase gene (MGMT) encodes the direct reversal DNA repair protein that removes alkyl adducts from the O6 position of guanine. Several single-nucleotide polymorphisms (SNPs) exist in the MGMT promoter/enhancer (P/E) region. However, the haplotype structure encompassing these SNPs and their functional/biological significance are currently unknown. We hypothesized that MGMT P/E haplotypes, rather than individual SNPs, alter MGMT transcription and can thus alter human sensitivity to alkylating agents. To identify the haplotype structure encompassing the MGMT P/E region SNPs, we sequenced 104 DNA samples from healthy individuals and inferred the haplotypes using the data generated. We identified eight SNPs in this region, namely T7C (rs180989103), T135G (rs1711646), G290A (rs61859810), C485A (rs1625649), C575A (rs113813075), G666A (rs34180180), C777A (rs34138162) and C1099T (rs16906252). Phylogenetics and Sequence Evolution analysis predicted 21 potential haplotypes that encompass these SNPs ranging in frequencies from 0.000048 to 0.39. Of these, 10 were identified in our study population as 20 paired haplotype combinations. To determine the functional significance of these haplotypes, luciferase reporter constructs representing these haplotypes were transfected into glioblastoma cells and their effect on MGMT promoter activity was determined. Compared with the most common (reference) haplotype 1, seven haplotypes significantly upregulated MGMT promoter activity (18-119% increase; P < 0.05), six significantly downregulated MGMT promoter activity (29-97% decrease; P < 0.05) and one haplotype had no effect. Mechanistic studies conducted support the conclusion that MGMT P/E haplotypes, rather than individual SNPs, differentially regulate MGMT transcription and could thus play a significant role in human sensitivity to environmental and therapeutic alkylating agents.

Linkage disequilibrium in HLA cannot be explained by selective recombination.

PubMed

Termijtelen, A; D'Amaro, J; van Rood, J J; Schreuder, G M

1995-11-01

Some combinations of HLA-A, -B and -DR antigens occur more frequently than would be expected from their gene frequencies in the population. This phenomenon, referred to as Linkage Disequilibrium (LD) has been the origin of many speculations. One hypothesis to explain LD is that some haplotypes are protected from recombination. A second hypothesis is that these HLA antigens preferentially recombine after cross-over to create an LD haplotype. We tested these 2 hypotheses: from a pool of over 10,000 families typed in our department, we analyzed 126 families in which HLA-A:B or B:DR recombinant offspring was documented. To overcome a possible bias in our material, we used the non-recombined haplotypes from the same 126 families as a control group. Our results show that the number of cross-overs through LD haplotypes is not significantly lower then would be expected if recombination occurred randomly. Also the number of LD haplotypes created upon recombination was not significantly increased.
Forensic timber identification: a case study of a CITES listed species, Gonystylus bancanus (Thymelaeaceae).

PubMed

Ng, Kevin Kit Siong; Lee, Soon Leong; Tnah, Lee Hong; Nurul-Farhanah, Zakaria; Ng, Chin Hong; Lee, Chai Ting; Tani, Naoki; Diway, Bibian; Lai, Pei Sing; Khoo, Eyen

2016-07-01

Illegal logging and smuggling of Gonystylus bancanus (Thymelaeaceae) poses a serious threat to this fragile valuable peat swamp timber species. Using G. bancanus as a case study, DNA markers were used to develop identification databases at the species, population and individual level. The species level database for Gonystylus comprised of an rDNA (ITS2) and two cpDNA (trnH-psbA and trnL) markers based on a 20 Gonystylus species database. When concatenated, taxonomic species recognition was achieved with a resolution of 90% (18 out of the 20 species). In addition, based on 17 natural populations of G. bancanus throughout West (Peninsular Malaysia) and East (Sabah and Sarawak) Malaysia, population and individual identification databases were developed using cpDNA and STR markers respectively. A haplotype distribution map for Malaysia was generated using six cpDNA markers, resulting in 12 unique multilocus haplotypes, from 24 informative intraspecific variable sites. These unique haplotypes suggest a clear genetic structuring of West and East regions. A simulation procedure based on the composition of the samples was used to test whether a suspected sample conformed to a given regional origin. Overall, the observed type I and II errors of the databases showed good concordance with the predicted 5% threshold which indicates that the databases were useful in revealing provenance and establishing conformity of samples from West and East Malaysia. Sixteen STRs were used to develop the DNA profiling databases for individual identification. Bayesian clustering analyses divided the 17 populations into two main genetic clusters, corresponding to the regions of West and East Malaysia. Population substructuring (K=2) was observed within each region. After removal of bias resulting from sampling effects and population subdivision, conservativeness tests showed that the West and East Malaysia databases were conservative. This suggests that both databases can be used independently for random match probability estimation within respective regions. The reliability of the databases was further determined by independent self-assignment tests based on the likelihood of each individual's multilocus genotype occurring in each identified population, genetic cluster and region with an average percentage of correctly assigned individuals of 54.80%, 99.60% and 100% respectively. Thus, after appropriate validation, the genetic identification databases developed for G. bancanus in this study could support forensic applications and help safeguard this valuable species into the future. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
16th IHIW: Global analysis of registry HLA haplotypes from 20 Million individuals: Report from the IHIW Registry Diversity Group

PubMed Central

Maiers, M; Gragert, L; Madbouly, A; Steiner, D; Marsh, S G E; Gourraud, P-A; Oudshoorn, M; Zanden, H; Schmidt, A H; Pingel, J; Hofmann, J; Müller, C; Eberhard, H-P

2013-01-01

This project has the goal to validate bioinformatics methods and tools for HLA haplotype frequency analysis specifically addressing unique issues of haematopoietic stem cell registry data sets. In addition to generating new methods and tools for the analysis of registry data sets, the intent is to produce a comprehensive analysis of HLA data from 20 million donors from the Bone Marrow Donors Worldwide (BMDW) database. This report summarizes the activity on this project as of the 16IHIW meeting in Liverpool. PMID:23280139
DR-GAS: a database of functional genetic variants and their phosphorylation states in human DNA repair systems.

PubMed

Sehgal, Manika; Singh, Tiratha Raj

2014-04-01

We present DR-GAS(1), a unique, consolidated and comprehensive DNA repair genetic association studies database of human DNA repair system. It presents information on repair genes, assorted mechanisms of DNA repair, linkage disequilibrium, haplotype blocks, nsSNPs, phosphorylation sites, associated diseases, and pathways involved in repair systems. DNA repair is an intricate process which plays an essential role in maintaining the integrity of the genome by eradicating the damaging effect of internal and external changes in the genome. Hence, it is crucial to extensively understand the intact process of DNA repair, genes involved, non-synonymous SNPs which perhaps affect the function, phosphorylated residues and other related genetic parameters. All the corresponding entries for DNA repair genes, such as proteins, OMIM IDs, literature references and pathways are cross-referenced to their respective primary databases. DNA repair genes and their associated parameters are either represented in tabular or in graphical form through images elucidated by computational and statistical analyses. It is believed that the database will assist molecular biologists, biotechnologists, therapeutic developers and other scientific community to encounter biologically meaningful information, and meticulous contribution of genetic level information towards treacherous diseases in human DNA repair systems. DR-GAS is freely available for academic and research purposes at: http://www.bioinfoindia.org/drgas. Copyright © 2014 Elsevier B.V. All rights reserved.
Huntingtin Haplotypes Provide Prioritized Target Panels for Allele-specific Silencing in Huntington Disease Patients of European Ancestry

PubMed Central

Kay, Chris; Collins, Jennifer A; Skotte, Niels H; Southwell, Amber L; Warby, Simon C; Caron, Nicholas S; Doty, Crystal N; Nguyen, Betty; Griguoli, Annamaria; Ross, Colin J; Squitieri, Ferdinando; Hayden, Michael R

2015-01-01

Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin gene (HTT). Heterozygous polymorphisms in cis with the mutation allow for allele-specific suppression of the pathogenic HTT transcript as a therapeutic strategy. To prioritize target selection, precise heterozygosity estimates are needed across diverse HD patient populations. Here we present the first comprehensive investigation of all common target alleles across the HTT gene, using 738 reference haplotypes from the 1000 Genomes Project and 2364 haplotypes from HD patients and relatives in Canada, Sweden, France, and Italy. The most common HD haplotypes (A1, A2, and A3a) define mutually exclusive sets of polymorphisms for allele-specific therapy in the greatest number of patients. Across all four populations, a maximum of 80% are treatable using these three target haplotypes. We identify a novel deletion found exclusively on the A1 haplotype, enabling potent and selective silencing of mutant HTT in approximately 40% of the patients. Antisense oligonucleotides complementary to the deletion reduce mutant A1 HTT mRNA by 78% in patient cells while sparing wild-type HTT expression. By suppressing specific haplotypes on which expanded CAG occurs, we demonstrate a rational approach to the development of allele-specific therapy for a monogenic disorder. PMID:26201449
Association between cytochrome CYP17A1, CYP3A4, and CYP3A43 polymorphisms and prostate cancer risk and aggressiveness in a Korean study population

PubMed Central

Han, Jun Hyun; Lee, Yong Seong; Kim, Hae Jong; Lee, Shin Young; Myung, Soon Chul

2015-01-01

In this study, we evaluated genetic variants of the androgen metabolism genes CYP17A1, CYP3A4, and CYP3A43 to determine whether they play a role in the development of prostate cancer (PCa) in Korean men. The study population included 240 pathologically diagnosed cases of PCa and 223 age-matched controls. Among the 789 single-nucleotide polymorphism (SNP) database variants detected, 129 were reported in two Asian groups (Han Chinese and Japanese) in the HapMap database. Only 21 polymorphisms of CYP17A1, CYP3A4, and CYP3A43 were selected based on linkage disequilibrium in Asians (r2 = 1), locations (SNPs in exons were preferred), and amino acid changes and were assessed. In addition, we performed haplotype analysis for the 21 SNPs in CYP17A1, CYP3A4, and CYP3A43 genes. To determine the association between genotype and haplotype distributions of patients and controls, logistic analyses were carried out, controlling for age. Twelve sequence variants and five major haplotypes were identified in CYP17A1. Five sequence variants and two major haplotypes were identified in CYP3A4. Four sequence variants and four major haplotypes were observed in CYP3A43. CYP17A1 haplotype-2 (Ht-2) (odds ratio [OR], 1.51; 95% confidence interval [CI], 1.04–2.18) was associated with PCa susceptibility. CYP3A4 Ht-2 (OR: 1.87; 95% CI: 1.02–3.43) was associated with PCa metastatic potential according to tumor stage. rs17115149 (OR: 1.96; 95% CI: 1.04–3.68) and CYP17A1 Ht-4 (OR: 2.01; 95% CI: 1.07–4.11) showed a significant association with histologic aggressiveness according to Gleason score. Genetic variants of CYP17A1 and CYP3A4 may play a role in the development of PCa in Korean men. PMID:25337833
Association between cytochrome CYP17A1, CYP3A4, and CYP3A43 polymorphisms and prostate cancer risk and aggressiveness in a Korean study population.

PubMed

Han, Jun Hyun; Lee, Yong Seong; Kim, Hae Jong; Lee, Shin Young; Myung, Soon Chul

2015-01-01

In this study, we evaluated genetic variants of the androgen metabolism genes CYP17A1, CYP3A4, and CYP3A43 to determine whether they play a role in the development of prostate cancer (PCa) in Korean men. The study population included 240 pathologically diagnosed cases of PCa and 223 age-matched controls. Among the 789 single-nucleotide polymorphism (SNP) database variants detected, 129 were reported in two Asian groups (Han Chinese and Japanese) in the HapMap database. Only 21 polymorphisms of CYP17A1, CYP3A4, and CYP3A43 were selected based on linkage disequilibrium in Asians (r2 = 1), locations (SNPs in exons were preferred), and amino acid changes and were assessed. In addition, we performed haplotype analysis for the 21 SNPs in CYP17A1, CYP3A4, and CYP3A43 genes. To determine the association between genotype and haplotype distributions of patients and controls, logistic analyses were carried out, controlling for age. Twelve sequence variants and five major haplotypes were identified in CYP17A1. Five sequence variants and two major haplotypes were identified in CYP3A4. Four sequence variants and four major haplotypes were observed in CYP3A43. CYP17A1 haplotype-2 (Ht-2) (odds ratio [OR], 1.51; 95% confidence interval [CI], 1.04-2.18) was associated with PCa susceptibility. CYP3A4 Ht-2 (OR: 1.87; 95% CI: 1.02-3.43) was associated with PCa metastatic potential according to tumor stage. rs17115149 (OR: 1.96; 95% CI: 1.04-3.68) and CYP17A1 Ht-4 (OR: 2.01; 95% CI: 1.07-4.11) showed a significant association with histologic aggressiveness according to Gleason score. Genetic variants of CYP17A1 and CYP3A4 may play a role in the development of PCa in Korean men.
Molecular phylogenetic identification of Fasciola flukes in Nepal.

PubMed

Shoriki, Takuya; Ichikawa-Seki, Madoka; Devkota, Bhuminand; Rana, Hari B; Devkota, Shiva P; Humagain, Sudeep K; Itagaki, Tadashi

2014-12-01

Eighty-one Fasciola flukes collected from 8 districts in Nepal were analyzed for their species identification on the basis of their spermatogenic status and nuclear ribosomal internal transcribed spacer 1 (ITS1) and for their phylogenetic relation with Fasciola flukes from other Asian countries on the basis of the mitochondrial NADH dehydrogenase subunit 1 (nad1) gene. Sixty-one flukes (75.3%) were aspermic Fasciola sp., and 20 flukes (24.7%) were identified as Fasciola gigantica. All of the aspermic flukes displayed the Fh/Fg type in ITS1, which was predominant in aspermic Fasciola sp. from China, and most (60 flukes) displayed the Fsp-ND1-N1 haplotype in the nad1, which had an identical nucleotide sequence to the major haplotype (Fg-C2) of the aspermic flukes from China. These results suggest that aspermic Fasciola sp. was introduced into Nepal from China. Furthermore, the results of the diversity indices, neutrality indices, and median-joining network analysis with reference haplotypes from Asian countries suggest that aspermic Fasciola sp. rapidly expanded its distribution. In contrasts, F. gigantica displayed 10 nad1 haplotypes, which showed higher population diversity indices than the haplotypes of aspermic flukes, indicating that the F. gigantica population was clearly distributed in Nepal earlier than the aspermic Fasciola population. Although the F. gigantica haplotypes from Nepal formed a star-like phylogeny consisting of a main founder haplotype (Fg-ND1-N1), together with some F. gigantica haplotypes from Myanmar and Thailand, the Nepal population differed genetically from F. gigantica populations of neighboring countries as each country had distinct founder haplotype(s). Copyright © 2014 Elsevier Inc. All rights reserved.
Vitamin K epoxide reductase complex subunit 1 (Vkorc1) haplotype diversity in mouse priority strains

PubMed Central

Song, Ying; Vera, Nicole; Kohn, Michael H

2008-01-01

Background Polymorphisms in the vitamin K-epoxide reductase complex subunit 1 gene, Vkorc1, could affect blood coagulation and other vitamin K-dependent proteins, such as osteocalcin (bone Gla protein, BGP). Here we sequenced the Vkorc1 gene in 40 mouse priority strains. We analyzed Vkorc1 haplotypes with respect to prothrombin time (PT) and bone mineral density and composition (BMD and BMC); phenotypes expected to be vitamin K-dependent and represented by data in the Mouse Phenome Database (MPD). Findings In the commonly used laboratory strains of Mus musculus domesticus we identified only four haplotypes differing in the intron or 5' region sequence of the Vkorc1. Six haplotypes differing by coding and non-coding polymorphisms were identified in the other subspecies of Mus. We detected no significant association of Vkorc1 haplotypes with PT, BMD and BMC within each subspecies of Mus. Vkorc1 haplotype sequences divergence between subspecies was associated with PT, BMD and BMC. Conclusion Phenotypic variation in PT, BMD and BMC within subspecies of Mus, while substantial, appears to be dominated by genetic variation in genes other than the Vkorc1. This was particularly evident for M. m. domesticus, where a single haplotype was observed in conjunction with virtually the entire range of PT, BMD and BMC values of all 5 subspecies of Mus included in this study. Differences in these phenotypes between subspecies also should not be attributed to Vkorc1 variants, but should be viewed as a result of genome wide genetic divergence. PMID:19046458
A spatial haplotype copying model with applications to genotype imputation.

PubMed

Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan

2015-05-01

Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.
[The haplomatch program for comparing Y-chromosome STR-haplotypes and its application to the analysis of the origin of Don Cossacks].

PubMed

Chukhryaeva, M I; Ivanov, I O; Frolova, S A; Koshel, S M; Utevska, O M; Skhalyakho, R A; Agdzhoyan, A T; Bogunov, Yu V; Balanovska, E V; Balanovsky, O P

2016-05-01

STR haplotypes of the Y chromosome are widely used as effective genetic markers in studies of human populations and in forensic DNA analysis. The task often arises to compare the spectrum of haplotypes in individuals or entire populations. Performing this task manually is too laborious and thus unrealistic. We propose an algorithm for counting similarity between STR haplotypes. This algorithm is suitable for massive analyses of samples. It is implemented in the computer program Haplomatch, which makes it possible to find haplotypes that differ from the target haplotype by 0, 1, 2, 3, or more mutational steps. The program may operate in two modes: comparison of individuals and comparison of populations. Flexibility of the program (the possibility of using any external database), its usability (MS Excel spreadsheets are used), and the capability of being applied to other chromosomes and other species could make this software a new useful tool in population genetics and forensic and genealogical studies. The Haplomatch software is freely available on our website www.genofond.ru. The program is applied to studying the gene pool of Cossacks. Experimental analysis of Y-chromosomal diversity in a representative set (N = 131) of Upper Don Cossacks is performed. Analysis of the STR haplotypes detects genetic proximity of Cossacks to East Slavic populations (in particular, to Southern and Central Russians, as well as to Ukrainians), which confirms the hypothesis of the origin of the Cossacks mainly due to immigration from Russia and Ukraine. Also, a small genetic influence of Turkicspeaking Nogais is found, probably caused by their occurrence in the Don Voisko as part of the Tatar layer. No similarities between haplotype spectra of Cossacks and Caucasus populations are found. This case study demonstrates the effectiveness of the Haplomatch software in analyzing large sets of STR haplotypes.
The single-nucleotide polymorphisms in CHD5 affect the prognosis of patients with hepatocellular carcinoma

PubMed Central

Zhu, Xiao; Kong, Qingming; Xie, Liwei; Chen, Zhihong; Li, Hongmei; Zhu, Zhu; Huang, Yongmei; Lan, Feifei; Luo, Haiqing; Zhan, Jingting; Ding, Hongrong; Lei, Jinli; Xiao, Qin; Fu, Weiming; Fan, Wenguo; Zhang, Jinfang; Luo, Hui

2018-01-01

Previous studies showed that the low expressions of chromodomain-helicase-DNA-binding protein 5 (CHD5) were intensively associated with deteriorative biologic and clinical characteristics as well as outcomes in many tumors. The aim of this study is to determine whether CHD5 single nucleotide polymorphisms (SNPs) contribute to the prognosis of hepatocellular carcima (HCC). The SNPs were selected according to their linkage disequilibrium (LD) in the targeted next-generation sequencing (NGS) and then genotyped with TaqMan probers. We revealed a rare haplotype AG in CHD5 (SNPs: rs12564469-rs9434711) was markedly associated with HCC prognosis. The univariate and multivariate regression analyses revealed the patients with worse overall survival time were those with tumor metastasis and haplotype AG, as well as cirrhosis, poor differentiation and IV-TNM stage. Based on the available public databases, we discovered the significant association between haplotype AG and CHD5 mRNA expressions only existed in Chinese. These data proposed that the potentially genetic haplotype might functionally contribute to HCC prognosis and CHD5 mRNA expressions. PMID:29568352
Mitochondrial DNA analyses reveal low genetic diversity in Culex quinquefasciatus from residential areas in Malaysia.

PubMed

Low, V L; Lim, P E; Chen, C D; Lim, Y A L; Tan, T K; Norma-Rashid, Y; Lee, H L; Sofian-Azirun, M

2014-06-01

The present study explored the intraspecific genetic diversity, dispersal patterns and phylogeographic relationships of Culex quinquefasciatus Say (Diptera: Culicidae) in Malaysia using reference data available in GenBank in order to reveal this species' phylogenetic relationships. A statistical parsimony network of 70 taxa aligned as 624 characters of the cytochrome c oxidase subunit I (COI) gene and 685 characters of the cytochrome c oxidase subunit II (COII) gene revealed three haplotypes (A1-A3) and four haplotypes (B1-B4), respectively. The concatenated sequences of both COI and COII genes with a total of 1309 characters revealed seven haplotypes (AB1-AB7). Analysis using tcs indicated that haplotype AB1 was the common ancestor and the most widespread haplotype in Malaysia. The genetic distance based on concatenated sequences of both COI and COII genes ranged from 0.00076 to 0.00229. Sequence alignment of Cx. quinquefasciatus from Malaysia and other countries revealed four haplotypes (AA1-AA4) by the COI gene and nine haplotypes (BB1-BB9) by the COII gene. Phylogenetic analyses demonstrated that Malaysian Cx. quinquefasciatus share the same genetic lineage as East African and Asian Cx. quinquefasciatus. This study has inferred the genetic lineages, dispersal patterns and hypothetical ancestral genotypes of Cx. quinquefasciatus. © 2013 The Royal Entomological Society.
Recovery of Native Genetic Background in Admixed Populations Using Haplotypes, Phenotypes, and Pedigree Information – Using Cika Cattle as a Case Breed

PubMed Central

Simčič, Mojca; Smetko, Anamarija; Sölkner, Johann; Seichter, Doris; Gorjanc, Gregor; Kompan, Dragomir; Medugorac, Ivica

2015-01-01

The aim of this study was to obtain unbiased estimates of the diversity parameters, the population history, and the degree of admixture in Cika cattle which represents the local admixed breeds at risk of extinction undergoing challenging conservation programs. Genetic analyses were performed on the genome-wide Single Nucleotide Polymorphism (SNP) Illumina Bovine SNP50 array data of 76 Cika animals and 531 animals from 14 reference populations. To obtain unbiased estimates we used short haplotypes spanning four markers instead of single SNPs to avoid an ascertainment bias of the BovineSNP50 array. Genome-wide haplotypes combined with partial pedigree and type trait classification show the potential to improve identification of purebred animals with a low degree of admixture. Phylogenetic analyses demonstrated unique genetic identity of Cika animals. Genetic distance matrix presented by rooted Neighbour-Net suggested long and broad phylogenetic connection between Cika and Pinzgauer. Unsupervised clustering performed by the admixture analysis and two-dimensional presentation of the genetic distances between individuals also suggest Cika is a distinct breed despite being similar in appearance to Pinzgauer. Animals identified as the most purebred could be used as a nucleus for a recovery of the native genetic background in the current admixed population. The results show that local well-adapted strains, which have never been intensively managed and differentiated into specific breeds, exhibit large haplotype diversity. They suggest a conservation and recovery approach that does not rely exclusively on the search for the original native genetic background but rather on the identification and removal of common introgressed haplotypes would be more powerful. Successful implementation of such an approach should be based on combining phenotype, pedigree, and genome-wide haplotype data of the breed of interest and a spectrum of reference breeds which potentially have had direct or indirect historical contribution to the genetic makeup of the breed of interest. PMID:25923207
Expanded Croatian 12 X-STR loci database with an overview of anomalous profiles.

PubMed

Mršić, Gordan; Ozretić, Petar; Crnjac, Josip; Merkaš, Siniša; Sukser, Viktorija; Račić, Ivana; Rožić, Sara; Barbarić, Lucija; Popović, Maja; Korolija, Marina

2018-05-01

In order to implement X-chromosome short tandem repeat (X-STR) typing into routine forensic practice, reference database of a given population should be established. Therefore we extended already published data with additional 397 blood samples from unrelated Croatian citizens, and analyzed the total of 995 samples (549 male and 446 female) typed by Investigator ® Argus X-12 Kit. To test genetic homogeneity of consecutively processed five historic-cultural regions covering the entire national territory, we calculated pairwise Fst genetic distances between regions based on allele and full haplotype frequencies. Since the comparison did not yield any statistically significant difference, we integrated STR profile information from all regions and used the whole data set to calculate forensic parameters. The most informative marker is DXS10135 (polymorphism information content (PIC = 0.929) and the most informative linkage group (LG) is LG1 (PIC = 0.996). We confirmed linkage disequilibrium (LD) for seven marker pairs belonging to LG2, LG3 and LG4. By including LD information, we calculated cumulative power of discrimination that amounted to 0.999999999997 in females and 0.999999005 in males. We also compared Croatia with 13 European populations based on haplotype frequencies and detected no statistically significant Fst values after Bonferroni correction in any LG. Multi-dimensional scaling plot revealed tight grouping of four Croatian regions amongst populations of southern, central and northern Europe, with the exception of northern Croatia. In this study we gave the first extensive overview of aberrant profiles encountered during Investigator ® Argus X-12 typing. We found ten profiles consistent with single locus duplication followed by tetranucleotide tract length polymorphism. Locus DXS10079 is by far the most frequently affected one, presumably mutated in eight samples. We also found four profiles consistent with X-chromosome aneuploidy (three profiles with XXX pattern and one profile with XXY pattern). In conclusion, we established integral forensic Croatian X-chromosome database, proved forensic pertinence of Investigator ® Argus X-12 Kit for the entire Croatian population and identified locus DXS10079 as a potential duplication hotspot. Copyright © 2018 Elsevier B.V. All rights reserved.
Geographic origin and individual assignment of Shorea platyclados (Dipterocarpaceae) for forensic identification

PubMed Central

Diway, Bibian; Khoo, Eyen

2017-01-01

The development of timber tracking methods based on genetic markers can provide scientific evidence to verify the origin of timber products and fulfill the growing requirement for sustainable forestry practices. In this study, the origin of an important Dark Red Meranti wood, Shorea platyclados, was studied by using the combination of seven chloroplast DNA and 15 short tandem repeats (STRs) markers. A total of 27 natural populations of S. platyclados were sampled throughout Malaysia to establish population level and individual level identification databases. A haplotype map was generated from chloroplast DNA sequencing for population identification, resulting in 29 multilocus haplotypes, based on 39 informative intraspecific variable sites. Subsequently, a DNA profiling database was developed from 15 STRs allowing for individual identification in Malaysia. Cluster analysis divided the 27 populations into two genetic clusters, corresponding to the region of Eastern and Western Malaysia. The conservativeness tests showed that the Malaysia database is conservative after removal of bias from population subdivision and sampling effects. Independent self-assignment tests correctly assigned individuals to the database in an overall 60.60−94.95% of cases for identified populations, and in 98.99−99.23% of cases for identified regions. Both the chloroplast DNA database and the STRs appear to be useful for tracking timber originating in Malaysia. Hence, this DNA-based method could serve as an effective addition tool to the existing forensic timber identification system for ensuring the sustainably management of this species into the future. PMID:28430826
Evidence of a Native Northwest Atlantic COI Haplotype Clade in the Cryptogenic Colonial Ascidian Botryllus schlosseri.

PubMed

Yund, Philip O; Collins, Catherine; Johnson, Sheri L

2015-06-01

The colonial ascidian Botryllus schlosseri should be considered cryptogenic (i.e., not definitively classified as either native or introduced) in the Northwest Atlantic. Although all the evidence is quite circumstantial, over the last 15 years most research groups have accepted the scenario of human-mediated dispersal and classified B. schlosseri as introduced; others have continued to consider it native or cryptogenic. We address the invasion status of this species by adding 174 sequences to the growing worldwide database for the mitochondrial gene cytochrome c oxidase subunit I (COI) and analyzing 1077 sequences to compare genetic diversity of one clade of haplotypes in the Northwest Atlantic with two hypothesized source regions (the Northeast Atlantic and Mediterranean). Our results lead us to reject the prevailing view of the directionality of transport across the Atlantic. We argue that the genetic diversity patterns at COI are far more consistent with the existence of at least one haplotype clade in the Northwest Atlantic (and possibly a second) that substantially pre-dates human colonization from Europe, with this native North American clade subsequently introduced to three sites in Northeast Atlantic and Mediterranean waters. However, we agree with past researchers that some sites in the Northwest Atlantic have more recently been invaded by alien haplotypes, so that some populations are currently composed of a mixture of native and invader haplotypes. © 2015 Marine Biological Laboratory.
Diversidad haplotípica en el manatí Trichechus manatus en Cuba: resultados preliminares

USGS Publications Warehouse

Hernandez-Martinez, Damir; Alvarez-Aleman, Anmari; Bonde, Robert K.; Powell, James A.; Garcia-Machado, Erik

2013-01-01

The aim of this analysis was to obtain information regarding the mtDNA haplotype composition of the manatee (T. manatus) occupying the Cuban archipelago. A fragment of 410 bp of the non-coding region was analyzed for 12 individual manatees from Cuba and one from Florida, USA. Only two haplotypes were identified. Haplotype A1, found exclusively in Florida (including in the sample analyzed here) but also found in Mexico, the Dominican Republic and Puerto Rico, was the most frequent haplotype (11 of the 12 samples from Cuba) and widely distributed. The second haplotype A3, previously referred to as endemic from Belize, was identified from an individual stranded in Isabela de Sagua, north of Cuba. These preliminary results provide information about three major aspects of manatee biology: (1) the mtDNA genetic diversity of T. manatus in Cuba seems low as compared to other regions of the Caribbean; (2) the Cuban population likely belongs to the group comprising Florida and the portions of the Greater Antilles; and (3) the territories of Belize and Cuba have exchanged individuals at present or in a relatively recent past.
Genomic dissection of a ‘Fuji’ apple cultivar: re-sequencing, SNP marker development, definition of haplotypes, and QTL detection

PubMed Central

Kunihisa, Miyuki; Moriya, Shigeki; Abe, Kazuyuki; Okada, Kazuma; Haji, Takashi; Hayashi, Takeshi; Kawahara, Yoshihiro; Itoh, Ryutaro; Itoh, Takeshi; Katayose, Yuichi; Kanamori, Hiroyuki; Matsumoto, Toshimi; Mori, Satomi; Sasaki, Harumi; Matsumoto, Takashi; Nishitani, Chikako; Terakami, Shingo; Yamamoto, Toshiya

2016-01-01

‘Fuji’ is one of the most popular and highly-produced apple cultivars worldwide, and has been frequently used in breeding programs. The development of genotypic markers for the preferable phenotypes of ‘Fuji’ is required. Here, we aimed to define the haplotypes of ‘Fuji’ and find associations between haplotypes and phenotypes of five traits (harvest day, fruit weight, acidity, degree of watercore, and flesh mealiness) by using 115 accessions related to ‘Fuji’. Through the re-sequencing of ‘Fuji’ genome, total of 2,820,759 variants, including single nucleotide polymorphisms (SNPs) and insertions or deletions (indels) were detected between ‘Fuji’ and ‘Golden Delicious’ reference genome. We selected mapping-validated 1,014 SNPs, most of which were heterozygous in ‘Fuji’ and capable of distinguishing alleles inherited from the parents of ‘Fuji’ (i.e., ‘Ralls Janet’ and ‘Delicious’). We used these SNPs to define the haplotypes of ‘Fuji’ and trace their inheritance in relatives, which were shown to have an average of 27% of ‘Fuji’ genome. Analysis of variance (ANOVA) based on ‘Fuji’ haplotypes identified one quantitative trait loci (QTL) each for harvest time, acidity, degree of watercore, and mealiness. A haplotype from ‘Delicious’ chr14 was considered to dominantly cause watercore, and one from ‘Ralls Janet’ chr1 was related to low-mealiness. PMID:27795675
Construction of the third-generation Zea mays haplotype map.

PubMed

Bukowski, Robert; Guo, Xiaosen; Lu, Yanli; Zou, Cheng; He, Bing; Rong, Zhengqin; Wang, Bo; Xu, Dawen; Yang, Bicheng; Xie, Chuanxiao; Fan, Longjiang; Gao, Shibin; Xu, Xun; Zhang, Gengyun; Li, Yingrui; Jiao, Yinping; Doebley, John F; Ross-Ibarra, Jeffrey; Lorant, Anne; Buffalo, Vince; Romay, M Cinta; Buckler, Edward S; Ware, Doreen; Lai, Jinsheng; Sun, Qi; Xu, Yunbi

2018-04-01

Characterization of genetic variations in maize has been challenging, mainly due to deterioration of collinearity between individual genomes in the species. An international consortium of maize research groups combined resources to develop the maize haplotype version 3 (HapMap 3), built from whole-genome sequencing data from 1218 maize lines, covering predomestication and domesticated Zea mays varieties across the world. A new computational pipeline was set up to process more than 12 trillion bp of sequencing data, and a set of population genetics filters was applied to identify more than 83 million variant sites. We identified polymorphisms in regions where collinearity is largely preserved in the maize species. However, the fact that the B73 genome used as the reference only represents a fraction of all haplotypes is still an important limiting factor.

Towards a comprehensive barcode library for arctic life - Ephemeroptera, Plecoptera, and Trichoptera of Churchill, Manitoba, Canada

PubMed Central

2009-01-01

Background This study reports progress in assembling a DNA barcode reference library for Ephemeroptera, Plecoptera, and Trichoptera ("EPTs") from a Canadian subarctic site, which is the focus of a comprehensive biodiversity inventory using DNA barcoding. These three groups of aquatic insects exhibit a moderate level of species diversity, making them ideal for testing the feasibility of DNA barcoding for routine biotic surveys. We explore the correlation between the morphological species delineations, DNA barcode-based haplotype clusters delimited by a sequence threshold (2%), and a threshold-free approach to biodiversity quantification--phylogenetic diversity. Results A DNA barcode reference library is built for 112 EPT species for the focal region, consisting of 2277 COI sequences. Close correspondence was found between EPT morphospecies and haplotype clusters as designated using a standard threshold value. Similarly, the shapes of taxon accumulation curves based upon haplotype clusters were very similar to those generated using phylogenetic diversity accumulation curves, but were much more computationally efficient. Conclusion The results of this study will facilitate other lines of research on northern EPTs and also bode well for rapidly conducting initial biodiversity assessments in unknown EPT faunas. PMID:20003245
Detection of haplotypes associated with prenatal death in dairy cattle and identification of deleterious mutations in GART, SHBG and SLC37A2.

PubMed

Fritz, Sébastien; Capitan, Aurelien; Djari, Anis; Rodriguez, Sabrina C; Barbat, Anne; Baur, Aurélia; Grohs, Cécile; Weiss, Bernard; Boussaha, Mekki; Esquerré, Diane; Klopp, Christophe; Rocha, Dominique; Boichard, Didier

2013-01-01

The regular decrease of female fertility over time is a major concern in modern dairy cattle industry. Only half of this decrease is explained by indirect response to selection on milk production, suggesting the existence of other factors such as embryonic lethal genetic defects. Genomic regions harboring recessive deleterious mutations were detected in three dairy cattle breeds by identifying frequent haplotypes (>1%) showing a deficit in homozygotes among Illumina Bovine 50k Beadchip haplotyping data from the French genomic selection database (47,878 Holstein, 16,833 Montbéliarde, and 11,466 Normande animals). Thirty-four candidate haplotypes (p<10(-4)) including previously reported regions associated with Brachyspina, CVM, HH1, and HH3 in Holstein breed were identified. Haplotype length varied from 1 to 4.8 Mb and frequencies from 1.7 up to 9%. A significant negative effect on calving rate, consistent in heifers and in lactating cows, was observed for 9 of these haplotypes in matings between carrier bulls and daughters of carrier sires, confirming their association with embryonic lethal mutations. Eight regions were further investigated using whole genome sequencing data from heterozygous bull carriers and control animals (45 animals in total). Six strong candidate causative mutations including polymorphisms previously reported in FANCI (Brachyspina), SLC35A3 (CVM), APAF1 (HH1) and three novel mutations with very damaging effect on the protein structure, according to SIFT and Polyphen-2, were detected in GART, SHBG and SLC37A2 genes. In conclusion, this study reveals a yet hidden consequence of the important inbreeding rate observed in intensively selected and specialized cattle breeds. Counter-selection of these mutations and management of matings will have positive consequences on female fertility in dairy cattle.
SAM: String-based sequence search algorithm for mitochondrial DNA database queries

PubMed Central

Röck, Alexander; Irwin, Jodi; Dür, Arne; Parsons, Thomas; Parson, Walther

2011-01-01

The analysis of the haploid mitochondrial (mt) genome has numerous applications in forensic and population genetics, as well as in disease studies. Although mtDNA haplotypes are usually determined by sequencing, they are rarely reported as a nucleotide string. Traditionally they are presented in a difference-coded position-based format relative to the corrected version of the first sequenced mtDNA. This convention requires recommendations for standardized sequence alignment that is known to vary between scientific disciplines, even between laboratories. As a consequence, database searches that are vital for the interpretation of mtDNA data can suffer from biased results when query and database haplotypes are annotated differently. In the forensic context that would usually lead to underestimation of the absolute and relative frequencies. To address this issue we introduce SAM, a string-based search algorithm that converts query and database sequences to position-free nucleotide strings and thus eliminates the possibility that identical sequences will be missed in a database query. The mere application of a BLAST algorithm would not be a sufficient remedy as it uses a heuristic approach and does not address properties specific to mtDNA, such as phylogenetically stable but also rapidly evolving insertion and deletion events. The software presented here provides additional flexibility to incorporate phylogenetic data, site-specific mutation rates, and other biologically relevant information that would refine the interpretation of mitochondrial DNA data. The manuscript is accompanied by freeware and example data sets that can be used to evaluate the new software (http://stringvalidation.org). PMID:21056022
High-resolution HLA haplotype frequencies of stem cell donors in Germany with foreign parentage: how can they be used to improve unrelated donor searches?

PubMed

Pingel, Julia; Solloch, Ute V; Hofmann, Jan A; Lange, Vinzenz; Ehninger, Gerhard; Schmidt, Alexander H

2013-03-01

In hematopoietic stem cell transplantation, human leukocyte antigens (HLA), usually HLA loci A, B, C, DRB1 and DQB1, are required to check histocompatibility between a potential donor and the recipient suffering from a malignant or non-malignant blood disease. As databases of potential unrelated donors are very heterogeneous with respect to typing resolution and number of typed loci, donor registries make use of haplotype frequency-based algorithms to provide matching probabilities for each potentially matching recipient/donor pair. However, it is well known that HLA allele and haplotype frequencies differ significantly between populations. We estimated high-resolution HLA-A, -B, -C, -DRB1 haplotype and allele frequencies of donors within DKMS German Bone Marrow Donor Center with parentage from 17 different countries: Turkey, Poland, Italy, Russian Federation, Croatia, Greece, Austria, Kazakhstan, France, The Netherlands, Republic of China, Romania, Portugal, USA, Spain, United Kingdom and Bosnia and Herzegovina. 5-locus haplotypes including HLA-DQB1 are presented for Turkey, Poland, Italy and Russian Federation. We calculated linkage disequilibria for each sample. Genetic distances between included countries could be shown to reflect geography. We further demonstrate how genetic differences between populations are reflected in matching probabilities of recipient/donor pairs and how they influence the search for unrelated donors as well as strategic donor center typings. Copyright © 2012 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Population and forensic genetic analyses of mitochondrial DNA control region variation from six major provinces in the Korean population.

PubMed

Hong, Seung Beom; Kim, Ki Cheol; Kim, Wook

2015-07-01

We generated complete mitochondrial DNA (mtDNA) control region sequences from 704 unrelated individuals residing in six major provinces in Korea. In addition to our earlier survey of the distribution of mtDNA haplogroup variation, a total of 560 different haplotypes characterized by 271 polymorphic sites were identified, of which 473 haplotypes were unique. The gene diversity and random match probability were 0.9989 and 0.0025, respectively. According to the pairwise comparison of the 704 control region sequences, the mean number of pairwise differences between individuals was 13.47±6.06. Based on the result of mtDNA control region sequences, pairwise FST genetic distances revealed genetic homogeneity of the Korean provinces on a peninsular level, except in samples from Jeju Island. This result indicates there may be a need to formulate a local mtDNA database for Jeju Island, to avoid bias in forensic parameter estimates caused by genetic heterogeneity of the population. Thus, the present data may help not only in personal identification but also in determining maternal lineages to provide an expanded and reliable Korean mtDNA database. These data will be available on the EMPOP database via accession number EMP00661. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
High Frequency of Haplotype HLA-DQ7 in Celiac Disease Patients from South Italy: Retrospective Evaluation of 5,535 Subjects at Risk of Celiac Disease

PubMed Central

Tinto, Nadia; Cola, Arturo; Piscopo, Chiara; Capuano, Marina; Galatola, Martina; Greco, Luigi; Sacchetti, Lucia

2015-01-01

Background Celiac disease (CD) has a strong genetic component mainly due to HLA DQ2/DQ8 encoding genes. However, a minority of CD patients are DQ2/DQ8-negative. To address this issue, we retrospectively characterized HLA haplotypes in 5,535 subjects at risk of CD (either relatives of CD patients or subjects with CD-like symptoms) referred to our center during a 10-year period. Methods We identified loci DQA1/DQB1/DRB1 by sequence-specific oligonucleotide-PCR and sequence-specific primer-PCR; anti-transglutaminase IgA/IgG and anti-endomysium IgA by ELISA and indirect immunofluorescence, respectively. Results We diagnosed CD in 666/5,535 individuals, 4.2% of whom were DQ2/DQ8-negative. Interestingly, DQ7 was one of the most abundant haplotypes in all CD patients and significantly more frequent in DQ2/DQ8-negative (38%) than in DQ2/DQ8-positive CD patients (24%) (p<0.05). Conclusion Our data lend support to the concept that DQ7 represents an additive or independent CD risk haplotype with respect to DQ2/DQ8 haplotypes but this finding should be verified in other large CD populations. PMID:26398634
Genetics Home Reference: primary sclerosing cholangitis

MedlinePlus

... with primary sclerosing cholangitis (PSC) in a southern European population. Dig Liver Dis. 2003 Aug;35(8): ... haplotypes in primary sclerosing cholangitis patients from five European populations. Tissue Antigens. 1999 May;53(5):459- ...
Construction of the third-generation Zea mays haplotype map

PubMed Central

Bukowski, Robert; Guo, Xiaosen; Lu, Yanli; Zou, Cheng; He, Bing; Rong, Zhengqin; Wang, Bo; Xu, Dawen; Yang, Bicheng; Xie, Chuanxiao; Fan, Longjiang; Gao, Shibin; Xu, Xun; Zhang, Gengyun; Li, Yingrui; Jiao, Yinping; Doebley, John F; Ross-Ibarra, Jeffrey; Lorant, Anne; Buffalo, Vince; Romay, M Cinta; Buckler, Edward S; Ware, Doreen; Lai, Jinsheng; Sun, Qi

2017-01-01

Abstract Background Characterization of genetic variations in maize has been challenging, mainly due to deterioration of collinearity between individual genomes in the species. An international consortium of maize research groups combined resources to develop the maize haplotype version 3 (HapMap 3), built from whole-genome sequencing data from 1218 maize lines, covering predomestication and domesticated Zea mays varieties across the world. Results A new computational pipeline was set up to process more than 12 trillion bp of sequencing data, and a set of population genetics filters was applied to identify more than 83 million variant sites. Conclusions We identified polymorphisms in regions where collinearity is largely preserved in the maize species. However, the fact that the B73 genome used as the reference only represents a fraction of all haplotypes is still an important limiting factor. PMID:29300887
Mitochondrial DNA and trade data support multiple origins of Helicoverpa armigera (Lepidoptera, Noctuidae) in Brazil

PubMed Central

Tay, Wee Tek; Walsh, Thomas K.; Downes, Sharon; Anderson, Craig; Jermiin, Lars S.; Wong, Thomas K. F.; Piper, Melissa C.; Chang, Ester Silva; Macedo, Isabella Barony; Czepak, Cecilia; Behere, Gajanan T.; Silvie, Pierre; Soria, Miguel F.; Frayssinet, Marie; Gordon, Karl H. J.

2017-01-01

The Old World bollworm Helicoverpa armigera is now established in Brazil but efforts to identify incursion origin(s) and pathway(s) have met with limited success due to the patchiness of available data. Using international agricultural/horticultural commodity trade data and mitochondrial DNA (mtDNA) cytochrome oxidase I (COI) and cytochrome b (Cyt b) gene markers, we inferred the origins and incursion pathways into Brazil. We detected 20 mtDNA haplotypes from six Brazilian states, eight of which were new to our 97 global COI-Cyt b haplotype database. Direct sequence matches indicated five Brazilian haplotypes had Asian, African, and European origins. We identified 45 parsimoniously informative sites and multiple substitutions per site within the concatenated (945 bp) nucleotide dataset, implying that probabilistic phylogenetic analysis methods are needed. High diversity and signatures of uniquely shared haplotypes with diverse localities combined with the trade data suggested multiple incursions and introduction origins in Brazil. Increasing agricultural/horticultural trade activities between the Old and New Worlds represents a significant biosecurity risk factor. Identifying pest origins will enable resistance profiling that reflects countries of origin to be included when developing a resistance management strategy, while identifying incursion pathways will improve biosecurity protocols and risk analysis at biosecurity hotspots including national ports. PMID:28350004
Mitochondrial DNA and trade data support multiple origins of Helicoverpa armigera (Lepidoptera, Noctuidae) in Brazil.

PubMed

Tay, Wee Tek; Walsh, Thomas K; Downes, Sharon; Anderson, Craig; Jermiin, Lars S; Wong, Thomas K F; Piper, Melissa C; Chang, Ester Silva; Macedo, Isabella Barony; Czepak, Cecilia; Behere, Gajanan T; Silvie, Pierre; Soria, Miguel F; Frayssinet, Marie; Gordon, Karl H J

2017-03-28

The Old World bollworm Helicoverpa armigera is now established in Brazil but efforts to identify incursion origin(s) and pathway(s) have met with limited success due to the patchiness of available data. Using international agricultural/horticultural commodity trade data and mitochondrial DNA (mtDNA) cytochrome oxidase I (COI) and cytochrome b (Cyt b) gene markers, we inferred the origins and incursion pathways into Brazil. We detected 20 mtDNA haplotypes from six Brazilian states, eight of which were new to our 97 global COI-Cyt b haplotype database. Direct sequence matches indicated five Brazilian haplotypes had Asian, African, and European origins. We identified 45 parsimoniously informative sites and multiple substitutions per site within the concatenated (945 bp) nucleotide dataset, implying that probabilistic phylogenetic analysis methods are needed. High diversity and signatures of uniquely shared haplotypes with diverse localities combined with the trade data suggested multiple incursions and introduction origins in Brazil. Increasing agricultural/horticultural trade activities between the Old and New Worlds represents a significant biosecurity risk factor. Identifying pest origins will enable resistance profiling that reflects countries of origin to be included when developing a resistance management strategy, while identifying incursion pathways will improve biosecurity protocols and risk analysis at biosecurity hotspots including national ports.
Mitochondrial DNA and trade data support multiple origins of Helicoverpa armigera (Lepidoptera, Noctuidae) in Brazil

NASA Astrophysics Data System (ADS)

Tay, Wee Tek; Walsh, Thomas K.; Downes, Sharon; Anderson, Craig; Jermiin, Lars S.; Wong, Thomas K. F.; Piper, Melissa C.; Chang, Ester Silva; Macedo, Isabella Barony; Czepak, Cecilia; Behere, Gajanan T.; Silvie, Pierre; Soria, Miguel F.; Frayssinet, Marie; Gordon, Karl H. J.

2017-03-01

The Old World bollworm Helicoverpa armigera is now established in Brazil but efforts to identify incursion origin(s) and pathway(s) have met with limited success due to the patchiness of available data. Using international agricultural/horticultural commodity trade data and mitochondrial DNA (mtDNA) cytochrome oxidase I (COI) and cytochrome b (Cyt b) gene markers, we inferred the origins and incursion pathways into Brazil. We detected 20 mtDNA haplotypes from six Brazilian states, eight of which were new to our 97 global COI-Cyt b haplotype database. Direct sequence matches indicated five Brazilian haplotypes had Asian, African, and European origins. We identified 45 parsimoniously informative sites and multiple substitutions per site within the concatenated (945 bp) nucleotide dataset, implying that probabilistic phylogenetic analysis methods are needed. High diversity and signatures of uniquely shared haplotypes with diverse localities combined with the trade data suggested multiple incursions and introduction origins in Brazil. Increasing agricultural/horticultural trade activities between the Old and New Worlds represents a significant biosecurity risk factor. Identifying pest origins will enable resistance profiling that reflects countries of origin to be included when developing a resistance management strategy, while identifying incursion pathways will improve biosecurity protocols and risk analysis at biosecurity hotspots including national ports.
Phylogeny and Haplotype Analysis of Fungi Within the Fusarium incarnatum-equiseti Species Complex.

PubMed

Ramdial, H; Latchoo, R K; Hosein, F N; Rampersad, S N

2017-01-01

Fusarium spp. are ranked among the top 10 most economically and scientifically important plant-pathogenic fungi in the world and are associated with plant diseases that include fruit decay of a number of crops. Fusarium isolates infecting bell pepper in Trinidad were identified based on sequence comparisons of the translation elongation factor gene (EF-1a) with sequences of Fusarium incarnatum-equiseti species complex (FIESC) verified in the FUSARIUM-ID database. Eighty-two isolates were identified as belonging to one of four phylogenetic species within the subclades FIESC-1, FIESC-15, FIESC-16, and FIESC-26, with the majority of isolates belonging to FIESC-15. A comparison of the level of DNA polymorphism and phylogenetic inference for sequences of the internal transcribed spacer region (ITS1-5.8S-ITS2) and EF-1a sequences for Trinidad and FUSARIUM-ID type species was carried out. The ITS sequences were less informative, had lower haplotype diversity and restricted haplotype distribution, and resulted in poor resolution and taxa placement in the consensus maximum-likelihood tree. EF-1a sequences enabled strongly supported phylogenetic inference with highly resolved branching patterns of the 30 phylogenetic species within the FIESC and placement of representative Trinidad isolates. Therefore, global phylogeny was inferred from EF-1a sequences representing 11 countries, and separation into distinct Incarnatum and Equiseti clades was again evident. In total, 42 haplotypes were identified: 12 were shared and the remaining were unique haplotypes. The most diverse haplotype was represented by sequences from China, Indonesia, Malaysia, and Trinidad and consisted exclusively of F. incarnatum isolates. Spain had the highest haplotype diversity, perhaps because both F. equiseti and F. incarnatum sequences were represented; followed by the United States, which contributed both F. equiseti and F. incarnatum sequences to the data set; then by countries representing Southeast Asia (China, Indonesia, Malaysia, Thailand, and Philippines) and Trinidad; both of these regions were represented by only F. incarnatum sequences. Trinidad shared two haplotypes with China and one haplotype with the United States for only F. incarnatum isolates. The findings of this study are important for devising disease management strategies and for understanding the phylogenetic relationships among members of the FIESC.
Novel pfdhps Haplotypes among Imported Cases of Plasmodium falciparum Malaria in the United Kingdom ▿

PubMed Central

Sutherland, Colin J.; Fifer, Helen; Pearce, Richard J.; bin Reza, Faisal; Nicholas, Meredydd; Haustein, Thomas; Njimgye-Tekumafor, Njah E.; Doherty, Justin F.; Gothard, Philip; Polley, Spencer D.; Chiodini, Peter L.

2009-01-01

Treatment of acute malaria caused by Plasmodium falciparum may include long-half-life drugs, such as the antifolate combination sulfadoxine-pyrimethamine (SP), to provide posttreatment chemoprophylaxis against parasite recrudescence or delayed emergence from the liver. An unusual case of P. falciparum recrudescence in a returned British traveler who received such a regimen, as well as a series of 44 parasite isolates from the same hospital, was analyzed by PCR and direct DNA sequencing for the presence of markers of parasite resistance to chloroquine and antifolates. The index patient harbored a mixture of wild-type and resistant pfdhfr and pfdhps alleles upon initial presentation. During his second malaria episode, he harbored only resistant parasites, with the haplotypes IRNI (codons 51, 59, 108, and 164) and SGEAA (codons 436, 437, 540, 581, and 613) at these two loci, respectively. Analysis of isolates from 44 other patients showed that the pfdhfr haplotype IRNI was common (found in 81% of cases). The SGEAA haplotype of pfdhps was uncommon (found only in eight cases of East African origin [17%]). A previously undescribed mutation, I431V, was observed for seven cases of Nigerian origin, occurring as one of two haplotypes, VAGKGS or VAGKAA. The presence of this mutation was also confirmed in isolates of Nigerian origin from the United Kingdom Malaria Reference Laboratory. The presence of the pfdhps haplotype SGEAA in P. falciparum parasites of East African origin appears to compromise the efficacy of treatment regimens that include SP as a means to prevent recrudescence. Parasites with novel pfdhps haplotypes are circulating in West Africa. The response of these parasites to chemotherapy needs to be evaluated. PMID:19433569
Surveying the maize community for their diversity and pedigree visualization needs to prioritize tool development and curation

USDA-ARS?s Scientific Manuscript database

The Maize Genetics and Genomics Database (MaizeGDB) team prepared a survey to identify breeders’ needs for visualizing pedigrees, diversity data, and haplotypes in order to prioritize tool development and curation efforts at MaizeGDB. The survey was distributed to the maize research community on beh...
Identification of specific angiotensin-converting enzyme variants and haplotypes that confer risk and protection against type 2 diabetic nephropathy.

PubMed

Ezzidi, Intissar; Mtiraoui, Nabil; Kacem, Maha; Chaieb, Molka; Mahjoub, Touhami; Almawi, Wassim Y

2009-11-01

Cross-sectional and family studies identified angiotensin-converting enzyme (ACE) gene as a risk factor for diabetic nephropathy (DN). The contribution of ACE gene variants to DN development and progression is controversial and varies among different ethnic/racial groups. We investigated the association of three ACE gene variants with DN, rs1799752 insertion/deletion (I/D), rs1800764T/C and rs12449782A/G in 917 Tunisian type 2 diabetic (T2DM) patients: 515 with (DN) and 402 without (DWN) nephropathy. ACE genotyping was done by PCR-based assays; haplotype estimation was performed using H-Plus software (chi(2)-test based). Genotype frequency distributions of the three studied variants were in Hardy-Weinberg equilibrium. Minor allele frequency of rs1800764 was higher in DN patients than DWN patients or healthy controls, and minor allele frequency of rs1799752 was higher in DN than DWN patients. Higher frequency of rs1799752 and rs1800764 homozygous mutant genotypes was seen in DN compared to DWN patients. Of the three variants, only rs1799752 deletion/deletion (D/D) genotype was associated with a significant increase in albumin to creatinine ratios levels, and D/D carriers had elevated low-density lipoprotein, total cholesterol and urea. Three locus haplotype [rs1799752(I/D)/rs1800764(T/C)/rs12449782(A/G)] analysis revealed that the frequency of DCG haplotype was higher, while that of ITG and ICA haplotypes were lower among unselected type 2 diabetic patients. Taking ITA haplotype as reference, multivariate regression analysis confirmed the negative (ITG), and positive (DCG, DTG, DCA and DTA) association of specific ACE haplotypes with DN, after adjusting for potential nephropathy-linked covariates. Our results support the involvement of specific ACE variants in DN pathogenesis and demonstrate the presence of DN-specific haplotypes at the ACE locus.
Accuracy of estimation of genomic breeding values in pigs using low-density genotypes and imputation.

PubMed

Badke, Yvonne M; Bates, Ronald O; Ernst, Catherine W; Fix, Justin; Steibel, Juan P

2014-04-16

Genomic selection has the potential to increase genetic progress. Genotype imputation of high-density single-nucleotide polymorphism (SNP) genotypes can improve the cost efficiency of genomic breeding value (GEBV) prediction for pig breeding. Consequently, the objectives of this work were to: (1) estimate accuracy of genomic evaluation and GEBV for three traits in a Yorkshire population and (2) quantify the loss of accuracy of genomic evaluation and GEBV when genotypes were imputed under two scenarios: a high-cost, high-accuracy scenario in which only selection candidates were imputed from a low-density platform and a low-cost, low-accuracy scenario in which all animals were imputed using a small reference panel of haplotypes. Phenotypes and genotypes obtained with the PorcineSNP60 BeadChip were available for 983 Yorkshire boars. Genotypes of selection candidates were masked and imputed using tagSNP in the GeneSeek Genomic Profiler (10K). Imputation was performed with BEAGLE using 128 or 1800 haplotypes as reference panels. GEBV were obtained through an animal-centric ridge regression model using de-regressed breeding values as response variables. Accuracy of genomic evaluation was estimated as the correlation between estimated breeding values and GEBV in a 10-fold cross validation design. Accuracy of genomic evaluation using observed genotypes was high for all traits (0.65-0.68). Using genotypes imputed from a large reference panel (accuracy: R(2) = 0.95) for genomic evaluation did not significantly decrease accuracy, whereas a scenario with genotypes imputed from a small reference panel (R(2) = 0.88) did show a significant decrease in accuracy. Genomic evaluation based on imputed genotypes in selection candidates can be implemented at a fraction of the cost of a genomic evaluation using observed genotypes and still yield virtually the same accuracy. On the other side, using a very small reference panel of haplotypes to impute training animals and candidates for selection results in lower accuracy of genomic evaluation.
Phylogenetic relationship and species delimitation of matsutake and allied species based on multilocus phylogeny and haplotype analyses.

PubMed

Ota, Yuko; Yamanaka, Takashi; Murata, Hitoshi; Neda, Hitoshi; Ohta, Akira; Kawai, Masataka; Yamada, Akiyoshi; Konno, Miki; Tanaka, Chihiro

2012-01-01

Tricholoma matsutake (S. Ito & S. Imai) Singer and its allied species are referred to as matsutake worldwide and are the most economically important edible mushrooms in Japan. They are widely distributed in the northern hemisphere and established an ectomycorrhizal relationship with conifer and broadleaf trees. To clarify relationships among T. matsutake and its allies, and to delimit phylogenetic species, we analyzed multilocus datasets (ITS, megB1, tef, gpd) with samples that were correctly identified based on morphological characteristics. Phylogenetic analyses clearly identified four major groups: matsutake, T. bakamatsutake, T. fulvocastaneum and T. caligatum; the latter three species were outside the matsutake group. The haplotype analyses and median-joining haplotype network analyses showed that the matsutake group included four closely related but clearly distinct taxa (T. matsutake, T. anatolicum, Tricholoma sp. from Mexico and T. magnivelare) from different geographical regions; these were considered to be distinct phylogenetic species.
LDSplitDB: a database for studies of meiotic recombination hotspots in MHC using human genomic data.

PubMed

Guo, Jing; Chen, Hao; Yang, Peng; Lee, Yew Ti; Wu, Min; Przytycka, Teresa M; Kwoh, Chee Keong; Zheng, Jie

2018-04-20

Meiotic recombination happens during the process of meiosis when chromosomes inherited from two parents exchange genetic materials to generate chromosomes in the gamete cells. The recombination events tend to occur in narrow genomic regions called recombination hotspots. Its dysregulation could lead to serious human diseases such as birth defects. Although the regulatory mechanism of recombination events is still unclear, DNA sequence polymorphisms have been found to play crucial roles in the regulation of recombination hotspots. To facilitate the studies of the underlying mechanism, we developed a database named LDSplitDB which provides an integrative and interactive data mining and visualization platform for the genome-wide association studies of recombination hotspots. It contains the pre-computed association maps of the major histocompatibility complex (MHC) region in the 1000 Genomes Project and the HapMap Phase III datasets, and a genome-scale study of the European population from the HapMap Phase II dataset. Besides the recombination profiles, related data of genes, SNPs and different types of epigenetic modifications, which could be associated with meiotic recombination, are provided for comprehensive analysis. To meet the computational requirement of the rapidly increasing population genomics data, we prepared a lookup table of 400 haplotypes for recombination rate estimation using the well-known LDhat algorithm which includes all possible two-locus haplotype configurations. To the best of our knowledge, LDSplitDB is the first large-scale database for the association analysis of human recombination hotspots with DNA sequence polymorphisms. It provides valuable resources for the discovery of the mechanism of meiotic recombination hotspots. The information about MHC in this database could help understand the roles of recombination in human immune system. DATABASE URL: http://histone.scse.ntu.edu.sg/LDSplitDB.
Differential distribution of Y-chromosome haplotypes in Swiss and Southern European goat breeds.

PubMed

Vidal, Oriol; Drögemüller, Cord; Obexer-Ruff, Gabriela; Reber, Irene; Jordana, Jordi; Martínez, Amparo; Bâlteanu, Valentin Adrian; Delgado, Juan Vicente; Eghbalsaied, Shahin; Landi, Vincenzo; Goyache, Felix; Traoré, Amadou; Pazzola, Michele; Vacca, Giuseppe Massimo; Badaoui, Bouabid; Pilla, Fabio; D'Andrea, Mariasilvia; Álvarez, Isabel; Capote, Juan; Sharaf, Abdoallah; Pons, Àgueda; Amills, Marcel

2017-11-23

The analysis of Y-chromosome variation has provided valuable clues about the paternal history of domestic animal populations. The main goal of the current work was to characterize Y-chromosome diversity in 31 goat populations from Central Eastern (Switzerland and Romania) and Southern Europe (Spain and Italy) as well as in reference populations from Africa and the Near East. Towards this end, we have genotyped seven single nucleotide polymorphisms (SNPs), mapping to the SRY, ZFY, AMELY and DDX3Y Y-linked loci, in 275 bucks from 31 populations. We have observed a low level of variability in the goat Y-chromosome, with just five haplotypes segregating in the whole set of populations. We have also found that Swiss bucks carry exclusively Y1 haplotypes (Y1A: 24%, Y1B1: 15%, Y1B2: 43% and Y1C: 18%), while in Italian and Spanish bucks Y2A is the most abundant haplotype (77%). Interestingly, in Carpathian goats from Romania the Y2A haplotype is also frequent (42%). The high Y-chromosome differentiation between Swiss and Italian/Spanish breeds might be due to the post-domestication spread of two different Near Eastern genetic stocks through the Danubian and Mediterranean corridors. Historical gene flow between Southern European and Northern African goats might have also contributed to generate such pattern of genetic differentiation.
Molecular genetic identification of skeletal remains from the Second World War Konfin I mass grave in Slovenia

PubMed Central

Gornjak Pogorelc, Barbara; Balažic, Jože

2010-01-01

This paper describes molecular genetic identification of one third of the skeletal remains of 88 victims of postwar (June 1945) killings found in the Konfin I mass grave in Slovenia. Living relatives were traced for 36 victims. We analyzed 84 right femurs and compared their genetic profiles to the genetic material of living relatives. We cleaned the bones, removed surface contamination, and ground the bones into powder. Prior to DNA isolation using Biorobot EZ1 (Qiagen), the powder was decalcified. The nuclear DNA of the samples was quantified using the real-time polymerase chain reaction method. We extracted 0.8 to 100 ng DNA/g of bone powder from 82 bones. Autosomal genetic profiles and Y-chromosome haplotypes were obtained from 98% of the bones, and mitochondrial DNA (mtDNA) haplotypes from 95% of the bones for the HVI region and from 98% of the bones for the HVII region. Genetic profiles of the nuclear and mtDNA were determined for reference persons. For traceability in the event of contamination, we created an elimination database including genetic profiles of the nuclear and mtDNA of all persons that had been in contact with the skeletal remains. When comparing genetic profiles, we matched 28 of the 84 bones analyzed with living relatives (brothers, sisters, sons, daughters, nephews, or cousins). The statistical analyses showed a high confidence of correct identification for all 28 victims in the Konfin I mass grave (posterior probability ranged from 99.9% to more than 99.999999%). PMID:20217112

Molecular genetic identification of skeletal remains from the Second World War Konfin I mass grave in Slovenia.

PubMed

Zupanic Pajnic, Irena; Gornjak Pogorelc, Barbara; Balazic, Joze

2010-07-01

This paper describes molecular genetic identification of one third of the skeletal remains of 88 victims of postwar (June 1945) killings found in the Konfin I mass grave in Slovenia. Living relatives were traced for 36 victims. We analyzed 84 right femurs and compared their genetic profiles to the genetic material of living relatives. We cleaned the bones, removed surface contamination, and ground the bones into powder. Prior to DNA isolation using Biorobot EZ1 (Qiagen), the powder was decalcified. The nuclear DNA of the samples was quantified using the real-time polymerase chain reaction method. We extracted 0.8 to 100 ng DNA/g of bone powder from 82 bones. Autosomal genetic profiles and Y-chromosome haplotypes were obtained from 98% of the bones, and mitochondrial DNA (mtDNA) haplotypes from 95% of the bones for the HVI region and from 98% of the bones for the HVII region. Genetic profiles of the nuclear and mtDNA were determined for reference persons. For traceability in the event of contamination, we created an elimination database including genetic profiles of the nuclear and mtDNA of all persons that had been in contact with the skeletal remains. When comparing genetic profiles, we matched 28 of the 84 bones analyzed with living relatives (brothers, sisters, sons, daughters, nephews, or cousins). The statistical analyses showed a high confidence of correct identification for all 28 victims in the Konfin I mass grave (posterior probability ranged from 99.9% to more than 99.999999%).
Multiple SNPs in Intron 41 of Thyroglobulin Gene Are Associated with Autoimmune Thyroid Disease in the Japanese Population

PubMed Central

Ban, Yoshiyuki; Tozaki, Teruaki; Taniyama, Matsuo; Skrabanek, Luce; Nakano, Yasuko; Ban, Yoshio; Hirano, Tsutomu

2012-01-01

Background The etiology of the autoimmune thyroid diseases (AITDs), Graves' disease (GD) and Hashimoto's thyroiditis (HT), is largely unknown. However, genetic susceptibility is believed to play a major role. Two whole genome scans from Japan and from the US identified a locus on chromosome 8q24 that showed evidence for linkage with AITD and HT. Recent studies have demonstrated an association between thyroglobulin (Tg) polymorphisms and AITD in Caucasians, suggesting that Tg is a susceptibility gene on 8q24. Objectives The objective of the study was to refine Tg association with AITD, by analyzing a panel of 25 SNPs across an extended 260 kb region of the Tg. Methods We studied 458 Japanese AITD patients (287 GD and 171 HT patients) and 221 matched Japanese control subjects in association studies. Case-control association studies were performed using 25 Tg single nucleotide polymorphisms (SNPs) chosen from a database of the Single Nucleotide Polymorphism Database (dbSNP). Haplotype analysis was undertaken using the computer program SNPAlyze version 7.0. Principal Findings and Conclusions In total, 5 SNPs revealed association with GD (P<0.05), with the strongest SNP associations at rs2256366 (P = 0.002) and rs2687836 (P = 0.0077), both located in intron 41 of the Tg gene. Because of the strong LD between these two strongest associated variants, we performed the haplotype analysis, and identified a major protective haplotype for GD (P = 0.001).These results suggested that the Tg gene is involved in susceptibility for GD and AITD in the Japanese. PMID:22662162
Gene-based association study of genes linked to hippocampal sclerosis of aging neuropathology: GRN, TMEM106B, ABCC9, and KCNMB2

PubMed Central

Katsumata, Yuriko; Nelson, Peter T.; Ellingson, Sally R.; Fardo, David W.

2017-01-01

Hippocampal sclerosis of aging (HS-Aging) is a common neurodegenerative condition associated with dementia. To learn more about genetic risk of HS-Aging pathology, we tested gene-based associations of the GRN, TMEM106B, ABCC9, and KCNMB2 genes, which were reported to be associated with HS-Aging pathology in previous studies. Genetic data were obtained from the Alzheimer’s Disease Genetics Consortium (ADGC), linked to autopsy-derived neuropathological outcomes from the National Alzheimer’s Coordinating Center (NACC). Of the 3,251 subjects included in the study, 271 (8.3%) were identified as an HS-Aging case. The significant gene-based association between the ABCC9 gene and HS-Aging appeared to be driven by a region in which a significant haplotype-based association was found. We tested this haplotype as an expression Quantitative Trait Locus (eQTL) using two different public-access brain gene expression databases. The HS-Aging pathology protective ABCC9 haplotype was associated with decreased ABCC9 expression, indicating a possible toxic gain of function. PMID:28131462
Reference quality assembly of the 3.5-Gb genome of Capsicum annuum from a single linked-read library.

PubMed

Hulse-Kemp, Amanda M; Maheshwari, Shamoni; Stoffel, Kevin; Hill, Theresa A; Jaffe, David; Williams, Stephen R; Weisenfeld, Neil; Ramakrishnan, Srividya; Kumar, Vijay; Shah, Preyas; Schatz, Michael C; Church, Deanna M; Van Deynze, Allen

2018-01-01

Linked-Read sequencing technology has recently been employed successfully for de novo assembly of human genomes, however, the utility of this technology for complex plant genomes is unproven. We evaluated the technology for this purpose by sequencing the 3.5-gigabase (Gb) diploid pepper ( Capsicum annuum ) genome with a single Linked-Read library. Plant genomes, including pepper, are characterized by long, highly similar repetitive sequences. Accordingly, significant effort is used to ensure that the sequenced plant is highly homozygous and the resulting assembly is a haploid consensus. With a phased assembly approach, we targeted a heterozygous F 1 derived from a wide cross to assess the ability to derive both haplotypes and characterize a pungency gene with a large insertion/deletion. The Supernova software generated a highly ordered, more contiguous sequence assembly than all currently available C. annuum reference genomes. Over 83% of the final assembly was anchored and oriented using four publicly available de novo linkage maps. A comparison of the annotation of conserved eukaryotic genes indicated the completeness of assembly. The validity of the phased assembly is further demonstrated with the complete recovery of both 2.5-Kb insertion/deletion haplotypes of the PUN1 locus in the F 1 sample that represents pungent and nonpungent peppers, as well as nearly full recovery of the BUSCO2 gene set within each of the two haplotypes. The most contiguous pepper genome assembly to date has been generated which demonstrates that Linked-Read library technology provides a tool to de novo assemble complex highly repetitive heterozygous plant genomes. This technology can provide an opportunity to cost-effectively develop high-quality genome assemblies for other complex plants and compare structural and gene differences through accurate haplotype reconstruction.
Mapping the genetic diversity of HLA haplotypes in the Japanese populations

PubMed Central

Saw, Woei-Yuh; Liu, Xuanyao; Khor, Chiea-Chuen; Takeuchi, Fumihiko; Katsuya, Tomohiro; Kimura, Ryosuke; Nabika, Toru; Ohkubo, Takayoshi; Tabara, Yasuharu; Yamamoto, Ken; Yokota, Mitsuhiro; Akiyama, Koichi; Asano, Hiroyuki; Asayama, Kei; Haga, Toshikazu; Hara, Azusa; Hirose, Takuo; Hosaka, Miki; Ichihara, Sahoko; Imai, Yutaka; Inoue, Ryusuke; Ishiguro, Aya; Isomura, Minoru; Isono, Masato; Kamide, Kei; Kato, Norihiro; Katsuya, Tomohiro; Kikuya, Masahiro; Kohara, Katsuhiko; Matsubara, Tatsuaki; Matsuda, Ayako; Metoki, Hirohito; Miki, Tetsuro; Murakami, Keiko; Nabika, Toru; Nakatochi, Masahiro; Ogihara, Toshio; Ohnaka, Keizo; Ohkubo, Takayoshi; Rakugi, Hiromi; Satoh, Michihiro; Shiwaku, Kunihiro; Sugimoto, Ken; Tabara, Yasuharu; Takami, Yoichi; Takayanagi, Ryoichi; Takeuchi, Fumihiko; Tsubota-Utsugi, Megumi; Yamamoto, Ken; Yamamoto, Koichi; Yamasaki, Masayuki; Yasui, Daisaku; Yokota, Mitsuhiro; Teo, Yik-Ying; Kato, Norihiro

2015-01-01

Japan has often been viewed as an Asian country that possesses a genetically homogenous community. The basis for partitioning the country into prefectures has largely been geographical, although cultural and linguistic differences still exist between some of the districts/prefectures, especially between Okinawa and the mainland prefectures. The Major Histocompatibility Complex (MHC) region has consistently emerged as the most polymorphic region in the human genome, harbouring numerous biologically important variants; nevertheless the presence of population-specific long haplotypes hinders the imputation of SNPs and classical HLA alleles. Here, we examined the extent of genetic variation at the MHC between eight Japanese populations sampled from Okinawa, and six other prefectures located in or close to the mainland of Japan, specifically focusing at the haplotypes observed within each population, and what the impact of any variation has on imputation. Our results indicated that Okinawa was genetically farther to the mainland Japanese than were Gujarati Indians from Tamil Indians, while the mainland Japanese from six prefectures were more homogeneous than between northern and southern Han Chinese. The distribution of haplotypes across Japan was similar, although imputation was most accurate for Okinawa and several mainland prefectures when population-specific panels were used as reference. PMID:26648100
Detection of Haplotypes Associated with Prenatal Death in Dairy Cattle and Identification of Deleterious Mutations in GART, SHBG and SLC37A2

PubMed Central

Fritz, Sébastien; Capitan, Aurelien; Djari, Anis; Rodriguez, Sabrina C.; Barbat, Anne; Baur, Aurélia; Grohs, Cécile; Weiss, Bernard; Boussaha, Mekki; Esquerré, Diane; Klopp, Christophe; Rocha, Dominique; Boichard, Didier

2013-01-01

The regular decrease of female fertility over time is a major concern in modern dairy cattle industry. Only half of this decrease is explained by indirect response to selection on milk production, suggesting the existence of other factors such as embryonic lethal genetic defects. Genomic regions harboring recessive deleterious mutations were detected in three dairy cattle breeds by identifying frequent haplotypes (>1%) showing a deficit in homozygotes among Illumina Bovine 50k Beadchip haplotyping data from the French genomic selection database (47,878 Holstein, 16,833 Montbéliarde, and 11,466 Normande animals). Thirty-four candidate haplotypes (p<10−4) including previously reported regions associated with Brachyspina, CVM, HH1, and HH3 in Holstein breed were identified. Haplotype length varied from 1 to 4.8 Mb and frequencies from 1.7 up to 9%. A significant negative effect on calving rate, consistent in heifers and in lactating cows, was observed for 9 of these haplotypes in matings between carrier bulls and daughters of carrier sires, confirming their association with embryonic lethal mutations. Eight regions were further investigated using whole genome sequencing data from heterozygous bull carriers and control animals (45 animals in total). Six strong candidate causative mutations including polymorphisms previously reported in FANCI (Brachyspina), SLC35A3 (CVM), APAF1 (HH1) and three novel mutations with very damaging effect on the protein structure, according to SIFT and Polyphen-2, were detected in GART, SHBG and SLC37A2 genes. In conclusion, this study reveals a yet hidden consequence of the important inbreeding rate observed in intensively selected and specialized cattle breeds. Counter-selection of these mutations and management of matings will have positive consequences on female fertility in dairy cattle. PMID:23762392
Evolutionary and functional mitogenomics associated with the genetic restoration of the Florida panther

USGS Publications Warehouse

Ochoa, Alexander; Onorato, David P.; Fitak, Robert R.; Roelke-Parker, Melody; Culver, Melanie

2017-01-01

Florida panthers are endangered pumas that currently persist in reduced patches of habitat in South Florida, USA. We performed mitogenome reference-based assemblies for most parental lines of the admixed Florida panthers that resulted from the introduction of female Texas pumas into South Florida in 1995. With the addition of 2 puma mitogenomes, we characterized 174 single nucleotide polymorphisms (SNPs) across 12 individuals. We defined 5 haplotypes (Pco1–Pco5), one of which (Pco1) had a geographic origin exclusive to Costa Rica and Panama and was possibly introduced into the Everglades National Park, Florida, prior to 1995. Haplotype Pco2 was native to Florida. Haplotypes Pco3 and Pco4 were exclusive to Texas, whereas haplotype Pco5 had an undetermined geographic origin. Phylogenetic inference suggests that haplotypes Pco1–Pco4 diverged ~202000 (95% HPDI = 83000–345000) years ago and that haplotypes Pco2–Pco4 diverged ~61000 (95% HPDI = 9000–127000) years ago. These results are congruent with a south-to-north continental expansion and with a recent North American colonization by pumas. Furthermore, pumas may have migrated from Texas to Florida no earlier than ~44000 (95% HPDI = 2000–98000) years ago. Synonymous mutations presented a greater mean substitution rate than other mitochondrial functional regions: nonsynonymous mutations, tRNAs, rRNAs, and control region. Similarly, all protein-coding genes were under predominant negative selection constraints. We directly and indirectly assessed the presence of potential deleterious SNPs in the ND2 and ND5 genes in Florida panthers prior to and as a consequence of the introduction of Texas pumas. Screenings for such variants are recommended in extant Florida panthers.
Analysis of single nucleotide polymorphisms in the 3' region of the estrogen receptor 1 gene in normal and cryptorchid Miniature Dachshunds and Chihuahuas.

PubMed

Pathirana, Indunil Nishantha; Tanaka, Kakeru; Kawate, Noritoshi; Tsuji, Makoto; Kida, Kayoko; Hatoya, Shingo; Inaba, Toshio; Tamada, Hiromichi

2010-08-01

This study was performed to examine the distribution of single nucleotide polymorphisms (SNPs) and estimated haplotypes in the canine estrogen receptor (ER) alpha gene (ESR1) and the association of them with different phenotypes of cryptorchidism (CO) in Miniature Dachshunds and Chihuahuas. Forty CO and 68 normal dogs were used, and CO was classified into unilateral (UCO; n=33) and bilateral CO (BCO; n=5) or into abdominal (ACO; n=16) and inguinal CO (ICO; n=22). Thirteen DNA fragments located in the 70-kb region at the 3' end of ESR1 were amplified by PCR and sequenced to examine 13 SNPs (#1-#13) reported in a canine SNP database. Ten SNPs (#1-#4, #7, #8, #10-#13) were not polymorphic, and 5 new SNPs (#14-#18) were discovered. A common haplotype block in normal, CO and CO phenotypes was identified for an approximately 20-kb region encompassing 4 SNPs (#14-#17). Allele, genotype and haplotype frequencies in CO without classification by phenotype and also in UCO, ACO and ICO phenotypes were not statistically different from the normal group. Significant differences in genotype frequencies and homozygosity for the estimated GTTG haplotype within the block were observed in BCO compared with the normal group, although the number of BCO animals was small. Our results demonstrate that the examined SNPs and haplotypes in the 3' end of canine ESR1 are not associated with unilateral, abdominal and inguinal CO phenotypes and CO per se in Miniature Dachshunds and Chihuahuas. Further studies are necessary to suggest a clear association between the ESR1 SNPs and bilateral CO in dogs.
Inclusion of Population-specific Reference Panel from India to the 1000 Genomes Phase 3 Panel Improves Imputation Accuracy.

PubMed

Ahmad, Meraj; Sinha, Anubhav; Ghosh, Sreya; Kumar, Vikrant; Davila, Sonia; Yajnik, Chittaranjan S; Chandak, Giriraj R

2017-07-27

Imputation is a computational method based on the principle of haplotype sharing allowing enrichment of genome-wide association study datasets. It depends on the haplotype structure of the population and density of the genotype data. The 1000 Genomes Project led to the generation of imputation reference panels which have been used globally. However, recent studies have shown that population-specific panels provide better enrichment of genome-wide variants. We compared the imputation accuracy using 1000 Genomes phase 3 reference panel and a panel generated from genome-wide data on 407 individuals from Western India (WIP). The concordance of imputed variants was cross-checked with next-generation re-sequencing data on a subset of genomic regions. Further, using the genome-wide data from 1880 individuals, we demonstrate that WIP works better than the 1000 Genomes phase 3 panel and when merged with it, significantly improves the imputation accuracy throughout the minor allele frequency range. We also show that imputation using only South Asian component of the 1000 Genomes phase 3 panel works as good as the merged panel, making it computationally less intensive job. Thus, our study stresses that imputation accuracy using 1000 Genomes phase 3 panel can be further improved by including population-specific reference panels from South Asia.
Influences of APOA5 Variants on Plasma Triglyceride Levels in Uyghur Population

PubMed Central

Wang, Yi; Wu, Di; Jin, Li; Wang, Xiaofeng

2014-01-01

Objective Single nucleotide polymorphisms (SNPs) in apolipoprotein A5 (APOA5) gene are associated with triglyceride (TG) levels. However, the minor allele frequencies and linkage disequilibriums (LDs) of the SNPs in addition to their effects on TG levels vary greatly between Caucasians and East Asians. The distributions of the SNPs/haplotypes and their associations with TG levels in Uyghur population, an admixture population of Caucasians and East Asians, have not been reported to date. Here, we performed a cross-sectional study to address these. Methods Genotyping of four SNPs in APOA5 (rs662799, rs3135506, rs2075291, and rs2266788) was performed in 1174 unrelated Uyghur subjects. SNP/haplotype and TG association analyses were conducted. Results The frequencies of the SNPs in Uyghurs were in between those in Caucasians and East Asians. The LD between rs662799 and rs2266788 in Uyghurs was stronger than that in East Asians but weaker than that in Caucasians, and the four SNPs resulted in four haplotypes (TGGT, CGGC, TCGT, and CGTT arranged in the order of rs662799, rs3135506, rs2075291, and rs2266788) representing 99.2% of the population. All the four SNPs were significantly associated with TG levels. Compared with non-carriers, carriers of rs662799-C, rs3135506-C, rs2075291-T, and rs2266788-C alleles had 16.0%, 15.1%, 17.1%, and 12.4% higher TG levels, respectively. When haplotype TGGT was defined as the reference, the haplotypes CGGC, TCGT, and CGTT resulted in 16.1%, 19.0%, and 19.8% higher TG levels, respectively. The proportions of variance in TG explained by APOA5 locus were 2.5%, 0.3%, 0.4%, and 1.9% for single SNP rs662799, rs3135506, rs2075291, and rs2266788, respectively, and 3.0% for the haplotypes constructed by them. Conclusions The association profiles between the SNPs and haplotypes at APOA5 locus and TG levels in this admixture population differed from those in Caucasians and East Asians. The functions of these SNPs and haplotypes need to be elucidated comprehensively. PMID:25313938
GeneImp: Fast Imputation to Large Reference Panels Using Genotype Likelihoods from Ultralow Coverage Sequencing

PubMed Central

Spiliopoulou, Athina; Colombo, Marco; Orchard, Peter; Agakov, Felix; McKeigue, Paul

2017-01-01

We address the task of genotype imputation to a dense reference panel given genotype likelihoods computed from ultralow coverage sequencing as inputs. In this setting, the data have a high-level of missingness or uncertainty, and are thus more amenable to a probabilistic representation. Most existing imputation algorithms are not well suited for this situation, as they rely on prephasing for computational efficiency, and, without definite genotype calls, the prephasing task becomes computationally expensive. We describe GeneImp, a program for genotype imputation that does not require prephasing and is computationally tractable for whole-genome imputation. GeneImp does not explicitly model recombination, instead it capitalizes on the existence of large reference panels—comprising thousands of reference haplotypes—and assumes that the reference haplotypes can adequately represent the target haplotypes over short regions unaltered. We validate GeneImp based on data from ultralow coverage sequencing (0.5×), and compare its performance to the most recent version of BEAGLE that can perform this task. We show that GeneImp achieves imputation quality very close to that of BEAGLE, using one to two orders of magnitude less time, without an increase in memory complexity. Therefore, GeneImp is the first practical choice for whole-genome imputation to a dense reference panel when prephasing cannot be applied, for instance, in datasets produced via ultralow coverage sequencing. A related future application for GeneImp is whole-genome imputation based on the off-target reads from deep whole-exome sequencing. PMID:28348060
Comparison of phasing strategies for whole human genomes

PubMed Central

Kirkness, Ewen; Schork, Nicholas J.

2018-01-01

Humans are a diploid species that inherit one set of chromosomes paternally and one homologous set of chromosomes maternally. Unfortunately, most human sequencing initiatives ignore this fact in that they do not directly delineate the nucleotide content of the maternal and paternal copies of the 23 chromosomes individuals possess (i.e., they do not ‘phase’ the genome) often because of the costs and complexities of doing so. We compared 11 different widely-used approaches to phasing human genomes using the publicly available ‘Genome-In-A-Bottle’ (GIAB) phased version of the NA12878 genome as a gold standard. The phasing strategies we compared included laboratory-based assays that prepare DNA in unique ways to facilitate phasing as well as purely computational approaches that seek to reconstruct phase information from general sequencing reads and constructs or population-level haplotype frequency information obtained through a reference panel of haplotypes. To assess the performance of the 11 approaches, we used metrics that included, among others, switch error rates, haplotype block lengths, the proportion of fully phase-resolved genes, phasing accuracy and yield between pairs of SNVs. Our comparisons suggest that a hybrid or combined approach that leverages: 1. population-based phasing using the SHAPEIT software suite, 2. either genome-wide sequencing read data or parental genotypes, and 3. a large reference panel of variant and haplotype frequencies, provides a fast and efficient way to produce highly accurate phase-resolved individual human genomes. We found that for population-based approaches, phasing performance is enhanced with the addition of genome-wide read data; e.g., whole genome shotgun and/or RNA sequencing reads. Further, we found that the inclusion of parental genotype data within a population-based phasing strategy can provide as much as a ten-fold reduction in phasing errors. We also considered a majority voting scheme for the construction of a consensus haplotype combining multiple predictions for enhanced performance and site coverage. Finally, we also identified DNA sequence signatures associated with the genomic regions harboring phasing switch errors, which included regions of low polymorphism or SNV density. PMID:29621242
Different DRB1*03:01-DQB1*02:01 haplotypes confer different risk for celiac disease.

PubMed

Alshiekh, S; Zhao, L P; Lernmark, Å; Geraghty, D E; Naluai, Å T; Agardh, D

2017-08-01

Celiac disease is associated with the HLA-DR3-DQA1*05:01-DQB1*02:01 and DR4-DQA1*03:01-DQB1*03:02 haplotypes. In addition, there are currently over 40 non-HLA loci associated with celiac disease. This study extends previous analyses on different HLA haplotypes in celiac disease using next generation targeted sequencing. Included were 143 patients with celiac disease and 135 non-celiac disease controls investigated at median 9.8 years (1.4-18.3 years). PCR-based amplification of HLA and sequencing with Illumina MiSeq technology were used for extended sequencing of the HLA class II haplotypes HLA-DRB1, DRB3, DRB4, DRB5, DQA1 and DQB1, respectively. Odds ratios were computed marginally for every allele and haplotype as the ratio of allelic frequency in patients and controls as ratio of exposure rates (RR), when comparing a null reference with equal exposure rates in cases and controls. Among the extended HLA haplotypes, the strongest risk haplotype for celiac disease was shown for DRB3*01:01:02 in linkage with DQA1*05:01-DQB1*02:01 (RR = 6.34; P-value < .0001). In a subpopulation analysis, DRB3*01:01:02-DQA1*05:01-DQB1*02:01 remained the most significant in patients with Scandinavian ethnicity (RR = 4.63; P < .0001) whereas DRB1*07:01:01-DRB4*01:03:01-DQA1*02:01-DQB1*02:02:01 presented the highest risk of celiac disease among non-Scandinavians (RR = 7.94; P = .011). The data also revealed 2 distinct celiac disease risk DR3-DQA1*05:01-DQB*02:01 haplotypes distinguished by either the DRB3*01:01:02 or DRB3*02:02:01 alleles, indicating that different DRB1*03:01-DQB1*02:01 haplotypes confer different risk for celiac disease. The associated risk of celiac disease for DR3-DRB3*01:01:02-DQA1*05:01-DQB1*02:01 is predominant among patients of Scandinavian ethnicity. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Mitochondrial cytochrome b sequence variations and population structure of Siberian chipmunk (Tamias sibiricus) in Northeastern Asia and population substructure in South Korea.

PubMed

Lee, Mu-Yeong; Lissovsky, Andrey A; Park, Sun-Kyung; Obolenskaya, Ekaterina V; Dokuchaev, Nikolay E; Zhang, Ya-Ping; Yu, Li; Kim, Young-Jun; Voloshina, Inna; Myslenkov, Alexander; Choi, Tae-Young; Min, Mi-Sook; Lee, Hang

2008-12-31

Twenty-five chipmunk species occur in the world, of which only the Siberian chipmunk, Tamias sibiricus, inhabits Asia. To investigate mitochondrial cytochrome b sequence variations and population structure of the Siberian chipmunk in northeastern Asia, we examined mitochondrial cytochrome b sequences (1140 bp) from 3 countries. Analyses of 41 individuals from South Korea and 33 individuals from Russia and northeast China resulted in 37 haplotypes and 27 haplotypes, respectively. There were no shared haplotypes between South Korea and Russia--northeast China. Phylogenetic trees and network analysis showed 2 major maternal lineages for haplotypes, referred to as the S and R lineages. Haplotype grouping in each cluster was nearly coincident with its geographic affinity. In particular, 3 distinct groups were found that mostly clustered in the northern, central and southern parts of South Korea. Nucleotide diversity of the S lineage was twice that of lineage R. The divergence between S and R lineages was estimated to be 2.98-0.98 Myr. During the ice age, there may have been at least 2 refuges in South Korea and Russia--northeast China. The sequence variation between the S and R lineages was 11.3% (K2P), which is indicative of specific recognition in rodents. These results suggest that T. sibiricus from South Korea could be considered a separate species. However, additional information, such as details of distribution, nuclear genes data or morphology, is required to strengthen this hypothesis.
Diversity, abundance, and host relationships of avian malaria and related haemosporidians in New Mexico pine forests.

PubMed

Marroquin-Flores, Rosario A; Williamson, Jessie L; Chavez, Andrea N; Bauernfeind, Selina M; Baumann, Matthew J; Gadek, Chauncey R; Johnson, Andrew B; McCullough, Jenna M; Witt, Christopher C; Barrow, Lisa N

2017-01-01

Avian malaria and related haemosporidian parasites (genera Haemoproteus , Plasmodium , and Leucocytozoon ) affect bird demography, species range limits, and community structure, yet they remain unsurveyed in most bird communities and populations. We conducted a community-level survey of these vector-transmitted parasites in New Mexico, USA, to describe their diversity, abundance, and host associations. We focused on the breeding-bird community in the transition zone between piñon-juniper woodland and ponderosa pine forests (elevational range: 2,150-2,460 m). We screened 186 birds representing 49 species using both standard PCR and microscopy techniques to detect infections of all three avian haemosporidian genera. We detected infections in 68 out of 186 birds (36.6%), the highest proportion of which were infected with Haemoproteus (20.9%), followed by Leucocytozoon (13.4%), then Plasmodium (8.0%). We sequenced mtDNA for 77 infections representing 43 haplotypes (25 Haemoproteus , 12 Leucocytozoon , 6 Plasmodium ). When compared to all previously known haplotypes in the MalAvi and GenBank databases, 63% (27) of the haplotypes we recovered were novel. We found evidence for host specificity at the avian clade and species level, but this specificity was variable among parasite genera, in that Haemoproteus and Leucocytozoon were each restricted to three avian groups (out of six), while Plasmodium occurred in all groups except non-passerines. We found striking variation in infection rate among host species, with nearly universal infection among vireos and no infection among nuthatches. Using rarefaction and extrapolation, we estimated the total avian haemosporidian diversity to be 70 haplotypes (95% CI [43-98]); thus, we may have already sampled ∼60% of the diversity of avian haemosporidians in New Mexico pine forests. It is possible that future studies will find higher diversity in microhabitats or host species that are under-sampled or unsampled in the present study. Fortunately, this study is fully extendable via voucher specimens, frozen tissues, blood smears, parasite images, and documentation provided in open-access databases (MalAvi, GenBank, and ARCTOS).
Diversity, abundance, and host relationships of avian malaria and related haemosporidians in New Mexico pine forests

PubMed Central

Marroquin-Flores, Rosario A.; Williamson, Jessie L.; Chavez, Andrea N.; Bauernfeind, Selina M.; Baumann, Matthew J.; Gadek, Chauncey R.; Johnson, Andrew B.; McCullough, Jenna M.

2017-01-01

Avian malaria and related haemosporidian parasites (genera Haemoproteus, Plasmodium, and Leucocytozoon) affect bird demography, species range limits, and community structure, yet they remain unsurveyed in most bird communities and populations. We conducted a community-level survey of these vector-transmitted parasites in New Mexico, USA, to describe their diversity, abundance, and host associations. We focused on the breeding-bird community in the transition zone between piñon-juniper woodland and ponderosa pine forests (elevational range: 2,150–2,460 m). We screened 186 birds representing 49 species using both standard PCR and microscopy techniques to detect infections of all three avian haemosporidian genera. We detected infections in 68 out of 186 birds (36.6%), the highest proportion of which were infected with Haemoproteus (20.9%), followed by Leucocytozoon (13.4%), then Plasmodium (8.0%). We sequenced mtDNA for 77 infections representing 43 haplotypes (25 Haemoproteus, 12 Leucocytozoon, 6 Plasmodium). When compared to all previously known haplotypes in the MalAvi and GenBank databases, 63% (27) of the haplotypes we recovered were novel. We found evidence for host specificity at the avian clade and species level, but this specificity was variable among parasite genera, in that Haemoproteus and Leucocytozoon were each restricted to three avian groups (out of six), while Plasmodium occurred in all groups except non-passerines. We found striking variation in infection rate among host species, with nearly universal infection among vireos and no infection among nuthatches. Using rarefaction and extrapolation, we estimated the total avian haemosporidian diversity to be 70 haplotypes (95% CI [43–98]); thus, we may have already sampled ∼60% of the diversity of avian haemosporidians in New Mexico pine forests. It is possible that future studies will find higher diversity in microhabitats or host species that are under-sampled or unsampled in the present study. Fortunately, this study is fully extendable via voucher specimens, frozen tissues, blood smears, parasite images, and documentation provided in open-access databases (MalAvi, GenBank, and ARCTOS). PMID:28828279
Haplotag: Software for Haplotype-Based Genotyping-by-Sequencing Analysis

PubMed Central

Tinker, Nicholas A.; Bekele, Wubishet A.; Hattori, Jiro

2016-01-01

Genotyping-by-sequencing (GBS), and related methods, are based on high-throughput short-read sequencing of genomic complexity reductions followed by discovery of single nucleotide polymorphisms (SNPs) within sequence tags. This provides a powerful and economical approach to whole-genome genotyping, facilitating applications in genomics, diversity analysis, and molecular breeding. However, due to the complexity of analyzing large data sets, applications of GBS may require substantial time, expertise, and computational resources. Haplotag, the novel GBS software described here, is freely available, and operates with minimal user-investment on widely available computer platforms. Haplotag is unique in fulfilling the following set of criteria: (1) operates without a reference genome; (2) can be used in a polyploid species; (3) provides a discovery mode, and a production mode; (4) discovers polymorphisms based on a model of tag-level haplotypes within sequenced tags; (5) reports SNPs as well as haplotype-based genotypes; and (6) provides an intuitive visual “passport” for each inferred locus. Haplotag is optimized for use in a self-pollinating plant species. PMID:26818073
[Cloning and sequencing of KIR2DL1 framework gene cDNA and identification of a novel allele].

PubMed

Sun, Ge; Wang, Chang; Zhen, Jianxin; Zhang, Guobin; Xu, Yunping; Deng, Zhihui

2016-10-01

To develop an assay for cDNA cloning and haplotype sequencing of KIR2DL1 framework gene and determine the genotype of an ethnic Han from southern China. Total RNA was isolated from peripheral blood sample, and complementary DNA (cDNA) transcript was synthesized by RT-PCR. The entire coding sequence of the KIR2DL1 framework gene was amplified with a pair of KIR2DL1-specific PCR primers. The PCR products with a length of approximately 1.2 kb were then subjected to cloning and haplotype sequencing. A specific target fragment of the KIR2DL1 framework gene was obtained. Following allele separation, a wild-type KIR2DL1*00302 allele and a novel variant allele, KIR2DL1*031, were identified. Sequence alignment with KIR2DL1 alleles from the IPD-KIR Database showed that the novel allele KIR2DL1*031 has differed from the closest allele KIR2DL1*00302 by a non-synonymous mutation at CDS nt 188A>G (codon 42 GAG>GGG) in exon 4, which has caused an amino acid change Glu42Gly. The sequence of the novel allele KIR2DL1*031 was submitted to GenBank under the accession number KP025960 and to the IPD-KIR Database under the submission number IWS40001982. A name KIR2DL1*031 has been officially assigned by the World Health Organization (WHO) Nomenclature Committee. An assay for cDNA cloning and haplotype sequencing of KIR2DL1 has been established, which has a broad applications in KIR studies at allelic level.
mtDNAmanager: a Web-based tool for the management and quality analysis of mitochondrial DNA control-region sequences

PubMed Central

Lee, Hwan Young; Song, Injee; Ha, Eunho; Cho, Sung-Bae; Yang, Woo Ick; Shin, Kyoung-Jin

2008-01-01

Background For the past few years, scientific controversy has surrounded the large number of errors in forensic and literature mitochondrial DNA (mtDNA) data. However, recent research has shown that using mtDNA phylogeny and referring to known mtDNA haplotypes can be useful for checking the quality of sequence data. Results We developed a Web-based bioinformatics resource "mtDNAmanager" that offers a convenient interface supporting the management and quality analysis of mtDNA sequence data. The mtDNAmanager performs computations on mtDNA control-region sequences to estimate the most-probable mtDNA haplogroups and retrieves similar sequences from a selected database. By the phased designation of the most-probable haplogroups (both expected and estimated haplogroups), mtDNAmanager enables users to systematically detect errors whilst allowing for confirmation of the presence of clear key diagnostic mutations and accompanying mutations. The query tools of mtDNAmanager also facilitate database screening with two options of "match" and "include the queried nucleotide polymorphism". In addition, mtDNAmanager provides Web interfaces for users to manage and analyse their own data in batch mode. Conclusion The mtDNAmanager will provide systematic routines for mtDNA sequence data management and analysis via easily accessible Web interfaces, and thus should be very useful for population, medical and forensic studies that employ mtDNA analysis. mtDNAmanager can be accessed at . PMID:19014619
The Tip of the “Celiac Iceberg” in China: A Systematic Review and Meta-Analysis

PubMed Central

Yuan, Juanli; Gao, Jinyan; Li, Xin; Liu, Fahui; Wijmenga, Cisca; Chen, Hongbing; Gilissen, Luud J. W. J.

2013-01-01

Objective Until recently, celiac disease was considered to be rare in China. We aimed to estimate its true status. Methods By searching the MEDLINE database and four Chinese full-text databases (CNKI, CBM, VIP and WANFANG) (up to August 2012), as well as two HLA allele frequency net databases and the Chinese Statistics Yearbook databases, we systematically reviewed the literature on definite and suspected cases of celiac disease, the predisposing HLA allele frequencies, and on gluten exposure in China. Meta-analysis was performed by analyzing DQ2, DQ8 and DQB1*0201 gene frequencies and heterogeneity in populations from different geographic regions and ethnicities in China. Results At present, the number of reported celiac disease cases is extremely low in China. The frequencies of the HLA-DQ2.5 and HLA-DQ8 haplotypes were 3.4% (95% confidence interval 1.3–5.5%) and 2.1% (0.1–4.1%), respectively. HLA-DQ2 and HLA-DQ8 antigen frequencies were 18.4% (15.0–21.7%) and 8.0% (4.5–11.4%), respectively. The frequency of the DQB1*0201 allele was 10.5% (9.3–11.6%) and it was more common in the northern Chinese than in the southern Chinese populations. The chance of being exposed to gluten is rapidly increasing all over China nowadays. Conclusion The data on HLA haplotyping, in conjunction with increasing wheat consumption, strongly suggests that the occurrence of celiac disease is more common in China than currently reported. Coordinated measures by the Chinese government, medical and agricultural research institutions, and food industries, would be justified to create more awareness about celiac disease and to prevent it becoming a medical and societal burden. PMID:24324669

A comprehensively molecular haplotype-resolved genome of a European individual

PubMed Central

Suk, Eun-Kyung; McEwen, Gayle K.; Duitama, Jorge; Nowick, Katja; Schulz, Sabrina; Palczewski, Stefanie; Schreiber, Stefan; Holloway, Dustin T.; McLaughlin, Stephen; Peckham, Heather; Lee, Clarence; Huebsch, Thomas; Hoehe, Margret R.

2011-01-01

Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, “Max Planck One” (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ∼1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying “haploid landscapes,” which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for “phase-sensitive” personal genomics. MP1's annotated haploid genomes are available as a public resource. PMID:21813624
A Mainly Circum-Mediterranean Origin for West Eurasian and North African mtDNAs in Puerto Rico with Strong Contributions from the Canary Islands and West Africa.

PubMed

Díaz-Zabala, Héctor J; Nieves-Colón, María A; Martínez-Cruzado, Juan C

2017-04-01

Maternal lineages of West Eurasian and North African origin account for 11.5% of total mitochondrial ancestry in Puerto Rico. Historical sources suggest that this ancestry arrived mostly from European migrations that took place during the four centuries of the Spanish colonization of Puerto Rico. This study analyzed 101 mitochondrial control region sequences and diagnostic coding region variants from a sample set randomly and systematically selected using a census-based sampling frame to be representative of the Puerto Rican population, with the goal of defining West Eurasian-North African maternal clades and estimating their possible geographical origin. Median-joining haplotype networks were constructed using hypervariable regions 1 and 2 sequences from various reference populations in search of shared haplotypes. A posterior probability analysis was performed to estimate the percentage of possible origins across wide geographic regions for the entire sample set and for the most common haplogroups on the island. Principal component analyses were conducted to place the Puerto Rican mtDNA set within the variation present among all reference populations. Our study shows that up to 38% of West Eurasian and North African mitochondrial ancestry in Puerto Rico most likely migrated from the Canary Islands. However, most of those haplotypes had previously migrated to the Canary Islands from elsewhere, and there are substantial contributions from various populations across the circum-Mediterranean region and from West African populations related to the modern Wolof and Serer peoples from Senegal and the nomad Fulani who extend up to Cameroon. In conclusion, the West Eurasian mitochondrial ancestry in Puerto Ricans is geographically diverse. However, haplotype diversity seems to be low, and frequencies have been shaped by population bottlenecks, migration waves, and random genetic drift. Consequently, approximately 47% of mtDNAs of West Eurasian and North African ancestry in Puerto Rico probably arrived early in its colonial history.
Functional Effects of Genetic Polymorphisms in the N-acetyltransferase 1 Coding and 3′ Untranslated Regions

PubMed Central

Zhu, Yuanqi; States, J. Christopher; Wang, Yang; Hein, David W.

2011-01-01

BACKGROUND The functional effects of N-acetyltransferase 1 (NAT1) polymorphisms and haplotypes are poorly understood, compromising the validity of associations reported with diseases including birth defects and numerous cancers. METHODS We investigated the effects of genetic polymorphisms within the NAT1 coding region and the 3′-untranslated region (3′-UTR) and their associated haplotypes on N- and O-acetyltransferase catalytic activities, and NAT1 mRNA and protein levels following recombinant expression in COS-1 cells. RESULTS 1088T>A (rs1057126; 3′-UTR) and 1095C>A (rs15561; 3′-UTR) each slightly reduced NAT1 catalytic activity and NAT1 mRNA and protein levels. A 9-base pair (TAATAATAA) deletion between nucleotides 1065-1090 (3′-UTR) reduced NAT1 catalytic activity and NAT1 mRNA and protein levels. In contrast, a 445G>A (rs4987076; V149I), 459G>A (rs4986990; T153T), 640T>G (rs4986783; S214A) coding region haplotype present in NAT1*11 increased NAT1 catalytic activity and NAT1 protein, but not NAT1 mRNA levels. A combination of the 9-base pair (TAATAATAA) deletion and the 445G>A, 459G>A, 640T>G coding region haplotypes, both present in NAT1*11, appeared to neutralize the opposing effects on NAT1 protein and catalytic activity, resulting in levels of NAT1 protein and catalytic activity that did not differ significantly from the NAT1*4 reference. CONCLUSIONS Since 1095C>A (3′-UTR) is the sole polymorphism present in NAT1*3, our data suggests that NAT1*3 is not functionally equivalent to the NAT1*4 reference. Furthermore, our findings provide biological support for reported associations of 1088T>A and 1095C>A polymorphisms with birth defects. PMID:21290563
De novo assembly and phasing of a Korean human genome.

PubMed

Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon

2016-10-13

Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.
Distinct genetic difference between the Duffy binding protein (PkDBPαII) of Plasmodium knowlesi clinical isolates from North Borneo and Peninsular Malaysia.

PubMed

Fong, Mun-Yik; Rashdi, Sarah A A; Yusof, Ruhani; Lau, Yee-Ling

2015-02-21

Plasmodium knowlesi is one of the monkey malaria parasites that can cause human malaria. The Duffy binding protein of P. knowlesi (PkDBPαII) is essential for the parasite's invasion into human and monkey erythrocytes. A previous study on P. knowlesi clinical isolates from Peninsular Malaysia reported high level of genetic diversity in the PkDBPαII. Furthermore, 36 amino acid haplotypes were identified and these haplotypes could be separated into allele group I and allele group II. In the present study, the PkDBPαII of clinical isolates from the Malaysian states of Sarawak and Sabah in North Borneo was investigated, and compared with the PkDBPαII of Peninsular Malaysia isolates. Blood samples from 28 knowlesi malaria patients were used. These samples were collected between 2011 and 2013 from hospitals in North Borneo. The PkDBPαII region of the isolates was amplified by PCR, cloned into Escherichia coli, and sequenced. The genetic diversity, natural selection and phylogenetics of PkDBPαII haplotypes were analysed using MEGA5 and DnaSP ver. 5.10.00 programmes. Forty-nine PkDBPαII sequences were obtained. Comparison at the nucleotide level against P. knowlesi strain H as reference sequence revealed 58 synonymous and 102 non-synonymous mutations. Analysis on these mutations showed that PkDBPαII was under purifying (negative) selection. At the amino acid level, 38 different PkDBPαII haplotypes were identified. Twelve of the 28 blood samples had mixed haplotype infections. Phylogenetic analysis revealed that all the haplotypes were in allele group I, but they formed a sub-group that was distinct from those of Peninsular Malaysia. Wright's FST fixation index indicated high genetic differentiation between the North Borneo and Peninsular Malaysia haplotypes. This study is the first to report the genetic diversity and natural selection of PkDBPαII of P. knowlesi from Borneo Island. The PkDBPαII haplotypes found in this study were distinct from those from Peninsular Malaysia. This difference may not be attributed to geographical separation because other genetic markers studied thus far such as the P. knowlesi circumsporozoite protein gene and small subunit ribosomal RNA do not display such differentiation. Immune evasion may possibly be the reason for the differentiation.
[Hereditary motor and sensory neuropathy with proximal dominant involvement (HMSN-P) is caused by a mutation in TFG].

PubMed

Ishiura, Hiroyuki; Tsuji, Shoji

2013-01-01

Hereditary motor and sensory neuropathy with proximal dominant involvement (HMSN-P) is an autosomal dominant neurodegenerative disease characterized by proximal predominant weakness and muscle atrophy accompanied by distal sensory disturbance. Linkage analysis using 4 families identified a region on chromosome 3 showing a LOD score exceeding 4. Further refinement of candidate region was performed by haplotype analysis using high-density SNP data, resulting in a minimum candidate region spanning 3.3 Mb. Exome analysis of an HMSN-P patient revealed a mutation (c.854C>T, p.Pro285Leu) in TRK-fused gene (TFG). The identical mutation was found in the four families, which cosegregated with the disease. The mutation was neither found in Japanese control subjects nor public databases. Detailed haplotype analysis suggested two independent origins of the mutation. These findings indicate that the mutation in TFG causes HMSN-P.
Database of Geoscientific References Through 2007 for Afghanistan, Version 2

USGS Publications Warehouse

Eppinger, Robert G.; Sipeki, Julianna; Scofield, M.L. Sco

2007-01-01

This report describes an accompanying database of geoscientific references for the country of Afghanistan. Included is an accompanying Microsoft? Access 2003 database of geoscientific references for the country of Afghanistan. The reference compilation is part of a larger joint study of Afghanistan's energy, mineral, and water resources, and geologic hazards, currently underway by the U.S. Geological Survey, the British Geological Survey, and the Afghanistan Geological Survey. The database includes both published (n = 2,462) and unpublished (n = 174) references compiled through September, 2007. The references comprise two separate tables in the Access database. The reference database includes a user-friendly, keyword-searchable, interface and only minimum knowledge of the use of Microsoft? Access is required.
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

PubMed

Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A

2016-10-11

Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.
Molecular analyses on host-seeking black flies (Diptera: Simuliidae) reveal a diverse assemblage of Leucocytozoon (Apicomplexa: Haemospororida) parasites in an alpine ecosystem.

PubMed

Murdock, Courtney C; Adler, Peter H; Frank, Jared; Perkins, Susan L

2015-06-25

Molecular studies have suggested that the true diversity of Leucocytozoon (Apicomplexa: Haemospororida) species well exceeds the approximately 35 currently described taxa. Further, the degree of host-specificity may vary substantially among lineages. Parasite distribution can be influenced by the ability of the parasite to infect a host, vector preferences for certain avian hosts, or other factors such as microhabitat requirements that increase the probability that vertebrate hosts and vectors are in frequent contact with each other. Whereas most studies of haemosporidians have focused on passerine hosts, sampling vectors in the same habitats may allow the detection of other lineages affecting other hosts. We sampled abundant, ornithophilic black flies (Simuliidae) across a variety of sites and habitats in the Colorado Rocky Mountains throughout the summer of 2007. Black flies were screened with PCR using Leucocytozoon-specific primers that amplify a portion of the cytochrome b gene, and the sequences were compared to the haplotypes in the MalAvi database. Infections of Leucocytozoon from birds sampled in the same area were also included. We recovered 33 unique haplotypes from the black flies in this study area, which represented a large phylogenetic diversity of Leucocytozoon parasites. However, there were no clear patterns of avian host species or geography for the distribution of Leucocytozoon haplotypes in the phylogeny. Sampling host-seeking vectors is a useful way to obtain a wide variety of avian haemosporidian haplotypes from a given area and may prove useful for understanding the global patterns of host, parasite, and vector associations of these ubiquitous and diverse parasites.
Association Between ADRB2 Genetic Polymorphisms and the Risk of Chronic Obstructive Pulmonary Disease: A Case-Control Study in a Chinese Population.

PubMed

Zhao, Hui; Wu, Xuan; Dong, Chun-Ling; Wang, Bi-Ying; Zhao, Jiao; Cao, Xian-E

2017-08-01

This study was designed to investigate the association between single nucleotide polymorphisms (SNPs) of the β2-adrenergic receptor (ADRB2) gene and the risk of chronic obstructive pulmonary disease (COPD) in a Chinese population. From January 2010 to October 2014, 261 COPD patients were selected as the case group and 239 healthy subjects were selected as the control group. Pulmonary function tests were performed to detect forced vital capacity (FVC), 1-s forced expiratory volume (FEV 1 ), and FEV 1 /FVC (%). rs1042711, rs1042714, and rs1042718 were selected as tagSNPs of the ADRB2 gene from the HapMap database in accordance with previous studies. The ADRB2 genotypes were established by real-time polymerase chain reaction assays using TaqMan-labeled probes. The relationships between the ADRB2 polymorphisms and COPD risk were estimated using logistic regression analyses. The frequency of the genotypes and alleles of rs1042711 in ADRB2 showed a significant difference between the COPD and control groups (p < 0.05); compared with the CC genotype, the non-CC genotypes showed an increased COPD risk (p = 0.002). Compared with the CC haplotype, the TG haplotype increased COPD risk, while the CG haplotype reduced COPD risk for normal individuals. Compared with the CC genotype, the TT genotype showed significantly lower FEV 1 and FEV 1 /FVC (p = 0.022, p = 0.0191, respectively). Both the TC and TG haplotypes showed lower FEV 1 and FEV 1 /FVC in comparison with the CC haplotype (both p < 0.05). The results of logistic regression analysis showed that rs1042711 of ADRB2 and smoking history were associated with COPD risk (both p < 0.05). It is indicated that the TT genotype of rs1042711 and smoking pack years are both risk factors for COPD.
The mitochondrial DNA makeup of Romanians: A forensic mtDNA control region database and phylogenetic characterization.

PubMed

Turchi, Chiara; Stanciu, Florin; Paselli, Giorgia; Buscemi, Loredana; Parson, Walther; Tagliabracci, Adriano

2016-09-01

To evaluate the pattern of Romanian population from a mitochondrial perspective and to establish an appropriate mtDNA forensic database, we generated a high-quality mtDNA control region dataset from 407 Romanian subjects belonging to four major historical regions: Moldavia, Transylvania, Wallachia and Dobruja. The entire control region (CR) was analyzed by Sanger-type sequencing assays and the resulting 306 different haplotypes were classified into haplogroups according to the most updated mtDNA phylogeny. The Romanian gene pool is mainly composed of West Eurasian lineages H (31.7%), U (12.8%), J (10.8%), R (10.1%), T (9.1%), N (8.1%), HV (5.4%),K (3.7%), HV0 (4.2%), with exceptions of East Asian haplogroup M (3.4%) and African haplogroup L (0.7%). The pattern of mtDNA variation observed in this study indicates that the mitochondrial DNA pool is geographically homogeneous across Romania and that the haplogroup composition reveals signals of admixture of populations of different origin. The PCA scatterplot supported this scenario, with Romania located in southeastern Europe area, close to Bulgaria and Hungary, and as a borderland with respect to east Mediterranean and other eastern European countries. High haplotype diversity (0.993) and nucleotide diversity indices (0.00838±0.00426), together with low random match probability (0.0087) suggest the usefulness of this control region dataset as a forensic database in routine forensic mtDNA analysis and in the investigation of maternal genetic lineages in the Romanian population. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Global diversity in the TAS2R38 bitter taste receptor: revisiting a classic evolutionary PROPosal

PubMed Central

Risso, Davide S.; Mezzavilla, Massimo; Pagani, Luca; Robino, Antonietta; Morini, Gabriella; Tofanelli, Sergio; Carrai, Maura; Campa, Daniele; Barale, Roberto; Caradonna, Fabio; Gasparini, Paolo; Luiselli, Donata; Wooding, Stephen; Drayna, Dennis

2016-01-01

The ability to taste phenylthiocarbamide (PTC) and 6-n-propylthiouracil (PROP) is a polymorphic trait mediated by the TAS2R38 bitter taste receptor gene. It has long been hypothesized that global genetic diversity at this locus evolved under pervasive pressures from balancing natural selection. However, recent high-resolution population genetic studies of TAS2Rs suggest that demographic events have played a critical role in the evolution of these genes. We here utilized the largest TAS2R38 database yet analyzed, consisting of 5,589 individuals from 105 populations, to examine natural selection, haplotype frequencies and linkage disequilibrium to estimate the effects of both selection and demography on contemporary patterns of variation at this locus. We found signs of an ancient balancing selection acting on this gene but no post Out-Of-Africa departures from neutrality, implying that the current observed patterns of variation can be predominantly explained by demographic, rather than selective events. In addition, we found signatures of ancient selective forces acting on different African TAS2R38 haplotypes. Collectively our results provide evidence for a relaxation of recent selective forces acting on this gene and a revised hypothesis for the origins of the present-day worldwide distribution of TAS2R38 haplotypes. PMID:27138342
Global diversity in the TAS2R38 bitter taste receptor: revisiting a classic evolutionary PROPosal.

PubMed

Risso, Davide S; Mezzavilla, Massimo; Pagani, Luca; Robino, Antonietta; Morini, Gabriella; Tofanelli, Sergio; Carrai, Maura; Campa, Daniele; Barale, Roberto; Caradonna, Fabio; Gasparini, Paolo; Luiselli, Donata; Wooding, Stephen; Drayna, Dennis

2016-05-03

The ability to taste phenylthiocarbamide (PTC) and 6-n-propylthiouracil (PROP) is a polymorphic trait mediated by the TAS2R38 bitter taste receptor gene. It has long been hypothesized that global genetic diversity at this locus evolved under pervasive pressures from balancing natural selection. However, recent high-resolution population genetic studies of TAS2Rs suggest that demographic events have played a critical role in the evolution of these genes. We here utilized the largest TAS2R38 database yet analyzed, consisting of 5,589 individuals from 105 populations, to examine natural selection, haplotype frequencies and linkage disequilibrium to estimate the effects of both selection and demography on contemporary patterns of variation at this locus. We found signs of an ancient balancing selection acting on this gene but no post Out-Of-Africa departures from neutrality, implying that the current observed patterns of variation can be predominantly explained by demographic, rather than selective events. In addition, we found signatures of ancient selective forces acting on different African TAS2R38 haplotypes. Collectively our results provide evidence for a relaxation of recent selective forces acting on this gene and a revised hypothesis for the origins of the present-day worldwide distribution of TAS2R38 haplotypes.
Genetic diversity of armored scales (Hemiptera: Diaspididae) and soft scales (Hemiptera: Coccidae) in Chile.

PubMed

Amouroux, P; Crochard, D; Germain, J-F; Correa, M; Ampuero, J; Groussier, G; Kreiter, P; Malausa, T; Zaviezo, T

2017-05-17

Scale insects (Sternorrhyncha: Coccoidea) are one of the most invasive and agriculturally damaging insect groups. Their management and the development of new control methods are currently jeopardized by the scarcity of identification data, in particular in regions where no large survey coupling morphological and DNA analyses have been performed. In this study, we sampled 116 populations of armored scales (Hemiptera: Diaspididae) and 112 populations of soft scales (Hemiptera: Coccidae) in Chile, over a latitudinal gradient ranging from 18°S to 41°S, on fruit crops, ornamental plants and trees. We sequenced the COI and 28S genes in each population. In total, 19 Diaspididae species and 11 Coccidae species were identified morphologically. From the 63 COI haplotypes and the 54 28S haplotypes uncovered, and using several DNA data analysis methods (Automatic Barcode Gap Discovery, K2P distance, NJ trees), up to 36 genetic clusters were detected. Morphological and DNA data were congruent, except for three species (Aspidiotus nerii, Hemiberlesia rapax and Coccus hesperidum) in which DNA data revealed highly differentiated lineages. More than 50% of the haplotypes obtained had no high-scoring matches with any of the sequences in the GenBank database. This study provides 63 COI and 54 28S barcode sequences for the identification of Coccoidea from Chile.
De Novo Assembly and Phasing of Dikaryotic Genomes from Two Isolates of Puccinia coronata f. sp. avenae, the Causal Agent of Oat Crown Rust

PubMed Central

Miller, Marisa E.; Zhang, Ying; Omidvar, Vahid; Sperschneider, Jana; Raley, Castle; Palmer, Jonathan M.; Garnica, Diana; Upadhyaya, Narayana; Rathjen, John; Taylor, Jennifer M.; Park, Robert F.; Dodds, Peter N.; Hirsch, Cory D.

2018-01-01

ABSTRACT Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenae. PMID:29463655
A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity

PubMed Central

Sperschneider, Jana; Garnica, Diana P.; Miller, Marisa E.; Taylor, Jennifer M.; Dodds, Peter N.; Park, Robert F.

2018-01-01

ABSTRACT A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb) and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales. In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies. PMID:29463659
CGDSNPdb: a database resource for error-checked and imputed mouse SNPs.

PubMed

Hutchins, Lucie N; Ding, Yueming; Szatkiewicz, Jin P; Von Smith, Randy; Yang, Hyuna; de Villena, Fernando Pardo-Manuel; Churchill, Gary A; Graber, Joel H

2010-07-06

The Center for Genome Dynamics Single Nucleotide Polymorphism Database (CGDSNPdb) is an open-source value-added database with more than nine million mouse single nucleotide polymorphisms (SNPs), drawn from multiple sources, with genotypes assigned to multiple inbred strains of laboratory mice. All SNPs are checked for accuracy and annotated for properties specific to the SNP as well as those implied by changes to overlapping protein-coding genes. CGDSNPdb serves as the primary interface to two unique data sets, the 'imputed genotype resource' in which a Hidden Markov Model was used to assess local haplotypes and the most probable base assignment at several million genomic loci in tens of strains of mice, and the Affymetrix Mouse Diversity Genotyping Array, a high density microarray with over 600,000 SNPs and over 900,000 invariant genomic probes. CGDSNPdb is accessible online through either a web-based query tool or a MySQL public login. Database URL: http://cgd.jax.org/cgdsnpdb/
A -819 C/T polymorphism in the interleukin-10 promoter is associated with persistent HBV infection, but -1082 A/G and -592A/C polymorphisms are not: a meta-analysis.

PubMed

Ren, Hong; Zhang, Ting-Ting; Hu, Wen-Long

2015-03-01

Single-nucleotide polymorphisms (SNPs) in the interleukin-10 (IL10) gene promoter have been associated with persistent hepatitis B virus (HBV) infection. In particular, the -1082A/G, -819 C/T and -592 A/C polymorphisms have most often been implicated. We performed a meta-analysis of available data to determine the relative importance of these SNPs in persistent HBV infection. We searched available articles in NCBI PubMed, EMBASE, the Chinese National Knowledge Infrastructure (CNKI), and the Chinese Biomedical Literature Database (CBM) and identified 24 studies for inclusion in our meta-analysis. Our results indicated that the presence of the IL10 -819 C allele significantly increased the risk for persistent HBV infection (CC+CT vs. TT: OR = 1.283, 95 % CI 1.023-1.610, P = 0.031; C vs. T: OR = 1.183, 95 % CI 1.001-1.399, P = 0.049). Meanwhile, the -1082A/-819T/-592A haplotype (OR = 0.751, 95 % CI 0.640-0.881, P = 0.000) and the -1082A/-819C/-592C haplotype (OR = 1.568, 95 % CI 1.304-1.884, P = 0.000) were observed to be significantly associated with HBV disease progression in Asians. In contrast, the IL10 -1082A/G and -592A/C polymorphisms were not associated with an increased susceptibility to or outcome of HBV infection. Our meta-analysis supports the growing body of evidence that the presence of the IL10 -819 C/T polymorphism is associated with persistent HBV infection and that the -1082A/-819T/-592A haplotype and the -1082A/-819C/-592C haplotype are associated with HBV disease progression in Asians.
Single nucleotide polymorphism coverage and inference of N-acetyltransferase-2 acetylator phenotypes in wordwide population groups.

PubMed

Suarez-Kurtz, Guilherme; Fuchshuber-Moraes, Mateus; Struchiner, Claudio J; Parra, Esteban J

2016-08-01

Several algorithms have been proposed to reduce the genotyping effort and cost, while retaining the accuracy of N-acetyltransferase-2 (NAT2) phenotype prediction. Data from the 1000 Genomes (1KG) project and an admixed cohort of Black Brazilians were used to assess the accuracy of NAT2 phenotype prediction using algorithms based on paired single nucleotide polymorphisms (SNPs) (rs1041983 and rs1801280) or a tag SNP (rs1495741). NAT2 haplotypes comprising SNPs rs1801279, rs1041983, rs1801280, rs1799929, rs1799930, rs1208 and rs1799931 were assigned according to the arylamine N-acetyltransferases database. Contingency tables were used to visualize the agreement between the NAT2 acetylator phenotypes on the basis of these haplotypes versus phenotypes inferred by the prediction algorithms. The paired and tag SNP algorithms provided more than 96% agreement with the 7-SNP derived phenotypes in Europeans, East Asians, South Asians and Admixed Americans, but discordance of phenotype prediction occurred in 30.2 and 24.8% 1KG Africans and in 14.4 and 18.6% Black Brazilians, respectively. Paired SNP panel misclassification occurs in carriers of NATs haplotypes *13A (282T alone), *12B (282T and 803G), *6B (590A alone) and *14A (191A alone), whereas haplotype *14, defined by the 191A allele, is the major culprit of misclassification by the tag allele. Both the paired SNP and the tag SNP algorithms may be used, with economy of scale, to infer NAT2 acetylator phenotypes, including the ultra-slow phenotype, in European, East Asian, South Asian and American populations represented in the 1KG cohort. Both algorithms, however, perform poorly in populations of predominant African descent, including admixed African-Americans, African Caribbeans and Black Brazilians.
Tripping over emerging pathogens around the world: a phylogeographical approach for determining the epidemiology of Porcine circovirus-2 (PCV-2), considering global trading.

PubMed

Vidigal, Pedro M P; Mafra, Claudio L; Silva, Fernanda M F; Fietto, Juliana L R; Silva Júnior, Abelardo; Almeida, Márcia R

2012-01-01

Porcine circovirus-2 (PCV-2) is an emerging virus associated with a number of different syndromes in pigs known as Porcine Circovirus Associated Diseases (PCVAD). Since its identification and characterization in the early 1990s, PCV-2 has achieved a worldwide distribution, becoming endemic in most pig-producing countries, and is currently considered as the main cause of losses on pig farms. In this study, we analyzed the main routes of the spread of PCV-2 between pig-producing countries using phylogenetic and phylogeographical approaches. A search for PCV-2 genome sequences in GenBank was performed, and the 420 PCV-2 sequences obtained were grouped into haplotypes (group of sequences that showed 100% identity), based on the infinite sites model of genome evolution. A phylogenetic hypothesis was inferred by Bayesian Inference for the classification of viral strains and a haplotype network was constructed by Median Joining to predict the geographical distribution of and genealogical relationships between haplotypes. In order to establish an epidemiological and economic context in these analyses, we considered all information about PCV-2 sequences available in GenBank, including papers published on viral isolation, and live pig trading statistics available on the UN Comtrade database (http://comtrade.un.org/). In these analyses, we identified a strong correlation between the means of PCV-2 dispersal predicted by the haplotype network and the statistics on the international trading of live pigs. This correlation provides a new perspective on the epidemiology of PCV-2, highlighting the importance of the movement of animals around the world in the emergence of new pathogens, and showing the need for effective sanitary barriers when trading live animals. Copyright © 2011 Elsevier B.V. All rights reserved.

Genetic diversity and natural selection of Plasmodium knowlesi merozoite surface protein 1 paralog gene in Malaysia.

PubMed

Ahmed, Md Atique; Fauzi, Muh; Han, Eun-Taek

2018-03-14

Human infections due to the monkey malaria parasite Plasmodium knowlesi is on the rise in most Southeast Asian countries specifically Malaysia. The C-terminal 19 kDa domain of PvMSP1P is a potential vaccine candidate, however, no study has been conducted in the orthologous gene of P. knowlesi. This study investigates level of polymorphisms, haplotypes and natural selection of full-length pkmsp1p in clinical samples from Malaysia. A total of 36 full-length pkmsp1p sequences along with the reference H-strain and 40 C-terminal pkmsp1p sequences from clinical isolates of Malaysia were downloaded from published genomes. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 and MEGA 5.0 software. Genealogical relationships were determined using haplotype network tree in NETWORK software v5.0. Population genetic differentiation index (F ST ) and population structure of parasite was determined using Arlequin v3.5 and STRUCTURE v2.3.4 software. Comparison of 36 full-length pkmsp1p sequences along with the H-strain identified 339 SNPs (175 non-synonymous and 164 synonymous substitutions). The nucleotide diversity across the full-length gene was low compared to its ortholog pvmsp1p. The nucleotide diversity was higher toward the N-terminal domains (pkmsp1p-83 and 30) compared to the C-terminal domains (pkmsp1p-38, 33 and 19). Phylogenetic analysis of full-length genes identified 2 distinct clusters of P. knowlesi from Malaysian Borneo. The 40 pkmsp1p-19 sequences showed low polymorphisms with 16 polymorphisms leading to 18 haplotypes. In total there were 10 synonymous and 6 non-synonymous substitutions and 12 cysteine residues were intact within the two EGF domains. Evidence of strong purifying selection was observed within the full-length sequences as well in all the domains. Shared haplotypes of 40 pkmsp1p-19 were identified within Malaysian Borneo haplotypes. This study is the first to report on the genetic diversity and natural selection of pkmsp1p. A low level of genetic diversity and strong evidence of negative selection was detected and observed in all the domains of pkmsp1p of P. knowlesi indicating functional constrains. Shared haplotypes were identified within pkmsp1p-19 highlighting further evaluation using larger number of clinical samples from Malaysia.
Ultra-high density intra-specific genetic linkage maps accelerate identification of functionally relevant molecular tags governing important agronomic traits in chickpea

PubMed Central

Kujur, Alice; Upadhyaya, Hari D.; Shree, Tanima; Bajaj, Deepak; Das, Shouvik; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

2015-01-01

We discovered 26785 and 16573 high-quality SNPs differentiating two parental genotypes of a RIL mapping population using reference desi and kabuli genome-based GBS assay. Of these, 3625 and 2177 SNPs have been integrated into eight desi and kabuli chromosomes, respectively in order to construct ultra-high density (0.20–0.37 cM) intra-specific chickpea genetic linkage maps. One of these constructed high-resolution genetic map has potential to identify 33 major genomic regions harbouring 35 robust QTLs (PVE: 17.9–39.7%) associated with three agronomic traits, which were mapped within <1 cM mean marker intervals on desi chromosomes. The extended LD (linkage disequilibrium) decay (~15 cM) in chromosomes of genetic maps have encouraged us to use a rapid integrated approach (comparative QTL mapping, QTL-region specific haplotype/LD-based trait association analysis, expression profiling and gene haplotype-based association mapping) rather than a traditional QTL map-based cloning method to narrow-down one major seed weight (SW) robust QTL region. It delineated favourable natural allelic variants and superior haplotype-containing one seed-specific candidate embryo defective gene regulating SW in chickpea. The ultra-high-resolution genetic maps, QTLs/genes and alleles/haplotypes-related genomic information generated and integrated strategy for rapid QTL/gene identification developed have potential to expedite genomics-assisted breeding applications in crop plants, including chickpea for their genetic enhancement. PMID:25942004
Haplotype diversity and linkage disequilibrium at DRD2 locus--a study on four population groups of Andhra Pradesh, India.

PubMed

Saraswathy, Kallur Nava; Mukhopadhyay, Rupak; Shukla, Deepti; Kaur, Harpreet; Sachdeva, Mohinder Pal; Rao, A P; Saksena, Deepti; Kalla, Aloke Kumar

2009-02-01

Dopamine receptor D2 (DRD2) is expressed in the central nervous system and has a high affinity for many antipsychotic drugs. Besides several epidemiological investigations on association of DRD2 locus polymorphism(s) with neuropsychiatric problems and addictive behavior, a few polymorphisms in this locus have also been used to understand genomic diversity and population migratory histories globally. The present study attempts to understand the genomic diversity/affinity among four endogamous groups of Andhra Pradesh (India) against the backdrop of diversity studies from other parts of India and the rest of the world, with special reference to DRD2 locus. The four population groups from Adilabad District of Andhra Pradesh, namely, Brahmin (n=50), Nayakpod (n=49), Thoti (n=52), and Kolam (n=53), were included in the study. The DRD2 markers typed for the present study are three biallelic restriction fragments, that is, TaqI A (rs1800497), TaqI B (rs1079597), and TaqI D (rs1800498). Scoring of DRD2 haplotypes with respect to the three TaqI sites shows that five out of eight possible haplotypes are shared by the four populations. Ancestral haplotype B2D2A1 is most frequent among Thotis (0.359). The results of the present study indicate a differential gene flow into South India followed by certain important demographic events resulting in diversified peopling of India.
Genetic diversity and natural selection of Plasmodium vivax multi-drug resistant gene (pvmdr1) in Mesoamerica.

PubMed

González-Cerón, Lilia; Montoya, Alberto; Corzo-Gómez, Josselin C; Cerritos, Rene; Santillán, Frida; Sandoval, Marco A

2017-07-01

The Plasmodium vivax multidrug resistant 1 gene (pvmdr1) codes for a transmembrane protein of the parasite's digestive vacuole. It is likely that the pvmdr1 gene mutations occur at different sites by convergent evolution. In here, the genetic variation of pvmdr1 at three sites of the Mesoamerican region was studied. Since 1950s, malarious patients of those areas have been treated only with chloroquine and primaquine. Blood samples from patients infected with P. vivax were obtained in southern Mexico (SMX), in the Northwest (NIC-NW) and in the northeast (NIC-NE) of Nicaragua. Genomic DNA was obtained and fragments of pvmdr1 were amplified and sequenced. The nucleotide and amino acid changes as well as the haplotype frequency in pvmdr1 were determined per strain and per geographic site. The sequences of pvmdr1 obtained from the studied regions were compared with homologous sequences from the GenBank database to explore the P. vivax genetic structure. In 141 parasites, eight nucleotide changes (two changes were synonymous and other six were nonsynonymous) were detected in 1536 bp. The PvMDR1 amino acid changes Y976F, F1076FL were predominant in endemic parasites from NIC-NE and outbreak parasites in NIC-NW but absent in SMX. Thirteen haplotypes were resolved, and found to be closely related, but their frequency at each geographic site was different (P = 0.0001). The pvmdr1 codons 925-1083 gene fragment showed higher genetic and haplotype diversity in parasites from NIC-NE than the other areas outside Latin America. The haplotype networks suggested local diversification of pvmdr1 and no significant departure from neutrality. The F ST values were low to moderate regionally, but high between NIC-NE or NIC-NW and other regions inside and outside Latin America. The pvmdr1 gene might have diversified recently at regional level. In the absence of significant natural, genetic drift might have caused differential pvmdr1 haplotype frequencies at different geographic sites in Mesoamerica. A very recent expansion of divergent pvmdr1 haplotypes in NIC-NE/NIC-NW produced high differentiation between these and parasites from other sites including SMX. These data are useful to set a baseline for epidemiological surveillance.
Evolutionary history of Mexican domesticated and wild Meleagris gallopavo.

PubMed

Padilla-Jacobo, Gabriela; Cano-Camacho, Horacio; López-Zavala, Rigoberto; Cornejo-Pérez, María E; Zavala-Páramo, María G

2018-04-17

The distribution of the wild turkey (Meleagris gallopavo) extends from Mexico to southeastern Canada and to the eastern and southern regions of the USA. Six subspecies have been described based on morphological characteristics and/or geographical variations in wild and domesticated populations. In this paper, based on DNA sequence data from the mitochondrial D-loop, we investigated the genetic diversity and structure, genealogical relationships, divergence time and demographic history of M. gallopavo populations including domesticated individuals. Analyses of 612 wild and domesticated turkey mitochondrial D-loop sequences, including 187 that were collected for this study and 425 from databases, revealed 64 haplotypes with few mutations, some of which are shared between domesticated and wild turkeys. We found a high level of haplotype and nucleotide diversity, which suggests that the total population of this species is large and stable with an old evolutionary history. The results of genetic differentiation, haplotype network, and genealogical relationships analyses revealed three main genetic groups within the species: mexicana as a population relict (C1), merriami (C2), and mexicana/intermedia/silvestris/osceola (C3). Haplotypes detected in domesticated turkeys belong to group C3. Estimates of divergence times agree with range expansion and diversification events of the relict population of M. gallopavo in northwestern Mexico during the Pliocene-Pleistocene and Pleistocene-Holocene boundaries. Demographic reconstruction showed that an expansion of the population occurred 110,000 to 130,000 years ago (Kya), followed by a stable period 100 Kya and finally a decline ~ 10 Kya (Pleistocene-Holocene boundary). In Mexico, the Trans-Mexican Volcanic Belt may be responsible for the range expansion of the C3 group. Two haplotypes with different divergence times, MGMDgoB/MICH1 and MICH2, are dominant in domesticated and commercial turkeys. During the Pleistocene, a large and stable population of M. gallopavo covered a wide geographic distribution from the north to the center of America (USA and Mexico). The mexicana, merriami, and mexicana/intermedia/silvestris/osceola genetic groups originated after divergence and range expansion from northwestern Mexico during the Pliocene-Pleistocene and Pleistocene-Holocene boundaries. Old and new maternal lines of the mexicana/intermedia/silvestris/osceola genetic group were distributed within the Trans-Mexican Volcanic Belt where individuals were captured for domestication. Two haplotypes are the main founder maternal lines of domesticated turkeys.
Acute systemic effects of inhaled salbutamol in asthmatic subjects expressing common homozygous beta2-adrenoceptor haplotypes at positions 16 and 27.

PubMed

Lee, Daniel K C; Bates, Caroline E; Lipworth, Brian J

2004-01-01

The relationship between beta2-adrenoceptor polymorphisms at positions 16 and 27, and the acute systemic beta2-adrenoceptor effects of inhaled salbutamol is unclear. We therefore elected to evaluate the influence of common homozygous beta2-adrenoceptor haplotypes on the acute systemic beta2-adrenoceptor effects following inhaled salbutamol in asthmatic subjects. An initial database search of 531 asthmatic subjects identified the two commonest homozygous haplotypes at positions 16 and 27 to be Arg16-Gln27 (12%) and Gly16-Glu27 (19%). After a 1-week washout period where all beta2-adrenoceptor agonists were withdrawn, 16 Caucasian subjects (Arg16-Gln27: n = 8 and Gly16-Glu27: n = 8) were given a single dose of inhaled salbutamol (1200 microg), followed by serial blood sampling for serum potassium, along with measurements of diastolic blood pressure and heart rate, at 5-min intervals for 20 min. The two groups were well matched for age, sex, FEV1, and inhaled corticosteroid dose. Baseline values for serum potassium, diastolic blood pressure and heart rate were not significantly different comparing Arg16-Gln27 vs Gly16-Glu27. The mean +/- SEM maximum serum potassium change from baseline over 20 min was significantly greater (P = 0.04) for Arg16-Gln27: -0.37 +/- 0.05 mmol l(-1) vs Gly16-Glu27: -0.23 +/- 0.04 mmol l(-1); 95% CI for difference: -0.01 to -0.28 mmol l(-1). The maximum diastolic blood pressure change from baseline over 20 min was significantly greater (P = 0.0008) for Arg16-Gln27: -13 +/- 1 mmHg vs Gly16-Glu27: -4 +/- 2 mmHg; 95% CI for difference: -5, 14 mmHg. There was no significant difference comparing the maximum heart rate change from baseline for Arg16-Gln27: 10 +/- 3 beats min(-1) vs Gly16-Glu27: 10 +/- 3 beats min(-1). Caucasian asthmatic subjects with the Arg16-Gln27 haplotype exhibited a greater systemic response to inhaled salbutamol, compared with those with the Gly16-Glu27 haplotype. The attenuated beta2-adrenoceptor response in the Gly16-Glu27 haplotype would be in keeping with increased susceptibility to prior down-regulation by endogenous catecholamines.
The GHEP–EMPOP collaboration on mtDNA population data—A new resource for forensic casework

PubMed Central

Prieto, L.; Zimmermann, B.; Goios, A.; Rodriguez-Monge, A.; Paneto, G.G.; Alves, C.; Alonso, A.; Fridman, C.; Cardoso, S.; Lima, G.; Anjos, M.J.; Whittle, M.R.; Montesino, M.; Cicarelli, R.M.B.; Rocha, A.M.; Albarrán, C.; de Pancorbo, M.M.; Pinheiro, M.F.; Carvalho, M.; Sumita, D.R.; Parson, W.

2011-01-01

Mitochondrial DNA (mtDNA) population data for forensic purposes are still scarce for some populations, which may limit the evaluation of forensic evidence especially when the rarity of a haplotype needs to be determined in a database search. In order to improve the collection of mtDNA lineages from the Iberian and South American subcontinents, we here report the results of a collaborative study involving nine laboratories from the Spanish and Portuguese Speaking Working Group of the International Society for Forensic Genetics (GHEP-ISFG) and EMPOP. The individual laboratories contributed population data that were generated throughout the past 10 years, but in the majority of cases have not been made available to the scientific community. A total of 1019 haplotypes from Iberia (Basque Country, 2 general Spanish populations, 2 North and 1 Central Portugal populations), and Latin America (3 populations from São Paulo) were collected, reviewed and harmonized according to defined EMPOP criteria. The majority of data ambiguities that were found during the reviewing process (41 in total) were transcription errors confirming that the documentation process is still the most error-prone stage in reporting mtDNA population data, especially when performed manually. This GHEP–EMPOP collaboration has significantly improved the quality of the individual mtDNA datasets and adds mtDNA population data as valuable resource to the EMPOP database (www.empop.org). PMID:21075696
Taxonomic review of Argentine mackerel Scomber japonicus (Houttuyn, 1782) by phylogenetic analysis

PubMed Central

Trucco, María Inés; Buratti, Claudio César

2017-01-01

Taxonomically, Argentine mackerels were first considered as Scomber japonicus marplatensis and later as Scomber japonicus Houttuyn 1782, although, in the last years, different studies have suggested that South Atlantic mackerel species belongs to Scomber colias Gmelin 1789. These latter results, incorporated in the main fish databases (FishBase and Catalog of Fishes), promoted a phylogenetic study using cytochrome c oxidase I (COI) gene sequences taken from the Barcode of Life (FISH-BOL) database. Thus, 76 sequences of S. japonicus, S. colias, S. australasicus and S. scombrus from different regions were used; including 3 from Sarda sarda as outgroup. Among S. japonicus selected sequences are those corresponding to the Argentine mackerels collected in 2007. Phylogenetic trees were obtained by neighbor joining and maximum likelihood methods and a network of haplotypes was reconstructed to analyze the relationship between species. The results showed the clear differentiation of S. australasicus, S. scombrus and S. japonicus from the Pacific while S. japonicus from Argentina was included in the S. colias group, with genetic differences corresponding to conspecific populations (0.1%). Four of the five Argentine specimens shared the same haplotype with S. colias, and none were shared with S. japonicus from the Pacific. These results suggest that the current specific name of Argentine mackerel S. japonicus should be changed to S. colias, in agreement with several genetic studies carried out with species of the genus Scomber. PMID:29071283
Surveying the Maize community for their diversity and pedigree visualization needs to prioritize tool development and curation

PubMed Central

Braun, Bremen L.; Schott, David A.; Portwood, II, John L.; Schaeffer, Mary L.; Harper, Lisa C.; Gardiner, Jack M.; Cannon, Ethalinda K.; Andorf, Carson M.

2017-01-01

Abstract The Maize Genetics and Genomics Database (MaizeGDB) team prepared a survey to identify breeders’ needs for visualizing pedigrees, diversity data and haplotypes in order to prioritize tool development and curation efforts at MaizeGDB. The survey was distributed to the maize research community on behalf of the Maize Genetics Executive Committee in Summer 2015. The survey garnered 48 responses from maize researchers, of which more than half were self-identified as breeders. The survey showed that the maize researchers considered their top priorities for visualization as: (i) displaying single nucleotide polymorphisms in a given region for a given list of lines, (ii) showing haplotypes for a given list of lines and (iii) presenting pedigree relationships visually. The survey also asked which populations would be most useful to display. The following two populations were on top of the list: (i) 3000 publicly available maize inbred lines used in Romay et al. (Comprehensive genotyping of the USA national maize inbred seed bank. Genome Biol, 2013;14:R55) and (ii) maize lines with expired Plant Variety Protection Act (ex-PVP) certificates. Driven by this strong stakeholder input, MaizeGDB staff are currently working in four areas to improve its interface and web-based tools: (i) presenting immediate progenies of currently available stocks at the MaizeGDB Stock pages, (ii) displaying the most recent ex-PVP lines described in the Germplasm Resources Information Network (GRIN) on the MaizeGDB Stock pages, (iii) developing network views of pedigree relationships and (iv) visualizing genotypes from SNP-based diversity datasets. These survey results can help other biological databases to direct their efforts according to user preferences as they serve similar types of data sets for their communities. Database URL: https://www.maizegdb.org PMID:28605768
A User-Friendly, Keyword-Searchable Database of Geoscientific References Through 2007 for Afghanistan

USGS Publications Warehouse

Eppinger, Robert G.; Sipeki, Julianna; Scofield, M.L. Sco

2008-01-01

This report includes a document and accompanying Microsoft Access 2003 database of geoscientific references for the country of Afghanistan. The reference compilation is part of a larger joint study of Afghanistan?s energy, mineral, and water resources, and geologic hazards currently underway by the U.S. Geological Survey, the British Geological Survey, and the Afghanistan Geological Survey. The database includes both published (n = 2,489) and unpublished (n = 176) references compiled through calendar year 2007. The references comprise two separate tables in the Access database. The reference database includes a user-friendly, keyword-searchable interface and only minimum knowledge of the use of Microsoft Access is required.
Genetic Diversity and Distribution of Blastocystis Subtype 3 in Human Populations, with Special Reference to a Rural Population in Central Mexico

PubMed Central

Serrano-Vázquez, Angélica; Pérez-Juárez, Horacio; Poot-Hernández, Augusto C.; González, Enrique; Hernández, Eric; Nieves-Ramírez, Miriam E.; Magaña, Ulises; Eguiarte, Luis E.; Piñero, Daniel

2018-01-01

Blastocystis subtype 3 (ST3) is a parasitic protist found in the digestive tract of symptomatic and asymptomatic humans around the world. While this parasite exhibits a high prevalence in the human population, its true geographic distribution and global genetic diversity are still unknown. This gap in knowledge limits the understanding of the spread mechanisms, epidemiology, and impact that this parasite has on human populations. Herein, we provided new data on the geographical distribution and genetic diversity of Blastocystis ST3 from a rural human population in Mexico. To do so, we collected and targeted the SSU-rDNA region in fecal samples from this population and further compared its genetic diversity and structure with that previously observed in populations of Blastocystis ST3 from other regions of the planet. Our analyses reveled that diversity of Blastocystis ST3 showed a high haplotype diversity and genetic structure to the world level; however, they were low in the Morelos population. The haplotype network revealed a common widespread haplotype from which the others were generated recently. Finally, our results suggested a recent expansion of the diversity of Blastocystis ST3 worldwide. PMID:29744356
Genetic Diversity and Distribution of Blastocystis Subtype 3 in Human Populations, with Special Reference to a Rural Population in Central Mexico.

PubMed

Rojas-Velázquez, Liliana; Morán, Patricia; Serrano-Vázquez, Angélica; Fernández, Leonardo D; Pérez-Juárez, Horacio; Poot-Hernández, Augusto C; Portillo, Tobías; González, Enrique; Hernández, Eric; Partida-Rodríguez, Oswaldo; Nieves-Ramírez, Miriam E; Magaña, Ulises; Torres, Javier; Eguiarte, Luis E; Piñero, Daniel; Ximénez, Cecilia

2018-01-01

Blastocystis subtype 3 (ST3) is a parasitic protist found in the digestive tract of symptomatic and asymptomatic humans around the world. While this parasite exhibits a high prevalence in the human population, its true geographic distribution and global genetic diversity are still unknown. This gap in knowledge limits the understanding of the spread mechanisms, epidemiology, and impact that this parasite has on human populations. Herein, we provided new data on the geographical distribution and genetic diversity of Blastocystis ST3 from a rural human population in Mexico. To do so, we collected and targeted the SSU-rDNA region in fecal samples from this population and further compared its genetic diversity and structure with that previously observed in populations of Blastocystis ST3 from other regions of the planet. Our analyses reveled that diversity of Blastocystis ST3 showed a high haplotype diversity and genetic structure to the world level; however, they were low in the Morelos population. The haplotype network revealed a common widespread haplotype from which the others were generated recently. Finally, our results suggested a recent expansion of the diversity of Blastocystis ST3 worldwide.
Cystic fibrosis mutations in North American populations of French ancestry: Analysis of Quebec French-Canadian and Louisiana Acadian families

PubMed Central

Rozen, Rima; Schwartz, Robert H.; Hilman, Bettina C.; Stanislovitis, Pat; Horn, Glenn T.; Klinger, Katherine; Daigneault, Jocelyne; De Braekeleer, Marc; Kerem, Bat-sheva; Tsui, Lap-Chee; Fujiwara, T. Mary; Morgan, Kenneth

1990-01-01

A 3-bp deletion (ΔF508) in the cystic fibrosis (CF) gene is the mutation on the majority of CF chromosomes. We studied 112 CF families from North American populations of French ancestry: French-Canadian families referred from hospitals in three cities in Quebec and from the Saguenay-Lac St. Jean region of northeastern Quebec and Acadian families living in Louisiana. ΔF508 was present on 71%, 55%, and 70% of the CF chromosomes from the major-urban Quebec, Saguenay-Lac St. Jean, and Louisiana Acadian families, respectively. A weighted estimate of the proportion of ΔF508 in the French-Canadian patient population of Quebec was 70%. We found that 95% of the CF chromosomes with ΔF508 had D7S23 haplotype B, the most frequent haplotype on CF chromosomes. In the Saguenay-Lac St. Jean families, 86% of the CF chromosomes without ΔF508 had the B haplotype, compared with 31% for the major-urban Quebec and Louisiana Acadian families. The incidence of CF in the Saguenay-Lac St. Jean population was 1/895 live-born infants. PMID:2220803
DNA Barcode Sequence Identification Incorporating Taxonomic Hierarchy and within Taxon Variability

PubMed Central

Little, Damon P.

2011-01-01

For DNA barcoding to succeed as a scientific endeavor an accurate and expeditious query sequence identification method is needed. Although a global multiple–sequence alignment can be generated for some barcoding markers (e.g. COI, rbcL), not all barcoding markers are as structurally conserved (e.g. matK). Thus, algorithms that depend on global multiple–sequence alignments are not universally applicable. Some sequence identification methods that use local pairwise alignments (e.g. BLAST) are unable to accurately differentiate between highly similar sequences and are not designed to cope with hierarchic phylogenetic relationships or within taxon variability. Here, I present a novel alignment–free sequence identification algorithm–BRONX–that accounts for observed within taxon variability and hierarchic relationships among taxa. BRONX identifies short variable segments and corresponding invariant flanking regions in reference sequences. These flanking regions are used to score variable regions in the query sequence without the production of a global multiple–sequence alignment. By incorporating observed within taxon variability into the scoring procedure, misidentifications arising from shared alleles/haplotypes are minimized. An explicit treatment of more inclusive terminals allows for separate identifications to be made for each taxonomic level and/or for user–defined terminals. BRONX performs better than all other methods when there is imperfect overlap between query and reference sequences (e.g. mini–barcode queries against a full–length barcode database). BRONX consistently produced better identifications at the genus–level for all query types. PMID:21857897
Mitochondrial DNA sequences of 37 collar-spined echinostomes (Digenea: Echinostomatidae) in Thailand and Lao PDR reveals presence of two species: Echinostoma revolutum and E. miyagawai.

PubMed

Nagataki, Mitsuru; Tantrawatpan, Chairat; Agatsuma, Takeshi; Sugiura, Tetsuro; Duenngai, Kunyarat; Sithithaworn, Paiboon; Andrews, Ross H; Petney, Trevor N; Saijuntha, Weerachai

2015-10-01

The "37 collar-spined" or "revolutum" group of echinostomes is recognized as a species complex. The identification of members of this complex by morphological taxonomic characters is difficult and confusing, and hence, molecular analyses are a useful alternative method for molecular systematic studies. The current study examined the genetic diversity of those 37 collar-spined echinostomes which are recognized morphologically as Echinostoma revolutum in Thailand and Lao PDR using the cytochrome c oxidase subunit 1 (CO1) and the NADH dehydrogenase subunit 1 (ND1) sequences. On the basis of molecular investigations, at least two species of 37 collar-spined echinostomes exist in Southeast Asia, namely E. revolutum and Echinostoma miyagawai. The specimens examined in this study, coming from ducks in Thailand and Lao PDR, were compared to isolates from America, Europe and Australia for which DNA sequences are available in public databases. Haplotype analysis detected 6 and 26 haplotypes when comparing the CO1 sequences of E. revolutum and E. miyagawai, respectively, from different geographical isolates from Thailand and Lao PDR. The phylogenetic trees, ND1 haplotype network and genetic differentiation (ɸST) analyses showed that E. revolutum were genetically different on a continental scale, i.e. Eurasian and American lineages. Copyright © 2015 Elsevier B.V. All rights reserved.
RTS,S/AS01 malaria vaccine mismatch observed among Plasmodium falciparum isolates from southern and central Africa and globally.

PubMed

Pringle, Julia C; Carpi, Giovanna; Almagro-Garcia, Jacob; Zhu, Sha Joe; Kobayashi, Tamaki; Mulenga, Modest; Bobanga, Thierry; Chaponda, Mike; Moss, William J; Norris, Douglas E

2018-04-26

The RTS,S/AS01 malaria vaccine encompasses the central repeats and C-terminal of Plasmodium falciparum circumsporozoite protein (PfCSP). Although no Phase II clinical trial studies observed evidence of strain-specific immunity, recent studies show a decrease in vaccine efficacy against non-vaccine strain parasites. In light of goals to reduce malaria morbidity, anticipating the effectiveness of RTS,S/AS01 is critical to planning widespread vaccine introduction. We deep sequenced C-terminal Pfcsp from 77 individuals living along the international border in Luapula Province, Zambia and Haut-Katanga Province, the Democratic Republic of the Congo (DRC) and compared translated amino acid haplotypes to the 3D7 vaccine strain. Only 5.2% of the 193 PfCSP sequences from the Zambia-DRC border region matched 3D7 at all 84 amino acids. To further contextualize the genetic diversity sampled in this study with global PfCSP diversity, we analyzed an additional 3,809 Pfcsp sequences from the Pf3k database and constructed a haplotype network representing 15 countries from Africa and Asia. The diversity observed in our samples was similar to the diversity observed in the global haplotype network. These observations underscore the need for additional research assessing genetic diversity in P. falciparum and the impact of PfCSP diversity on RTS,S/AS01 efficacy.
Chapter 4 - The LANDFIRE Prototype Project reference database

Treesearch

John F. Caratti

2006-01-01

This chapter describes the data compilation process for the Landscape Fire and Resource Management Planning Tools Prototype Project (LANDFIRE Prototype Project) reference database (LFRDB) and explains the reference data applications for LANDFIRE Prototype maps and models. The reference database formed the foundation for all LANDFIRE tasks. All products generated by the...
Nucleotide and amino acid variations of tannase gene from different Aspergillus strains.

PubMed

Borrego-Terrazas, J A; Lara-Victoriano, F; Flores-Gallegos, A C; Veana, F; Aguilar, C N; Rodríguez-Herrera, R

2014-08-01

Tannase is an enzyme that catalyses the hydrolysis of ester bonds present in tannins. Most of the scientific reports about this biocatalysis focus on aspects related to tannase production and its recovery; on the other hand, reports assessing the molecular aspects of the tannase gene or protein are scarce. In the present study, a tannase gene fragment from several Aspergillus strains isolated from the Mexican semidesert was sequenced and compared with tannase amino acid sequences reported in NCBI database using bioinformatics tools. The genetic relationship among the different tannase sequences was also determined. A conserved region of 7 amino acids was found with the conserved motif GXSXG common to esterases, in which the active-site serine residue is located. In addition, in Aspergillus niger strains GH1 and PSH, we found an extra codon in the tannase sequences encoding glycine. The tannase gene belonging to semidesert fungal strains followed a neutral evolution path with the formation of 10 haplotypes, of which A. niger GH1 and PSH haplotypes are the oldest.
Normative Databases for Imaging Instrumentation.

PubMed

Realini, Tony; Zangwill, Linda M; Flanagan, John G; Garway-Heath, David; Patella, Vincent M; Johnson, Chris A; Artes, Paul H; Gaddie, Ian B; Fingeret, Murray

2015-08-01

To describe the process by which imaging devices undergo reference database development and regulatory clearance. The limitations and potential improvements of reference (normative) data sets for ophthalmic imaging devices will be discussed. A symposium was held in July 2013 in which a series of speakers discussed issues related to the development of reference databases for imaging devices. Automated imaging has become widely accepted and used in glaucoma management. The ability of such instruments to discriminate healthy from glaucomatous optic nerves, and to detect glaucomatous progression over time is limited by the quality of reference databases associated with the available commercial devices. In the absence of standardized rules governing the development of reference databases, each manufacturer's database differs in size, eligibility criteria, and ethnic make-up, among other key features. The process for development of imaging reference databases may be improved by standardizing eligibility requirements and data collection protocols. Such standardization may also improve the degree to which results may be compared between commercial instruments.
Normative Databases for Imaging Instrumentation

PubMed Central

Realini, Tony; Zangwill, Linda; Flanagan, John; Garway-Heath, David; Patella, Vincent Michael; Johnson, Chris; Artes, Paul; Ben Gaddie, I.; Fingeret, Murray

2015-01-01

Purpose To describe the process by which imaging devices undergo reference database development and regulatory clearance. The limitations and potential improvements of reference (normative) data sets for ophthalmic imaging devices will be discussed. Methods A symposium was held in July 2013 in which a series of speakers discussed issues related to the development of reference databases for imaging devices. Results Automated imaging has become widely accepted and used in glaucoma management. The ability of such instruments to discriminate healthy from glaucomatous optic nerves, and to detect glaucomatous progression over time is limited by the quality of reference databases associated with the available commercial devices. In the absence of standardized rules governing the development of reference databases, each manufacturer’s database differs in size, eligibility criteria, and ethnic make-up, among other key features. Conclusions The process for development of imaging reference databases may be improved by standardizing eligibility requirements and data collection protocols. Such standardization may also improve the degree to which results may be compared between commercial instruments. PMID:25265003

Searching for religion and mental health studies required health, social science, and grey literature databases.

PubMed

Wright, Judy M; Cottrell, David J; Mir, Ghazala

2014-07-01

To determine the optimal databases to search for studies of faith-sensitive interventions for treating depression. We examined 23 health, social science, religious, and grey literature databases searched for an evidence synthesis. Databases were prioritized by yield of (1) search results, (2) potentially relevant references identified during screening, (3) included references contained in the synthesis, and (4) included references that were available in the database. We assessed the impact of databases beyond MEDLINE, EMBASE, and PsycINFO by their ability to supply studies identifying new themes and issues. We identified pragmatic workload factors that influence database selection. PsycINFO was the best performing database within all priority lists. ArabPsyNet, CINAHL, Dissertations and Theses, EMBASE, Global Health, Health Management Information Consortium, MEDLINE, PsycINFO, and Sociological Abstracts were essential for our searches to retrieve the included references. Citation tracking activities and the personal library of one of the research teams made significant contributions of unique, relevant references. Religion studies databases (Am Theo Lib Assoc, FRANCIS) did not provide unique, relevant references. Literature searches for reviews and evidence syntheses of religion and health studies should include social science, grey literature, non-Western databases, personal libraries, and citation tracking activities. Copyright © 2014 Elsevier Inc. All rights reserved.
Germline variations at JAK2, TERT, HBS1L-MYB and MECOM and the risk of myeloproliferative neoplasms in Taiwanese population

PubMed Central

Chiang, Yi-Hao; Chang, Yu-Cheng; Lin, Huan-Chau; Huang, Ling; Cheng, Chun-Chia; Wang, Wei-Ting; Cheng, Hung-I; Su, Nai-Wen; Chen, Caleb Gon-Shen; Lin, Johnson; Chang, Yi-Fang; Chang, Ming-Chih; Hsieh, Ruey-Kuen; Chou, Wen-Chien; Lim, Ken-Hong; Kuo, Yuan-Yeh

2017-01-01

Germline variations at JAK2, TERT, HBS1L-MYB and MECOM have been found to associate with myeloproliferative neoplasms (MPNs) in European populations. Whether these germline variations are associated with MPNs in Taiwanese population is obscure. Here we aimed to evaluate the association of five germline variations (JAK2 46/1 haplotype tagged by rs12343867, JAK2 intron 8 rs12339666, TERT rs2736100, HBS1L-MYB rs9376092 and MECOM rs2201862) and the risk of MPNs in Taiwanese population. A total of 178 MPN patients (109 essential thrombocythemia, 54 polycythemia vera and 15 primary myelofibrosis) were enrolled into this study. The information of 17033 control subjects was obtained from Taiwan Biobank database. The JAK2 46/1 haplotype, JAK2 rs12339666 and TERT rs2736100 were significantly associated with Taiwanese MPNs (P = 3.6×10-19, 1.9×10-19 and 3.1×10-6, respectively), and JAK2V617F-positive MPNs (n=121) (P = 5.6×10-21, 4.4×10-21 and 8.6×10-7, respectively). In JAK2V617F-negative cases (n=55), only the JAK2 46/1 haplotype and JAK2 rs12339666 remained statistically significant (P= 0.009 and 0.007, respectively). When stratified by disease subtypes, the JAK2 46/1 haplotype and JAK2 rs12339666 were significantly associated with all three MPN subtypes, but TERT rs2736100 was only associated with essential thrombocythemia and polycythemia vera. We did not find any association of these five SNPs with CALR mutations in our cohort. Furthermore, the risk alleles of MECOM rs2201862 and HBS1L-MYB rs9376092 were demonstrated to be negatively associated with the risk of developing polycythemia vera. In conclusion, germline variations at JAK2 (both the 46/1 haplotype and rs12339666) and TERT rs2736100 were associated with MPNs in Taiwanese population. PMID:29100304
RTEL1 tagging SNPs and haplotypes were associated with glioma development.

PubMed

Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong

2013-05-17

As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.
Is There a Genetic Predisposition to Anterior Cruciate Ligament Tear? A Systematic Review.

PubMed

John, Rakesh; Dhillon, Mandeep Singh; Sharma, Siddhartha; Prabhakar, Sharad; Bhandari, Mohit

2016-12-01

Injuries to the anterior cruciate ligament (ACL) are among the most common knee ligament injuries and frequently warrant reconstruction. The etiopathogenesis of these injuries has focused mainly on mechanism of trauma, patient sex, and anatomic factors as predisposing causes. Several genetic factors that could predispose to an ACL tear have recently been reported. This systematic review summarizes the current evidence for a genetic predisposition to ACL tears. The principal research question was to identify genetic factors, based on the available literature, that could predispose an individual to an ACL tear. Systematic review. The PubMed, EMBASE, Cochrane, and HuGE databases were searched; the search was run from the period of inception until June 21, 2015. A secondary search was performed by screening the references of full-text articles obtained and by manually searching selected journals. Articles were screened with prespecified inclusion criteria. The quality of studies included in the review was assessed for risk of bias by 2 reviewers using the Newcastle-Ottawa Scale. A total of 994 records were identified by the search, out of which 17 studies (16 case-control studies and 1 cross-sectional study) were included in the final review. Two studies observed a familial predisposition to an ACL tear. Fourteen studies looked at specific gene polymorphisms in 20 genes, from which different polymorphisms in 10 genes were positively associated with an ACL tear. In addition to these polymorphisms, 8 haplotypes were associated with ACL tear. One study looked at gene expression analysis. Although specific gene polymorphisms and haplotypes have been identified, it is difficult to come to a conclusion on the basis of the existing literature. Several sources of bias have been identified in these studies, and the results cannot be extrapolated to the general population. More studies are needed in larger populations of different ethnicities. Gene-gene interactions and gene expression studies in the future may delineate the exact role of these gene polymorphisms in ACL tears. © 2016 The Author(s).
Efficient algorithms for polyploid haplotype phasing.

PubMed

He, Dan; Saha, Subrata; Finkers, Richard; Parida, Laxmi

2018-05-09

Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.
The haplotype-resolved genome and epigenome of the aneuploid HeLa cancer cell line.

PubMed

Adey, Andrew; Burton, Joshua N; Kitzman, Jacob O; Hiatt, Joseph B; Lewis, Alexandra P; Martin, Beth K; Qiu, Ruolan; Lee, Choli; Shendure, Jay

2013-08-08

The HeLa cell line was established in 1951 from cervical cancer cells taken from a patient, Henrietta Lacks. This was the first successful attempt to immortalize human-derived cells in vitro. The robust growth and unrestricted distribution of HeLa cells resulted in its broad adoption--both intentionally and through widespread cross-contamination--and for the past 60 years it has served a role analogous to that of a model organism. The cumulative impact of the HeLa cell line on research is demonstrated by its occurrence in more than 74,000 PubMed abstracts (approximately 0.3%). The genomic architecture of HeLa remains largely unexplored beyond its karyotype, partly because like many cancers, its extensive aneuploidy renders such analyses challenging. We carried out haplotype-resolved whole-genome sequencing of the HeLa CCL-2 strain, examined point- and indel-mutation variations, mapped copy-number variations and loss of heterozygosity regions, and phased variants across full chromosome arms. We also investigated variation and copy-number profiles for HeLa S3 and eight additional strains. We find that HeLa is relatively stable in terms of point variation, with few new mutations accumulating after early passaging. Haplotype resolution facilitated reconstruction of an amplified, highly rearranged region of chromosome 8q24.21 at which integration of the human papilloma virus type 18 (HPV-18) genome occurred and that is likely to be the event that initiated tumorigenesis. We combined these maps with RNA-seq and ENCODE Project data sets to phase the HeLa epigenome. This revealed strong, haplotype-specific activation of the proto-oncogene MYC by the integrated HPV-18 genome approximately 500 kilobases upstream, and enabled global analyses of the relationship between gene dosage and expression. These data provide an extensively phased, high-quality reference genome for past and future experiments relying on HeLa, and demonstrate the value of haplotype resolution for characterizing cancer genomes and epigenomes.
Association of polymorphisms in survivin gene with the risk of hepatocellular carcinoma in Chinese han population: a case control study

PubMed Central

2012-01-01

Background Survivin, one of the strongest apoptosis inhibitors, plays a critical role in the development and progression of hepatocellular carcinoma (HCC). By comparison, relatively little is known about the effect of survivin gene polymorphisms on HCC susceptibility. Our study aimed to investigate the association of survivin gene polymorphisms with the risk of HCC in Chinese han population. Methods A case-control study was conducted in Chinese han population consisting of 178 HCC cases and 196 cancer-free controls. Information on demographic data and related risk factors was collected for all subjects. Polymorphisms of the survivin gene, including three loci of rs8073069, rs9904341 and rs1042489, were selected and genotyped by a polymerase chain reaction- restriction fragment length polymorphism (PCR-RFLP) technique. Association analysis of genotypes/alleles and haplotypes from these loci with the risk of HCC was conducted under different genetic models. Results Using univariate analysis of rs8073069, rs9904341 and rs1042489 under different genetic models, no statistically significant difference was found in genotype or allele distribution of HCC cases relative to the controls (P > 0.05). Linkage disequilibrium (LD) analysis showed that these loci were in LD. Multivariate logistic regression indicated that with no G-C-T haplotype as reference, the haplotype of G-C-T from these loci was associated with a lower risk for HCC under the recessive model (OR = 0.46, 95% confidence interval (CI): 0.24~0.90, P = 0.023). Both HBsAg+ and the medical history of viral hepatitis type B were risk factors for HCC. However, no statistically significant haplotype-environment interaction existed. Conclusions No association between rs8073069, rs9904341 or rs1042489 in survivin gene and the risk of HCC is found in Chinese han population, but rs8073069G-rs9904341C- rs1042489T is perhaps a protective haplotype for HCC. PMID:22214342
Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps.

PubMed

Garud, Nandita R; Messer, Philipp W; Buzbas, Erkan O; Petrov, Dmitri A

2015-02-01

Adaptation from standing genetic variation or recurrent de novo mutation in large populations should commonly generate soft rather than hard selective sweeps. In contrast to a hard selective sweep, in which a single adaptive haplotype rises to high population frequency, in a soft selective sweep multiple adaptive haplotypes sweep through the population simultaneously, producing distinct patterns of genetic variation in the vicinity of the adaptive site. Current statistical methods were expressly designed to detect hard sweeps and most lack power to detect soft sweeps. This is particularly unfortunate for the study of adaptation in species such as Drosophila melanogaster, where all three confirmed cases of recent adaptation resulted in soft selective sweeps and where there is evidence that the effective population size relevant for recent and strong adaptation is large enough to generate soft sweeps even when adaptation requires mutation at a specific single site at a locus. Here, we develop a statistical test based on a measure of haplotype homozygosity (H12) that is capable of detecting both hard and soft sweeps with similar power. We use H12 to identify multiple genomic regions that have undergone recent and strong adaptation in a large population sample of fully sequenced Drosophila melanogaster strains from the Drosophila Genetic Reference Panel (DGRP). Visual inspection of the top 50 candidates reveals that in all cases multiple haplotypes are present at high frequencies, consistent with signatures of soft sweeps. We further develop a second haplotype homozygosity statistic (H2/H1) that, in combination with H12, is capable of differentiating hard from soft sweeps. Surprisingly, we find that the H12 and H2/H1 values for all top 50 peaks are much more easily generated by soft rather than hard sweeps. We discuss the implications of these results for the study of adaptation in Drosophila and in species with large census population sizes.
Non-additive and epistatic effects of HLA polymorphisms contributing to risk of adult glioma.

PubMed

Zhang, Chenan; de Smith, Adam J; Smirnov, Ivan V; Wiencke, John K; Wiemels, Joseph L; Witte, John S; Walsh, Kyle M

2017-11-01

Although genome-wide association studies have identified several susceptibility loci for adult glioma, little is known regarding the potential contribution of genetic variation in the human leukocyte antigen (HLA) region to glioma risk. HLA associations have been reported for various malignancies, with many studies investigating selected candidate HLA polymorphisms. However, no systematic analysis has been conducted in glioma patients, and no investigation into potential non-additive effects has been described. We conducted comprehensive genetic analyses of HLA variants among 1746 adult glioma patients and 2312 controls of European-ancestry from the GliomaScan Consortium. Genotype data were generated with the Illumina 660-Quad array, and we imputed HLA alleles using a reference panel of 5225 individuals in the Type 1 Diabetes Genetics Consortium who underwent high-resolution HLA typing via next-generation sequencing. Case-control comparisons were adjusted for population stratification using ancestry-informative principal components. Because alleles in different loci across the HLA region are linked, we created multigene haplotypes consisting of the genes DRB1, DQA1, and DQB1. Although none of the haplotypes were associated with glioma in additive models, inclusion of a dominance term significantly improved the model for multigene haplotype HLA-DRB1*1501-DQA1*0102-DQB1*0602 (P = 0.002). Heterozygous carriers of the haplotype had an increased risk of glioma [odds ratio (OR) 1.23; 95% confidence interval (CI) 1.01-1.49], while homozygous carriers were at decreased risk compared with non-carriers (OR 0.64; 95% CI 0.40-1.01). Our results suggest that the DRB1*1501-DQA1*0102-DQB1*0602 haplotype may contribute to the risk of glioma in a non-additive manner, with the positive dominance effect partly explained by an epistatic interaction with HLA-DRB1*0401-DQA1*0301-DQB1*0301.
Graphical genotyping as a method to map Ny (o,n)sto and Gpa5 using a reference panel of tetraploid potato cultivars.

PubMed

van Eck, Herman J; Vos, Peter G; Valkonen, Jari P T; Uitdewilligen, Jan G A M L; Lensing, Hellen; de Vetten, Nick; Visser, Richard G F

2017-03-01

The method of graphical genotyping is applied to a panel of tetraploid potato cultivars to visualize haplotype sharing. The method allowed to map genes involved in virus and nematode resistance. The physical coordinates of the amount of linkage drag surrounding these genes are easily interpretable. Graphical genotyping is a visually attractive and easily interpretable method to represent genetic marker data. In this paper, the method is extended from diploids to a panel of tetraploid potato cultivars. Application of filters to select a subset of SNPs allows one to visualize haplotype sharing between individuals that also share a specific locus. The method is illustrated with cultivars resistant to Potato virus Y (PVY), while simultaneously selecting for the absence of the SNPs in susceptible clones. SNP data will then merge into an image which displays the coordinates of a distal genomic region on the northern arm of chromosome 11 where a specific haplotype is introgressed from the wild potato species S. stoloniferum (CPC 2093) carrying a gene (Ny (o,n)sto ) conferring resistance to two PVY strains, PVY O and PVY NTN . Graphical genotyping was also successful in showing the haplotypes on chromosome 12 carrying Ry-f sto , another resistance gene derived from S. stoloniferum conferring broad-spectrum resistance to PVY, as well as chromosome 5 haplotypes from S. vernei, with the Gpa5 locus involved in resistance against Globodera pallida cyst nematodes. The image also shows shortening of linkage drag by meiotic recombination of the introgression segment in more recent breeding material. Identity-by-descent was found to be a requirement for using graphical genotyping, which is proposed as a non-statistical alternative method for gene discovery, as compared with genome-wide association studies. The potential and limitations of the method are discussed.
A termite symbiotic mushroom maximizing sexual activity at growing tips of vegetative hyphae.

PubMed

Hsieh, Huei-Mei; Chung, Mei-Chu; Chen, Pao-Yang; Hsu, Fei-Man; Liao, Wen-Wei; Sung, Ai-Ning; Lin, Chun-Ru; Wang, Chung-Ju Rachel; Kao, Yu-Hsin; Fang, Mei-Jane; Lai, Chi-Yung; Huang, Chieh-Chen; Chou, Jyh-Ching; Chou, Wen-Neng; Chang, Bill Chia-Han; Ju, Yu-Ming

2017-09-19

Termitomyces mushrooms are mutualistically associated with fungus-growing termites, which are widely considered to cultivate a monogenotypic Termitomyces symbiont within a colony. Termitomyces cultures isolated directly from termite colonies are heterokaryotic, likely through mating between compatible homokaryons. After pairing homokaryons carrying different haplotypes at marker gene loci MIP and RCB from a Termitomyces fruiting body associated with Odontotermes formosanus, we observed nuclear fusion and division, which greatly resembled meiosis, during each hyphal cell division and conidial formation in the resulting heterokaryons. Surprisingly, nuclei in homokaryons also behaved similarly. To confirm if meiotic-like recombination occurred within mycelia, we constructed whole-genome sequencing libraries from mycelia of two homokaryons and a heterokaryon resulting from mating of the two homokaryons. Obtained reads were aligned to the reference genome of Termitomyces sp. J132 for haplotype reconstruction. After removal of the recombinant haplotypes shared between the heterokaryon and either homokaryons, we inferred that 5.04% of the haplotypes from the heterokaryon were the recombinants resulting from homologous recombination distributed genome-wide. With RNA transcripts of four meiosis-specific genes, including SPO11, DMC1, MSH4, and MLH1, detected from a mycelial sample by real-time quantitative PCR, the nuclear behavior in mycelia was reconfirmed meiotic-like. Unlike other basidiomycetes where sex is largely restricted to basidia, Termitomyces maximizes sexuality at somatic stage, resulting in an ever-changing genotype composed of a myriad of coexisting heterogeneous nuclei in a heterokaryon. Somatic meiotic-like recombination may endow Termitomyces with agility to cope with termite consumption by maximized genetic variability.
Haplotype estimation using sequencing reads.

PubMed

Delaneau, Olivier; Howie, Bryan; Cox, Anthony J; Zagury, Jean-François; Marchini, Jonathan

2013-10-03

High-throughput sequencing technologies produce short sequence reads that can contain phase information if they span two or more heterozygote genotypes. This information is not routinely used by current methods that infer haplotypes from genotype data. We have extended the SHAPEIT2 method to use phase-informative sequencing reads to improve phasing accuracy. Our model incorporates the read information in a probabilistic model through base quality scores within each read. The method is primarily designed for high-coverage sequence data or data sets that already have genotypes called. One important application is phasing of single samples sequenced at high coverage for use in medical sequencing and studies of rare diseases. Our method can also use existing panels of reference haplotypes. We tested the method by using a mother-father-child trio sequenced at high-coverage by Illumina together with the low-coverage sequence data from the 1000 Genomes Project (1000GP). We found that use of phase-informative reads increases the mean distance between switch errors by 22% from 274.4 kb to 328.6 kb. We also used male chromosome X haplotypes from the 1000GP samples to simulate sequencing reads with varying insert size, read length, and base error rate. When using short 100 bp paired-end reads, we found that using mixtures of insert sizes produced the best results. When using longer reads with high error rates (5-20 kb read with 4%-15% error per base), phasing performance was substantially improved. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Salt tolerance underlies the cryptic invasion of North American salt marshes by an introduced haplotype of the common reed Phragmites australis (Poaceae)

USGS Publications Warehouse

Vasquez, Edward A.; Glenn, Edward P.; Brown, J. Jed; Guntenspergen, Glenn R.; Nelson, Stephen G.

2005-01-01

A distinct, non-native haplotype of the common reed Phragmites australis has become invasive in Atlantic coastal Spartina marshes. We compared the salt tolerance and other growth characteristics of the invasive M haplotype with 2 native haplotypes (F and AC) in greenhouse experiments. The M haplotype retained 50% of its growth potential up to 0.4 M NaCl, whereas the F and AC haplotypes did not grow above 0.1 M NaCl. The M haplotype produced more shoots per gram of rhizome tissue and had higher relative growth rates than the native haplotypes on both freshwater and saline water treatments. The M haplotype also differed from the native haplotypes in shoot water content and the biometrics of shoots and rhizomes. The results offer an explanation for how the M haplotype is able to spread in coastal salt marshes and support the conclusion of DNA analyses that the M haplotype is a distinct ecotype of P. australis.
Ancestral Asian source(s) of new world Y-chromosome founder haplotypes.

PubMed Central

Karafet, T M; Zegura, S L; Posukh, O; Osipova, L; Bergen, A; Long, J; Goldman, D; Klitz, W; Harihara, S; de Knijff, P; Wiebe, V; Griffiths, R C; Templeton, A R; Hammer, M F

1999-01-01

Haplotypes constructed from Y-chromosome markers were used to trace the origins of Native Americans. Our sample consisted of 2,198 males from 60 global populations, including 19 Native American and 15 indigenous North Asian groups. A set of 12 biallelic polymorphisms gave rise to 14 unique Y-chromosome haplotypes that were unevenly distributed among the populations. Combining multiallelic variation at two Y-linked microsatellites (DYS19 and DXYS156Y) with the unique haplotypes results in a total of 95 combination haplotypes. Contra previous findings based on Y- chromosome data, our new results suggest the possibility of more than one Native American paternal founder haplotype. We postulate that, of the nine unique haplotypes found in Native Americans, haplotypes 1C and 1F are the best candidates for major New World founder haplotypes, whereas haplotypes 1B, 1I, and 1U may either be founder haplotypes and/or have arrived in the New World via recent admixture. Two of the other four haplotypes (YAP+ haplotypes 4 and 5) are probably present because of post-Columbian admixture, whereas haplotype 1G may have originated in the New World, and the Old World source of the final New World haplotype (1D) remains unresolved. The contrasting distribution patterns of the two major candidate founder haplotypes in Asia and the New World, as well as the results of a nested cladistic analysis, suggest the possibility of more than one paternal migration from the general region of Lake Baikal to the Americas. PMID:10053017
[Selected aspects of computer-assisted literature management].

PubMed

Reiss, M; Reiss, G

1998-01-01

We want to report about our own experiences with a database manager. Bibliography database managers are used to manage information resources: specifically, to maintain a database to references and create bibliographies and reference lists for written works. A database manager allows to enter summary information (record) for articles, book sections, books, dissertations, conference proceedings, and so on. Other features that may be included in a database manager include the ability to import references from different sources, such as MEDLINE. The word processing components allow to generate reference list and bibliographies in a variety of different styles, generates a reference list from a word processor manuscript. The function and the use of the software package EndNote 2 for Windows are described. Its advantages in fulfilling different requirements for the citation style and the sort order of reference lists are emphasized.
Strains of the Group I Lineage of Acidovorax citrulli, the Causal Agent of Bacterial Fruit Blotch of Cucurbitaceous Crops, are Predominant in Brazil.

PubMed

Silva, Gustavo M; Souza, Ricardo M; Yan, Lichun; Júnior, Rui S; Medeiros, Flavio H V; Walcott, Ron R

2016-12-01

Bacterial fruit blotch (BFB), caused by the seedborne bacterium Acidovorax citrulli, is an economically important threat to cucurbitaceous crops worldwide. Since the first report of BFB in Brazil in 1990, outbreaks have occurred sporadically on watermelon and, more frequently, on melon, resulting in significant yield losses. At present, the genetic diversity and the population structure of A. citrulli strains in Brazil remain unclear. A collection of 74 A. citrulli strains isolated from naturally infected tissues of different cucurbit hosts in Brazil between 2000 and 2014 and 18 A. citrulli reference strains from other countries were compared by pulsed-field gel electrophoresis (PFGE), multilocus sequence analysis (MLSA) of housekeeping and virulence-associated genes, and pathogenicity tests on seedlings of different cucurbit species. The Brazilian population comprised predominantly group I strains (98%), regardless of the year of isolation, geographical region, or host. Whole-genome restriction digestion and PFGE analysis revealed that three unique and previously unreported A. citrulli haplotypes (assigned as haplotypes B22, B23, and B24) occurred in Brazil. The greatest diversity of A. citrulli (four haplotypes) was found among strains collected from the northeastern region of Brazil, which accounts for more than 90% of the country's melon production. MLSA clearly distinguished A. citrulli strains into two well-supported clades, in agreement with observations based on PFGE analysis. Five Brazilian A. citrulli strains, representing different group I haplotypes, were moderately aggressive on watermelon seedlings compared with four group II strains that were highly aggressive. In contrast, no significant differences in BFB severity were observed between group I and II A. citrulli strains on melon and squash seedlings. Finally, we observed a differential effect of temperature on in vitro growth of representative group I and II A. citrulli haplotypes. Specifically, of 18 group II strains tested, all grew at 40 and 41°C, whereas only 3 of 15 group I strains (haplotypes B8[P], B3[K], and B15) grew at 40°C. Three strains representing haplotype B8(P) were the only group I strains that grew at 41°C. These results contribute to a better understanding of the genetic diversity of A. citrulli associated with BFB outbreaks in Brazil, and reinforce the efficiency of MLSA and PFGE analysis for assessing population structure. This study also provides the first evidence to suggest that temperature might be a driver in the ecological adaptation of A. citrulli populations.
17 to 23: A novel complementary mini Y-STR panel to extend the Y-STR databases from 17 to 23 markers for forensic purposes.

PubMed

Núñez, Carolina; Baeta, Miriam; Ibarbia, Nerea; Ortueta, Urko; Jiménez-Moreno, Susana; Blazquez-Caeiro, José Luis; Builes, Juan José; Herrera, Rene J; Martínez-Jarreta, Begoña; de Pancorbo, Marian M

2017-04-01

A Y-STR multiplex system has been developed with the purpose of complementing the widely used 17 Y-STR haplotyping (AmpFlSTR Y Filer® PCR Amplification kit) routinely employed in forensic and population genetic studies. This new multiplex system includes six additional STR loci (DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643) to reach the 23 Y-STR of the PowerPlex® Y23 System. In addition, this kit includes the DYS456 and DYS385 loci for traceability purposes. Male samples from 625 individuals from ten worldwide populations were genotyped, including three sample sets from populations previously published with the 17 Y-STR system to expand their current data. Validation studies demonstrated good performance of the panel set in terms of concordance, sensitivity, and stability in the presence of inhibitors and artificially degraded DNA. The results obtained for haplotype diversity and discrimination capacity with this multiplex system were considerably high, providing further evidences of the suitability of this novel Y-STR system for forensic purposes. Thus, the use of this multiplex for samples previously genotyped with 17 Y-STRs will be an efficient and low-cost alternative to complete the set of 23 Y-STRs and improve allele databases for population and forensic purposes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel.

PubMed

Mitt, Mario; Kals, Mart; Pärn, Kalle; Gabriel, Stacey B; Lander, Eric S; Palotie, Aarno; Ripatti, Samuli; Morris, Andrew P; Metspalu, Andres; Esko, Tõnu; Mägi, Reedik; Palta, Priit

2017-06-01

Genetic imputation is a cost-efficient way to improve the power and resolution of genome-wide association (GWA) studies. Current publicly accessible imputation reference panels accurately predict genotypes for common variants with minor allele frequency (MAF)≥5% and low-frequency variants (0.5≤MAF<5%) across diverse populations, but the imputation of rare variation (MAF<0.5%) is still rather limited. In the current study, we evaluate imputation accuracy achieved with reference panels from diverse populations with a population-specific high-coverage (30 ×) whole-genome sequencing (WGS) based reference panel, comprising of 2244 Estonian individuals (0.25% of adult Estonians). Although the Estonian-specific panel contains fewer haplotypes and variants, the imputation confidence and accuracy of imputed low-frequency and rare variants was significantly higher. The results indicate the utility of population-specific reference panels for human genetic studies.
Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel

PubMed Central

Mitt, Mario; Kals, Mart; Pärn, Kalle; Gabriel, Stacey B; Lander, Eric S; Palotie, Aarno; Ripatti, Samuli; Morris, Andrew P; Metspalu, Andres; Esko, Tõnu; Mägi, Reedik; Palta, Priit

2017-01-01

Genetic imputation is a cost-efficient way to improve the power and resolution of genome-wide association (GWA) studies. Current publicly accessible imputation reference panels accurately predict genotypes for common variants with minor allele frequency (MAF)≥5% and low-frequency variants (0.5≤MAF<5%) across diverse populations, but the imputation of rare variation (MAF<0.5%) is still rather limited. In the current study, we evaluate imputation accuracy achieved with reference panels from diverse populations with a population-specific high-coverage (30 ×) whole-genome sequencing (WGS) based reference panel, comprising of 2244 Estonian individuals (0.25% of adult Estonians). Although the Estonian-specific panel contains fewer haplotypes and variants, the imputation confidence and accuracy of imputed low-frequency and rare variants was significantly higher. The results indicate the utility of population-specific reference panels for human genetic studies. PMID:28401899
Variation and Evolution in the Glutamine-Rich Repeat Region of Drosophila Argonaute-2

PubMed Central

Palmer, William H.; Obbard, Darren J.

2016-01-01

RNA interference pathways mediate biological processes through Argonaute-family proteins, which bind small RNAs as guides to silence complementary target nucleic acids . In insects and crustaceans Argonaute-2 silences viral nucleic acids, and therefore acts as a primary effector of innate antiviral immunity. Although the function of the major Argonaute-2 domains, which are conserved across most Argonaute-family proteins, are known, many invertebrate Argonaute-2 homologs contain a glutamine-rich repeat (GRR) region of unknown function at the N-terminus . Here we combine long-read amplicon sequencing of Drosophila Genetic Reference Panel (DGRP) lines with publicly available sequence data from many insect species to show that this region evolves extremely rapidly and is hyper-variable within species. We identify distinct GRR haplotype groups in Drosophila melanogaster, and suggest that one of these haplotype groups has recently risen to high frequency in a North American population. Finally, we use published data from genome-wide association studies of viral resistance in D. melanogaster to test whether GRR haplotypes are associated with survival after virus challenge. We find a marginally significant association with survival after challenge with Drosophila C Virus in the DGRP, but we were unable to replicate this finding using lines from the Drosophila Synthetic Population Resource panel. PMID:27317784

Who Are the Okinawans? Ancestry, Genome Diversity, and Implications for the Genetic Study of Human Longevity From a Geographically Isolated Population

PubMed Central

Hsueh, Wen-Chi; He, Qimei; Willcox, D. Craig; Nievergelt, Caroline M.; Donlon, Timothy A.; Kwok, Pui-Yan; Suzuki, Makoto; Willcox, Bradley J.

2014-01-01

Isolated populations have advantages for genetic studies of longevity from decreased haplotype diversity and long-range linkage disequilibrium. This permits smaller sample sizes without loss of power, among other utilities. Little is known about the genome of the Okinawans, a potential population isolate, recognized for longevity. Therefore, we assessed genetic diversity, structure, and admixture in Okinawans, and compared this with Caucasians, Chinese, Japanese, and Africans from HapMap II, genotyped on the same Affymetrix GeneChip Human Mapping 500K array. Principal component analysis, haplotype coverage, and linkage disequilibrium decay revealed a distinct Okinawan genome—more homogeneity, less haplotype diversity, and longer range linkage disequilibrium. Population structure and admixture analyses utilizing 52 global reference populations from the Human Genome Diversity Cell Line Panel demonstrated that Okinawans clustered almost exclusively with East Asians. Sibling relative risk (λs) analysis revealed that siblings of Okinawan centenarians have 3.11 times (females) and 3.77 times (males) more likelihood of centenarianism. These findings suggest that Okinawans are genetically distinct and share several characteristics of a population isolate, which are prone to develop extreme phenotypes (eg, longevity) from genetic drift, natural selection, and population bottlenecks. These data support further exploration of genetic influence on longevity in the Okinawans. PMID:24444611
Association Between Chloroplast DNA and Mitochondrial DNA Haplotypes in Prunus spinosa L. (Rosaceae) Populations across Europe

PubMed Central

MOHANTY, APARAJITA; MARTÍN, JUAN PEDRO; GONZÁLEZ, LUIS MIGUEL; AGUINAGALDE, ITZIAR

2003-01-01

Chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) were studied in 24 populations of Prunus spinosa sampled across Europe. The cpDNA and mtDNA fragments were amplified using universal primers and subsequently digested with restriction enzymes to obtain the polymorphisms. Combinations of all the polymorphisms resulted in 33 cpDNA haplotypes and two mtDNA haplotypes. Strict association between the cpDNA haplotypes and the mtDNA haplotypes was detected in most cases, indicating conjoint inheritance of the two genomes. The most frequent and abundant cpDNA haplotype (C20; frequency, 51 %) is always associated with the more frequent and abundant mtDNA haplotype (M1; frequency, 84 %). All but two of the cpDNA haplotypes associated with the less frequent mtDNA haplotype (M2) are private haplotypes. These private haplotypes are phylogenetically related but geographically unrelated. They form a separate cluster on the minimum‐length spanning tree. PMID:14534199
[Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].

PubMed

Gu, Ming-liang; Chu, Jia-you

2007-12-01

Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.
ITS-90 Thermocouple Database

National Institute of Standards and Technology Data Gateway

SRD 60 NIST ITS-90 Thermocouple Database (Web, free access) Web version of Standard Reference Database 60 and NIST Monograph 175. The database gives temperature -- electromotive force (emf) reference functions and tables for the letter-designated thermocouple types B, E, J, K, N, R, S and T. These reference functions have been adopted as standards by the American Society for Testing and Materials (ASTM) and the International Electrotechnical Commission (IEC).
Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

PubMed Central

2013-01-01

Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially yielding different numbers of haplotype-based quality SNPs. Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually mining results, a total of 25,417 quality SNPs were discovered – 15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were insertion/deletions (indels). A vast majority of SNP-containing contigs consisted of only 2 haplotypes, as expected, but the percentages of 2 haplotype contigs varied widely in these citrus cultivars. BLAST of the 25,417 25-mer SNP oligos to the Clementine reference genome scaffolds revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit / alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+ hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955 (98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide identities, accounting for 93% of all the alignments. Considering almost all the nucleotide discrepancies in the 24/25 alignments were at the SNP sites, it served well as in silico validation of these SNPs, in addition to and consistent with the rate (81%) validated by sequencing and SNaPshot assay. Conclusions High-quality EST-SNPs from different citrus genotypes were detected, and compared to estimate the heterozygosity of each genome. All the SNP oligo sequences were aligned with the Clementine citrus genome to determine their distribution and uniqueness and for in silico validation, in addition to SNaPshot and sequencing validation of selected SNPs. PMID:24175923
Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations

PubMed Central

Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

2007-01-01

Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle cell disease, could lie considerable distances away from β-globin. PMID:17688704
Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations.

PubMed

Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

2007-08-10

The sickle (betas) mutation in the beta-globin gene (HBB) occurs on five "classical" betas haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the betas allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical betas haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical betas haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). The most common betas sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the betas mutation. Two different classical betas haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of betas haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle cell disease, could lie considerable distances away from beta-globin.
Analysis and comparison of NoSQL databases with an introduction to consistent references in big data storage systems

NASA Astrophysics Data System (ADS)

Dziedzic, Adam; Mulawka, Jan

2014-11-01

NoSQL is a new approach to data storage and manipulation. The aim of this paper is to gain more insight into NoSQL databases, as we are still in the early stages of understanding when to use them and how to use them in an appropriate way. In this submission descriptions of selected NoSQL databases are presented. Each of the databases is analysed with primary focus on its data model, data access, architecture and practical usage in real applications. Furthemore, the NoSQL databases are compared in fields of data references. The relational databases offer foreign keys, whereas NoSQL databases provide us with limited references. An intermediate model between graph theory and relational algebra which can address the problem should be created. Finally, the proposal of a new approach to the problem of inconsistent references in Big Data storage systems is introduced.
Interleukin-10 Promoter Gene Polymorphisms and Susceptibility to Asthma: A Meta-Analysis

PubMed Central

Hyun, Myung-Han; Lee, Chung-Ho; Kang, Min-Hyung; Park, Bong-Kyung; Lee, Young Ho

2013-01-01

Objective The aim of this study was to explore whether the interleukin (IL)-10 polymorphisms and their haplotypes contribute to asthma susceptibility. Methods MEDLINE, EMBASE and the COCHRANE library databases were utilized to identify available articles. A meta-analysis was conducted on IL-10 -1082 G/A, -819 C/T, -592 C/A polymorphisms, and their haplotypes and asthma. Results Eleven studies involving 2,215 asthma patients and 2,170 controls were considered in the meta-analysis. The meta-analysis revealed no association between asthma and the IL-10 -1082 G allele [Odds ratio (OR) = 0.87, 95% Confidence interval (CI) = 0.68–1.12, p = 0.28]. However, meta-analysis of the five studies in Hardy-Weinburg equilibrium produced the relationship between the IL-10 -1082 G allele and asthma (OR = 0.71, 95% CI = 0.60–0.83, p<0.0001). Stratification by ethnicity indicated an association between the IL-10 -1082 G allele and asthma in East Asians (OR = 0.74, 95% CI = 0.57–0.96, p = 0.02), but not in West Asians. Furthermore, stratification by age indicated an association between the IL-10 -1082 G allele and asthma in adults and mixed groups (OR = 0.77, 95% CI = 0.62–0.96, p = 0.02; OR = 0.67, 95% CI = 0.49–0.92, p = 0.01). No association was found between asthma and IL-10 -819 C/T and IL-10 -592 C/A polymorphisms and their haplotypes. Conclusion The IL-10 -1082 G/A polymorphism confers susceptibility to asthma in East Asians and in adults. However, the IL-10 -819 C/T, -592 C/A polymorphisms and their haplotypes are not associated with asthma. PMID:23335974
Genetic variability of Echinococcus granulosus complex in various geographical populations of Iran inferred by mitochondrial DNA sequences.

PubMed

Spotin, Adel; Mahami-Oskouei, Mahmoud; Harandi, Majid Fasihi; Baratchian, Mehdi; Bordbar, Ali; Ahmadpour, Ehsan; Ebrahimi, Sahar

2017-01-01

To investigate the genetic variability and population structure of Echinococcus granulosus complex, 79 isolates were sequenced from different host species covering human, dog, camel, goat, sheep and cattle as of various geographical sub-populations of Iran (Northwestern, Northern, and Southeastern). In addition, 36 sequences of other geographical populations (Western, Southeastern and Central Iran), were directly retrieved from GenBank database for the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene. The confirmed isolates were grouped as G1 genotype (n=92), G6 genotype (n=14), G3 genotype (n=8) and G2 genotype (n=1). 50 unique haplotypes were identified based on the analyzed sequences of cox1. A parsimonious network of the sequence haplotypes displayed star-like features in the overall population containing IR23 (22: 19.1%) as the most common haplotype. According to the analysis of molecular variance (AMOVA) test, the high value of haplotype diversity of E. granulosus complex was shown the total genetic variability within populations while nucleotide diversity was low in all populations. Neutrality indices of the cox1 (Tajima's D and Fu's Fs tests) were shown negative values in Western-Northwestern, Northern and Southeastern populations which indicating significant divergence from neutrality and positive but not significant in Central isolates. A pairwise fixation index (Fst) as a degree of gene flow was generally low value for all populations (0.00647-0.15198). The statistically Fst values indicate that Echinococcus sensu stricto (genotype G1-G3) populations are not genetically well differentiated in various geographical regions of Iran. To appraise the hypothetical evolutionary scenario, further study is needed to analyze concatenated mitogenomes and as well a panel of single locus nuclear markers should be considered in wider areas of Iran and neighboring countries. Copyright © 2016 Elsevier B.V. All rights reserved.
Significant association of full-thickness rotator cuff tears and estrogen-related receptor-β (ESRRB).

PubMed

Teerlink, Craig C; Cannon-Albright, Lisa A; Tashjian, Robert Z

2015-02-01

The precise etiology of rotator cuff disease is unknown, but prior evidence suggests a role for genetic factors. Variants of estrogen-related receptor-β (ESRRB) have been previously associated with rotator cuff disease. The purpose of the present study was to confirm the association between multiple candidate genes, including ESRRB, and rotator cuff disease in an independent set of patients with rotator cuff tear. The Illumina 5M (Illumina Inc, San Diego, CA, USA) single nucleotide polymorphism (SNP) platform was used to genotype 175 patients with rotator cuff tear. Genotypes were used to select a set of 2595 genetically matched Caucasian controls available from the Illumina iControls database. Tests of association were performed with Genome-wide Efficient Mixed Model Association (GEMMA) software at 69 SNPs that fell within 20 kb of 6 candidate genes (DEFB1, DENND2C, ESRRB, FGF3, FGF10, and FGFR1). Tests of association revealed 1 significantly associated SNP occurring in ESRRB (rs17583842; P = 4.4E-4). Another SNP within ESRRB (rs7157192) had a nominal P value of 7.8E-3. FastPHASE software estimated 2 frequent haplotypes among 54 individuals who carried both risk alleles at these 2 SNPs. The first haplotype had a frequency of 13.9% (n = 15) in risk-allele carriers and only 2.2% in controls (odds ratio, 6.9; 95% confidence interval, 3.9-2.2). The second haplotype had a frequency of 12.9% in risk-allele carriers and only 2.7% in controls (odds ratio, 5.3; 95% confidence interval, 3.0-9.5). The significant association and the presence of high-risk haplotypes identified in the ESRRB gene confirm the association of variants in ESRRB and rotator cuff disease. Copyright © 2015 Journal of Shoulder and Elbow Surgery Board of Trustees. All rights reserved.
Age and origin of two common MLH1 mutations predisposing to hereditary colon cancer.

PubMed

Moisio, A L; Sistonen, P; Weissenbach, J; de la Chapelle, A; Peltomäki, P

1996-12-01

Two mutations in the DNA mismatch repair gene MLH1, referred to as mutations 1 and 2, are frequent among Finnish kindreds with hereditary nonpolyposis colorectal cancer (HNPCC). In order to assess the ages and origins of these mutations, we constructed a map of 15 microsatellite markers around MLH1 and used this information in haplotype analyses of 19 kindreds with mutation 1 and 6 kindreds with mutation 2. All kindreds with mutation 1 showed a single allele for the intragenic marker D3S1611 that was not observed on any unaffected chromosome. They also shared portions of a haplotype of 4-15 markers encompassing 2.0-19.0 cM around MLH1. All kindreds with mutation 2 shared another allele for D3S1611 and a conserved haplotype of 5-14 markers spanning 2.0-15.0 cM around MLH1. The degree of haplotype conservation was used to estimate the ages of these two mutations. While some recessive disease genes have been estimated to have existed and spread for as long as thousands of generations worldwide and hundreds of generations in the Finnish population, our analyses suggest that the spread of mutation 1 started 16-43 generations (400-1,075 years) ago and that of mutation 2 some 5-21 generations (125-525 years) ago. These datings are compatible with our genealogical results identifying a common ancestor born in the 16th and 18th century, respectively. Overall, our results indicate that all Finnish kindreds studied to date showing either mutation 1 or mutation 2 are due to single ancestral founding mutations relatively recent in origin in the population. Alternatively, the mutations arose elsewhere earlier and were introduced in Finland more recently.
Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.

PubMed

Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz

2004-08-18

The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.
Y-Chromosome Markers for the Red Fox.

PubMed

Rando, Halie M; Stutchman, Jeremy T; Bastounes, Estelle R; Johnson, Jennifer L; Driscoll, Carlos A; Barr, Christina S; Trut, Lyudmila N; Sacks, Benjamin N; Kukekova, Anna V

2017-09-01

The de novo assembly of the red fox (Vulpes vulpes) genome has facilitated the development of genomic tools for the species. Efforts to identify the population history of red foxes in North America have previously been limited by a lack of information about the red fox Y-chromosome sequence. However, a megabase of red fox Y-chromosome sequence was recently identified over 2 scaffolds in the reference genome. Here, these scaffolds were scanned for repeated motifs, revealing 194 likely microsatellites. Twenty-three of these loci were selected for primer development and, after testing, produced a panel of 11 novel markers that were analyzed alongside 2 markers previously developed for the red fox from dog Y-chromosome sequence. The markers were genotyped in 76 male red foxes from 4 populations: 7 foxes from Newfoundland (eastern Canada), 12 from Maryland (eastern United States), and 9 from the island of Great Britain, as well as 48 foxes of known North American origin maintained on an experimental farm in Novosibirsk, Russia. The full marker panel revealed 22 haplotypes among these red foxes, whereas the 2 previously known markers alone would have identified only 10 haplotypes. The haplotypes from the 4 populations clustered primarily by continent, but unidirectional gene flow from Great Britain and farm populations may influence haplotype diversity in the Maryland population. The development of new markers has increased the resolution at which red fox Y-chromosome diversity can be analyzed and provides insight into the contribution of males to red fox population diversity and patterns of phylogeography. © The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Genetic determinants and stroke in children with sickle cell disease.

PubMed

Rodrigues, Daniela O W; Ribeiro, Luiz C; Sudário, Lysla C; Teixeira, Maria T B; Martins, Marina L; Pittella, Anuska M O L; Junior, Irtis de O Fernandes

To verify genetic determinants associated with stroke in children with sickle cell disease (SCD). Prospective cohort with 110 children submitted to neonatal screening by the Neonatal Screening Program, between 1998 and 2007, with SCD diagnosis, followed at a regional reference public service for hemoglobinopathies. The analyzed variables were type of hemoglobinopathy, gender, coexistence with alpha thalassemia (α-thal), haplotypes of the beta globin chain cluster, and stroke. The final analysis was conducted with 66 children with sickle cell anemia (SCA), using the chi-squared test in the program SPSS ® version 14.0. Among children with SCD, 60% had SCA. The prevalence of coexistence with α-thal was 30.3% and the Bantu haplotype (CAR) was identified in 89.2%. The incidence of stroke was significantly higher in those with SCA (27.3% vs. 2.3%; p=0.001) and males (24.1% vs. 9.6%; p=0.044). The presence of α-thal (p=0.196), the CAR haplotype (p=0.543), and socioeconomic factors were not statistically significant in association with the occurrence of stroke. There is a high incidence of stroke in male children and in children with SCA. Coexistence with α-thal and haplotypes of the beta globin chain cluster did not show any significant association with stroke. The heterogeneity between previously evaluated populations, the non-reproducibility between studies, and the need to identify factors associated with stroke in patients with SCA indicate the necessity of conducting further research to demonstrate the relevance of genetic factors in stroke related to SCD. Copyright © 2016 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Xeroderma pigmentosum genes and melanoma risk.

PubMed

Paszkowska-Szczur, K; Scott, R J; Serrano-Fernandez, P; Mirecka, A; Gapska, P; Górski, B; Cybulski, C; Maleszka, R; Sulikowski, M; Nagay, L; Lubinski, J; Dębniak, T

2013-09-01

Xeroderma pigmentosum is a rare autosomal recessive disease that is associated with a severe deficiency in nucleotide excision repair. The presence of a distinct the nucleotide excision repair (NER) mutation signature in melanoma suggests that perturbations in this critical repair process are likely to be involved with disease risk. We hypothesized that persons with polymorphic NER gene(s) are likely to have reduced NER activity and are consequently at an increased risk of melanoma development. We assessed the association between 94 SNPs within seven XP genes (XPA-XPG) and the melanoma risk in the Polish population. We genotyped 714 unselected melanoma patients and 1,841 healthy adults to determine if there were any polymorphisms differentially represented in the disease group. We found that a significantly decreased risk of melanoma was associated with the Xeroderma pigmentosum complementation (XPC) rs2228000_CT genotype (odds ratio [OR] = 0.15; p < 0.001) and the rs2228000_TT genotype (OR = 0.11; p < 0.001) compared to the reference genotype. Haplotype analysis within XPC revealed the rs2228001_A + G1475A_G + G2061A_A + rs2228000_T + rs3731062_C haplotype (OR = 0.26; p < 0.05) was associated with a significantly decreased disease risk. The haplotype analysis within the Xeroderma pigmentosum group D (XPD) showed a modest association between two haplotypes and a decrease in melanoma risk. There were no major differences between the prevalence of the XP polymorphisms among young or older patients with melanoma. Linkage disequilibrium of XPC: rs2228001, G1475A, G2061A, rs2228000 and rs3731062 was found. The data from our study support the notion that only XPC and XPD genes are associated with melanoma susceptibility. Copyright © 2013 UICC.
HAPRAP: a haplotype-based iterative method for statistical fine mapping using GWAS summary statistics.

PubMed

Zheng, Jie; Rodriguez, Santiago; Laurin, Charles; Baird, Denis; Trela-Larsen, Lea; Erzurumluoglu, Mesut A; Zheng, Yi; White, Jon; Giambartolomei, Claudia; Zabaneh, Delilah; Morris, Richard; Kumari, Meena; Casas, Juan P; Hingorani, Aroon D; Evans, David M; Gaunt, Tom R; Day, Ian N M

2017-01-01

Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients ([Formula: see text]) of the variants. However, haplotypes rather than pairwise [Formula: see text], are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this article, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel. Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits height data, HAPRAP performs well with a small training sample size (N < 2000) while other methods become suboptimal. Moreover, HAPRAP's performance is not affected substantially by single nucleotide polymorphisms (SNPs) with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization). The HAPRAP package and documentation are available at http://apps.biocompute.org.uk/haprap/ CONTACT: : jie.zheng@bristol.ac.uk or tom.gaunt@bristol.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Detecting local haplotype sharing and haplotype association

USDA-ARS?s Scientific Manuscript database

A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...
TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS

PubMed Central

AGUIAR, DEREK; WONG, WENDY S.W.; ISTRAIL, SORIN

2014-01-01

The growing availability of inexpensive high-throughput sequence data is enabling researchers to sequence tumor populations within a single individual at high coverage. But, cancer genome sequence evolution and mutational phenomena like driver mutations and gene fusions are difficult to investigate without first reconstructing tumor haplotype sequences. Haplotype assembly of single individual tumor populations is an exceedingly difficult task complicated by tumor haplotype heterogeneity, tumor or normal cell sequence contamination, polyploidy, and complex patterns of variation. While computational and experimental haplotype phasing of diploid genomes has seen much progress in recent years, haplotype assembly in cancer genomes remains uncharted territory. In this work, we describe HapCompass-Tumor a computational modeling and algorithmic framework for haplotype assembly of copy number variable cancer genomes containing haplotypes at different frequencies and complex variation. We extend our polyploid haplotype assembly model and present novel algorithms for (1) complex variations, including copy number changes, as varying numbers of disjoint paths in an associated graph, (2) variable haplotype frequencies and contamination, and (3) computation of tumor haplotypes using simple cycles of the compass graph which constrain the space of haplotype assembly solutions. The model and algorithm are implemented in the software package HapCompass-Tumor which is available for download from http://www.brown.edu/Research/Istrail_Lab/. PMID:24297529
CLSI-based transference of the CALIPER database of pediatric reference intervals from Abbott to Beckman, Ortho, Roche and Siemens Clinical Chemistry Assays: direct validation using reference samples from the CALIPER cohort.

PubMed

Estey, Mathew P; Cohen, Ashley H; Colantonio, David A; Chan, Man Khun; Marvasti, Tina Binesh; Randell, Edward; Delvin, Edgard; Cousineau, Jocelyne; Grey, Vijaylaxmi; Greenway, Donald; Meng, Qing H; Jung, Benjamin; Bhuiyan, Jalaluddin; Seccombe, David; Adeli, Khosrow

2013-09-01

The CALIPER program recently established a comprehensive database of age- and sex-stratified pediatric reference intervals for 40 biochemical markers. However, this database was only directly applicable for Abbott ARCHITECT assays. We therefore sought to expand the scope of this database to biochemical assays from other major manufacturers, allowing for a much wider application of the CALIPER database. Based on CLSI C28-A3 and EP9-A2 guidelines, CALIPER reference intervals were transferred (using specific statistical criteria) to assays performed on four other commonly used clinical chemistry platforms including Beckman Coulter DxC800, Ortho Vitros 5600, Roche Cobas 6000, and Siemens Vista 1500. The resulting reference intervals were subjected to a thorough validation using 100 reference specimens (healthy community children and adolescents) from the CALIPER bio-bank, and all testing centers participated in an external quality assessment (EQA) evaluation. In general, the transferred pediatric reference intervals were similar to those established in our previous study. However, assay-specific differences in reference limits were observed for many analytes, and in some instances were considerable. The results of the EQA evaluation generally mimicked the similarities and differences in reference limits among the five manufacturers' assays. In addition, the majority of transferred reference intervals were validated through the analysis of CALIPER reference samples. This study greatly extends the utility of the CALIPER reference interval database which is now directly applicable for assays performed on five major analytical platforms in clinical use, and should permit the worldwide application of CALIPER pediatric reference intervals. Copyright © 2013 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.

Singapore Genome Variation Project: a haplotype map of three Southeast Asian populations.

PubMed

Teo, Yik-Ying; Sim, Xueling; Ong, Rick T H; Tan, Adrian K S; Chen, Jieming; Tantoso, Erwin; Small, Kerrin S; Ku, Chee-Seng; Lee, Edmund J D; Seielstad, Mark; Chia, Kee-Seng

2009-11-01

The Singapore Genome Variation Project (SGVP) provides a publicly available resource of 1.6 million single nucleotide polymorphisms (SNPs) genotyped in 268 individuals from the Chinese, Malay, and Indian population groups in Southeast Asia. This online database catalogs information and summaries on genotype and phased haplotype data, including allele frequencies, assessment of linkage disequilibrium (LD), and recombination rates in a format similar to the International HapMap Project. Here, we introduce this resource and describe the analysis of human genomic variation upon agglomerating data from the HapMap and the Human Genome Diversity Project, providing useful insights into the population structure of the three major population groups in Asia. In addition, this resource also surveyed across the genome for variation in regional patterns of LD between the HapMap and SGVP populations, and for signatures of positive natural selection using two well-established metrics: iHS and XP-EHH. The raw and processed genetic data, together with all population genetic summaries, are publicly available for download and browsing through a web browser modeled with the Generic Genome Browser.
Singapore Genome Variation Project: A haplotype map of three Southeast Asian populations

PubMed Central

Teo, Yik-Ying; Sim, Xueling; Ong, Rick T.H.; Tan, Adrian K.S.; Chen, Jieming; Tantoso, Erwin; Small, Kerrin S.; Ku, Chee-Seng; Lee, Edmund J.D.; Seielstad, Mark; Chia, Kee-Seng

2009-01-01

The Singapore Genome Variation Project (SGVP) provides a publicly available resource of 1.6 million single nucleotide polymorphisms (SNPs) genotyped in 268 individuals from the Chinese, Malay, and Indian population groups in Southeast Asia. This online database catalogs information and summaries on genotype and phased haplotype data, including allele frequencies, assessment of linkage disequilibrium (LD), and recombination rates in a format similar to the International HapMap Project. Here, we introduce this resource and describe the analysis of human genomic variation upon agglomerating data from the HapMap and the Human Genome Diversity Project, providing useful insights into the population structure of the three major population groups in Asia. In addition, this resource also surveyed across the genome for variation in regional patterns of LD between the HapMap and SGVP populations, and for signatures of positive natural selection using two well-established metrics: iHS and XP-EHH. The raw and processed genetic data, together with all population genetic summaries, are publicly available for download and browsing through a web browser modeled with the Generic Genome Browser. PMID:19700652
Beta-globin gene cluster haplotypes of Amerindian populations from the Brazilian Amazon region.

PubMed

Guerreiro, J F; Figueiredo, M S; Zago, M A

1994-01-01

We have determined the beta-globin cluster haplotypes for 80 Indians from four Brazilian Amazon tribes: Kayapó, Wayampí, Wayana-Apalaí, and Arára. The results are analyzed together with 20 Yanomámi previously studied. From 2 to 4 different haplotypes were identified for each tribe, and 7 of the possible 32 haplotypes were found in a sample of 172 chromosomes for which the beta haplotypes were directly determined or derived from family studies. The haplotype distribution does not differ significantly among the five populations. The two most common haplotypes in all tribes were haplotypes 2 and 6, with average frequencies of 0.843 and 0.122, respectively. The genetic affinities between Brazilian Indians and other human populations were evaluated by estimates of genetic distance based on haplotype data. The lowest values were observed in relation to Asians, especially Chinese, Polynesians, and Micronesians.
Mitochondrial haplotype variation and phylogeography of Iberian brown trout populations.

PubMed

MacHordom, A; Suárez, J; Almodóvar, A; Bautista, J M

2000-09-01

The biogeographical distribution of brown trout mitochondrial DNA haplotypes throughout the Iberian Peninsula was established by polymerase chain reaction-restriction fragment polymorphism analysis. The study of 507 specimens from 58 localities representing eight widely separated Atlantic-slope (north and west Iberian coasts) and six Mediterranean drainage systems served to identify five main groups of mitochondrial haplotypes: (i) haplotypes corresponding to non-native, hatchery-reared brown trout that were widely distributed but also found in wild populations of northern Spain (Cantabrian slope); (ii) a widespread Atlantic haplotype group; (iii) a haplotype restricted to the Duero Basin; (iv) a haplotype shown by southern Iberian populations; and (v) a Mediterranean haplotype. The Iberian distribution of these haplotypes reflects both the current fishery management policy of introducing non-native brown trout, and Messinian palaeobiogeography. Our findings complement and extend previous allozyme studies on Iberian brown trout and improve present knowledge of glacial refugia and postglacial movement of brown trout lineages.
How Have Self-Incompatibility Haplotypes Diversified? Generation of New Haplotypes during the Evolution of Self-Incompatibility from Self-Compatibility.

PubMed

Sakai, Satoki

2016-08-01

I developed a gametophytic self-incompatibility (SI) model to study the conditions leading to diversification in SI haplotypes. In the model, the SI system is assumed to be incomplete, and the pollen expressing a given specificity is not fully rejected by the pistils expressing the same specificity. I also assumed that mutations can occur that enhance the rejection of pollen by pistils with the same haplotype variant and reduce rejection by pistils with other variants in the same haplotype. I found that if such mutations occur, the new haplotypes (mutant variants) can stably coexist with the ancestral haplotype in which the mutant arose. This is because pollen bearing the new haplotype is most strongly rejected by pistils bearing the same new haplotype among the pistils in the population; hence, negative frequency-dependent selection prevents their fixation. I also performed simulations and found that the nearly complete SI system evolves from completely self-compatible populations and that SI haplotypes can increase to about 40-50 within a few thousand generations. On the basis of my findings, I propose that diversification of SI haplotypes occurred during the evolution of SI from self-compatibility.
The Trichoptera barcode initiative: a strategy for generating a species-level Tree of Life.

PubMed

Zhou, Xin; Frandsen, Paul B; Holzenthal, Ralph W; Beet, Clare R; Bennett, Kristi R; Blahnik, Roger J; Bonada, Núria; Cartwright, David; Chuluunbat, Suvdtsetseg; Cocks, Graeme V; Collins, Gemma E; deWaard, Jeremy; Dean, John; Flint, Oliver S; Hausmann, Axel; Hendrich, Lars; Hess, Monika; Hogg, Ian D; Kondratieff, Boris C; Malicky, Hans; Milton, Megan A; Morinière, Jérôme; Morse, John C; Mwangi, François Ngera; Pauls, Steffen U; Gonzalez, María Razo; Rinne, Aki; Robinson, Jason L; Salokannel, Juha; Shackleton, Michael; Smith, Brian; Stamatakis, Alexandros; StClair, Ros; Thomas, Jessica A; Zamora-Muñoz, Carmen; Ziesmann, Tanja; Kjer, Karl M

2016-09-05

DNA barcoding was intended as a means to provide species-level identifications through associating DNA sequences from unknown specimens to those from curated reference specimens. Although barcodes were not designed for phylogenetics, they can be beneficial to the completion of the Tree of Life. The barcode database for Trichoptera is relatively comprehensive, with data from every family, approximately two-thirds of the genera, and one-third of the described species. Most Trichoptera, as with most of life's species, have never been subjected to any formal phylogenetic analysis. Here, we present a phylogeny with over 16 000 unique haplotypes as a working hypothesis that can be updated as our estimates improve. We suggest a strategy of implementing constrained tree searches, which allow larger datasets to dictate the backbone phylogeny, while the barcode data fill out the tips of the tree. We also discuss how this phylogeny could be used to focus taxonomic attention on ambiguous species boundaries and hidden biodiversity. We suggest that systematists continue to differentiate between 'Barcode Index Numbers' (BINs) and 'species' that have been formally described. Each has utility, but they are not synonyms. We highlight examples of integrative taxonomy, using both barcodes and morphology for species description.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.
Extensive geographical and social structure in the paternal lineages of Saudi Arabia revealed by analysis of 27 Y-STRs.

PubMed

Khubrani, Yahya M; Wetton, Jon H; Jobling, Mark A

2018-03-01

Saudi Arabia's indigenous population is organized into patrilineal descent groups, but to date, little has been done to characterize its population structure, in particular with respect to the male-specific region of the Y chromosome. We have used the 27-STR Yfiler ® Plus kit to generate haplotypes in 597 unrelated Saudi males, classified into five geographical regions (North, South, Central, East and West). Overall, Yfiler ® Plus provides a good discrimination capacity of 95.3%, but this is greatly reduced (74.7%) when considering the reduced Yfiler ® set of 17 Y-STRs, justifying the use of the expanded set of markers in this population. Comparison of the five geographical divisions reveals striking differences, with low diversity and similar haplotype spectra in the Central and Northern regions, and high diversity and similar haplotype spectra in the East and West. These patterns likely reflect the geographical isolation of the desert heartland of the peninsula, and the proximity to the sea of the Eastern and Western areas, and consequent historical immigration. We predicted haplogroups from Y-STR haplotypes, testing the performance of prediction by using a large independent set of Saudi Arabian Y-STR + Y-SNP data. Prediction indicated predominance (71%) of haplogroup J1, which was significantly more common in Central, Northern and Southern groups than in East and West, and formed a star-like expansion cluster in a median-joining network with an estimated age of ∼2800 years. Most of our 597 participants were sampled within Saudi Arabia itself, but ∼16% were sampled in the UK. Despite matching these two groups by home sub-region, we observed significant differences in haplotype and predicted haplogroup constitutions overall, and for most sub-regions individually. This suggests social structure influencing the probability of leaving Saudi Arabia, correlated with different Y-chromosome compositions. The UK-recruited sample is an inappropriate proxy for Saudi Arabia generally, and caution is needed when considering expatriate groups as representative of country of origin. Our study shows the importance of geographical and social structuring that may affect the utility of forensic databases and the interpretation of Y-STR profiles. Copyright © 2017 Elsevier B.V. All rights reserved.
RTEL1 tagging SNPs and haplotypes were associated with glioma development

PubMed Central

2013-01-01

Abstract As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case–control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF) > 5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P = 0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P = 0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype “GG” of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P = 0.0002), while the genotype “CC” of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P = 0.0003). Furthermore, haplotype “GCT” in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher’s P = 0.0005; Pearson’s P = 0.0005), and haplotype “ATT” was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher’s P = 0.0013; Pearson’s P = 0.0013). Two single variants, the genotypes of “GG” of rs6010620 and “CC” of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. Virtual slides The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998 PMID:23683922
The effect of using genealogy-based haplotypes for genomic prediction

PubMed Central

2013-01-01

Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971
The effect of using genealogy-based haplotypes for genomic prediction.

PubMed

Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

2013-03-06

Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.
Recombinant structures expand and contract inter and intragenic diversification at the KIR locus

PubMed Central

2013-01-01

Background The human KIR genes are arranged in at least six major gene-content haplotypes, all of which are combinations of four centromeric and two telomeric motifs. Several less frequent or minor haplotypes also exist, including insertions, deletions, and hybridization of KIR genes derived from the major haplotypes. These haplotype structures and their concomitant linkage disequilibrium among KIR genes suggest that more meaningful correlative data from studies of KIR genetics and complex disease may be achieved by measuring haplotypes of the KIR region in total. Results Towards that end, we developed a KIR haplotyping method that reports unambiguous combinations of KIR gene-content haplotypes, including both phase and copy number for each KIR. A total of 37 different gene content haplotypes were detected from 4,512 individuals and new sequence data was derived from haplotypes where the detailed structure was not previously available. Conclusions These new structures suggest a number of specific recombinant events during the course of KIR evolution, and add to an expanding diversity of potential new KIR haplotypes derived from gene duplication, deletion, and hybridization. PMID:23394822
User`s and reference guide to the INEL RML/analytical radiochemistry sample tracking database version 1.00

DOE Office of Scientific and Technical Information (OSTI.GOV)

Femec, D.A.

This report discusses the sample tracking database in use at the Idaho National Engineering Laboratory (INEL) by the Radiation Measurements Laboratory (RML) and Analytical Radiochemistry. The database was designed in-house to meet the specific needs of the RML and Analytical Radiochemistry. The report consists of two parts, a user`s guide and a reference guide. The user`s guide presents some of the fundamentals needed by anyone who will be using the database via its user interface. The reference guide describes the design of both the database and the user interface. Briefly mentioned in the reference guide are the code-generating tools, CREATE-SCHEMAmore » and BUILD-SCREEN, written to automatically generate code for the database and its user interface. The appendices contain the input files used by the these tools to create code for the sample tracking database. The output files generated by these tools are also included in the appendices.« less
BCL11A Enhancer Haplotypes and Fetal Hemoglobin in Sickle Cell Anemia

PubMed Central

Sebastiani, P.; Farrell, J.J.; Alsultan, A.; Wang, S.; Edward, H. L.; Shappell, H.; Bae, H.; Milton, J. N.; Baldwin, C.T.; Al-Rubaish, A.M.; Naserullah, Z.; Al-Muhanna, F.; Alsuliman, A.; Patra, P. K.; Farrer, L.A.; Ngo, D.; Vathipadiekal, V.; Chui, D.H.K.; Al-Ali, A.K.; Steinberg, M.H.

2015-01-01

Background Fetal hemoglobin (HbF) levels in sickle cell anemia patients vary. We genotyped polymorphisms in the erythroid-specific enhancer of BCL11A to see if they might account for the very high HbF associated with the Arab-Indian (AI) haplotype and Benin haplotype of sickle cell anemia. Methods and Results Six BCL112A enhancer SNPs and their haplotypes were studied in Saudi Arabs from the Eastern Province and Indian patients with AI haplotype (HbF ~20%), African Americans (HbF ~7%), and Saudi Arabs from the Southwestern Province (HbF ~12%). Four SNPs (rs1427407, rs6706648, rs6738440, and rs7606173) and their haplotypes were consistently associated with HbF levels. The distributions of haplotypes differ in the 3 cohorts but not their genetic effects: the haplotype TCAG was associated with the lowest HbF level and the haplotype GTAC was associated with the highest HbF level and differences in HbF levels between carriers of these haplotypes in all cohorts was approximately 6%. Conclusions Common HbF BCL11A enhancer haplotypes in patients with African origin and AI sickle cell anemia have similar effects on HbF but they do not explain their differences in HbF. PMID:25703683
FMR1 CGG repeat expansion mutation detection and linked haplotype analysis for reliable and accurate preimplantation genetic diagnosis of fragile X syndrome.

PubMed

Rajan-Babu, Indhu-Shree; Lian, Mulias; Cheah, Felicia S H; Chen, Min; Tan, Arnold S C; Prasath, Ethiraj B; Loh, Seong Feei; Chong, Samuel S

2017-07-19

Fragile X mental retardation 1 (FMR1) full-mutation expansion causes fragile X syndrome. Trans-generational fragile X syndrome transmission can be avoided by preimplantation genetic diagnosis (PGD). We describe a robust PGD strategy that can be applied to virtually any couple at risk of transmitting fragile X syndrome. This novel strategy utilises whole-genome amplification, followed by triplet-primed polymerase chain reaction (TP-PCR) for robust detection of expanded FMR1 alleles, in parallel with linked multi-marker haplotype analysis of 13 highly polymorphic microsatellite markers located within 1 Mb of the FMR1 CGG repeat, and the AMELX/Y dimorphism for gender identification. The assay was optimised and validated on single lymphoblasts isolated from fragile X reference cell lines, and applied to a simulated PGD case and a clinical in vitro fertilisation (IVF)-PGD case. In the simulated PGD case, definitive diagnosis of the expected results was achieved for all 'embryos'. In the clinical IVF-PGD case, delivery of a healthy baby girl was achieved after transfer of an expansion-negative blastocyst. FMR1 TP-PCR reliably detects presence of expansion mutations and obviates reliance on informative normal alleles for determining expansion status in female embryos. Together with multi-marker haplotyping and gender determination, misdiagnosis and diagnostic ambiguity due to allele dropout is minimised, and couple-specific assay customisation can be avoided.
A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea

PubMed Central

Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

2015-01-01

We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368
The Genetic Basis of Inbreeding Avoidance in House Mice

PubMed Central

Sherborne, Amy L.; Thom, Michael D.; Paterson, Steve; Jury, Francine; Ollier, William E.R.; Stockley, Paula; Beynon, Robert J.; Hurst, Jane L.

2007-01-01

Summary Animals might be able to use highly polymorphic genetic markers to recognize very close relatives and avoid inbreeding [1, 2]. The major histocompatibility complex (MHC) is thought to provide such a marker [1, 3–6] because it influences individual scent in a broad range of vertebrates [6–10]. However, direct evidence is very limited [1, 6, 10, 11]. In house mice (Mus musculus domesticus), the major urinary protein (MUP) gene cluster provides another highly polymorphic scent signal of genetic identity [8, 12–15] that could underlie kin recognition. We demonstrate that wild mice breeding freely in seminatural enclosures show no avoidance of mates with the same MHC genotype when genome-wide similarity is controlled. Instead, inbreeding avoidance is fully explained by a strong deficit in successful matings between mice sharing both MUP haplotypes. Single haplotype sharing is not a good guide to the identification of full sibs, and there was no evidence of behavioral imprinting on maternal MHC or MUP haplotypes. This study, the first to examine wild animals with normal variation in MHC, MUP, and genetic background, demonstrates that mice use self-referent matching of a species-specific [16, 17] polymorphic signal to avoid inbreeding. Recognition of close kin as unsuitable mates might be more variable across species than a generic vertebrate-wide ability to avoid inbreeding based on MHC. PMID:17997307
A potential third Manta Ray species near the Yucatán Peninsula? Evidence for a recently diverged and novel genetic Manta group from the Gulf of Mexico.

PubMed

Hinojosa-Alvarez, Silvia; Walter, Ryan P; Diaz-Jaimes, Pindaro; Galván-Magaña, Felipe; Paig-Tran, E Misty

2016-01-01

We present genetic and morphometric support for a third, distinct, and recently diverged group of Manta ray that appears resident to the Yucatán coastal waters of the Gulf of Mexico. Individuals of the genus Manta from Isla Holbox are markedly different from the other described manta rays in their morphology, habitat preference, and genetic makeup. Herein referred to as the Yucatán Manta Ray, these individuals form two genetically distinct groups: (1) a group of mtDNA haplotypes divergent (0.78%) from the currently recognized Manta birostris and M. alfredi species, and (2) a group possessing mtDNA haplotypes of M. birostris and highly similar haplotypes. The latter suggests the potential for either introgressive hybridization between Yucatán Manta Rays and M. birostris , or the retention of ancestral M. birostris signatures among Yucatán Manta Rays. Divergence of the genetically distinct Yucatán Manta Ray from M. birostris appears quite recent (<100,000 YBP) following fit to an Isolation-with-Migration model, with additional support for asymmetrical gene flow from M. birostris into the Yucatán Manta Ray. Formal naming of the Yucatán Manta Ray cannot yet be assigned until an in-depth taxonomic study and further confirmation of the genetic identity of existing type specimens has been performed.
Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.

PubMed

Hashemi, Abolfazl; Zhu, Banghua; Vikalo, Haris

2018-03-21

Haplotype assembly is the task of reconstructing haplotypes of an individual from a mixture of sequenced chromosome fragments. Haplotype information enables studies of the effects of genetic variations on an organism's phenotype. Most of the mathematical formulations of haplotype assembly are known to be NP-hard and haplotype assembly becomes even more challenging as the sequencing technology advances and the length of the paired-end reads and inserts increases. Assembly of haplotypes polyploid organisms is considerably more difficult than in the case of diploids. Hence, scalable and accurate schemes with provable performance are desired for haplotype assembly of both diploid and polyploid organisms. We propose a framework that formulates haplotype assembly from sequencing data as a sparse tensor decomposition. We cast the problem as that of decomposing a tensor having special structural constraints and missing a large fraction of its entries into a product of two factors, U and [Formula: see text]; tensor [Formula: see text] reveals haplotype information while U is a sparse matrix encoding the origin of erroneous sequencing reads. An algorithm, AltHap, which reconstructs haplotypes of either diploid or polyploid organisms by iteratively solving this decomposition problem is proposed. The performance and convergence properties of AltHap are theoretically analyzed and, in doing so, guarantees on the achievable minimum error correction scores and correct phasing rate are established. The developed framework is applicable to diploid, biallelic and polyallelic polyploid species. The code for AltHap is freely available from https://github.com/realabolfazl/AltHap . AltHap was tested in a number of different scenarios and was shown to compare favorably to state-of-the-art methods in applications to haplotype assembly of diploids, and significantly outperforms existing techniques when applied to haplotype assembly of polyploids.
Sex differences in TTC12/ANKK1 haplotype associations with daily tobacco smoking in Black and White Americans.

PubMed

David, Sean P; Mezuk, Briana; Zandi, Peter P; Strong, David; Anthony, James C; Niaura, Raymond; Uhl, George R; Eaton, William W

2010-03-01

The 11q23.1 genomic region has been associated with nicotine dependence in Black and White Americans. By conducting linkage disequilibrium analyses of 7 informative single nucleotide polymorphisms (SNPs) within the tetratricopeptide repeat domain 12 (TTC12)/ankyrin repeat and kinase containing 1 (ANKK1)/dopamine (D2) receptor gene cluster, we identified haplotype block structures in 270 Black and 368 White (n = 638) participants, from the Baltimore Epidemiologic Catchment Area cohort study, spanning the TTC12 and ANKK1 genes consisting of three SNPs (rs2303380-rs4938015-rs11604671). Informative haplotypes were examined for sex-specific associations with daily tobacco smoking initiation and cessation using longitudinal data from 1993-1994 and 2004-2005 interviews. There was a Haplotype x Sex interaction such that Black men possessing the GTG haplotype who were smokers in 1993-2004 were more likely to have stopped smoking by 2004-2005 (55.6% GTG vs. 22.0% other haplotypes), while Black women were less likely to have quit smoking if they possessed the GTG (20.8%) versus other haplotypes (24.0%; p = .028). In Whites, the GTG haplotype (vs. other haplotypes) was associated with lifetime history of daily smoking (smoking initiation; odds ratio = 1.6; 95% CI = 1.1-2.4; p = .013). Moreover, there was a Haplotype x Sex interaction such that there was higher prevalence of smoking initiation with GTG (77.6%) versus other haplotypes (57.0%; p = .043). In 2 different ethnic American populations, we observed man-woman variation in the influence of the rs2303380-rs4938015-rs11604671 GTG haplotype on smoking initiation and cessation. These results should be replicated in larger cohorts to establish the relationship among the rs2303380-rs4938015-rs11604671 haplotype block, sex, and smoking behavior.
Reference Fluid Thermodynamic and Transport Properties Database (REFPROP)

National Institute of Standards and Technology Data Gateway

SRD 23 NIST Reference Fluid Thermodynamic and Transport Properties Database (REFPROP) (PC database for purchase) NIST 23 contains revised data in a Windows version of the database, including 105 pure fluids and allowing mixtures of up to 20 components. The fluids include the environmentally acceptable HFCs, traditional HFCs and CFCs and 'natural' refrigerants like ammonia

Kullback-Leibler divergence for detection of rare haplotype common disease association.

PubMed

Lin, Shili

2015-11-01

Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.
FreeSolv: A database of experimental and calculated hydration free energies, with input files

PubMed Central

Mobley, David L.; Guthrie, J. Peter

2014-01-01

This work provides a curated database of experimental and calculated hydration free energies for small neutral molecules in water, along with molecular structures, input files, references, and annotations. We call this the Free Solvation Database, or FreeSolv. Experimental values were taken from prior literature and will continue to be curated, with updated experimental references and data added as they become available. Calculated values are based on alchemical free energy calculations using molecular dynamics simulations. These used the GAFF small molecule force field in TIP3P water with AM1-BCC charges. Values were calculated with the GROMACS simulation package, with full details given in references cited within the database itself. This database builds in part on a previous, 504-molecule database containing similar information. However, additional curation of both experimental data and calculated values has been done here, and the total number of molecules is now up to 643. Additional information is now included in the database, such as SMILES strings, PubChem compound IDs, accurate reference DOIs, and others. One version of the database is provided in the Supporting Information of this article, but as ongoing updates are envisioned, the database is now versioned and hosted online. In addition to providing the database, this work describes its construction process. The database is available free-of-charge via http://www.escholarship.org/uc/item/6sd403pz. PMID:24928188
Population Structure With Localized Haplotype Clusters

PubMed Central

Browning, Sharon R.; Weir, Bruce S.

2010-01-01

We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data. PMID:20457877
An analysis of variation in the long-range genomic organization of the human major histocompatibility complex class II region by pulsed-field gel electrophoresis.

PubMed

Dunham, I; Sargent, C A; Dawkins, R L; Campbell, R D

1989-11-01

The class II region of the human major histocompatibility complex in seven common HLA haplotypes has been analyzed using pulsed-field gel electrophoresis, restriction enzymes that cut genomic DNA infrequently, and Southern blotting. This analysis has revealed that there are differences in the amount of DNA present in the DQ and DR subregions dependent on the haplotype. The class II region of the DR3 haplotype spans approximately 750 kb and has the same amount of DNA as the class II region of the DR5 and DR6 haplotypes. However, the DR2 haplotype has approximately 30 kb more DNA within the DR subregion. The DR4 haplotype has an additional approximately 110 kb of DNA within the DQ or DR subregions compared to the DR3, DR5, and DR6 haplotypes. These haplotype-specific differences could have some bearing both on the analysis of disease susceptibility and on the ability of chromosomes possessing different HLA haplotypes to recombine within the DQ/DR subregions.
IRF5 haplotypes demonstrate diverse serological associations which predict serum interferon alpha activity and explain the majority of the genetic association with systemic lupus erythematosus

PubMed Central

Niewold, Timothy B; Kelly, Jennifer A; Kariuki, Silvia N; Franek, Beverly S; Kumar, Akaash A; Kaufman, Kenneth M; Thomas, Kenaz; Walker, Daniel; Kamp, Stan; Frost, Jacqueline M; Wong, Andrew K; Merrill, Joan T; Alarcón-Riquelme, Marta E; Tikly, Mohammed; Ramsey-Goldman, Rosalind; Reveille, John D; Petri, Michelle A; Edberg, Jeffrey C; Kimberly, Robert P; Alarcón, Graciela S; Kamen, Diane L; Gilkeson, Gary S; Vyse, Timothy J; James, Judith A; Gaffney, Patrick M; Moser, Kathy L; Crow, Mary K; Harley, John B

2012-01-01

Objective High serum interferon α (IFNα) activity is a heritable risk factor for systemic lupus erythematosus (SLE). Auto-antibodies found in SLE form immune complexes which can stimulate IFNα production by activating endosomal Toll-like receptors and interferon regulatory factors (IRFs), including IRF5. Genetic variation in IRF5 is associated with SLE susceptibility; however, it is unclear how IRF5 functional genetic elements contribute to human disease. Methods 1034 patients with SLE and 989 controls of European ancestry, 555 patients with SLE and 679 controls of African–American ancestry, and 73 patients with SLE of South African ancestry were genotyped at IRF5 polymorphisms, which define major haplotypes. Serum IFNα activity was measured using a functional assay. Results In European ancestry subjects, anti-double-stranded DNA (dsDNA) and anti-Ro antibodies were each associated with different haplotypes characterised by a different combination of functional genetic elements (OR > 2.56, p >003C; 1.9×10−14 for both). These IRF5 haplotype-auto-antibody associations strongly predicted higher serum IFNα in patients with SLE and explained > 70% of the genetic risk of SLE due to IRF5. In African–American patients with SLE a similar relationship between serology and IFNα was observed, although the previously described European ancestry-risk haplotype was present at admixture proportions in African–American subjects and absent in African patients with SLE. Conclusions The authors define a novel risk haplotype of IRF5 that is associated with anti-dsDNA antibodies and show that risk of SLE due to IRF5 genotype is largely dependent upon particular auto-antibodies. This suggests that auto-antibodies are directly pathogenic in human SLE, resulting in increased IFNα in cooperation with particular combinations of IRF5 functional genetic elements. SLE is a systemic autoimmune disorder affecting multiple organ systems including the skin, musculoskeletal, renal and haematopoietic systems. Humoral autoimmunity is a hallmark of SLE, and patients frequently have circulating auto-antibodies directed against dsDNA, as well as RNA binding proteins (RBP). Anti-RBP autoantibodies include antibodies which recognize Ro, La, Smith (anti-Sm), and ribonucleoprotein (anti-nRNP), collectively referred to as anti-retinol-binding protein). Anti-retinol-binding protein and anti-dsDNA auto-antibodies are rare in the healthy population.1 These auto-antibodies can be present in sera for years preceding the onset of clinical SLE illness2 and are likely pathogenic in SLE.34 PMID:22088620
De Novo Assembly and Phasing of Dikaryotic Genomes from Two Isolates of Puccinia coronata f. sp. avenae, the Causal Agent of Oat Crown Rust.

PubMed

Miller, Marisa E; Zhang, Ying; Omidvar, Vahid; Sperschneider, Jana; Schwessinger, Benjamin; Raley, Castle; Palmer, Jonathan M; Garnica, Diana; Upadhyaya, Narayana; Rathjen, John; Taylor, Jennifer M; Park, Robert F; Dodds, Peter N; Hirsch, Cory D; Kianian, Shahryar F; Figueroa, Melania

2018-02-20

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae , is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenae IMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae , which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae , resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae . Copyright © 2018 Miller et al.
A Sediment Testing Reference Area Database for the San Francisco Deep Ocean Disposal Site (SF-DODS)

EPA Pesticide Factsheets

EPA established and maintains a SF-DODS reference area database of previously-collected sediment test data. Several sets of sediment test data have been successfully collected from the SF-DODS reference area.
Genetic study of KIR and HLA ligands in 235 individuals from Northeastern Thailand.

PubMed

Chaisri, Suwit; Leelayuwat, Chanvit; Romphruk, Amornrat

The diversity of 17 KIR and HLA ligands (HLA-C1, C2, Bw4, A11) were investigated in two hundred and thirty-five unrelated healthy donors in Northeastern Thais (NETs) by the polymerase chain reaction with sequence-specific primer (PCR-SSP) method. The Hardy-Weinberg Equilibrium (HWE) was used to verify genotyping method for dimorphic KIR and HLA. They were in HWE (p>0.05). KIR and HLA ligands frequencies, genotypes, haplotypes and linkage disequilibrium (LD) were presented. The genetic data are available in allele Frequencies Net Database. Copyright © 2017. Published by Elsevier Inc.
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

PubMed Central

Pruitt, Kim D.; Tatusova, Tatiana; Maglott, Donna R.

2005-01-01

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) provides a non-redundant collection of sequences representing genomic data, transcripts and proteins. Although the goal is to provide a comprehensive dataset representing the complete sequence information for any given species, the database pragmatically includes sequence data that are currently publicly available in the archival databases. The database incorporates data from over 2400 organisms and includes over one million proteins representing significant taxonomic diversity spanning prokaryotes, eukaryotes and viruses. Nucleotide and protein sequences are explicitly linked, and the sequences are linked to other resources including the NCBI Map Viewer and Gene. Sequences are annotated to include coding regions, conserved domains, variation, references, names, database cross-references, and other features using a combined approach of collaboration and other input from the scientific community, automated annotation, propagation from GenBank and curation by NCBI staff. PMID:15608248
Cis-acting mutation and duplication: History of molecular evolution in a P450 haplotype responsible for insecticide resistance in Culex quinquefasciatus.

PubMed

Itokawa, Kentaro; Komagata, Osamu; Kasai, Shinji; Masada, Masahiro; Tomita, Takashi

2011-07-01

A cytochrome P450 gene, Cyp9m10, is more than 200-fold overexpressed in a pyrethroid resistant strain of Culex quinquefasciatus, JPal-per. The haplotype of this strain contains two copies of Cyp9m10 resulted from recent tandem duplication. In this study, we discovered and isolated a Cyp9m10 haplotype closely related to this duplicated Cyp9m10 haplotype from JHB, a strain used for the recent genome project for this mosquito species. The isolated haplotype (JHB-NIID-B haplotype) shared the same insertion of a transposable element upstream of the coding region with JPal-per strain but not duplicated. The JHB-NIID-B haplotype was considered to have diverged from the JPal-per lineage just before the duplication event. Cyp9m10 was moderately overexpressed in larvae with the JHB-NIID-B haplotype. The overexpressions in JHB-NIID-B and JPal-per haplotypes were developmentally regulated in similar pattern indicating both haplotypes share a common cis-acting mutation responsible for the overexpressions. The isolated moderately overexpressed haplotype conferred resistance, however, its efficacy was relatively small. We hypothesized that the first cis-acting mutation modified the consequence of the subsequent duplication in JPal-per lineage to confer stronger phenotypic effect than that if it occurred before the first cis-acting mutation. Copyright © 2011 Elsevier Ltd. All rights reserved.
A Candidate Trans-acting Modulator of Fetal Hemoglobin Gene Expression in the Arab-Indian Haplotype of Sickle Cell Anemia

PubMed Central

Vathipadiekal, Vinod; Farrell, John J.; Wang, Shuai; Edward, Heather L.; Shappell, Heather; Al-Rubaish, A.M.; Al-Muhanna, Fahad; Naserullah, Z.; Alsuliman, A.; Qutub, Hatem Othman; Simkin, Irene; Farrer, Lindsay A.; Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Mostoslavsky, Gustavo; Murphy, George J.; Patra, Pradeep.K.; Chui, David H.K.; Alsultan, Abdulrahman; Al-Ali, Amein K.; Sebastiani, Paola.; Steinberg, Martin. H.

2016-01-01

Fetal hemoglobin (HbF) levels are higher in the Arab-Indian (AI) β-globin gene haplotype of sickle cell anemia compared with African-origin haplotypes. To study genetic elements that effect HbF expression in the AI haplotype we completed whole genome sequencing in 14 Saudi AI haplotype sickle hemoglobin homozygotes—seven selected for low HbF (8.2±1.3%) and seven selected for high HbF (23.5±.2.6%). An intronic single nucleotide polymorphism (SNP) in ANTXR1, an anthrax toxin receptor (chromosome 2p13), was associated with HbF. These results were replicated in two independent Saudi AI haplotype cohorts of 120 and 139 patients, but not in 76 Saudi Benin haplotype, 894 African origin haplotype and 44 Arab Indian haplotype patients of Indian descent, suggesting that this association is effective only in the Saudi AI haplotype background. ANTXR1 variants explained 10% of the HbF variability compared with 8% for BCL11A. These two genes had independent, additive effects on HbF and together explained about 15% of HbF variability in Saudi AI sickle cell anemia patients. ANTXR1 was expressed at mRNA and protein levels in erythroid progenitors derived from induced pluripotent stem cells (iPSCs) and CD34+ cells. As CD34+ cells matured and their HbF decreased ANTXR1 expression increased; as iPSCs differentiated and their HbF increased, ANTXR1 expression decreased. Along with elements in cis to the HbF genes, ANTXR1 contributes to the variation in HbF in Saudi AI haplotype sickle cell anemia and is the first gene in trans to HBB that is associated with HbF only in carriers of the Saudi AI haplotype. PMID:27501013
Complement factor H polymorphisms in Japanese population with age-related macular degeneration.

PubMed

Okamoto, Haru; Umeda, Shinsuke; Obazawa, Minoru; Minami, Masayoshi; Noda, Toru; Mizota, Atsushi; Honda, Miki; Tanaka, Minoru; Koyama, Risa; Takagi, Ikue; Sakamoto, Yoshihiro; Saito, Yoshihiro; Miyake, Yozo; Iwata, Takeshi

2006-03-06

To study the frequency of five haplotypes previously reported in the complement factor H (CFH) gene for Japanese patients with age-related macular degeneration (AMD). Genomic DNA was isolated from peripheral blood samples taken from 96 Japanese AMD patients and 89 age-matched controls. All patients were diagnosed as having exudative (wet-type) AMD. The amplified polymerase chain reaction (PCR) products of CFH exons 2, 9, and 13, and intron 6 were analyzed by temperature gradient capillary electrophoresis (TGCE) and by direct sequencing. The haplotypes were identified, and their frequencies were calculated and compared with reported results. Five haplotypes were identified in the Japanese population including four already reported in the American population. The frequencies of these haplotypes were significantly different between Japanese and American in both control and case groups. The haplotype containing Y402H, which was previously reported to be associated with AMD, was only 4% in the control and case population, with a p value of 0.802. However, two other haplotypes were found as risk factors, which gave an increased likelihood of AMD of 1.9 and 2.5 fold (95% CI 1.12-3.69 and 1.42-6.38). One protective haplotype that decreased the likelihood of AMD by 1.6 fold (95% CI 0.26-0.67) was identified. The frequencies for five haplotypes previously identified were analyzed in a Japanese population with AMD. Four previously found haplotypes were identified and one additional haplotype was found. The frequencies of each haplotype were significantly different from that in found Americans affected with AMD. Two of the haplotypes were identified as risk factors and one was considered protective.
HaploForge: a comprehensive pedigree drawing and haplotype visualization web application.

PubMed

Tekman, Mehmet; Medlar, Alan; Mozere, Monika; Kleta, Robert; Stanescu, Horia

2017-12-15

Haplotype reconstruction is an important tool for understanding the aetiology of human disease. Haplotyping infers the most likely phase of observed genotypes conditional on constraints imposed by the genotypes of other pedigree members. The results of haplotype reconstruction, when visualized appropriately, show which alleles are identical by descent despite the presence of untyped individuals. When used in concert with linkage analysis, haplotyping can help delineate a locus of interest and provide a succinct explanation for the transmission of the trait locus. Unfortunately, the design choices made by existing haplotype visualization programs do not scale to large numbers of markers. Indeed, following haplotypes from generation to generation requires excessive scrolling back and forth. In addition, the most widely used program for haplotype visualization produces inconsistent recombination artefacts for the X chromosome. To resolve these issues, we developed HaploForge, a novel web application for haplotype visualization and pedigree drawing. HaploForge takes advantage of HTML5 to be fast, portable and avoid the need for local installation. It can accurately visualize autosomal and X-linked haplotypes from both outbred and consanguineous pedigrees. Haplotypes are coloured based on identity by descent using a novel A* search algorithm and we provide a flexible viewing mode to aid visual inspection. HaploForge can currently process haplotype reconstruction output from Allegro, GeneHunter, Merlin and Simwalk. HaploForge is licensed under GPLv3 and is hosted and maintained via GitHub. https://github.com/mtekman/haploforge. r.kleta@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
A reference system for animal biometrics: application to the northern leopard frog

USGS Publications Warehouse

Petrovska-Delacretaz, D.; Edwards, A.; Chiasson, J.; Chollet, G.; Pilliod, D.S.

2014-01-01

Reference systems and public databases are available for human biometrics, but to our knowledge nothing is available for animal biometrics. This is surprising because animals are not required to give their agreement to be in a database. This paper proposes a reference system and database for the northern leopard frog (Lithobates pipiens). Both are available for reproducible experiments. Results of both open set and closed set experiments are given.
Global genetic diversity of the Plasmodium vivax transmission-blocking vaccine candidate Pvs48/45.

PubMed

Vallejo, Andres F; Martinez, Nora L; Tobon, Alejandra; Alger, Jackeline; Lacerda, Marcus V; Kajava, Andrey V; Arévalo-Herrera, Myriam; Herrera, Sócrates

2016-04-12

Plasmodium vivax 48/45 protein is expressed on the surface of gametocytes/gametes and plays a key role in gamete fusion during fertilization. This protein was recently expressed in Escherichia coli host as a recombinant product that was highly immunogenic in mice and monkeys and induced antibodies with high transmission-blocking activity, suggesting its potential as a P. vivax transmission-blocking vaccine candidate. To determine sequence polymorphism of natural parasite isolates and its potential influence on the protein structure, all pvs48/45 sequences reported in databases from around the world as well as those from low-transmission settings of Latin America were compared. Plasmodium vivax parasite isolates from malaria-endemic regions of Colombia, Brazil and Honduras (n = 60) were used to sequence the Pvs48/45 gene, and compared to those previously reported to GenBank and PlasmoDB (n = 222). Pvs48/45 gene haplotypes were analysed to determine the functional significance of genetic variation in protein structure and vaccine potential. Nine non-synonymous substitutions (E35K, Y196H, H211N, K250N, D335Y, E353Q, A376T, K390T, K418R) and three synonymous substitutions (I73, T149, C156) that define seven different haplotypes were found among the 282 isolates from nine countries when compared with the Sal I reference sequence. Nucleotide diversity (π) was 0.00173 for worldwide samples (range 0.00033-0.00216), resulting in relatively high diversity in Myanmar and Colombia, and low diversity in Mexico, Peru and South Korea. The two most frequent substitutions (E353Q: 41.9 %, K250N: 39.5 %) were predicted to be located in antigenic regions without affecting putative B cell epitopes or the tertiary protein structure. There is limited sequence polymorphism in pvs48/45 with noted geographical clustering among Asian and American isolates. The low genetic diversity of the protein does not influence the predicted antigenicity or protein structure and, therefore, supports its further development as transmission-blocking vaccine candidate.
PHYTOTOX: DATABASE DEALING WITH THE EFFECT OF ORGANIC CHEMICALS ON TERRESTRIAL VASCULAR PLANTS

EPA Science Inventory

A new database, PHYTOTOX, dealing with the direct effects of exogenously supplied organic chemicals on terrestrial vascular plants is described. The database consists of two files, a Reference File and Effects File. The Reference File is a bibliographic file of published research...
Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae)

PubMed Central

Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M

2013-01-01

Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455
Genetic signature analysis of Perkinsus marinus in Mexico suggests possible translocation from the Atlantic Ocean to the Pacific coast of Mexico.

PubMed

Ek-Huchim, Juan Pablo; Aguirre-Macedo, Ma Leopoldina; Améndola-Pimenta, Monica; Vidal-Martínez, Victor Manuel; Pérez-Vega, Juan Antonio; Simá-Alvarez, Raúl; Jiménez-García, Isabel; Zamora-Bustillos, Roberto; Rodríguez-Canul, Rossanna

2017-08-02

The protozoan Perkinsus marinus (Mackin, Owen & Collier) Levine, 1978 causes perkinsosis in the American oyster Crassostrea virginica Gmelin, 1791. This pathogen is present in cultured C. virginica from the Gulf of Mexico and has been reported recently in Saccostrea palmula (Carpenter, 1857), Crassostrea corteziensis (Hertlein, 1951) and Crassostrea gigas (Thunberg, 1793) from the Mexican Pacific coast. Transportation of fresh oysters for human consumption and repopulation could be implicated in the transmission and dissemination of this parasite across the Mexican Pacific coast. The aim of this study was two-fold. First, we evaluated the P. marinus infection parameters by PCR and RFTM (Ray's fluid thioglycollate medium) in C. virginica from four major lagoons (Términos Lagoon, Campeche; Carmen-Pajonal-Machona Lagoon complex, Tabasco; Mandinga Lagoon, Veracruz; and La Pesca Lagoon, Tamaulipas) from the Gulf of Mexico. Secondly, we used DNA sequence analyses of the ribosomal non-transcribed spacer (rNTS) region of P. marinus to determine the possible translocation of this species from the Gulf of Mexico to the Mexican Pacific coast. Perkinsus marinus prevalence by PCR was 57.7% (338 out of 586 oysters) and 38.2% (224 out of 586 oysters) by RFTM. The highest prevalence was observed in the Carmen-Pajonal-Machona Lagoon complex in the state of Tabasco (73% by PCR and 58% by RFTM) and the estimated weighted prevalence (WP) was less than 1.0 in the four lagoons. Ten unique rDNA-NTS sequences of P. marinus [termed herein the "P. marinus (Pm) haplotype"] were identified in the Gulf of Mexico sample. They shared 96-100% similarity with 18 rDNA-NTS sequences from the GenBank database which were derived from 16 Mexican Pacific coast infections and two sequences from the USA. The phylogenetic tree and the haplotype network showed that the P. marinus rDNA-NTS sequences from Mexico were distant from the rDNA-NTS sequences of P. marinus reported from the USA. The ten rDNA-NTS sequences described herein were restricted to specific locations displaying different geographical connections within the Gulf of Mexico; the Carmen-Pajonal-Machona Pm1 haplotype from the state of Tabasco shared a cluster with P. marinus isolates reported from the Mexican Pacific coast. The rDNA-NTS sequences of P. marinus from the state of Tabasco shared high similarity with the reference rDNA-NTS sequences from the Mexican Pacific coast. The high similarity suggests a transfer of oysters infected with P. marinus from the Mexican part of the Gulf of Mexico into the Mexican Pacific coast.
Haplotype-Based Genotyping in Polyploids.

PubMed

Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

2018-01-01

Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.
Identification of parental line specific effects of MLF2 on resistance to coccidiosis in chickens

PubMed Central

2011-01-01

Background MLF2 was the candidate gene associated with coccidiosis resistance in chickens. Although single marker analysis supported the association between MLF2 and coccidiosis resistance, causative mutation relevant to coccidiosis was not identified yet. Thus, this study suggested segregation analysis of MLF2 haplotype and the association test of the other candidate genes using improved data transformation. Results A haplotype probably originated from one parental line was found out of 4 major haplotypes of MLF2. Frequency of this haplotype was 0.2 in parental chickens and its offspring in 12 families. Allele substitution effect of the MLF2 haplotype originated from a specific line was associated with increased body weight and fecal egg count explaining coccidiosis resistance. Nevertheless Box-Cox transformation was able to improve normality; association test did not produce obvious different results compared with analysis with log transformed phenotype. Conclusion Allele substitution effect analysis and classification of MLF2 haplotype identified the segregation of haplotype associated with coccidiosis resistance. The haplotype originated from a specific parental line was associated with improving disease resistance. Estimating effect of MLF2 haplotype on coccidiosis resistance will provide useful information for selecting animals or lines for future study. PMID:21645301

Haplotype diversity in 11 candidate genes across four populations.

PubMed

Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

2005-09-01

Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.
Online Reference Service--How to Begin: A Selected Bibliography.

ERIC Educational Resources Information Center

Shroder, Emelie J., Ed.

1982-01-01

Materials in this bibliography were selected and recommended by members of the Use of Machine-Assisted Reference in Public Libraries Committee, Reference and Adult Services Division, American Library Association. Topics include: financial aspects, equipment and communications considerations, comparing databases and database systems, advertising…
Detecting structure of haplotypes and local ancestry

USDA-ARS?s Scientific Manuscript database

We present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local an...
Electronic Reference Library: Silverplatter's Database Networking Solution.

ERIC Educational Resources Information Center

Millea, Megan

Silverplatter's Electronic Reference Library (ERL) provides wide area network access to its databases using TCP/IP communications and client-server architecture. ERL has two main components: The ERL clients (retrieval interface) and the ERL server (search engines). ERL clients provide patrons with seamless access to multiple databases on multiple…
Reconstruction of Haplotype-Blocks Selected during Experimental Evolution.

PubMed

Franssen, Susanne U; Barton, Nicholas H; Schlötterer, Christian

2017-01-01

The genetic analysis of experimentally evolving populations typically relies on short reads from pooled individuals (Pool-Seq). While this method provides reliable allele frequency estimates, the underlying haplotype structure remains poorly characterized. With small population sizes and adaptive variants that start from low frequencies, the interpretation of selection signatures in most Evolve and Resequencing studies remains challenging. To facilitate the characterization of selection targets, we propose a new approach that reconstructs selected haplotypes from replicated time series, using Pool-Seq data. We identify selected haplotypes through the correlated frequencies of alleles carried by them. Computer simulations indicate that selected haplotype-blocks of several Mb can be reconstructed with high confidence and low error rates, even when allele frequencies change only by 20% across three replicates. Applying this method to real data from D. melanogaster populations adapting to a hot environment, we identify a selected haplotype-block of 6.93 Mb. We confirm the presence of this haplotype-block in evolved populations by experimental haplotyping, demonstrating the power and accuracy of our haplotype reconstruction from Pool-Seq data. We propose that the combination of allele frequency estimates with haplotype information will provide the key to understanding the dynamics of adaptive alleles. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Native and European haplotypes of Phragmites Australis (common reed) in the central Platte River, Nebraska

USGS Publications Warehouse

Larson, D.L.; Galatowitsch, S.M.; Larson, J.L.

2011-01-01

Phragmites australis (common reed) is known to have occurred along the Platte River historically, but recent rapid increases in both distribution and density have begun to impact habitat for migrating sandhill cranes and nesting piping plovers and least terns. Invasiveness in Phragmites has been associated with the incursion of a European genotype (haplotype M) in other areas; determining the genotype of Phragmites along the central Platte River has implications for proper management of the river system. In 2008 we sampled Phragmites patches along the central Platte River from Lexington to Chapman, NE, stratified by bridge segments, to determine the current distribution of haplotype E (native) and haplotype M genotypes. In addition, we did a retrospective analysis of historical Phragmites collections from the central Platte watershed (1902-2006) at the Bessey Herbarium. Fresh tissue from the 2008 survey and dried tissue from the herbarium specimens were classified as haplotype M or E using the restriction fragment length polymorphism procedure. The European haplotype was predominant in the 2008 samples: only 14 Phragmites shoots were identified as native haplotype E; 224 were non-native haplotype M. The retrospective analysis revealed primarily native haplotype individuals. Only collections made in Lancaster County, near Lincoln, NE, were haplotype M, and the earliest of these was collected in 1973. ?? 2011 Copyright by the Center for Great Plains Studies, University of Nebraska-Lincoln.
Haplotyping for disease association: a combinatorial approach.

PubMed

Lancia, Giuseppe; Ravi, R; Rizzi, Romeo

2008-01-01

We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.
A Circular Dichroism Reference Database for Membrane Proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wallace,B.; Wien, F.; Stone, T.

2006-01-01

Membrane proteins are a major product of most genomes and the target of a large number of current pharmaceuticals, yet little information exists on their structures because of the difficulty of crystallising them; hence for the most part they have been excluded from structural genomics programme targets. Furthermore, even methods such as circular dichroism (CD) spectroscopy which seek to define secondary structure have not been fully exploited because of technical limitations to their interpretation for membrane embedded proteins. Empirical analyses of circular dichroism (CD) spectra are valuable for providing information on secondary structures of proteins. However, the accuracy of themore » results depends on the appropriateness of the reference databases used in the analyses. Membrane proteins have different spectral characteristics than do soluble proteins as a result of the low dielectric constants of membrane bilayers relative to those of aqueous solutions (Chen & Wallace (1997) Biophys. Chem. 65:65-74). To date, no CD reference database exists exclusively for the analysis of membrane proteins, and hence empirical analyses based on current reference databases derived from soluble proteins are not adequate for accurate analyses of membrane protein secondary structures (Wallace et al (2003) Prot. Sci. 12:875-884). We have therefore created a new reference database of CD spectra of integral membrane proteins whose crystal structures have been determined. To date it contains more than 20 proteins, and spans the range of secondary structures from mostly helical to mostly sheet proteins. This reference database should enable more accurate secondary structure determinations of membrane embedded proteins and will become one of the reference database options in the CD calculation server DICHROWEB (Whitmore & Wallace (2004) NAR 32:W668-673).« less
Haplotype assembly in polyploid genomes and identical by descent shared tracts.

PubMed

Aguiar, Derek; Istrail, Sorin

2013-07-01

Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and computational efficiency. Furthermore, current models and methodologies for haplotype assembly (i) do not consider individuals sharing haplotypes jointly, which reduces the size and accuracy of assembled haplotypes, and (ii) are unable to model genomes having more than two sets of homologous chromosomes (polyploidy). Polyploid organisms are increasingly becoming the target of many research groups interested in the genomics of disease, phylogenetics, botany and evolution but there is an absence of theory and methods for polyploid haplotype reconstruction. In this work, we present a number of results, extensions and generalizations of compass graphs and our HapCompass framework. We prove the theoretical complexity of two haplotype assembly optimizations, thereby motivating the use of heuristics. Furthermore, we present graph theory-based algorithms for the problem of haplotype assembly using our previously developed HapCompass framework for (i) novel implementations of haplotype assembly optimizations (minimum error correction), (ii) assembly of a pair of individuals sharing a haplotype tract identical by descent and (iii) assembly of polyploid genomes. We evaluate our methods on 1000 Genomes Project, Pacific Biosciences and simulated sequence data. HapCompass is available for download at http://www.brown.edu/Research/Istrail_Lab/. Supplementary data are available at Bioinformatics online.
MGMT DNA repair gene promoter/enhancer haplotypes alter transcription factor binding and gene expression.

PubMed

Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z

2016-10-01

The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.
Phylogenetic status of brown trout Salmo trutta populations in five rivers from the southern Caspian Sea and two inland lake basins, Iran: a morphogenetic approach.

PubMed

Hashemzadeh Segherloo, I; Farahmand, H; Abdoli, A; Bernatchez, L; Primmer, C R; Swatdipong, A; Karami, M; Khalili, B

2012-10-01

Interrelationships, origin and phylogenetic affinities of brown trout Salmo trutta populations from the southern Caspian Sea basin, Orumieh and Namak Lake basins in Iran were analysed from complete mtDNA control region sequences, 12 microsatellite loci and morphological characters. Among 129 specimens from six populations, seven haplotypes were observed. Based on mtDNA haplotype data, the Orumieh and southern Caspian populations did not differ significantly, but the Namak basin-Karaj population presented a unique haplotype closely related to the haplotypes of the other populations (0·1% Kimura two-parameter, K2P divergence). All Iranian haplotypes clustered as a distinct group within the Danube phylogenetic grouping, with an average K2P distance of 0·41% relative to other Danubian haplotypes. The Karaj haplotype in the Namak basin was related to a haplotype (Da26) formerly identified in the Tigris basin in Turkey, to a Salmo trutta oxianus haplotype from the Aral Sea basin, and to haplotype Da1a with two mutational steps, as well as to other Iranian haplotypes with one to two mutational steps, which may indicate a centre of origin in the Caspian basin. In contrast to results of the mtDNA analysis, more pronounced differentiation was observed among the populations studied in the morphological and microsatellite DNA data, except for the two populations from the Orumieh basin, which were similar, possibly due to anthropogenic causes. © 2012 The Authors. Journal of Fish Biology © 2012 The Fisheries Society of the British Isles.
A Comprehensive Molecular Investigation of α-Thalassemia in an Iranian Cohort from Different Provinces of North Iran.

PubMed

Eftekhari, Hajar; Tamaddoni, Ahmad; Mahmoudi Nesheli, Hassan; Vakili, Mohsen; Sedaghat, Sadegh; Banihashemi, Ali; Azizi, Mandana; Youssefi Kamangar, Reza; Akhavan-Niaki, Haleh

2017-01-01

α-Thalassemia (α-thal) is the most common monogenic disease that is caused by the absence or reduced expression of α-globin genes. The aim of this study was to investigate common α-globin mutations and their associated haplotypes in four northern provinces of Iran (Gilan, Mazandaran, Golestan, Khorasan). One thousand, one hundred and ninety-one persons were tested for α-thal mutations by gap-polymerase chain reaction (PCR), reverse dot-blot hybridization, restriction fragment length polymorphism (RFLP) analysis and sequencing. Of the nine different mutations found, the most frequent were -α 3.7 (rightward deletion) (45.6%), polyadenylation site (α p ° lyA2 α) (α2) (AATAAA>AATGAA; HBA2: c.*92 A>G) (15.27%), - - MED (Mediterranean deletion) (6.86%), -α 4.2 (leftward deletion), (6.17%), α CS α [Hb Constant Spring (Hb CS) (HBA2: c.427 T>C)] (4.62%), -α -5 nt (HBA2: c.95+2_95+6delTGAGG) (3.70%). All chromosomes bearing an α-globin point mutation [α p ° lyA2 α, -α -5 nt α, α CS α, α p ° lyA1 α (AATAAA> AATAAG; HBA2: c.*94 A>G)] showed only one haplotype that was present in most normal chromosomes, while the -α 3.7 deletion was associated with three distinct haplotypes. Our results indicate that α-thal mutations are heterogeneous and -α 3.7 and α p ° lyA2 α are the most prevalent mutations in this region. The presence of -α 3.7 with three different haplotypes suggests an older history for this mutation. The high prevalence of α p ° lyA2 α in Mazandaran Province, Iran compared to other parts of the country and the world, suggests a founder effect. Altogether, we here provide further data confirming the heterogeneity of the northern population of Iran. These data may contribute to the establishment of a national mutation database, more accurate genetic counseling and prenatal diagnosis (PND).
Functional Characterization of the Osteoarthritis Susceptibility Mapping to CHST11—A Bioinformatics and Molecular Study

PubMed Central

Reynard, Louise N.; Ratnayake, Madhushika; Santibanez-Koref, Mauro

2016-01-01

The single nucleotide polymorphism (SNP) rs835487 is associated with hip osteoarthritis (OA) at the genome-wide significance level and is located within CHST11, which codes for carbohydrate sulfotransferase 11. This enzyme post-translationally modifies proteoglycan prior to its deposition in the cartilage extracellular matrix. Using bioinformatics and experimental analyses, our aims were to characterise the rs835487 association signal and to identify the causal functional variant/s. Database searches revealed that rs835487 resides within a linkage disequilibrium (LD) block of only 2.7 kb and is in LD (r2 ≥ 0.8) with six other SNPs. These are all located within intron 2 of CHST11, in a region that has predicted enhancer activity and which shows a high degree of conservation in primates. Luciferase reporter assays revealed that of the seven SNPs, rs835487 and rs835488, which have a pairwise r2 of 0.962, are the top functional candidates; the haplotype composed of the OA-risk conferring G allele of rs835487 and the corresponding T allele of rs835488 (the G-T haplotype) demonstrated significantly different enhancer activity relative to the haplotype composed of the non-risk A allele of rs835487 and the corresponding C allele of rs835488 (the A-C haplotype) (p < 0.001). Electrophoretic mobility shift assays and supershifts identified several transcription factors that bind more strongly to the risk-conferring G and T alleles of the two SNPs, including SP1, SP3, YY1 and SUB1. CHST11 was found to be upregulated in OA versus non-OA cartilage (p < 0.001) and was expressed dynamically during chondrogenesis. Its expression in adult cartilage did not however correlate with rs835487 genotype. Our data demonstrate that the OA susceptibility is mediated by differential protein binding to the alleles of rs835487 and rs835488, which are located within an enhancer whose target may be CHST11 during chondrogenesis or an alternative gene. PMID:27391021
Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms.

PubMed

Yamamoto, Toshio; Nagasaki, Hideki; Yonemaru, Jun-ichi; Ebana, Kaworu; Nakajima, Maiko; Shibaya, Taeko; Yano, Masahiro

2010-04-27

To create useful gene combinations in crop breeding, it is necessary to clarify the dynamics of the genome composition created by breeding practices. A large quantity of single-nucleotide polymorphism (SNP) data is required to permit discrimination of chromosome segments among modern cultivars, which are genetically related. Here, we used a high-throughput sequencer to conduct whole-genome sequencing of an elite Japanese rice cultivar, Koshihikari, which is closely related to Nipponbare, whose genome sequencing has been completed. Then we designed a high-throughput typing array based on the SNP information by comparison of the two sequences. Finally, we applied this array to analyze historical representative rice cultivars to understand the dynamics of their genome composition. The total 5.89-Gb sequence for Koshihikari, equivalent to 15.7 x the entire rice genome, was mapped using the Pseudomolecules 4.0 database for Nipponbare. The resultant Koshihikari genome sequence corresponded to 80.1% of the Nipponbare sequence and led to the identification of 67,051 SNPs. A high-throughput typing array consisting of 1917 SNP sites distributed throughout the genome was designed to genotype 151 representative Japanese cultivars that have been grown during the past 150 years. We could identify the ancestral origin of the pedigree haplotypes in 60.9% of the Koshihikari genome and 18 consensus haplotype blocks which are inherited from traditional landraces to current improved varieties. Moreover, it was predicted that modern breeding practices have generally decreased genetic diversity Detection of genome-wide SNPs by both high-throughput sequencer and typing array made it possible to evaluate genomic composition of genetically related rice varieties. With the aid of their pedigree information, we clarified the dynamics of chromosome recombination during the historical rice breeding process. We also found several genomic regions decreasing genetic diversity which might be caused by a recent human selection in rice breeding. The definition of pedigree haplotypes by means of genome-wide SNPs will facilitate next-generation breeding of rice and other crops.
Gene flow for Echinococcus granulosus metapopulations determined by mitochondrial sequences: A reliable approach for reflecting epidemiological drift of parasite among neighboring countries.

PubMed

Mahami-Oskouei, Mahmoud; Kaseb-Yazdanparast, Azam; Spotin, Adel; Shahbazi, Abbas; Adibpour, Mohammad; Ahmadpour, Ehsan; Ghabouli-Mehrabani, Nader

2016-12-01

In genetic diversity and population structure of Echinococcus granulosus, the gene flow can illustrate how the Echinococcus isolates have epidemiologically drifted among endemic neighboring countries. 51 isolates of hydatid cysts were collected from human, dog, cattle and sheep in northwest Iran, where placed co-border with Turkey. DNA samples were extracted, amplified and subjected to sequence analysis of NADH dehydrogenase subunit 1 (nad1) and cytochrome oxidase subunit 1 (cox1) genes. As well, sequences of Echinococcus at east to the southeast regions of Turkey were retrieved from GenBank database for the cox1 gene. The confirmed isolates were grouped as G1 (n = 74) and G3 (n = 6) genotypes. 31 unique haplotypes were identified inferred by the analyzed sequences of cox1 among two distinct populations. A parsimonious network of the sequence haplotypes displayed star-like features in the overall population containing TUR1, IR15 and IR22 as the most common haplotypes. According to AMOVA test, the high value of haplotype diversity (0.94758-0.98901) of E. granulosus was reflected the total genetic variability within populations while nucleotide diversity was low (0.00727-0.01046) in Iranian and Turkish metapopulations. Neutrality indices of the cox1 were shown negative values (-15.078 to -10.057) in Echinococcus populations which indicating a significant divergence from neutrality. A pairwise fixation index (Fst) as a degree of gene flow was partially high value for all populations (0.151). The statistically Fst value indicates that E. granulosus sensu stricto (G1-G3) are genetically moderate differentiated among Iranian and Turkish isolates. The occurrence of TUR1 and IR15 elucidate that there is possibly the dawn of domestication due to transfer of alleles between populations through the diffusion of stock raising or anthropogenic movements. To evaluate the hypothetical evolutionary scenario, further exploration is necessitated to analyze isolates from various host species in rest Middle East countries. Copyright Â© 2016 Elsevier Inc. All rights reserved.
A potential third Manta Ray species near the Yucatán Peninsula? Evidence for a recently diverged and novel genetic Manta group from the Gulf of Mexico

PubMed Central

Hinojosa-Alvarez, Silvia; Walter, Ryan P.; Paig-Tran, E. Misty

2016-01-01

We present genetic and morphometric support for a third, distinct, and recently diverged group of Manta ray that appears resident to the Yucatán coastal waters of the Gulf of Mexico. Individuals of the genus Manta from Isla Holbox are markedly different from the other described manta rays in their morphology, habitat preference, and genetic makeup. Herein referred to as the Yucatán Manta Ray, these individuals form two genetically distinct groups: (1) a group of mtDNA haplotypes divergent (0.78%) from the currently recognized Manta birostris and M. alfredi species, and (2) a group possessing mtDNA haplotypes of M. birostris and highly similar haplotypes. The latter suggests the potential for either introgressive hybridization between Yucatán Manta Rays and M. birostris, or the retention of ancestral M. birostris signatures among Yucatán Manta Rays. Divergence of the genetically distinct Yucatán Manta Ray from M. birostris appears quite recent (<100,000 YBP) following fit to an Isolation-with-Migration model, with additional support for asymmetrical gene flow from M. birostris into the Yucatán Manta Ray. Formal naming of the Yucatán Manta Ray cannot yet be assigned until an in-depth taxonomic study and further confirmation of the genetic identity of existing type specimens has been performed. PMID:27833795
Molecular and morphological evidence for three species of Diplostomum (Digenea: Diplostomidae), parasites of fishes and fish-eating birds in Spain.

PubMed

Pérez-del-Olmo, Ana; Georgieva, Simona; Pula, Héctor J; Kostadinova, Aneta

2014-11-12

Recent molecular studies have revealed high species diversity of Diplostomum in central and northern Europe. However, our knowledge of the distribution of Diplostomum spp. in the southern distributional range in Europe of the snail intermediate hosts (Lymnaea stagnalis and Radix spp.) is rather limited. This study aims to fill this gap in our knowledge using molecular and morphological evidence. Nineteen fish species and six fish-eating bird species were sampled opportunistically in three regions (Catalonia, Extremadura and Aragon) in Spain. All isolates of Diplostomum spp. were characterised morphologically and molecularly. Partial sequences of the barcode region of the cox1 mitochondrial gene and complete sequences of the ribosomal ITS1-5.8S-ITS2 gene cluster were used for molecular identification of the isolates. Integrated morphological and molecular analyses demonstrated the presence of three species among the larval and adult isolates of Diplostomum spp. sampled in Spain: Diplostomum spathaceum (in fish and birds), D. pseudospathaceum (in birds) and Diplostomum sp. (in fish) referred to as Clade Q sensu Georgieva et al. (Int J Parasitol, 43:57-72, 2013). We detected ten cox1 haplotypes among the isolates of D. spathaceum with only one haplotype shared with adult isolates from central and northern Europe. No specific geographic pattern of the distribution of the novel haplotypes was found. This first molecular exploration of the diversity of Diplostomum spp. in southern Europe indicates much lower species richness compared with the northern regions of Europe.
Bionomics of Asian Citrus Psyllid (Hemiptera: Liviidae) Associated with Orange Jasmine Hedges in Southeast Central Florida, with Special Reference to Biological Control by Tamarixia radiata.

PubMed

Hall, David G; Rohrig, Eric

2015-06-01

The Asian citrus psyllid, Diaphorina citri Kuwayama, is an important pest in Florida because it transmits bacteria responsible for citrus huanglongbing disease. In addition to infesting citrus, orange jasmine (Murraya exotica L.) is one of Asian citrus psyllid's preferred host plants and is widely grown as an ornamental hedge. We report on Asian citrus psyllid bionomics over three years at five urban plantings of orange jasmine and on biological control of Asian citrus psyllid by a parasitoid Tamarixia radiata (Waterston). T. radiata had been released in Florida shortly after Asian citrus psyllid was first found, and the parasitoid was known to be established at each planting. Additionally, three new T. radiata haplotypes were released every 3 wk at three plantings during the first study year (one haplotype per planting, over all releases an average of 17 parasitoids per linear meter of hedge); all three haplotypes were released at a fourth planting beginning midway through the study (over all releases, an average combined total of 202 parasitoids per linear meter of hedge). Asian citrus psyllid populations were present year-round at each planting, often at large levels. Such plantings may pose risk to commercial citrus as Asian citrus psyllid reservoirs. Releases of the new haplotypes did not cause any measurable reduction in Asian citrus psyllid population levels during the study, and ironically percentage parasitism was generally highest at a planting where no releases were made. Higher release rates might have been more effective. The probability is discussed that repetitive pruning of orange jasmine reduced the full potential of T. radiata against Asian citrus psyllid in this study. Published by Oxford University Press on behalf of Entomological Society of America 2015. This work is written by US Government employees and is in the public domain in the US.
Accurate and Practical Identification of 20 Fusarium Species by Seven-Locus Sequence Analysis and Reverse Line Blot Hybridization, and an In Vitro Antifungal Susceptibility Study▿†

PubMed Central

Wang, He; Xiao, Meng; Kong, Fanrong; Chen, Sharon; Dou, Hong-Tao; Sorrell, Tania; Li, Ruo-Yu; Xu, Ying-Chun

2011-01-01

Eleven reference and 25 clinical isolates of Fusarium were subject to multilocus DNA sequence analysis to determine the species and haplotypes of the fusarial isolates from Beijing and Shandong, China. Seven loci were analyzed: the translation elongation factor 1 alpha gene (EF-1α); the nuclear rRNA internal transcribed spacer (ITS), large subunit (LSU), and intergenic spacer (IGS) regions; the second largest subunit of the RNA polymerase gene (RPB2); the calmodulin gene (CAM); and the mitochondrial small subunit (mtSSU) rRNA gene. We also evaluated an IGS-targeted PCR/reverse line blot (RLB) assay for species/haplotype identification of Fusarium. Twenty Fusarium species and seven species complexes were identified. Of 25 clinical isolates (10 species), the Gibberella (Fusarium) fujikuroi species complex was the commonest (40%) and was followed by the Fusarium solani species complex (FSSC) (36%) and the F. incarnatum-F. equiseti species complex (12%). Six FSSC isolates were identified to the species level as FSSC-3+4, and three as FSSC-5. Twenty-nine IGS, 27 EF-1α, 26 RPB2, 24 CAM, 18 ITS, 19 LSU, and 18 mtSSU haplotypes were identified; 29 were unique, and haplotypes for 24 clinical strains were novel. By parsimony informative character analysis, the IGS locus was the most phylogenetically informative, and the rRNA gene regions were the least. Results by RLB were concordant with multilocus sequence analysis for all isolates. Amphotericin B was the most active drug against all species. Voriconazole MICs were high (>8 μg/ml) for 15 (42%) isolates, including FSSC. Analysis of larger numbers of isolates is required to determine the clinical utility of the seven-locus sequence analysis and RLB assay in species classification of fusaria. PMID:21389150
Y-SNPs haplotype diversity in four Chinese cattle breeds.

PubMed

Zhang, Runfeng; Cheng, Ming; Li, Xiaofeng; Chen, Fuying; Zheng, Jing; Wang, Xiaofei; Meng, Quanke

2013-01-01

To investigate the genetic diversity of Chinese cattle, 96 male samples of 4 Chinese native cattle breeds were investigated using 5 single nucleotide polymorphisms specific to the bovine Y chromosome. Two previously described haplotypes (taurine Y2 and indicine Y3) were detected in 74 and 22 animals, respectively. The haplotype frequencies varied amongst the four native breeds. The taurine Y2 haplotype dominated in the Qinchuan, Dabieshan, and Yunba breeds. However, the indicine Y3 haplotype occurred in high frequency in the Enshi breed. Among the four native breeds, Yunba had the highest haplotype diversity (0.4330 ± 0.0750), followed by Qinchuan (0.2899 ± 0.1028) and Enshi (0.2222 ± 0.1662), Dabieshan was the least differentiated (0.1079 ± 0.0680). Compared with some foreign cattle breeds, the low level of haplotype diversity was detected in our breeds (0.2633 ± 0.1030).

Microcomputer-Based Access to Machine-Readable Numeric Databases.

ERIC Educational Resources Information Center

Wenzel, Patrick

1988-01-01

Describes the use of microcomputers and relational database management systems to improve access to numeric databases by the Data and Program Library Service at the University of Wisconsin. The internal records management system, in-house reference tools, and plans to extend these tools to the entire campus are discussed. (3 references) (CLB)
Automated processing of shoeprint images based on the Fourier transform for use in forensic science.

PubMed

de Chazal, Philip; Flynn, John; Reilly, Richard B

2005-03-01

The development of a system for automatically sorting a database of shoeprint images based on the outsole pattern in response to a reference shoeprint image is presented. The database images are sorted so that those from the same pattern group as the reference shoeprint are likely to be at the start of the list. A database of 476 complete shoeprint images belonging to 140 pattern groups was established with each group containing two or more examples. A panel of human observers performed the grouping of the images into pattern categories. Tests of the system using the database showed that the first-ranked database image belongs to the same pattern category as the reference image 65 percent of the time and that a correct match appears within the first 5 percent of the sorted images 87 percent of the time. The system has translational and rotational invariance so that the spatial positioning of the reference shoeprint images does not have to correspond with the spatial positioning of the shoeprint images of the database. The performance of the system for matching partial-prints was also determined.
(BARS) -- Bibliographic Retrieval System Sandia Shock Compression (SSC) database Shock Physics Index (SPHINX) database. Volume 1: UNIX version query guide customized application for INGRES

DOE Office of Scientific and Technical Information (OSTI.GOV)

Herrmann, W.; von Laven, G.M.; Parker, T.

1993-09-01

The Bibliographic Retrieval System (BARS) is a data base management system specially designed to retrieve bibliographic references. Two databases are available, (i) the Sandia Shock Compression (SSC) database which contains over 5700 references to the literature related to stress waves in solids and their applications, and (ii) the Shock Physics Index (SPHINX) which includes over 8000 further references to stress waves in solids, material properties at intermediate and low rates, ballistic and hypervelocity impact, and explosive or shock fabrication methods. There is some overlap in the information in the two data bases.
Genetic variation of 'Candidatus Liberibacter solanacearum' haplotype C and identification of a novel haplotype from Trioza urticae and stinging nettle.

PubMed

Haapalainen, Minna L; Wang, Jinhui; Latvala, Satu; Lehtonen, Mikko T; Pirhonen, Minna; Nissinen, Anne I

2018-03-30

'Candidatus Liberibacter solanacearum' (CLso) haplotype C is associated with disease in carrots and transmitted by the carrot psyllid Trioza apicalis. To identify possible other sources and vectors of this pathogen in Finland, samples were taken of wild plants within and near the carrot fields, the psyllids feeding on these plants, parsnips growing next to carrots, and carrot seeds. For analyzing the genotype of the CLso positive samples, a multi-locus sequence typing (MLST) scheme was developed. CLso haplotype C was detected in 11% of the Trioza anthrisci samples, in 35% of the Anthriscus sylvestris plants with discoloration, and in parsnips showing leaf discoloration. MLST revealed that the CLso in T. anthrisci and most A. sylvestris plants represent different strains than the bacteria found in T. apicalis and the cultivated plants. CLso haplotype D was detected in two of the 34 carrot seed lots tested, but was not detected in the plants grown from these seeds. Phylogenetic analysis by UPGMA clustering suggested that the haplotype D is more closely related to the haplotype A than to C. A novel, sixth haplotype of CLso, most closely related to A and D, was found in the psyllid Trioza urticae and stinging nettle (Urtica dioica, Urticaceae), and named as haplotype U.
Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.

PubMed

Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng

2010-02-01

Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.
Glucocorticoid Receptor Related Genes: Genotype And Brain Gene Expression Relationships To Suicide And Major Depressive Disorder

PubMed Central

Pantazatos, Spiro P.; Huang, Yung-yu; Rosoklija, Gorazd B.; Dwork, Andrew J.; Burke, Ainsley; Arango, Victoria; Oquendo, Maria A.; Mann, J. John

2016-01-01

Introduction We tested the relationship between genotype, gene expression and suicidal behavior and MDD in live subjects and postmortem samples for three genes, associated with the hypothalamic-pituitary-adrenal axis, suicidal behavior and major depressive disorder (MDD); FK506 binding protein 5 (FKBP5), Spindle and kinetochore-associated protein 2 (SKA2) and Glucocorticoid Receptor (NR3C1). Materials and Methods Single-nucleotide polymorphisms (SNPs) and haplotypes were tested for association with suicidal behavior and MDD in a live (N=277) and a postmortem sample (N=209). RNA-seq was used to examine gene and isoform-level brain expression postmortem (Brodmann Area 9) (N=59). Expression quantitative trait loci (eQTL) relationships were examined using a public database (UK Brain Expression Consortium). Results We identified a haplotype within the FKBP5 gene, present in 47% of the live subjects, that was associated with increased risk of suicide attempt (OR=1.58, t=6.03, p=0.014). Six SNPs on this gene, three SNPs on SKA2 and one near NR3C1 showed before-adjustment association with attempted suicide, and two SNPs of SKA2 with suicide death, but none stayed significant after adjustment for multiple testing. Only the SKA2 SNPs were related to expression in the prefrontal cortex. One NR3C1 transcript had lower expression in suicide relative to non-suicide sudden death cases (b=-0.48, SE=0.12, t=-4.02, adjusted p=0.004). Conclusion We have identified an association of FKBP5 haplotype with risk of suicide attempt and found an association between suicide and altered NR3C1 gene expression in the prefrontal cortex. Our findings further implicate hypothalamic pituitary axis dysfunction in suicidal behavior. PMID:27030168
GLUCOCORTICOID RECEPTOR-RELATED GENES: GENOTYPE AND BRAIN GENE EXPRESSION RELATIONSHIPS TO SUICIDE AND MAJOR DEPRESSIVE DISORDER.

PubMed

Yin, Honglei; Galfalvy, Hanga; Pantazatos, Spiro P; Huang, Yung-Yu; Rosoklija, Gorazd B; Dwork, Andrew J; Burke, Ainsley; Arango, Victoria; Oquendo, Maria A; Mann, J John

2016-06-01

We tested the relationship between genotype, gene expression and suicidal behavior and major depressive disorder (MDD) in live subjects and postmortem samples for three genes, associated with the hypothalamic-pituitary-adrenal axis, suicidal behavior, and MDD; FK506-binding protein 5 (FKBP5), Spindle and kinetochore-associated protein 2 (SKA2), and Glucocorticoid Receptor (NR3C1). Single-nucleotide polymorphisms (SNPs) and haplotypes were tested for association with suicidal behavior and MDD in a live (N = 277) and a postmortem sample (N = 209). RNA-seq was used to examine gene and isoform-level brain expression postmortem (Brodmann Area 9; N = 59). Expression quantitative trait loci (eQTL) relationships were examined using a public database (UK Brain Expression Consortium). We identified a haplotype within the FKBP5 gene, present in 47% of the live subjects, which was associated with increased risk of suicide attempt (OR = 1.58, t = 6.03, P = .014). Six SNPs on this gene, three SNPs on SKA2, and one near NR3C1 showed before-adjustment association with attempted suicide, and two SNPs of SKA2 with suicide death, but none stayed significant after adjustment for multiple testing. Only the SKA2 SNPs were related to expression in the prefrontal cortex (pFCTX). One NR3C1 transcript had lower expression in suicide relative to nonsuicide sudden death cases (b = -0.48, SE = 0.12, t = -4.02, adjusted P = .004). We have identified an association of FKBP5 haplotype with risk of suicide attempt and found an association between suicide and altered NR3C1 gene expression in the pFCTX. Our findings further implicate hypothalamic pituitary axis dysfunction in suicidal behavior. © 2016 Wiley Periodicals, Inc.
Regional differences in the distribution of the sub-Saharan, West Eurasian, and South Asian mtDNA lineages in Yemen.

PubMed

Cerný, Viktor; Mulligan, Connie J; Rídl, Jakub; Zaloudková, Martina; Edens, Christopher M; Hájek, Martin; Pereira, Luísa

2008-06-01

Despite its key location for population movements out of and back into Africa, Yemen has not yet been sampled on a regional level for an investigation of sub-Saharan, West Eurasian, and South Asian genetic contributions. In this study, we present mitochondrial DNA (mtDNA) data for regionally distinct Yemeni populations that reveal different distributions of mtDNA lineages. An extensive database of mtDNA sequences from North and East African, Middle Eastern and Indian populations was analyzed to provide a context for the regional Yemeni mtDNA datasets. The groups of western Yemen appear to be most closely related to Middle Eastern and North African populations, while the eastern Yemeni population from Hadramawt is most closely related to East Africa. Furthermore, haplotype matches with Africa are almost exclusively confined to West Eurasian R0a haplogroup in southwestern Yemen, although more sub-Saharan L-type matches appear in more northern Yemeni populations. In fact, Yemeni populations have the highest frequency of R0a haplotypes detected to date, thus Yemen or southern Arabia may be the site of the initial expansion of this haplogroup. Whereas two variants of the sub-Saharan haplogroup M1 were detected only in southwestern Yemen close to the Bab el-Mandeb Strait, different non-African M haplotypes were detected at low frequencies (approximately 2%) in western parts of the country and at a higher frequency (7.5%) in the Hadramawt. We conclude that the Yemeni gene pool is highly stratified both regionally and temporally and that it has received West Eurasian, Northeast African, and South Asian gene flow. Copyright 2008 Wiley-Liss, Inc.
Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

PubMed Central

Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

2015-01-01

DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576
Haplotype phasing and inheritance of copy number variants in nuclear families.

PubMed

Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

2015-01-01

DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.
Global spread and genetic variants of the two CYP9M10 haplotype forms associated with insecticide resistance in Culex quinquefasciatus Say.

PubMed

Itokawa, K; Komagata, O; Kasai, S; Kawada, H; Mwatele, C; Dida, G O; Njenga, S M; Mwandawiro, C; Tomita, T

2013-09-01

Insecticide resistance develops as a genetic factor (allele) conferring lower susceptibility to insecticides proliferates within a target insect population under strong positive selection. Intriguingly, a resistance allele pre-existing in a population often bears a series of further adaptive allelic variants through new mutations. This phenomenon occasionally results in replacement of the predominating resistance allele by fitter new derivatives, and consequently, development of greater resistance at the population level. The overexpression of the cytochrome P450 gene CYP9M10 is associated with pyrethroid resistance in the southern house mosquito Culex quinquefasciatus. Previously, we have found two genealogically related overexpressing CYP9M10 haplotypes, which differ in gene copy number (duplicated and non-duplicated). The duplicated haplotype was derived from the non-duplicated overproducer probably recently. In the present study, we investigated allelic series of CYP9M10 involved in three C. quinquefasciatus laboratory colonies recently collected from three different localities. Duplicated and non-duplicated overproducing haplotypes coexisted in African and Asian colonies indicating a global distribution of both haplotype lineages. The duplicated haplotypes both in the Asian and African colonies were associated with higher expression levels and stronger resistance than non-duplicated overproducing haplotypes. There were slight variation in expression level among the non-duplicated overproducing haplotypes. The nucleotide sequences in coding and upstream regions among members of this group also showed a little diversity. Non-duplicated overproducing haplotypes with relatively higher expression were genealogically closer to the duplicated haplotypes than the other non-duplicated overproducing haplotypes, suggesting multiple cis-acting mutations before duplication.
[A total of 362 HLA different haplotypes and HLA recombination haplotypes based on analysis of their family pedigree in Chinese partial Han populations].

PubMed

Gao, Su-Qing; Cheng, Xi; Li, Qian; Li, Yu-Zhu; Deng, Zhi-Hui

2009-06-01

This study was aimed to discover the novel HLA recombination haplotypes and investigate the distribution of haplotypes in Chinese Han population. Based on the HLA-A, B, DRB1 typing results of 179 family members, 791 haplotypes were assigned by the mode of inheritance. The results showed that a total of 4 novel recombinant haplotypes in HLA-DRB1 locus region were observed in 4 families, which ratio of paternal to maternal chromosomes was 3:1. The recombination ratio between HLA-DRB1 and HLA-A or B loci was 0.92% (4/433). There were a total of 362 kinds of HLA-A, -B, -DRB1 haplotypes to be confirmed in Chinese Han partial population. A33-B58-DR17, A2-B46-DR9, A30-B13-DR7, A11-B13-DR15, A11-B75-DR12 and A2-B46-DR14 were the most common haplotypes that was consistent with the distribution of HLA alleles in unrelated donors. There were A1-B63-DR12, A29-B46-DR15, A1-B61-DR10, A34-B35-DR9, A29-B54-DR4, A23-B13-DR16 and A34-B62-DR15 haplotypes and so on, which were rare haplotypes not yet reported in Chinese. It is concluded that the HLA-A-B-DRB1 haplotypes would be confirmed by analysis of their family pedigree. The results obtained in this study are basic data for study of Chinese anthropology, organ transplantation and disease correlation analysis.
Haplotype-Based Association Analysis via Variance-Components Score Test

PubMed Central

Tzeng, Jung-Ying ; Zhang, Daowen

2007-01-01

Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies. PMID:17924336
Mapping of HLA- DQ haplotypes in a group of Danish patients with celiac disease.

PubMed

Lund, Flemming; Hermansen, Mette N; Pedersen, Merete F; Hillig, Thore; Toft-Hansen, Henrik; Sölétormos, György

2015-10-01

A cost-effective identification of HLA- DQ risk haplotypes using the single nucleotide polymorphism (SNP) technique has recently been applied in the diagnosis of celiac disease (CD) in four European populations. The objective of the study was to map risk HLA- DQ haplotypes in a group of Danish CD patients using the SNP technique. Cohort A: Among 65 patients with gastrointestinal symptoms we compared the HLA- DQ2 and HLA- DQ8 risk haplotypes obtained by the SNP technique (method 1) with results based on a sequence specific primer amplification technique (method 2) and a technique used in an assay from BioDiagene (method 3). Cohort B: 128 patients with histologically verified CD were tested for CD risk haplotypes (method 1). Patients with negative results were further tested for sub-haplotypes of HLA- DQ2 (methods 2 and 3). Cohort A: The three applied methods provided the same HLA- DQ2 and HLA- DQ8 results among 61 patients. Four patients were negative for the HLA- DQ2 and HLA- DQ8 haplotypes (method 1) but were positive for the HLA- DQ2.5-trans and HLA- DQ2.2 haplotypes (methods 2 and 3). Cohort B: A total of 120 patients were positive for the HLA- DQ2.5-cis and HLA- DQ8 haplotypes (method 1). The remaining seven patients were positive for HLA- DQ2.5-trans or HLA- DQ2.2 haplotypes (methods 2 and 3). One patient was negative with all three HLA methods. The HLA- DQ risk haplotypes were detected in 93.8% of the CD patients using the SNP technique (method 1). The sensitivity increased to 99.2% by combining methods 1 - 3.
Haplotype-based approach to known MS-associated regions increases the amount of explained risk

PubMed Central

Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S

2015-01-01

Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143
β3 Integrin Haplotype Influences Gene Regulation and Plasma von Willebrand Factor Activity

PubMed Central

Payne, Katie E; Bray, Paul F; Grant, Peter J; Carter, Angela M

2008-01-01

The Leu33Pro polymorphism of the gene encoding β3 integrin (ITGB3) is associated with acute coronary syndromes and influences platelet aggregation. Three common promoter polymorphisms have also been identified. The aims of this study were to (1) investigate the influence of the ITGB3 −400C/A, −425A/C and −468G/A promoter polymorphisms on reporter gene expression and nuclear protein binding and (2) determine genotype and haplotype associations with platelet αIIbβ3 receptor density. Promoter haplotypes were introduced into an ITGB3 promoter-pGL3 construct by site directed mutagenesis and luciferase reporter gene expression analysed in HEL and HMEC-1 cells. Binding of nuclear proteins was assessed by electrophoretic mobility shift assay. The association of ITGB3 haplotype with platelet αIIbβ3 receptor density was determined in 223 subjects. Species conserved motifs were identified in the ITGB3 promoter in the vicinity of the 3 polymorphisms. The GAA, GCC, AAC, AAA and ACC constructs induced ~50% increased luciferase expression relative to the GAC construct in both cell types. Haplotype analysis including Leu33Pro indicated 5 common haplotypes; no associations between ITGB3 haplotypes and receptor density were found. However, the GCC-Pro33 haplotype was associated with significantly higher vWF activity (128.6 [112.1–145.1]%) compared with all other haplotypes (107.1 [101.2–113.0]%, p=0.02). In conclusion, the GCC-Pro33 haplotype was associated with increased vWF activity but not with platelet αIIbβ3 receptor density, which may indicate ITGB3 haplotype influences endothelial function. PMID:18045606
Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.

PubMed

N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J

2017-01-01

Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.
Phylogeography of the Qinghai-Tibetan Plateau endemic Juniperus przewalskii (Cupressaceae) inferred from chloroplast DNA sequence variation.

PubMed

Zhang, Q; Chiang, T Y; George, M; Liu, J Q; Abbott, R J

2005-10-01

The vegetation of the northeast Qinghai-Tibetan Plateau is dominated by alpine meadow and desert-steppe with sparse forests scattered within it. To obtain a better understanding of the phylogeography of one constituent species of the forests in this region, we examined chloroplast trnT-trnF and trnS-trnG sequence variation within Juniperus przewalskii, a key endemic tree species. Sequence data were obtained from 392 trees in 20 populations covering the entire distribution range of the species. Six cpDNA haplotypes were identified. Significant population subdivision was detected (G(ST) = 0.772, N(ST) = 0.834), suggesting low levels of recurrent gene flow among populations and significant phylogeographic structure (N(ST) > G(ST), P < 0.05). Eight of the nine disjunct populations surveyed on the high-elevation northeast plateau were fixed for a single haplotype (A), while the remaining, more westerly population, contained the same haplotype at high frequency together with two low frequency haplotypes (C and F). In contrast, most populations that occurred at lower altitudes at the plateau edge were fixed or nearly fixed for one of two haplotypes, A or E. However, two plateau edge populations had haplotype compositions different from the rest. In one, four haplotypes (A, B, D and E) were present at approximately equivalent frequencies, which might reflect a larger refugium in the area of this population during the last glacial period. Phylogenetic analysis indicated that the most widely distributed haplotype A is not ancestral to other haplotypes. The contrasting phylogeographic structures of the haplotype-rich plateau edge area and the almost haplotype-uniform plateau platform region indicate that the plateau platform was recolonized by J. przewalskii during the most recent postglacial period. This is supported by the findings of a nested clade analysis, which inferred that postglacial range expansion from the plateau edge followed by recent fragmentation is largely responsible for the present-day spatial distribution of cpDNA haplotypes within the species.
The Trichoptera barcode initiative: a strategy for generating a species-level Tree of Life

PubMed Central

Frandsen, Paul B.; Holzenthal, Ralph W.; Beet, Clare R.; Bennett, Kristi R.; Blahnik, Roger J.; Bonada, Núria; Cartwright, David; Chuluunbat, Suvdtsetseg; Cocks, Graeme V.; Collins, Gemma E.; deWaard, Jeremy; Dean, John; Flint, Oliver S.; Hausmann, Axel; Hendrich, Lars; Hess, Monika; Hogg, Ian D.; Kondratieff, Boris C.; Malicky, Hans; Milton, Megan A.; Morinière, Jérôme; Morse, John C.; Mwangi, François Ngera; Pauls, Steffen U.; Gonzalez, María Razo; Rinne, Aki; Robinson, Jason L.; Salokannel, Juha; Shackleton, Michael; Smith, Brian; Stamatakis, Alexandros; StClair, Ros; Thomas, Jessica A.; Zamora-Muñoz, Carmen; Ziesmann, Tanja

2016-01-01

DNA barcoding was intended as a means to provide species-level identifications through associating DNA sequences from unknown specimens to those from curated reference specimens. Although barcodes were not designed for phylogenetics, they can be beneficial to the completion of the Tree of Life. The barcode database for Trichoptera is relatively comprehensive, with data from every family, approximately two-thirds of the genera, and one-third of the described species. Most Trichoptera, as with most of life's species, have never been subjected to any formal phylogenetic analysis. Here, we present a phylogeny with over 16 000 unique haplotypes as a working hypothesis that can be updated as our estimates improve. We suggest a strategy of implementing constrained tree searches, which allow larger datasets to dictate the backbone phylogeny, while the barcode data fill out the tips of the tree. We also discuss how this phylogeny could be used to focus taxonomic attention on ambiguous species boundaries and hidden biodiversity. We suggest that systematists continue to differentiate between ‘Barcode Index Numbers’ (BINs) and ‘species’ that have been formally described. Each has utility, but they are not synonyms. We highlight examples of integrative taxonomy, using both barcodes and morphology for species description. This article is part of the themed issue ‘From DNA barcodes to biomes’. PMID:27481793
Sodium content of foods contributing to sodium intake: A comparison between selected foods from the CDC Packaged Food Database and the USDA National Nutrient Database for Standard Reference

USDA-ARS?s Scientific Manuscript database

The sodium concentration (mg/100g) for 23 of 125 Sentinel Foods were identified in the 2009 CDC Packaged Food Database (PFD) and compared with data in the USDA’s 2013 Standard Reference 26 (SR 26) database. Sentinel Foods are foods and beverages identified by USDA to be monitored as primary indicat...

Three potato centromeres are associated with distinct haplotypes with or without megabase-sized satellite repeat arrays.

PubMed

Wang, Linsheng; Zeng, Zixian; Zhang, Wenli; Jiang, Jiming

2014-02-01

We report discoveries of different haplotypes associated with the centromeres of three potato chromosomes, including haplotypes composed of long arrays of satellite repeats and haplotypes lacking the same repeats. These results are in favor of the hypothesis that satellite repeat-based centromeres may originate from neocentromeres that lack repeats.
Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

PubMed

Patil, N; Berno, A J; Hinds, D A; Barrett, W A; Doshi, J M; Hacker, C R; Kautzer, C R; Lee, D H; Marjoribanks, C; McDonough, D P; Nguyen, B T; Norris, M C; Sheehan, J B; Shen, N; Stern, D; Stokowski, R P; Thomas, D J; Trulson, M O; Vyas, K R; Frazer, K A; Fodor, S P; Cox, D R

2001-11-23

Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. We have used high-density oligonucleotide arrays, in combination with somatic cell genetics, to identify a large fraction of all common human chromosome 21 SNPs and to directly observe the haplotype structure defined by these SNPs. This structure reveals blocks of limited haplotype diversity in which more than 80% of a global human sample can typically be characterized by only three common haplotypes.
COMT haplotypes, catecholamine metabolites in plasma and clinical response in schizophrenic and bipolar patients.

PubMed

Zumárraga, Mercedes; Arrúe, Aurora; Basterreche, Nieves; Macías, Isabel; Catalán, Ana; Madrazo, Arantza; Bustamante, Sonia; Zamalloa, María I; Erkoreka, Leire; Gordo, Estibaliz; Arnaiz, Ainara; Olivas, Olga; Arroita, Ariane; Marín, Elena; González-Torres, Miguel A

2016-06-01

We examined the association of COMT haplotypes and plasma metabolites of catecholamines in relation to the clinical response to antipsychotics in schizophrenic and bipolar patients. We studied 165 patients before and after four weeks of treatment, and 163 healthy controls. We assessed four COMT haplotypes and the plasma concentrations of HVA, DOPAC and MHPG. Bipolar patients: haplotypes are associated with age at onset and clinical evolution. In schizophrenic patients, an haplotype previously associated with increased risk, is related to better response of negative symptoms. Haplotypes would be good indicators of the clinical status and the treatment response in bipolar and schizophrenic patients. Larger studies are required to elucidate the clinical usefulness of these findings.
Extended Islands of Tractability for Parsimony Haplotyping

NASA Astrophysics Data System (ADS)

Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi

Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.
Document creation, linking, and maintenance system

DOEpatents

Claghorn, Ronald [Pasco, WA

2011-02-15

A document creation and citation system designed to maintain a database of reference documents. The content of a selected document may be automatically scanned and indexed by the system. The selected documents may also be manually indexed by a user prior to the upload. The indexed documents may be uploaded and stored within a database for later use. The system allows a user to generate new documents by selecting content within the reference documents stored within the database and inserting the selected content into a new document. The system allows the user to customize and augment the content of the new document. The system also generates citations to the selected content retrieved from the reference documents. The citations may be inserted into the new document in the appropriate location and format, as directed by the user. The new document may be uploaded into the database and included with the other reference documents. The system also maintains the database of reference documents so that when changes are made to a reference document, the author of a document referencing the changed document will be alerted to make appropriate changes to his document. The system also allows visual comparison of documents so that the user may see differences in the text of the documents.
Selecting a database for literature searches in nursing: MEDLINE or CINAHL?

PubMed

Brazier, H; Begley, C M

1996-10-01

This study compares the usefulness of the MEDLINE and CINAHL databases for students on post-registration nursing courses. We searched for nine topics, using title words only. Identical searches of the two databases retrieved 1162 references, of which 88% were in MEDLINE, 33% in CINAHL and 20% in both sources. The relevance of the references was assessed by student reviewers. The positive predictive value of CINAHL (70%) was higher than that of MEDLINE (54%), but MEDLINE produced more than twice as many relevant references as CINAHL. The sensitivity of MEDLINE was 85% (95% CI 82-88%), and that of CINAHL was 41% (95% CI 37-45%). To assess the ease of obtaining the references, we developed an index of accessibility, based on the holdings of a number of Irish and British libraries. Overall, 47% of relevant references were available in the students' own library, and 64% could be obtained within 48 hours. There was no difference between the two databases overall, but when two topics relating specifically to the organization of nursing were excluded, references found in MEDLINE were significantly more accessible. We recommend that MEDLINE should be regarded as the first choice of bibliographic database for any subject other than one related strictly to the organization of nursing.
Structured Forms Reference Set of Binary Images (SFRS)

National Institute of Standards and Technology Data Gateway

NIST Structured Forms Reference Set of Binary Images (SFRS) (Web, free access) The NIST Structured Forms Database (Special Database 2) consists of 5,590 pages of binary, black-and-white images of synthesized documents. The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988.
Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

PubMed

Tyson, Jess; Armour, John A L

2012-12-11

Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.
FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads.

PubMed

Wang, Meng; Lin, Shili

2014-09-15

In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Frequency and origin of haplotypes associated with the beta-globin gene cluster in individuals with trait and sickle cell anemia in the Atlantic and Pacific coastal regions of Colombia

PubMed Central

Fong, Cristian; Lizarralde-Iragorri, María Alejandra; Rojas-Gallardo, Diana; Barreto, Guillermo

2013-01-01

Sickle cell anemia is a genetic disease with high prevalence in people of African descent. There are five typical haplotypes associated with this disease and the haplotypes associated with the beta-globin gene cluster have been used to establish the origin of African-descendant people in America. In this work, we determined the frequency and the origin of haplotypes associated with hemoglobin S in a sample of individuals with sickle cell anemia (HbSS) and sickle cell hemoglobin trait (HbAS) in coastal regions of Colombia. Blood samples from 71 HbAS and 79 HbSS individuals were obtained. Haplotypes were determined based on the presence of variable restriction sites within the β-globin gene cluster. On the Pacific coast of Colombia the most frequent haplotype was Benin, while on the Atlantic coast Bantu was marginally higher than Benin. Eight atypical haplotypes were observed on both coasts, being more diverse in the Atlantic than in the Pacific region. These results suggest a differential settlement of the coasts, dependent on where slaves were brought from, either from the Gulf of Guinea or from Angola, where the haplotype distributions are similar. Atypical haplotypes probably originated from point mutations that lost or gained a restriction site and/or by recombination events. PMID:24385850
Inheritance of Hetero-Diploid Pollen S-Haplotype in Self-Compatible Tetraploid Chinese Cherry (Prunus pseudocerasus Lindl)

PubMed Central

Gu, Chao; Liu, Qing-Zhong; Yang, Ya-Nan; Zhang, Shu-Jun; Khan, Muhammad Awais; Wu, Jun; Zhang, Shao-Ling

2013-01-01

The breakdown of self-incompatibility, which could result from the accumulation of non-functional S-haplotypes or competitive interaction between two different functional S-haplotypes, has been studied extensively at the molecular level in tetraploid Rosaceae species. In this study, two tetraploid Chinese cherry (Prunus pseudocerasus) cultivars and one diploid sweet cherry (Prunus avium) cultivar were used to investigate the ploidy of pollen grains and inheritance of pollen-S alleles. Genetic analysis of the S-genotypes of two intercross-pollinated progenies showed that the pollen grains derived from Chinese cherry cultivars were hetero-diploid, and that the two S-haplotypes were made up of every combination of two of the four possible S-haplotypes. Moreover, the distributions of single S-haplotypes expressed in self- and intercross-pollinated progenies were in disequilibrium. The number of individuals of the two different S-haplotypes was unequal in two self-pollinated and two intercross-pollinated progenies. Notably, the number of individuals containing two different S-haplotypes (S1- and S5-, S5- and S8-, S1- and S4-haplotype) was larger than that of other individuals in the two self-pollinated progenies, indicating that some of these hetero-diploid pollen grains may have the capability to inactivate stylar S-RNase inside the pollen tube and grow better into the ovaries. PMID:23596519
Relationship of the bovine growth hormone gene to carcass traits in Japanese black cattle.

PubMed

Tatsuda, K; Oka, A; Iwamoto, E; Kuroda, Y; Takeshita, H; Kataoka, H; Kouno, S

2008-02-01

The bovine growth hormone gene (bGH) possesses three haplotypes, A, B and C, that differ by amino acid mutations at positions 127 and 172 in the fifth exon: (leucine 127, threonine 172), (valine 127, threonine 172) and (valine 127, methionine 172) respectively. The correlation between meat quality or carcass weight and these haplotypes was investigated in Japanese black cattle. Altogether, 940 bGH haplotypes were compared with respect to six carcass traits: carcass weight, longissimus muscle area, rib thickness, subcutaneous fat thickness, beef marbling score and beef colour. The frequency of the B haplotype was higher (0.421) than that of A (0.269) and C (0.311). High carcass weight and low beef marbling were associated with haplotype A (p < 0.05 and p < 0.01 respectively), whereas beef marbling was increased by haplotype C (p < 0.05). Estimated regression coefficient of the A haplotype substitution effect for carcass weight and beef marbling score were 5.55 (13.1% of the phenotypic SD) and -0.31 (17.0%) respectively. That of the C haplotype for beef marbling score was 0.20 (11.0%). The other traits showed no relationship to the haplotypes examined. The results of this investigation suggest that information pertaining to bGH polymorphisms in Japanese black cattle could be used to improve the selection of meat traits.
Fetal hemoglobin in sickle cell anemia: The Arab-Indian haplotype and new therapeutic agents.

PubMed

Habara, Alawi H; Shaikho, Elmutaz M; Steinberg, Martin H

2017-11-01

Fetal hemoglobin (HbF) has well-known tempering effects on the symptoms of sickle cell disease and its levels vary among patients with different haplotypes of the sickle hemoglobin gene. Compared with sickle cell anemia haplotypes found in patients of African descent, HbF levels in Saudi and Indian patients with the Arab-Indian (AI) haplotype exceed that in any other haplotype by nearly twofold. Genetic association studies have identified some loci associated with high HbF in the AI haplotype but these observations require functional confirmation. Saudi patients with the Benin haplotype have HbF levels almost twice as high as African patients with this haplotype but this difference is unexplained. Hydroxyurea is still the only FDA approved drug for HbF induction in sickle cell disease. While most patients treated with hydroxyurea have an increase in HbF and some clinical improvement, 10 to 20% of adults show little response to this agent. We review the genetic basis of HbF regulation focusing on sickle cell anemia in Saudi Arabia and discuss new drugs that can induce increased levels of HbF. © 2017 Wiley Periodicals, Inc.
The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes

PubMed Central

Lindenau, Juliana D.; Wagner, Sandrine C.; de Castro, Simone M.; Hutz, Mara H.

2016-01-01

Abstract Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management. PMID:27706371
Haplotype Reconstruction in Large Pedigrees with Many Untyped Individuals

NASA Astrophysics Data System (ADS)

Li, Xin; Li, Jing

Haplotypes, as they specify the linkage patterns between dispersed genetic variations, provide important information for understanding the genetics of human traits. However haplotypes are not directly available from current genotyping platforms, and hence there are extensive investigations of computational methods to recover such information. Two major computational challenges arising in current family-based disease studies are large family sizes and many ungenotyped family members. Traditional haplotyping methods can neither handle large families nor families with missing members. In this paper, we propose a method which addresses these issues by integrating multiple novel techniques. The method consists of three major components: pairwise identical-bydescent (IBD) inference, global IBD reconstruction and haplotype restoring. By reconstructing the global IBD of a family from pairwise IBD and then restoring the haplotypes based on the inferred IBD, this method can scale to large pedigrees, and more importantly it can handle families with missing members. Compared with existing methods, this method demonstrates much higher power to recover haplotype information, especially in families with many untyped individuals.
The influence of maternal lineages on social affiliations among humpback whales (Megaptera novaeangliae) on their feeding grounds in the southern gulf of Maine.

PubMed

Weinrich, Mason T; Rosenbaum, Howard; Scott Baker, C; Blackmer, Alexis L; Whitehead, Hal

2006-01-01

Humpback whales on their feeding grounds in the Gulf of Maine typically form fluid fission/fusion groups of two to three individuals characterized by noncompetitive and, at times, cooperative behavior. Here we test the hypothesis that, despite the apparent absence of close kinship bonds, the fluid associations between feeding whales are influenced by "maternal lineages" as represented by mtDNA haplotypes. Using skin samples collected with a biopsy dart, variation in the hypervariable segment of the mtDNA control region identified 17 unique haplotypes among 159 individually identified whales from the southern Gulf of Maine. The haplotypes of a further 143 individuals were inferred from known direct maternal (cow-calf) relationships. The frequencies of associations among these 302 individuals were calculated from 21,617 sighting records collected from 1980 to 1995, excluding associations between a cow and her dependent calf. For groups of two where the haplotypes of both individuals were known (n = 3,151), individuals with the same haplotype were together significantly more often (26%) than expected by random association (20%). To account for different group sizes and associations with individuals of unknown haplotype and sex, we used Monte Carlo simulations to test for nonrandom associations in the full data set, as well as known female-only (n = 1,512), male-only (n = 730), and mixed-sex (n = 2,745) groups. Within-haplotype associations were significantly more frequent than expected at random for all groups (P = .002) and female-only groups (P = .011) but not male-only groups, while mixed-sex groups approached significance (P = .062). A Mantel test of individual pairwise association indices and haplotype identity confirmed that within-haplotype associations were more frequent than expected for all sex combinations except male-male associations, with females forming within-haplotype associations 1.7 times more often than expected by random assortment. Partial matrix correlations and permutation analyses indicated that the skew toward within-haplotype associations could not be accounted for by short-term temporal co-occurrence or fine-scale spatial distributions of individuals with shared haplotypes. While the mechanism by which individuals with a common mtDNA haplotype assort remains unknown, our results strongly suggest an influence of maternal lineages on the social organization of humpback whales within a regional feeding ground.
Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map

PubMed Central

Haile, Jemanesh K.; Cory, Aron T.; Clarke, Fran R.; Clarke, John M.; Knox, Ron E.; Pozniak, Curtis J.

2017-01-01

Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat. PMID:28135299
DLA Class II Alleles Are Associated with Risk for Canine Symmetrical Lupoid Onychodystropy (SLO)

PubMed Central

Wilbe, Maria; Ziener, Martine Lund; Aronsson, Anita; Harlos, Charlotte; Sundberg, Katarina; Norberg, Elin; Andersson, Lisa; Lindblad-Toh, Kerstin; Hedhammar, Åke; Andersson, Göran; Lingaas, Frode

2010-01-01

Symmetrical lupoid onychodystrophy (SLO) is an immune-mediated disease in dogs affecting the claws with a suggested autoimmune aethiology. Sequence-based genotyping of the polymorphic exon 2 from DLA-DRB1, -DQA1, and -DQB1 class II loci were performed in a total of 98 SLO Gordon setter cases and 98 healthy controls. A risk haplotype (DRB1*01801/DQA1*00101/DQB1*00802) was present in 53% of cases and 34% of controls and conferred an elevated risk of developing SLO with an odds ratio (OR) of 2.1. When dogs homozygous for the risk haplotype were compared to all dogs not carrying the haplotype the OR was 5.4. However, a stronger protective haplotype (DRB1*02001/DQA1*00401/DQB1*01303, OR = 0.03, 1/OR = 33) was present in 16.8% of controls, but only in a single case (0.5%). The effect of the protective haplotype was clearly stronger than the risk haplotype, since 11.2% of the controls were heterozygous for the risk and protective haplotypes, whereas this combination was absent from cases. When the dogs with the protective haplotype were excluded, an OR of 2.5 was obtained when dogs homozygous for the risk haplotype were compared to those heterozygous for the risk haplotype, suggesting a co-dominant effect of the risk haplotype. In smaller sample sizes of the bearded collie and giant schnauzer breeds we found the same or similar haplotypes, sharing the same DQA1 allele, over-represented among the cases suggesting that the risk is associated primarily with DLA-DQ. We obtained conclusive results that DLA class II is significantly associated with risk of developing SLO in Gordon setters, thus supporting that SLO is an immune-mediated disease. Further studies of SLO in dogs may provide important insight into immune privilege of the nail apparatus and also knowledge about a number of inflammatory disorders of the nail apparatus like lichen planus, psoriasis, alopecia areata and onycholysis. PMID:20808798
DNA Sequences over the Internet Provide Greater Speed and Accuracy for Health Sciences Reference Librarians.

ERIC Educational Resources Information Center

Harzbecker, Joseph, Jr.

1993-01-01

Describes the National Institute of Health's GenBank DNA sequence database and how it can be accessed through the Internet. A real reference question, which was answered successfully using the database, is reproduced to illustrate and elaborate on the potential of the Internet for information retrieval. (10 references) (KRN)
Horse Racing at the Library: How One Library System Increased the Usage of Some of Its Online Databases

ERIC Educational Resources Information Center

Kurhan, Scott H.; Griffing, Elizabeth A.

2011-01-01

Reference services in public libraries are changing dramatically. The Internet, online databases, and shrinking budgets are all making it necessary for non-traditional reference staff to become familiar with online reference tools. Recognizing the need for cross-training, Chesapeake Public Library (CPL) developed a program called the Database…

Temporal fluctuation in North East Baltic Sea region cattle population revealed by mitochondrial and Y-chromosomal DNA analyses.

PubMed

Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Harjula, Janne; Nyström Edmark, Veronica; Rannamäe, Eve; Lõugas, Lembi; Sajantila, Antti; Lidén, Kerstin; Taavitsainen, Jussi-Pekka

2015-01-01

Ancient DNA analysis offers a way to detect changes in populations over time. To date, most studies of ancient cattle have focused on their domestication in prehistory, while only a limited number of studies have analysed later periods. Conversely, the genetic structure of modern cattle populations is well known given the undertaking of several molecular and population genetic studies. Bones and teeth from ancient cattle populations from the North-East Baltic Sea region dated to the Prehistoric (Late Bronze and Iron Age, 5 samples), Medieval (14), and Post-Medieval (26) periods were investigated by sequencing 667 base pairs (bp) from the mitochondrial DNA (mtDNA) and 155 bp of intron 19 in the Y-chromosomal UTY gene. Comparison of maternal (mtDNA haplotypes) genetic diversity in ancient cattle (45 samples) with modern cattle populations in Europe and Asia (2094 samples) revealed 30 ancient mtDNA haplotypes, 24 of which were shared with modern breeds, while 6 were unique to the ancient samples. Of seven Y-chromosomal sequences determined from ancient samples, six were Y2 and one Y1 haplotype. Combined data including Swedish samples from the same periods (64 samples) was compared with the occurrence of Y-chromosomal haplotypes in modern cattle (1614 samples). The diversity of haplogroups was highest in the Prehistoric samples, where many haplotypes were unique. The Medieval and Post-Medieval samples also show a high diversity with new haplotypes. Some of these haplotypes have become frequent in modern breeds in the Nordic Countries and North-Western Russia while other haplotypes have remained in only a few local breeds or seem to have been lost. A temporal shift in Y-chromosomal haplotypes from Y2 to Y1 was detected that corresponds with the appearance of new mtDNA haplotypes in the Medieval and Post-Medieval period. This suggests a replacement of the Prehistoric mtDNA and Y chromosomal haplotypes by new types of cattle.
Effects of IL-10 haplotype and atomic bomb radiation exposure on gastric cancer risk.

PubMed

Hayashi, Tomonori; Ito, Reiko; Cologne, John; Maki, Mayumi; Morishita, Yukari; Nagamura, Hiroko; Sasaki, Keiko; Hayashi, Ikue; Imai, Kazue; Yoshida, Kengo; Kajimura, Junko; Kyoizumi, Seishi; Kusunoki, Yoichiro; Ohishi, Waka; Fujiwara, Saeko; Akahoshi, Masazumi; Nakachi, Kei

2013-07-01

Gastric cancer (GC) is one of the cancers that reveal increased risk of mortality and incidence in atomic bomb survivors. The incidence of gastric cancer in the Life Span Study cohort of the Radiation Effects Research Foundation (RERF) increased with radiation dose (gender-averaged excess relative risk per Gy = 0.28) and remains high more than 65 years after exposure. To assess a possible role of gene-environment interaction, we examined the dose response for gastric cancer incidence based on immunosuppression-related IL-10 genotype, in a cohort study with 200 cancer cases (93 intestinal, 96 diffuse and 11 other types) among 4,690 atomic bomb survivors participating in an immunological substudy. Using a single haplotype block composed of four haplotype-tagging SNPs (comprising the major haplotype allele IL-10-ATTA and the minor haplotype allele IL-10-GGCG, which are categorized by IL-10 polymorphisms at -819A>G and -592T>G, +1177T>C and +1589A>G), multiplicative and additive models for joint effects of radiation and this IL-10 haplotyping were examined. The IL-10 minor haplotype allele(s) was a risk factor for intestinal type gastric cancer but not for diffuse type gastric cancer. Radiation was not associated with intestinal type gastric cancer. In diffuse type gastric cancer, the haplotype-specific excess relative risk (ERR) for radiation was statistically significant only in the major homozygote category of IL-10 (ERR = 0.46/Gy, P = 0.037), whereas estimated ERR for radiation with the minor IL-10 homozygotes was close to 0 and nonsignificant. Thus, the minor IL-10 haplotype might act to reduce the radiation related risk of diffuse-type gastric cancer. The results suggest that this IL-10 haplotyping might be involved in development of radiation-associated gastric cancer of the diffuse type, and that IL-10 haplotypes may explain individual differences in the radiation-related risk of gastric cancer. © 2013 by Radiation Research Society
Temporal Fluctuation in North East Baltic Sea Region Cattle Population Revealed by Mitochondrial and Y-Chromosomal DNA Analyses

PubMed Central

Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Harjula, Janne; Nyström Edmark, Veronica; Rannamäe, Eve; Lõugas, Lembi; Sajantila, Antti; Lidén, Kerstin; Taavitsainen, Jussi-Pekka

2015-01-01

Background Ancient DNA analysis offers a way to detect changes in populations over time. To date, most studies of ancient cattle have focused on their domestication in prehistory, while only a limited number of studies have analysed later periods. Conversely, the genetic structure of modern cattle populations is well known given the undertaking of several molecular and population genetic studies. Results Bones and teeth from ancient cattle populations from the North-East Baltic Sea region dated to the Prehistoric (Late Bronze and Iron Age, 5 samples), Medieval (14), and Post-Medieval (26) periods were investigated by sequencing 667 base pairs (bp) from the mitochondrial DNA (mtDNA) and 155 bp of intron 19 in the Y-chromosomal UTY gene. Comparison of maternal (mtDNA haplotypes) genetic diversity in ancient cattle (45 samples) with modern cattle populations in Europe and Asia (2094 samples) revealed 30 ancient mtDNA haplotypes, 24 of which were shared with modern breeds, while 6 were unique to the ancient samples. Of seven Y-chromosomal sequences determined from ancient samples, six were Y2 and one Y1 haplotype. Combined data including Swedish samples from the same periods (64 samples) was compared with the occurrence of Y-chromosomal haplotypes in modern cattle (1614 samples). Conclusions The diversity of haplogroups was highest in the Prehistoric samples, where many haplotypes were unique. The Medieval and Post-Medieval samples also show a high diversity with new haplotypes. Some of these haplotypes have become frequent in modern breeds in the Nordic Countries and North-Western Russia while other haplotypes have remained in only a few local breeds or seem to have been lost. A temporal shift in Y-chromosomal haplotypes from Y2 to Y1 was detected that corresponds with the appearance of new mtDNA haplotypes in the Medieval and Post-Medieval period. This suggests a replacement of the Prehistoric mtDNA and Y chromosomal haplotypes by new types of cattle. PMID:25992976
SLC22A1-ABCB1 haplotype profiles predict imatinib pharmacokinetics in Asian patients with chronic myeloid leukemia.

PubMed

Singh, Onkar; Chan, Jason Yongsheng; Lin, Keegan; Heng, Charles Chuah Thuan; Chowbay, Balram

2012-01-01

This study aimed to explore the influence of SLC22A1, PXR, ABCG2, ABCB1 and CYP3A5 3 genetic polymorphisms on imatinib mesylate (IM) pharmacokinetics in Asian patients with chronic myeloid leukemia (CML). Healthy subjects belonging to three Asian populations (Chinese, Malay, Indian; n = 70 each) and CML patients (n = 38) were enrolled in a prospective pharmacogenetics study. Imatinib trough (C(0h)) and clearance (CL) were determined in the patients at steady state. Haplowalk method was applied to infer the haplotypes and generalized linear model (GLM) to estimate haplotypic effects on IM pharmacokinetics. Association of haplotype copy numbers with IM pharmacokinetics was defined by Mann-Whitney U test. Global haplotype score statistics revealed a SLC22A1 sub-haplotypic region encompassing three polymorphisms (rs3798168, rs628031 and IVS7+850C>T), to be significantly associated with IM clearance (p = 0.013). Haplotype-specific GLM estimated that the haplotypes AGT and CGC were both associated with 22% decrease in clearance compared to CAC [CL (10(-2) L/hr/mg): CAC vs AGT: 4.03 vs 3.16, p = 0.017; CAC vs CGC: 4.03 vs 3.15, p = 0.017]. Patients harboring 2 copies of AGT or CGC haplotypes had 33.4% lower clearance and 50% higher C(0h) than patients carrying 0 or 1 copy [CL (10(-2) L/hr/mg): 2.19 vs 3.29, p = 0.026; C(0h) (10(-6) 1/ml): 4.76 vs 3.17, p = 0.013, respectively]. Further subgroup analysis revealed SLC22A1 and ABCB1 haplotypic combinations to be significantly associated with clearance and C(0h) (p = 0.002 and 0.009, respectively). This exploratory study suggests that SLC22A1-ABCB1 haplotypes may influence IM pharmacokinetics in Asian CML patients.
Molecular identification and first report of mitochondrial COI gene haplotypes in the hawksbill turtle Eretmochelys imbricata (Testudines: Cheloniidae) in the Colombian Caribbean nesting colonies.

PubMed

Daza-Criado, L; Hernández-Fernández, J

2014-02-21

Hawksbill sea turtles Eretmochelys imbricata are found extensively around the world, including the Atlantic, Pacific, and Indian Oceans; the Persian Gulf, and the Red and Mediterranean Seas. Populations of this species are affected by international trafficking of their shields, meat, and eggs, making it a critically endangered animal. We determined the haplotypes of 17 hawksbill foraging turtles of Islas del Rosario (Bolivar) and of the nesting beach Don Diego (Magdalena) in the Colombian Caribbean based on amplification and sequencing of the mitochondrial gene cytochrome oxidase c subunit I (COI). We identified 5 haplotypes, including EI-A1 previously reported in Puerto Rico, which was similar to 10 of the study samples. To our knowledge, the remaining 4 haplotypes have not been described. Samples EICOI11 and EICOI3 showed 0.2% divergence from EI-A1, by a single nucleotide change, and were classified as the EI-A2 haplotype. EICOI6, EICOI14, and EICOI12 samples showed 0.2% divergence from EI-A1 and 0.3% divergence from EI-A2 and were classified as EI-A3 haplotype. Samples EICOI16 and EICOI15 presented 5 nucleotide changes each and were classified as 2 different haplotypes, EI-A4 and EI-A5, respectively. The last 2 haplotypes had higher nucleotide diversity (K2P=1.7%) than that by the first 3 haplotypes. EI-A1 and EI-A2 occurred in nesting individuals, and EI-A2, EI-A3, EI-A4, and EI-A5 occurred in foraging individuals. The description of the haplotypes may be associated with reproductive migrations or foraging and could support the hypothesis of natal homing. Furthermore, they can be used in phylogeographic studies.
Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

PubMed Central

2012-01-01

Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411
A DRD1 haplotype is associated with risk for autism spectrum disorders in male-only affected sib-pair families.

PubMed

Hettinger, Joe A; Liu, Xudong; Schwartz, Charles E; Michaelis, Ron C; Holden, Jeanette J A

2008-07-05

Individuals with autism spectrum disorders (ASDs) have impairments in executive function and social cognition, with males generally being more severely affected in these areas than females. Because the dopamine D1 receptor (encoded by DRD1) is integral to the neural circuitry mediating these processes, we examined the DRD1 gene for its role in susceptibility to ASDs by performing single marker and haplotype case-control comparisons, family-based association tests, and genotype-phenotype assessments (quantitative transmission disequilibrium tests: QTDT) using three DRD1 polymorphisms, rs265981C/T, rs4532A/G, and rs686T/C. Our previous findings suggested that the dopaminergic system may be more integrally involved in families with affected males only than in other families. We therefore restricted our study to families with two or more affected males (N = 112). There was over-transmission of rs265981-C and rs4532-A in these families (P = 0.040, P = 0.038), with haplotype TDT analysis showing over-transmission of the C-A-T haplotype (P = 0.022) from mothers to affected sons (P = 0.013). In addition, haplotype case-control comparisons revealed an increase of this putative risk haplotype in affected individuals relative to a comparison group (P = 0.004). QTDT analyses showed associations of the rs265981-C, rs4532-A, rs686-T alleles, and the C-A-T haplotype with more severe problems in social interaction, greater difficulties with nonverbal communication and increased stereotypies compared to individuals with other haplotypes. Preferential haplotype transmission of markers at the DRD1 locus and an increased frequency of a specific haplotype support the DRD1 gene as a risk gene for core symptoms of ASD in families having only affected males. Copyright 2008 Wiley-Liss, Inc.
Performance of Single Nucleotide Polymorphisms versus Haplotypes for Genome-Wide Association Analysis in Barley

PubMed Central

Jannink, Jean-Luc

2010-01-01

Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933
β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil.

PubMed

Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe

2010-07-01

Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.
β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil

PubMed Central

2010-01-01

Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405
The putative oncogene Pim-1 in the mouse: its linkage and variation among t haplotypes.

PubMed

Nadeau, J H; Phillips, S J

1987-11-01

Pim-1, a putative oncogene involved in T-cell lymphomagenesis, was mapped between the pseudo-alpha globin gene Hba-4ps and the alpha-crystallin gene Crya-1 on mouse chromosome 17 and therefore within the t complex. Pim-1 restriction fragment variants were identified among t haplotypes. Analysis of restriction fragment sizes obtained with 12 endonucleases demonstrated that the Pim-1 genes in some t haplotypes were indistinguishable from the sizes for the Pim-1b allele in BALB/c inbred mice. There are now three genes, Pim-1, Crya-1 and H-2 I-E, that vary among independently derived t haplotypes and that have indistinguishable alleles in t haplotypes and inbred strains. These genes are closely linked within the distal inversion of the t complex. Because it is unlikely that these variants arose independently in t haplotypes and their wild-type homologues, we propose that an exchange of chromosomal segments, probably through double crossingover, was responsible for indistinguishable Pim-1 genes shared by certain t haplotypes and their wild-type homologues. There was, however, no apparent association between variant alleles of these three genes among t haplotypes as would be expected if a single exchange introduced these alleles into t haplotypes. If these variant alleles can be shown to be identical to the wild-type allele, then lack of association suggests that multiple exchanges have occurred during the evolution of the t complex.
High variation and strong phylogeographic pattern among cpDNA haplotypes in Taxus wallichiana (Taxaceae) in China and North Vietnam.

PubMed

Gao, L M; Möller, M; Zhang, X-M; Hollingsworth, M L; Liu, J; Mill, R R; Gibby, M; Li, D-Z

2007-11-01

We studied the phylogeography of Chinese yew (Taxus wallichiana), a tree species distributed over most of southern China and adjacent regions. A total of 1235 individuals from 50 populations from China and North Vietnam were analysed for chloroplast DNA variation using polymerase chain reaction-restriction fragment length polymorphism of the trnL-F intron-spacer region. A total of 19 different haplotypes were distinguished. We found a very high level of population differentiation and a strong phylogeographic pattern, suggesting low levels of recurrent gene flow among populations. Haplotype differentiation was most marked along the boundary between the Sino-Himalayan and Sino-Japanese Forest floristic subkingdoms, with only one haplotype being shared among these two subkingdoms. The Malesian and Sino-Himalayan Forest subkingdoms had five and 10 haplotypes, respectively, while the relatively large Sino-Japanese Forest subkingdom had only eight. The strong geography-haplotype correlation persisted at the regional floristic level, with most regions possessing a unique set of haplotypes, except for the central China region. Strong landscape effects were observed in the Hengduan and Dabashan mountains, where steep mountains and valleys might have been natural dispersal barriers. The molecular phylogenetic data, together with the geographic distribution of the haplotypes, suggest the existence of several localized refugia during the last glaciation from which the present-day distribution may be derived. The pattern of haplotype distribution across China and North Vietnam corresponded well with the current taxonomic delineation of the three intraspecific varieties of T. wallichiana.
Genetic diversity and geographical structure of the pitcher plant Nepenthes vieillardii in New Caledonia: A chloroplast DNA haplotype analysis.

PubMed

Kurata, Kaoruko; Jaffré, Tanguy; Setoguchi, Hiroaki

2008-12-01

Among the many species that grow in New Caledonia, the pitcher plant Nepenthes vieillardii (Nepenthaceae) has a high degree of morphological variation. In this study, we present the patterns of genetic differentiation of pitcher plant populations based on chloroplast DNA haplotype analysis using the sequences of five spacers. We analyzed 294 samples from 16 populations covering the entire range of the species, using 4660 bp of sequence. Our analysis identified 17 haplotypes, including one that is widely distributed across the islands, as well as regional and private haplotypes. The greatest haplotype diversity was detected on the eastern coast of the largest island and included several private haplotypes, while haplotype diversity was low in the southern plains region. The parsimony network analysis of the 17 haplotypes suggested that the genetic divergence is the result of long-term isolation of individual populations. Results from a spatial analysis of molecular variance and a cluster analysis suggest that the plants once covered the entire serpentine area of New Caledonia and that subsequent regional fragmentation resulted in the isolation of each population and significantly restricted seed flow. This isolation may have been an important factor in the development of the morphological and genetic variation among pitcher plants in New Caledonia.
β-globin gene cluster haplotypes in ethnic minority populations of southwest China

PubMed Central

Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

2017-01-01

The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625
Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.

PubMed

Städler, Thomas; Delph, Lynda F

2002-09-03

Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.
Identification and genetic effect of haplotype in the bovine BMP7 gene.

PubMed

Huang, Yong-Zhen; Wang, Xin-Lei; He, Hua; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong

2013-12-15

Bone morphogenetic proteins (BMPs) are peptide growth factors belonging to the transforming growth factor-beta (TGF-β) superfamily, and some members of the BMP family support white adipocyte differentiation. In this study, we focused on the BMP7 which singularly promotes the differentiation of brown preadipocytes. Haplotypes involving 5 single nucleotide polymorphism (SNP) sites in the bovine BMP7 gene were identified and their effect on body weight was analyzed. 16 haplotypes and 18 combined haplotypes were revealed and the linkage disequilibrium was assessed in the cattle population with 602 individuals representing three main cattle breeds from China. The results showed that haplotypes 3, 10 and 14 were predominant and accounted for 75.64%, 69.85%, and 83.36% in Nanyang, Qinchuan and Jiaxian cattle breeds, respectively. The statistical analyses indicated that the SNP 1, 4, and 5 are associated with the body weight, body length, and heart girth at 12 and 24 months in Nanyang cattle population (P<0.05), whereas there is no significant association between their 16 haplotypes and 18 combined haplotypes. Our results provide evidence that some SNPs and haplotypes in BMP7 are associated with growth traits, and may be utilized as a genetic marker in marker-assisted selection for beef cattle breeding programs. Copyright © 2013. Published by Elsevier B.V.
The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

PubMed

Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco

2018-04-11

The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.
Massively parallel haplotyping on microscopic beads for the high-throughput phase analysis of single molecules.

PubMed

Boulanger, Jérôme; Muresan, Leila; Tiemann-Boege, Irene

2012-01-01

In spite of the many advances in haplotyping methods, it is still very difficult to characterize rare haplotypes in tissues and different environmental samples or to accurately assess the haplotype diversity in large mixtures. This would require a haplotyping method capable of analyzing the phase of single molecules with an unprecedented throughput. Here we describe such a haplotyping method capable of analyzing in parallel hundreds of thousands single molecules in one experiment. In this method, multiple PCR reactions amplify different polymorphic regions of a single DNA molecule on a magnetic bead compartmentalized in an emulsion drop. The allelic states of the amplified polymorphisms are identified with fluorescently labeled probes that are then decoded from images taken of the arrayed beads by a microscope. This method can evaluate the phase of up to 3 polymorphisms separated by up to 5 kilobases in hundreds of thousands single molecules. We tested the sensitivity of the method by measuring the number of mutant haplotypes synthesized by four different commercially available enzymes: Phusion, Platinum Taq, Titanium Taq, and Phire. The digital nature of the method makes it highly sensitive to detecting haplotype ratios of less than 1:10,000. We also accurately quantified chimera formation during the exponential phase of PCR by different DNA polymerases.
Characterization of swine leukocyte antigen alleles and haplotypes on a novel miniature pig line, Microminipig.

PubMed

Ando, A; Imaeda, N; Ohshima, S; Miyamoto, A; Kaneko, N; Takasu, M; Shiina, T; Kulski, J K; Inoko, H; Kitagawa, H

2014-12-01

Microminipigs are extremely small-sized, novel miniature pigs that were recently developed for medical research. The inbred Microminipigs with defined swine leukocyte antigen (SLA) haplotypes are expected to be useful for allo- and xenotransplantation studies and also for association analyses between SLA haplotypes and immunological traits. To establish SLA-defined Microminipig lines, we characterized the polymorphic SLA alleles for three class I (SLA-1, SLA-2 and SLA-3) and two class II (SLA-DRB1 and SLA-DQB1) genes of 14 parental Microminipigs using a high-resolution nucleotide sequence-based typing method. Eleven class I and II haplotypes, including three recombinant haplotypes, were found in the offspring of the parental Microminipigs. Two class I and class II haplotypes, Hp-31.0 (SLA-1*1502-SLA-3*070102-SLA-2*1601) and Hp-0.37 (SLA-DRB1*0701-SLA-DQB1*0502), are novel and have not so far been reported in other pig breeds. Crossover regions were defined by the analysis of 22 microsatellite markers within the SLA class III region of three recombinant haplotypes. The SLA allele and haplotype information of Microminipigs in this study will be useful to establish SLA homozygous lines including three recombinants for transplantation and immunological studies. © 2014 Stichting International Foundation for Animal Genetics.
Genomic evolution in domestic cattle: ancestral haplotypes and healthy beef.

PubMed

Williamson, Joseph F; Steele, Edward J; Lester, Susan; Kalai, Oscar; Millman, John A; Wolrige, Lindsay; Bayard, Dominic; McLure, Craig; Dawkins, Roger L

2011-05-01

We have identified numerous Ancestral Haplotypes encoding a 14-Mb region of Bota C19. Three are frequent in Simmental, Angus and Wagyu and have been conserved since common progenitor populations. Others are more relevant to the differences between these 3 breeds including fat content and distribution in muscle. SREBF1 and Growth Hormone, which have been implicated in the production of healthy beef, are included within these haplotypes. However, we conclude that alleles at these 2 loci are less important than other sequences within the haplotypes. Identification of breeds and hybrids is improved by using haplotypes rather than individual alleles. Copyright © 2010 Elsevier Inc. All rights reserved.

Y-STR haplotypes of Native American populations from the Brazilian Amazon region.

PubMed

Palha, Teresinha Jesus Brabo Ferreira; Rodrigues, Elzemar Martins Ribeiro; dos Santos, Sidney Emanuel Batista

2010-10-01

The allele and haplotype frequencies of nine Y-STRs (DYS19, DYS389 I, DYS389 II, DYS390, DYS391, DYS392, DYS393, DYS385 I/II) were determined in a sample of six native tribes from the Brazilian Amazon (Tiriyó, Awa-Guajá, Waiãpi, Urubu-Kaapor, Zoé and Parakanã). Forty-eight different haplotypes were identified, 28 of which unique. Five haplotypes are very frequent and were shared by over 10 individuals. The estimated haplotype diversity (0.9114) was very low compared to other geographic groups, including Africans, Europeans and Asians. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Planning for CD-ROM in the Reference Department.

ERIC Educational Resources Information Center

Graves, Gail T.; And Others

1987-01-01

Outlines the evaluation criteria used by the reference department at the Williams Library at the University of Mississippi in selecting databases and hardware used in CD-ROM workstations. The factors discussed include database coverage, costs, and security. (CLB)
TRENDS: A flight test relational database user's guide and reference manual

NASA Technical Reports Server (NTRS)

Bondi, M. J.; Bjorkman, W. S.; Cross, J. L.

1994-01-01

This report is designed to be a user's guide and reference manual for users intending to access rotocraft test data via TRENDS, the relational database system which was developed as a tool for the aeronautical engineer with no programming background. This report has been written to assist novice and experienced TRENDS users. TRENDS is a complete system for retrieving, searching, and analyzing both numerical and narrative data, and for displaying time history and statistical data in graphical and numerical formats. This manual provides a 'guided tour' and a 'user's guide' for the new and intermediate-skilled users. Examples for the use of each menu item within TRENDS is provided in the Menu Reference section of the manual, including full coverage for TIMEHIST, one of the key tools. This manual is written around the XV-15 Tilt Rotor database, but does include an appendix on the UH-60 Blackhawk database. This user's guide and reference manual establishes a referrable source for the research community and augments NASA TM-101025, TRENDS: The Aeronautical Post-Test, Database Management System, Jan. 1990, written by the same authors.
Cloud computing-based TagSNP selection algorithm for human genome data.

PubMed

Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

2015-01-05

Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
iXora: exact haplotype inferencing and trait association.

PubMed

Utro, Filippo; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar E; Royaert, Stefan; Schnell, Raymond J; Motamayor, Juan Carlos; Kuhn, David N; Parida, Laxmi

2013-06-06

We address the task of extracting accurate haplotypes from genotype data of individuals of large F1 populations for mapping studies. While methods for inferring parental haplotype assignments on large F1 populations exist in theory, these approaches do not work in practice at high levels of accuracy. We have designed iXora (Identifying crossovers and recombining alleles), a robust method for extracting reliable haplotypes of a mapping population, as well as parental haplotypes, that runs in linear time. Each allele in the progeny is assigned not just to a parent, but more precisely to a haplotype inherited from the parent. iXora shows an improvement of at least 15% in accuracy over similar systems in literature. Furthermore, iXora provides an easy-to-use, comprehensive environment for association studies and hypothesis checking in populations of related individuals. iXora provides detailed resolution in parental inheritance, along with the capability of handling very large populations, which allows for accurate haplotype extraction and trait association. iXora is available for non-commercial use from http://researcher.ibm.com/project/3430.
Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data

PubMed Central

Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

2015-01-01

Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used. PMID:25569088
VNTR alleles associated with the {alpha}-globin locus are haplotype and population related

DOE Office of Scientific and Technical Information (OSTI.GOV)

Martinson, J.J.; Clegg, J.B.; Boyce, A.J.

1994-09-01

The human {alpha}-globin complex contains several polymorphic restriction-enzyme sites (i.e., RFLPs) linked to form haplotypes and is flanked by two hypervariable VNTR loci, the 5{prime} hypervariable region (HVR) and the more highly polymorphic 3{prime}HVR. Using a combination of RFLP analysis and PCR, the authors have characterized the 5{prime}HVR and 3{prime}HVR alleles associated with the {alpha}-globin haplotypes of 133 chromosomes, and they here show that specific {alpha}-globin haplotypes are each associated with discrete subsets of the alleles observed at these two VNTR loci. This statistically highly significant association is observed over a region spanning {approximately} 100 kb. With the exception ofmore » closely related haplotypes, different haplotypes do not share identically sized 3{prime}HVR alleles. Earlier studies have shown that {alpha}-globin haplotype distributions differ between populations; the current findings also reveal extensive population substructure in the repertoire of {alpha}-globin VNTRs. If similar features are characteristic of other VNTR loci, this will have important implications for forensic and anthropological studies. 42 refs., 5 figs., 5 tabs.« less
Structured Forms Reference Set of Binary Images II (SFRS2)

National Institute of Standards and Technology Data Gateway

NIST Structured Forms Reference Set of Binary Images II (SFRS2) (Web, free access) The second NIST database of structured forms (Special Database 6) consists of 5,595 pages of binary, black-and-white images of synthesized documents containing hand-print. The documents in this database are 12 different tax forms with the IRS 1040 Package X for the year 1988.
Genetics of Progressive Supranuclear Palsy.

PubMed

Im, Sun Young; Kim, Young Eun; Kim, Yun Joong

2015-09-01

Progressive supranuclear palsy (PSP) is a neurodegenerative syndrome that is clinically characterized by progressive postural instability, supranuclear gaze palsy, parkinsonism and cognitive decline. Pathologically, diagnosis of PSP is based on characteristic features, such as neurofibrillary tangles, neutrophil threads, tau-positive astrocytes and their processes in basal ganglia and brainstem, and the accumulation of 4 repeat tau protein. PSP is generally recognized as a sporadic disorder; however, understanding of genetic background of PSP has been expanding rapidly. Here we review relevant publications to outline the genetics of PSP. Although only small number of familial PSP cases have been reported, the recognition of familial PSP has been increasing. In some familial cases of clinically probable PSP, PSP pathologies were confirmed based on NINDS neuropathological diagnostic criteria. Several mutations in MAPT, the gene that causes a form of familial frontotemporal lobar degeneration with tauopathy, have been identified in both sporadic and familial PSP cases. The H1 haplotype of MAPT is a risk haplotype for PSP, and within H1, a sub-haplotype (H1c) is associated with PSP. A recent genome-wide association study on autopsyproven PSP revealed additional PSP risk alleles in STX6 and EIF2AK3. Several heredodegenerative parkinsonian disorders are referred to as PSP-look-alikes because their clinical phenotype, but not their pathology, mimics PSP. Due to the fast development of genomics and bioinformatics, more genetic factors related to PSP are expected to be discovered. Undoubtedly, these studies will provide a better understanding of the pathogenesis of PSP and clues for developing therapeutic strategies.
A Monomorphic Haplotype of Chromosome Ia Is Associated with Widespread Success in Clonal and Nonclonal Populations of Toxoplasma gondii

PubMed Central

Khan, Asis; Miller, Natalie; Roos, David S.; Dubey, J. P.; Ajzenberg, Daniel; Dardé, Marie Laure; Ajioka, James W.; Rosenthal, Benjamin; Sibley, L. David

2011-01-01

ABSTRACT Toxoplasma gondii is a common parasite of animals that also causes a zoonotic infection in humans. Previous studies have revealed a strongly clonal population structure that is shared between North America and Europe, while South American strains show greater genetic diversity and evidence of sexual recombination. The common inheritance of a monomorphic version of chromosome Ia (referred to as ChrIa*) among three clonal lineages from North America and Europe suggests that inheritance of this chromosome might underlie their recent clonal expansion. To further examine the diversity and distribution of ChrIa, we have analyzed additional strains with greater geographic diversity. Our findings reveal that the same haplotype of ChrIa* is found in the clonal lineages from North America and Europe and in older lineages in South America, where sexual recombination is more common. Although lineages from all three continents harbor the same conserved ChrIa* haplotype, strains from North America and Europe are genetically separate from those in South America, and these respective geographic regions show limited evidence of recent mixing. Genome-wide, array-based profiling of polymorphisms provided evidence for an ancestral flow from particular older southern lineages that gave rise to the clonal lineages now dominant in the north. Collectively, these data indicate that ChrIa* is widespread among nonclonal strains in South America and has more recently been associated with clonal expansion of specific lineages in North America and Europe. These findings have significant implications for the spread of genetic loci influencing transmission and virulence in pathogen populations. PMID:22068979
Intragenic SNP haplotypes associated with 84dup18 mutation in TNFRSF11A in four FEO pedigrees suggest three independent origins for this mutation.

PubMed

Elahi, Elahe; Shafaghati, Yousef; Asadi, Sareh; Absalan, Farnaz; Goodarzi, Hani; Gharaii, Nava; Karimi-Nejad, Mohammad Hassan; Shahram, Farhad; Hughes, Anne E

2007-01-01

Familial expansile osteolysis (FEO) is a rare disorder causing bone dysplasia. The clinical features of FEO include early-onset hearing loss, tooth destruction, and progressive lytic expansion within limb bones causing pain, fracture, and deformity. An 18-bp duplication in the first exon of the TNFRSF11A gene encoding RANK has been previously identified in four FEO pedigrees. Despite having the identical mutation, phenotypic variations among affected individuals of the same and different pedigrees were noted. Another 18-bp duplication, one base proximal to the duplication previously reported, was subsequently found in two unrelated FEO patients. Finally, mutations overlapping with the mutations found in the FEO pedigrees have been found in ESH and early-onset PDB pedigrees. An Iranian FEO pedigree that contains six affected individuals dispersed in three generations has previously been introduced; here, the clinical features of the proband are reported in greater detail, and the genetic defect of the pedigree is presented. Direct sequencing of the entire coding region and upstream and downstream noncoding regions of TNFRSF11A in her DNA revealed the same 18-bp duplication mutation as previously found in the four FEO pedigrees. Additionally, eight sequence variations as compared to the TNFRSF11A reference sequence were identified, and a haplotype linked to the mutation based on these variations was defined. Although the mutation in the Iranian and four of the previously described FEO pedigrees was the same, haplotypes based on the intragenic SNPs suggest that the mutations do not share a common descent.
ReprDB and panDB: minimalist databases with maximal microbial representation.

PubMed

Zhou, Wei; Gay, Nicole; Oh, Julia

2018-01-18

Profiling of shotgun metagenomic samples is hindered by a lack of unified microbial reference genome databases that (i) assemble genomic information from all open access microbial genomes, (ii) have relatively small sizes, and (iii) are compatible to various metagenomic read mapping tools. Moreover, computational tools to rapidly compile and update such databases to accommodate the rapid increase in new reference genomes do not exist. As a result, database-guided analyses often fail to profile a substantial fraction of metagenomic shotgun sequencing reads from complex microbiomes. We report pipelines that efficiently traverse all open access microbial genomes and assemble non-redundant genomic information. The pipelines result in two species-resolution microbial reference databases of relatively small sizes: reprDB, which assembles microbial representative or reference genomes, and panDB, for which we developed a novel iterative alignment algorithm to identify and assemble non-redundant genomic regions in multiple sequenced strains. With the databases, we managed to assign taxonomic labels and genome positions to the majority of metagenomic reads from human skin and gut microbiomes, demonstrating a significant improvement over a previous database-guided analysis on the same datasets. reprDB and panDB leverage the rapid increases in the number of open access microbial genomes to more fully profile metagenomic samples. Additionally, the databases exclude redundant sequence information to avoid inflated storage or memory space and indexing or analyzing time. Finally, the novel iterative alignment algorithm significantly increases efficiency in pan-genome identification and can be useful in comparative genomic analyses.
A novel haplotype of spinocerebellar ataxia type 6 contributes to the highest prevalence in western Japan.

PubMed

Terasawa, Hideo; Oda, Masaya; Morino, Hiroyuki; Miyachi, Takafumi; Izumi, Yuishin; Maruyama, Hirofumi; Matsumoto, Masayasu; Kawakami, Hideshi

2004-03-25

The highest prevalence rate of spinocerebellar ataxia type 6 (SCA6) in the worldwide population is in the Chugoku and Kansai areas of Western Japan, but the reason of this geographic characteristics is unclear. We investigated the predisposing haplotypes and their geographic distribution. Genotyping of five microsatellite markers and three single nucleotide polymorphisms linked to the CACNA1A gene in 150 Japanese SCA6 patients from unrelated 118 families revealed three major haplotypes, carrying a pool of one common haplotype core. A founder chromosome was thought to have historically diverged into at least three types. One of the major haplotypes newly identified showed a strong geographical cluster around the Seto Inland Sea in the Chugoku and Kansai areas of Western Japan, whereas the others were widely distributed throughout Japan. The distribution of predisposing haplotypes contributes to the geographical differences in prevalence of SCA6.
The land management and operations database (LMOD)

USDA-ARS?s Scientific Manuscript database

This paper presents the design, implementation, deployment, and application of the Land Management and Operations Database (LMOD). LMOD is the single authoritative source for reference land management and operation reference data within the USDA enterprise data warehouse. LMOD supports modeling appl...
Fire-induced water-repellent soils, an annotated bibliography

USGS Publications Warehouse

Kalendovsky, M.A.; Cannon, S.H.

1997-01-01

The development and nature of water-repellent, or hydrophobic, soils are important issues in evaluating hillslope response to fire. The following annotated bibliography was compiled to consolidate existing published research on the topic. Emphasis was placed on the types, causes, effects and measurement techniques of water repellency, particularly with respect to wildfires and prescribed burns. Each annotation includes a general summary of the respective publication, as well as highlights of interest to this focus. Although some references on the development of water repellency without fires, the chemistry of hydrophobic substances, and remediation of water-repellent conditions are included, coverage of these topics is not intended to be comprehensive. To develop this database, the GeoRef, Agricola, and Water Resources Abstracts databases were searched for appropriate references, and the bibliographies of each reference were then reviewed for additional entries. Additional references will be added to this bibliography as they become available. The annotated bibliography can be accessed on the Web at http://geohazards.cr.usgs.gov/html_files/landslides/ofr97-720/biblio.html. A database consisting of the references and keywords is available through a link at the above address. This database was compiled using EndNote2 plus software by Niles and Associates, and is necessary to search the database.
Standardized molecular diagnostic tool for the identification of cryptic species within the Bemisia tabaci complex.

PubMed

Elfekih, Samia; Tay, Wee Tek; Gordon, Karl; Court, Leon N; De Barro, Paul J

2018-01-01

The whitefly Bemisia tabaci complex harbours over 40 cryptic species that have been placed in 11 phylogenetically distinct clades based on the molecular characterization of partial mitochondrial DNA COI (mtCOI) gene region. Four cryptic species are currently within the invasive clade, i.e. MED, MEAM1, MEAM2 and IO. Correct identification of these species is a critical step towards implementing reliable measures for plant biosecurity and border protection; however, no standardized B. tabaci-specific primers are currently available which has caused inconsistencies in the species identification processes. We report three sets of polymerase chain reaction (PCR) primers developed to amplify the mtCOI region which can be used for genotyping MED, MEAM1 and IO species, and tested these primers on 91 MED, 35 MEAM1 and five IO individuals. PCR and sequencing of amplicons identified a total of 21, six and one haplotypes in MED, MEAM1 and IO respectively, of which six haplotypes were new to the B. tabaci database. These primer pairs enabled standardization and robust molecular species identification via mtCOI screening of the targeted invasive cryptic species and will improve quarantine decisions. Use of this diagnostic tool could be extended to other species within the complex. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Founder haplotype analysis of Fanconi anemia in the Korean population finds common ancestral haplotypes for a FANCG variant.

PubMed

Park, Joonhong; Kim, Myungshin; Jang, Woori; Chae, Hyojin; Kim, Yonggoo; Chung, Nack-Gyun; Lee, Jae-Wook; Cho, Bin; Jeong, Dae-Chul; Park, In Yang; Park, Mi Sun

2015-05-01

A common ancestral haplotype is strongly suggested in the Korean and Japanese patients with Fanconi anemia (FA), because common mutations have been frequently found: c.2546delC and c.3720_3724delAAACA of FANCA; c.307+1G>C, c.1066C>T, and c.1589_1591delATA of FANCG. Our aim in this study was to investigate the origin of these common mutations of FANCA and FANCG. We genotyped 13 FA patients consisting of five FA-A patients and eight FA-G patients from the Korean FA population. Microsatellite markers used for haplotype analysis included four CA repeat markers which are closely linked with FANCA and eight CA repeat markers which are contiguous with FANCG. As a result, Korean FA-A patients carrying c.2546delC or c.3720_3724delAAACA did not share the same haplotypes. However, three unique haplotypes carrying c.307+1G>C, c.1066C > T, or c.1589_1591delATA, that consisted of eight polymorphic loci covering a flanking region were strongly associated with Korean FA-G, consistent with founder haplotypes reported previously in the Japanese FA-G population. Our finding confirmed the common ancestral haplotypes on the origins of the East Asian FA-G patients, which will improve our understanding of the molecular population genetics of FA-G. To the best of our knowledge, this is the first report on the association between disease-linked mutations and common ancestral haplotypes in the Korean FA population. © 2015 John Wiley & Sons Ltd/University College London.
Mitochondrial DNA haplotype distribution patterns in Pinus ponderosa (Pinaceae): range-wide evolutionary history and implications for conservation.

PubMed

Potter, Kevin M; Hipkins, Valerie D; Mahalovich, Mary F; Means, Robert E

2013-08-01

Ponderosa pine (Pinus ponderosa Douglas ex P. Lawson & C. Lawson) exhibits complicated patterns of morphological and genetic variation across its range in western North America. This study aims to clarify P. ponderosa evolutionary history and phylogeography using a highly polymorphic mitochondrial DNA marker, with results offering insights into how geographical and climatological processes drove the modern evolutionary structure of tree species in the region. We amplified the mtDNA nad1 second intron minisatellite region for 3,100 trees representing 104 populations, and sequenced all length variants. We estimated population-level haplotypic diversity and determined diversity partitioning among varieties, races and populations. After aligning sequences of minisatellite repeat motifs, we evaluated evolutionary relationships among haplotypes. The geographical structuring of the 10 haplotypes corresponded with division between Pacific and Rocky Mountain varieties. Pacific haplotypes clustered with high bootstrap support, and appear to have descended from Rocky Mountain haplotypes. A greater proportion of diversity was partitioned between Rocky Mountain races than between Pacific races. Areas of highest haplotypic diversity were the southern Sierra Nevada mountain range in California, northwestern California, and southern Nevada. Pinus ponderosa haplotype distribution patterns suggest a complex phylogeographic history not revealed by other genetic and morphological data, or by the sparse paleoecological record. The results appear consistent with long-term divergence between the Pacific and Rocky Mountain varieties, along with more recent divergences not well-associated with race. Pleistocene refugia may have existed in areas of high haplotypic diversity, as well as the Great Basin, Southwestern United States/northern Mexico, and the High Plains.
Haplotypes composed of minor frequency single nucleotide polymorphisms of the TNF gene protect from progression into sepsis: A study using the new sepsis classification.

PubMed

Retsas, Theodoros; Huse, Klaus; Lazaridis, Lazaros-Dimitrios; Karampela, Niki; Bauer, Michael; Platzer, Matthias; Kolonia, Virginia; Papageorgiou, Eirini; Giamarellos-Bourboulis, Evangelos J; Dimopoulos, George

2018-02-01

Several articles have provided conflicting results regarding the role of single nucleotide polymorphisms (SNPs) in the promoter region of the TNF gene in susceptibility to sepsis. Former articles have been based on previous definitions of sepsis. This study investigated the influence of TNF haplotypes on the development of sepsis using the new Sepsis-3 definitions. DNA was isolated from patients suffering from infection and systemic inflammatory response syndrome. Haplotyping was performed for six SNPs of TNF. The serum levels of tumour necrosis factor alpha (TNF-α) of these patients were measured using an enzyme immunosorbent assay. Patients were classified into infection and sepsis categories using the Sepsis-3 definitions. Associations between the TNF haplotypes and the clinical characteristics and serum TNF-α levels of the patients were examined. The most common TNF haplotype h1 was composed of major alleles of the studied SNPs. Carriage of haplotypes composed of minor frequency alleles was associated with a lower risk of developing sepsis (odds ratio 0.41, 95% confidence interval 0.19-0.88, p=0.022), but this did not affect the 28-day outcome. Serum TNF-α levels were significantly higher among patients homozygous for h1 haplotypes who developed sepsis compared to infection (p=0.032); a similar result was not observed for patients carrying other haplotypes. Haplotypes containing minor frequency SNP alleles of TNF protect against the development of sepsis without affecting the outcome. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Mitochondrial Haplotype Diversity in Zambian Lions: Bridging a Gap in the Biogeography of an Iconic Species

PubMed Central

Curry, Caitlin J.; White, Paula A.; Derr, James N.

2015-01-01

Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species. PMID:26674533

The targetable A1 Huntington disease haplotype has distinct Amerindian and European origins in Latin America

PubMed Central

Kay, Chris; Tirado-Hurtado, Indira; Cornejo-Olivas, Mario; Collins, Jennifer A; Wright, Galen; Inca-Martinez, Miguel; Veliz-Otani, Diego; Ketelaar, Maria E; Slama, Ramy A; Ross, Colin J; Mazzetti, Pilar; Hayden, Michael R

2017-01-01

Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin (HTT) gene. HD occurs worldwide, but the causative mutation is found on different HTT haplotypes in distinct ethnic groups. In Latin America, HD is thought to have European origins, but indigenous Amerindian ancestry has not been investigated. Here, we report dense HTT haplotypes in 62 mestizo Peruvian HD families, 17 HD families from across Latin America, and 42 controls of defined Peruvian Amerindian ethnicity to determine the origin of HD in populations of admixed Amerindian and European descent. HD in Peru occurs most frequently on the A1 HTT haplotype (73%), as in Europe, but on an unexpected indigenous variant also found in Amerindian controls. This Amerindian A1 HTT haplotype predominates over the European A1 variant among geographically disparate Latin American controls and in HD families from across Latin America, supporting an indigenous origin of the HD mutation in mestizo American populations. We also show that a proportion of HD mutations in Peru occur on a C1 HTT haplotype of putative Amerindian origin (14%). The majority of HD mutations in Latin America may therefore occur on haplotypes of Amerindian ancestry rather than on haplotypes resulting from European admixture. Despite the distinct ethnic ancestry of Amerindian and European A1 HTT, alleles on the parent A1 HTT haplotype allow for development of identical antisense molecules to selectively silence the HD mutation in the greatest proportion of patients in both Latin American and European populations. PMID:28000697
Mitochondrial Haplotype Diversity in Zambian Lions: Bridging a Gap in the Biogeography of an Iconic Species.

PubMed

Curry, Caitlin J; White, Paula A; Derr, James N

2015-01-01

Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species.
Reproductive status of overwintering potato psyllid: absence of photoperiod effects

USDA-ARS?s Scientific Manuscript database

We examined the effects of photoperiod on reproductive diapause of three haplotypes of potato psyllid, Bactericera cockerelli (Hemiptera: Triozidae), collected from three geographic locations: south Texas (Central haplotype), California (Western haplotype), and Washington State (Northwestern haploty...
MtDNA diversity among four Portuguese autochthonous dog breeds: a fine-scale characterisation

PubMed Central

van Asch, Barbara; Pereira, Luísa; Pereira, Filipe; Santa-Rita, Pedro; Lima, Manuela; Amorim, António

2005-01-01

Background The picture of dog mtDNA diversity, as obtained from geographically wide samplings but from a small number of individuals per region or breed, has revealed weak geographic correlation and high degree of haplotype sharing between very distant breeds. We aimed at a more detailed picture through extensive sampling (n = 143) of four Portuguese autochthonous breeds – Castro Laboreiro Dog, Serra da Estrela Mountain Dog, Portuguese Sheepdog and Azores Cattle Dog-and comparatively reanalysing published worldwide data. Results Fifteen haplotypes belonging to four major haplogroups were found in these breeds, of which five are newly reported. The Castro Laboreiro Dog presented a 95% frequency of a new A haplotype, while all other breeds contained a diverse pool of existing lineages. The Serra da Estrela Mountain Dog, the most heterogeneous of the four Portuguese breeds, shared haplotypes with the other mainland breeds, while Azores Cattle Dog shared no haplotypes with the other Portuguese breeds. A review of mtDNA haplotypes in dogs across the world revealed that: (a) breeds tend to display haplotypes belonging to different haplogroups; (b) haplogroup A is present in all breeds, and even uncommon haplogroups are highly dispersed among breeds and continental areas; (c) haplotype sharing between breeds of the same region is lower than between breeds of different regions and (d) genetic distances between breeds do not correlate with geography. Conclusion MtDNA haplotype sharing occurred between Serra da Estrela Mountain dogs (with putative origin in the centre of Portugal) and two breeds in the north and south of the country-with the Castro Laboreiro Dog (which behaves, at the mtDNA level, as a sub-sample of the Serra da Estrela Mountain Dog) and the southern Portuguese Sheepdog. In contrast, the Azores Cattle Dog did not share any haplotypes with the other Portuguese breeds, but with dogs sampled in Northern Europe. This suggested that the Azores Cattle Dog descended maternally from Northern European dogs rather than Portuguese mainland dogs. A review of published mtDNA haplotypes identified thirteen non-Portuguese breeds with sufficient data for comparison. Comparisons between these thirteen breeds, and the four Portuguese breeds, demonstrated widespread haplotype sharing, with the greatest diversity among Asian dogs, in accordance with the central role of Asia in canine domestication. PMID:15972107
Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Jisen; Sharma, Anupma; Yu, Qingyi

Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less
Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

DOE PAGES

Zhang, Jisen; Sharma, Anupma; Yu, Qingyi; ...

2016-06-10

Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less
16 CFR 1102.6 - Definitions.

Code of Federal Regulations, 2011 CFR

2011-01-01

... AVAILABLE CONSUMER PRODUCT SAFETY INFORMATION DATABASE (Eff. Jan. 10, 2011) Background and Definitions... Product Safety Information Database. (2) Commission or CPSC means the Consumer Product Safety Commission... Information Database, also referred to as the Database, means the database on the safety of consumer products...
Genetic differences in the two main groups of the Japanese population based on autosomal SNPs and haplotypes.

PubMed

Yamaguchi-Kabata, Yumi; Tsunoda, Tatsuhiko; Kumasaka, Natsuhiko; Takahashi, Atsushi; Hosono, Naoya; Kubo, Michiaki; Nakamura, Yusuke; Kamatani, Naoyuki

2012-05-01

Although the Japanese population has a rather low genetic diversity, we recently confirmed the presence of two main clusters (the Hondo and Ryukyu clusters) through principal component analysis of genome-wide single-nucleotide polymorphism (SNP) genotypes. Understanding the genetic differences between the two main clusters requires further genome-wide analyses based on a dense SNP set and comparison of haplotype frequencies. In the present study, we determined haplotypes for the Hondo cluster of the Japanese population by detecting SNP homozygotes with 388,591 autosomal SNPs from 18,379 individuals and estimated the haplotype frequencies. Haplotypes for the Ryukyu cluster were inferred by a statistical approach using the genotype data from 504 individuals. We then compared the haplotype frequencies between the Hondo and Ryukyu clusters. In most genomic regions, the haplotype frequencies in the Hondo and Ryukyu clusters were very similar. However, in addition to the human leukocyte antigen region on chromosome 6, other genomic regions (chromosomes 3, 4, 5, 7, 10 and 12) showed dissimilarities in haplotype frequency. These regions were enriched for genes involved in the immune system, cell-cell adhesion and the intracellular signaling cascade. These differentiated genomic regions between the Hondo and Ryukyu clusters are of interest because they (1) should be examined carefully in association studies and (2) likely contain genes responsible for morphological or physiological differences between the two groups.
Two families from New England with usher syndrome type IC with distinct haplotypes.

PubMed

DeAngelis, M M; McGee, T L; Keats, B J; Slim, R; Berson, E L; Dryja, T P

2001-03-01

To search for patients with Usher syndrome type IC among those with Usher syndrome type I who reside in New England. Genotype analysis of microsatellite markers closely linked to the USH1C locus was done using the polymerase chain reaction. We compared the haplotype of our patients who were homozygous in the USH1C region with the haplotypes found in previously reported USH1C Acadian families who reside in southwestern Louisiana and from a single family residing in Lebanon. Of 46 unrelated cases of Usher syndrome type I residing in New England, two were homozygous at genetic markers in the USH1C region. Of these, one carried the Acadian USH1C haplotype and had Acadian ancestors (that is, from Nova Scotia) who did not participate in the 1755 migration of Acadians to Louisiana. The second family had a haplotype that proved to be the same as that of a family with USH1C residing in Lebanon. Each of the two families had haplotypes distinct from the other. This is the first report that some patients residing in New England have Usher syndrome type IC. Patients with Usher syndrome type IC can have the Acadian haplotype or the Lebanese haplotype compatible with the idea that at least two independently arising pathogenic mutations have occurred in the yet-to-be identified USH1C gene.
Hapl-o-Mat: open-source software for HLA haplotype frequency estimation from ambiguous and heterogeneous data.

PubMed

Schäfer, Christian; Schmidt, Alexander H; Sauter, Jürgen

2017-05-30

Knowledge of HLA haplotypes is helpful in many settings as disease association studies, population genetics, or hematopoietic stem cell transplantation. Regarding the recruitment of unrelated hematopoietic stem cell donors, HLA haplotype frequencies of specific populations are used to optimize both donor searches for individual patients and strategic donor registry planning. However, the estimation of haplotype frequencies from HLA genotyping data is challenged by the large amount of genotype data, the complex HLA nomenclature, and the heterogeneous and ambiguous nature of typing records. To meet these challenges, we have developed the open-source software Hapl-o-Mat. It estimates haplotype frequencies from population data including an arbitrary number of loci using an expectation-maximization algorithm. Its key features are the processing of different HLA typing resolutions within a given population sample and the handling of ambiguities recorded via multiple allele codes or genotype list strings. Implemented in C++, Hapl-o-Mat facilitates efficient haplotype frequency estimation from large amounts of genotype data. We demonstrate its accuracy and performance on the basis of artificial and real genotype data. Hapl-o-Mat is a versatile and efficient software for HLA haplotype frequency estimation. Its capability of processing various forms of HLA genotype data allows for a straightforward haplotype frequency estimation from typing records usually found in stem cell donor registries.
Factor IX gene haplotypes in Amerindians.

PubMed

Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S

1997-02-01

We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.
Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

PubMed Central

Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

1999-01-01

With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World. PMID:10388827
Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

PubMed

Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

1999-07-01

With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World.
Mineralocorticoid receptor haplotype moderates the effects of oral contraceptives and menstrual cycle on emotional information processing.

PubMed

Hamstra, Danielle A; de Kloet, E Ronald; Tollenaar, Marieke; Verkuil, Bart; Manai, Meriem; Putman, Peter; Van der Does, Willem

2016-10-01

The processing of emotional information is affected by menstrual cycle phase and by the use of oral contraceptives (OCs). The stress hormone cortisol is known to affect emotional information processing via the limbic mineralocorticoid receptor (MR). We investigated in an exploratory study whether the MR-genotype moderates the effect of both OC-use and menstrual cycle phase on emotional cognition. Healthy premenopausal volunteers (n=93) of West-European descent completed a battery of emotional cognition tests. Forty-nine participants were OC users and 44 naturally cycling, 21 of whom were tested in the early follicular (EF) and 23 in the mid-luteal (ML) phase of the menstrual cycle. In MR-haplotype 1/3 carriers, ML women gambled more than EF women when their risk to lose was relatively small. In MR-haplotype 2, ML women gambled more than EF women, regardless of their odds of winning. OC-users with MR-haplotype 1/3 recognised fewer facial expressions than ML women with MR-haplotype 1/3. MR-haplotype 1/3 carriers may be more sensitive to the influence of their female hormonal status. MR-haplotype 2 carriers showed more risky decision-making. As this may reflect optimistic expectations, this finding may support previous observations in female carriers of MR-haplotype 2 in a naturalistic cohort study. © The Author(s) 2016.
Evaluation of haplotype diversity of Achatina fulica (Lissachatina) [Bowdich] from Indian sub-continent by means of 16S rDNA sequence and its phylogenetic relationships with other global populations.

PubMed

Ayyagari, Vijaya Sai; Sreerama, Krupanidhi

2017-08-01

Achatina fulica (Lissachatina fulica) is one of the most invasive species found across the globe causing a significant damage to crops, vegetables, and horticultural plants. This terrestrial snail is native to east Africa and spread to different parts of the world by introductions. India, a hot spot for biodiversity of several endemic gastropods, has witnessed an outburst of this snail population in several parts of the country posing a serious threat to crop loss and also to human health. With an objective to evaluate the genetic diversity of this snail, we have sampled this snail from different parts of India and analyzed its haplotype diversity by means of 16S rDNA sequence information. Apart from this, we have studied the phylogenetic relationships of the isolates sequenced in the present study in relation with other global populations by Bayesian and Maximum-likelihood approaches. Of the isolates sequenced, haplotype 'C' is the predominant one. A new haplotype 'S' from the state of Odisha was observed. The isolates sequenced in the present study clustered with its conspecifics from the Indian sub-continent. Haplotype network analyses were also carried out for studying the evolution of different haplotypes. It was observed that haplotype 'S' was associated with a Mauritius haplotype 'H', indicating the possibility of multiple introductions of A. fulica to India.
A TNF region haplotype offers protection from typhoid fever in Vietnamese patients

PubMed Central

2009-01-01

The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085
Mineralocorticoid receptor haplotypes sex-dependently moderate depression susceptibility following childhood maltreatment.

PubMed

Vinkers, Christiaan H; Joëls, Marian; Milaneschi, Yuri; Gerritsen, Lotte; Kahn, René S; Penninx, Brenda W J H; Boks, Marco P M

2015-04-01

The MR is an important regulator of the hypothalamic-pituitary-adrenal (HPA) axis and a prime target for corticosteroids. There is increasing evidence from both clinical and preclinical studies that the MR has different effects on behavior and mood in males and females. To investigate the hypothesis that the MR sex-dependently influences the relation between childhood maltreatment and depression, we investigated three common and functional MR haplotypes (GA, CA, and CG haplotype, based on rs5522 and rs2070951) in a population-based cohort (N = 665) and an independent clinical cohort from the Netherlands Study of Depression and Anxiety (NESDA) (N = 1639). The CA haplotype sex-dependently moderated the relation between childhood maltreatment and depressive symptoms both in the population-based sample (sex × maltreatment × haplotype: β = -4.07, P = 0.029) and in the clinical sample (sex × maltreatment × haplotype, β = -2.40, P = 0.011). Specifically, female individuals in the population-based sample were protected (β = -4.58, P = 2.0 e(-5)), whereas males in the clinical sample were at increased risk (β = 2.54, P = 0.0022). In line with these results, female GA haplotype carriers displayed increased vulnerability in the population-based sample (β = 4.58, P = 7.5 e(-5)) whereas male CG-carriers showed increased resilience in the clinical sample (β = -2.71, P = 0.016). Consistently, we found a decreased lifetime MDD risk for male GA haplotype carriers following childhood maltreatment but an increased risk for male CA haplotype carriers in the clinical sample. In both samples, sex-dependent effects were observed for GA-GA diplotype carriers. In summary, sex plays an important role in determining whether functional genetic variation in MR is beneficial or detrimental, with an apparent female advantage for the CA haplotype but male advantage for the GA and CG haplotype. These sex-dependent effects of MR on depression susceptibility following childhood maltreatment are relevant in light of the increased prevalence of mood disorders in women and point to a sex-specific role of MR in the etiology of depression following childhood maltreatment. Copyright © 2015 Elsevier Ltd. All rights reserved.
HLA DPA1, DPB1 alleles and haplotypes contribute to the risk associated with type 1 diabetes: analysis of the type 1 diabetes genetics consortium families.

PubMed

Varney, Michael D; Valdes, Ana Maria; Carlson, Joyce A; Noble, Janelle A; Tait, Brian D; Bonella, Persia; Lavant, Eva; Fear, Anna Lisa; Louey, Anthony; Moonsamy, Priscilla; Mychaleckyj, Josyf C; Erlich, Henry

2010-08-01

To determine the relative risk associated with DPA1 and DPB1 alleles and haplotypes in type 1 diabetes. The frequency of DPA1 and DPB1 alleles and haplotypes in type 1 diabetic patients was compared to the family based control frequency in 1,771 families directly and conditional on HLA (B)-DRB1-DQA1-DQB1 linkage disequilibrium. A relative predispositional analysis (RPA) was performed in the presence or absence of the primary HLA DR-DQ associations and the contribution of DP haplotype to individual DR-DQ haplotype risks examined. Eight DPA1 and thirty-eight DPB1 alleles forming seventy-four DPA1-DPB1 haplotypes were observed; nineteen DPB1 alleles were associated with multiple DPA1 alleles. Following both analyses, type 1 diabetes susceptibility was significantly associated with DPB1*0301 (DPA1*0103-DPB1*0301) and protection with DPB1*0402 (DPA1*0103-DPB1*0402) and DPA1*0103-DPB1*0101 but not DPA1*0201-DPB1*0101. In addition, DPB1*0202 (DPA1*0103-DPB1*0202) and DPB1*0201 (DPA1*0103-DPB1*0201) were significantly associated with susceptibility in the presence of the high risk and protective DR-DQ haplotypes. Three associations (DPB1*0301, *0402, and *0202) remained statistically significant when only the extended HLA-A1-B8-DR3 haplotype was considered, suggesting that DPB1 alone may delineate the risk associated with this otherwise conserved haplotype. HLA DP allelic and haplotypic diversity contributes significantly to the risk for type 1 diabetes; DPB1*0301 (DPA1*0103-DPB1*0301) is associated with susceptibility and DPB1*0402 (DPA1*0103-DPB1*0402) and DPA1*0103-DPB1*0101 with protection. Additional evidence is presented for the susceptibility association of DPB1*0202 (DPA1*0103-DPB1*0202) and for a contributory role of individual amino acids and DPA1 or a gene in linkage disequilibrium in DR3-DPB1*0101 positive haplotypes.
Error and Uncertainty in the Accuracy Assessment of Land Cover Maps

NASA Astrophysics Data System (ADS)

Sarmento, Pedro Alexandre Reis

Traditionally the accuracy assessment of land cover maps is performed through the comparison of these maps with a reference database, which is intended to represent the "real" land cover, being this comparison reported with the thematic accuracy measures through confusion matrixes. Although, these reference databases are also a representation of reality, containing errors due to the human uncertainty in the assignment of the land cover class that best characterizes a certain area, causing bias in the thematic accuracy measures that are reported to the end users of these maps. The main goal of this dissertation is to develop a methodology that allows the integration of human uncertainty present in reference databases in the accuracy assessment of land cover maps, and analyse the impacts that uncertainty may have in the thematic accuracy measures reported to the end users of land cover maps. The utility of the inclusion of human uncertainty in the accuracy assessment of land cover maps is investigated. Specifically we studied the utility of fuzzy sets theory, more precisely of fuzzy arithmetic, for a better understanding of human uncertainty associated to the elaboration of reference databases, and their impacts in the thematic accuracy measures that are derived from confusion matrixes. For this purpose linguistic values transformed in fuzzy intervals that address the uncertainty in the elaboration of reference databases were used to compute fuzzy confusion matrixes. The proposed methodology is illustrated using a case study in which the accuracy assessment of a land cover map for Continental Portugal derived from Medium Resolution Imaging Spectrometer (MERIS) is made. The obtained results demonstrate that the inclusion of human uncertainty in reference databases provides much more information about the quality of land cover maps, when compared with the traditional approach of accuracy assessment of land cover maps. None
Biological impact of α genes, β haplotypes, and G6PD activity in sickle cell anemia at baseline and with hydroxyurea

PubMed Central

Arnaud, Cécile; Kamdem, Annie; Hau, Isabelle; Lelong, Françoise; Epaud, Ralph; Pondarré, Corinne; Pissard, Serge

2018-01-01

Sickle cell anemia (SCA), albeit monogenic, has heterogeneous phenotypic expression, mainly related to the level of hemoglobin F (HbF). No large cohort studies have ever compared biological parameters in patients with major β-globin haplotypes; ie, Senegal (SEN), Benin (BEN), and Bantu/Central African Republic (CAR). The aim of this study was to evaluate the biological impact of α genes, β haplotypes, and glucose-6-phosphate dehydrogenase (G6PD) activity at baseline and with hydroxyurea (HU). Homozygous HbS patients from the Créteil pediatric cohort with available α-gene and β-haplotype data were included (n = 580; 301 females and 279 males) in this retrospective study. Homozygous β-haplotype patients represented 74% of cases (37.4% CAR/CAR, 24.3% BEN/BEN, and 12.1% SEN/SEN). HU was given to 168 cohort SCA children. Hematological parameters were recorded when HbF was maximal, and changes (ΔHU-T0) were calculated. At baseline, CAR-haplotype and α-gene numbers were independently and negatively correlated with Hb and positively correlated with lactate dehydrogenase. HbF was negatively correlated with CAR-haplotype numbers and positively with BEN- and SEN-haplotype numbers. The BCL11A/rs1427407 “T” allele, which is favorable for HbF expression, was positively correlated with BEN- and negatively correlated with CAR-haplotype numbers. With HU treatment, Δ and HbF values were positively correlated with the BEN-haplotype number. BEN/BEN patients had higher HbF and Hb levels than CAR/CAR and SEN/SEN patients. In conclusion, we show that BEN/BEN patients have the best response on HU and suggest that this could be related to the higher prevalence of the favorable BCL11A/rs1427407/T/allele for HbF expression in these patients. PMID:29555644

A comprehensive literature review of haplotyping software and methods for use with unrelated individuals.

PubMed

Salem, Rany M; Wessel, Jennifer; Schork, Nicholas J

2005-03-01

Interest in the assignment and frequency analysis of haplotypes in samples of unrelated individuals has increased immeasurably as a result of the emphasis placed on haplotype analyses by, for example, the International HapMap Project and related initiatives. Although there are many available computer programs for haplotype analysis applicable to samples of unrelated individuals, many of these programs have limitations and/or very specific uses. In this paper, the key features of available haplotype analysis software for use with unrelated individuals, as well as pooled DNA samples from unrelated individuals, are summarised. Programs for haplotype analysis were identified through keyword searches on PUBMED and various internet search engines, a review of citations from retrieved papers and personal communications, up to June 2004. Priority was given to functioning computer programs, rather than theoretical models and methods. The available software was considered in light of a number of factors: the algorithm(s) used, algorithm accuracy, assumptions, the accommodation of genotyping error, implementation of hypothesis testing, handling of missing data, software characteristics and web-based implementations. Review papers comparing specific methods and programs are also summarised. Forty-six haplotyping programs were identified and reviewed. The programs were divided into two groups: those designed for individual genotype data (a total of 43 programs) and those designed for use with pooled DNA samples (a total of three programs). The accuracy of programs using various criteria are assessed and the programs are categorised and discussed in light of: algorithm and method, accuracy, assumptions, genotyping error, hypothesis testing, missing data, software characteristics and web implementation. Many available programs have limitations (eg some cannot accommodate missing data) and/or are designed with specific tasks in mind (eg estimating haplotype frequencies rather than assigning most likely haplotypes to individuals). It is concluded that the selection of an appropriate haplotyping program for analysis purposes should be guided by what is known about the accuracy of estimation, as well as by the limitations and assumptions built into a program.
Phylogeography and connectivity of molluscan parasites: Perkinsus spp. in Panama and beyond.

PubMed

Pagenkopp Lohan, Katrina M; Hill-Spanik, Kristina M; Torchin, Mark E; Fleischer, Robert C; Carnegie, Ryan B; Reece, Kimberly S; Ruiz, Gregory M

2018-02-01

Panama is a major hub for commercial shipping between two oceans, making it an ideal location to examine parasite biogeography, potential invasions, and the spread of infectious agents. Our goals were to (i) characterise the diversity and genetic connectivity of Perkinsus spp. haplotypes across the Panamanian Isthmus and (ii) combine these data with sequences from around the world to evaluate the current phylogeography and genetic connectivity of these widespread molluscan parasites. We collected 752 bivalves from 12 locations along the coast of Panama including locations around the Bocas del Toro archipelago and the Caribbean and Pacific entrances to the Panama Canal, from December 2012 to February 2013. We used molecular genetic methods to screen for Perkinsus spp. and obtained internal transcribed spacer region (ITS) ribosomal DNA (rDNA) sequences for all positive samples. Our sequence data were used to evaluate regional haplotype diversity and distribution across both coasts of Panama, and were then combined with publicly available sequences to create global haplotype networks. We found 26 ITS haplotypes from four Perkinsus spp. (1-12 haplotypes per species) in Panama. Perkinsus beihaiensis haplotypes had the highest genetic diversity, were the most regionally widespread, and were associated with the greatest number of hosts. On a global scale, network analyses demonstrated that some haplotypes found in Panama were cosmopolitan (Perkinsus chesapeaki, Perkinsus marinus), while others were more geographically restricted (Perkinsus olseni, P. beihaiensis), indicating different levels of genetic connectivity and dispersal. We found some Perkinsus haplotypes were shared across the Isthmus of Panama and several regions around the world, including across ocean basins. We also found that haplotype diversity is currently underestimated and directly related to the number of sequences. Nevertheless, our results demonstrate long-range dispersal and global connectivity for many haplotypes, suggesting that dispersal through shipping probably contributes to these biogeographical patterns. Published by Elsevier Ltd.
Variation in the prion protein sequence in Dutch goat breeds.

PubMed

Windig, J J; Hoving, R A H; Priem, J; Bossers, A; van Keulen, L J M; Langeveld, J P M

2016-10-01

Scrapie is a neurodegenerative disease occurring in goats and sheep. Several haplotypes of the prion protein increase resistance to scrapie infection and may be used in selective breeding to help eradicate scrapie. In this study, frequencies of the allelic variants of the PrP gene are determined for six goat breeds in the Netherlands. Overall frequencies in Dutch goats were determined from 768 brain tissue samples in 2005, 766 in 2008 and 300 in 2012, derived from random sampling for the national scrapie surveillance without knowledge of the breed. Breed specific frequencies were determined in the winter 2013/2014 by sampling 300 breeding animals from the main breeders of the different breeds. Detailed analysis of the scrapie-resistant K222 haplotype was carried out in 2014 for 220 Dutch Toggenburger goats and in 2015 for 942 goats from the Saanen derived White Goat breed. Nine haplotypes were identified in the Dutch breeds. Frequencies for non-wild type haplotypes were generally low. Exception was the K222 haplotype in the Dutch Toggenburger (29%) and the S146 haplotype in the Nubian and Boer breeds (respectively 7 and 31%). The frequency of the K222 haplotype in the Toggenburger was higher than for any other breed reported in literature, while for the White Goat breed it was with 3.1% similar to frequencies of other Saanen or Saanen derived breeds. Further evidence was found for the existence of two M142 haplotypes, M142 /S240 and M142 /P240 . Breeds vary in haplotype frequencies but frequencies of resistant genotypes are generally low and consequently selective breeding for scrapie resistance can only be slow but will benefit from animals identified in this study. The unexpectedly high frequency of the K222 haplotype in the Dutch Toggenburger underlines the need for conservation of rare breeds in order to conserve genetic diversity rare or absent in other breeds. © 2016 Blackwell Verlag GmbH.
Contribution of HLA-A/B/C/DRB1/DQB1 common haplotypes to donor search outcome in unrelated hematopoietic stem cell transplantation.

PubMed

Pédron, Béatrice; Guérin-El Khourouj, Valérie; Dalle, Jean-Hugues; Ouachée-Chardin, Marie; Yakouben, Karima; Corroyez, France; Auvrignon, Anne; Petit, Arnaud; Landman-Parker, Judith; Leverger, Guy; Baruchel, André; Sterkers, Ghislaine

2011-11-01

In unrelated hematopoietic stem cell transplantation (HSCT), the prediction of donor search outcome at the time of search initiation is of great value for the physicians to delineate the strategy of patient care. The probability of finding an unrelated donor is high for patients who carry at least 1 of the 10 most common HLA haplotypes in Caucasians. As only 10% to 20% patients respond to this criterion, here we aimed at finding additional common haplotypes to improve the prediction of a successful search. HLA broad HLA-A/B/DRB1 haplotypes that were observed with frequencies ≥0.19% in patient families of European origin and that split into ≤2 predominant 4-digit HLA-A/B/C/DRB1/DQB1 haplotypes were considered as common. Carriage of at least 1 of those in 168 patients of various geographic areas with no family donor was confronted to the chance of finding ≥9/10 HLA-matched unrelated donors. Fifty common 4-digit haplotypes were identified. A higher (P < 5 × 10(-6)) chance of finding a suitable donor was found for 55 of 170 (32%) recipients that carried at least 1 of these common haplotypes. Up to now, estimates classified patients into ≥3 groups of probability with ≥1 intermediate group of poor utility for the clinicians. Considering carriage of these common haplotypes together with the frequencies of alleles and of B/C and DRB1/DQB1 associations, which are carried by patient HLA haplotypes, we could classify the patients into 2 groups of probability with a 98% and 26% chance of finding a donor, respectively. Prediction of search outcome could be improved by including the 50 most common HLA haplotypes in the current approaches. Copyright © 2011 American Society for Blood and Marrow Transplantation. Published by Elsevier Inc. All rights reserved.
Association between platelet P2Y12 haplotype and risk of cardiovascular events in chronic coronary disease.

PubMed

Schettert, Isolmar T; Pereira, Alexandre C; Lopes, Neuza H; Hueb, Whady A; Krieger, Jose E

2006-01-01

A positive association was recently described between P2Y12 platelet receptor H1 and H2 haplotypes and peripheral artery disease. We tested the described P2Y12 receptor haplotypes in a group of patients with coronary artery disease. The P2Y12 platelet receptor H1 and H2 haplotypes was tested in a group of 540 patients enrolled in the Medical, Angioplasty, or Surgery Study II (MASS II), a randomized trial comparing treatments for patients with coronary artery disease (CAD) and preserved left ventricular function. After a 3-year follow-up period, the incidence of the composite end point of cardiac death, myocardial infarction, and refractory angina requiring revascularization was determined in the H1/H1, H1/H2 and H2/H2 haplotype groups. We used Student's t-test and the chi-square test to analyze the differences among groups and Kaplan-Meier method to calculate survival curves. Risk was assessed with the use of a Cox proportional-hazards model. The frequency of haplotypes among studied patients were 410 (75.9%) H1/H1, 119 (22.0%) H1/H2 and 11 (2.1%) H2/H2. The baseline clinical characteristics, mean clinical follow-up time and received treatment of each genotype group were similar. We did not disclose any association between haplotype groups regarding the incidence of any of the studied cardiovascular end-points. This is the first report studying the association of P2Y12 platelet receptor H1 and H2 haplotype and cardiovascular events. Our findings do not provide evidence for a strong association between H1/H1 and H1/H2 haplotypes and a increased risk of cardiovascular events in a population with CAD. Future works should address the role of the H2/H2 haplotype as a genetic marker for cardiovascular events.
Association of KIR genotypes and haplotypes with susceptibility to chronic hepatitis B virus infection in Chinese Han population.

PubMed

Lu, Zhiming; Zhang, Bingchang; Chen, Shijun; Gai, Zhongtao; Feng, Zhaolei; Liu, Xiangdong; Liu, Yiqing; Wen, Xin; Li, Li; Jiao, Yulian; Ma, Chunyan; Shao, Song; Cui, Xiangfa; Chen, Guojian; Li, Jianfeng; Zhao, Yueran

2008-12-01

Killer immunoglobulin-like receptor (KIR) genes can regulate the activation of NK and T cells upon interaction with HLA class I molecules. Hepatitis B virus (HBV) infection has been regarded as a multi-factorial disorder disease. Previous studies revealed that KIRs were involved in HCV and HIV infection or clearance. The aim of this study was to explore the possibility of the inheritance of KIR genotypes and haplotypes as a candidate for susceptibility to persistent HBV infection or HBV clearance. The sequence specific primer polymerase chain reaction (SSP-PCR) was employed to identify the KIR genes and pseudogenes in 150 chronic hepatitis B (CHB) patients, 251 spontaneously recovered (SR) controls, and 412 healthy controls. The frequencies of genotype G, M, FZ1 increased in CHB patients compared with healthy control subjects. The frequency of genotype AH was higher in SR controls than that in both CHB patients and healthy controls. The carriage frequencies of genotype G and AH were higher; while, the frequencies of AF and AJ were lower in SR controls than those in healthy control subjects. The frequency of A haplotype was lower, whereas, the frequency of B haplotype was higher in CHB patients and SR controls than those in healthy controls. In healthy controls, haplotype 4 was found lower compared with that in CHB patients and SR controls and the frequency of haplotype 5 was higher in SR controls than that in other two groups. Based on these findings, it seems that the genotypes M and FZ1 are HBV susceptive genotypes; AH, on the other hand, may be protective genotypes that facilitate the clearance of HBV. It appears that the haplotype 4 is HBV susceptive haplotype, whereas, haplotype 5 may be the protective haplotype that facilitates the clearance of HBV.
Impacts of TNF-LTA SNPs/Haplotypes and Lifestyle Factors on Oral Carcinoma in an Indian Population.

PubMed

Bandil, Kapil; Singhal, Pallavi; Sharma, Upma; Hussain, Showket; Basu, Surojit; Parashari, Aditya; Singh, Veena; Sehgal, Ashok; Shivam, Animesh; Ahuja, Puneet; Bharadwaj, Mausumi; Banerjee, Basu Dev; Mehrotra, Ravi

2016-10-01

To investigate a potential association between single-nucleotide polymorphisms (SNPs) and haplotypes at the TNFA-LTA locus and the development of oral cancer in an Indian population. In this study, 150 oral precancer/cancer samples (50 precancer and 100 cancer), along with an equal number of control samples, were genotyped. Six SNPs at the TNF-LTA locus (i.e., -238G/A, -308G/A, -857C/T, -863C/A, -1031T/C, and +252A/G) were analyzed by use of a polymerase chain reaction-restriction fragment length polymorphism method, the assay was validated by sequencing 10 % of samples. The allelic frequencies of TNFA and LTA SNPs were found to be significantly associated with the risk of oral cancer and precancerous lesions in comparison with controls (P < 0.0003). Further haplotypic analysis showed that two haplotypes (ATCTGG and ACACGG) served as risk haplotypes for oral cancer. These haplotypes were also found to be significantly and positively associated with lifestyle habits (tobacco chewing P = 0.04, odds ratio [OR] 3.4) and socioeconomic status (P = 0.01, OR 3.4). We noticed an increased percentage of risk haplotypes correlating with the aggressiveness of oral cancer. The percentages of risk haplotypes were found to be threefold higher in precancer and fourfold higher in advanced stages of oral cancer in comparison with controls. Five SNPs at the TNF-LTA locus (i.e., -308G>A, -857C>T, -863C>A, -1031T>C, and +252A>G) were found to be associated with the development of oral cancer. Two haplotypes (ATCTGG and ACACGG) emerged as major risk haplotypes for oral carcinoma progression and were also found to be associated with lifestyle factors and clinical aggressiveness. These findings make the TNF-LTA locus a suitable candidate for a future biomarker, which may be used either for early detection or for helping to improve treatment efficacy and effectiveness.
Molecular tracing of confiscated pangolin scales for conservation and illegal trade monitoring in Southeast Asia

USGS Publications Warehouse

Zhang, Huarong; Miller, Mark P.; Yang, Feng; Chan, Hon Ki; Gaubert, Philippe; Ades, Gary; Fischer, Gunter A

2015-01-01

Despite being protected by both international and national regulations, pangolins are threatened by illegal trade. Here we report mitochondrial DNA identification and haplotype richness estimation, using 239 pangolin scale samples from two confiscations in Hong Kong. We found a total of 13 genetically distinct cytochrome c oxidase I (COI) haplotypes in two confiscations (13 and ten haplotypes respectively, with ten shared haplotypes between confiscations). These haplotypes clustered in two distinct clades with one clade representing the Sunda pangolin (Manisjavanica). The other clade did not match with any known Asian pangolin sequences, and likely represented a cryptic pangolin lineage in Asia. By fitting sample coverage and rarefaction/regression models to our sample data, we predicted that the total number of COI haplotypes in two confiscations were 14.86 and 11.06 respectively, suggesting that our sampling caught the majority of haplotypes and that we had adequately characterized each confiscation. We detected substantial sequence divergence among the seized scales, likely evidencing that the Sunda pangolins were harvested over wide geographical areas across Southeast Asia. Our study illustrates the value of applying DNA forensics for illegal wildlife trade monitoring.
Whole-loop mitochondrial DNA D-loop sequence variability in Egyptian Arabian equine matrilines

PubMed Central

Hudson, William

2017-01-01

Background Egyptian Arabian horses have been maintained in a state of genetic isolation for over a hundred years. There is only limited genetic proof that the studbook records of female lines of Egyptian Arabian pedigrees are reliable. This study characterized the mitochondrial DNA (mtDNA) signatures of 126 horses representing 14 matrilines in the Egyptian Agricultural Organization (EAO) horse-breeding program. Findings Analysis of the whole D-loop sequence yielded additional information compared to hypervariable region-1 (HVR1) analysis alone, with 42 polymorphic sites representing ten haplotypes compared to 16 polymorphic sites representing nine haplotypes, respectively. Most EAO haplotypes belonged to ancient haplogroups, suggesting origin from a wide geographical area over many thousands of years, although one haplotype was novel. Conclusions Historical families share haplotypes and some individuals from different strains belonged to the same haplogroup: the classical EAO strain designation is not equivalent to modern monophyletic matrilineal groups. Phylogenetic inference showed that the foundation mares of the historical haplotypes were highly likely to have the same haplotypes as the animals studied (p > 0.998 in all cases), confirming the reliability of EAO studbook records and providing the opportunity for breeders to confirm the ancestry of their horses. PMID:28859174
The Geographic Distribution of Human Y Chromosome Variation

PubMed Central

Hammer, M. F.; Spurdle, A. B.; Karafet, T.; Bonner, M. R.; Wood, E. T.; Novelletto, A.; Malaspina, P.; Mitchell, R. J.; Horai, S.; Jenkins, T.; Zegura, S. L.

1997-01-01

We examined variation on the nonrecombining portion of the human Y chromosome to investigate human evolution during the last 200,000 years. The Y-specific polymorphic sites included the Y Alu insertional polymorphism or ``YAP'' element (DYS287), the poly(A) tail associated with the YAP element, three point mutations in close association with the YAP insertion site, an A-G polymorphic transition (DYS271), and a tetranucleotide microsatellite (DYS19). Global variation at the five bi-allelic sites (DYS271, DYS287, and the three point mutations) gave rise to five ``YAP haplotypes'' in 60 populations from Africa, Europe, Asia, Australasia, and the New World (n = 1500). Combining the multi-allelic variation at the microsatellite loci (poly(A) tail and DYS19) with the YAP haplotypes resulted in a total of 27 ``combination haplotypes''. All five of the YAP haplotypes and 21 of the 27 combination haplotypes were found in African populations, which had greater haplotype diversity than did populations from other geographical locations. Only subsets of the five YAP haplotypes were found outside of Africa. Patterns of observed variation were compatible with a variety of hypotheses, including multiple human migrations and range expansions. PMID:9055088
MHC Class II haplotypes of Colombian Amerindian tribes

PubMed Central

Yunis, Juan J.; Yunis, Edmond J.; Yunis, Emilio

2013-01-01

We analyzed 1041 individuals belonging to 17 Amerindian tribes of Colombia, Chimila, Bari and Tunebo (Chibcha linguistic family), Embera, Waunana (Choco linguistic family), Puinave and Nukak (Maku-Puinave linguistic families), Cubeo, Guanano, Tucano, Desano and Piratapuyo (Tukano linguistic family), Guahibo and Guayabero (Guayabero Linguistic Family), Curripaco and Piapoco (Arawak linguistic family) and Yucpa (Karib linguistic family). for MHC class II haplotypes (HLA-DRB1, DQA1, DQB1). Approximately 90% of the MHC class II haplotypes found among these tribes are haplotypes frequently encountered in other Amerindian tribes. Nonetheless, striking differences were observed among Chibcha and non-Chibcha speaking tribes. The DRB1*04:04, DRB1*04:11, DRB1*09:01 carrying haplotypes were frequently found among non-Chibcha speaking tribes, while the DRB1*04:07 haplotype showed significant frequencies among Chibcha speaking tribes, and only marginal frequencies among non-Chibcha speaking tribes. Our results suggest that the differences in MHC class II haplotype frequency found among Chibcha and non-Chibcha speaking tribes could be due to genetic differentiation in Mesoamerica of the ancestral Amerindian population into Chibcha and non-Chibcha speaking populations before they entered into South America. PMID:23885196
Promoter variants of Xa23 alleles affect bacterial blight resistance and evolutionary pattern

PubMed Central

Xu, Feifei; Tang, Yongchao; Gao, Ying

2017-01-01

Bacterial blight, caused by Xanthomonas oryzae pv. oryzae (Xoo), is the most important bacterial disease in rice (Oryza sativa L.). Our previous studies have revealed that the bacterial blight resistance gene Xa23 from wild rice O. rufipogon Griff. confers the broadest-spectrum resistance against all the naturally occurring Xoo races. As a novel executor R gene, Xa23 is transcriptionally activated by the bacterial avirulence (Avr) protein AvrXa23 via binding to a 28-bp DNA element (EBEAvrXa23) in the promoter region. So far, the evolutionary mechanism of Xa23 remains to be illustrated. Here, a rice germplasm collection of 97 accessions, including 29 rice cultivars (indica and japonica) and 68 wild relatives, was used to analyze the evolution, phylogeographic relationship and association of Xa23 alleles with bacterial blight resistance. All the ~ 473 bp DNA fragments consisting of promoter and coding regions of Xa23 alleles in the germplasm accessions were PCR-amplified and sequenced, and nine single nucleotide polymorphisms (SNPs) were detected in the promoter regions (~131 bp sequence upstream from the start codon ATG) of Xa23/xa23 alleles while only two SNPs were found in the coding regions. The SNPs in the promoter regions formed 5 haplotypes (Pro-A, B, C, D, E) which showed no significant difference in geographic distribution among these 97 rice accessions. However, haplotype association analysis indicated that Pro-A is the most favored haplotype for bacterial blight resistance. Moreover, SNP changes among the 5 haplotypes mostly located in the EBE/ebe regions (EBEAvrXa23 and corresponding ebes located in promoters of xa23 alleles), confirming that the EBE region is the key factor to confer bacterial blight resistance by altering gene expression. Polymorphism analysis and neutral test implied that Xa23 had undergone a bottleneck effect, and selection process of Xa23 was not detected in cultivated rice. In addition, the Xa23 coding region was found highly conserved in the Oryza genus but absent in other plant species by searching the plant database, suggesting that Xa23 originated along with the diversification of the Oryza genus from the grass family during evolution. This research offers a potential for flexible use of novel Xa23 alleles in rice breeding programs and provide a model for evolution analysis of other executor R genes. PMID:28982185
Promoter variants of Xa23 alleles affect bacterial blight resistance and evolutionary pattern.

PubMed

Cui, Hua; Wang, Chunlian; Qin, Tengfei; Xu, Feifei; Tang, Yongchao; Gao, Ying; Zhao, Kaijun

2017-01-01

Bacterial blight, caused by Xanthomonas oryzae pv. oryzae (Xoo), is the most important bacterial disease in rice (Oryza sativa L.). Our previous studies have revealed that the bacterial blight resistance gene Xa23 from wild rice O. rufipogon Griff. confers the broadest-spectrum resistance against all the naturally occurring Xoo races. As a novel executor R gene, Xa23 is transcriptionally activated by the bacterial avirulence (Avr) protein AvrXa23 via binding to a 28-bp DNA element (EBEAvrXa23) in the promoter region. So far, the evolutionary mechanism of Xa23 remains to be illustrated. Here, a rice germplasm collection of 97 accessions, including 29 rice cultivars (indica and japonica) and 68 wild relatives, was used to analyze the evolution, phylogeographic relationship and association of Xa23 alleles with bacterial blight resistance. All the ~ 473 bp DNA fragments consisting of promoter and coding regions of Xa23 alleles in the germplasm accessions were PCR-amplified and sequenced, and nine single nucleotide polymorphisms (SNPs) were detected in the promoter regions (~131 bp sequence upstream from the start codon ATG) of Xa23/xa23 alleles while only two SNPs were found in the coding regions. The SNPs in the promoter regions formed 5 haplotypes (Pro-A, B, C, D, E) which showed no significant difference in geographic distribution among these 97 rice accessions. However, haplotype association analysis indicated that Pro-A is the most favored haplotype for bacterial blight resistance. Moreover, SNP changes among the 5 haplotypes mostly located in the EBE/ebe regions (EBEAvrXa23 and corresponding ebes located in promoters of xa23 alleles), confirming that the EBE region is the key factor to confer bacterial blight resistance by altering gene expression. Polymorphism analysis and neutral test implied that Xa23 had undergone a bottleneck effect, and selection process of Xa23 was not detected in cultivated rice. In addition, the Xa23 coding region was found highly conserved in the Oryza genus but absent in other plant species by searching the plant database, suggesting that Xa23 originated along with the diversification of the Oryza genus from the grass family during evolution. This research offers a potential for flexible use of novel Xa23 alleles in rice breeding programs and provide a model for evolution analysis of other executor R genes.
Phylogeography, genetic variability and structure of Acanthamoeba metapopulations in Iran inferred by 18S ribosomal RNA sequences: A systematic review and meta-analysis.

PubMed

Spotin, Adel; Moslemzadeh, Hamid Reza; Mahami-Oskouei, Mahmoud; Ahmadpour, Ehsan; Niyyati, Maryam; Hejazi, Seyed Hossein; Memari, Fatemeh; Noori, Jafar

2017-09-01

To verify phylogeography and genetic structure of Acanthamoeba populations among the Iranian clinical isolates and natural/artificial environments distributed in various regions of the country. We searched electronic databases including Medline, PubMed, Science Direct, Scopus and Google Scholar from 2005 to 2016. To explore the genetic variability of Acanthamoeba sp, 205 sequences were retrieved from keratitis patients, immunosuppressed cases and environmental sources as of various geographies of Iran. T4 genotype was the predominant strain in Iran, and the rare genotypes belonged to T2, T3, T5 (Acanthamoeba lenticulata), T6, T9, T11, T13 and T15 (Acanthamoeba jacobsi). A total of 47 unique haplotypes of T4 were identified. A parsimonious network of the sequence haplotypes demonstrated star-like feature containing haplogroups IR6 (34.1%) and IR7 (31.2%) as the most common haplotypes. In accordance with the analysis of molecular variance, the high value of haplotype diversity (0.612-0.848) of Acanthamoeba T4 represented genetic variability within populations. Neutrality indices of the 18S ribosomal RNA demonstrated negative values in all populations which represented a considerable divergence from neutrality. The majority of genetic diversity belonged to the infected contact lens and dust samples in immunodeficiency and ophthalmology wards, which indicated potential routes for exposure to a pathogenic Acanthamoeba sp. in at-risk individuals. A pairwise fixation index (F ST ) was from low to high values (0.02433-0.41892). The statistically F ST points out that T4 is genetically differentiated between north-west, north-south and central-south metapopulations, but not differentiated between west-central, west-south, central-south, and north-central isolates. An occurrence of IR6 and IR7 displays that possibly a gene flow of Acanthamoeba T4 occurred after the founder effect or bottleneck experience through ecological changes or host mobility. This is the first systematic review and meta-analysis providing new approaches into gene migration and transmission patterns of Acanthamoeba sp, and targeting at the high-risk individuals/sources among the various regions of Iran. Copyright © 2017 Hainan Medical University. Production and hosting by Elsevier B.V. All rights reserved.
HLA-A, -B, -C, -DRB1 and -DQB1 alleles and haplotypes in 951 Southeast Asia Malays from Peninsular Malaysia.

PubMed

Tan, Lay-Kim; Mohd-Farid, Baharin; Salsabil, Sulaiman; Heselynn, Hussein; Wahinuddin, Sulaiman; Lau, Ing-Soo; Gun, Suk-Chyn; Nor-Suhaila, Sharil; Eashwary, M; Mohd-Shahrir, Mohamed Said; Ainon, Mohd-Mokhtar; Azmillah, Rosman; Muhaini, Othman; Shahnaz, Murad; Too, Chun-Lai

2016-10-01

A total of 951 Southeast Asia Malays from Peninsular Malaysia were genotyped for HLA-A, -B, -C -DRB1, and -DQB1 loci using polymerase chain reaction sequence-specific oligonucleotide probe hybridization methods. In this report, there were significant deviation from Hardy-Weinberg proportions for the HLA-A (p<0.0001), -B (p<0.0001), -DRB1 (p<0.0001) and -DQB1 (p<0.01) loci. Minor deviations from HWEP were detected for HLA-C (p=0.01). This genotype data was available in Allele Frequencies Network Database (AFND) Gonzalez-Galarza et al. (2015). Copyright © 2016. Published by Elsevier Inc.
The distribution of HLA haplotypes in the ethnic groups that make up the Brazilian Bone Marrow Volunteer Donor Registry (REDOME).

PubMed

Halagan, Michael; Oliveira, Danielli Cristina; Maiers, Martin; Fabreti-Oliveira, Raquel A; Moraes, Maria Elisa Hue; Visentainer, Jeane Eliete Laguila; Pereira, Noemi Farah; Romero, Matilde; Cardoso, Juliana Fernandes; Porto, Luís Cristóvão

2018-04-26

The Registries of Bone Marrow Donors around the world include more than 30 million volunteer donors from 57 different countries, and were responsible for over 17,000 hematopoietic stem cell transplants in 2016. The Brazilian Bone Marrow Volunteer Donor Registry (REDOME) was established in 1993 and is the third largest registry in the world with more than 4.3 million donors. We characterized HLA allele and haplotypes frequencies from REDOME comparing them with the donor self-reported race group classification. Five-locus haplotype frequencies (A~C~B~DRB1~DQB1) were estimated for each of the six race groups, resolving phase and allelic ambiguity using the expectation-maximization (EM) algorithm. The top 100 haplotypes in the race groups were separated into eight clusters of haplotypes, based on haplotype similarity, using CLUTO. We present HLA allele and haplotype frequency data from six race groups from 2,938,259 individuals from REDOME. The most frequent haplotype was the same for all groups: A*01:01g~C*07:01g~B*08:01g~DRB1*03:01g~DQB1*02:01g. Some frequent haplotypes such as A*02:01g~C*16:01g~B*44:03~DRB1*07:01g~DQB1*02:01g was not found in people with Preta (Sub-Saharan African descent). A cluster including Branca (European) and Parda or non-informed (admixed) could be distinguished from both Preta (SubSaharan) and Indígena (Amerindian) groups, and from the Amarela (Asian) ones, which clustered with their original population. These results have implications on cross-population matching and can help in donor searches and population-based recruitment strategies.
Molecular analysis and association with clinical and laboratory manifestations in children with sickle cell anemia

PubMed Central

Camilo-Araújo, Roberta Faria; Amancio, Olga Maria Silverio; Figueiredo, Maria Stella; Cabanãs-Pedro, Ana Carolina; Braga, Josefina Aparecida Pellegrini

2014-01-01

Objectives To analyze the frequency of βS-globin haplotypes and alpha-thalassemia, and their influence on clinical manifestations and the hematological profile of children with sickle cell anemia. Method The frequency of βS-globin haplotypes and alpha-thalassemia and any association with clinical and laboratorial manifestations were determined in 117 sickle cell anemia children aged 3–71 months. The confirmation of hemoglobin SS and determination of the haplotypes were achieved by polymerase chain reaction-restriction fragment length polymorphism, and alpha-thalassemia genotyping was by multiplex polymerase chain reaction (single-tube multiplex-polymerase chain reaction). Results The genotype distribution of haplotypes was 43 (36.7%) Central African Republic/Benin, 41 (35.0%) Central African Republic/Central African Republic, 20 (17.0%) Rare/atypical, and 13 (11.1%) Benin/Benin. The frequency of the α3.7 deletion was 1.71% as homozygous (−α3.7/−α3.7) and 11.9% as heterozygous (−α3.7/αα). The only significant association in respect to haplotypes was related to the mean corpuscular volume. The presence of alpha-thalassemia was significantly associated to decreases in mean corpuscular volume, mean corpuscular hemoglobin and reticulocyte count and to an increase in the red blood cell count. There were no significant associations of βS-globin haplotypes and alpha-thalassemia with clinical manifestations. Conclusions In the study population, the frequency of alpha-thalassemia was similar to published data in Brazil with the Central African Republic haplotype being the most common, followed by the Benin haplotype. βS-globin haplotypes and interaction between alpha-thalassemia and sickle cell anemia did not influence fetal hemoglobin concentrations or the number of clinical manifestations. PMID:25305165
The "Sardinian" HLA-A30,B18,DR3,DQw2 haplotype constantly lacks the 21-OHA and C4B genes. Is it an ancestral haplotype without duplication?

PubMed

Contu, L; Carcassi, C; Dausset, J

1989-01-01

The C4 and 21-OH loci of the class III HLA have been studied by specific DNA probes and the restriction enzyme Taq 1 in 24 unrelated Sardinian individuals selected from completely HLA-typed families. All 24 individuals had the HLA extended haplotype A30,Cw5,B18, BfF1,DR3,DRw52,DQw2, named "Sardinian" in the present paper because of its frequency of 15% in the Sardinian population. Eighteen of these were homozygous for the entire haplotype, and six were heterozygous at the A locus and blank (or homozygous) at all the other loci. In all completely homozygous cells and in four heterozygous cells at the A locus, the restriction fragments of the 21-OHA (3.2 kb) and C4B (5.8 kb or 5.4 kb) genes were absent, and the fragments of the C4A (7.0 kb) and 21-OHB (3.7 kb) genes were present. It is suggested that the "Sardinian" haplotype is an ancestral haplotype without duplication of the C4 and 21-OH genes, practically always identical in its structure, also in unrelated individuals. The diversity of this haplotype in the class III region (about 30 kb less) may be at least partially responsible for its misalignment with most haplotypes, which have duplicated C4 and 21-OH genes, and therefore also for its decreased probability to recombine. This can help explain its high stability and frequency in the Sardinian population. The same conclusion can be suggested for the Caucasian extended haplotype A1,B8,DR3 that always seems to lack the C4A and 21-OHA genes.
A mixed integer linear programming model to reconstruct phylogenies from single nucleotide polymorphism haplotypes under the maximum parsimony criterion

PubMed Central

2013-01-01

Background Phylogeny estimation from aligned haplotype sequences has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from medical research, to drug discovery, to epidemiology, to population dynamics. The literature on molecular phylogenetics proposes a number of criteria for selecting a phylogeny from among plausible alternatives. Usually, such criteria can be expressed by means of objective functions, and the phylogenies that optimize them are referred to as optimal. One of the most important estimation criteria is the parsimony which states that the optimal phylogeny T∗for a set H of n haplotype sequences over a common set of variable loci is the one that satisfies the following requirements: (i) it has the shortest length and (ii) it is such that, for each pair of distinct haplotypes hi,hj∈H, the sum of the edge weights belonging to the path from hi to hj in T∗ is not smaller than the observed number of changes between hi and hj. Finding the most parsimonious phylogeny for H involves solving an optimization problem, called the Most Parsimonious Phylogeny Estimation Problem (MPPEP), which is NP-hard in many of its versions. Results In this article we investigate a recent version of the MPPEP that arises when input data consist of single nucleotide polymorphism haplotypes extracted from a population of individuals on a common genomic region. Specifically, we explore the prospects for improving on the implicit enumeration strategy of implicit enumeration strategy used in previous work using a novel problem formulation and a series of strengthening valid inequalities and preliminary symmetry breaking constraints to more precisely bound the solution space and accelerate implicit enumeration of possible optimal phylogenies. We present the basic formulation and then introduce a series of provable valid constraints to reduce the solution space. We then prove that these constraints can often lead to significant reductions in the gap between the optimal solution and its non-integral linear programming bound relative to the prior art as well as often substantially faster processing of moderately hard problem instances. Conclusion We provide an indication of the conditions under which such an optimal enumeration approach is likely to be feasible, suggesting that these strategies are usable for relatively large numbers of taxa, although with stricter limits on numbers of variable sites. The work thus provides methodology suitable for provably optimal solution of some harder instances that resist all prior approaches. PMID:23343437
Construction and comparative evaluation of different activity detection methods in brain FDG-PET.

PubMed

Buchholz, Hans-Georg; Wenzel, Fabian; Gartenschläger, Martin; Thiele, Frank; Young, Stewart; Reuss, Stefan; Schreckenberger, Mathias

2015-08-18

We constructed and evaluated reference brain FDG-PET databases for usage by three software programs (Computer-aided diagnosis for dementia (CAD4D), Statistical Parametric Mapping (SPM) and NEUROSTAT), which allow a user-independent detection of dementia-related hypometabolism in patients' brain FDG-PET. Thirty-seven healthy volunteers were scanned in order to construct brain FDG reference databases, which reflect the normal, age-dependent glucose consumption in human brain, using either software. Databases were compared to each other to assess the impact of different stereotactic normalization algorithms used by either software package. In addition, performance of the new reference databases in the detection of altered glucose consumption in the brains of patients was evaluated by calculating statistical maps of regional hypometabolism in FDG-PET of 20 patients with confirmed Alzheimer's dementia (AD) and of 10 non-AD patients. Extent (hypometabolic volume referred to as cluster size) and magnitude (peak z-score) of detected hypometabolism was statistically analyzed. Differences between the reference databases built by CAD4D, SPM or NEUROSTAT were observed. Due to the different normalization methods, altered spatial FDG patterns were found. When analyzing patient data with the reference databases created using CAD4D, SPM or NEUROSTAT, similar characteristic clusters of hypometabolism in the same brain regions were found in the AD group with either software. However, larger z-scores were observed with CAD4D and NEUROSTAT than those reported by SPM. Better concordance with CAD4D and NEUROSTAT was achieved using the spatially normalized images of SPM and an independent z-score calculation. The three software packages identified the peak z-scores in the same brain region in 11 of 20 AD cases, and there was concordance between CAD4D and SPM in 16 AD subjects. The clinical evaluation of brain FDG-PET of 20 AD patients with either CAD4D-, SPM- or NEUROSTAT-generated databases from an identical reference dataset showed similar patterns of hypometabolism in the brain regions known to be involved in AD. The extent of hypometabolism and peak z-score appeared to be influenced by the calculation method used in each software package rather than by different spatial normalization parameters.

Use of dual-energy X-ray absorptiometry (DXA) for diagnosis and fracture risk assessment; WHO-criteria, T- and Z-score, and reference databases.

PubMed

Dimai, Hans P

2017-11-01

Dual-energy X-ray absorptiometry (DXA) is a two-dimensional imaging technology developed to assess bone mineral density (BMD) of the entire human skeleton and also specifically of skeletal sites known to be most vulnerable to fracture. In order to simplify interpretation of BMD measurement results and allow comparability among different DXA-devices, the T-score concept was introduced. This concept involves an individual's BMD which is then compared with the mean value of a young healthy reference population, with the difference expressed as a standard deviation (SD). Since the early nineties of the past century, the diagnostic categories "normal, osteopenia, and osteoporosis", as recommended by a WHO working Group, are based on this concept. Thus, DXA is still the globally accepted "gold-standard" method for the noninvasive diagnosis of osteoporosis. Another score obtained from DXA measurement, termed Z-score, describes the number of SDs by which the BMD in an individual differs from the mean value expected for age and sex. Although not intended for diagnosis of osteoporosis in adults, it nevertheless provides information about an individual's fracture risk compared to peers. DXA measurement can either be used as a "stand-alone" means in the assessment of an individual's fracture risk, or incorporated into one of the available fracture risk assessment tools such as FRAX® or Garvan, thus improving the predictive power of such tools. The issue which reference databases should be used by DXA-device manufacturers for T-score reference standards has been recently addressed by an expert group, who recommended use National Health and Nutrition Examination Survey III (NHANES III) databases for the hip reference standard but own databases for the lumbar spine. Furthermore, in men it is recommended use female reference databases for calculation of the T-score and use male reference databases for calculation of Z-score. Copyright © 2017 Elsevier Inc. All rights reserved.
HLA-A, B and DRB1 allele and haplotype frequencies in volunteer bone marrow donors from the north of Parana State.

PubMed

Bardi, Marlene Silva; Jarduli, Luciana Ribeiro; Jorge, Adylson Justino; Camargo, Rossana Batista Oliveira Godoy; Carneiro, Fernando Pagotto; Gelinski, Jair Roberto; Silva, Roseclei Assunção Feliciano; Lavado, Edson Lopes

2012-01-01

Knowledge of allele and haplotype frequencies of the human leukocyte antigen (HLA) system is important in the search for unrelated bone marrow donors. The Brazilian population is very heterogeneous and the HLA system is highly informative of populations because of the high level of polymorphisms. The aim of this study was to characterize the immunogenetic profile of ethnic groups (Caucasians, Afro-Brazilians and Asians) in the north of Parana State. A study was carried out of 3978 voluntary bone marrow donors registered in the Brazilian National Bone Marrow Donor Registry and typed for the HLA-A, B and DRB1 (low resolution) loci. The alleles were characterized by the polymerase chain reaction sequence-specific oligonucleotides method using the LabType SSO kit (One Lambda, CA, USA). The ARLEQUIN v.3.11 computer program was used to calculate allele and haplotype frequencies The most common alleles found in Caucasians were HLA-A*02, 24, 01; HLA-B*35, 44, 51; DRB1*11, 13, 07; for Afro-Brazilians they were HLA-A*02, 03, 30; HLA-B*35, 15, 44; DRB1*13, 11, 03; and for Asians they were: HLA-A*24, 02, 26; HLA-B*40, 51, 52; DRB1*04, 15, 09. The most common haplotype combinations were: HLA-A*01, B*08, DRB1*03 and HLA-A*29, B*44, DRB1*07 for Caucasians; HLA-A*29, B*44, DRB1*07 and HLA-A*01, B*08 and DRB1*03 for Afro-Brazilians; and HLA-A*24, B*52, DRB1*15 and HLA-A*24, B*40 and DRB1*09 for Asians. There is a need to target and expand bone marrow donor campaigns in the north of Parana State. The data of this study may be used as a reference by the Instituto Nacional de Cancer/Brazilian National Bone Marrow Donor Registry to evaluate the immunogenetic profile of populations in specific regions and in the selection of bone marrow donors.
HLA-A, B and DRB1 allele and haplotype frequencies in volunteer bone marrow donors from the north of Parana State

PubMed Central

Bardi, Marlene Silva; Jarduli, Luciana Ribeiro; Jorge, Adylson Justino; Camargo, Rossana Batista Oliveira Godoy; Carneiro, Fernando Pagotto; Gelinski, Jair Roberto; Silva, Roseclei Assunção Feliciano; Lavado, Edson Lopes

2012-01-01

Background Knowledge of allele and haplotype frequencies of the human leukocyte antigen (HLA) system is important in the search for unrelated bone marrow donors. The Brazilian population is very heterogeneous and the HLA system is highly informative of populations because of the high level of polymorphisms. Aim The aim of this study was to characterize the immunogenetic profile of ethnic groups (Caucasians, Afro-Brazilians and Asians) in the north of Parana State. Methods A study was carried out of 3978 voluntary bone marrow donors registered in the Brazilian National Bone Marrow Donor Registry and typed for the HLA-A, B and DRB1 (low resolution) loci. The alleles were characterized by the polymerase chain reaction sequence-specific oligonucleotides method using the LabType SSO kit (One Lambda, CA, USA). The ARLEQUIN v.3.11 computer program was used to calculate allele and haplotype frequencies Results The most common alleles found in Caucasians were HLA-A*02, 24, 01; HLA-B*35, 44, 51; DRB1*11, 13, 07; for Afro-Brazilians they were HLA-A*02, 03, 30; HLA-B*35, 15, 44; DRB1*13, 11, 03; and for Asians they were: HLA-A*24, 02, 26; HLA-B*40, 51, 52; DRB1*04, 15, 09. The most common haplotype combinations were: HLA-A*01, B*08, DRB1*03 and HLA-A*29, B*44, DRB1*07 for Caucasians; HLA-A*29, B*44, DRB1*07 and HLA-A*01, B*08 and DRB1*03 for Afro-Brazilians; and HLA-A*24, B*52, DRB1*15 and HLA-A*24, B*40 and DRB1*09 for Asians. Conclusion There is a need to target and expand bone marrow donor campaigns in the north of Parana State. The data of this study may be used as a reference by the Instituto Nacional de Cancer/Brazilian National Bone Marrow Donor Registry to evaluate the immunogenetic profile of populations in specific regions and in the selection of bone marrow donors PMID:23049380
International trades, local spread and viral evolution: the case of porcine circovirus type 2 (PCV2) strains heterogeneity in Italy.

PubMed

Franzo, Giovanni; Tucciarone, Claudia M; Dotto, Giorgia; Gigli, Alessandra; Ceglie, Letizia; Drigo, Michele

2015-06-01

Porcine circovirus type 2 is one of the most widespread and economically relevant infections of swine. Four genotypes have been recognized, but currently, only three (PCV2a, PCV2b and PCV2d) are effectively circulating. The widespread livestock trade and rapid viral evolution have contributed to determining the high heterogeneity of PCV2 and the dispersal of potentially more virulent strains. Italian swine farming and the related processing industry are relevant in the national economy. Despite the noteworthy losses associated with direct and control measure costs, no data are currently available on the molecular epidemiology of PCV2 in Italy. Our study, which was intended to fill this gap, considered 75 completed genome PCV2 sequences, which were obtained from samples collected from the highly densely populated area of Northern Italy between 2007 and 2014. Phylogenetic analysis and comparison with reference sequences demonstrated the co-circulation, with different prevalences, of PCV2a, PCV2b and PCV2d within the national borders, with PCV2b being the most prevalent. Recombination between different genotypes was also proven to be frequent. Phylogeographic analysis demonstrated that the marked variability of Italian PCV2 strains can be attributable to multiple introduction events. The comparison of the phylogenetic analysis results, the location of different haplotypes and the international commercial routs of live pigs allow the speculation of several links as well as the role of Italy as both an importer and exporter of PCV2 haplotypes, mainly from and to European and Asian countries. A similarly intricate contact network was demonstrated within national borders, with different haplotypes being detected in the same province and different provinces harbouring the same haplotype. Overall, this paper represents the first description of PCV2 in Italy and demonstrates that the high variability of circulating Italian strains is due to multiple introduction events, wide circulation within national boundaries and rapid viral evolution. Copyright © 2015 Elsevier B.V. All rights reserved.
A World Wide Web (WWW) server database engine for an organelle database, MitoDat.

PubMed

Lemkin, P F; Chipperfield, M; Merril, C; Zullo, S

1996-03-01

We describe a simple database search engine "dbEngine" which may be used to quickly create a searchable database on a World Wide Web (WWW) server. Data may be prepared from spreadsheet programs (such as Excel, etc.) or from tables exported from relationship database systems. This Common Gateway Interface (CGI-BIN) program is used with a WWW server such as available commercially, or from National Center for Supercomputer Algorithms (NCSA) or CERN. Its capabilities include: (i) searching records by combinations of terms connected with ANDs or ORs; (ii) returning search results as hypertext links to other WWW database servers; (iii) mapping lists of literature reference identifiers to the full references; (iv) creating bidirectional hypertext links between pictures and the database. DbEngine has been used to support the MitoDat database (Mendelian and non-Mendelian inheritance associated with the Mitochondrion) on the WWW.
Assessing transmission of ‘Candidatus Liberibacter solanacearum’ haplotypes through seed potato

USDA-ARS?s Scientific Manuscript database

Conflicting data has previously been reported concerning the impact of zebra chip disease transmission through seed tubers. These discrepancies may be due to the experimental design of each study, whereby different pathogen haplotypes, insect vector haplotypes, and potato plant varieties were used....
A new mathematical modeling for pure parsimony haplotyping problem.

PubMed

Feizabadi, R; Bagherian, M; Vaziri, H R; Salahi, M

2016-11-01

Pure parsimony haplotyping (PPH) problem is important in bioinformatics because rational haplotyping inference plays important roles in analysis of genetic data, mapping complex genetic diseases such as Alzheimer's disease, heart disorders and etc. Haplotypes and genotypes are m-length sequences. Although several integer programing models have already been presented for PPH problem, its NP-hardness characteristic resulted in ineffectiveness of those models facing the real instances especially instances with many heterozygous sites. In this paper, we assign a corresponding number to each haplotype and genotype and based on those numbers, we set a mixed integer programing model. Using numbers, instead of sequences, would lead to less complexity of the new model in comparison with previous models in a way that there are neither constraints nor variables corresponding to heterozygous nucleotide sites in it. Experimental results approve the efficiency of the new model in producing better solution in comparison to two state-of-the art haplotyping approaches. Copyright © 2016 Elsevier Inc. All rights reserved.
Mathematical properties and bounds on haplotyping populations by pure parsimony.

PubMed

Wang, I-Lin; Chang, Chia-Yuan

2011-06-01

Although the haplotype data can be used to analyze the function of DNA, due to the significant efforts required in collecting the haplotype data, usually the genotype data is collected and then the population haplotype inference (PHI) problem is solved to infer haplotype data from genotype data for a population. This paper investigates the PHI problem based on the pure parsimony criterion (HIPP), which seeks the minimum number of distinct haplotypes to infer a given genotype data. We analyze the mathematical structure and properties for the HIPP problem, propose techniques to reduce the given genotype data into an equivalent one of much smaller size, and analyze the relations of genotype data using a compatible graph. Based on the mathematical properties in the compatible graph, we propose a maximal clique heuristic to obtain an upper bound, and a new polynomial-sized integer linear programming formulation to obtain a lower bound for the HIPP problem. Copyright © 2011 Elsevier Inc. All rights reserved.
Association between endothelin type A receptor haplotypes and mortality in coronary heart disease.

PubMed

Ellis, Katrina L; Pilbrow, Anna P; Potter, Howard C; Frampton, Chris M; Doughty, Rob N; Whalley, Gillian A; Ellis, Chris J; Palmer, Barry R; Skelton, Lorraine; Yandle, Tim G; Troughton, Richard W; Richards, A Mark; A Cameron, Vicky

2012-05-01

The endothelin type A receptor, encoded by EDNRA, mediates the effects of endothelin-1 to promote vasoconstriction, vascular cell growth, adhesion, fibrosis and thrombosis. We investigated the association between EDNRA haplotype and cardiovascular outcomes in patients with coronary artery disease. Coronary disease patients (n = 1007) were genotyped for the His323His (rs5333) variant and one tag SNP from each of the major EDNRA haplotype blocks (rs6537484, rs1568136, rs5335 and rs10003447). EDNRA haplotype associations with clinical history, natriuretic peptides cardiac function and cardiovascular outcomes were tested over a median 3.8 years. Univariate analysis identified a 'low-risk' EDNRA haplotype associated with later age of Type 2 diabetes onset (p = 0.004) smaller BMI (p = 0.021), and reduced mortality (log rank p = 0.001). Cox proportional hazards analysis including established cardiovascular risk factors revealed an independent association between haplotype and mortality (p < 0.0001). These data highlight the potential importance of the endothelin system, and in particular EDNRA in coronary disease.
AN ASSESSMENT OF GROUND TRUTH VARIABILITY USING A "VIRTUAL FIELD REFERENCE DATABASE"

EPA Science Inventory

A "Virtual Field Reference Database (VFRDB)" was developed from field measurment data that included location and time, physical attributes, flora inventory, and digital imagery (camera) documentation foy 1,01I sites in the Neuse River basin, North Carolina. The sampling f...
A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.

PubMed

Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin

2018-04-26

Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.
Linkage Disequilibrium and Haplotype Diversity in the Genes of the Renin–Angiotensin System: Findings From the Family Blood Pressure Program

PubMed Central

Zhu, Xiaofeng; Yan, Denise; Cooper, Richard S.; Luke, Amy; Ikeda, Morna A.; Chang, Yen-Pei C.; Weder, Alan; Chakravarti, Aravinda

2003-01-01

Association studies of candidate genes with complex traits have generally used one or a few single nucleotide polymorphisms (SNPs), although variation in the extent of linkage disequilibrium (LD) within genes markedly influences the sensitivity and precision of association studies. The extent of LD and the underlying haplotype structure for most candidate genes are still unavailable. We sampled 193 blacks (African-Americans) and 160 whites (European-Americans) and estimated the intragenic LD and the haplotype structure in four genes of the renin–angiotensin system. We genotyped 25 SNPs, with all but one of the pairs spaced between 1 and 20 kb, thus providing resolution at small scale. The pattern of LD within a gene was very heterogeneous. Using a robust method to define haplotype blocks, blocks of limited haplotype diversity were identified at each locus; between these blocks, LD was lost owing to the history of recombination events. As anticipated, there was less LD among blacks, the number of haplotypes was substantially larger, and shorter haplotype segments were found, compared with whites. These findings have implications for candidate-gene association studies and indicate that variation between populations of European and African origin in haplotype diversity is characteristic of most genes. [The sequence data described in this paper are available in GenBank under the following accession nos: AGT, MIM 106150; Renin, MIM 179820; ACE, MIM 106180; Angiotensin receptor I, MIM 106165. Supplementary material is available online at http://www.genome.org.] PMID:12566395
Alpha-globin gene haplotypes in South American Indians.

PubMed

Zago, M A; Melo Santos, E J; Clegg, J B; Guerreiro, J F; Martinson, J J; Norwich, J; Figueiredo, M S

1995-08-01

The haplotypes of the alpha-globin gene cluster were determined for 99 Indians from the Brazilian Amazon region who belong to 5 tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Three predominant haplotypes were identified: Ia (present in 38.9% of chromosomes), IIIa (25.8%), and IIe (22.1%). The only alpha-globin gene rearrangement detected was alpha alpha alpha 3.7 I gene triplication associated with haplotype IIIa, found in high frequencies (5.6% and 10.6%) in two tribes and absent in the others. alpha-Globin gene deletions that cause alpha-thalassemia were not seen, supporting the argument that malaria was absent in these populations until recently. The heterogeneous distribution of alpha-globin gene haplotypes and rearrangements among the different tribes differs markedly from the homogeneous distribution of beta-globin gene cluster haplotypes and reflects the action of various genetic mechanisms (genetic drift, founder effect, consanguinity) on small isolated population groups with a complicated history of divergence-fusion events. The alpha-globin gene haplotype distribution has some similarities to distributions observed in Southeast Asian and Pacific Island populations, indicating that these populations have considerable genetic affinities. However, the absence of several features of the alpha-globin gene cluster that are consistently present among the Pacific Islanders suggests that the similarity of haplotypes between Brazilian Indians and people from Polynesia, Micronesia, and Melanesia is more likely to result of ancient common ancestry rather than the consequence of recent direct genetic contribution through immigration.
Investigating biogeographic boundaries of the Sunda shelf: A phylogenetic analysis of two island populations of Macaca fascicularis.

PubMed

Klegarth, A R; Sanders, S A; Gloss, A D; Lane-deGraaf, K E; Jones-Engel, L; Fuentes, A; Hollocher, H

2017-08-01

Cyclical submergence and re-emergence of the Sunda Shelf throughout the Pleistocene served as a dynamic biogeographic landscape, across which long-tailed macaques (Macaca fascicularis) have migrated and evolved. Here, we tested the integrity of the previously reported continental-insular haplotype divide reported among Y and mitochondrial DNA lineages across multiple studies. The continental-insular haplotype divide was tested by heavily sampling wild macaques from two important biogeographic regions within Sundaland: (1) Singapore, the southernmost tip of continental Asia and (2) Bali, Indonesia, the southeastern edge of the Indonesian archipelago, immediately west of Wallace's line. Y DNA was haplotyped for samples from Bali, deep within the Indonesian archipelago. Mitochondrial D-loop from both islands was analyzed against existing data using Maximum Likelihood and Bayesian approaches. We uncovered both "continental" and "insular" Y DNA haplotypes in Bali. Between Singapore and Bali we found 52 unique mitochondrial haplotypes, none of which had been previously described. Phylogenetic analyses confirmed a major haplogroup division within Singapore and identified five new Singapore subclades and two primary subclades in Bali. While we confirmed the continental-insular divide among mtDNA haplotypes, maintenance of both Y DNA haplotypes on Bali, deep within the Indonesian archipelago calls into question the mechanism by which Y DNA diversity has been maintained. It also suggests the continental-insular designation is less appropriate for Y DNA, leading us to propose geographically neutral Y haplotype designations. © 2017 Wiley Periodicals, Inc.
Tryptophan Hydroxylase 2 haplotype association with borderline personality disorder and aggression in a sample of patients with personality disorders and healthy controls

PubMed Central

Perez-Rodriguez, M. Mercedes; Weinstein, Shauna; New, Antonia S.; Bevilacqua, Laura; Yuan, Qiaoping; Zhou, Zhifeng; Hodgkinson, Colin; Goodman, Marianne; Koenigsberg, Harold W.; Goldman, David; Siever, Larry J.

2010-01-01

Background There is decreased serotonergic function in impulsive aggression and borderline personality disorder (BPD), and genetic association studies suggest a role of serotonergic genes in impulsive aggression and BPD. Only one study has analyzed the association between the tryptophan-hydroxylase 2 (TPH2) gene and BPD. A TPH2 “risk” haplotype has been described that is associated with anxiety, depression and suicidal behavior. Methods We assessed the relationship between the previously identified “risk” haplotype at the TPH2 locus and BPD diagnosis, impulsive aggression, affective lability, and suicidal/parasuicidal behaviors, in a well-characterized clinical sample of 103 healthy controls (HCs) and 251 patients with personality disorders (109 with BPD). A logistic regression including measures of depression, affective lability and aggression scores in predicting “risk” haplotype was conducted. Results The prevalence of the “risk” haplotype was significantly higher in patients with BPD compared to HCs. Those with the “risk” haplotype have higher aggression and affect lability scores and more suicidal/parasuicidal behaviors than those without it. In the logistic regression model, affect lability was the only significant predictor and it correctly classified 83.1% of the subjects as “risk” or “non-risk” haplotype carriers. Conclusions We found an association between the previously described TPH2 “risk” haplotype and BPD diagnosis, affective lability, suicidal/parasuicidal behavior, and aggression scores. PMID:20451217
Genetic variability of populations of Nyssomyia neivai in the Northern State of Paraná, Brazil

PubMed Central

Gasparotto, Jaqueline de Carvalho; da Costa-Ribeiro, Magda Clara Vieira; Thomaz-Soccol, Vanete; Liebel, Sandra Mara Rodrigues da Silva; Neitzke-Abreu, Herintha Coeto; Reinhold-Castro, Kárin Rosi; Cristovão, Edilson Colhera; Teodoro, Ueslei

2017-01-01

ABSTRACT The genetic study of sandfly populations needs to be further explored given the importance of these insects for public health. Were sequenced the NDH4 mitochondrial gene from populations of Nyssomyia neivai from Doutor Camargo, Lobato, Japira, and Porto Rico, municipalities in the State of Paraná, Brazil, to understand the genetic structure and gene flow. Eighty specimens of Ny. Neivai were sequenced, 20 from each municipality, and 269 base pairs were obtained. A total of 27 haplotypes and 28 polymorphic sites were found, along with a haplotypic diversity of 0.80696 and a nucleotide diversity of 0.00567. Haplotype H5, with 33 specimens, was the most common among the four populations. Only haplotypes H5 and H7 were present in all four populations. The population from Doutor Camargo showed the highest genetic diversity, and only this population shared haplotypes with those from the other municipalities. The highest number of haplotypes was sheared with Lobato which also had the highest number of unique haplotypes. This probably occurred because of constant anthropic changes that happened in the environment during the first half of the twentieth century, mainly after 1998. There was no significant correlation between genetic and geographical distances regarding these populations. However, the highest genetic and geographical distances, and the lowest gene flow were observed between Japira and Porto Rico. Geographical distance is a possible barrier between these municipalities through the blocking of haplotype sharing. PMID:28380111
Mutation Analysis in Classical Phenylketonuria Patients Followed by Detecting Haplotypes Linked to Some PAH Mutations.

PubMed

Dehghanian, Fatemeh; Silawi, Mohammad; Tabei, Seyed M B

2017-02-01

Deficiency of phenylalanine hydroxylase (PAH) enzyme and elevation of phenylalanine in body fluids cause phenylketonuria (PKU). The gold standard for confirming PKU and PAH deficiency is detecting causal mutations by direct sequencing of the coding exons and splicing involved sequences of the PAH gene. Furthermore, haplotype analysis could be considered as an auxiliary approach for detecting PKU causative mutations before direct sequencing of the PAH gene by making comparisons between prior detected mutation linked-haplotypes and new PKU case haplotypes with undetermined mutations. In this study, 13 unrelated classical PKU patients took part in the study detecting causative mutations. Mutations were identified by polymerase chain reaction (PCR) and direct sequencing in all patients. After that, haplotype analysis was performed by studying VNTR and PAHSTR markers (linked genetic markers of the PAH gene) through application of PCR and capillary electrophoresis (CE). Mutation analysis was performed successfully and the detected mutations were as follows: c.782G>A, c.754C>T, c.842C>G, c.113-115delTCT, c.688G>A, and c.696A>G. Additionally, PAHSTR/VNTR haplotypes were detected to discover haplotypes linked to each mutation. Mutation detection is the best approach for confirming PAH enzyme deficiency in PKU patients. Due to the relatively large size of the PAH gene and high cost of the direct sequencing in developing countries, haplotype analysis could be used before DNA sequencing and mutation detection for a faster and cheaper way via identifying probable mutated exons.
RefPrimeCouch—a reference gene primer CouchApp

PubMed Central

Silbermann, Jascha; Wernicke, Catrin; Pospisil, Heike; Frohme, Marcus

2013-01-01

To support a quantitative real-time polymerase chain reaction standardization project, a new reference gene database application was required. The new database application was built with the explicit goal of simplifying not only the development process but also making the user interface more responsive and intuitive. To this end, CouchDB was used as the backend with a lightweight dynamic user interface implemented client-side as a one-page web application. Data entry and curation processes were streamlined using an OpenRefine-based workflow. The new RefPrimeCouch database application provides its data online under an Open Database License. Database URL: http://hpclife.th-wildau.de:5984/rpc/_design/rpc/view.html PMID:24368831
RefPrimeCouch--a reference gene primer CouchApp.

PubMed

Silbermann, Jascha; Wernicke, Catrin; Pospisil, Heike; Frohme, Marcus

2013-01-01

To support a quantitative real-time polymerase chain reaction standardization project, a new reference gene database application was required. The new database application was built with the explicit goal of simplifying not only the development process but also making the user interface more responsive and intuitive. To this end, CouchDB was used as the backend with a lightweight dynamic user interface implemented client-side as a one-page web application. Data entry and curation processes were streamlined using an OpenRefine-based workflow. The new RefPrimeCouch database application provides its data online under an Open Database License. Database URL: http://hpclife.th-wildau.de:5984/rpc/_design/rpc/view.html.
Optics survivability support, volume 2

NASA Astrophysics Data System (ADS)

Wild, N.; Simpson, T.; Busdeker, A.; Doft, F.

1993-01-01

This volume of the Optics Survivability Support Final Report contains plots of all the data contained in the computerized Optical Glasses Database. All of these plots are accessible through the Database, but are included here as a convenient reference. The first three pages summarize the types of glass included with a description of the radiation source, test date, and the original data reference. This information is included in the database as a macro button labeled 'LLNL DATABASE'. Following this summary is an Abbe chart showing which glasses are included and where they lie as a function of nu(sub d) and n(sub d). This chart is also callable through the database as a macro button labeled 'ABBEC'.

Hawaii bibliographic database

USGS Publications Warehouse

Wright, T.L.; Takahashi, T.J.

1998-01-01

The Hawaii bibliographic database has been created to contain all of the literature, from 1779 to the present, pertinent to the volcanological history of the Hawaiian-Emperor volcanic chain. References are entered in a PC- and Macintosh-compatible EndNote Plus bibliographic database with keywords and abstracts or (if no abstract) with annotations as to content. Keywords emphasize location, discipline, process, identification of new chemical data or age determinations, and type of publication. The database is updated approximately three times a year and is available to upload from an ftp site. The bibliography contained 8460 references at the time this paper was submitted for publication. Use of the database greatly enhances the power and completeness of library searches for anyone interested in Hawaiian volcanism.
Nuclear Science References Database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pritychenko, B., E-mail: pritychenko@bnl.gov; Běták, E.; Singh, B.

2014-06-15

The Nuclear Science References (NSR) database together with its associated Web interface, is the world's only comprehensive source of easily accessible low- and intermediate-energy nuclear physics bibliographic information for more than 210,000 articles since the beginning of nuclear science. The weekly-updated NSR database provides essential support for nuclear data evaluation, compilation and research activities. The principles of the database and Web application development and maintenance are described. Examples of nuclear structure, reaction and decay applications are specifically included. The complete NSR database is freely available at the websites of the National Nuclear Data Center (http://www.nndc.bnl.gov/nsr) and the International Atomic Energymore » Agency (http://www-nds.iaea.org/nsr)« less
APPLICATION OF A "VITURAL FIELD REFERENCE DATABASE" TO ASSESS LAND-COVER MAP ACCURACIES

EPA Science Inventory

An accuracy assessment was performed for the Neuse River Basin, NC land-cover/use
(LCLU) mapping results using a "Virtual Field Reference Database (VFRDB)". The VFRDB was developed using field measurement and digital imagery (camera) data collected at 1,409 sites over a perio...
USDA National Nutrient Database for Standard Reference, release 28

USDA-ARS?s Scientific Manuscript database

The USDA National Nutrient Database for Standard Reference, Release 28 contains data for nearly 8,800 food items for up to 150 food components. SR28 replaces the previous release, SR27, originally issued in August 2014. Data in SR28 supersede values in the printed handbooks and previous electronic...
Missing data imputation and haplotype phase inference for genome-wide association studies

PubMed Central

Browning, Sharon R.

2009-01-01

Imputation of missing data and the use of haplotype-based association tests can improve the power of genome-wide association studies (GWAS). In this article, I review methods for haplotype inference and missing data imputation, and discuss their application to GWAS. I discuss common features of the best algorithms for haplotype phase inference and missing data imputation in large-scale data sets, as well as some important differences between classes of methods, and highlight the methods that provide the highest accuracy and fastest computational performance. PMID:18850115
Thermodynamics of Enzyme-Catalyzed Reactions Database

National Institute of Standards and Technology Data Gateway

SRD 74 Thermodynamics of Enzyme-Catalyzed Reactions Database (Web, free access) The Thermodynamics of Enzyme-Catalyzed Reactions Database contains thermodynamic data on enzyme-catalyzed reactions that have been recently published in the Journal of Physical and Chemical Reference Data (JPCRD). For each reaction the following information is provided: the reference for the data, the reaction studied, the name of the enzyme used and its Enzyme Commission number, the method of measurement, the data and an evaluation thereof.
BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.

PubMed

Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F

2014-01-01

We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.
Short communication: casein haplotype variability in sicilian dairy goat breeds.

PubMed

Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B

2008-09-01

In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.
The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

PubMed Central

Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina

2018-01-01

The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446
Genetic polymorphisms in MDR1 and CYP3A4 genes in Asians and the influence of MDR1 haplotypes on cyclosporin disposition in heart transplant recipients.

PubMed

Chowbay, Balram; Cumaraswamy, Sivathasan; Cheung, Yin Bun; Zhou, Qingyu; Lee, Edmund J D

2003-02-01

Intestinal cytochrome P450 3A4 (CYP3A4) and P-glycoprotein (P-gp) both play a vital role in the metabolism of oral cyclosporine (CsA). We investigated the genetic polymorphisms in CYP3A4(promoter region and exons 5, 7 and 9) and MDR1 (exons 12, 21 and 26) genes and the impact of these polymorphisms on the pharmacokinetics of oral CsA in stable heart transplant patients (n = 14). CYP3A4 polymorphisms were rare in the Asian population and transplant patients. Haplotype analysis revealed 12 haplotypes in the Chinese, eight in the Malays and 10 in the Indians. T-T-T was the most common haplotype in all ethnic groups. The frequency of the homozygous mutant genotype at all three loci (TT-TT-TT) was highest in the Indians (31%) compared to 19% and 15% in the Chinese and Malays, respectively. In heart transplant patients, CsA exposure (AUC(0-4 h), AUC(0-12 h) and C(max)) was high in patients with the T-T-T haplotypes compared to those with C-G-C haplotypes. These findings suggest that haplotypes rather than genotypes influence CsA disposition in transplant patients.
Haplotype frequency distribution for 7 microsatellites in chromosome 8 and 11 in relation to the metabolic syndrome in four ethnic groups: Tehran Lipid and Glucose Study.

PubMed

Daneshpour, Maryam Sadat; Hosseinzadeh, Nima; Zarkesh, Maryam; Azizi, Fereidoun

2012-03-01

Different variants of haplotype frequencies may lead to various frequencies of the same variants in individuals with drug resistance and disease susceptibility at the population level. In this study, the haplotype frequencies of 4 STR loci including the D8S1132, D8S1779, D8S514 and D8S1743, and 3 STR loci including D11S1304, D11S1998 and D11S934 were investigated in 563 individuals of four Iranian ethnic groups in the capital city of Iran, Tehran. One hundred thirty subjects had the metabolic syndrome. Haplotype frequencies of all markers were calculated. There were significant differences in the haplotype frequencies in short and long alleles between the metabolic affected subjects and controls. In addition, haplotype frequencies were significant in the four ethnic groups in both chromosomes 8 and 11. Our findings show a relation between the short allele of D8S1743 in all related haplotype frequencies of subjects with metabolic syndrome. These findings may require more studies of some candidate genes, including the lipoprotein lipase gene, in this chromosomal region. Copyright © 2011. Published by Elsevier B.V.
A Relational Database System for Student Use.

ERIC Educational Resources Information Center

Fertuck, Len

1982-01-01

Describes an APL implementation of a relational database system suitable for use in a teaching environment in which database development and database administration are studied, and discusses the functions of the user and the database administrator. An appendix illustrating system operation and an eight-item reference list are attached. (Author/JL)
Common variants in immune and DNA repair genes and risk for human papillomavirus persistence and progression to cervical cancer.

PubMed

Wang, Sophia S; Bratti, M Concepcion; Rodríguez, Ana Cecilia; Herrero, Rolando; Burk, Robert D; Porras, Carolina; González, Paula; Sherman, Mark E; Wacholder, Sholom; Lan, Z Elizabeth; Schiffman, Mark; Chanock, Stephen J; Hildesheim, Allan

2009-01-01

We examined host genetic factors to identify those more common in individuals whose human papillomavirus (HPV) infections were most likely to persist and progress to cervical intraepithelial neoplasia grade 3 (CIN3) and cancer. We genotyped 92 single-nucleotide polymorphisms (SNPs) from 49 candidate immune response and DNA repair genes obtained from 469 women with CIN3 or cancer, 390 women with persistent HPV infections (median duration, 25 months), and 452 random control subjects from the 10,049-woman Guanacaste Costa Rica Natural History Study. We calculated odds ratios and 95% confidence intervals (CIs) for the association of SNP and haplotypes in women with CIN3 or cancer and HPV persistence, compared with random control subjects. A SNP in the Fanconi anemia complementation group A gene (FANCA) (G501S) was associated with increased risk of CIN3 or cancer. The AG and GG genotypes had a 1.3-fold (95% CI, 0.95-1.8-fold) and 1.7-fold (95% CI, 1.1-2.6-fold) increased risk for CIN3 or cancer, respectively (P(trend) = .008; referent, AA). The FANCA haplotype that included G501S also conferred increased risk of CIN3 or cancer, as did a different haplotype that included 2 other FANCA SNPs (G809A and T266A). A SNP in the innate immune gene IRF3 (S427T) was associated with increased risk for HPV persistence (P(trend) = .009). Our results require replication but support the role of FANCA variants in cervical cancer susceptibility and of IRF3 in HPV persistence.
Analyzing Mosquito (Diptera: Culicidae) Diversity in Pakistan by DNA Barcoding

PubMed Central

Ashfaq, Muhammad; Hebert, Paul D. N.; Mirza, Jawwad H.; Khan, Arif M.; Zafar, Yusuf; Mirza, M. Sajjad

2014-01-01

Background Although they are important disease vectors mosquito biodiversity in Pakistan is poorly known. Recent epidemics of dengue fever have revealed the need for more detailed understanding of the diversity and distributions of mosquito species in this region. DNA barcoding improves the accuracy of mosquito inventories because morphological differences between many species are subtle, leading to misidentifications. Methodology/Principal Findings Sequence variation in the barcode region of the mitochondrial COI gene was used to identify mosquito species, reveal genetic diversity, and map the distribution of the dengue-vector species in Pakistan. Analysis of 1684 mosquitoes from 491 sites in Punjab and Khyber Pakhtunkhwa during 2010–2013 revealed 32 species with the assemblage dominated by Culex quinquefasciatus (61% of the collection). The genus Aedes (Stegomyia) comprised 15% of the specimens, and was represented by six taxa with the two dengue vector species, Ae. albopictus and Ae. aegypti, dominant and broadly distributed. Anopheles made up another 6% of the catch with An. subpictus dominating. Barcode sequence divergence in conspecific specimens ranged from 0–2.4%, while congeneric species showed from 2.3–17.8% divergence. A global haplotype analysis of disease-vectors showed the presence of multiple haplotypes, although a single haplotype of each dengue-vector species was dominant in most countries. Geographic distribution of Ae. aegypti and Ae. albopictus showed the later species was dominant and found in both rural and urban environments. Conclusions As the first DNA-based analysis of mosquitoes in Pakistan, this study has begun the construction of a barcode reference library for the mosquitoes of this region. Levels of genetic diversity varied among species. Because of its capacity to differentiate species, even those with subtle morphological differences, DNA barcoding aids accurate tracking of vector populations. PMID:24827460
Long-distance gene flow and cross-Andean dispersal of lowland rainforest bees (Apidae: Euglossini) revealed by comparative mitochondrial DNA phylogeography.

PubMed

Dick, Christopher W; Roubik, David W; Gruber, Karl F; Bermingham, Eldredge

2004-12-01

Euglossine bees (Apidae; Euglossini) exclusively pollinate hundreds of orchid species and comprise up to 25% of bee species richness in neotropical rainforests. As one of the first studies of comparative phylogeography in a neotropical insect group, we performed a mitochondrial DNA (mtDNA)-based analysis of 14 euglossine species represented by populations sampled across the Andes and/or across the Amazon basin. The mtDNA divergences within species were consistently low; across the 12 monophyletic species the mean intraspecific divergence among haplotypes was 0.9% (range of means, 0-1.9%). The cytochrome oxidase 1 (CO1) divergence among populations separated by the Andes (N = 11 species) averaged 1.1% (range 0.0-2.0%). The mtDNA CO1 data set displayed homogeneous rates of nucleotide substitution, permitting us to infer dispersal across the cordillera long after the final Andean uplift based on arthropod molecular clocks of 1.2-1.5% divergence per million years. Gene flow across the 3000-km breadth of the Amazon basin was inferred from identical cross-Amazon haplotypes found in five species. Although mtDNA haplotypes for 12 of the 14 euglossine species were monophyletic, a reticulate CO1 phylogeny was recovered in Euglossa cognata and E. mixta, suggesting large ancestral populations and recent speciation. Reference to closely related outgroups suggested recent speciation for the majority of species. Phylogeographical structure across a broad spatial scale is weaker in euglossine bees than in any neotropical group previously examined, and may derive from a combination of Quaternary speciation, population expansion and/or long-distance gene flow.
Analyzing mosquito (Diptera: culicidae) diversity in Pakistan by DNA barcoding.

PubMed

Ashfaq, Muhammad; Hebert, Paul D N; Mirza, Jawwad H; Khan, Arif M; Zafar, Yusuf; Mirza, M Sajjad

2014-01-01

Although they are important disease vectors mosquito biodiversity in Pakistan is poorly known. Recent epidemics of dengue fever have revealed the need for more detailed understanding of the diversity and distributions of mosquito species in this region. DNA barcoding improves the accuracy of mosquito inventories because morphological differences between many species are subtle, leading to misidentifications. Sequence variation in the barcode region of the mitochondrial COI gene was used to identify mosquito species, reveal genetic diversity, and map the distribution of the dengue-vector species in Pakistan. Analysis of 1684 mosquitoes from 491 sites in Punjab and Khyber Pakhtunkhwa during 2010-2013 revealed 32 species with the assemblage dominated by Culex quinquefasciatus (61% of the collection). The genus Aedes (Stegomyia) comprised 15% of the specimens, and was represented by six taxa with the two dengue vector species, Ae. albopictus and Ae. aegypti, dominant and broadly distributed. Anopheles made up another 6% of the catch with An. subpictus dominating. Barcode sequence divergence in conspecific specimens ranged from 0-2.4%, while congeneric species showed from 2.3-17.8% divergence. A global haplotype analysis of disease-vectors showed the presence of multiple haplotypes, although a single haplotype of each dengue-vector species was dominant in most countries. Geographic distribution of Ae. aegypti and Ae. albopictus showed the later species was dominant and found in both rural and urban environments. As the first DNA-based analysis of mosquitoes in Pakistan, this study has begun the construction of a barcode reference library for the mosquitoes of this region. Levels of genetic diversity varied among species. Because of its capacity to differentiate species, even those with subtle morphological differences, DNA barcoding aids accurate tracking of vector populations.
Genetic variation at the microRNA binding site of CAV1 gene is associated with lung cancer susceptibility

PubMed Central

Fang, Xue; Li, Xuelian; Yin, Zhihua; Xia, Lingzi; Quan, Xiaowei; Zhao, Yuxia; Zhou, Baosen

2017-01-01

Single nucleotide polymorphism (SNP) may influence the genesis and development of cancer in a variety of ways depending on their location. Here we conducted a study in Chinese female non-smokers to investigate the relationship between rs1049337, rs926198 and the risk or survival of lung cancer. Further, we explored whether rs1049337 could alter the binding affinity between the mRNA of CAV1 and the corresponding microRNAs. Finally, we evaluated the relationship between expression level of CAV1 and prognosis of lung cancer. The results showed that the rs1049337-C allele and rs926198-C allele were the protective alleles of lung cancer risk. Haplotype analysis indicated that the C-C haplotype (constructed by rs1049337 and rs926198) was a protective haplotype for lung cancer risk. The result of luciferase reporter assay showed that rs1049337 can affect the binding affinity of CAV1 mRNA to the corresponding microRNAs both in A549 cell line and H1299 cell line. Compared with C allele, T allele had a relatively decreased luciferase activity. Compared with paired normal adjacent tissue or normal lung tissue, lung cancer tissue showed a relatively low level of CAV1. Refer to those patients at early stage of lung cancer, the expression level of CAV1 in patients at late stage of lung cancer was relatively low. In conclusion, the results indicated that rs1049337, it's a SNP located at 3′UTR region of CAV1 may affect lung cancer risk by altering the binding affinity between the mRNA of CAV1 and the corresponding microRNAs. PMID:29190968
Phylogenetic evidence for the ancient Himalayan wolf: towards a clarification of its taxonomic status based on genetic sampling from western Nepal

PubMed Central

Kaden, Jennifer; Joshi, Jyoti; Bhattarai, Susmita; Kusi, Naresh; Sillero-Zubiri, Claudio; Macdonald, David W.

2017-01-01

Wolves in the Himalayan region form a monophyletic lineage distinct from the present-day Holarctic grey wolf Canis lupus spp. (Linnaeus 1758) found across Eurasia and North America. Here, we analyse phylogenetic relationships and the geographic distribution of mitochondrial DNA haplotypes of the contemporary Himalayan wolf (proposed in previous studies as Canis himalayensis) found in Central Asia. We combine genetic data from a living Himalayan wolf population collected in northwestern Nepal in this study with already published genetic data, and confirm the Himalayan wolf lineage based on mitochondrial genomic data (508 bp cytochrome b and 242 bp D-loop), and X- and Y-linked zinc-finger protein gene (ZFX and ZFY) sequences. We then compare the genetic profile of the Himalayan wolf lineage found in northwestern Nepal with canid reference sequences from around the globe with maximum likelihood and Bayesian phylogeny building methods to demonstrate that the Himalayan wolf forms a distinct monophyletic clade supported by posterior probabilities/bootstrap for D-loop of greater than 0.92/85 and cytochrome b greater than 0.99/93. The Himalayan wolf shows a unique Y-chromosome (ZFY) haplotype, and shares an X-chromosome haplotype (ZFX) with the newly postulated African wolf. Our results imply that the Himalayan wolf distribution range extends from the Himalayan range north across the Tibetan Plateau up to the Qinghai Lakes region in Qinghai Province in the People's Republic of China. Based on its phylogenetic distinction and its older age of divergence relative to the Holarctic grey wolf, the Himalayan wolf merits formal classification as a distinct taxon of special conservation concern. PMID:28680672
Proteomics Analysis of Bladder Cancer Exosomes*

PubMed Central

Welton, Joanne L.; Khanna, Sanjay; Giles, Peter J.; Brennan, Paul; Brewis, Ian A.; Staffurth, John; Mason, Malcolm D.; Clayton, Aled

2010-01-01

Exosomes are nanometer-sized vesicles, secreted by various cell types, present in biological fluids that are particularly rich in membrane proteins. Ex vivo analysis of exosomes may provide biomarker discovery platforms and form non-invasive tools for disease diagnosis and monitoring. These vesicles have never before been studied in the context of bladder cancer, a major malignancy of the urological tract. We present the first proteomics analysis of bladder cancer cell exosomes. Using ultracentrifugation on a sucrose cushion, exosomes were highly purified from cultured HT1376 bladder cancer cells and verified as low in contaminants by Western blotting and flow cytometry of exosome-coated beads. Solubilization in a buffer containing SDS and DTT was essential for achieving proteomics analysis using an LC-MALDI-TOF/TOF MS approach. We report 353 high quality identifications with 72 proteins not previously identified by other human exosome proteomics studies. Overrepresentation analysis to compare this data set with previous exosome proteomics studies (using the ExoCarta database) revealed that the proteome was consistent with that of various exosomes with particular overlap with exosomes of carcinoma origin. Interrogating the Gene Ontology database highlighted a strong association of this proteome with carcinoma of bladder and other sites. The data also highlighted how homology among human leukocyte antigen haplotypes may confound MASCOT designation of major histocompatability complex Class I nomenclature, requiring data from PCR-based human leukocyte antigen haplotyping to clarify anomalous identifications. Validation of 18 MS protein identifications (including basigin, galectin-3, trophoblast glycoprotein (5T4), and others) was performed by a combination of Western blotting, flotation on linear sucrose gradients, and flow cytometry, confirming their exosomal expression. Some were confirmed positive on urinary exosomes from a bladder cancer patient. In summary, the exosome proteomics data set presented is of unrivaled quality. The data will aid in the development of urine exosome-based clinical tools for monitoring disease and will inform follow-up studies into varied aspects of exosome manufacture and function. PMID:20224111
A combined reference panel from the 1000 Genomes and UK10K projects improved rare variant imputation in European and Chinese samples

PubMed Central

Chou, Wen-Chi; Zheng, Hou-Feng; Cheng, Chia-Ho; Yan, Han; Wang, Li; Han, Fang; Richards, J. Brent; Karasik, David; Kiel, Douglas P.; Hsu, Yi-Hsiang

2016-01-01

Imputation using the 1000 Genomes haplotype reference panel has been widely adapted to estimate genotypes in genome wide association studies. To evaluate imputation quality with a relatively larger reference panel and a reference panel composed of different ethnic populations, we conducted imputations in the Framingham Heart Study and the North Chinese Study using a combined reference panel from the 1000 Genomes (N = 1,092) and UK10K (N = 3,781) projects. For rare variants with 0.01% < MAF ≤ 0.5%, imputation in the Framingham Heart Study with the combined reference panel increased well-imputed genotypes (with imputation quality score ≥0.4) from 62.9% to 76.1% when compared to imputation with the 1000 Genomes. For the North Chinese samples, imputation of rare variants with 0.01% < MAF ≤ 0.5% with the combined reference panel increased well-imputed genotypes by from 49.8% to 61.8%. The predominant European ancestry of the UK10K and the combined reference panels may explain why there was less of an increase in imputation success in the North Chinese samples. Our results underscore the importance and potential of larger reference panels to impute rare variants, while recognizing that increasing ethnic specific variants in reference panels may result in better imputation for genotypes in some ethnic groups. PMID:28004816

HLA genotyping by next-generation sequencing of complementary DNA.

PubMed

Segawa, Hidenobu; Kukita, Yoji; Kato, Kikuya

2017-11-28

Genotyping of the human leucocyte antigen (HLA) is indispensable for various medical treatments. However, unambiguous genotyping is technically challenging due to high polymorphism of the corresponding genomic region. Next-generation sequencing is changing the landscape of genotyping. In addition to high throughput of data, its additional advantage is that DNA templates are derived from single molecules, which is a strong merit for the phasing problem. Although most currently developed technologies use genomic DNA, use of cDNA could enable genotyping with reduced costs in data production and analysis. We thus developed an HLA genotyping system based on next-generation sequencing of cDNA. Each HLA gene was divided into 3 or 4 target regions subjected to PCR amplification and subsequent sequencing with Ion Torrent PGM. The sequence data were then subjected to an automated analysis. The principle of the analysis was to construct candidate sequences generated from all possible combinations of variable bases and arrange them in decreasing order of the number of reads. Upon collecting candidate sequences from all target regions, 2 haplotypes were usually assigned. Cases not assigned 2 haplotypes were forwarded to 4 additional processes: selection of candidate sequences applying more stringent criteria, removal of artificial haplotypes, selection of candidate sequences with a relaxed threshold for sequence matching, and countermeasure for incomplete sequences in the HLA database. The genotyping system was evaluated using 30 samples; the overall accuracy was 97.0% at the field 3 level and 98.3% at the G group level. With one sample, genotyping of DPB1 was not completed due to short read size. We then developed a method for complete sequencing of individual molecules of the DPB1 gene, using the molecular barcode technology. The performance of the automatic genotyping system was comparable to that of systems developed in previous studies. Thus, next-generation sequencing of cDNA is a viable option for HLA genotyping.
Cytochrome P450 2E1 gene polymorphisms/haplotypes and anti-tuberculosis drug-induced hepatitis in a Chinese cohort.

PubMed

Tang, Shaowen; Lv, Xiaozhen; Zhang, Yuan; Wu, Shanshan; Yang, Zhirong; Xia, Yinyin; Tu, Dehua; Deng, Peiyuan; Ma, Yu; Chen, Dafang; Zhan, Siyan

2013-01-01

The pathogenic mechanism of anti-tuberculosis (anti-TB) drug-induced hepatitis is associated with drug metabolizing enzymes. No tagging single-nucleotide polymorphisms (tSNPs) of cytochrome P450 2E1(CYP2E1) in the risk of anti-TB drug-induced hepatitis have been reported. The present study was aimed at exploring the role of tSNPs in CYP2E1 gene in a population-based anti-TB treatment cohort. A nested case-control study was designed. Each hepatitis case was 14 matched with controls by age, gender, treatment history, disease severity and drug dosage. The tSNPs were selected by using Haploview 4.2 based on the HapMap database of Han Chinese in Beijing, and detected by using TaqMan allelic discrimination technology. Eighty-nine anti-TB drug-induced hepatitis cases and 356 controls were included in this study. 6 tSNPs (rs2031920, rs2070672, rs915908, rs8192775, rs2515641, rs2515644) were genotyped and minor allele frequencies of these tSNPs were 21.9%, 23.0%, 19.1%, 23.6%, 20.8% and 44.4% in the cases and 20.9%, 22.7%, 18.9%, 23.2%, 18.2% and 43.2% in the controls, respectively. No significant difference was observed in genotypes or allele frequencies of the 6 tSNPs between case group and control group, and neither of haplotypes in block 1 nor in block 2 was significantly associated with the development of hepatitis. Based on the Chinese anti-TB treatment cohort, we did not find a statistically significant association between genetic polymorphisms of CYP2E1 and the risk of anti-TB drug-induced hepatitis. None of the haplotypes showed a significant association with the development of hepatitis in Chinese TB population.
Increased risks between Interleukin-10 gene polymorphisms and haplotype and head and neck cancer: a meta-analysis.

PubMed

Niu, Yu-Ming; Du, Xin-Ya; Cai, Heng-Xing; Zhang, Chao; Yuan, Rui-Xia; Zeng, Xian-Tao; Luo, Jie

2015-11-27

Molecular epidemiological research suggests that interleukin-10 (IL-10) polymorphisms may be associated with an increased risk of head and neck cancer (HNC), but results remain controversial. To derive a more precise evaluation, we performed a meta-analysis focused on genetic polymorphisms of IL-10. PubMed, Embase, CNKI and Wanfang databases were searched for studies that examined the relationship between IL-10 polymorphisms or haplotypes and HNC risk. The odds ratio (OR) and 95% confidence interval (CI) were applied to assess the relationship strength. Publication bias, sensitivity and cumulative analyses were conducted to measure the robustness of our findings. Overall, nine related studies involving 2,258 patients and 2,887 control samples were analyzed. Significant associations between the IL-10-1082A > G polymorphism and HNC risk were observed (G vs. A: OR = 1.56, 95% CI = 1.27-1.92, P < 0.01, I(2) = 69.4%; AG vs. AA: OR = 1.64, 95% CI = 1.32-2.05, P < 0.01, I(2) = 55.6%; GG vs. AA: OR = 2.24, 95% CI = 1.69-2.97, P < 0.01, I(2) = 38.5%; AG + GG vs. AA: OR = 1.70, 95% CI = 1.36-2.14, P = 0.02, I(2) = 61.8%; GG vs. AA + AG: OR = 1.89, 95% CI = 1.23-2.90, P = 0.01, I(2) = 46.3%) in the total population, as well as in subgroup analysis. Moreover, increased HNC risks were also associated with the IL-10 -819T > C polymorphism and the GCC haplotype. In conclusion, our meta-analyses suggest that IL-10 polymorphisms, specifically the -1082A > G polymorphism, may be associated with increased risk of HNC development.
Analysis of four microsatellite markers on the long arm of chromosome 9 by meiotic recombination in flow-sorted single sperm

DOE Office of Scientific and Technical Information (OSTI.GOV)

Furlong, R.A.; Goudie, D.R.; Carter, N.P.

1993-06-01

Meiotic recombination in flow-sorted single sperm was used to analyze four highly polymorphic microsatellite markers on the long arm of chromosome 9. The microsatellites comprised three tightly linked markers: 9CMP1 (D9S109), 9CMP2 (D9S127), and D9S53, which map to 9q31, and a reference marker, ASS, which is located in 9q34.1. Haplotypes of single sperm were assessed by using PCR in a single-step multiplex reaction to amplify each locus. Recombinant haplotypes were identified by their relative infrequency and were analyzed using THREELOC, a maximum-likelihood-analysis program, and an adaptation of CRI-MAP. The most likely order of these markers was cen-D9S109-D9S127-D9S53-ASS-tel with D9S109, D9S127,more » and D9S53 being separated by a genetic distance of approximately 3%. The order of the latter three markers did not however achieve statistical significance using the THREELOC program. 21 refs., 2 figs., 4 tabs.« less
[Identification of Y-chromosomal Genetic Types for the Soldier's Remains from Huaihai Campaign].

PubMed

Wang, C Z; Wen, S Q; Shi, M S; Yu, X E; Wang, X J; Pan, Y L; Zhang, Y F; Li, H; Tan, J Z

2017-08-01

To identify the Y-chromosomal genetic types for the soldier's remains from Huaihai Campaign, and to offer a clue for search of their paternal relatives. DNA of the remains were extracted by the ancient DNA extraction method. Yfiler kit was used for the multiplex amplification of 17 Y-STR loci. The haplogroups of the samples were speculated. Detailed genotyping of the selected Y-SNP was performed based on the latest Y-chromosome phylogenetic tree. Haplotype-sharing analysis was done based on the data of Y-SNP and Y-STR, the closest modern individual information to the genetic relationship of remains was gained. A total of 8 Y-STR haplotypes were observed on 17 Y-STR loci of 8 male individuals. Furthermore, 6 Y-SNP haplogroups were identified, which were O2a1-M95+, O1a1-P203+, O3*-M122+/M234-, D1-M15+, C3*-ST and R1a1-M17+. Identification of Y-chromosomal genetic types for the soldier's remains from Huaihai Campaign shows a reference value on inferring the geographical origins of old materials. Copyright© by the Editorial Department of Journal of Forensic Medicine
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

PubMed

Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

2010-07-16

Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Multi-allelic haplotype model based on genetic partition for genomic prediction and variance component estimation using SNP markers.

PubMed

Da, Yang

2015-12-18

The amount of functional genomic information has been growing rapidly but remains largely unused in genomic selection. Genomic prediction and estimation using haplotypes in genome regions with functional elements such as all genes of the genome can be an approach to integrate functional and structural genomic information for genomic selection. Towards this goal, this article develops a new haplotype approach for genomic prediction and estimation. A multi-allelic haplotype model treating each haplotype as an 'allele' was developed for genomic prediction and estimation based on the partition of a multi-allelic genotypic value into additive and dominance values. Each additive value is expressed as a function of h - 1 additive effects, where h = number of alleles or haplotypes, and each dominance value is expressed as a function of h(h - 1)/2 dominance effects. For a sample of q individuals, the limit number of effects is 2q - 1 for additive effects and is the number of heterozygous genotypes for dominance effects. Additive values are factorized as a product between the additive model matrix and the h - 1 additive effects, and dominance values are factorized as a product between the dominance model matrix and the h(h - 1)/2 dominance effects. Genomic additive relationship matrix is defined as a function of the haplotype model matrix for additive effects, and genomic dominance relationship matrix is defined as a function of the haplotype model matrix for dominance effects. Based on these results, a mixed model implementation for genomic prediction and variance component estimation that jointly use haplotypes and single markers is established, including two computing strategies for genomic prediction and variance component estimation with identical results. The multi-allelic genetic partition fills a theoretical gap in genetic partition by providing general formulations for partitioning multi-allelic genotypic values and provides a haplotype method based on the quantitative genetics model towards the utilization of functional and structural genomic information for genomic prediction and estimation.
Dominant Sequences of Human Major Histocompatibility Complex Conserved Extended Haplotypes from HLA-DQA2 to DAXX

PubMed Central

Larsen, Charles E.; Alford, Dennis R.; Trautwein, Michael R.; Jalloh, Yanoh K.; Tarnacki, Jennifer L.; Kunnenkeri, Sushruta K.; Fici, Dolores A.; Yunis, Edmond J.; Awdeh, Zuheir L.; Alper, Chester A.

2014-01-01

We resequenced and phased 27 kb of DNA within 580 kb of the MHC class II region in 158 population chromosomes, most of which were conserved extended haplotypes (CEHs) of European descent or contained their centromeric fragments. We determined the single nucleotide polymorphism and deletion-insertion polymorphism alleles of the dominant sequences from HLA-DQA2 to DAXX for these CEHs. Nine of 13 CEHs remained sufficiently intact to possess a dominant sequence extending at least to DAXX, 230 kb centromeric to HLA-DPB1. We identified the regions centromeric to HLA-DQB1 within which single instances of eight “common” European MHC haplotypes previously sequenced by the MHC Haplotype Project (MHP) were representative of those dominant CEH sequences. Only two MHP haplotypes had a dominant CEH sequence throughout the centromeric and extended class II region and one MHP haplotype did not represent a known European CEH anywhere in the region. We identified the centromeric recombination transition points of other MHP sequences from CEH representation to non-representation. Several CEH pairs or groups shared sequence identity in small blocks but had significantly different (although still conserved for each separate CEH) sequences in surrounding regions. These patterns partly explain strong calculated linkage disequilibrium over only short (tens to hundreds of kilobases) distances in the context of a finite number of observed megabase-length CEHs comprising half a population's haplotypes. Our results provide a clearer picture of European CEH class II allelic structure and population haplotype architecture, improved regional CEH markers, and raise questions concerning regional recombination hotspots. PMID:25299700
Neuropsychiatric systemic lupus erythematosus is associated with imbalance in interleukin 10 promoter haplotypes

PubMed Central

Rood, M; Keijsers, V; van der Linden, M W; Tong, T; Borggreve, S; Verweij, C; Breedveld, F; Huizinga, T

1999-01-01

OBJECTIVE—To investigate the association of interleukin 10 (IL10) promoter polymorphisms and neuropsychiatric manifestations of systemic lupus erythematosus (SLE). METHODS—IL10 haplotypes of 11 healthy volunteers were cloned to confirm that in the Dutch population, only the three common haplotypes (-1082/-819/-592) GCC, ACC and ATA exist. The IL10 promoter polymorphisms of 92 SLE patients and 162 healthy controls were determined. The medical records of the SLE patients were screened for the presence of neuropsychiatric involvement. RESULTS—All cloned haplotypes were either GCC, ACC or ATA. Forty two SLE patients had suffered from neuropsychiatric manifestations (NP-SLE). In NP-SLE patients, the frequency of the ATA haplotype is 30% versus 18% in the controls and 17% in the non-NP-SLE group (odds ratios 1.9, p=0.02, and 2.1, p=0.04, respectively), whereas the GCC haplotype frequency is lower in the NP-SLE group compared with controls and non-NP-SLE patients (40% versus 55% and 61%, odds ratios 0.6, p=0.02 and 0.4 p=0.006). The odds ratio for the presence of NP-SLE is inversely proportional to the number of GCC haplotypes per genotype when the NP-SLE group is compared with non-NP-SLE patients. CONCLUSIONS—The IL10 locus is associated with neuropsychiatric manifestations in SLE. This suggests that IL10 is implicated in the immunopathogenesis of neuropsychiatric manifestations in SLE.   Keywords: systemic lupus erythematosus; neuropsychiatric manifestations; genetics; interleukin 10 promoter haplotypes PMID:10343522
African gene flow to north Brazil as revealed by HBB*S gene haplotype analysis.

PubMed

Lemos Cardoso, Greice; Farias Guerreiro, João

2006-01-01

Haplotypes linked to the HBB*S gene were analyzed in a sample of 260 chromosomes of Brazilian sickle cell anemia patients from the population of Belém, state of Pará, to evaluate if the present-day haplotype frequencies correlate as well as expected with historical information on the geographic origin of African slaves sent directly to Northern Brazil. The HBB*S gene haplotype distribution (66% Bantu, 21.8% Benin, 10.9% Senegal, and 1.3% Cameroon) is in agreement with those observed for other Brazilian populations regarding the highest proportion of the Bantu type, followed by the Benin type, but it differs significantly concerning the Senegal type as this haplotype is rare or absent in samples from other Brazilian regions already studied. In addition, our results are in accordance with historical records that establish that about 90% of the slaves sent to Northern Brazil were from Angola, Congo, and Mozambique, where the Bantu haplotype predominates, in contrast to 10% of slaves from Senegambia, Guine-Bissau, and Cape Verde, where the Senegal haplotype is the most common. On the other hand, the observed frequency of the Benin haplotype in Belém was much higher than that expected by historical data. This fact corroborates the suggestion that the high prevalence of the Benin type in Belém is due to domestic slave trade and later internal migrations, mainly from the Northeast, since there are no historical records of direct slave trade from Central West Africa to North Brazil. Am. J. Hum. Biol. 18:93-98, 2006. (c) 2005 Wiley-Liss, Inc.
Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.

PubMed

Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

2014-07-01

The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
F8 haplotype and inhibitor risk: results from the Hemophilia Inhibitor Genetics Study (HIGS) Combined Cohort

PubMed Central

Schwarz, John; Astermark, Jan; Menius, Erika D.; Carrington, Mary; Donfield, Sharyne M.; Gomperts, Edward D.; Nelson, George W.; Oldenburg, Johannes; Pavlova, Anna; Shapiro, Amy D.; Winkler, Cheryl A.; Berntorp, Erik

2012-01-01

Background Ancestral background, specifically African descent, confers higher risk for development of inhibitory antibodies to factor VIII (FVIII) in hemophilia A. It has been suggested that differences in the distribution of factor VIII gene (F8) haplotypes, and mismatch between endogenous F8 haplotypes and those comprising products used for treatment could contribute to risk. Design and Methods Data from the HIGS Combined Cohort were used to determine the association between F8 haplotype 3 (H3) vs. haplotypes 1 and 2 (H1+H2) and inhibitor risk among individuals of genetically-determined African descent. Other variables known to affect inhibitor risk including type of F8 mutation and HLA were included in the analysis. A second research question regarding risk related to mismatch in endogenous F8 haplotype and recombinant FVIII products used for treatment was addressed. Results H3 was associated with higher inhibitor risk among those genetically-identified (N=49) as of African ancestry, but the association did not remain significant after adjustment for F8 mutation type and the HLA variables. Among subjects of all racial ancestries enrolled in HIGS who reported early use of recombinant products (N=223), mismatch in endogenous haplotype and the FVIII proteins constituting the products used did not confer greater risk for inhibitor development. Conclusion H3 was not an independent predictor of inhibitor risk. Further, our findings did not support a higher risk of inhibitors in the presence of a haplotype mismatch between the FVIII molecule infused and that of the individual. PMID:22958194
Dual African Origins of Global Aedes aegypti s.l. Populations Revealed by Mitochondrial DNA

PubMed Central

Moore, Michelle; Sylla, Massamba; Goss, Laura; Burugu, Marion Warigia; Sang, Rosemary; Kamau, Luna W.; Kenya, Eucharia Unoma; Bosio, Chris; Munoz, Maria de Lourdes; Sharakova, Maria; Black, William Cormack

2013-01-01

Background Aedes aegypti is the primary global vector to humans of yellow fever and dengue flaviviruses. Over the past 50 years, many population genetic studies have documented large genetic differences among global populations of this species. These studies initially used morphological polymorphisms, followed later by allozymes, and most recently various molecular genetic markers including microsatellites and mitochondrial markers. In particular, since 2000, fourteen publications and four unpublished datasets have used sequence data from the NADH dehydrogenase subunit 4 mitochondrial gene to compare Ae. aegypti collections and collectively 95 unique mtDNA haplotypes have been found. Phylogenetic analyses in these many studies consistently resolved two clades but no comprehensive study of mtDNA haplotypes have been made in Africa, the continent in which the species originated. Methods and Findings ND4 haplotypes were sequenced in 426 Ae. aegypti s.l. from Senegal, West Africa and Kenya, East Africa. In Senegal 15 and in Kenya 7 new haplotypes were discovered. When added to the 95 published haplotypes and including 6 African Aedes species as outgroups, phylogenetic analyses showed that all but one Senegal haplotype occurred in a basal clade while most East African haplotypes occurred in a second clade arising from the basal clade. Globally distributed haplotypes occurred in both clades demonstrating that populations outside Africa consist of mixtures of mosquitoes from both clades. Conclusions Populations of Ae. aegypti outside Africa consist of mosquitoes arising from one of two ancestral clades. One clade is basal and primarily associated with West Africa while the second arises from the first and contains primarily mosquitoes from East Africa PMID:23638196
Discovery of novel MHC-class I alleles and haplotypes in Filipino cynomolgus macaques (Macaca fascicularis) by pyrosequencing and Sanger sequencing: Mafa-class I polymorphism.

PubMed

Shiina, Takashi; Yamada, Yukiho; Aarnink, Alice; Suzuki, Shingo; Masuya, Anri; Ito, Sayaka; Ido, Daisuke; Yamanaka, Hisashi; Iwatani, Chizuru; Tsuchiya, Hideaki; Ishigaki, Hirohito; Itoh, Yasushi; Ogasawara, Kazumasa; Kulski, Jerzy K; Blancher, Antoine

2015-10-01

Although the low polymorphism of the major histocompatibility complex (MHC) transplantation genes in the Filipino cynomolgus macaque (Macaca fascicularis) is expected to have important implications in the selection and breeding of animals for medical research, detailed polymorphism information is still lacking for many of the duplicated class I genes. To better elucidate the degree and types of MHC polymorphisms and haplotypes in the Filipino macaque population, we genotyped 127 unrelated animals by the Sanger sequencing method and high-resolution pyrosequencing and identified 112 different alleles, 28 at cynomolgus macaque MHC (Mafa)-A, 54 at Mafa-B, 12 at Mafa-I, 11 at Mafa-E, and seven at Mafa-F alleles, of which 56 were newly described. Of them, the newly discovered Mafa-A8*01:01 lineage allele had low nucleotide similarities (<86%) with primate MHC class I genes, and it was also conserved in the Vietnamese and Indonesian populations. In addition, haplotype estimations revealed 17 Mafa-A, 23 Mafa-B, and 12 Mafa-E haplotypes integrated with 84 Mafa-class I haplotypes and Mafa-F alleles. Of these, the two Mafa-class I haplotypes, F/A/E/B-Hp1 and F/A/E/B-Hp2, had the highest haplotype frequencies at 10.6 and 10.2%, respectively. This suggests that large scale genetic screening of the Filipino macaque population would identify these and other high-frequency Mafa-class I haplotypes that could be used as MHC control animals for the benefit of biomedical research.
Discovery of a haplotype affecting fertility in Ayrshire dairy dattle and identification of a putative causal variant

USDA-ARS?s Scientific Manuscript database

Initial genomic test results for US Ayrshire dairy cattle became available in January of 2013. Several haplotypes that showed a deficiency of homozygotes were investigated to determine if they had an effect on fertility. A haplotype on chromosome 17 was determined to affect fertility, indicating tha...
49 CFR 630.4 - Requirements.

Code of Federal Regulations, 2012 CFR

2012-10-01

... TRANSPORTATION NATIONAL TRANSIT DATABASE § 630.4 Requirements. (a) National Transit Database Reporting System... from the National Transit Database Web site located at http://www.ntdprogram.gov. These reference... Transit Database Web site and a notice of any significant changes to the reporting requirements specified...
49 CFR 630.4 - Requirements.

Code of Federal Regulations, 2011 CFR

2011-10-01

... TRANSPORTATION NATIONAL TRANSIT DATABASE § 630.4 Requirements. (a) National Transit Database Reporting System... from the National Transit Database Web site located at http://www.ntdprogram.gov. These reference... Transit Database Web site and a notice of any significant changes to the reporting requirements specified...
49 CFR 630.4 - Requirements.

Code of Federal Regulations, 2010 CFR

2010-10-01

... TRANSPORTATION NATIONAL TRANSIT DATABASE § 630.4 Requirements. (a) National Transit Database Reporting System... from the National Transit Database Web site located at http://www.ntdprogram.gov. These reference... Transit Database Web site and a notice of any significant changes to the reporting requirements specified...
49 CFR 630.4 - Requirements.

Code of Federal Regulations, 2014 CFR

2014-10-01

... TRANSPORTATION NATIONAL TRANSIT DATABASE § 630.4 Requirements. (a) National Transit Database Reporting System... from the National Transit Database Web site located at http://www.ntdprogram.gov. These reference... Transit Database Web site and a notice of any significant changes to the reporting requirements specified...
49 CFR 630.4 - Requirements.

Code of Federal Regulations, 2013 CFR

2013-10-01

... TRANSPORTATION NATIONAL TRANSIT DATABASE § 630.4 Requirements. (a) National Transit Database Reporting System... from the National Transit Database Web site located at http://www.ntdprogram.gov. These reference... Transit Database Web site and a notice of any significant changes to the reporting requirements specified...

Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes

PubMed Central

Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.

1990-01-01

We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812
Phylogeography of Japanese horse chestnut (Aesculus turbinata) in the Japanese Archipelago based on chloroplast DNA haplotypes.

PubMed

Sugahara, Kanako; Kaneko, Yuko; Ito, Satoshi; Yamanaka, Keisuke; Sakio, Hitoshi; Hoshizaki, Kazuhiko; Suzuki, Wajiro; Yamanaka, Norikazu; Setoguchi, Hiroaki

2011-01-01

Japanese horse chestnut (Aesculus turbinata: Hippocastanaceae) is one of the typical woody plants that grow in temperate riparian forests in the Japanese Archipelago. To analyze the phylogeography of this plant in the Japanese Archipelago, we determined cpDNA haplotypes for 337 samples from 55 populations covering the entire distribution range. Based on 1,313 bp of two spacers, we determined ten haplotypes that are distinguished from adjacent haplotypes by one or two steps. Most of the populations had a single haplotype, suggesting low diversity. Spatial analysis of molecular variance suggested three obvious phylogeographic structures in western Japan, where Japanese horse chestnut is scattered and isolated in mountainous areas. Conversely, no clear phylogeographic structure was observed from the northern to the southern limit of this species, including eastern Japan, where this plant is more common. Rare and private haplotypes were also found in southwestern Japan, where Japanese horse chestnuts are distributed sparsely. These findings imply that western Japan might have maintained a relatively large habitat for A. turbinata during the Quaternary climatic oscillations, while northerly regions could not.
In Vivo Characterization of Human APOA5 Haplotypes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ahituv, Nadav; Akiyama, Jennifer; Chapman-Helleboid, Audrey

2006-10-01

Increased plasma triglycerides concentrations are an independent risk factor for cardiovascular disease. Numerous studies support a reproducible genetic association between two minor haplotypes in the human apolipoprotein A5 gene (APOA5) and increased plasma triglyceride concentrations. We thus sought to investigate the effect of these minor haplotypes (APOA5*2 and APOA5*3) on ApoAV plasma levels through the precise insertion of single-copy intact APOA5 haplotypes at a targeted location in the mouse genome. While we found no difference in the amount of human plasma ApoAV in mice containing the common APOA5*1 and minor APOA5*2 haplotype, the introduction of the single APOA5*3 defining allelemore » (19W) resulted in 3-fold lower ApoAV plasma levels consistent with existing genetic association studies. These results indicate that S19W polymorphism is likely to be functional and explain the strong association of this variant with plasma triglycerides supporting the value of sensitive in vivo assays to define the functional nature of human haplotypes.« less
Mineralocorticoid receptor haplotype, oral contraceptives and emotional information processing.

PubMed

Hamstra, D A; de Kloet, E R; van Hemert, A M; de Rijk, R H; Van der Does, A J W

2015-02-12

Oral contraceptives (OCs) affect mood in some women and may have more subtle effects on emotional information processing in many more users. Female carriers of mineralocorticoid receptor (MR) haplotype 2 have been shown to be more optimistic and less vulnerable to depression. To investigate the effects of oral contraceptives on emotional information processing and a possible moderating effect of MR haplotype. Cross-sectional study in 85 healthy premenopausal women of West-European descent. We found significant main effects of oral contraceptives on facial expression recognition, emotional memory and decision-making. Furthermore, carriers of MR haplotype 1 or 3 were sensitive to the impact of OCs on the recognition of sad and fearful faces and on emotional memory, whereas MR haplotype 2 carriers were not. Different compounds of OCs were included. No hormonal measures were taken. Most naturally cycling participants were assessed in the luteal phase of their menstrual cycle. Carriers of MR haplotype 2 may be less sensitive to depressogenic side-effects of OCs. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Multiple genetic origins of histidine-rich protein 2 gene deletion in Plasmodium falciparum parasites from Peru

PubMed Central

Akinyi, Sheila; Hayden, Tonya; Gamboa, Dionicia; Torres, Katherine; Bendezu, Jorge; Abdallah, Joseph F.; Griffing, Sean M.; Quezada, Wilmer Marquiño; Arrospide, Nancy; De Oliveira, Alexandre Macedo; Lucas, Carmen; Magill, Alan J.; Bacon, David J.; Barnwell, John W.; Udhayakumar, Venkatachalam

2013-01-01

The majority of malaria rapid diagnostic tests (RDTs) detect Plasmodium falciparum histidine-rich protein 2 (PfHRP2), encoded by the pfhrp2 gene. Recently, P. falciparum isolates from Peru were found to lack pfhrp2 leading to false-negative RDT results. We hypothesized that pfhrp2-deleted parasites in Peru derived from a single genetic event. We evaluated the parasite population structure and pfhrp2 haplotype of samples collected between 1998 and 2005 using seven neutral and seven chromosome 8 microsatellite markers, respectively. Five distinct pfhrp2 haplotypes, corresponding to five neutral microsatellite-based clonal lineages, were detected in 1998-2001; pfhrp2 deletions occurred within four haplotypes. In 2003-2005, outcrossing among the parasite lineages resulted in eight population clusters that inherited the five pfhrp2 haplotypes seen previously and a new haplotype; pfhrp2 deletions occurred within four of these haplotypes. These findings indicate that the genetic origin of pfhrp2 deletion in Peru was not a single event, but likely occurred multiple times. PMID:24077522
A parsimonious tree-grow method for haplotype inference.

PubMed

Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan

2005-09-01

Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf
Gene Flow Patterns of the Mayfly Fallceon quilleri in San Diego County, California.

NASA Astrophysics Data System (ADS)

Zickovich, J.; Bohonak, A. J.

2005-05-01

Management decisions and conservation strategies for freshwater invertebrates critically depend on an understanding of gene flow and genetic structure. We collected the mayfly Fallceon quilleri (Ephemeroptera: Baetidae) from 15 streams across three geographically distinct watersheds in San Diego County, California (San Dieguito, Santa Margarita, and Tijuana) and one site in Anza-Borrego desert. We sequenced a 667 base pair region of the mitochondrial DNA (COI) to assess genetic structure and gene flow. We found eight haplotypes across all populations. San Dieguito and Santa Margarita each contained six haplotypes. Tijuana and Anza Borrego each contained four haplotypes. The expected heterozygosity for San Dieguito, Santa Margarita, Tijuana, and Anza Borrego was 0.81, 0.83, 0.75, and 1.0, respectively. A hierarchical AMOVA analysis indicated restricted gene flow and a pairwise comparison indicated that Tijuana watershed differs significantly from San Dieguito and Anza Borrego. A haplotype cladogram revealed two internal ancestral haplotypes and six derived tip haplotypes that are unique to particular watersheds. These results suggest that Tijuana (the southernmost and the most impacted watershed) is more genetically distinct and isolated than the other watersheds sampled.
Interactions Between Serotonin Transporter Gene Haplotypes and Quality of Mothers’ Parenting Predict the Development of Children’s Noncompliance

PubMed Central

Sulik, Michael J.; Eisenberg, Nancy; Lemery-Chalfant, Kathryn; Spinrad, Tracy L.; Silva, Kassondra M.; Eggum, Natalie D.; Betkowski, Jennifer A.; Kupfer, Anne; Smith, Cynthia L.; Gaertner, Bridget; Stover, Daryn A.; Verrelli, Brian C.

2012-01-01

The LPR and STin2 polymorphisms of the serotonin transporter gene (SLC6A4) were combined into haplotypes that, together with quality of maternal parenting, were used to predict initial levels and linear change in children’s (N = 138) noncompliance and aggression from age 18 –54 months. Quality of mothers’ parenting behavior was observed when children were 18 months old, and nonparental caregivers’ reports of noncompliance and aggression were collected annually from 18 to 54 months of age. Quality of early parenting was negatively related to the slope of noncompliance only for children with the LPR-S/STin2-10 haplotype and to 18-month noncompliance only for children with haplotypes that did not include LPR-S. The findings support the notion that SLC6A4 haplotypes index differential susceptibility to variability in parenting quality, with certain haplotypes showing greater reactivity to both supportive and unsupportive environments. These different genetic backgrounds likely reflect an evolutionary response to variation in the parenting environment. PMID:22059451
An Online Resource for Flight Test Safety Planning

NASA Technical Reports Server (NTRS)

Lewis, Greg

2007-01-01

A viewgraph presentation describing an online database for flight test safety techniques is shown. The topics include: 1) Goal; 2) Test Hazard Analyses; 3) Online Database Background; 4) Data Gathering; 5) NTPS Role; 6) Organizations; 7) Hazard Titles; 8) FAR Paragraphs; 9) Maneuver Name; 10) Identified Hazard; 11) Matured Hazard Titles; 12) Loss of Control Causes; 13) Mitigations; 14) Database Now Open to the Public; 15) FAR Reference Search; 16) Record Field Search; 17) Keyword Search; and 18) Results of FAR Reference Search.
Phylogenetic analysis of mtDNA lineages in South American mummies.

PubMed

Monsalve, M V; Cardenas, F; Guhl, F; Delaney, A D; Devine, D V

1996-07-01

Some studies of mtDNA propose that contemporary Amerindians have descended from four haplotype groups, each defined by specific sets of polymorphisms. One recent study also found evidence of other potential founder haplotypes. We wanted to determine whether the four haplotypes in modern populations were also present in ancient South American aboriginals. We subjected mtDNA from Colombian mummies (470 to 1849 AD) to PCR amplification and restriction endonuclease analysis. The mtDNA D-loop region was surveyed for sequence variation by restriction analysis and a segment of this region was sequenced for each mummy to characterize the haplotypes. Our mummies exhibited three of the four major characteristic haplotypes of Amerindian populations defined by four markers. With sequence data obtained in the ancient samples and published data on contemporary Amerindians it was possible to infer the origin of these six mummies.
The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets

PubMed Central

McCaskie, Pamela A; Carter, Kim W; McCaskie, Simon R; Palmer, Lyle J

2005-01-01

We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes. PMID:16451612
Haplotype Frequency Distribution in Northeastern European Saduria entomon (Crustacea: Isopoda) Populations. A Phylogeographic Approach

NASA Astrophysics Data System (ADS)

Sell, Jerzy

2003-11-01

The distribution pattern of mtDNA haplotypes in distinct populations of the glacial relict crustacean Saduria entomon was examined to assess phylogeographic relationships among them. Populations from the Baltic, the White Sea and the Barents Sea were screened for mtDNA variation using PCR-based RFLP analysis of a 1150 bp fragment containing part of the CO I and CO II genes. Five mtDNA haplotypes were recorded. An analysis of geographical heterogeneity in haplotype frequency distributions revealed significant differences among populations. The isolated populations of S. entomon have diverged since the retreat of the last glaciation. The geographical pattern of variation is most likely the result of stochastic (founder effect, genetic drift) mechanisms and suggests that the haplotype differentiation observed is probably older than the isolation of the Baltic and Arctic seas.
DHLAS: A web-based information system for statistical genetic analysis of HLA population data.

PubMed

Thriskos, P; Zintzaras, E; Germenis, A

2007-03-01

DHLAS (database HLA system) is a user-friendly, web-based information system for the analysis of human leukocyte antigens (HLA) data from population studies. DHLAS has been developed using JAVA and the R system, it runs on a Java Virtual Machine and its user-interface is web-based powered by the servlet engine TOMCAT. It utilizes STRUTS, a Model-View-Controller framework and uses several GNU packages to perform several of its tasks. The database engine it relies upon for fast access is MySQL, but others can be used a well. The system estimates metrics, performs statistical testing and produces graphs required for HLA population studies: (i) Hardy-Weinberg equilibrium (calculated using both asymptotic and exact tests), (ii) genetics distances (Euclidian or Nei), (iii) phylogenetic trees using the unweighted pair group method with averages and neigbor-joining method, (iv) linkage disequilibrium (pairwise and overall, including variance estimations), (v) haplotype frequencies (estimate using the expectation-maximization algorithm) and (vi) discriminant analysis. The main merit of DHLAS is the incorporation of a database, thus, the data can be stored and manipulated along with integrated genetic data analysis procedures. In addition, it has an open architecture allowing the inclusion of other functions and procedures.
Profiling Developmental Toxicity of 387 Environmental Chemicals using EPA’s Toxicity Reference Database (ToxRefDB)

EPA Science Inventory

EPA's Toxicity Reference Databases (ToxRefDB) was developed by the National Center for Computational Toxicology in partnership with EPA's Office of Pesticide Programs, to store data derived from in vivo animal toxicity studies [www.epa.gov/ncct/toxrefdb/]. The initial build of To...
USDA National Nutrient Database for Standard Reference, Release 25

USDA-ARS?s Scientific Manuscript database

The USDA National Nutrient Database for Standard Reference, Release 25(SR25)contains data for over 8,100 food items for up to 146 food components. It replaces the previous release, SR24, issued in September 2011. Data in SR25 supersede values in the printed handbooks and previous electronic releas...
USDA National Nutrient Database for Standard Reference, Release 24

USDA-ARS?s Scientific Manuscript database

The USDA Nutrient Database for Standard Reference, Release 24 contains data for over 7,900 food items for up to 146 food components. It replaces the previous release, SR23, issued in September 2010. Data in SR24 supersede values in the printed Handbooks and previous electronic releases of the databa...
Design of a diagnostic encyclopaedia using AIDA.

PubMed

van Ginneken, A M; Smeulders, A W; Jansen, W

1987-01-01

Diagnostic Encyclopaedia Workstation (DEW) is the name of a digital encyclopaedia constructed to contain reference knowledge with respect to the pathology of the ovary. Comparing DEW with the common sources of reference knowledge (i.e. books) leads to the following advantages of DEW: it contains more verbal knowledge, pictures and case histories, and it offers information adjusted to the needs of the user. Based on an analysis of the structure of this reference knowledge we have chosen AIDA to develop a relational database and we use a video-disc player to contain the pictorial part of the database. The system consists of a database input version and a read-only run version. The design of the database input version is discussed. Reference knowledge for ovary pathology requires 1-3 Mbytes of memory. At present 15% of this amount is available. The design of the run version is based on an analysis of which information must necessarily be specified to the system by the user to access a desired item of information. Finally, the use of AIDA in constructing DEW is evaluated.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Andrew, S.E.; Goldberg, Y.P.; Squitieri, F.

Huntington disease (HD) is one of 7 disorders now known to be caused by expansion of a trinucleotide repeat. The HD mutation is a polymorphic trinucleotide (CAG) repeat in the 5{prime} region of a novel gene that expands beyond the normal range of 10-35 repeats in persons destined to develop the disease. Haplotype analysis of other dynamic mutation disorders such as myotonic dystrophy and Fragil X have suggested that a rare ancestral expansion event on a normal chromosome is followed by subsequent expansion events, resulting in a pool of chromosomes in the premutation range, which is inherently unstable and pronemore » to further multiple expansion events leading to disease range chromosomes. Haplotype analysis of 67 HD and 84 control chromosomes using 5 polymorphic markers, both intragenic and 5{prime} to the disease mutation, demonstrate that multiple haplotypes underlie HD. However, 94% of the chromosomes can be grouped under two major haplotypes. These two haplotypes are also present in the normal population. A third major haplotype is seen on 38% of normal chromosomes but rarely on HD chromosomes (6%). CAG lengths on the normal chromosomes with the two haplotypes seen in the HD population are higher than those seen on the normal chromosomes with the haplotype rarely seen on HD chromosomes. Furthermore, in populations with a diminished frequency of HD, CAG length on normal chromosomes is significantly less than other populations with higher prevalence rates for HD. These data suggest that CAG length on normal chromosomes may be a significant factor contributing to repeat instability that eventually leads to chromosomes with CAG repeat lengths in the HD range. Haplotypes on the HD chromosomes are identical to those normal chromosomes which have CAG lengths in the high range of normal, suggesting that further expansions of this pool of chromosomes leads to chromosomes with CAG repeat sizes within the disease range, consistent with a multistep model.« less
Inferring mechanisms of copy number change from haplotype structures at the human DEFA1A3 locus.

PubMed

Black, Holly A; Khan, Fayeza F; Tyson, Jess; Al Armour, John

2014-07-21

The determination of structural haplotypes at copy number variable regions can indicate the mechanisms responsible for changes in copy number, as well as explain the relationship between gene copy number and expression. However, obtaining spatial information at regions displaying extensive copy number variation, such as the DEFA1A3 locus, is complex, because of the difficulty in the phasing and assembly of these regions. The DEFA1A3 locus is intriguing in that it falls within a region of high linkage disequilibrium, despite its high variability in copy number (n = 3-16); hence, the mechanisms responsible for changes in copy number at this locus are unclear. In this study, a region flanking the DEFA1A3 locus was sequenced across 120 independent haplotypes with European ancestry, identifying five common classes of DEFA1A3 haplotype. Assigning DEFA1A3 class to haplotypes within the 1000 Genomes project highlights a significant difference in DEFA1A3 class frequencies between populations with different ancestry. The features of each DEFA1A3 class, for example, the associated DEFA1A3 copy numbers, were initially assessed in a European cohort (n = 599) and replicated in the 1000 Genomes samples, showing within-class similarity, but between-class and between-population differences in the features of the DEFA1A3 locus. Emulsion haplotype fusion-PCR was used to generate 61 structural haplotypes at the DEFA1A3 locus, showing a high within-class similarity in structure. Structural haplotypes across the DEFA1A3 locus indicate that intra-allelic rearrangement is the predominant mechanism responsible for changes in DEFA1A3 copy number, explaining the conservation of linkage disequilibrium across the locus. The identification of common structural haplotypes at the DEFA1A3 locus could aid studies into how DEFA1A3 copy number influences expression, which is currently unclear.
Genetic structure of Phytophthora infestans populations in China indicates multiple migration events.

PubMed

Guo, Liyun; Zhu, Xiao-Qiong; Hu, Chia-Hui; Ristaino, Jean Beagle

2010-10-01

One hundred isolates of Phytophthora infestans collected from 10 provinces in China between 1998 and 2004 were analyzed for mating type, metalaxyl resistance, mitochondrial DNA (mtDNA) haplotype, allozyme genotype, and restriction fragment length polymorphism (RFLP) with the RG-57 probe. In addition, herbarium samples collected in China, Russia, Australia, and other Asian countries were also typed for mtDNA haplotype. The Ia haplotype was found during the first outbreaks of the disease in China (1938 and 1940), Japan (1901, 1930, and 1931), India (1913), Peninsular Malaysia (1950), Nepal (1954), The Philippines (1910), Australia (1917), Russia (1917), and Latvia (1935). In contrast, the Ib haplotype was found after 1950 in China on both potato and tomato (1952, 1954, 1956, and 1982) and in India (1968 and 1974). Another migration of a genotype found in Siberia called SIB-1 (Glucose-6-phosphate isomerase [Gpi] 100/100, Peptidase [Pep] 100/100, IIa mtDNA haplotype) was identified using RFLP fingerprints among 72% of the isolates and was widely distributed in the north and south of China and has also been reported in Japan. A new genotype named CN-11 (Gpi 100/111, Pep 100/100, IIb mtDNA haplotype), found only in the south of China, and two additional genotypes (Gpi 100/100, Pep 100/100, Ia mtDNA haplotype) named CN-9 and CN-10 were identified. There were more diverse genotypes among isolates from Yunnan province than elsewhere. The SIB-1 (IIa) genotype is identical to those from Siberia, suggesting later migration of this genotype from either Russia or Japan into China. The widespread predominance of SIB-1 suggests that this genotype has enhanced fitness compared with other genotypes found. Movement of the pathogen into China via infected seed from several sources most likely accounts for the distribution of pathogen genotypes observed. MtDNA haplotype evidence and RFLP data suggest multiple migrations of the pathogen into China after the initial introduction of the Ia haplotype in the 1930s.

Geographic Patterns of Genetic Variation in a Broadly Distributed Marine Vertebrate: New Insights into Loggerhead Turtle Stock Structure from Expanded Mitochondrial DNA Sequences

PubMed Central

Shamblin, Brian M.; Bolten, Alan B.; Abreu-Grobois, F. Alberto; Bjorndal, Karen A.; Cardona, Luis; Carreras, Carlos; Clusa, Marcel; Monzón-Argüello, Catalina; Nairn, Campbell J.; Nielsen, Janne T.; Nel, Ronel; Soares, Luciano S.; Stewart, Kelly R.; Vilaça, Sibelle T.; Türkozan, Oguz; Yilmaz, Can; Dutton, Peter H.

2014-01-01

Previous genetic studies have demonstrated that natal homing shapes the stock structure of marine turtle nesting populations. However, widespread sharing of common haplotypes based on short segments of the mitochondrial control region often limits resolution of the demographic connectivity of populations. Recent studies employing longer control region sequences to resolve haplotype sharing have focused on regional assessments of genetic structure and phylogeography. Here we synthesize available control region sequences for loggerhead turtles from the Mediterranean Sea, Atlantic, and western Indian Ocean basins. These data represent six of the nine globally significant regional management units (RMUs) for the species and include novel sequence data from Brazil, Cape Verde, South Africa and Oman. Genetic tests of differentiation among 42 rookeries represented by short sequences (380 bp haplotypes from 3,486 samples) and 40 rookeries represented by long sequences (∼800 bp haplotypes from 3,434 samples) supported the distinction of the six RMUs analyzed as well as recognition of at least 18 demographically independent management units (MUs) with respect to female natal homing. A total of 59 haplotypes were resolved. These haplotypes belonged to two highly divergent global lineages, with haplogroup I represented primarily by CC-A1, CC-A4, and CC-A11 variants and haplogroup II represented by CC-A2 and derived variants. Geographic distribution patterns of haplogroup II haplotypes and the nested position of CC-A11.6 from Oman among the Atlantic haplotypes invoke recent colonization of the Indian Ocean from the Atlantic for both global lineages. The haplotypes we confirmed for western Indian Ocean RMUs allow reinterpretation of previous mixed stock analysis and further suggest that contemporary migratory connectivity between the Indian and Atlantic Oceans occurs on a broader scale than previously hypothesized. This study represents a valuable model for conducting comprehensive international cooperative data management and research in marine ecology. PMID:24465810
Molecular and geographic evolutionary support for the essential role of GIGANTEAa in soybean domestication of flowering time.

PubMed

Wang, Yan; Gu, Yongzhe; Gao, Huihui; Qiu, Lijuan; Chang, Ruzhen; Chen, Shouyi; He, Chaoying

2016-04-12

Flowering time is a domestication trait of Glycine max and varies in soybeans, yet, a gene for flowering time variation has not been associated with soybean domestication. GIGANTEA (GI) is a major gene involved in the control of flowering time in Arabidopsis, although three GI homologs complicate this model in the soybean genome. In the present work, we revealed that the geographic evolution of the GIGANTEAa (GIa) haplotypes in G. max (GmGIa) and Glycine soja (GsGIa). Three GIa haplotypes (H1, H2, and H3) were found among cultivated soybeans and their wild relatives, yet an additional 44 diverse haplotypes were observed in wild soybeans. H1 had a premature stop codon in the 10(th) exon, whereas the other haplotypes encoded full-length GIa protein isoforms. In both wild-type and cultivated soybeans, H2 was present in the Southern region of China, and H3 was restricted to areas near the Northeast region of China. H1 was genetically derived from H2, and it was dominant and widely distributed among cultivated soybeans, whereas in wild populations, the ortholog of this domesticated haplotype H1 was only found in Yellow River basin with a low frequency. Moreover, this mutated GIa haplotype significantly correlated with early flowering. We further determined that the differences in gene expression of the three GmGIa haplotypes were not correlated to flowering time variations in cultivated soybeans. However, only the truncated GmGIa H1 could partially rescue gi-2 Arabidopsis from delayed flowering in transgenic plants, whereas both GmGIa H2 and H3 haplotypes could significantly repress flowering in transgenic Arabidopsis with a wild-type background. Thus, GmGIa haplotype diversification may have contributed to flowering time adaptation that facilitated the radiation of domesticated soybeans. In light of the evolution of the GIa gene, soybean domestication history for an early flowering phenotype is discussed.
Autosomal Dominant Retinal Dystrophies Caused by a Founder Splice Site Mutation, c.828+3A>T, in PRPH2 and Protein Haplotypes in trans as Modifiers

PubMed Central

Shankar, Suma P.; Hughbanks-Wheaton, Dianna K.; Birch, David G.; Sullivan, Lori S.; Conneely, Karen N.; Bowne, Sara J.; Stone, Edwin M.; Daiger, Stephen P.

2016-01-01

Purpose We determined the phenotypic variation, disease progression, and potential modifiers of autosomal dominant retinal dystrophies caused by a splice site founder mutation, c.828+3A>T, in the PRPH2 gene. Methods A total of 62 individuals (19 families) harboring the PRPH2 c.828+3A>T mutation, had phenotype analysis by fundus appearance, electrophysiology, and visual fields. The PRPH2 haplotypes in trans were sequenced for potential modifying variants and generalized estimating equations (GEE) used for statistical analysis. Results Several distinct phenotypes caused by the PRPH2 c.828+3A>T mutation were observed and fell into two clinical categories: Group I (N = 44) with mild pattern dystrophies (PD) and Group II (N = 18) with more severe cone-rod dystrophy (CRD), retinitis pigmentosa (RP), and central areolar chorioretinal dystrophy (CACD). The PRPH2 Gln304-Lys310-Asp338 protein haplotype in trans was found in Group I only (29.6% vs. 0%), whereas the Glu304-Lys310-Gly338 haplotype was predominant in Group II (94.4% vs. 70.4%). Generalized estimating equations analysis for PD versus the CRD/CACD/RP phenotypes in individuals over 43 years alone with the PRPH2 haplotypes in trans and age as predictors, adjusted for correlation within families, confirmed a significant effect of haplotype on severity (P = 0.03) with an estimated odds ratio of 7.16 (95% confidence interval [CI] = [2.8, 18.4]). Conclusions The PRPH2 c.828+3A>T mutation results in multiple distinct phenotypes likely modified by protein haplotypes in trans; the odds of having the CACD/RP-like phenotype (versus the PD phenotype) are 7.16 times greater with a Glu304-Lys310-Gly338 haplotype in trans. Further functional studies of the modifying haplotypes in trans and PRPH2 splice variants may offer therapeutic targets. PMID:26842753
Genetic variants in a haplotype block spanning IDE are significantly associated with plasma Abeta42 levels and risk for Alzheimer disease.

PubMed

Ertekin-Taner, Nilüfer; Allen, Mariet; Fadale, Daniel; Scanlin, Leah; Younkin, Linda; Petersen, Ronald C; Graff-Radford, Neill; Younkin, Steven G

2004-04-01

Risk for late onset Alzheimer disease (LOAD) and plasma amyloid beta levels (Abeta42; encoded by APP), an intermediate phenotype for LOAD, show linkage to chromosome 10q. Several strong candidate genes (VR22, PLAU, IDE) lie within the 1-lod support interval for linkage. Others have independently identified haplotypes in the chromosome 10q region harboring IDE that show highly significant association with intermediate AD phenotypes and with risk for AD. To pursue these associations, we analyzed the same haplotypes for association with plasma Abeta42 in 24 extended LOAD families and for association with LOAD in two independent case-control series. One series (MCR, 188 age-matched case-control pairs) did not show association (p=0.64) with the six haplotypes in the 276-kb region spanning three genes (IDE, KNSL1, and HHEX) previously shown to associate with LOAD. The other series (MCJ, 109 age-matched case-control pairs) showed significant (p=0.003) association with these haplotypes. In the MCJ series, the H4 (odds ratio [OR]=5.1, p=0.003) and H2(H7) haplotypes (OR=0.60, p=0.04) had the same effects previously reported. In this series, the H8 haplotype (OR=2.7, p=0.098) also had an effect similar as in one previous case control series but not in others. In the extended families, the H8 haplotype was associated with significantly elevated plasma Abeta42 (p=0.02). In addition, the H5(H10) haplotype, which is associated with reduced risk for AD in the other study is associated with reduced plasma Abeta42 (p=0.007) in our family series. These results provide strong evidence for pathogenic variant(s) in the 276-kb region harboring IDE that influence intermediate AD phenotypes and risk for AD. Copyright 2004 Wiley-Liss, Inc.
Congruence as a measurement of extended haplotype structure across the genome

PubMed Central

2012-01-01

Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. PMID:22369243
GMOMETHODS: the European Union database of reference methods for GMO analysis.

PubMed

Bonfini, Laura; Van den Bulcke, Marc H; Mazzara, Marco; Ben, Enrico; Patak, Alexandre

2012-01-01

In order to provide reliable and harmonized information on methods for GMO (genetically modified organism) analysis we have published a database called "GMOMETHODS" that supplies information on PCR assays validated according to the principles and requirements of ISO 5725 and/or the International Union of Pure and Applied Chemistry protocol. In addition, the database contains methods that have been verified by the European Union Reference Laboratory for Genetically Modified Food and Feed in the context of compliance with an European Union legislative act. The web application provides search capabilities to retrieve primers and probes sequence information on the available methods. It further supplies core data required by analytical labs to carry out GM tests and comprises information on the applied reference material and plasmid standards. The GMOMETHODS database currently contains 118 different PCR methods allowing identification of 51 single GM events and 18 taxon-specific genes in a sample. It also provides screening assays for detection of eight different genetic elements commonly used for the development of GMOs. The application is referred to by the Biosafety Clearing House, a global mechanism set up by the Cartagena Protocol on Biosafety to facilitate the exchange of information on Living Modified Organisms. The publication of the GMOMETHODS database can be considered an important step toward worldwide standardization and harmonization in GMO analysis.
Evidence of triple mutant Pfdhps ISGNGA haplotype in Plasmodium falciparum isolates from North-east India: An analysis of sulfadoxine resistant haplotype selection.

PubMed

Das, Manuj K; Chetry, Sumi; Kalita, Mohan C; Dutta, Prafulla

2016-12-01

North-east region of India has consistent role in the spread of multi drug resistant Plasmodium (P.) falciparum to other parts of Southeast Asia. After rapid clinical treatment failure of Artemisinin based combination therapy-Sulphadoxine/Pyrimethamine (ACT-SP) chemoprophylaxis, Artemether-Lumefantrine (ACT-AL) combination therapy was introduced in the year 2012 in this region for the treatment of uncomplicated P. falciparum malaria. In a DNA sequencing based polymorphism analysis, seven codons of P. falciparum dihydropteroate synthetase ( Pf dhps) gene were screened in a total of 127 P. falciparum isolates collected from Assam, Arunachal Pradesh and Tripura of North-east India during the year 2014 and 2015 to document current sulfadoxine resistant haplotypes. Sequences were analyzed to rearrange both nucleotide and protein haplotypes. Molecular diversity indices were analyzed in DNA Sequence Polymorphism software (DnaSP) on the basis of Pf dhps gene sequences. Disappearance from selective neutrality was assessed based on the ratio of non-synonomous to synonomous nucleotide substitutions [dN/dS ratio]. Moreover, two-tailed Z test was performed in search of the significance for probability of rejecting null hypothesis of strict neutrality [dN = dS]. Presence of mutant P. falciparum multidrug resistance protein1 ( Pf mdr1) was also checked in those isolates that were present with new Pf dhps haplotypes. Phylogenetic relationship based on Pf dhps gene was reconstructed in Molecular Evolutionary Genetics Analysis (MEGA). Among eight different sulfadoxine resistant haplotypes found, IS GNG A haplotype was documented in a total of five isolates from Tripura with association of a new mutant M538 R allele. Sequence analysis of Pf mdr1 gene in these five isolates came to notice that not all but only one isolate was mutant at codon 86 (N86 Y ; Y YSND) in the multidrug resistance protein. Molecular diversity based on Pf dhps haplotypes revealed that P. falciparum populations in Assam and Tripura were under balancing selection for sulfadoxine resistant haplotypes but population from Arunachal Pradesh was under positive selection with comparatively high haplotype diversity ( h = 0.870). In reconstructed phylogenetic analysis, isolates having IS GNG A haplotype were grouped into two separate sub-clusters from the other isolates based on their genetic distances and diversities. This study suggests that sulfadoxine resistant isolates are still migrating from its epicenter to the other parts of Southeast Asia and hence control and elimination of the drug resistant isolates have become impedimental. Moreover, P. falciparum populations in different areas may undergo selection of particular sulfadoxine resistant haplotypes either in the presence of drug or after its removal to maintain their plasticity.
23 CFR 972.204 - Management systems requirements.

Code of Federal Regulations, 2012 CFR

2012-04-01

... to operate and maintain the management systems and their associated databases; and (5) A process for... systems will use databases with a geographical reference system that can be used to geolocate all database...
23 CFR 972.204 - Management systems requirements.

Code of Federal Regulations, 2011 CFR

2011-04-01

... to operate and maintain the management systems and their associated databases; and (5) A process for... systems will use databases with a geographical reference system that can be used to geolocate all database...
23 CFR 972.204 - Management systems requirements.

Code of Federal Regulations, 2010 CFR

2010-04-01

... to operate and maintain the management systems and their associated databases; and (5) A process for... systems will use databases with a geographical reference system that can be used to geolocate all database...
23 CFR 972.204 - Management systems requirements.

Code of Federal Regulations, 2013 CFR

2013-04-01

... to operate and maintain the management systems and their associated databases; and (5) A process for... systems will use databases with a geographical reference system that can be used to geolocate all database...
Interrelationships between Amerindian tribes of lower Amazonia as manifest by HLA haplotype disequilibria.

PubMed

Black, F L

1984-11-01

HLA B-C haplotypes exhibit common disequilibria in populations drawn from four continents, indicating that they are subject to broadly active selective forces. However, the A-B and A-C associations we have examined show no consistent disequilibrium pattern, leaving open the possibility that these disequilibria are due to descent from common progenitors. By examining HLA haplotype distributions, I have explored the implications that would follow from the hypothesis that biological selection played no role in determining A-C disequilibria in 10 diverse tribes of the lower Amazon Basin. Certain haplotypes are in strong positive disequilibria across a broad geographic area, suggesting that members of diverse tribes descend from common ancestors. On the basis of the extent of diffusion of the components of these haplotypes, one can estimate that the progenitors lived less than 6,000 years ago. One widely encountered lineage entered the area within the last 1,200 years. When haplotype frequencies are used in genetic distance measurements, they give a pattern of relationships very similar to that obtained by conventional chord measurements based on several genetic markers; but more than that, when individual haplotype disequilibria in the several tribes are compared, multiple origins of a single tribe are discernible and relationships are revealed that correlate more closely to geographic and linguistic patterns than do the genetic distance measurements.
The interactive effects of child maltreatment and the FK506 binding protein 5 gene (FKBP5) on dissociative symptoms in adolescence.

PubMed

Yaylaci, Fatima Tuba; Cicchetti, Dante; Rogosch, Fred A; Bulut, Okan; Hetzel, Susan R

2017-08-01

The FK506 binding protein 5 gene (FKBP5) has been associated with susceptibility to pathogenic effects of childhood trauma including dissociative symptoms. This study examines the impact of maltreatment on dissociative tendencies in adolescence as moderated by the FKBP5 gene. Dissociative symptoms and variation within FKBP5 were assessed in a high-risk, low socioeconomic status community sample of 279 maltreated and 171 nonmaltreated adolescents. Following the assignment of haplotypes across four single nucleotide polymorphisms (rs3800373, rs9296158, rs1360780, and rs9470080), individuals with one or more copies of the CATT haplotype (N = 230) were grouped together and compared to individuals with zero copies of this haplotype (N = 185). Analyses of covariance were conducted to test hypotheses regarding the effects of developmental timing and the chronicity of maltreatment and the CATT haplotype. We found a significant interactive effect of timing/chronicity of maltreatment and the CATT haplotype on dissociative symptoms. Among adolescents who had no copies of the CATT haplotype, dissociative symptoms were higher for chronically maltreated adolescents who had an infancy onset compared to those who were not maltreated or whose maltreatment experience was either relatively less chronic or not started in infancy. The groups did not differ significantly among subjects who carry one or more copies of the CATT haplotype.
[Developing forensic reference database by 18 autosomal STR for DNA identification in Republic of Belarus].

PubMed

Tsybovskii, I S; Veremeichik, V M; Kotova, S A; Kritskaya, S V; Evmenenko, S A; Udina, I G

2017-02-01

For the Republic of Belarus, development of a forensic reference database on the basis of 18 autosomal microsatellites (STR) using a population dataset (N = 1040), “familial” genotypic dataset (N = 2550) obtained from expertise performance of paternity testing, and a dataset of genotypes from a criminal registration database (N = 8756) is described. Population samples studied consist of 80% ethnic Belarusians and 20% individuals of other nationality or of mixed origin (by questionnaire data). Genotypes of 12346 inhabitants of the Republic of Belarus from 118 regional samples studied by 18 autosomal microsatellites are included in the sample: 16 tetranucleotide STR (D2S1338, TPOX, D3S1358, CSF1PO, D5S818, D8S1179, D7S820, THO1, vWA, D13S317, D16S539, D18S51, D19S433, D21S11, F13B, and FGA) and two pentanucleotide STR (Penta D and Penta E). The samples studied are in Hardy–Weinberg equilibrium according to distribution of genotypes by 18 STR. Significant differences were not detected between discrete populations or between samples from various historical ethnographic regions of the Republic of Belarus (Western and Eastern Polesie, Podneprovye, Ponemanye, Poozerye, and Center), which indicates the absence of prominent genetic differentiation. Statistically significant differences between the studied genotypic datasets also were not detected, which made it possible to combine the datasets and consider the total sample as a unified forensic reference database for 18 “criminalistic” STR loci. Differences between reference database of the Republic of Belarus and Russians and Ukrainians by the distribution of the range of autosomal STR also were not detected, corresponding to a close genetic relationship of the three Eastern Slavic nations mediated by common origin and intense mutual migrations. Significant differences by separate STR loci between the reference database of Republic of Belarus and populations of Southern and Western Slavs were observed. The necessity of using original reference database for support of forensic expertise practice in the Republic of Belarus was demonstrated.
Spectrum of sequence variations in the FANCA gene: an International Fanconi Anemia Registry (IFAR) study.

PubMed

Levran, Orna; Diotti, Raffaella; Pujara, Kanan; Batish, Sat D; Hanenberg, Helmut; Auerbach, Arleen D

2005-02-01

Fanconi anemia (FA) is an autosomal recessive disorder that is defined by cellular hypersensitivity to DNA cross-linking agents, and is characterized clinically by developmental abnormalities, progressive bone-marrow failure, and predisposition to leukemia and solid tumors. There is extensive genetic heterogeneity, with at least 11 different FA complementation groups. FA-A is the most common group, accounting for approximately 65% of all affected individuals. The mutation spectrum of the FANCA gene, located on chromosome 16q24.3, is highly heterogeneous. Here we summarize all sequence variations (mutations and polymorphisms) in FANCA described in the literature and listed in the Fanconi Anemia Mutation Database as of March 2004, and report 61 novel FANCA mutations identified in FA patients registered in the International Fanconi Anemia Registry (IFAR). Thirty-eight novel SNPs, previously unreported in the literature or in dbSNP, were also identified. We studied the segregation of common FANCA SNPs in FA families to generate haplotypes. We found that FANCA SNP data are highly useful for carrier testing, prenatal diagnosis, and preimplantation genetic diagnosis, particularly when the disease-causing mutations are unknown. Twenty-two large genomic deletions were identified by detection of apparent homozygosity for rare SNPs. In addition, a conserved SNP haplotype block spanning at least 60 kb of the FANCA gene was identified in individuals from various ethnic groups. (c) 2005 Wiley-Liss, Inc.
Resurrection of New Caledonian maskray Neotrygon trigonoides (Myliobatoidei: Dasyatidae) from synonymy with N. kuhlii, based on cytochrome-oxidase I gene sequences and spotting patterns.

PubMed

Borsa, Philippe; Arlyza, Irma S; Chen, Wei-Jen; Durand, Jean-Dominique; Meekan, Mark G; Shen, Kang-Ning

2013-04-01

The maskray from New Caledonia, Neotrygon trigonoides Castelnau, 1873, has been recently synonymized with the blue-spotted maskray, N. kuhlii (Müller and Henle, 1841), a species with wide Indo-West Pacific distribution, but the reasons for this are unclear. Blue-spotted maskray specimens were collected from the Indian Ocean (Tanzania, Sumatra) and the Coral Triangle (Indonesia, Taiwan, and West Papua), and N. trigonoides specimens were collected from New Caledonia (Coral-Sea). Their partial COI gene sequences were generated to expand the available DNA-barcode database on this species, which currently comprises homologous sequences from Ningaloo Reef, the Coral Triangle and the Great Barrier Reef (Coral-Sea). Spotting patterns were also compared across regions. Haplotypes from the Coral-Sea formed a haplogroup phylogenetically distinct from all other haplotypes sampled in the Indo-West Pacific. No clear-cut geographic composition relative to DNA-barcodes or spotting patterns was apparent in N. kuhlii samples across the Indian Ocean and the Coral Triangle. The New Caledonian maskray had spotting patterns markedly different from all the other samples. This, added to a substantial level of net nucleotide divergence (2.6%) with typical N. kuhlii justifies considering the New Caledonian maskray as a separate species, for which we propose to resurrect the name Neotrygon trigonoides. Copyright © 2013. Published by Elsevier SAS.
Longitudinal analysis of haplotypes and polymorphisms of the APOA5 and APOC3 genes associated with variation in serum triglyceride levels: the Bogalusa Heart Study.

PubMed

Hallman, D Michael; Srinivasan, Sathanur R; Chen, Wei; Boerwinkle, Eric; Berenson, Gerald S

2006-12-01

Polymorphisms in the APOC3 and APOA5 genes, from the APOA1/APOC3/APOA4/APOA5 gene cluster on chromosome 11q23, have been associated with interindividual variation in plasma triglycerides. APOA5 polymorphisms implicated include 2 in the promoter region (-1131 T/C and -3 A/G) and 1 in exon 2 (+56 C/G). APOC3 polymorphisms implicated include 1 (SstI) in the 3' untranslated region and 1 (-2854 G/T) in the APOC3-APOA4 intergenic region. We analyzed the associations of haplotypes and multilocus genotypes of these polymorphisms on longitudinal serum triglyceride profiles in 360 African American and 823 white subjects from the Bogalusa Heart Study. Subjects were examined from 2 to 8 times (mean +/- SD, 5.4 +/- 1.3) between 1973 and 1996, at ages ranging from 4 to 38 years, with 1978 observations in African Americans and 4465 in whites. Serum triglycerides were significantly higher among whites across all ages. Allele frequencies differed significantly between African Americans and whites at all but the APOA5 +56 C/G locus. Linkage disequilibrium among the loci was higher in whites and haplotype diversity lower: 6 haplotypes had estimated frequencies of more than 1% in African Americans, 5 in whites. Individually, all polymorphisms except APOC3 -2854 G/T showed significant associations with triglyceride levels in the full sample. However, genotype models including all 5 loci showed significant triglyceride associations for only 3 (APOC3 SstI, APOA5 -1131 T/C, and APOA5 +56 C/G); significant interactions among them indicated their effects were not independent. Neither APOC3 -2854 G/T nor APOA5 -3 A/G had significant effects when the other 3 loci were in the models. The EM algorithm was used to estimate haplotype frequencies and assign haplotype probabilities to individuals, which is conditional on their genotypes; individuals' haplotype probability vectors were then used as predictors in multilevel mixed models of longitudinal triglyceride profiles. Of haplotypes comprising, in order, APOC3 SstI and -2854 G/T and APOA5 -1131 T/C, -3 A/G, and +56 C/G, 3 were significantly associated with higher triglycerides, even after adjusting for multiple tests: GGTAG (P = .002), GTTAG (P < .0001), and CGCGC (P = .0002). Each GGTAG haplotype carried would be expected to raise triglyceride levels (relative to those of GTTAC homozygotes) by approximately 19 mg/dL, each GTTAG haplotype by approximately 15 mg/dL, and each CGCGC haplotype by approximately 7 mg/dL. Haplotypes comprising the 3 loci implicated by genotype analyses (SstI, -1131 T/C, and +56 C/G) were also tested: haplotypes C_C_C and G_T_G significantly raised triglycerides, even after adjustment for multiple comparisons (P < .002 for both), with each copy of C_C_C expected to raise triglycerides by approximately 7 mg/dL and each copy of G_T_G by approximately 15 mg/dL. Overall, our findings support those of others in associating specific polymorphisms and haplotypes in the APOA1/C3/A4/A5 gene cluster with higher serum triglyceride levels. However, the degree to which polymorphisms in the APOC3 and APOA5 genes may be independently associated with triglyceride levels remains to be determined.
Effect of malaria transmission reduction by insecticide-treated bed nets (ITNs) on the genetic diversity of Plasmodium falciparum merozoite surface protein (MSP-1) and circumsporozoite (CSP) in western Kenya.

PubMed

Kariuki, Simon K; Njunge, James; Muia, Ann; Muluvi, Geofrey; Gatei, Wangeci; Ter Kuile, Feiko; Terlouw, Dianne J; Hawley, William A; Phillips-Howard, Penelope A; Nahlen, Bernard L; Lindblade, Kim A; Hamel, Mary J; Slutsker, Laurence; Shi, Ya Ping

2013-08-27

Although several studies have investigated the impact of reduced malaria transmission due to insecticide-treated bed nets (ITNs) on the patterns of morbidity and mortality, there is limited information on their effect on parasite diversity. Sequencing was used to investigate the effect of ITNs on polymorphisms in two genes encoding leading Plasmodium falciparum vaccine candidate antigens, the 19 kilodalton blood stage merozoite surface protein-1 (MSP-1(19kDa)) and the Th2R and Th3R T-cell epitopes of the pre-erythrocytic stage circumsporozoite protein (CSP) in a large community-based ITN trial site in western Kenya. The number and frequency of haplotypes as well as nucleotide and haplotype diversity were compared among parasites obtained from children <5 years old prior to the introduction of ITNs (1996) and after 5 years of high coverage ITN use (2001). A total of 12 MSP-1(19kDa) haplotypes were detected in 1996 and 2001. The Q-KSNG-L and E-KSNG-L haplotypes corresponding to the FVO and FUP strains of P. falciparum were the most prevalent (range 32-37%), with an overall haplotype diversity of > 0.7. No MSP-1(19kDa) 3D7 sequence-types were detected in 1996 and the frequency was less than 4% in 2001. The CSP Th2R and Th3R domains were highly polymorphic with a total of 26 and 14 haplotypes, respectively detected in 1996 and 34 and 13 haplotypes in 2001, with an overall haplotype diversity of > 0.9 and 0.75 respectively. The frequency of the most predominant Th2R and Th3R haplotypes was 14 and 36%, respectively. The frequency of Th2R and Th3R haplotypes corresponding to the 3D7 parasite strain was less than 4% at both time points. There was no significant difference in nucleotide and haplotype diversity in parasite isolates collected at both time points. High diversity in these two genes has been maintained overtime despite marked reductions in malaria transmission due to ITNs use. The frequency of 3D7 sequence-types was very low in this area. These findings provide information that could be useful in the design of future malaria vaccines for deployment in endemic areas with high ITN coverage and in interpretation of efficacy data for malaria vaccines based on 3D7 parasite strains.
Mineralocorticoid receptor haplotype, estradiol, progesterone and emotional information processing.

PubMed

Hamstra, Danielle A; de Kloet, E Ronald; Quataert, Ina; Jansen, Myrthe; Van der Does, Willem

2017-02-01

Carriers of MR-haplotype 1 and 3 (GA/CG; rs5522 and rs2070951) are more sensitive to the influence of oral contraceptives (OC) and menstrual cycle phase on emotional information processing than MR-haplotype 2 (CA) carriers. We investigated whether this effect is associated with estradiol (E2) and/or progesterone (P4) levels. Healthy MR-genotyped premenopausal women were tested twice in a counterbalanced design. Naturally cycling (NC) women were tested in the early-follicular and mid-luteal phase and OC-users during OC-intake and in the pill-free week. At both sessions E2 and P4 were assessed in saliva. Tests included implicit and explicit positive and negative affect, attentional blink accuracy, emotional memory, emotion recognition, and risky decision-making (gambling). MR-haplotype 2 homozygotes had higher implicit happiness scores than MR-haplotype 2 heterozygotes (p=0.031) and MR-haplotype 1/3 carriers (p<0.001). MR-haplotype 2 homozygotes also had longer reaction times to happy faces in an emotion recognition test than MR-haplotype 1/3 (p=0.001). Practice effects were observed for most measures. The pattern of correlations between information processing and P4 or E2 differed between sessions, as well as the moderating effects of the MR genotype. In the first session the MR-genotype moderated the influence of P4 on implicit anxiety (sr=-0.30; p=0.005): higher P4 was associated with reduction in implicit anxiety, but only in MR-haplotype 2 homozygotes (sr=-0.61; p=0.012). In the second session the MR-genotype moderated the influence of E2 on the recognition of facial expressions of happiness (sr=-0.21; p=0.035): only in MR-haplotype 1/3 higher E2 was correlated with happiness recognition (sr=0.29; p=0.005). In the second session higher E2 and P4 were negatively correlated with accuracy in lag2 trials of the attentional blink task (p<0.001). Thus NC women, compared to OC-users, performed worse on lag 2 trials (p=0.041). The higher implicit happiness scores of MR-haplotype 2 homozygotes are in line with previous reports. Performance in the attentional blink task may be influenced by OC-use. The MR-genotype moderates the influence of E2 and P4 on emotional information processing. This moderating effect may depend on the novelty of the situation. Copyright © 2016 Elsevier Ltd. All rights reserved.
Assignment of the SLA alleles and reproductive potential of selective breeding Duroc pig lines.

PubMed

Soe, Ok Kar; Ohba, Yasunori; Imaeda, Noriaki; Nishii, Naohito; Takasu, Masaki; Yoshioka, Gou; Kawata, Hisako; Shigenari, Atsuko; Uenishi, Hirohide; Inoko, Hidetoshi; Ando, Asako; Kitagawa, Hitoshi

2008-01-01

Pigs with defined swine leukocyte antigen (SLA) haplotypes and their detailed information are useful for transplantation and immunological studies. We developed two herds of SLA homozygous Duroc pigs with novel SLA haplotypes and characterized their reproductive potential. For selective inbreeding, a pair of Duroc pigs was chosen as initial breeders, and substantial breeding within progenies was carried out for eight generations. In the selective breeding Duroc pigs, SLA haplotypes were assigned by nucleotide sequence determination of reverse transcription polymerase chain reaction (RT-PCR) products of three SLA classical class I genes and two class II genes. Based on this sequence information, we developed a rapid and simple SLA class II DNA typing method by polymerase chain reaction-sequence specific primer (PCR-SSP) technique. As a complementary method for the characterization of the SLA haplotypes, genetic polymorphisms of 36 microsatellite (MS) markers within the SLA region were also analyzed in the selective breeding pigs with SLA homozygous/heterozygous haplotypes. Among the selective breeding pigs from the third to fifth generations, only two SLA haplotypes were identified by the RT-PCR based SLA typing method; Hp-27.30 (SLA-1*08an03, SLA-1*06an04, SLA-2*0102, SLA-3*0101 DRB1*1101 and DQB1*0503) and Hp-60.13 (SLA-1*an02, SLA-2*1002, SLA-3*0502, DRB1*0403 and DQB1*0303). In these two SLA haplotypes, two class I haplotypes, Hp-27.0 and Hp-60.0, are novel. Furthermore, two class II haplotypes, Hp-0.30 and Hp-0.13, which were previously reported in Korean native pigs and pigs of Hanford breed, respectively, were also assigned by a simple assay using a PCR-SSP technique in the entire selective breeding stock. Moreover, two haplotype specific MS patterns were observed across the entire SLA region in the selective breeding (homozygous/heterozygous) pigs. No morphological abnormalities were observed in selective breeding pigs. The theoretical inbreeding coefficient at the eighth generation was 78.5%. In all generations of selective breeding pigs, litter sizes were comparable and weaning weights from the fifth to eighth generation produced progenies significantly lighter (P < 0.01) than those in the non-selective breeding pigs. We established and characterized SLA homozygous Duroc herds with two kinds of haplotypes that can be used as a new resource for transplantation and other biomedical studies.

Meta-analysis of haplotype-association studies: comparison of methods and empirical evaluation of the literature

PubMed Central

2011-01-01

Background Meta-analysis is a popular methodology in several fields of medical research, including genetic association studies. However, the methods used for meta-analysis of association studies that report haplotypes have not been studied in detail. In this work, methods for performing meta-analysis of haplotype association studies are summarized, compared and presented in a unified framework along with an empirical evaluation of the literature. Results We present multivariate methods that use summary-based data as well as methods that use binary and count data in a generalized linear mixed model framework (logistic regression, multinomial regression and Poisson regression). The methods presented here avoid the inflation of the type I error rate that could be the result of the traditional approach of comparing a haplotype against the remaining ones, whereas, they can be fitted using standard software. Moreover, formal global tests are presented for assessing the statistical significance of the overall association. Although the methods presented here assume that the haplotypes are directly observed, they can be easily extended to allow for such an uncertainty by weighting the haplotypes by their probability. Conclusions An empirical evaluation of the published literature and a comparison against the meta-analyses that use single nucleotide polymorphisms, suggests that the studies reporting meta-analysis of haplotypes contain approximately half of the included studies and produce significant results twice more often. We show that this excess of statistically significant results, stems from the sub-optimal method of analysis used and, in approximately half of the cases, the statistical significance is refuted if the data are properly re-analyzed. Illustrative examples of code are given in Stata and it is anticipated that the methods developed in this work will be widely applied in the meta-analysis of haplotype association studies. PMID:21247440
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers

PubMed Central

Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.

2018-01-01

Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092
Aldehyde dehydrogenase-2 genotypes and HLA haplotypes in Japanese patients with esophageal cancer.

PubMed

Watanabe, Seishiro; Sasahara, Katsuyuki; Kinekawa, Fumihiko; Uchida, Naohito; Masaki, Tsutomu; Kurokohchi, Kazutaka; Murota, Masayuki; Touge, Tetsuo; Kawauchi, Kazuyoshi; Oda, Syuji; Kuriyama, Shigeki

2002-01-01

The aim of this study was to examine how aldehyde dehydrogenase-2 (ALDH2) genotypes and human leukocyte antigen (HLA) haplotypes contribute to the risk for esophageal cancer. We examined ALDH2 genotypes and HLA haplotypes in 29 Japanese patients with esophageal cancer. The ratio of patients who experienced current or former intense vasodilatation upon consuming alcohol (flushing type) was much higher in individuals with the inactive form of ALDH2 encoded by the ALDH2(2)/2(2) or ALDH2(1)/2(2) genotype than in those with the active form of ALDH2 encoded by the ALDH2(1)/2(1) genotype. The ratio of inactive ALDH2 was significantly higher in patients with esophageal cancer than in control normal subjects, suggesting that alcoholics with inactive ALDH2 were susceptible to esophageal cancer. HLA haplotypes A24, A26, B54, B61 and DR9 were prevalent in patients with esophageal cancer (82.8, 24.1, 34.5, 37.9 and 44.8%, respectively). HLA haplotype of A24 and inactive ALDH2 were simultaneously found in 58.6% of patients with esophageal cancer. Furthermore, we found other primary malignancies in 6 of 29 (20.7%) patients with esophageal cancer, and 4 of these 6 patients had both the inactive form of ALDH2 and the HLA A24 haplotype. The present study showed the high prevalence of the inactive form of ALDH2 and HLA haplotypes A24, A26, B54, B61 and DR9 in Japanese patients with esophageal cancer. Therefore, the examination of genotypes of ALDH2 loci and HLA haplotypes may allow the early detection of esophageal cancer in the Japanese population.
Dimensional Anxiety Mediates Linkage of GABRA2 Haplotypes With Alcoholism

PubMed Central

Enoch, Mary-Anne; Schwartz, Lori; Albaugh, Bernard; Virkkunen, Matti; Goldman, David

2015-01-01

The GABAAα2 receptor gene (GABRA2) modulates anxiety and stress response. Three recent association studies implicate GABRA2 in alcoholism, however in these papers both common, opposite-configuration haplotypes in the region distal to intron3 predict risk. We have now replicated the GABRA2 association with alcoholism in 331 Plains Indian men and women and 461 Finnish Caucasian men. Using a dimensional measure of anxiety, harm avoidance (HA), we also found that the association with alcoholism is mediated, or moderated, by anxiety. Nine SNPs were genotyped revealing two haplotype blocks. Within the previously implicated block 2 region, we identified the two common, opposite-configuration risk haplotypes, A and B. Their frequencies differed markedly in Finns and Plains Indians. In both populations, most block 2 SNPs were significantly associated with alcoholism. The associations were due to increased frequencies of both homozygotes in alcoholics, indicating the possibility of alcoholic subtypes with opposite genotypes. Congruently, there was no significant haplotype association. Using HA as an indicator variable for anxiety, we found haplotype linkage to alcoholism with high and low dimensional anxiety, and to HA itself, in both populations. High HA alcoholics had the highest frequency of the more abundant haplotype (A in Finns, B in Plains Indians); low HA alcoholics had the highest frequency of the less abundant haplotype (B in Finns, A in Plains Indians) (Finns: P α0.007, OR α2.1, Plains Indians: P α0.040, OR α1.9). Non-alcoholics had intermediate frequencies. Our results suggest that within the distal GABRA2 region is a functional locus or loci that may differ between populations but that alters risk for alcoholism via the mediating action of anxiety. PMID:16874763
A Haplotype Information Theory Method Reveals Genes of Evolutionary Interest in European vs. Asian Pigs.

PubMed

Hudson, Nicholas J; Naval-Sánchez, Marina; Porto-Neto, Laercio; Pérez-Enciso, Miguel; Reverter, Antonio

2018-06-05

Asian and European wild boars were independently domesticated ca. 10,000 years ago. Since the 17th century, Chinese breeds have been imported to Europe to improve the genetics of European animals by introgression of favourable alleles, resulting in a complex mosaic of haplotypes. To interrogate the structure of these haplotypes further, we have run a new haplotype segregation analysis based on information theory, namely compression efficiency (CE). We applied the approach to sequence data from individuals from each phylogeographic region (n = 23 from Asia and Europe) including a number of major pig breeds. Our genome-wide CE is able to discriminate the breeds in a manner reflecting phylogeography. Furthermore, 24,956 non-overlapping sliding windows (each comprising 1,000 consecutive SNP) were quantified for extent of haplotype sharing within and between Asia and Europe. The genome-wide distribution of extent of haplotype sharing was quite different between groups. Unlike European pigs, Asian pigs haplotype sharing approximates a normal distribution. In line with this, we found the European breeds possessed a number of genomic windows of dramatically higher haplotype sharing than the Asian breeds. Our CE analysis of sliding windows capture some of the genomic regions reported to contain signatures of selection in domestic pigs. Prominent among these regions, we highlight the role of a gene encoding the mitochondrial enzyme LACTB which has been associated with obesity, and the gene encoding MYOG a fundamental transcriptional regulator of myogenesis. The origin of these regions likely reflects either a population bottleneck in European animals, or selective targets on commercial phenotypes reducing allelic diversity in particular genes and/or regulatory regions.
Rapid growth of a Eurasian haplotype of Phragmites australis in a restored brackish marsh in Louisiana, USA

USGS Publications Warehouse

Howard, R.J.; Travis, S.E.; Sikes, B.A.

2008-01-01

While numerous studies have documented patterns of invasion by non-indigenous plant species, few have considered the invasive properties of non-native genotypes of native species. Characteristics associated with specific genotypes, such as tolerance to disturbance, may mistakenly be applied to an entire species in the absence of genetic information, which consequently may affect management decisions. We report here on the incidence and growth of an introduced lineage of Phragmites australis in the Gulf of Mexico coastal zone of Louisiana. P. australis was collected from nine separate locations for inclusion in a series of growth experiments. Chloroplast DNA analysis indicated that specimens collected from four locations in the Mississippi River Delta represented the introduced Eurasian haplotype; the remainder represented the gulf coast haplotype. Three distinct genotypes, or clones, were identified within each haplotype via analysis using amplified fragment length polymorphisms, which also revealed reduced genetic diversity of the gulf coast clones compared to the Eurasian clones. Clones of each haplotype were planted along with three other native macrophytes at similar densities in a restored brackish marsh and monitored for growth. After 14 months, the Eurasian haplotype had spread vegetatively to cover about 82% of the experimental plots, more than four times the coverage (18%) of the gulf coast haplotype. Thus, the use of P. australis plantings for wetland restoration should consider the genetic lineage of plants used since our results indicate the potential of the Eurasian haplotype to grow rapidly at newly restored sites. This rapid growth may limit the establishment of more slowly growing native species. ?? 2007 Springer Science+Business Media B.V.
IL7Rα Expression and Upregulation by IFNβ in Dendritic Cell Subsets Is Haplotype-Dependent

PubMed Central

McKay, Fiona C.; Hoe, Edwin; Parnell, Grant; Gatt, Prudence; Schibeci, Stephen D.; Stewart, Graeme J.; Booth, David R.

2013-01-01

The IL7Rα gene is unequivocally associated with susceptibility to multiple sclerosis (MS). Haplotype 2 (Hap 2) confers protection from MS, and T cells and dendritic cells (DCs) of Hap 2 exhibit reduced splicing of exon 6, resulting in production of relatively less soluble receptor, and potentially more response to ligand. We have previously shown in CD4 T cells that IL7Rα haplotypes 1 and 2, but not 4, respond to interferon beta (IFNβ), the most commonly used immunomodulatory drug in MS, and that haplotype 4 (Hap 4) homozygotes have the highest risk of developing MS. We now show that IL7R expression increases in myeloid cells in response to IFNβ, but that the response is haplotype-dependent, with cells from homozygotes for Hap 4 again showing no response. This was shown using freshly derived monocytes, in vitro cultured immature and mature monocyte-derived dendritic cells, and by comparing homozygotes for the common haplotypes, and relative expression of alleles in heterozygotes (Hap 4 vs not Hap 4). As for T cells, in all myeloid cell subsets examined, Hap 2 homozygotes showed a trend for reduced splicing of exon 6 compared to the other haplotypes, significantly so in most conditions. These data are consistent with increased signaling being protective from MS, constitutively and in response to IFNβ. We also demonstrate significant regulation of immune response, chemokine activity and cytokine biosynthesis pathways by IL7Rα signaling in IFNβ -treated myeloid subsets. IFNβ-responsive genes are over-represented amongst genes associated with MS susceptibility. IL7Rα haplotype may contribute to MS susceptibility through reduced capacity for IL7Rα signalling in myeloid cells, especially in the presence of IFNβ, and is currently under investigation as a predictor of therapeutic response. PMID:24147013
The Functional SNPs in the 5’ Regulatory Region of the Porcine PPARD Gene Have Significant Association with Fat Deposition Traits

PubMed Central

Hu, Shanyao; Lin, Bin; Yan, Dechao; Xu, Zaiyan; Zhang, Zijun; Mao, Yuanliang; Mao, Huimin; Wang, Litong; Wang, Guoshui; Xiong, Yuanzhu; Zuo, Bo

2015-01-01

Peroxisome proliferator-activated receptor delta (PPARD) is a key regulator of lipid metabolism, insulin sensitivity, cell proliferation and differentiation. In this study, we identified two Single Nucleotide Polymorphisms (SNPs, g.1015 A>G and g.1018 T>C) constituting four haplotypes (GT, GC, AC and AT) in the 5’ regulatory region of porcine PPARD gene. Functional analysis of the four haplotypes showed that the transcriptional activity of the PPARD promoter fragment carrying haplotype AC was significantly lower than that of the other haplotypes in 3T3-L1, C2C12 and PK-15 cells, and haplotype AC had the lowest binding capacities to the nuclear extracts. Transcription factor 7-like 2 (TCF7L2) enhanced the transcription activities of promoter fragments of PPARD gene carrying haplotypes GT, GC and AT in C2C12 and 3T3-L1 cells, and increased the protein expression of PPARD gene in C2C12 myoblasts. TCF7L2 differentially bound to the four haplotypes, and the binding capacity of TCF7L2 to haplotype AC was the lowest. There were significant associations between -655A/G and fat deposition traits in three pig populations including the Large White × Meishan F2 pigs, France and American Large White pigs. Pigs with genotype GG had significantly higher expression of PPARD at both mRNA and protein level than those with genotype AG. These results strongly suggested that the SNPs in 5’ regulatory region of PPARD genes had significant impact on pig fat deposition traits. PMID:26599230
Sequence polymorphism at the human apolipoprotein AII gene ( APOA2): unexpected deficit of variation in an African-American sample.

PubMed

Fullerton, Stephanie M; Clark, Andrew G; Weiss, Kenneth M; Taylor, Scott L; Stengård, Jari H; Salomaa, Veikko; Boerwinkle, Eric; Nickerson, Deborah A

2002-07-01

A 3.3-kb region, encompassing the APOA2 gene and 2 kb of 5' and 3' flanking DNA, was re-sequenced in a "core" sample of 24 individuals, sampled without regard to the health from each of three populations: African-Americans from Jackson (Miss., USA), Europeans from North Karelia (Finland), and non-Hispanic European-Americans from Rochester, (Minn., USA). Fifteen variable sites were identified (14 SNPs and one multi-allelic microsatellite, all silent), and these sites segregated as 18 sequence haplotypes (or nine, if SNPs only are considered). The haplotype distribution in the core African-American sample was unusual, with a deficit of particular haplotypes compared with those found in the other two samples, and a significantly (P<0.05) low level of nucleotide diversity relative to patterns of polymorphism and divergence at other human loci. Six of the 14 SNPs, whose variation captured the haplotype structure of the core data, were then genotyped by oligonucleotide ligation assay in an additional 2183 individuals from the same three populations (n=843, n=452, and n=888, respectively). All six sites varied in each of the larger "epidemiological" samples, and together, they defined 19 SNP haplotypes, seven with relative frequencies greater than 1% in the total sample; all of these common haplotypes had been identified earlier in the core re-sequencing survey. Here also, the African-American sample showed significantly lower SNP heterozygosity and haplotype diversity than the other two samples. The deficit of polymorphism is consistent with a population-specific non-neutral increase in the relative frequency of several haplotypes in Jackson.
Three Novel Haplotypes of Theileria bicornis in Black and White Rhinoceros in Kenya.

PubMed

Otiende, M Y; Kivata, M W; Jowers, M J; Makumi, J N; Runo, S; Obanda, V; Gakuya, F; Mutinda, M; Kariuki, L; Alasaad, S

2016-02-01

Piroplasms, especially those in the genera Babesia and Theileria, have been found to naturally infect rhinoceros. Due to natural or human-induced stress factors such as capture and translocations, animals often develop fatal clinical piroplasmosis, which causes death if not treated. This study examines the genetic diversity and occurrence of novel Theileria species infecting both black and white rhinoceros in Kenya. Samples collected opportunistically during routine translocations and clinical interventions from 15 rhinoceros were analysed by polymerase chain reaction (PCR) using a nested amplification of the small subunit ribosomal RNA (18S rRNA) gene fragments of Babesia and Theileria. Our study revealed for the first time in Kenya the presence of Theileria bicornis in white (Ceratotherium simum simum) and black (Diceros bicornis michaeli) rhinoceros and the existence of three new haplotypes: haplotypes H1 and H3 were present in white rhinoceros, while H2 was present in black rhinoceros. No specific haplotype was correlated to any specific geographical location. The Bayesian inference 50% consensus phylogram recovered the three haplotypes monophyleticly, and Theileria bicornis had very high support (BPP: 0.98). Furthermore, the genetic p-uncorrected distances and substitutions between T. bicornis and the three haplotypes were the same in all three haplotypes, indicating a very close genetic affinity. This is the first report of the occurrence of Theileria species in white and black rhinoceros from Kenya. The three new haplotypes reported here for the first time have important ecological and conservational implications, especially for population management and translocation programs and as a means of avoiding the transport of infected animals into non-affected areas. © 2014 Blackwell Verlag GmbH.
Intricacies in arrangement of SNP haplotypes suggest "Great Admixture" that created modern humans.

PubMed

Dutta, Rajib; Mainsah, Joseph; Yatskiv, Yuriy; Chakrabortty, Sharmistha; Brennan, Patrick; Khuder, Basil; Qiu, Shuhao; Fedorova, Larisa; Fedorov, Alexei

2017-06-05

Inferring history from genomic sequences is challenging and problematic because chromosomes are mosaics of thousands of small Identicalby-descent (IBD) fragments, each of them having their own unique story. However, the main events in recent evolution might be deciphered from comparative analysis of numerous loci. A paradox of why humans, whose effective population size is only 10 4 , have nearly three million frequent SNPs is formulated and examined. We studied 5398 loci evenly covering all human autosomes. Common haplotypes built from frequent SNPs that are present in people from various populations have been examined. We demonstrated highly non-random arrangement of alleles in common haplotypes. Abundance of mutually exclusive pairs of common haplotypes that have different alleles at every polymorphic position (so-called Yin/Yang haplotypes) was found in 56% of loci. A novel widely spread category of common haplotypes named Mosaic has been described. Mosaic consists of numerous pieces of Yin/Yang haplotypes and represents an ancestral stage of one of them. Scenarios of possible appearance of large number of frequent human SNPs and their habitual arrangement in Yin/Yang common haplotypes have been evaluated with an advanced genomic simulation algorithm. Computer modeling demonstrated that the observed arrangement of 2.9 million frequent SNPs could not originate from a sole stand-alone population. A "Great Admixture" event has been proposed that can explain peculiarities with frequent SNP distributions. This Great Admixture presumably occurred 100-300 thousand years ago between two ancestral populations that had been separated from each other about a million years ago. Our programs and algorithms can be applied to other species to perform evolutionary and comparative genomics.
Ultraaccurate genome sequencing and haplotyping of single human cells.

PubMed

Chu, Wai Keung; Edge, Peter; Lee, Ho Suk; Bansal, Vikas; Bafna, Vineet; Huang, Xiaohua; Zhang, Kun

2017-11-21

Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10 -8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.
Population structure and phylogeography of Toda buffalo in Nilgiris throw light on possible origin of aboriginal Toda tribe of South India.

PubMed

Kathiravan, P; Kataria, R S; Mishra, B P; Dubey, P K; Sadana, D K; Joshi, B K

2011-08-01

We report the genetic structure and evolutionary relationship of the endangered Toda buffalo of Nilgiris in South India with Kanarese and two other riverine buffalo breeds. The upgma phylogeny drawn using Nei's distance grouped South Kanara and Toda buffaloes at a single node while Marathwada and Murrah together formed a separate node. Principal component analysis was performed with pairwise interindividual chord distances which revealed clustering of Murrah and Marathwada buffaloes distinctly, while individuals of Toda and South Kanara breeds completely intermingled with each other. Furthermore, there were highly significant group variances (p < 0.01) when the breeds were grouped based on phylogeny, thus revealing the existence of cryptic genetic structure within these buffalo breeds. To know the evolutionary relationship among these breeds, 537-bp D-loop region of mitochondrial DNA was analysed. The phylogenetic analysis of mtDNA haplotypes following NJ algorithm with Chinese swamp buffalo as outgroup revealed a major cluster that included haplotypes from all the four investigated breeds and two minor clusters formed by South Kanara and Toda haplotypes. Reduced median network analysis revealed haplotypes of South Kanara and Toda to be quite distinct from the commonly found haplotypes indicating that these might have been ancestral to all the present-day haplotypes. Few mutations in two of the haplotypes of South Kanara buffalo were found to have contributed to ancestral haplotypes of Toda buffalo suggesting the possible migration of buffaloes from Kanarese region towards Nilgiris along the Western Ghats. Considering the close social, economic and cultural association of Todas with their buffaloes, the present study supports the theory of migration of Toda tribe from Kanarese/Mysore region along with their buffaloes. © 2011 Blackwell Verlag GmbH.
Associations between mutations and a VNTR in the human phenylalanine hydroxylase gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Goltsov, A.A.; Eisensmith, R.C.; Woo, S.L.C.

1992-09-01

The HindIII RFLP in the human phenylalanine hydroxylase (PAH) gene is caused by the presence of an AT-rich (70%) minisatellite region. This region contains various multiples of 30-bp tandem repeats and is located 3 kb downstream of the final exon of the gene. PCR-mediated amplification of this region from haplotyped PAH chromosomes indicates that the previously reported 4.0-kb HindIII allele contains three of these repeats, while the 4.4-kb HindIII allele contains 12 of these repeats. The 4.2-kb HindIII fragment can contain six, seven, eight, or nine copies of this repeat. These variations permit more detailed analysis of mutant haplotypes 1,more » 5, 6, and, possibly, others. Kindred analysis in phenylketonuria families demonstrates Mendelian segregation of these VNTR alleles, as well as associations between theses alleles and certain PAH mutations. The R261Q mutation, associated with haplotype 1, is associated almost exclusively with an allele containing eight repeats; the R408W mutation, when occurring on a haplotype 1 background, may also be associated with the eight-repeat VNTR allele. Other PAH mutations associated with haplotype 1, R252W and P281L, do not appear to segregate with specific VNTR alleles. The IVS-10 mutation, when associated with haplotype 6, is associated exclusively with an allele containing seven repeats. The combined use of this VNTR system and the existing RFLP haplotype system will increase the performance of prenatal diagnostic tests based on haplotype analysis. In addition, this VNTR may prove useful in studies concerning the origins and distributions of PAH mutations in different human populations. 32 refs., 3 figs., 3 tabs.« less
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.

PubMed

Jiang, Yong; Schmidt, Renate H; Reif, Jochen C

2018-05-04

Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.
Diet History Questionnaire: Database Revision History

Cancer.gov

The following details all additions and revisions made to the DHQ nutrient and food database. This revision history is provided as a reference for investigators who may have performed analyses with a previous release of the database.
A Partnership for Public Health: USDA Branded Food Products Database

USDA-ARS?s Scientific Manuscript database

The importance of comprehensive food composition databases is more critical than ever in helping to address global food security. The USDA National Nutrient Database for Standard Reference is the “gold standard” for food composition databases. The presentation will include new developments in stren...
Molecular analysis and genetic diversity of Aedes albopictus (Diptera, Culicidae) from China.

PubMed

Ruiling, Zhang; Peien, Leng; Xuejun, Wang; Zhong, Zhang

2018-05-01

Aedes albopictus is one of the most invasive species, which can carry Dengue virus, Yellow fever virus and more than twenty arboviruses. Based on mitochondrial gene cytochrome c oxidase I (COI) and samples collected from 17 populations, we investigated the molecular character and genetic diversity of Ae. albopictus from China. Altogether, 25 haplotypes were detected, including 10 shared haplotypes and 15 private haplotypes. H1 was the dominant haplotype, which is widely distributed in 13 populations. Tajima'D value of most populations was significantly negative, demonstrating that populations experienced rapid range expansion recently. Most haplotypes clustered together both in phylogenetic and median-joining network analysis without clear phylogeographic patterns. However, neutrality tests revealed shallow divergences among Hainan and Guangxi with other populations (0.15599 ≤ F ST ≤ 0.75858), which probably due to interrupted gene flow, caused by geographical isolations. In conclusion, Ae. albopictus populations showed low genetic diversity in China.
HERC1 polymorphisms: population-specific variations in haplotype composition.

PubMed

Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen

2009-08-01

Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.
Haplotypes identified by 10 DNA restriction fragment length polymorphisms at the human low density lipoprotein receptor gene locus.

PubMed Central

Kotze, M J; Langenhoven, E; Retief, A E; Seftel, H C; Henderson, H E; Weich, H F

1989-01-01

Ten useful two allele restriction fragment length polymorphisms of the low density lipoprotein receptor gene were used for haplotype analysis in 45 unrelated familial hypercholesterolaemic (FH) patients, 60 normal controls, and 32 FH homozygotes, all of whom were white Afrikaners. Pedigree analysis in 27 informative heterozygous FH and 23 normal families has shown the segregation of at least 17 haplotypes in the normal population (111 chromosomes) compared to a predominant association of two of these haplotypes with the disease in the FH subjects. This association was further confirmed in 32 FH homozygotes, indicating at least two 'founder' members for the disease in the Afrikaner population. Recombination events were not detected in any of the families studied and we thus conclude that the haplotypes associated with FH function as specific markers for the disease and will allow presymptomatic diagnosis in affected families. PMID:2565980

Some links on this page may take you to non-federal websites. Their policies may differ from this site.