Robino, C; Ralf, A; Pasino, S; De Marchi, M R; Ballantyne, K N; Barbaro, A; Bini, C; Carnevali, E; Casarino, L; Di Gaetano, C; Fabbri, M; Ferri, G; Giardina, E; Gonzalez, A; Matullo, G; Nutini, A L; Onofri, V; Piccinini, A; Piglionica, M; Ponzano, E; Previderè, C; Resta, N; Scarnicci, F; Seidita, G; Sorçaburu-Cigliero, S; Turrina, S; Verzeletti, A; Kayser, M
2015-03-01
Recently introduced rapidly mutating Y-chromosomal short tandem repeat (RM Y-STR) loci, displaying a multiple-fold higher mutation rate relative to any other Y-STRs, including those conventionally used in forensic casework, have been demonstrated to improve the resolution of male lineage differentiation and to allow male relative separation usually impossible with standard Y-STRs. However, large and geographically-detailed frequency haplotype databases are required to estimate the statistical weight of RM Y-STR haplotype matches if observed in forensic casework. With this in mind, the Italian Working Group (GEFI) of the International Society for Forensic Genetics launched a collaborative exercise aimed at generating an Italian quality controlled forensic RM Y-STR haplotype database. Overall 1509 male individuals from 13 regional populations covering northern, central and southern areas of the Italian peninsula plus Sicily were collected, including both "rural" and "urban" samples classified according to population density in the sampling area. A subset of individuals was additionally genotyped for Y-STR loci included in the Yfiler and PowerPlex Y23 (PPY23) systems (75% and 62%, respectively), allowing the comparison of RM and conventional Y-STRs. Considering the whole set of 13 RM Y-STRs, 1501 unique haplotypes were observed among the 1509 sampled Italian men with a haplotype diversity of 0.999996, largely superior to Yfiler and PPY23 with 0.999914 and 0.999950, respectively. AMOVA indicated that 99.996% of the haplotype variation was within populations, confirming that genetic-geographic structure is almost undetected by RM Y-STRs. Haplotype sharing among regional Italian populations was not observed at all with the complete set of 13 RM Y-STRs. Haplotype sharing within Italian populations was very rare (0.27% non-unique haplotypes), and lower in urban (0.22%) than rural (0.29%) areas. Additionally, 422 father-son pairs were investigated, and 20.1% of them could
Investigation of extended Y chromosome STR haplotypes in Sardinia.
Lacerenza, D; Aneli, S; Di Gaetano, C; Critelli, R; Piazza, A; Matullo, G; Culigioni, C; Robledo, R; Robino, C; Calò, C
2017-03-01
Y-chromosomal variation of selected single nucleotide polymorphisms (SNPs) and 32 short tandem repeat (STR) loci was evaluated in Sardinia in three open population groups (Northern Sardinia, n=40; Central Sardinia, n=56; Southern Sardinia, n=91) and three isolates (Desulo, n=34; Benetutti, n=45, Carloforte, n=42). The tested Y-STRs consisted of Yfiler ® Plus markers and the seven rapidly mutating (RM) loci not included in the YFiler ® Plus kit (DYF399S1, DYF403S1ab, DYF404S1, DYS526ab, DYS547, DYS612, and DYS626). As expected, inclusion of additional Y-STR loci increased haplotype diversity (h), though complete differentiation of male lineages was impossible even by means of RM Y-STRs (h=0.99997). Analysis of molecular variance indicated that the three open populations were fairly homogeneous, whereas signs of genetic heterogeneity could be detected when the three isolates were also included in the analysis. Multidimensional scaling analysis showed that, even for extended haplotypes including RM Y-STR markers, Sardinians were clearly differentiated from populations of the Italian peninsula and Sicily. The only exception was represented by the Carloforte sample that, in accordance with its peculiar population history, clustered with Northern/Central Italian populations. The introduction of extended forensic Y-STR panels, including highly variable RM Y-STR markers, is expected to reduce the impact of population structure on haplotype frequency estimations. However, our results show that the availability of geographically detailed reference databases is still important for the assessment of the evidential value of a Y-haplotype match. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Haplotype analysis of the polymorphic 40 Y-STR markers in Chinese populations.
Ou, Xueling; Wang, Ying; Liu, Chao; Yang, Donggui; Zhang, Chuchu; Deng, Shujiao; Sun, Hongyu
2015-11-01
Forty Y-STR loci were analyzed in 1128 males from the following six Chinese ethnic populations: Han (n=300), Hui (n=244), Korean (n=100), Mongolian (n=100), Uighur (n=284) and Tibetan (n=100), utilizing two new generation multiplex Y-STR systems, AGCU Y24 STR and GFS Y24 STR genotyping kits, which allow for the genotyping of 24 loci from a single amplification reaction in each system. The lowest estimates of genetic diversity (below 0.5) correspond to markers DYS391 (0.441658) and DYS437 (0.496977), and the greatest diversity corresponds to markers DYS385a/b (0.969919) and DYS527a/b (0.94676). A considerable number of duplicate and off-ladder alleles were also revealed. Additionally, there were 1111 different haplotypes identified from the total 1128 samples, of which 1095 were unique. Notably, no shared haplotypes between populations were observed. The estimated overall haplotype diversity (HD) was 0.999085, and its discrimination capacity (DC) was 0.970745. An MDS plot based on the genetic distances between populations showed the genetic similarity of the southern Han population to the Northern populations of Hui, Korean, Mongolian and Uighur and a clear genetic departure of the Tibetan population from other populations. For the Y STR markers, population substructure correction was considered when calculating the rarity of the Y STR profile. However, because the haplotype based Fst values are extremely small within the present data (0.000153 with 40 Y-STRs), no substructure correction is required to estimate the rarity of a haplotype comprising 40 markers. In summary, the results of our study indicate that the 40 Y-STRs have a high level of polymorphism in Chinese ethnic groups and could therefore be a powerful tool for forensic applications and population genetic studies. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.
Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels
2014-07-01
The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Núñez, Carolina; Baeta, Miriam; Ibarbia, Nerea; Ortueta, Urko; Jiménez-Moreno, Susana; Blazquez-Caeiro, José Luis; Builes, Juan José; Herrera, Rene J; Martínez-Jarreta, Begoña; de Pancorbo, Marian M
2017-04-01
A Y-STR multiplex system has been developed with the purpose of complementing the widely used 17 Y-STR haplotyping (AmpFlSTR Y Filer® PCR Amplification kit) routinely employed in forensic and population genetic studies. This new multiplex system includes six additional STR loci (DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643) to reach the 23 Y-STR of the PowerPlex® Y23 System. In addition, this kit includes the DYS456 and DYS385 loci for traceability purposes. Male samples from 625 individuals from ten worldwide populations were genotyped, including three sample sets from populations previously published with the 17 Y-STR system to expand their current data. Validation studies demonstrated good performance of the panel set in terms of concordance, sensitivity, and stability in the presence of inhibitors and artificially degraded DNA. The results obtained for haplotype diversity and discrimination capacity with this multiplex system were considerably high, providing further evidences of the suitability of this novel Y-STR system for forensic purposes. Thus, the use of this multiplex for samples previously genotyped with 17 Y-STRs will be an efficient and low-cost alternative to complete the set of 23 Y-STRs and improve allele databases for population and forensic purposes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Chukhryaeva, M I; Ivanov, I O; Frolova, S A; Koshel, S M; Utevska, O M; Skhalyakho, R A; Agdzhoyan, A T; Bogunov, Yu V; Balanovska, E V; Balanovsky, O P
2016-05-01
STR haplotypes of the Y chromosome are widely used as effective genetic markers in studies of human populations and in forensic DNA analysis. The task often arises to compare the spectrum of haplotypes in individuals or entire populations. Performing this task manually is too laborious and thus unrealistic. We propose an algorithm for counting similarity between STR haplotypes. This algorithm is suitable for massive analyses of samples. It is implemented in the computer program Haplomatch, which makes it possible to find haplotypes that differ from the target haplotype by 0, 1, 2, 3, or more mutational steps. The program may operate in two modes: comparison of individuals and comparison of populations. Flexibility of the program (the possibility of using any external database), its usability (MS Excel spreadsheets are used), and the capability of being applied to other chromosomes and other species could make this software a new useful tool in population genetics and forensic and genealogical studies. The Haplomatch software is freely available on our website www.genofond.ru. The program is applied to studying the gene pool of Cossacks. Experimental analysis of Y-chromosomal diversity in a representative set (N = 131) of Upper Don Cossacks is performed. Analysis of the STR haplotypes detects genetic proximity of Cossacks to East Slavic populations (in particular, to Southern and Central Russians, as well as to Ukrainians), which confirms the hypothesis of the origin of the Cossacks mainly due to immigration from Russia and Ukraine. Also, a small genetic influence of Turkicspeaking Nogais is found, probably caused by their occurrence in the Don Voisko as part of the Tatar layer. No similarities between haplotype spectra of Cossacks and Caucasus populations are found. This case study demonstrates the effectiveness of the Haplomatch software in analyzing large sets of STR haplotypes.
Interpretation guidelines of a standard Y-chromosome STR 17-plex PCR-CE assay for crime casework.
Roewer, Lutz; Geppert, Maria
2012-01-01
Y-STR analysis is an invaluable tool to examine evidence in sexual assault cases and in other forensic casework. Unambiguous detection of the male component in DNA mixtures with a high female background is still the main field of application of forensic Y-STR haplotyping. In the last years, powerful technologies including a 17-locus multiplex PCR assay have been introduced in the forensic laboratories. At the same time, statistical methods have been developed and adapted for interpretation of a nonrecombining, linear marker as the Y-chromosome which shows a strongly clustered geographical distribution due to the linear inheritance and the patrilocality of ancestral groups. Large population databases, namely the Y-STR Haplotype Reference Database (YHRD), have been established to assess the evidentiary value of Y-STR matches by means of frequency estimation methods (counting and extrapolation).
Y chromosome STR typing in crime casework.
Roewer, Lutz
2009-01-01
Since the beginning of the nineties the field of forensic Y chromosome analysis has been successfully developed to become commonplace in laboratories working in crime casework all over the world. The ability to identify male-specific DNA renders highly variable Y-chromosomal polymorphisms, the STR sequences, an invaluable addition to the standard panel of autosomal loci used in forensic genetics. The male-specificity makes the Y chromosome especially useful in cases of male/female cell admixture, namely in sexual assault cases. On the other hand, the haploidy and patrilineal inheritance complicates the interpretation of a Y-STR match, because male relatives share for several generations an identical Y-STR profile. Since paternal relatives tend to live in the geographic and cultural territory of their ancestors, the Y chromosome analysis has a potential to make inferences on the population of origin of a given DNA profile. This review addresses the fields of application of Y chromosome haplotyping, the interpretation of results, databasing efforts and population genetics aspects.
17 Y-STR haplotype diversity in São Paulo state (southeast of Brazil).
de Souza, Leandro Fonseca; da Motta, Carlos Henrique Ares Silveira; Moura-Neto, Rodrigo Soares
2018-03-12
A sample of 158 Brazilian males from São Paulo (SP), Brazilian southeast, was typed for 17 Y-STR loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, YGATA_H4.1, and DYS385ab). A total of 158 haplotypes were identified, of which all were unique. The haplotype diversity and discrimination capacity were calculated in 1.0 and the genetic diversity was 67.4%. Pairwise haplotype distances showed that the São Paulo population is not significantly different from Rio de Janeiro and Portugal, but is different from African and Native American.
[Development of Chinese forensic Y-STR DNA database].
Ge, Jian-Ye; Yan, Jiang-Wei; Xie, Qun; Sun, Hong-Yu; Zhou, Huai-Gu; Li, Bin
2013-06-01
Y chromosome is a male-specific paternal inherited chromosome. The STR markers on Y chromosome have been widely used in forensic practices. This article summarizes the characteristics of Y-STR and some factors are considered of selecting appropriate Y-STR markers for Chinese population. The prospects of existing and potential forensic applications of Y-STR profiles are discussed including familial excluding, familial searching, crowd source deducing, mixture sample testing, and kinship identifying. The research, development, verification of Y-STR kit, Y-STR mutation rate, and search software are explored and some suggestions are given.
Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina
Kovačević, Lejla; Fatur-Cerić, Vera; Hadžić, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir
2013-01-01
Aim To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. Methods The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. Results The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. Conclusion This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis. PMID:23771760
Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina.
Kovačević, Lejla; Fatur-Cerić, Vera; Hadzic, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir
2013-06-01
To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis.
A global analysis of Y-chromosomal haplotype diversity for 23 STR loci
Purps, Josephine; Siegert, Sabine; Willuweit, Sascha; Nagy, Marion; Alves, Cíntia; Salazar, Renato; Angustia, Sheila M.T.; Santos, Lorna H.; Anslinger, Katja; Bayer, Birgit; Ayub, Qasim; Wei, Wei; Xue, Yali; Tyler-Smith, Chris; Bafalluy, Miriam Baeta; Martínez-Jarreta, Begoña; Egyed, Balazs; Balitzki, Beate; Tschumi, Sibylle; Ballard, David; Court, Denise Syndercombe; Barrantes, Xinia; Bäßler, Gerhard; Wiest, Tina; Berger, Burkhard; Niederstätter, Harald; Parson, Walther; Davis, Carey; Budowle, Bruce; Burri, Helen; Borer, Urs; Koller, Christoph; Carvalho, Elizeu F.; Domingues, Patricia M.; Chamoun, Wafaa Takash; Coble, Michael D.; Hill, Carolyn R.; Corach, Daniel; Caputo, Mariela; D’Amato, Maria E.; Davison, Sean; Decorte, Ronny; Larmuseau, Maarten H.D.; Ottoni, Claudio; Rickards, Olga; Lu, Di; Jiang, Chengtao; Dobosz, Tadeusz; Jonkisz, Anna; Frank, William E.; Furac, Ivana; Gehrig, Christian; Castella, Vincent; Grskovic, Branka; Haas, Cordula; Wobst, Jana; Hadzic, Gavrilo; Drobnic, Katja; Honda, Katsuya; Hou, Yiping; Zhou, Di; Li, Yan; Hu, Shengping; Chen, Shenglan; Immel, Uta-Dorothee; Lessig, Rüdiger; Jakovski, Zlatko; Ilievska, Tanja; Klann, Anja E.; García, Cristina Cano; de Knijff, Peter; Kraaijenbrink, Thirsa; Kondili, Aikaterini; Miniati, Penelope; Vouropoulou, Maria; Kovacevic, Lejla; Marjanovic, Damir; Lindner, Iris; Mansour, Issam; Al-Azem, Mouayyad; Andari, Ansar El; Marino, Miguel; Furfuro, Sandra; Locarno, Laura; Martín, Pablo; Luque, Gracia M.; Alonso, Antonio; Miranda, Luís Souto; Moreira, Helena; Mizuno, Natsuko; Iwashima, Yasuki; Neto, Rodrigo S. Moura; Nogueira, Tatiana L.S.; Silva, Rosane; Nastainczyk-Wulf, Marina; Edelmann, Jeanett; Kohl, Michael; Nie, Shengjie; Wang, Xianping; Cheng, Baowen; Núñez, Carolina; Pancorbo, Marian Martínez de; Olofsson, Jill K.; Morling, Niels; Onofri, Valerio; Tagliabracci, Adriano; Pamjav, Horolma; Volgyi, Antonia; Barany, Gusztav; Pawlowski, Ryszard; Maciejewska, Agnieszka; Pelotti, Susi; Pepinski, Witold; Abreu-Glowacka, Monica; Phillips, Christopher; Cárdenas, Jorge; Rey-Gonzalez, Danel; Salas, Antonio; Brisighelli, Francesca; Capelli, Cristian; Toscanini, Ulises; Piccinini, Andrea; Piglionica, Marilidia; Baldassarra, Stefania L.; Ploski, Rafal; Konarzewska, Magdalena; Jastrzebska, Emila; Robino, Carlo; Sajantila, Antti; Palo, Jukka U.; Guevara, Evelyn; Salvador, Jazelyn; Ungria, Maria Corazon De; Rodriguez, Jae Joseph Russell; Schmidt, Ulrike; Schlauderer, Nicola; Saukko, Pekka; Schneider, Peter M.; Sirker, Miriam; Shin, Kyoung-Jin; Oh, Yu Na; Skitsa, Iulia; Ampati, Alexandra; Smith, Tobi-Gail; Calvit, Lina Solis de; Stenzl, Vlastimil; Capal, Thomas; Tillmar, Andreas; Nilsson, Helena; Turrina, Stefania; De Leo, Domenico; Verzeletti, Andrea; Cortellini, Venusia; Wetton, Jon H.; Gwynne, Gareth M.; Jobling, Mark A.; Whittle, Martin R.; Sumita, Denilce R.; Wolańska-Nowak, Paulina; Yong, Rita Y.Y.; Krawczak, Michael; Nothnagel, Michael; Roewer, Lutz
2014-01-01
In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent. PMID:24854874
A global analysis of Y-chromosomal haplotype diversity for 23 STR loci.
Purps, Josephine; Siegert, Sabine; Willuweit, Sascha; Nagy, Marion; Alves, Cíntia; Salazar, Renato; Angustia, Sheila M T; Santos, Lorna H; Anslinger, Katja; Bayer, Birgit; Ayub, Qasim; Wei, Wei; Xue, Yali; Tyler-Smith, Chris; Bafalluy, Miriam Baeta; Martínez-Jarreta, Begoña; Egyed, Balazs; Balitzki, Beate; Tschumi, Sibylle; Ballard, David; Court, Denise Syndercombe; Barrantes, Xinia; Bäßler, Gerhard; Wiest, Tina; Berger, Burkhard; Niederstätter, Harald; Parson, Walther; Davis, Carey; Budowle, Bruce; Burri, Helen; Borer, Urs; Koller, Christoph; Carvalho, Elizeu F; Domingues, Patricia M; Chamoun, Wafaa Takash; Coble, Michael D; Hill, Carolyn R; Corach, Daniel; Caputo, Mariela; D'Amato, Maria E; Davison, Sean; Decorte, Ronny; Larmuseau, Maarten H D; Ottoni, Claudio; Rickards, Olga; Lu, Di; Jiang, Chengtao; Dobosz, Tadeusz; Jonkisz, Anna; Frank, William E; Furac, Ivana; Gehrig, Christian; Castella, Vincent; Grskovic, Branka; Haas, Cordula; Wobst, Jana; Hadzic, Gavrilo; Drobnic, Katja; Honda, Katsuya; Hou, Yiping; Zhou, Di; Li, Yan; Hu, Shengping; Chen, Shenglan; Immel, Uta-Dorothee; Lessig, Rüdiger; Jakovski, Zlatko; Ilievska, Tanja; Klann, Anja E; García, Cristina Cano; de Knijff, Peter; Kraaijenbrink, Thirsa; Kondili, Aikaterini; Miniati, Penelope; Vouropoulou, Maria; Kovacevic, Lejla; Marjanovic, Damir; Lindner, Iris; Mansour, Issam; Al-Azem, Mouayyad; Andari, Ansar El; Marino, Miguel; Furfuro, Sandra; Locarno, Laura; Martín, Pablo; Luque, Gracia M; Alonso, Antonio; Miranda, Luís Souto; Moreira, Helena; Mizuno, Natsuko; Iwashima, Yasuki; Neto, Rodrigo S Moura; Nogueira, Tatiana L S; Silva, Rosane; Nastainczyk-Wulf, Marina; Edelmann, Jeanett; Kohl, Michael; Nie, Shengjie; Wang, Xianping; Cheng, Baowen; Núñez, Carolina; Pancorbo, Marian Martínez de; Olofsson, Jill K; Morling, Niels; Onofri, Valerio; Tagliabracci, Adriano; Pamjav, Horolma; Volgyi, Antonia; Barany, Gusztav; Pawlowski, Ryszard; Maciejewska, Agnieszka; Pelotti, Susi; Pepinski, Witold; Abreu-Glowacka, Monica; Phillips, Christopher; Cárdenas, Jorge; Rey-Gonzalez, Danel; Salas, Antonio; Brisighelli, Francesca; Capelli, Cristian; Toscanini, Ulises; Piccinini, Andrea; Piglionica, Marilidia; Baldassarra, Stefania L; Ploski, Rafal; Konarzewska, Magdalena; Jastrzebska, Emila; Robino, Carlo; Sajantila, Antti; Palo, Jukka U; Guevara, Evelyn; Salvador, Jazelyn; Ungria, Maria Corazon De; Rodriguez, Jae Joseph Russell; Schmidt, Ulrike; Schlauderer, Nicola; Saukko, Pekka; Schneider, Peter M; Sirker, Miriam; Shin, Kyoung-Jin; Oh, Yu Na; Skitsa, Iulia; Ampati, Alexandra; Smith, Tobi-Gail; Calvit, Lina Solis de; Stenzl, Vlastimil; Capal, Thomas; Tillmar, Andreas; Nilsson, Helena; Turrina, Stefania; De Leo, Domenico; Verzeletti, Andrea; Cortellini, Venusia; Wetton, Jon H; Gwynne, Gareth M; Jobling, Mark A; Whittle, Martin R; Sumita, Denilce R; Wolańska-Nowak, Paulina; Yong, Rita Y Y; Krawczak, Michael; Nothnagel, Michael; Roewer, Lutz
2014-09-01
In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Using probabilistic theory to develop interpretation guidelines for Y-STR profiles.
Taylor, Duncan; Bright, Jo-Anne; Buckleton, John
2016-03-01
Y-STR profiling makes up a small but important proportion of forensic DNA casework. Often Y-STR profiles are used when autosomal profiling has failed to yield an informative result. Consequently Y-STR profiles are often from the most challenging samples. In addition to these points, Y-STR loci are linked, meaning that evaluation of haplotype probabilities are either based on overly simplified counting methods or computationally costly genetic models, neither of which extend well to the evaluation of mixed Y-STR data. For all of these reasons Y-STR data analysis has not seen the same advances as autosomal STR data. We present here a probabilistic model for the interpretation of Y-STR data. Due to the fact that probabilistic systems for Y-STR data are still some way from reaching active casework, we also describe how data can be analysed in a continuous way to generate interpretational thresholds and guidelines. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
A comprehensive Y-STR portrait of Yousafzai's population.
Tabassum, Sadia; Ilyas, Muhammad; Ullah, Inam; Israr, Muhammad; Ahmad, Habib
2017-09-01
In the current study, 17 Y-Chromosomal short tandem repeats (Y-STRs) included in theAmpFlSTR Y-Filer amplification kit (Applied Biosystems, Foster City, USA) were investigated in 146 unrelated Yousafzai males residing in the Khyber Pakhtunkhwa Province of Pakistan. A total of 94 (89.52%) unique haplotypes were observed. Discrimination capacity was 71.92%. Haplotype diversity ranged from 0.354 (DYS456) to 0.663 (DYS458). Both Rst pairwise analysis and multidimensional scaling plot showed that the genetic structure of the Yousafzais is significantly different from neighbouring populations.
Estimating trace-suspect match probabilities for singleton Y-STR haplotypes using coalescent theory.
Andersen, Mikkel Meyer; Caliebe, Amke; Jochens, Arne; Willuweit, Sascha; Krawczak, Michael
2013-02-01
Estimation of match probabilities for singleton haplotypes of lineage markers, i.e. for haplotypes observed only once in a reference database augmented by a suspect profile, is an important problem in forensic genetics. We compared the performance of four estimators of singleton match probabilities for Y-STRs, namely the count estimate, both with and without Brenner's so-called 'kappa correction', the surveying estimate, and a previously proposed, but rarely used, coalescent-based approach implemented in the BATWING software. Extensive simulation with BATWING of the underlying population history, haplotype evolution and subsequent database sampling revealed that the coalescent-based approach is characterized by lower bias and lower mean squared error than the uncorrected count estimator and the surveying estimator. Moreover, in contrast to the two count estimators, both the surveying and the coalescent-based approach exhibited a good correlation between the estimated and true match probabilities. However, although its overall performance is thus better than that of any other recognized method, the coalescent-based estimator is still computation-intense on the verge of general impracticability. Its application in forensic practice therefore will have to be limited to small reference databases, or to isolated cases of particular interest, until more powerful algorithms for coalescent simulation have become available. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Identifying the most likely contributors to a Y-STR mixture using the discrete Laplace method.
Andersen, Mikkel Meyer; Eriksen, Poul Svante; Mogensen, Helle Smidt; Morling, Niels
2015-03-01
In some crime cases, the male part of the DNA in a stain can only be analysed using Y chromosomal markers, e.g. Y-STRs. This may be the case in e.g. rape cases, where the male components can only be detected as Y-STR profiles, because the fraction of male DNA is much smaller than that of female DNA, which can mask the male results when autosomal STRs are investigated. Sometimes, mixtures of Y-STRs are observed, e.g. in rape cases with multiple offenders. In such cases, Y-STR mixture analysis is required, e.g. by mixture deconvolution, to deduce the most likely DNA profiles from the contributors. We demonstrate how the discrete Laplace method can be used to separate a two person Y-STR mixture, where the Y-STR profiles of the true contributors are not present in the reference dataset, which is often the case for Y-STR profiles in real case work. We also briefly discuss how to calculate the weight of the evidence using the likelihood ratio principle when a suspect's Y-STR profile fits into a two person mixture. We used three datasets with between 7 and 21 Y-STR loci: Denmark (n=181), Somalia (n=201) and Germany (n=3443). The Danish dataset with 21 loci was truncated to 15 and 10 loci to examine the effect of the number of loci. For each of these datasets, an out of sample simulation study was performed: A total of 550 mixtures were composed by randomly sampling two haplotypes, h1 and h2, from the dataset. We then used the discrete Laplace method on the remaining data (excluding h1 and h2) to rank the contributor pairs by the product of the contributors' estimated haplotype frequencies. Successful separation of mixtures (defined by the observation that the true contributor pair was among the 10 most likely contributor pairs) was found in 42-52% of the cases for 21 loci, 69-75% for 15 loci and 92-99% for 10 loci or less depending on the dataset and how the discrete Laplace model was chosen. Y-STR mixtures with many loci are difficult to separate, but even haplotypes
A Y-chromosome STR marker should be added to commercial multiplex STR kits.
Oz, Carla; Zaken, Neomi; Amiel, Merav; Zamir, Ashira
2008-07-01
Autosomal short tandem repeat (STR) analysis has become highly relevant in the identification of victims from mass disasters and terrorist attacks. In such events, gender misidentification can be of grave consequences, yet the list reporting amelogenin amplification failure using STR multiplex kits continues to grow. Presented here are three such examples. In the first case, we present two male suspects who demonstrated amelogenin Y-deficient results using two commercial kit procedures. The presence of their Y chromosomes was proven by obtaining a Y-haplotype. The second case demonstrated a profile from a third male suspect where only the Y homolog of the XY pair was amplified. In events such as mass disasters or terrorist attacks, timely and reliable high throughput DNA typing results are essential. As the number of reported cases of amplification failure at the amelogenin gene continues to grow, we suggest that the incorporation of a better gender identification tool in commercial kits is crucial.
Yang, Chun; Zhang, Jianqiu
2017-01-01
In this study, we analyzed the genetic polymorphisms of 23 Y-STR loci from PowerPlex® Y23 system in 916 unrelated healthy male individuals from Chinese Jiangsu Han, and observed 912 different haplotypes including 908 unique haplotypes and 4 duplicate haplotypes. The haplotype diversity reached 0.99999 and the discrimination capacity and match probability were 0.9956 and 0.0011, respectively. The gene diversity values ranged from 0.3942 at DYS438 to 0.9607 at DYS385a/b. Population differentiation within 10 Jiangsu Han subpopulations were evaluated by RST values and visualized in Neighbor-Joining trees and Multi-Dimensional Scaling plots as well as population relationships between the Jiangsu Han population and other 18 Eastern Asian populations. Such results indicated that the 23 Y-STR loci were highly polymorphic in Jiangsu Han population and played crucial roles in forensic application as well as population genetics. For the first time, we reported the genetic diversity of male lineages in Jiangsu Han population at a high-resolution level of 23 Y-STR set and consequently contributed to familial searching, offender tracking, and anthropology analysis of Jiangsu Han population. PMID:28704439
Next Generation Sequencing Plus (NGS+) with Y-chromosomal Markers for Forensic Pedigree Searches.
Qian, Xiaoqin; Hou, Jiayi; Wang, Zheng; Ye, Yi; Lang, Min; Gao, Tianzhen; Liu, Jing; Hou, Yiping
2017-09-12
There is high demand for forensic pedigree searches with Y-chromosome short tandem repeat (Y-STR) profiling in large-scale crime investigations. However, when two Y-STR haplotypes have a few mismatched loci, it is difficult to determine if they are from the same male lineage because of the high mutation rate of Y-STRs. Here we design a new strategy to handle cases in which none of pedigree samples shares identical Y-STR haplotype. We combine next generation sequencing (NGS), capillary electrophoresis and pyrosequencing under the term 'NGS+' for typing Y-STRs and Y-chromosomal single nucleotide polymorphisms (Y-SNPs). The high-resolution Y-SNP haplogroup and Y-STR haplotype can be obtained with NGS+. We further developed a new data-driven decision rule, FSindex, for estimating the likelihood for each retrieved pedigree. Our approach enables positive identification of pedigree from mismatched Y-STR haplotypes. It is envisaged that NGS+ will revolutionize forensic pedigree searches, especially when the person of interest was not recorded in forensic DNA database.
[Genetic Polymorphisms of 26 Y-STR Loci in Fujian She Nationality and Its Forensic Application].
Bian, Ying-nan; Siyit, Tele T; Zhu, Ru-xin; Zhao, Qi; Zhang Su-hua
2015-08-01
To study the forensic application of Goldeneye DNA ID 26Y Kit in the She nationality. Through capillary electrophoresis, the genotype of 26 Y-STR loci were analyzed in 53 unrelated male individuals from Fujian She nationality. The population genetics parameters such as allele frequency and haplotype diversity were calculated. The comparisons among the She nationality and the other nationalities were analyzed. A total of 126 alleles were observed on the 26 Y-STR loci of 53 unrelated male individuals. The allele frequencies and GD value ranged from 0.010 1 to 0.886 8 and 0.211 2 to 0.846 2, respectively. The GD value was greater than 0.5 in the 19 loci. A total of 47 haplotypes were observed. Based on R(ST), multidimensional scaling plot indicated that the genetic relationship among Fujian She nationality and Minnan Han nationality was closest, followed by Southern China Han nationality and Northern China nationality. Goldeneye™ DNA ID 26Y Kit including 26 Y-STR loci has good polymorphism in the She nationality. As an additional system, it has forensic application value in some special cases.
Genetic analysis of 15 autosomal and 12 Y-STR loci in the Espirito Santo State population, Brazil.
Wolfgramm, Eldamária de Vargas; Silva, Beatriz Candida; Aguiar, Vitor Resende da Costa; Malta, Frederico Scott Varela; de Castro, Amanda Mafia; Ferreira, Alessandro Clayton de Souza; Prezoti, Alessandra Nunes Loureiro; de Paula, Flavia; Louro, Iúri Drumond
2011-06-01
This study provides population genetic data for individuals of Vitoria, Espirito Santo, Brazil, a location not yet characterized for STR frequencies used for genetic identification studies. Allelic frequencies and other population data analysis are reported for the 15 autosomal-STR loci included in the PowerPlex(®)16 kit (CSF1PO, D13S317, D16S539, D18S51, D21S11, D3S1358, D5S818, D7S820, D8S1179, FGA, Penta D, Penta E, TPOX, TH01 and vWA). Allele and haplotype frequencies, gene diversity and discrimination capacity were also estimated for the PowerPlex(®) Y System (DYS19, DYS385, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438 and DYS439). Blood samples were obtained from 226 unrelated volunteers (135 males and 91 females) residents in the city of Vitoria, representing a typical sample of the mixed ethnicity present in the Espirito Santo State, Brazil. Within the tested population, the total number of individuals typed for specific markers is: 226 for D13S317, D21S11, D3S1358, D7S820, D8S1179 and FGA; 225 for D16S539 and D5S818; 224 for D18S51; 223 for CSF1PO; 222 for Penta D and vWA; 220 for Penta E; 207 for TPOX and 142 for TH01. Y-STR haplotypes were analyzed for 102 unrelated males, being 71 of them present in the 135 autosomal-STR sample, and 31 new males tested only for Y-STR markers. All autosomal markers were in Hardy-Weinberg Equilibrium. Y-STR analysis identified 101 haplotypes, being 100 of them unique. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Y-STR haplotypes of Native American populations from the Brazilian Amazon region.
Palha, Teresinha Jesus Brabo Ferreira; Rodrigues, Elzemar Martins Ribeiro; dos Santos, Sidney Emanuel Batista
2010-10-01
The allele and haplotype frequencies of nine Y-STRs (DYS19, DYS389 I, DYS389 II, DYS390, DYS391, DYS392, DYS393, DYS385 I/II) were determined in a sample of six native tribes from the Brazilian Amazon (Tiriyó, Awa-Guajá, Waiãpi, Urubu-Kaapor, Zoé and Parakanã). Forty-eight different haplotypes were identified, 28 of which unique. Five haplotypes are very frequent and were shared by over 10 individuals. The estimated haplotype diversity (0.9114) was very low compared to other geographic groups, including Africans, Europeans and Asians. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Comprehensive annotated STR physical map of the human Y chromosome: Forensic implications.
Hanson, Erin K; Ballantyne, Jack
2006-03-01
A plethora of Y-STR markers from diverse sources have been deposited in public databases and represent potential candidates for incorporation into the next generation of Y-STR multiplexes for forensic use. Here, based upon all of the Y-STR loci that have been deposited in the human genome database (>400), we have sequentially positioned each one along the Y chromosome using the most current human genome sequencing data (NCBI Build 35). The information derived from this work defines the number and relative position of all potentially forensically relevant Y-STR loci, their location within the physical linkage map of the Y chromosome and their relationship to structural genes. We conclude that there exists at present at least 417 separate Y-STR markers available for potential forensic use, although many of these will be found to be unsuitable for other reasons. However, from this data, we were able to identify 28 pairs of duplicated loci that were given separate DYS designations and four pairs of loci with overlapping flanking regions. Removing one locus from each set of duplicates reduced the number of potentially useful loci from 417 to 389. The derived information should be useful for workers who are designing novel Y-STR multiplexes to ensure the presence of non-synonymous loci and, if so desired, to avoid loci that lie within structural genes. It may also be useful for forensic casework practitioners (or molecular anthropologists) to aid in distinguishing between chromosomal rearrangements (such as duplications and deletions) and bona fide DNA admixtures or null alleles caused by primer binding site mutations. We illustrate the practical usefulness of the chromosomal positioning data in the design of eight multiplex systems using 94 Y-STR loci.
Analysis of genetic admixture in Uyghur using the 26 Y-STR loci system
Bian, Yingnan; Zhang, Suhua; Zhou, Wei; Zhao, Qi; Siqintuya; Zhu, Ruxin; Wang, Zheng; Gao, Yuzhen; Hong, Jie; Lu, Daru; Li, Chengtao
2016-01-01
The Uyghur population has experienced extensive interaction with European and Eastern Asian populations historically. A set of high-resolution genetic markers could be useful to infer the genetic relationships between the Uyghur population and European and Asian populations. In this study we typed 100 unrelated Uyghur males living in southern Xinjiang at 26 Y-STR loci. Using the high-resolution 26 Y-STR loci system, we investigated genetic and phylogenetic relationship between the Uyghur population and 23 reference European or Asian populations. We found that the Uyghur population exhibited a genetic admixture of Eastern Asian and European populations, and had a slightly closer relationship with the selected European populations than the Eastern Asian populations. We also demonstrated that the 26 Y-STR loci system was potentially useful in forensic sciences because it has a large power of discrimination and rarely exhibits common haplotypes. However, ancestry inference of Uyghur samples could be challenging due to the admixed nature of the population. PMID:26842947
Analysis of genetic admixture in Uyghur using the 26 Y-STR loci system.
Bian, Yingnan; Zhang, Suhua; Zhou, Wei; Zhao, Qi; Siqintuya; Zhu, Ruxin; Wang, Zheng; Gao, Yuzhen; Hong, Jie; Lu, Daru; Li, Chengtao
2016-02-04
The Uyghur population has experienced extensive interaction with European and Eastern Asian populations historically. A set of high-resolution genetic markers could be useful to infer the genetic relationships between the Uyghur population and European and Asian populations. In this study we typed 100 unrelated Uyghur males living in southern Xinjiang at 26 Y-STR loci. Using the high-resolution 26 Y-STR loci system, we investigated genetic and phylogenetic relationship between the Uyghur population and 23 reference European or Asian populations. We found that the Uyghur population exhibited a genetic admixture of Eastern Asian and European populations, and had a slightly closer relationship with the selected European populations than the Eastern Asian populations. We also demonstrated that the 26 Y-STR loci system was potentially useful in forensic sciences because it has a large power of discrimination and rarely exhibits common haplotypes. However, ancestry inference of Uyghur samples could be challenging due to the admixed nature of the population.
Population data for 15 Y-chromosome STRs in a population sample from Quito (Ecuador).
Baeza, Carlos; Guzmán, Rodrigo; Tirado, Miriam; López-Parra, Ana María; Rodríguez, Tatiana; Mesa, María Soledad; Fernández, Eva; Arroyo-Pardo, Eduardo
2007-12-20
Population frequencies for the 9 Y-STR loci included in the "minimal haplotype" from Y-STR Haplotype Reference Database (YHRD), plus other 6 Y-STRs (DYS437, DYS438, DYS439, GATA A7.2, GATA H4 and GATA A10) were obtained for a sample of 120 males from Quito (Ecuador). One hundred and sixteen unique haplotypes were identified within the sample. Haplotype diversity (0.9994) was among the highest in comparison to other populations from Iberia and South-America. Genetic distances were calculated and our sample presented significative differences with all other samples, the lowest values being with a Guinean sample.
UK and Irish Y-STR population data-A catalogue of variant alleles.
Aliferi, Anastasia; Thomson, Jim; McDonald, Andrew; Paynter, Vanessa Molin; Ferguson, Steven; Vanhinsbergh, Des; Syndercombe Court, Denise; Ballard, David
2018-05-01
A total of 3128 Y-STR profiles from three UK and one Irish population have been analysed with the PowerPlex Y23 system and are reported here. Instances of haplotype sharing between apparently unrelated individuals were identified and further investigated with the use of the 5 additional markers within the Yfiler Plus kit, resulting in a reduction by 76% in the number of shared haplotypes. Furthermore, Yfiler Plus was also employed to verify locus deletions and duplications observed in Y23 genotypes while inconsistencies between the two kits were sequenced, revealing underlying Y23 primer binding site mutations in loci DYS392 and DYS576. Finally, the mechanism behind a previously reported population specific peak shift observed in DYS481 in South Asian samples has been evaluated and further investigated in a novel case of this phenomenon seen in a Black British individual featuring a different flanking region mutation. Copyright © 2018 Elsevier B.V. All rights reserved.
Chang, Yuet Meng; Perumal, Revathi; Keat, Phoon Yoong; Kuehn, Daniel L C
2007-03-22
We have analyzed 16 Y-STR loci (DYS456, DYS389I, DYS390, DYS389II, DYS458, DYS19, DYS385a/b, DYS393, DYS391, DYS439, DYS635 or Y-GATA C4, DYS392, Y-GATA H4, DYS437, DYS438 and DYS448) from the non-recombining region of the human Y-chromosome in 980 male individuals from three main ethnic populations in Malaysia (Malay, Chinese, Indian) using the AmpFlSTR((R)) Y-filertrade mark (Applied Biosystems, Foster City, CA). The observed 17-loci haplotypes and the individual allele frequencies for each locus were estimated, whilst the locus diversity, haplotype diversity and discrimination capacity were calculated in the three ethnic populations. Analysis of molecular variance indicated that 88.7% of the haplotypic variation is found within population and 11.3% is between populations (fixation index F(ST)=0.113, p=0.000). This study has revealed Y-chromosomes with null alleles at several Y-loci, namely DYS458, DYS392, DYS389I, DYS389II, DYS439, DYS448 and Y-GATA H4; and several occurrences of duplications at the highly polymorphic DYS385 loci. Some of these deleted loci were in regions of the Y(q) arm that have been implicated in the occurrence of male infertility.
Ehrenreich, Liezle; Benjeddou, Mongi; Davison, Sean; D'Amato, Maria; Leat, Neil
2008-07-01
Samples were collected from 108 Afrikaner males and 114 males of mixed ancestry. The term mixed ancestry is being used to denote a complex community which was established with contributions from Asians, Caucasians and Indigenous populations and constitutes a significant proportion of the Cape Town metropolitan population. Allele and haplotype frequencies were determined for nine Y-STR loci (DYS19, DYS389-I, DYS389-II, DYS390, DYS391, DYS392, DYS393 and the duplicated locus DYS385). Unique haplotypes were obtained for 64 Afrikaner males and 90 males of mixed ancestry. Both population groups shared the same most common haplotype.
Iacovacci, Giuseppe; D'Atanasio, Eugenia; Marini, Ornella; Coppa, Alfredo; Sellitto, Daniele; Trombetta, Beniamino; Berti, Andrea; Cruciani, Fulvio
2017-03-01
By using the recently introduced 6-dye Yfiler ® Plus multiplex, we analyzed 462 males belonging to 20 ethnic groups from four eastern African countries (Eritrea, Ethiopia, Djibouti and Kenya). Through a Y-STR sequence analysis, combined with 62 SNP-based haplogroup information, we were able to classify observed microvariant alleles at four Y-STR loci as either monophyletic (DYF387S1 and DYS458) or recurrent (DYS449 and DYS627). We found evidence of non-allelic gene conversion among paralogous STRs of the two-copy locus DYF387S1. Twenty-two diallelic and triallelic patterns observed at 13 different loci were found to be significantly over-represented (p<10 -6 ) among profiles obtained from cell lines compared to those from blood and saliva. Most of the diallelic/triallelic patterns from cell lines involved recurrent mutations at rapidly mutating loci (RM Y-STRs) included in the multiplex (p<10 -2 ). At haplotype level, intra-population diversity indices were found to be among the lowest so far reported for the Yfiler ® Plus, while statistically significant differences among countries and ethnic groups were detected when considering haplotype frequencies alone (F ST ) or by using molecular distances among haplotypes (Φ ST ). The strong population subdivision observed is probably the consequence of the patrilineal social organization of most eastern African ethnic groups, and suggests caution in the use of country-based haplotype frequency distributions for forensic inferences in this region. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Palha, Teresinha; Ribeiro-Rodrigues, Elzemar; Ribeiro-dos-Santos, Andrea; Santos, Sidney
2012-05-01
Fourteen Y-STR loci (DYS458, DYS439, Y-GATA H4, DYS576, DYS447, DYS460, DYS456, YGATA A10, DYS437, DYS449, DYS570, DYS635 or Y-GATA C4, DYS448 and DYS438) were analysed in 873 males from eight northern Brazil populations: Belém (N=400), Santarém (N=69), Manaus (N=75), Macapá (N=65), Palmas (N=30), Rio Branco (N=32), Porto Velho (N=135) and Boa Vista (N=67). A total of 871 different haplotypes were identified, of which 869 were unique. The panel's estimated total haplotype diversity (HD) is 0.9988, and its discrimination capacity (DC) is 0.9980. The lowest estimates of genetic diversity correspond to markers Y-GATA H4 (0.550) and DYS460 (0.581), and the greatest (above 0.700) to markers DYS458, DYS576, DYS447, YS449, DYS570 and DYS635. The genetic parameters obtained were higher for the 14-Y-STR panel than that for the minimum haplotype set (HD=0.9969; DC=0.76) and the parameters were similar to those obtained with the panel of 17 YSTR of YHRD (HD=0.9987; DC=0. 9870). The analysis of molecular variance (AMOVA) indicated that most of the genetic variance is found within populations and a smaller, but significant part, is found among populations (R(ST)=0.027, p value=0.009). The data when compared with those from African, Amerindian and European populations have shown no significant genetic distance between northern Brazil populations and Europeans, but there is a significant genetic distance when compared to Africans and Amerindians. The discrimination capacity of the markers shows a high potential for forensic analysis. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Technical note: developmental validation of a novel 6-dye typing system with 36 Y-STR loci.
Du, Weian; Feng, Peipei; Huang, Hongyan; Wu, Weibin; Zhang, Lei; Guo, Yulin; Liu, Changhui; Liu, Hong; Liu, Chao; Chen, Ling
2018-05-30
Y-chromosomal short tandem repeats (Y-STRs) have proven to be very useful in investigating sexual assault cases and in paternity lineage differentiation. However, currently available commercial Y-STR multiplex amplification systems bear the limitations in the identification of related males from the same paternal lineage due to there being an insufficient number of loci in any single amplification kit. The aim of this study was to establish and validate a novel 6-dye, 36-plex Y-STR multiplex amplification system that incorporated all of the loci present in the Yfiler™ Plus kit (DYS19, DYS385a/b, DYF387S1, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS449, DYS456, DYS458, DYS460, DYS481, DYS518, DYS533, DYS570, DYS576, DYS627, DYS635, Y_GATA_H4) as well as a further nine highly polymorphic Y-STR loci (DYS388, DYS444, DYS447, DYS522, DYS527a/b, DYS549, DYS596, DYS643). The novel system was optimized and validated by a series of studies that tested the effect of different PCR-based conditions as well as the species specificity, sensitivity, stability, stutter precision, suitability for use on DNA mixtures, reproducibility, and parallel testing of the system, as well as its performance on casework samples and population analysis, according to the SWGDAM developmental validation guidelines. A total of 246 haplotypes were found for the 36 Y-STRs among 247 Guangdong Han unrelated males. Collectively, the results demonstrate that the developed 36-plex Y-STR system is sensitive, robust, reliable, and highly informative for use in forensic genetics.
Analysis of 16 autosomal STRs and 17 Y-STRs in an indigenous Maya population from Guatemala.
Cardoso, Sergio; Sevillano, Rubén; Illescas, María J; de Pancorbo, Marian Martínez
2016-03-01
The aim of this study was to contribute new data on autosomal STR and Y-STR markers of the Mayas from Guatemala in order to improve available databases of forensic interest. We analyzed 16 autosomal STR markers in a population sample of 155 indigenous Maya and 17 Y-chromosomal STR markers in the 100 males of the sample. Deviations from Hardy-Weinberg equilibrium and linkage disequilibrium between autosomal STR markers were not observed at any loci. The combined power of exclusion was estimated as 99.9991% and the combined power of discrimination was >99.999999999999%. Haplotype diversity of Y-STRs was calculated as 0.9984 ± 0.0018 and analysis of pairwise genetic distances (Rst) supported the Native American background of the population.
The discrete Laplace exponential family and estimation of Y-STR haplotype frequencies.
Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels
2013-07-21
Estimating haplotype frequencies is important in e.g. forensic genetics, where the frequencies are needed to calculate the likelihood ratio for the evidential weight of a DNA profile found at a crime scene. Estimation is naturally based on a population model, motivating the investigation of the Fisher-Wright model of evolution for haploid lineage DNA markers. An exponential family (a class of probability distributions that is well understood in probability theory such that inference is easily made by using existing software) called the 'discrete Laplace distribution' is described. We illustrate how well the discrete Laplace distribution approximates a more complicated distribution that arises by investigating the well-known population genetic Fisher-Wright model of evolution by a single-step mutation process. It was shown how the discrete Laplace distribution can be used to estimate haplotype frequencies for haploid lineage DNA markers (such as Y-chromosomal short tandem repeats), which in turn can be used to assess the evidential weight of a DNA profile found at a crime scene. This was done by making inference in a mixture of multivariate, marginally independent, discrete Laplace distributions using the EM algorithm to estimate the probabilities of membership of a set of unobserved subpopulations. The discrete Laplace distribution can be used to estimate haplotype frequencies with lower prediction error than other existing estimators. Furthermore, the calculations could be performed on a normal computer. This method was implemented in the freely available open source software R that is supported on Linux, MacOS and MS Windows. Copyright © 2013 Elsevier Ltd. All rights reserved.
Huszar, Tunde I; Jobling, Mark A; Wetton, Jon H
2018-04-12
Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega's prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants
Rębała, Krzysztof; Veselinović, Igor; Siváková, Daniela; Patskun, Erika; Kravchenko, Sergey; Szczerkowska, Zofia
2014-01-01
Studies on Y-chromosomal markers revealed significant genetic differentiation between Southern and Northern (Western and Eastern) Slavic populations. The northern Serbian region of Vojvodina is inhabited by Southern Slavic Serbian majority and, inter alia, Western Slavic (Slovak) and Eastern Slavic (Ruthenian) minorities. In the study, 15 autosomal STR markers were analysed in unrelated Slovaks, Ruthenians and Serbs from northern Serbia and western Slovakia. Additionally, Slovak males from Serbia were genotyped for 17 Y-chromosomal STR loci. The results were compared to data available for other Slavic populations. Genetic distances for autosomal markers revealed homogeneity between Serbs from northern Serbia and Slovaks from western Slovakia and distinctiveness of Serbian Slovaks and Ruthenians. Y-STR variation showed a clear genetic departure of the Slovaks and Ruthenians inhabiting Vojvodina from their Serbian neighbours and genetic similarity to the Northern Slavic populations of Slovakia and Ukraine. Admixture estimates revealed negligible Serbian paternal ancestry in both Northern Slavic minorities of Vojvodina, providing evidence for their genetic isolation from the Serbian majority population. No reduction of genetic diversity at autosomal and Y-chromosomal markers was found, excluding genetic drift as a reason for differences observed at autosomal STRs. Analysis of molecular variance detected significant population stratification of autosomal and Y-chromosomal microsatellites in the three Slavic populations of northern Serbia, indicating necessity for separate databases used for estimations of frequencies of autosomal and Y-chromosomal STR profiles in forensic casework. Our results demonstrate that regarding Y-STR haplotypes, Serbian Slovaks and Ruthenians fit in the Eastern European metapopulation defined in the Y chromosome haplotype reference database. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Claerhout, Sofie; Vandenbosch, Michiel; Nivelle, Kelly; Gruyters, Leen; Peeters, Anke; Larmuseau, Maarten H D; Decorte, Ronny
2018-05-01
Knowledge of Y-chromosomal short tandem repeat (Y-STR) mutation rates is essential to determine the most recent common ancestor (MRCA) in familial searching or genealogy research. Up to now, locus-specific mutation rates have been extensively examined especially for commercially available forensic Y-STRs, while haplogroup specific mutation rates have not yet been investigated in detail. Through 450 patrilineally related namesakes distributed over 212 deep-rooting genealogies, the individual mutation rates of 42 Y-STR loci were determined, including 27 forensic Y-STR loci from the Yfiler ® Plus kit and 15 additional Y-STR loci (DYS388, DYS426, DYS442, DYS447, DYS454, DYS455, DYS459a/b, DYS549, DYS607, DYS643, DYS724a/b and YCAIIa/b). At least 726 mutations were observed over 148,596 meiosis and individual Y-STR mutation rates varied from 2.83 × 10 -4 to 1.86 × 10 -2 . The mutation rate was significantly correlated with the average allele size, the complexity of the repeat motif sequence and the age of the father. Significant differences in average Y-STR mutations rates were observed when haplogroup 'I & J' (4.03 × 10 -3 mutations/generation) was compared to 'R1b' (5.35 × 10 -3 mutations/generation) and to the overall mutation rate (5.03 × 10 -3 mutations/generation). A difference in allele size distribution was identified as the only cause for these haplogroup specific mutation rates. The haplogroup specific mutation rates were also present within the commercially available Y-STR kits (Yfiler ® , PowerPlex ® Y23 System and Yfiler ® Plus). This observation has consequences for applications where an average Y-STR mutation rate is used, e.g. tMRCA estimations in familial searching and genealogy research. Copyright © 2018 Elsevier B.V. All rights reserved.
Yao, Jun; Wang, Bao-jie
2016-01-01
In the present study, we investigated the genetic characteristics of 25 Y-chromosomal and 15 autosomal short tandem repeat (STR) loci in 305 unrelated Han Chinese male individuals from Liaoning Province using AmpFISTR® Yfiler® Plus and IdentifilerTM PCR amplification kits. Population comparison was performed between Liaoning Han population and different ethnic groups to better understand the genetic background of the Liaoning Han population. For Y-STR loci, the overall haplotype diversity was 0.9997 and the discrimination capacity was 0.9607. Gene diversity values ranged from 0.4525 (DYS391) to 0.9617 (DYS385). Rst and two multi-dimensional scaling plots showed that minor differences were observed when the Liaoning Han population was compared to the Jilin Han Chinese, Beijing Han Chinese, Liaoning Manchu, Liaoning Mongolian, Liaoning Xibe, Shandong Han Chinese, Jiangsu Han Chinese, Anhui Han Chinese, Guizhou Han Chinese and Liaoning Hui populations; by contrast, major differences were observed when the Shanxi Han Chinese, Yunnan Bai, Jiangxi Han Chinese, Guangdong Han Chinese, Liaoning Korean, Hunan Tujia, Guangxi Zhuang, Gansu Tibetan, Xishuangbanna Dai, South Korean, Japanese and Hunan Miao populations. For autosomal STR loci, DP ranged from 0.9621 (D2S1338) to 0.8177 (TPOX), with PE distributing from 0.7521 (D18S51) to 0.2988 (TH01). A population comparison was performed and no statistically significant differences were detected at any STR loci between Liaoning Han, China Dong, and Shaanxi Han populations. The results showed that the 25 Y-STR and 15 autosomal STR loci in the Liaoning Han population were valuable for forensic applications and human genetics, and Liaoning Han was an independent endogenous ethnicity with a unique subpopulation structure. PMID:27483472
Makki-Rmida, Faten; Kammoun, Arwa; Mahfoudh, Nadia; Ayadi, Adnene; Gibriel, Abdullah Ahmed; Mallek, Bakhta; Maalej, Leila; Hammami, Zouheir; Maatoug, Samir; Makni, Hafedh; Masmoudi, Saber
2015-12-01
Y chromosome STRs (Y-STRs) are being used frequently in forensic laboratories. Previous studies of Y-STR polymorphisms in different groups of the Tunisian population identified low levels of diversity and discrimination capacity (DC) using various commercial marker sets. This definitely limits the use of such systems for Y-STRs genotyping in Tunisia. In our investigation on South Tunisia, 200 unrelated males were typed for the 12 conventional Y-STRs included in the PowerPlex® Y System. Additional set of nine noncore Y-STRs including DYS446, DYS456, DYS458, DYS388, DYS444, DYS445, DYS449, DYS710, and DYS464 markers were genotyped and evaluated for their potential in improving DC. Allele frequency, gene diversity, haplotype diversity (HD), and DC calculation revealed that DYS464 was the most diverse marker followed by DYS710 and DYS449 markers. The standard panel of 12 Y-STRs (DC = 80.5%) and the nine markers were combined to obtain DC of 99%. Among the 198 different haplotypes observed, 196 haplotypes were unique (HD = 99.999). Out of the nine noncore set, six Y-STRs (DYS458, DYS456, DYS449, DYS710, DYS444, and DYS464) had the greatest impact on enhancing DC. Our data provided putative Y-STRs combination to be used for genetic and forensic applications. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Expanded Croatian 12 X-STR loci database with an overview of anomalous profiles.
Mršić, Gordan; Ozretić, Petar; Crnjac, Josip; Merkaš, Siniša; Sukser, Viktorija; Račić, Ivana; Rožić, Sara; Barbarić, Lucija; Popović, Maja; Korolija, Marina
2018-05-01
In order to implement X-chromosome short tandem repeat (X-STR) typing into routine forensic practice, reference database of a given population should be established. Therefore we extended already published data with additional 397 blood samples from unrelated Croatian citizens, and analyzed the total of 995 samples (549 male and 446 female) typed by Investigator ® Argus X-12 Kit. To test genetic homogeneity of consecutively processed five historic-cultural regions covering the entire national territory, we calculated pairwise Fst genetic distances between regions based on allele and full haplotype frequencies. Since the comparison did not yield any statistically significant difference, we integrated STR profile information from all regions and used the whole data set to calculate forensic parameters. The most informative marker is DXS10135 (polymorphism information content (PIC = 0.929) and the most informative linkage group (LG) is LG1 (PIC = 0.996). We confirmed linkage disequilibrium (LD) for seven marker pairs belonging to LG2, LG3 and LG4. By including LD information, we calculated cumulative power of discrimination that amounted to 0.999999999997 in females and 0.999999005 in males. We also compared Croatia with 13 European populations based on haplotype frequencies and detected no statistically significant Fst values after Bonferroni correction in any LG. Multi-dimensional scaling plot revealed tight grouping of four Croatian regions amongst populations of southern, central and northern Europe, with the exception of northern Croatia. In this study we gave the first extensive overview of aberrant profiles encountered during Investigator ® Argus X-12 typing. We found ten profiles consistent with single locus duplication followed by tetranucleotide tract length polymorphism. Locus DXS10079 is by far the most frequently affected one, presumably mutated in eight samples. We also found four profiles consistent with X-chromosome aneuploidy (three profiles
Y-SNPs haplotype diversity in four Chinese cattle breeds.
Zhang, Runfeng; Cheng, Ming; Li, Xiaofeng; Chen, Fuying; Zheng, Jing; Wang, Xiaofei; Meng, Quanke
2013-01-01
To investigate the genetic diversity of Chinese cattle, 96 male samples of 4 Chinese native cattle breeds were investigated using 5 single nucleotide polymorphisms specific to the bovine Y chromosome. Two previously described haplotypes (taurine Y2 and indicine Y3) were detected in 74 and 22 animals, respectively. The haplotype frequencies varied amongst the four native breeds. The taurine Y2 haplotype dominated in the Qinchuan, Dabieshan, and Yunba breeds. However, the indicine Y3 haplotype occurred in high frequency in the Enshi breed. Among the four native breeds, Yunba had the highest haplotype diversity (0.4330 ± 0.0750), followed by Qinchuan (0.2899 ± 0.1028) and Enshi (0.2222 ± 0.1662), Dabieshan was the least differentiated (0.1079 ± 0.0680). Compared with some foreign cattle breeds, the low level of haplotype diversity was detected in our breeds (0.2633 ± 0.1030).
Extended Y chromosome haplotypes resolve multiple and unique lineages of the Jewish priesthood.
Hammer, Michael F; Behar, Doron M; Karafet, Tatiana M; Mendez, Fernando L; Hallmark, Brian; Erez, Tamar; Zhivotovsky, Lev A; Rosset, Saharon; Skorecki, Karl
2009-11-01
It has been known for over a decade that a majority of men who self report as members of the Jewish priesthood (Cohanim) carry a characteristic Y chromosome haplotype termed the Cohen Modal Haplotype (CMH). The CMH has since been used to trace putative Jewish ancestral origins of various populations. However, the limited number of binary and STR Y chromosome markers used previously did not provide the phylogenetic resolution needed to infer the number of independent paternal lineages that are encompassed within the Cohanim or their coalescence times. Accordingly, we have genotyped 75 binary markers and 12 Y-STRs in a sample of 215 Cohanim from diverse Jewish communities, 1,575 Jewish men from across the range of the Jewish Diaspora, and 2,099 non-Jewish men from the Near East, Europe, Central Asia, and India. While Cohanim from diverse backgrounds carry a total of 21 Y chromosome haplogroups, 5 haplogroups account for 79.5% of Cohanim Y chromosomes. The most frequent Cohanim lineage (46.1%) is marked by the recently reported P58 T->C mutation, which is prevalent in the Near East. Based on genotypes at 12 Y-STRs, we identify an extended CMH on the J-P58* background that predominates in both Ashkenazi and non-Ashkenazi Cohanim and is remarkably absent in non-Jews. The estimated divergence time of this lineage based on 17 STRs is 3,190 +/- 1,090 years. Notably, the second most frequent Cohanim lineage (J-M410*, 14.4%) contains an extended modal haplotype that is also limited to Ashkenazi and non-Ashkenazi Cohanim and is estimated to be 4.2 +/- 1.3 ky old. These results support the hypothesis of a common origin of the CMH in the Near East well before the dispersion of the Jewish people into separate communities, and indicate that the majority of contemporary Jewish priests descend from a limited number of paternal lineages.
Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates
Willems, Thomas; Gymrek, Melissa; Poznik, G. David; Tyler-Smith, Chris; Erlich, Yaniv
2016-01-01
Short tandem repeats (STRs) are mutation-prone loci that span nearly 1% of the human genome. Previous studies have estimated the mutation rates of highly polymorphic STRs by using capillary electrophoresis and pedigree-based designs. Although this work has provided insights into the mutational dynamics of highly mutable STRs, the mutation rates of most others remain unknown. Here, we harnessed whole-genome sequencing data to estimate the mutation rates of Y chromosome STRs (Y-STRs) with 2–6 bp repeat units that are accessible to Illumina sequencing. We genotyped 4,500 Y-STRs by using data from the 1000 Genomes Project and the Simons Genome Diversity Project. Next, we developed MUTEA, an algorithm that infers STR mutation rates from population-scale data by using a high-resolution SNP-based phylogeny. After extensive intrinsic and extrinsic validations, we harnessed MUTEA to derive mutation-rate estimates for 702 polymorphic STRs by tracing each locus over 222,000 meioses, resulting in the largest collection of Y-STR mutation rates to date. Using our estimates, we identified determinants of STR mutation rates and built a model to predict rates for STRs across the genome. These predictions indicate that the load of de novo STR mutations is at least 75 mutations per generation, rivaling the load of all other known variant types. Finally, we identified Y-STRs with potential applications in forensics and genetic genealogy, assessed the ability to differentiate between the Y chromosomes of father-son pairs, and imputed Y-STR genotypes. PMID:27126583
Chang, Yuet Meng; Swaran, Yuvaneswari; Phoon, Yoong Keat; Sothirasan, Kavin; Sim, Hang Thiew; Lim, Kong Boon; Kuehn, Daniel
2009-06-01
17 Y-STRs (DYS456, DYS389I, DYS390, DYS389II, DYS458, DYS19, DYS385a/b, DYS393, DYS391, DYS439, DYS635 or Y-GATA C4, DYS392, Y-GATA H4, DYS437, DYS438 and DYS448) have been analyzed in 320 male individuals from Sarawak, an eastern state of Malaysia on the Borneo island using the AmpFlSTR Y-filer (Applied Biosystems, Foster City, CA). These individuals were from three indigenous ethnic groups in Sarawak comprising of 103 Ibans, 113 Bidayuhs and 104 Melanaus. The observed 17-loci haplotypes and the individual allele frequencies for each locus were estimated, whilst the locus diversity, haplotype diversity and discrimination capacity were calculated in the three groups. Analysis of molecular variance (AMOVA) indicated that 87.6% of the haplotypic variation was found within population and 12.4% between populations (fixation index F(ST)=0.124, p=0.000). This study has revealed that the indigenous populations in Sarawak are distinctly different to each other, and to the three major ethnic groups in Malaysia (Malays, Chinese and Indians), with the Melanaus having a strikingly high degree of shared haplotypes within. There are rare unusual variants and microvariants that were not present in Malaysian Malay, Chinese or Indian groups. In addition, occurrences of DYS385 duplications which were only noticeably present in Chinese group previously was also observed in the Iban group whilst null alleles were detected at several Y-loci (namely DYS19, DYS392, DYS389II and DYS448) in the Iban and Melanau groups.
Zubizarreta, Josu; Davis, Michael C; Hampikian, Greg
2011-12-01
Fifty unrelated Basque males from southwest Idaho were typed for the 17 Y-STR loci in the Yfiler multiplex kit (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, YGATA H4.1 and DYS385a/b). In total, 42 haplotypes were identified, with no more than two individuals sharing a single haplotype. The haplotype diversity (HD) was 0.9935, and gene diversity (D) over loci was 0.457 ± 0.137. The Idaho Basque population was compared to the source population from the Basque autonomous region of Northern Spain and Southern France, as well as a United States Caucasian population. The haplotype diversity for the immigrant Basque sample is within 0.4% of the haplotype diversity of the European Basques (0.9903); thus the power of discrimination is similar for each population. The Idaho Basque population has less diversity in 9 out of 16 loci (considering DYS385a/b together) and 3% less diversity across all loci, compared to the European Basque population. A multidimensional scaling analysis (MDS) was created using pairwise R(ST) values to compare the Idaho Basques to other populations. Based upon R(ST) and F(ST) measures, no significant differentiation was found between the Idaho and source European Basque population.
Souza, C A; Oliveira, T C; Crovella, S; Santos, S M; Rabêlo, K C N; Soriano, E P; Carvalho, M V D; Junior, A F Caldas; Porto, G G; Campello, R I C; Antunes, A A; Queiroz, R A; Souza, S M
2017-04-28
The use of Y chromosome haplotypes, important for the detection of sexual crimes in forensics, has gained prominence with the use of databases that incorporate these genetic profiles in their system. Here, we optimized and validated an amplification protocol for Y chromosome profile retrieval in reference samples using lesser materials than those in commercial kits. FTA ® cards (Flinders Technology Associates) were used to support the oral cells of male individuals, which were amplified directly using the SwabSolution reagent (Promega). First, we optimized and validated the process to define the volume and cycling conditions. Three reference samples and nineteen 1.2 mm-diameter perforated discs were used per sample. Amplification of one or two discs (samples) with the PowerPlex ® Y23 kit (Promega) was performed using 25, 26, and 27 thermal cycles. Twenty percent, 32%, and 100% reagent volumes, one disc, and 26 cycles were used for the control per sample. Thereafter, all samples (N = 270) were amplified using 27 cycles, one disc, and 32% reagents (optimized conditions). Data was analyzed using a study of equilibrium values between fluorophore colors. In the samples analyzed with 20% volume, an imbalance was observed in peak heights, both inside and in-between each dye. In samples amplified with 32% reagents, the values obtained for the intra-color and inter-color standard balance calculations for verification of the quality of the analyzed peaks were similar to those of samples amplified with 100% of the recommended volume. The quality of the profiles obtained with 32% reagents was suitable for insertion into databases.
[Identification of Y-chromosomal Genetic Types for the Soldier's Remains from Huaihai Campaign].
Wang, C Z; Wen, S Q; Shi, M S; Yu, X E; Wang, X J; Pan, Y L; Zhang, Y F; Li, H; Tan, J Z
2017-08-01
To identify the Y-chromosomal genetic types for the soldier's remains from Huaihai Campaign, and to offer a clue for search of their paternal relatives. DNA of the remains were extracted by the ancient DNA extraction method. Yfiler kit was used for the multiplex amplification of 17 Y-STR loci. The haplogroups of the samples were speculated. Detailed genotyping of the selected Y-SNP was performed based on the latest Y-chromosome phylogenetic tree. Haplotype-sharing analysis was done based on the data of Y-SNP and Y-STR, the closest modern individual information to the genetic relationship of remains was gained. A total of 8 Y-STR haplotypes were observed on 17 Y-STR loci of 8 male individuals. Furthermore, 6 Y-SNP haplogroups were identified, which were O2a1-M95+, O1a1-P203+, O3*-M122+/M234-, D1-M15+, C3*-ST and R1a1-M17+. Identification of Y-chromosomal genetic types for the soldier's remains from Huaihai Campaign shows a reference value on inferring the geographical origins of old materials. Copyright© by the Editorial Department of Journal of Forensic Medicine
Tsybovskii, I S; Veremeichik, V M; Kotova, S A; Kritskaya, S V; Evmenenko, S A; Udina, I G
2017-02-01
For the Republic of Belarus, development of a forensic reference database on the basis of 18 autosomal microsatellites (STR) using a population dataset (N = 1040), “familial” genotypic dataset (N = 2550) obtained from expertise performance of paternity testing, and a dataset of genotypes from a criminal registration database (N = 8756) is described. Population samples studied consist of 80% ethnic Belarusians and 20% individuals of other nationality or of mixed origin (by questionnaire data). Genotypes of 12346 inhabitants of the Republic of Belarus from 118 regional samples studied by 18 autosomal microsatellites are included in the sample: 16 tetranucleotide STR (D2S1338, TPOX, D3S1358, CSF1PO, D5S818, D8S1179, D7S820, THO1, vWA, D13S317, D16S539, D18S51, D19S433, D21S11, F13B, and FGA) and two pentanucleotide STR (Penta D and Penta E). The samples studied are in Hardy–Weinberg equilibrium according to distribution of genotypes by 18 STR. Significant differences were not detected between discrete populations or between samples from various historical ethnographic regions of the Republic of Belarus (Western and Eastern Polesie, Podneprovye, Ponemanye, Poozerye, and Center), which indicates the absence of prominent genetic differentiation. Statistically significant differences between the studied genotypic datasets also were not detected, which made it possible to combine the datasets and consider the total sample as a unified forensic reference database for 18 “criminalistic” STR loci. Differences between reference database of the Republic of Belarus and Russians and Ukrainians by the distribution of the range of autosomal STR also were not detected, corresponding to a close genetic relationship of the three Eastern Slavic nations mediated by common origin and intense mutual migrations. Significant differences by separate STR loci between the reference database of Republic of Belarus and populations of Southern and Western Slavs were observed. The
Ancestral Asian source(s) of new world Y-chromosome founder haplotypes.
Karafet, T M; Zegura, S L; Posukh, O; Osipova, L; Bergen, A; Long, J; Goldman, D; Klitz, W; Harihara, S; de Knijff, P; Wiebe, V; Griffiths, R C; Templeton, A R; Hammer, M F
1999-01-01
Haplotypes constructed from Y-chromosome markers were used to trace the origins of Native Americans. Our sample consisted of 2,198 males from 60 global populations, including 19 Native American and 15 indigenous North Asian groups. A set of 12 biallelic polymorphisms gave rise to 14 unique Y-chromosome haplotypes that were unevenly distributed among the populations. Combining multiallelic variation at two Y-linked microsatellites (DYS19 and DXYS156Y) with the unique haplotypes results in a total of 95 combination haplotypes. Contra previous findings based on Y- chromosome data, our new results suggest the possibility of more than one Native American paternal founder haplotype. We postulate that, of the nine unique haplotypes found in Native Americans, haplotypes 1C and 1F are the best candidates for major New World founder haplotypes, whereas haplotypes 1B, 1I, and 1U may either be founder haplotypes and/or have arrived in the New World via recent admixture. Two of the other four haplotypes (YAP+ haplotypes 4 and 5) are probably present because of post-Columbian admixture, whereas haplotype 1G may have originated in the New World, and the Old World source of the final New World haplotype (1D) remains unresolved. The contrasting distribution patterns of the two major candidate founder haplotypes in Asia and the New World, as well as the results of a nested cladistic analysis, suggest the possibility of more than one paternal migration from the general region of Lake Baikal to the Americas. PMID:10053017
Similarities and distinctions in Y chromosome gene pool of Western Slavs.
Woźniak, Marcin; Malyarchuk, Boris; Derenko, Miroslava; Vanecek, Tomas; Lazur, Jan; Gomolcak, Pavol; Grzybowski, Tomasz
2010-08-01
Analysis of Y chromosome Y-STRs has proven to be a useful tool in the field of population genetics, especially in the case of closely related populations. We collected DNA samples from 169 males of Czech origin, 80 males of Slovakian origin, and 142 males dwelling Northern Poland. We performed Y-STR analysis of 12 loci in the samples collected (PowerPlex Y system from Promega) and compared the Y chromosome haplotype frequencies between the populations investigated. Also, we used Y-STR data available from the literature for comparison purposes. We observed significant differences between Y chromosome pools of Czechs and Slovaks compared to other Slavic and European populations. At the same time we were able to point to a specific group of Y-STR haplotypes belonging to an R1a haplogroup that seems to be shared by Slavic populations dwelling in Central Europe. The observed Y chromosome diversity may be explained by taking into consideration archeological and historical data regarding early Slav migrations. Copyright 2010 Wiley-Liss, Inc.
Marjanović, Damir; Durmić-Pašić, Adaleta; Kovačević, Lejla; Avdić, Jasna; Džehverović, Mirela; Haverić, Sanin; Ramić, Jasmin; Kalamujić, Belma; Bilela, Lada Lukić; Škaro, Vedrana; Projić, Petar; Bajrović, Kasim; Drobnič, Katja; Davoren, Jon; Primorac, Dragan
2009-01-01
Aim To report on the use of STR, Y-STRs, and miniSTRs typing methods in the identification of victims of revolutionary violence and crimes against humanity committed by the Communist Armed Forces during and after World War II in which bodies were exhumed from mass and individual graves in Slovenia. Methods Bone fragments and teeth were removed from human remains found in several small and closely located hidden mass graves in the Škofja Loka area (Lovrenska Grapa and Žolšče) and 2 individual graves in the Ljubljana area (Podlipoglav), Slovenia. DNA was isolated using the Qiagen DNA extraction procedure optimized for bone and teeth. Some DNA extracts required additional purification, such as N-buthanol treatment. The QuantifilerTM Human DNA Quantification Kit was used for DNA quantification. Initially, PowerPlex 16 kit was used to simultaneously analyze 15 short tandem repeat (STR) loci. The PowerPlex S5 miniSTR kit and AmpFℓSTR® MiniFiler PCR Amplification Kit was used for additional analysis if preliminary analysis yielded weak partial or no profiles at all. In 2 cases, when the PowerPlex 16 profiles indicated possible relatedness of the remains with reference samples, but there were insufficient probabilities to call the match to possible male paternal relatives, we resorted to an additional analysis of Y-STR markers. PowerPlex® Y System was used to simultaneously amplify 12 Y-STR loci. Fragment analysis was performed on an ABI PRISM 310 genetic analyzer. Matching probabilities were estimated using the DNA-View software. Results Following the Y-STR analysis, 1 of the “weak matches” previously obtained based on autosomal loci, was confirmed while the other 1 was not. Combined standard STR and miniSTR approach applied to bone samples from 2 individual graves resulted in positive identifications. Finally, using the same approach on 11 bone samples from hidden mass grave Žološče, we were able to obtain 6 useful DNA profiles. Conclusion The results of
Khubrani, Yahya M; Wetton, Jon H; Jobling, Mark A
2018-03-01
Saudi Arabia's indigenous population is organized into patrilineal descent groups, but to date, little has been done to characterize its population structure, in particular with respect to the male-specific region of the Y chromosome. We have used the 27-STR Yfiler ® Plus kit to generate haplotypes in 597 unrelated Saudi males, classified into five geographical regions (North, South, Central, East and West). Overall, Yfiler ® Plus provides a good discrimination capacity of 95.3%, but this is greatly reduced (74.7%) when considering the reduced Yfiler ® set of 17 Y-STRs, justifying the use of the expanded set of markers in this population. Comparison of the five geographical divisions reveals striking differences, with low diversity and similar haplotype spectra in the Central and Northern regions, and high diversity and similar haplotype spectra in the East and West. These patterns likely reflect the geographical isolation of the desert heartland of the peninsula, and the proximity to the sea of the Eastern and Western areas, and consequent historical immigration. We predicted haplogroups from Y-STR haplotypes, testing the performance of prediction by using a large independent set of Saudi Arabian Y-STR + Y-SNP data. Prediction indicated predominance (71%) of haplogroup J1, which was significantly more common in Central, Northern and Southern groups than in East and West, and formed a star-like expansion cluster in a median-joining network with an estimated age of ∼2800 years. Most of our 597 participants were sampled within Saudi Arabia itself, but ∼16% were sampled in the UK. Despite matching these two groups by home sub-region, we observed significant differences in haplotype and predicted haplogroup constitutions overall, and for most sub-regions individually. This suggests social structure influencing the probability of leaving Saudi Arabia, correlated with different Y-chromosome compositions. The UK-recruited sample is an inappropriate proxy for
Y-STR variation among Slavs: evidence for the Slavic homeland in the middle Dnieper basin.
Rebała, Krzysztof; Mikulich, Alexei I; Tsybovsky, Iosif S; Siváková, Daniela; Dzupinková, Zuzana; Szczerkowska-Dobosz, Aneta; Szczerkowska, Zofia
2007-01-01
A set of 18 Y-chromosomal microsatellite loci was analysed in 568 males from Poland, Slovakia and three regions of Belarus. The results were compared to data available for 2,937 Y chromosome samples from 20 other Slavic populations. Lack of relationship between linguistic, geographic and historical relations between Slavic populations and Y-short tandem repeat (STR) haplotype distribution was observed. Two genetically distant groups of Slavic populations were revealed: one encompassing all Western-Slavic, Eastern-Slavic, and two Southern-Slavic populations, and one encompassing all remaining Southern Slavs. An analysis of molecular variance (AMOVA) based on Y-chromosomal STRs showed that the variation observed between the two population groups was 4.3%, and was higher than the level of genetic variance among populations within the groups (1.2%). Homogeneity of northern Slavic paternal lineages in Europe was shown to stretch from the Alps to the upper Volga and involve ethnicities speaking completely different branches of Slavic languages. The central position of the population of Ukraine in the network of insignificant AMOVA comparisons, and the lack of traces of significant contribution of ancient tribes inhabiting present-day Poland to the gene pool of Eastern and Southern Slavs, support hypothesis placing the earliest known homeland of Slavs in the middle Dnieper basin.
Haplotype data for 23 Y-chromosome markers in four U.S. population groups.
Coble, Michael D; Hill, Carolyn R; Butler, John M
2013-05-01
The PowerPlex Y23 kit contains 23 Y-chromosomal loci including all 17 of the markers in the Yfiler Y-STR kit plus six additional markers: DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643. We have typed 1032 unrelated population samples from four self-declared US groups: African Americans, Asians, Hispanics, and Western European Caucasians. An analysis of the population genetic parameters and the improvement of adding additional Y-STR markers to the dataset are described. Published by Elsevier Ireland Ltd.
Ambers, Angie; Votrubova, Jitka; Vanek, Daniel; Sajantila, Antti; Budowle, Bruce
2018-02-23
Bones are a valuable source of DNA in forensic, anthropological, and archaeological investigations. There are a number of scenarios in which the only samples available for testing are highly degraded and/or skeletonized. Often it is necessary to perform more than one type of marker analysis on such samples in order to compile sufficient data for identification. Lineage markers, such as Y-STRs and mitochondrial DNA (mtDNA), represent important systems to complement autosomal DNA markers and anthropological metadata in making associations between unidentified remains and living relatives or for characterization of the remains for historical and archaeological studies. In this comparative study, Y-STR typing with both Yfiler™ and Yfiler™ Plus (Thermo Fisher Scientific, Waltham, MA, USA) was performed on a variety of human skeletal remains, including samples from the American Civil War (1861-1865), the late nineteenth century gold rush era in Deadwood, SD, USA (1874-1877), the Seven Years' War (1756-1763), a seventeenth-century archaeological site in Raspenava, Bohemia (Czech Republic), and World War II (1939-1945). The skeletal remains used for this study were recovered from a wide range of environmental conditions and were extracted using several common methods. Regardless of the DNA extraction method used and the age/condition of the remains, 22 out of 24 bone samples yielded a greater number of alleles using the Yfiler™ Plus kit compared to the Yfiler™ kit using the same quantity of input DNA. There was no discernable correlation with the degradation index values for these samples. Overall, the efficacy of the Yfiler™ Plus assay was demonstrated on degraded DNA from skeletal remains. Yfiler™ Plus increases the discriminatory power over the previous generation multiplex due to the larger set of Y-STR markers available for analysis and buffer modifications with the newer version kit. Increased haplotype resolution is provided to infer or refute putative
Genetic portrait of Tamil non-tribal and Irula tribal population using Y chromosome STR markers.
Raghunath, Rajshree; Krishnamoorthy, Kamalakshi; Balasubramanian, Lakshmi; Kunka Mohanram, Ramkumar
2016-03-01
The 17 Y chromosomal short tandem repeat loci included in the AmpFlSTR® Yfiler™ PCR Amplification Kit were used to analyse the genetic diversity of 517 unrelated males representing the non-tribal and Irula tribal population of Tamil Nadu. A total of 392 unique haplotypes were identified among the 400 non-tribal samples whereas 111 were observed among the 117 Irula tribal samples. Rare alleles for the loci DYS458, DYS635 and YGATAH4.1 were also observed in both population. The haplotype diversity for the non-tribal and Irula tribal population were found to be 0.9999, and the gene diversity ranged from 0.2041 (DYS391) to 0.9612 (DYS385). Comparison of the test population with 26 national and global population using principal coordinate analysis (PCoA) and determination of the genetic distance matrix using phylogenetic molecular analysis indicate a clustering of the Tamil Nadu non-tribal and Irula tribal population away from other unrelated population and proximity towards some Indo-European (IE) and Asian population. Data are available in the Y chromosome haplotype reference database (YHRD) under accession number YA004055 for Tamil non-tribal and YA004056 for the Irula tribal group.
No shortcut solution to the problem of Y-STR match probability calculation.
Caliebe, Amke; Jochens, Arne; Willuweit, Sascha; Roewer, Lutz; Krawczak, Michael
2015-03-01
Match probability calculation is deemed much more intricate for lineage genetic markers, including Y-chromosomal short tandem repeats (Y-STRs), than for autosomal markers. This is because, owing to the lack of recombination, strong interdependence between markers is likely, which implies that haplotype frequency estimates cannot simply be obtained through the multiplication of allele frequency estimates. As yet, however, the practical relevance of this problem has not been studied in much detail using real data. In fact, such scrutiny appears well warranted because the high mutation rates of Y-STRs and the possibility of backward mutation should have worked against the statistical association of Y-STRs. We examined haplotype data of 21 markers included in the PowerPlex(®)Y23 set (PPY23, Promega Corporation, Madison, WI) originating from six different populations (four European and two Asian). Assessing the conditional entropies of the markers, given different subsets of markers from the same panel, we demonstrate that the PowerPlex(®)Y23 set cannot be decomposed into smaller marker subsets that would be (conditionally) independent. Nevertheless, in all six populations, >94% of the joint entropy of the 21 markers is explained by the seven most rapidly mutating markers. Although this result might render a reduction in marker number a sensible option for practical casework, the partial haplotypes would still be almost as diverse as the full haplotypes. Therefore, match probability calculation remains difficult and calls for the improvement of currently available methods of haplotype frequency estimation. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Y-STR INRA189 polymorphisms in Chinese yak breeds.
Ma, Z J; Chen, S M; Sun, Y G; Xi, Y L; Li, R Z; Xu, J T; Lei, C Z
2015-12-29
To further explore Y-STR INRA189 polymorphisms in the yak, and to determine the genetic differences among yak breeds, genotyping analysis of INRA189 in 102 male yak individuals from three yak breeds in Qinghai Province of China was performed. Genotyping revealed the presence of four alleles, with sizes of 149, 155, 157, and 159 bp, respectively. Of these, the 157-bp allele, which was found with the highest frequency in the three yak breeds, was the dominant allele. Interestingly, the 149-bp allele was only detected in the Gaoyuan breed, and the 159-bp allele was only found in the Huanhu and Datong breeds. Only the 157- and 155-bp alleles were found in all three yak breeds. Taking the three yak breeds as a single population, the frequency of these four alleles was 0.0294, 0.0686, 0.8628, and 0.0392, respectively. The average polymorphism information content in the three yak breeds was 0.2379, indicating that the INRA189 was a low polymorphic Y-STR marker in yak.
Developmental Validation of a novel 5 dye Y-STR System comprising the 27 YfilerPlus loci
Bai, Rufeng; Liu, Yaju; Li, Zheng; Jin, Haiying; Tian, Qinghua; Shi, Meisen; Ma, Shuhua
2016-01-01
In this study, a new STRtyper-27 system, including the same Yfiler Plus loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385a/b, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, Y-GATA H4, DYS449, DYS460, DYS481, DYS518, DYS533, DYS570, DYS576, DYS627 and DYF387S1a/b), was established using a set of 5 fluorescent dye labels. Primers, internal size standard, allelic ladders and matrix standard set were designed and created in-house for this multiplex system. This paper describes the validation studies conducted with the STRtyper-27Y system using a 3130XL genetic analyzer for fragment length detection that included the analysis of the following parameters and aspects: sensitivity, species specificity, inhibition, haplotype concordance, precision, stutter, DNA mixtures, and stability studies with crime scene samples. The studies demonstrated, that the STRtyper-27Y system provided equivalent overall performance comparable to the latest Yfiler Plus kit, but with enhanced compatibility in terms of instrument platforms and software allowing forensic laboratories to conduct its forensic application and evaluate its performance, all in their own 5 dye Y-STR chemistry system /environment without software or instrument upgrades. PMID:27406339
Direct Y-STR amplification of body fluids deposited on commonly found crime scene substrates.
Dargay, Amanda; Roy, Reena
2016-04-01
Body fluids detected on commonly found crime scene substrates require extraction, purification and quantitation of DNA prior to amplification and generation of short tandem repeat (STR) DNA profiles. In this research Y-STR profiles were generated via direct amplification of blood and saliva deposited on 12 different substrates. These included cigarette butts, straws, grass, leaves, woodchips and seven different types of fabric. After depositing either 0.1 μL of blood or 0.5 μL of saliva, each substrate containing the dry body fluid stain was punched using a Harris 1.2 mm micro-punch. Each of these punched substrates, a total of 720 samples, containing minute amount of blood or saliva was either amplified directly without any pre-treatment, or was treated with one of the four washing reagents or buffer. In each of these five experimental groups the substrates containing the body fluid remained in the amplification reagent during the thermal cycling process. Each sample was amplified with the three direct Y-STR amplification kits; AmpFℓSTR(®) Yfiler(®) Direct, Yfiler(®) Plus Amplification Kits and the PowerPlex(®) Y23 System. Complete and concordant Y-STR profiles were successfully obtained from most of these 12 challenging crime scene objects when the stains were analyzed by at least one of the five experimental groups. The reagents and buffer were interchangeable among the three amplification kits, however, pre-treatment with these solutions did not appear to enhance the quality or the number of the full profiles generated with direct amplification. This study demonstrates that blood and saliva deposited on these simulated crime scene objects can be amplified directly. Copyright © 2016 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Purps, Josephine; Geppert, Maria; Nagy, Marion; Roewer, Lutz
2015-11-01
DNA testing is an established part of the investigation and prosecution of sexual assault. The primary purpose of DNA evidence is to identify a suspect and/or to demonstrate sexual contact. However, due to highly uneven proportions of female and male DNA in typical stains, routine autosomal analysis often fails to detect the DNA of the assailant. To evaluate the forensic efficiency of the combined application of autosomal and Y-chromosomal short tandem repeat (STR) markers, we present a large retrospective casework study of probative evidence collected in sexual-assault cases. We investigated up to 39 STR markers by testing combinations of the 16-locus NGMSElect kit with both the 23-locus PowerPlex Y23 and the 17-locus Yfiler kit. Using this dual approach we analyzed DNA extracts from 2077 biological stains collected in 287 cases over 30 months. To assess the outcome of the combined approach in comparison to stand-alone autosomal analysis we evaluated informative DNA profiles. Our investigation revealed that Y-STR analysis added up to 21% additional, highly informative (complete, single-source) profiles to the set of reportable autosomal STR profiles for typical stains collected in sexual-assault cases. Detection of multiple male contributors was approximately three times more likely with Y-chromosomal profiling than with autosomal STR profiling. In summary, 1/10 cases would have remained inconclusive (and could have been dismissed) if Y-STR analysis had been omitted from DNA profiling in sexual-assault cases. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
The association of 22 Y chromosome short tandem repeat loci with initiative-aggressive behavior.
Yang, Chun; Ba, Huajie; Zhang, Wei; Zhang, Shuyou; Zhao, Hanqing; Yu, Haiying; Gao, Zhiqin; Wang, Binbin
2018-05-15
Aggressive behavior represents an important public concern and a clinical challenge to behaviorists and psychiatrists. Aggression in humans is known to have an important genetic basis, so to investigate the association of Y chromosome short tandem repeat (Y-STR) loci with initiative-aggressive behavior, we compared allelic and haplotypic distributions of 22 Y-STRs in a group of Chinese males convicted of premeditated extremely violent crimes (n = 271) with a normal control group (n = 492). Allelic distributions of DYS533 and DYS437 loci differed significantly between the two groups (P < 0.05). The case group had higher frequencies of DYS533 allele 14, DYS437 allele 14, and haplotypes 11-14 of DYS533-DYS437 compared with the control group. Additionally, the DYS437 allele 15 frequency was significantly lower in cases than controls. No frequency differences were observed in the other 20 Y-STR loci between these two groups. Our results indicate a genetic role for Y-STR loci in the development of initiative aggression in non-psychiatric subjects. Copyright © 2018 Elsevier B.V. All rights reserved.
Thai, Quan Ke; Chung, Dung Anh; Tran, Hoang-Dung
2017-06-26
Canine and wolf mitochondrial DNA haplotypes, which can be used for forensic or phylogenetic analyses, have been defined in various schemes depending on the region analyzed. In recent studies, the 582 bp fragment of the HV1 region is most commonly used. 317 different canine HV1 haplotypes have been reported in the rapidly growing public database GenBank. These reported haplotypes contain several inconsistencies in their haplotype information. To overcome this issue, we have developed a Canis mtDNA HV1 database. This database collects data on the HV1 582 bp region in dog mitochondrial DNA from the GenBank to screen and correct the inconsistencies. It also supports users in detection of new novel mutation profiles and assignment of new haplotypes. The Canis mtDNA HV1 database (CHD) contains 5567 nucleotide entries originating from 15 subspecies in the species Canis lupus. Of these entries, 3646 were haplotypes and grouped into 804 distinct sequences. 319 sequences were recognized as previously assigned haplotypes, while the remaining 485 sequences had new mutation profiles and were marked as new haplotype candidates awaiting further analysis for haplotype assignment. Of the 3646 nucleotide entries, only 414 were annotated with correct haplotype information, while 3232 had insufficient or lacked haplotype information and were corrected or modified before storing in the CHD. The CHD can be accessed at http://chd.vnbiology.com . It provides sequences, haplotype information, and a web-based tool for mtDNA HV1 haplotyping. The CHD is updated monthly and supplies all data for download. The Canis mtDNA HV1 database contains information about canine mitochondrial DNA HV1 sequences with reconciled annotation. It serves as a tool for detection of inconsistencies in GenBank and helps identifying new HV1 haplotypes. Thus, it supports the scientific community in naming new HV1 haplotypes and to reconcile existing annotation of HV1 582 bp sequences.
Assessment of a subset of Slowly Mutating Y-STRs for forensic and evolutionary studies.
Baeta, Miriam; Núñez, Carolina; Villaescusa, Patricia; Ortueta, Urko; Ibarbia, Nerea; Herrera, Rene J; Blazquez-Caeiro, José Luis; Builes, Juan José; Jiménez-Moreno, Susana; Martínez-Jarreta, Begoña; de Pancorbo, Marian M
2018-05-01
Y-specific short tandem repeat (Y-STR) loci display different mutation rates and consequently are suitable for forensic, genealogical, and evolutionary studies that require different levels of timelines and resolution. Recent efforts have focused on implementing Rapidly Mutating (RM) Y-STRs to assess male specific profiles. However, due to their high mutation rate their use in kinship testing or in phylogenetic studies may be less reliable. In the present study, a novel Slowly Mutating Y-STR (SM) panel, including DYS388, DYS426, DYS461 (Y-GATA-A7.2), DYS485, DYS525, and DYS561, has been developed and evaluated in a sample set of 628 unrelated males from different worldwide populations. This panel is reproducible, sensitive, and robust for forensic applications and may be useful in conjunction with the common multiplexes, particularly in exclusion of kinship cases where minimal discrimination is reported employing the rapidly mutating Y-STR systems. Furthermore, SM Y-STR data may be of value in evolutionary studies to optimize the resolution of phylogenetic relationships generated with current Y-STR panel sets. In this study, we provide an extensive Y-STR allele and haplotype reference dataset for future applications. Copyright © 2018 Elsevier B.V. All rights reserved.
Differential distribution of Y-chromosome haplotypes in Swiss and Southern European goat breeds.
Vidal, Oriol; Drögemüller, Cord; Obexer-Ruff, Gabriela; Reber, Irene; Jordana, Jordi; Martínez, Amparo; Bâlteanu, Valentin Adrian; Delgado, Juan Vicente; Eghbalsaied, Shahin; Landi, Vincenzo; Goyache, Felix; Traoré, Amadou; Pazzola, Michele; Vacca, Giuseppe Massimo; Badaoui, Bouabid; Pilla, Fabio; D'Andrea, Mariasilvia; Álvarez, Isabel; Capote, Juan; Sharaf, Abdoallah; Pons, Àgueda; Amills, Marcel
2017-11-23
The analysis of Y-chromosome variation has provided valuable clues about the paternal history of domestic animal populations. The main goal of the current work was to characterize Y-chromosome diversity in 31 goat populations from Central Eastern (Switzerland and Romania) and Southern Europe (Spain and Italy) as well as in reference populations from Africa and the Near East. Towards this end, we have genotyped seven single nucleotide polymorphisms (SNPs), mapping to the SRY, ZFY, AMELY and DDX3Y Y-linked loci, in 275 bucks from 31 populations. We have observed a low level of variability in the goat Y-chromosome, with just five haplotypes segregating in the whole set of populations. We have also found that Swiss bucks carry exclusively Y1 haplotypes (Y1A: 24%, Y1B1: 15%, Y1B2: 43% and Y1C: 18%), while in Italian and Spanish bucks Y2A is the most abundant haplotype (77%). Interestingly, in Carpathian goats from Romania the Y2A haplotype is also frequent (42%). The high Y-chromosome differentiation between Swiss and Italian/Spanish breeds might be due to the post-domestication spread of two different Near Eastern genetic stocks through the Danubian and Mediterranean corridors. Historical gene flow between Southern European and Northern African goats might have also contributed to generate such pattern of genetic differentiation.
Hussing, C; Bytyci, R; Huber, C; Morling, N; Børsting, C
2018-05-24
Some STR loci have internal sequence variations, which are not revealed by the standard STR typing methods used in forensic genetics (PCR and fragment length analysis by capillary electrophoresis (CE)). Typing of STRs with next-generation sequencing (NGS) uncovers the sequence variation in the repeat region and in the flanking regions. In this study, 363 Danish individuals were typed for 56 STRs (26 autosomal STRs, 24 Y-STRs, and 6 X-STRs) using the ForenSeq™ DNA Signature Prep Kit to establish a Danish STR sequence database. Increased allelic diversity was observed in 34 STRs by the PCR-NGS assay. The largest increases were found in DYS389II and D12S391, where the numbers of sequenced alleles were around four times larger than the numbers of alleles determined by repeat length alone. Thirteen SNPs and one InDel were identified in the flanking regions of 12 STRs. Furthermore, 36 single positions and five longer stretches in the STR flanking regions were found to have dubious genotyping quality. The combined match probability of the 26 autosomal STRs was 10,000 times larger using the PCR-NGS assay than by using PCR-CE. The typical paternity indices for trios and duos were 500 and 100 times larger, respectively, than those obtained with PCR-CE. The assay also amplified 94 SNPs selected for human identification. Eleven of these loci were not in Hardy-Weinberg equilibrium in the Danish population, most likely because the minimum threshold for allele calling (30 reads) in the ForenSeq™ Universal Analysis Software was too low and frequent allele dropouts were not detected.
Hanson, Erin K; Ballantyne, Jack
2016-01-01
In some cases of sexual assault the victim may not report the assault for several days after the incident due to various factors. The ability to obtain an autosomal STR profile of the semen donor from a living victim rapidly diminishes as the post-coital interval is extended due to the presence of only a small amount of male DNA amidst an overwhelming amount of female DNA. Previously, we have utilized various technological tools to overcome the limitations of male DNA profiling in extended interval post-coital samples including the use of Y-chromosome STR profiling, cervical sample, and post-PCR purification permitting the recovery of Y-STR profiles of the male DNA from samples collected 5-6 days after intercourse. Despite this success, the reproductive biology literature reports the presence of spermatozoa in the human cervix up to 7-10 days post-coitus. Therefore, novel and improved methods for recovery of male profiles in extended interval post-coital samples were required. Here, we describe enhanced strategies, including Y-chromosome-targeted pre-amplification and next generation Y-STR amplification kits, that have resulted in the ability to obtain probative male profiles from samples collected 6-9 days after intercourse.
Adnan, Atif; Ralf, Arwin; Rakha, Allah; Kousouri, Nefeli; Kayser, Manfred
2016-11-01
Y-chromosomal short tandem repeat (Y-STR) markers are commonly used in forensic genetics. Male-specific haplotypes provided by commercial Y-STR kits allow discriminating between many - but not all - unrelated men, while they mostly fail to separate related ones. Aiming to improve male relative and paternal lineage differentiation, a set of 13 rapidly-mutating (RM) Y-STRs was previously identified and introduced to forensic Y-chromosome analysis. Recently, their value was highlighted by separating 99% of over 12,200 unrelated men from 111 global populations, as well as 29% of over 2500 male relative pairs, the vast majority were father-sons. Here, we provide improved empirical evidence on differentiating closely related men with RM Y-STRs, most notably beyond father-sons, where previous data were limited. After careful quality control including genetic relationship testing, we used 572 Pakistani men belonging to 99 2-4 generation pedigrees covering 1568 pairs of men related by 1-6 meioses. Of those, 45% were differentiated by one or more of the 13 RM Y-STR markers. In contrast, only 14.7% of a subset of 1484 pairs from 94 pedigrees were separated by the commercial AmpFlSTR Y-filer kit. Combining previously published and new data, an overall differentiation rate of 35.3% was revealed for the RM Y-STR set based on 4096 pairs of men related by 1-20 meioses, compared to 9.6% with Y-filer based on 3645 pairs. Using father-son pair data from the present and previous studies, we provide updated RM Y-STR mutation rates. Locus-specific mutation rates ranged from 2.0×10 -3 (7.0×10 -4 -4.3×10 -3 ) to 6.9×10 -2 (6.1×10 -2 -7.9×10 -2 ) based on 2741-3143 meioses, with an average rate across all 13 RM Y-STR markers of 1.8×10 -2 (1.7×10 -2 -1.9×10 -2 ) based on 800 mutations from 44,922 meioses. The high haplotype diversity (h=0.9996) we observed among the unrelated men (N=105) underlines the value of this RM Y-STR set to differentiate paternal lineages even from
Dogan, Serkan; Primorac, Dragan; Marjanović, Damir
2014-01-01
Aim To explore the distribution and polymorphisms of 23 short tandem repeat (STR) loci on the Y chromosome in the Turkish population recently settled in Sarajevo, Bosnia and Herzegovina and to investigate its genetic relationships with the homeland Turkish population and neighboring populations. Methods This study included 100 healthy unrelated male individuals from the Turkish population living in Sarajevo. Buccal swab samples were collected as a DNA source. Genomic DNA was extracted using the salting out method and amplification was performed using PowerPlex Y 23 amplification kit. The studied population was compared to other populations using pairwise genetic distances, which were represented with a multi-dimensional scaling plot. Results Haplotype and allele frequencies of the sample population were calculated and the results showed that all 100 samples had unique haplotypes. The most polymorphic locus was DYS458, and the least polymorphic DYS391. The observed haplotype diversity was 1.0000 ± 0.0014, with a discrimination capacity of 1.00 and the match probability of 0.01. Rst values showed that our sample population was closely related in both dimensions to the Lebanese and Iraqi populations, while it was more distant from Bosnian, Croatian, and Macedonian populations. Conclusion Turkish population residing in Sarajevo could be observed as a representative Turkish population, since our results were consistent with those previously published for the homeland Turkish population. Also, this study once again proved that geographically close populations were genetically more related to each other. PMID:25358886
Dogan, Serkan; Primorac, Dragan; Marjanović, Damir
2014-10-01
To explore the distribution and polymorphisms of 23 short tandem repeat (STR) loci on the Y chromosome in the Turkish population recently settled in Sarajevo, Bosnia and Herzegovina and to investigate its genetic relationships with the homeland Turkish population and neighboring populations. This study included 100 healthy unrelated male individuals from the Turkish population living in Sarajevo. Buccal swab samples were collected as a DNA source. Genomic DNA was extracted using the salting out method and amplification was performed using PowerPlex Y 23 amplification kit. The studied population was compared to other populations using pairwise genetic distances, which were represented with a multi-dimensional scaling plot. Haplotype and allele frequencies of the sample population were calculated and the results showed that all 100 samples had unique haplotypes. The most polymorphic locus was DYS458, and the least polymorphic DYS391. The observed haplotype diversity was 1.0000 ± 0.0014, with a discrimination capacity of 1.00 and the match probability of 0.01. Rst values showed that our sample population was closely related in both dimensions to the Lebanese and Iraqi populations, while it was more distant from Bosnian, Croatian, and Macedonian populations. Turkish population residing in Sarajevo could be observed as a representative Turkish population, since our results were consistent with those previously published for the homeland Turkish population. Also, this study once again proved that geographically close populations were genetically more related to each other.
Zhivotovsky, Lev A; Malyarchuk, Boris A; Derenko, Miroslava V; Wozniak, Marcin; Grzybowski, Tomasz
2009-09-01
Developing a forensic DNA database on a population that consists of local ethnic groups separated by physical and cultural barriers is questionable as it can be genetically subdivided. On the other side, small sizes of ethnic groups, especially in alpine regions where they are sub-structured further into small villages, prevent collecting a large sample from each ethnic group. For such situations, we suggest to obtain both a total population database on allele frequencies across ethnic groups and a list of theta-values between the groups and the total data. We have genotyped 558 individuals from the native population of South Siberia, consisting of nine ethnic groups, at 17 autosomal STR loci of the kit packages AmpFlSTR SGM Plus i, Cyrillic AmpFlSTR Profiler Plus. The groups differentiate from each other with average theta-values of around 1.1%, and some reach up to three to four percent at certain loci. There exists between-village differentiation as well. Therefore, a database for the population of South Siberia is composed of data on allele frequencies in the pool of ethnic groups and data on theta-values that indicate variation in allele frequencies across the groups. Comparison to additional data on northeastern Asia (the Chukchi and Koryak) shows that differentiation in allele frequencies among small groups that are separated by large geographic distance can be even greater. In contrast, populations of Russians that live in large cities of the European part of Russia are homogeneous in allele frequencies, despite large geographic distance between them, and thus can be described by a database on allele frequencies alone, without any specific information on theta-values.
Gómez, Alberto; Avila, Sandra J; Briceño, Ignacio
2008-09-01
In Colombia, surnames are characters usually passed to the children by the father, and they have been compared to neutral alleles associated with the Y-chromosome. Population frequencies were determined for 17 short tandem repeats (STR) DNA markers on the Y-chromosome to compare the two identity codes and define the correlation between haplotypes and surnames in each individual. DNA was extracted from blood samples from 308 male individuals in provinces of Valle del Cauca, Cauca and Nariño, all in southwestern Colombia. Sample DNA was analyzed with the commercial kit AmpFLSTR Yfiler (Applied Biosystems) and examined for the following 17 Y-chromosome STR markers: DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385a/b, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635 and Y-GATA-H4. The frequencies of molecular haplotypes were associated with the surname reported by each individual, and a correlation table was constructed. Amerindian and European surnames were associated with the presence of allele DYS19/13, a characteristic of Amerindian populations. Allele frequencies were reported for each of the 17 STR markers in the southwestern region of Colombia-high genetic and haplotypic diversities were obtained. Approximately 40% of lineage inconsistencies were found when the molecular genotype was compared with the European or Amerindian surnames. Surnames must be used as population markers with reservation. The genetic evidence indicates that traditional genealogies based on surnames with or without documental support, may be inconsistant with their biological provenance.
Inferring population structure and demographic history using Y-STR data from worldwide populations.
Xu, Hongyang; Wang, Chuan-Chao; Shrestha, Rukesh; Wang, Ling-Xiang; Zhang, Manfei; He, Yungang; Kidd, Judith R; Kidd, Kenneth K; Jin, Li; Li, Hui
2015-02-01
The Y chromosome is one of the best genetic materials to explore the evolutionary history of human populations. Global analyses of Y chromosomal short tandem repeats (STRs) data can reveal very interesting world population structures and histories. However, previous Y-STR works tended to focus on small geographical ranges or only included limited sample sizes. In this study, we have investigated population structure and demographic history using 17 Y chromosomal STRs data of 979 males from 44 worldwide populations. The largest genetic distances have been observed between pairs of African and non-African populations. American populations with the lowest genetic diversities also showed large genetic distances and coancestry coefficients with other populations, whereas Eurasian populations displayed close genetic affinities. African populations tend to have the oldest time to the most recent common ancestors (TMRCAs), the largest effective population sizes and the earliest expansion times, whereas the American, Siberian, Melanesian, and isolated Atayal populations have the most recent TMRCAs and expansion times, and the smallest effective population sizes. This clear geographic pattern is well consistent with serial founder model for the origin of populations outside Africa. The Y-STR dataset presented here provides the most detailed view of worldwide population structure and human male demographic history, and additionally will be of great benefit to future forensic applications and population genetic studies.
Constructing STR multiplex assays.
Butler, John M
2005-01-01
Multiplex polymerase chain reaction (PCR) refers to the simultaneous amplification of multiple regions of deoxyribonucleic acid (DNA) using PCR. Commercial short tandem repeat (STR) assays that can coamplify as many as 16 different loci have become widely used in forensic DNA typing. This chapter will focus on some of the aspects of constructing robust STR multiplex assays, including careful design and quality control of PCR primers. Examples from the development of a cat STR 12plex and a human Y chromosome STR 20plex are used to illustrate the importance of various parts of the protocol. Primer design parameters and Internet-accessible resources are discussed, as are solutions to problems with residual dye artifacts that result from impure primers.
Rapid microfluidic analysis of a Y-STR multiplex for screening of forensic samples.
Gibson-Daw, Georgiana; Albani, Patricia; Gassmann, Marcus; McCord, Bruce
2017-02-01
In this paper, we demonstrate a rapid analysis procedure for use with a small set of rapidly mutating Y chromosomal short tandem repeat (Y-STR) loci that combines both rapid polymerase chain reaction (PCR) and microfluidic separation elements. The procedure involves a high-speed polymerase and a rapid cycling protocol to permit PCR amplification in 16 min. The resultant amplified sample is next analysed using a short 1.8-cm microfluidic electrophoresis system that permits a four-locus Y-STR genotype to be produced in 80 s. The entire procedure takes less than 25 min from sample collection to result. This paper describes the rapid amplification protocol as well as studies of the reproducibility and sensitivity of the procedure and its optimisation. The amplification process utilises a small high-speed thermocycler, microfluidic device and compact laptop, making it portable and potentially useful for rapid, inexpensive on-site genotyping. The four loci used for the multiplex were selected due to their rapid mutation rates and should proved useful in preliminary screening of samples and suspects. Overall, this technique provides a method for rapid sample screening of suspect and crime scene samples in forensic casework. Graphical abstract ᅟ.
Comparison of Y-STR polymorphisms in three different Slovak population groups.
Petrejcíková, Eva; Siváková, Daniela; Soták, Miroslav; Bernasovská, Jarmila; Bernasovský, Ivan; Rebała, Krzysztof; Boronová, Iveta; Bôziková, Alexandra; Sovicová, Adriana; Gabriková, Dana; Maceková, Sona; Svícková, Petra; Carnogurská, Jana
2010-01-01
Eleven Y-chromosomal microsatellite loci included in the Powerplex Y multiplex kit were analyzed in different Slovak population samples: Habans (n = 39), Romanies (n = 100) and Slovak Caucasian (n = 148) individuals, respectively, from different regions of Slovakia. The analysis of molecular variance between populations indicated that 89.27% of the haplotypic variations were found within populations and only 10.72% between populations (Fst = 0.1027; p = 0.0000). The haplotype diversities were ranging from 0.9258 to 0.9978, and indicated a high potential for differentiating between male individuals. The study reports differences in allele frequencies between the Romanies, Habans and Slovak Caucasian men. Selected loci showed that both the Romany and Haban population belonged to endogamous and relatively small founder population groups, which developed in relatively reproductive isolated groups surrounded by the Slovak Caucasian population.
Globally dispersed Y chromosomal haplotypes in wild and domestic sheep.
Meadows, J R S; Hanotte, O; Drögemüller, C; Calvo, J; Godfrey, R; Coltman, D; Maddox, J F; Marzanov, N; Kantanen, J; Kijas, J W
2006-10-01
To date, investigations of genetic diversity and the origins of domestication in sheep have utilised autosomal microsatellites and variation in the mitochondrial genome. We present the first analysis of both domestic and wild sheep using genetic markers residing on the ovine Y chromosome. Analysis of a single nucleotide polymorphism (oY1) in the SRY promoter region revealed that allele A-oY1 was present in all wild bighorn sheep (Ovis canadensis), two subspecies of thinhorn sheep (Ovis dalli), European Mouflon (Ovis musimon) and the Barbary (Ammontragis lervia). A-oY1 also had the highest frequency (71.4%) within 458 domestic sheep drawn from 65 breeds sampled from Africa, Asia, Australia, the Caribbean, Europe, the Middle East and Central Asia. Sequence analysis of a second locus, microsatellite SRYM18, revealed a compound repeat array displaying fixed differences, which identified bighorn and thinhorn sheep as distinct from the European Mouflon and domestic animals. Combined genotypic data identified 11 male-specific haplotypes that represented at least two separate lineages. Investigation of the geographical distribution of each haplotype revealed that one (H6) was both very common and widespread in the global sample of domestic breeds. The remaining haplotypes each displayed more restricted and informative distributions. For example, H5 was likely founded following the domestication of European breeds and was used to trace the recent transportation of animals to both the Caribbean and Australia. A high rate of Y chromosomal dispersal appears to have taken place during the development of domestic sheep as only 12.9% of the total observed variation was partitioned between major geographical regions.
Estimating haplotype frequencies by combining data from large DNA pools with database information.
Gasbarra, Dario; Kulathinal, Sangita; Pirinen, Matti; Sillanpää, Mikko J
2011-01-01
We assume that allele frequency data have been extracted from several large DNA pools, each containing genetic material of up to hundreds of sampled individuals. Our goal is to estimate the haplotype frequencies among the sampled individuals by combining the pooled allele frequency data with prior knowledge about the set of possible haplotypes. Such prior information can be obtained, for example, from a database such as HapMap. We present a Bayesian haplotyping method for pooled DNA based on a continuous approximation of the multinomial distribution. The proposed method is applicable when the sizes of the DNA pools and/or the number of considered loci exceed the limits of several earlier methods. In the example analyses, the proposed model clearly outperforms a deterministic greedy algorithm on real data from the HapMap database. With a small number of loci, the performance of the proposed method is similar to that of an EM-algorithm, which uses a multinormal approximation for the pooled allele frequencies, but which does not utilize prior information about the haplotypes. The method has been implemented using Matlab and the code is available upon request from the authors.
Woźniak, Marcin; Grzybowski, Tomasz; Starzyński, Jarosław; Marciniak, Tomasz
2007-06-01
The Polish population is reported to be very homogenous as far as Y chromosome polymorphism is concerned. One of the hypotheses that explains this phenomenon is based on the assumption that massive migrations that took place in Poland after the Second World War might have evoked such an effect. Thus, knowledge of the pre-war frequencies of Y chromosome haplotypes in different parts of the country would be a useful tool in testing such a hypothesis. We have collected 226 DNA samples, together with family history data, from males living in the rural area of Małopolska, Polish Southern border region. Based on donors' family histories we were able to reconstruct an 'ancestral' subpopulation of 108 males whose ancestors had inhabited the area before both World Wars. We have analyzed 12 Y-STR loci: DYS19, DYS385, DYS389I&II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438 and DYS439 in all the collected samples. Comparisons of our contemporary and 'ancestral' population samples with other Polish and Central European populations showed that the population of Southern Małopolska is very closely related to other Polish and Slavic populations. The above-mentioned observations suggest that the population of Southern Poland could have been highly homogenous even before the Second World War.
Mayans: a Y chromosome perspective
Perez-Benedico, David; La Salvia, Joel; Zeng, Zhaoshu; Herrera, Giselle A; Garcia-Bertrand, Ralph; Herrera, Rene J
2016-01-01
In spite of the wealth of available cultural and archeological information as well as general interest in the Mayans, little is known about their genetics. In this study, for the first time, we attempt to alleviate this lacuna of knowledge by comprehensively investigating the Y chromosome composition of contemporary Mayan populations throughout their domain. To accomplish this, five geographically targeted and ethnically distinct Mayan populations are investigated using Y-SNP and Y-STR markers. Findings: overall, the Mayan populations as a group are highly homogeneous, basically made up of only two autochthonous haplogroups, Q1a2a1a1*-M3 and Q1a2a1*-L54. Although the Y-STR data illustrates diversity, this diversity, for the most part, is uniformly distributed among geographically distant Mayan populations. Similar haplotypes among populations, abundance of singletons and absence of population partitioning within networks among Mayan populations suggest recent population expansion and substantial gene flow within the Mayan dominion, possibly due to the development of agriculture, the establishment of interacting City–State systems and commerce. PMID:26956252
Y chromosome haplotype diversity of domestic sheep (Ovis aries) in northern Eurasia.
Zhang, Min; Peng, Wei-Feng; Yang, Guang-Li; Lv, Feng-Hua; Liu, Ming-Jun; Li, Wen-Rong; Liu, Yong-Gang; Li, Jin-Quan; Wang, Feng; Shen, Zhi-Qiang; Zhao, Sheng-Guo; Hehua, Eer; Marzanov, Nurbiy; Murawski, Maziek; Kantanen, Juha; Li, Meng-Hua
2014-12-01
Variation in two SNPs and one microsatellite on the Y chromosome was analyzed in a total of 663 rams representing 59 breeds from a large geographic range in northern Eurasia. SNPA-oY1 showed the highest allele frequency (91.55%) across the breeds, whereas SNPG-oY1 was present in only 56 samples. Combined genotypes established seven haplotypes (H4, H5, H6, H7, H8, H12 and H19). H6 dominated in northern Eurasia, and H8 showed the second-highest frequency. H4, which had been earlier reported to be absent in European breeds, was detected in one European breed (Swiniarka), whereas H7, which had been previously identified to be unique to European breeds, was present in two Chinese breeds (Ninglang Black and Large-tailed Han), one Buryatian (Transbaikal Finewool) and two Russian breeds (North Caucasus Mutton-Wool and Kuibyshev). H12, which had been detected only in Turkish breeds, was also found in Chinese breeds in this work. An overall low level of haplotype diversity (median h = 0.1288) was observed across the breeds with relatively higher median values in breeds from the regions neighboring the Near Eastern domestication center of sheep. H6 is the dominant haplotype in northwestern and eastern China, in which the haplotype distribution could be explained by the historical translocations of the H4 and H8 Y chromosomes to China via the Mongol invasions followed by expansions to northwestern and eastern China. Our findings extend previous results of sheep Y chromosomal genetic variability and indicate probably recent paternal gene flows between sheep breeds from distinct major geographic regions. © 2014 Stichting International Foundation for Animal Genetics.
Musanovic, Jasmin; Filipovska-Musanovic, Marijana; Kovacevic, Lejla; Buljugic, Dzenisa; Dzehverovic, Mirela; Avdic, Jasna; Marjanovic, Damir
2012-05-01
In our previous population studies of Bosnia and Herzegovina human population, we have used autosomal STR, Y-STR, and X-STR loci, as well as Y-chromosome NRY biallelic markers. All obtained results were included in Bosnian referent database. In order of future development of applied population molecular genetics researches of Bosnia and Herzegovina human population, we have examined the effectiveness of 15 STR loci system in determination of sibship by using 15 STR loci and calculating different cut-off points of combined sibship indices (CSI) and distribution of sharing alleles. From the perspective of its application, it is very difficult and complicated to establish strict CSI cut-off values for determination of the doubtless sibship. High statistically significant difference between the means of CSI values and in distribution of alleles sharing in siblings and non-siblings was noticed (P < 0.0001). After constructing the "gray zone", only one false positive result was found in three CSI cut-off levels with the highest percent of determined sibship/non-sibship at the CSI = 0.067, confirming its practical benefit. Concerning the distribution of sharing alleles, it is recommended as an informative estimator for its usage within Bosnia and Herzegovina human population.
Genetic analysis of 19 X chromosome STR loci for forensic purposes in four Chinese ethnic groups
Yang, Xingyi; Zhang, Xiaofang; Zhu, Junyong; Chen, Linli; Liu, Changhui; Feng, Xingling; Chen, Ling; Wang, Huijun; Liu, Chao
2017-01-01
A new 19 X- short tandem repeat (STR) multiplex PCR system has recently been developed, though its applicability in forensic studies has not been thoroughly assessed. In this study, 932 unrelated individuals from four Chinese ethnic groups (Han, Tibet, Uighur and Hui) were successfully genotyped using this new multiplex PCR system. Our results showed significant linkage disequilibrium between markers DXS10103 and DXS10101 in all four ethnic groups; markers DXS10159 and DXS10162, DXS6809 and DXS6789, and HPRTB and DXS10101 in Tibetan populations; and markers DXS10074 and DXS10075 in Uighur populations. The combined powers of discrimination in males and females were calculated according to haplotype frequencies from allele distributions rather than haplotype counts in the relevant population and were high in four ethnic groups. The cumulative powers of discrimination of the tested X-STR loci were 1.000000000000000 and 0.999999999997940 in females and males, respectively. All 19 X-STR loci are highly polymorphic. The highest Reynolds genetic distances were observed for the Tibet-Uighur pairwise comparisons. This study represents an extensive report on X-STR marker variation in minor Chinese populations and a comprehensive analysis of the diversity of these 19 X STR markers in four Chinese ethnic groups. PMID:28211539
Y-Chromosome Haplogroups in the Bosnian-Herzegovinian Population Based on 23 Y-STR Loci.
Doğan, Serkan; Ašić, Adna; Doğan, Gulsen; Besic, Larisa; Marjanovic, Damir
2016-07-01
In a study of the Bosnian-Herzegovinian (B&H) population, Y-chromosome marker frequencies for 100 individuals, generated using the PowerPlex Y23 kit, were used to perform Y-chromosome haplogroup assignment via Whit Athey's Haplogroup Predictor. This algorithm determines Y-chromosome haplogroups from Y-chromosome short tandem repeat (Y-STR) data using a Bayesian probability-based approach. The most frequent haplogroup appeared to be I2a, with a prevalence of 49%, followed by R1a and E1b1b, each accounting for 17% of all haplogroups within the population. Remaining haplogroups were J2a (5%), I1 (4%), R1b (4%), J2b (2%), G2a (1%), and N (1%). These results confirm previously published preliminary B&H population data published over 10 years ago, especially the prediction about the B&H population being a part of the Western Balkan area, which served as the Last Glacial Maximum refuge for the Paleolithic human European population. Furthermore, the results corroborate the hypothesis that this area was a significant stopping point on the "Middle East-Europe highway" during the Neolithic farmer migrations. Finally, since these results are almost completely in accordance with previously published data on B&H and neighboring populations generated by Y-chromosome single nucleotide polymorphism analysis, it can be concluded that in silico analysis of Y-STRs is a reliable method for approximation of the Y-chromosome haplogroup diversity of an examined population.
Dogan, S; Babic, N; Gurkan, C; Goksu, A; Marjanovic, D; Hadziavdic, V
2016-12-01
Y-chromosomal haplogroups are sets of ancestrally related paternal lineages, traditionally assigned by the use of Y-chromosomal single nucleotide polymorphism (Y-SNP) markers. An increasingly popular and a less labor-intensive alternative approach has been Y-chromosomal haplogroup assignment based on already available Y-STR data using a variety of different algorithms. In the present study, such in silico haplogroup assignments were made based on 23-loci Y-STR data for 100 unrelated male individuals from the Tuzla Canton, Bosnia and Herzegovina (B&H) using the following four different algorithms: Whit Athey's Haplogroup Predictor, Jim Cullen's World Haplogroup & Haplogroup-I Subclade Predictor, Vadim Urasin's YPredictor and the NevGen Y-DNA Haplogroup Predictor. Prior in-house assessment of these four different algorithms using a previously published dataset (n=132) from B&H with both Y-STR (12-loci) and Y-SNP data suggested haplogroup misassignment rates between 0.76% and 3.02%. Subsequent analyses with the Tuzla Canton population sample revealed only a few differences in the individual haplogroup assignments when using different algorithms. Nevertheless, the resultant Y-chromosomal haplogroup distribution by each method was very similar, where the most prevalent haplogroups observed were I, R and E with their sublineages I2a, R1a and E1b1b, respectively, which is also in accordance with the previously published Y-SNP data for the B&H population. In conclusion, results presented herein not only constitute a concordance study on the four most popular haplogroup assignment algorithms, but they also give a deeper insight into the inter-population differentiation in B&H on the basis of Y haplogroups for the first time. Copyright © 2016 Elsevier GmbH. All rights reserved.
Population data on 11 Y-chromosome STRs from Guiné-Bissau.
Rosa, Alexandra; Ornelas, Carolina; Brehm, António; Villems, Richard
2006-03-10
The forensic value of Y-STR markers in Guiné-Bissau was accessed by typing of 215 males. Allele and haplotype frequencies, determined for loci DYS19, DYS389-I, DYS389-II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439 and the duplicated locus DYS385, are within the limits of variation found in other populations south of the Sahara. The level of discrimination achieved is Guineans is higher than for European or other African populations with comparable data. The haplotype diversity of 0.9995 is reduced to 0.9981 when the minimal haplotype is considered thus revealing the importance of increasing the number of typed loci.
Lee, Eun Young; Lee, Hwan Young; Kwon, So Yeun; Oh, Yu Na; Yang, Woo Ick; Shin, Kyoung-Jin
2017-01-01
In forensic science and human genetics, Y-chromosomal short tandem repeats (Y-STRs) have been used as very useful markers. Recently, more Y-STR markers have been analyzed to enhance the resolution power in haplotype analysis, and 13 rapidly mutating (RM) Y-STRs have been suggested as revolutionary tools that can widen Y-chromosomal application from paternal lineage differentiation to male individualization. We have constructed two multiplex PCR sets for the amplification of 13 RM Y-STRs, which yield small-sized amplicons (<400bp) and a more balanced PCR efficiency with minimum PCR cycling. In particular, with the developed multiplex PCR system, we could separate three copies of DYF403S1a into two copies of DYF403S1a and one of DYF403S1b1. This is because DYF403S1b1 possesses distinguishable sequences from DYF403S1a at both the front and rear flanking regions of the repeat motif; therefore, the locus could be separately amplified using sequence-specific primers. In addition, the other copy, defined as DYF403S1b by Ballantyne et al., was renamed DYF403S1b2 because of its similar flanking region sequence to DYF403S1b1. By redefining DYF403S1 with the developed multiplex system, all genotypes of four copies could be successfully typed and more diverse haplotypes were obtained. We analyzed haplotype distributions in 705 Korean males based on four different Y-STR subsets: Yfiler, PowerPlex Y23, Yfiler Plus, and RM Y-STRs. All haplotypes obtained from RM Y-STRs were the most diverse and showed strong discriminatory power in Korean population. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Schettert, Isolmar T; Pereira, Alexandre C; Lopes, Neuza H; Hueb, Whady A; Krieger, Jose E
2006-01-01
A positive association was recently described between P2Y12 platelet receptor H1 and H2 haplotypes and peripheral artery disease. We tested the described P2Y12 receptor haplotypes in a group of patients with coronary artery disease. The P2Y12 platelet receptor H1 and H2 haplotypes was tested in a group of 540 patients enrolled in the Medical, Angioplasty, or Surgery Study II (MASS II), a randomized trial comparing treatments for patients with coronary artery disease (CAD) and preserved left ventricular function. After a 3-year follow-up period, the incidence of the composite end point of cardiac death, myocardial infarction, and refractory angina requiring revascularization was determined in the H1/H1, H1/H2 and H2/H2 haplotype groups. We used Student's t-test and the chi-square test to analyze the differences among groups and Kaplan-Meier method to calculate survival curves. Risk was assessed with the use of a Cox proportional-hazards model. The frequency of haplotypes among studied patients were 410 (75.9%) H1/H1, 119 (22.0%) H1/H2 and 11 (2.1%) H2/H2. The baseline clinical characteristics, mean clinical follow-up time and received treatment of each genotype group were similar. We did not disclose any association between haplotype groups regarding the incidence of any of the studied cardiovascular end-points. This is the first report studying the association of P2Y12 platelet receptor H1 and H2 haplotype and cardiovascular events. Our findings do not provide evidence for a strong association between H1/H1 and H1/H2 haplotypes and a increased risk of cardiovascular events in a population with CAD. Future works should address the role of the H2/H2 haplotype as a genetic marker for cardiovascular events.
Mitochondrial and Y-chromosomal profile of the Kazakh population from East Kazakhstan
Tarlykov, Pavel V.; Zholdybayeva, Elena V.; Akilzhanova, Ainur R.; Nurkina, Zhannur M.; Sabitov, Zhaxylyk M.; Rakhypbekov, Tolebay K.; Ramanculov, Erlan M.
2013-01-01
Aim To study the genetic relationship of Kazakhs from East Kazakhstan to other Eurasian populations by examining paternal and maternal DNA lineages. Methods Whole blood samples were collected in 2010 from 160 unrelated healthy Kazakhs residing in East Kazakhstan. Genomic DNA was extracted with Wizard® genomic DNA Purification Kit. Nucleotide sequence of hypervariable segment I of mitochondrial DNA (mtDNA) was determined and analyzed. Seventeen Y-short tandem repeat (STR) loci were studied in 67 samples with the AmpFiSTR Y-filer PCR Amplification Kit. In addition, mtDNA data for 2701 individuals and Y-STR data for 677 individuals were retrieved from the literature for comparison. Results There was a high degree of genetic differentiation on the level of mitochondrial DNA. The majority of maternal lineages belonged to haplogroups common in Central Asia. In contrast, Y-STR data showed very low genetic diversity, with the relative frequency of the predominant haplotype of 0.612. Conclusion The results revealed different migration patterns in the population sample, showing there had been more migration among women. mtDNA genetic diversity in this population was equivalent to that in other Central Asian populations. Genetic evidence suggests the existence of a single paternal founder lineage in the population of East Kazakhstan, which is consistent with verbal genealogical data of the local tribes. PMID:23444242
Filipino DNA variation at 12 X-chromosome short tandem repeat markers.
Salvador, Jazelyn M; Apaga, Dame Loveliness T; Delfin, Frederick C; Calacal, Gayvelline C; Dennis, Sheila Estacio; De Ungria, Maria Corazon A
2018-06-08
Demands for solving complex kinship scenarios where only distant relatives are available for testing have risen in the past years. In these instances, other genetic markers such as X-chromosome short tandem repeat (X-STR) markers are employed to supplement autosomal and Y-chromosomal STR DNA typing. However, prior to use, the degree of STR polymorphism in the population requires evaluation through generation of an allele or haplotype frequency population database. This population database is also used for statistical evaluation of DNA typing results. Here, we report X-STR data from 143 unrelated Filipino male individuals who were genotyped via conventional polymerase chain reaction-capillary electrophoresis (PCR-CE) using the 12 X-STR loci included in the Investigator ® Argus X-12 kit (Qiagen) and via massively parallel sequencing (MPS) of seven X-STR loci included in the ForenSeq ™ DNA Signature Prep kit of the MiSeq ® FGx ™ Forensic Genomics System (Illumina). Allele calls between PCR-CE and MPS systems were consistent (100% concordance) across seven overlapping X-STRs. Allele and haplotype frequencies and other parameters of forensic interest were calculated based on length (PCR-CE, 12 X-STRs) and sequence (MPS, seven X-STRs) variations observed in the population. Results of our study indicate that the 12 X-STRs in the PCR-CE system are highly informative for the Filipino population. MPS of seven X-STR loci identified 73 X-STR alleles compared with 55 X-STR alleles that were identified solely by length via PCR-CE. Of the 73 sequence-based alleles observed, six alleles have not been reported in the literature. The population data presented here may serve as a reference Philippine frequency database of X-STRs for forensic casework applications. Copyright © 2018 Elsevier B.V. All rights reserved.
First Polish DNA "manhunt"--an application of Y-chromosome STRs.
Dettlaff-Kakol, A; Pawlowski, R
2002-10-01
This study presents the application of Y-chromosomal STR polymorphisms to male identification in the case of a serial rapist and woman murderer in Poland. Since August 1996 a rapist from Swinoujscie (northwest Poland) committed at least 14 rapes. In the year 2000 he brutally raped 8 young girls and murdered a 22-year-old girl. DNA profiles obtained from semen stains left at the scenes of crime gave information that one and the same man had committed all the rapes. The Y-chromosome haplotype (9 loci) obtained was used for the elimination process of 421 suspects. One man was found who had an identical DNA profile in all Y-chromosome STR loci analysed and possessed common alleles in 9 out of 10 autosomal loci, strongly suggesting that the real rapist and the typed man were closely related males. Analysis of reference DNA obtained from the man's brother revealed an identical DNA STR profile to that identified at the crime scenes. To the best of our knowledge this is the first case in Poland and probably in Eastern Europe where DNA typing of a large population was used to identify the offender.
Characterization of genetic sequence variation of 58 STR loci in four major population groups.
Novroski, Nicole M M; King, Jonathan L; Churchill, Jennifer D; Seah, Lay Hong; Budowle, Bruce
2016-11-01
Massively parallel sequencing (MPS) can identify sequence variation within short tandem repeat (STR) alleles as well as their nominal allele lengths that traditionally have been obtained by capillary electrophoresis. Using the MiSeq FGx Forensic Genomics System (Illumina), STRait Razor, and in-house excel workbooks, genetic variation was characterized within STR repeat and flanking regions of 27 autosomal, 7 X-chromosome and 24 Y-chromosome STR markers in 777 unrelated individuals from four population groups. Seven hundred and forty six autosomal, 227 X-chromosome, and 324 Y-chromosome STR alleles were identified by sequence compared with 357 autosomal, 107 X-chromosome, and 189 Y-chromosome STR alleles that were identified by length. Within the observed sequence variation, 227 autosomal, 156 X-chromosome, and 112 Y-chromosome novel alleles were identified and described. One hundred and seventy six autosomal, 123 X-chromosome, and 93 Y-chromosome sequence variants resided within STR repeat regions, and 86 autosomal, 39 X-chromosome, and 20 Y-chromosome variants were located in STR flanking regions. Three markers, D18S51, DXS10135, and DYS385a-b had 1, 4, and 1 alleles, respectively, which contained both a novel repeat region variant and a flanking sequence variant in the same nucleotide sequence. There were 50 markers that demonstrated a relative increase in diversity with the variant sequence alleles compared with those of traditional nominal length alleles. These population data illustrate the genetic variation that exists in the commonly used STR markers in the selected population samples and provide allele frequencies for statistical calculations related to STR profiling with MPS data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Ehler, Edvard; Vaněk, Daniel; Stenzl, Vlastimil; Vančata, Václav
2011-01-01
Aim To evaluate Y-chromosomal diversity of the Moravian Valachs of the Czech Republic and compare them with a Czech population sample and other samples from Central and South-Eastern Europe, and to evaluate the effects of genetic isolation and sampling. Methods The first sample set of the Valachs consisted of 94 unrelated male donors from the Valach region in northeastern Czech Republic border-area. The second sample set of the Valachs consisted of 79 men who originated from 7 paternal lineages defined by surname. No close relatives were sampled. The third sample set consisted of 273 unrelated men from the whole of the Czech Republic and was used for comparison, as well as published data for other 27 populations. The total number of samples was 3244. Y-short tandem repeat (STR) markers were typed by standard methods using PowerPlex® Y System (Promega) and Yfiler® Amplification Kit (Applied Biosystems) kits. Y-chromosomal haplogroups were estimated from the haplotype information. Haplotype diversity and other intra- and inter-population statistics were computed. Results The Moravian Valachs showed a lower genetic variability of Y-STR markers than other Central European populations, resembling more to the isolated Balkan populations (Aromuns, Csango, Bulgarian, and Macedonian Roma) than the surrounding populations (Czechs, Slovaks, Poles, Saxons). We illustrated the effect of sampling on Valach paternal lineages, which includes reduction of discrimination capacity and variability inside Y-chromosomal haplogroups. Valach modal haplotype belongs to R1a haplogroup and it was not detected in the Czech population. Conclusion The Moravian Valachs display strong substructure and isolation in their Y chromosomal markers. They represent a unique Central European population model for population genetics. PMID:21674832
Goedbloed, Miriam; Vermeulen, Mark; Fang, Rixun N; Lembring, Maria; Wollstein, Andreas; Ballantyne, Kaye; Lao, Oscar; Brauer, Silke; Krüger, Carmen; Roewer, Lutz; Lessig, Rüdiger; Ploski, Rafal; Dobosz, Tadeusz; Henke, Lotte; Henke, Jürgen; Furtado, Manohar R; Kayser, Manfred
2009-11-01
The Y-chromosomal short tandem repeat (Y-STR) polymorphisms included in the AmpFlSTR Yfiler polymerase chain reaction amplification kit have become widely used for forensic and evolutionary applications where a reliable knowledge on mutation properties is necessary for correct data interpretation. Therefore, we investigated the 17 Yfiler Y-STRs in 1,730-1,764 DNA-confirmed father-son pairs per locus and found 84 sequence-confirmed mutations among the 29,792 meiotic transfers covered. Of the 84 mutations, 83 (98.8%) were single-repeat changes and one (1.2%) was a double-repeat change (ratio, 1:0.01), as well as 43 (51.2%) were repeat gains and 41 (48.8%) repeat losses (ratio, 1:0.95). Medians from Bayesian estimation of locus-specific mutation rates ranged from 0.0003 for DYS448 to 0.0074 for DYS458, with a median rate across all 17 Y-STRs of 0.0025. The mean age (at the time of son's birth) of fathers with mutations was with 34.40 (+/-11.63) years higher than that of fathers without ones at 30.32 (+/-10.22) years, a difference that is highly statistically significant (p < 0.001). A Poisson-based modeling revealed that the Y-STR mutation rate increased with increasing father's age on a statistically significant level (alpha = 0.0294, 2.5% quantile = 0.0001). From combining our data with those previously published, considering all together 135,212 meiotic events and 331 mutations, we conclude for the Yfiler Y-STRs that (1) none had a mutation rate of >1%, 12 had mutation rates of >0.1% and four of <0.1%, (2) single-repeat changes were strongly favored over multiple-repeat ones for all loci but 1 and (3) considerable variation existed among loci in the ratio of repeat gains versus losses. Our finding of three Y-STR mutations in one father-son pair (and two pairs with two mutations each) has consequences for determining the threshold of allelic differences to conclude exclusion constellations in future applications of Y-STRs in paternity testing and pedigree analyses.
Y chromosomal haplotype characteristics of domestic sheep (Ovis aries) in China.
Wang, Yutao; Xu, Lei; Yan, Wei; Li, Shaobin; Wang, Jiqing; Liu, Xiu; Hu, Jiang; Luo, Yuzhu
2015-07-10
Investigations on the variation present at the male-specific Y chromosome region provide strong information to understand the origin and evolution of domestic sheep. One SNP OY1 (g.88A>G) in the upstream region of SRY gene, and the microsatellite SRYM18 locus within ovine Y chromosome were analyzed in one hundred and forty five samples collected from eleven breeds in China. SNP OY1 was analyzed using PCR-SSCP method and sequencing. Two different PCR-SSCP patterns represented two specific sequences with sequence analysis revealing SNP-OY1 (g.88A>G) were observed, while SNP A-OY1 showed the most common frequency (82.8%). Sequencing of the SRYM18 region revealed one novel size fragment (A2) with different repetitive units. Seven haplotypes (H4, H5, H6, H7, H8, H9 and H12) and two novel haplotypes (Ha and Hb) were established using combined genotype analysis. H6 showed the highest frequency (43.4%) across all breeds, and H8 showed the second frequency (24.1%). Ha was only found in one breed (Tan), while Hb was present in three breeds (Gansu alpine, White Suffolk and Duolang). Our findings reveal one novel allele in SRYM18 region and two novel male haplotypes of domestic sheep in China. Copyright © 2015 Elsevier B.V. All rights reserved.
Liu, Yao-Shun; Chen, Jian-Gang; Mei, Ting; Guo, Yu-Xin; Meng, Hao-Tian; Li, Jian-Fei; Wei, Yuan-Yuan; Jin, Xiao-Ye; Zhu, Bo-Feng; Zhang, Li-Ping
2017-01-01
We analyzed the genetic polymorphisms of 15 autosomal and 10 Y-chromosomal STR loci in 214 individuals of Han population from Southern Shaanxi of China and studied the genetic relationships between Southern Shaanxi Han and other populations. We observed a total of 150 alleles at 15 autosomal STR loci with the corresponding allelic frequencies ranging from 0.0023 to 0.5210, and the combined power of discrimination and exclusion for the 15 autosomal STR loci were 0.99999999999999998866 and 0.999998491, respectively. For the 10 Y-STR loci, totally 100 different haplotypes were obtained, of which 94 were unique. The discriminatory capacity and haplotype diversity values of the 10 Y-STR loci were 0.9259 and 0.998269, respectively. The results demonstrated high genetic diversities of the 25 STR loci in the population for forensic applications. We constructed neighbor-joining tree and conducted principal component analysis based on 15 autosomal STR loci and conducted multidimensional scaling analysis and constructed neighbor-joining tree based on 10 Y-STR loci. The results of population genetic analyses based on both autosomal and Y-chromosome STRs indicated that the studied Southern Shaanxi Han population had relatively closer genetic relationship with Eastern Han population, and distant relationships with Croatian, Serbian and Moroccan populations. PMID:28903432
Liu, Yao-Shun; Chen, Jian-Gang; Mei, Ting; Guo, Yu-Xin; Meng, Hao-Tian; Li, Jian-Fei; Wei, Yuan-Yuan; Jin, Xiao-Ye; Zhu, Bo-Feng; Zhang, Li-Ping
2017-08-15
We analyzed the genetic polymorphisms of 15 autosomal and 10 Y-chromosomal STR loci in 214 individuals of Han population from Southern Shaanxi of China and studied the genetic relationships between Southern Shaanxi Han and other populations. We observed a total of 150 alleles at 15 autosomal STR loci with the corresponding allelic frequencies ranging from 0.0023 to 0.5210, and the combined power of discrimination and exclusion for the 15 autosomal STR loci were 0.99999999999999998866 and 0.999998491, respectively. For the 10 Y-STR loci, totally 100 different haplotypes were obtained, of which 94 were unique. The discriminatory capacity and haplotype diversity values of the 10 Y-STR loci were 0.9259 and 0.998269, respectively. The results demonstrated high genetic diversities of the 25 STR loci in the population for forensic applications. We constructed neighbor-joining tree and conducted principal component analysis based on 15 autosomal STR loci and conducted multidimensional scaling analysis and constructed neighbor-joining tree based on 10 Y-STR loci. The results of population genetic analyses based on both autosomal and Y-chromosome STRs indicated that the studied Southern Shaanxi Han population had relatively closer genetic relationship with Eastern Han population, and distant relationships with Croatian, Serbian and Moroccan populations.
Analysis of 12 X-STR loci in the population of south Croatia.
Mršić, Gordan; Ozretić, Petar; Crnjac, Josip; Merkaš, Siniša; Račić, Ivana; Rožić, Sara; Sukser, Viktorija; Popović, Maja; Korolija, Marina
2017-02-01
The aim of the study was to assess forensic pertinence of 12 short tandem repeats (STRs) on X-chromosome in south Croatia population. Investigator ® Argus X-12 kit was used to co-amplify 12 STR loci belonging to four linkage groups (LGs) on X-chromosome in 99 male and 98 female DNA samples of unrelated donors. PCR products were analyzed by capillary electrophoresis. Population genetic and forensic parameters were calculated by the Arlequin and POPTREE2 software, and an on-line tool available at ChrX-STR.org. Hardy-Weinberg equilibrium was confirmed for all X-STR markers in female samples. Biallelic patterns at DXS10079 locus were detected in four male samples. Polymorphism information content for the most (DXS10135) and the least (DXS8378) informative markers was 0.9212 and 0.6347, respectively. In both male and female samples, combined power of discrimination exceeded 0.999999999. As confirmed by linkage disequilibrium test, significant association of marker pair DXS10074-DXS10079 (P = 0.0004) within LG2 and marker pair DXS10101-DXS10103 (P = 0.0003) within LG3 was found only in male samples. Number of observed haplotypes in our sample pool amounted 3.01, 7.53, 5 and 3.25% of the number of possible haplotypes for LG1, LG2, LG3 and LG4, respectively. According to haplotype diversity value of 0.9981, LG1 was the most informative. In comparison of south Croatia with 26 world populations, pair-wise [Formula: see text] values increase in parallel with geographical distance. Overall statistical assessment confirmed suitability of Investigator ® Argus X-12 kit for forensic casework in both identification and familial testing in the population of south Croatia.
Saha, Anjana; Sharma, Swarkar; Bhat, Audesh; Pandit, Awadesh; Bamezai, Ramesh
2005-01-01
Four binary polymorphisms and four multiallelic short tandem repeat (STR) loci from the nonrecombining region of the human Y-chromosome were typed in different Indian population groups from Uttar Pradeh (UP), Bihar (BI), Punjab (PUNJ), and Bengal (WB) speaking the Indo-Aryan dialects and from South India (SI) with the root in the Dravidian language. We identified four major haplogroups [(P) 1+, (C and F) 2+, (R1a) 3, (K) 26+] and 114 combinations of Y-STR haplotypes. Analyses of the haplogroups indicated no single origin from any lineage but a result of a conglomeration of different lineages from time to time. The phylogenetic analyses indicate a high degree of population admixture and a greater genetic proximity for the studied population groups when compared with other world populations.
Fernández-Domínguez, Eva; Bertoncini, Stefania; Chimonas, Marios; Christofi, Vasilis; King, Jonathan; Budowle, Bruce; Manoli, Panayiotis
2017-01-01
Genetics can provide invaluable information on the ancestry of the current inhabitants of Cyprus. A Y-chromosome analysis was performed to (i) determine paternal ancestry among the Greek Cypriot (GCy) community in the context of the Central and Eastern Mediterranean and the Near East; and (ii) identify genetic similarities and differences between Greek Cypriots (GCy) and Turkish Cypriots (TCy). Our haplotype-based analysis has revealed that GCy and TCy patrilineages derive primarily from a single gene pool and show very close genetic affinity (low genetic differentiation) to Calabrian Italian and Lebanese patrilineages. In terms of more recent (past millennium) ancestry, as indicated by Y-haplotype sharing, GCy and TCy share much more haplotypes between them than with any surrounding population (7–8% of total haplotypes shared), while TCy also share around 3% of haplotypes with mainland Turks, and to a lesser extent with North Africans. In terms of Y-haplogroup frequencies, again GCy and TCy show very similar distributions, with the predominant haplogroups in both being J2a-M410, E-M78, and G2-P287. Overall, GCy also have a similar Y-haplogroup distribution to non-Turkic Anatolian and Southwest Caucasian populations, as well as Cretan Greeks. TCy show a slight shift towards Turkish populations, due to the presence of Eastern Eurasian (some of which of possible Ottoman origin) Y-haplogroups. Overall, the Y-chromosome analysis performed, using both Y-STR haplotype and binary Y-haplogroup data puts Cypriot in the middle of a genetic continuum stretching from the Levant to Southeast Europe and reveals that despite some differences in haplotype sharing and haplogroup structure, Greek Cypriots and Turkish Cypriots share primarily a common pre-Ottoman paternal ancestry. PMID:28622394
Heraclides, Alexandros; Bashiardes, Evy; Fernández-Domínguez, Eva; Bertoncini, Stefania; Chimonas, Marios; Christofi, Vasilis; King, Jonathan; Budowle, Bruce; Manoli, Panayiotis; Cariolou, Marios A
2017-01-01
Genetics can provide invaluable information on the ancestry of the current inhabitants of Cyprus. A Y-chromosome analysis was performed to (i) determine paternal ancestry among the Greek Cypriot (GCy) community in the context of the Central and Eastern Mediterranean and the Near East; and (ii) identify genetic similarities and differences between Greek Cypriots (GCy) and Turkish Cypriots (TCy). Our haplotype-based analysis has revealed that GCy and TCy patrilineages derive primarily from a single gene pool and show very close genetic affinity (low genetic differentiation) to Calabrian Italian and Lebanese patrilineages. In terms of more recent (past millennium) ancestry, as indicated by Y-haplotype sharing, GCy and TCy share much more haplotypes between them than with any surrounding population (7-8% of total haplotypes shared), while TCy also share around 3% of haplotypes with mainland Turks, and to a lesser extent with North Africans. In terms of Y-haplogroup frequencies, again GCy and TCy show very similar distributions, with the predominant haplogroups in both being J2a-M410, E-M78, and G2-P287. Overall, GCy also have a similar Y-haplogroup distribution to non-Turkic Anatolian and Southwest Caucasian populations, as well as Cretan Greeks. TCy show a slight shift towards Turkish populations, due to the presence of Eastern Eurasian (some of which of possible Ottoman origin) Y-haplogroups. Overall, the Y-chromosome analysis performed, using both Y-STR haplotype and binary Y-haplogroup data puts Cypriot in the middle of a genetic continuum stretching from the Levant to Southeast Europe and reveals that despite some differences in haplotype sharing and haplogroup structure, Greek Cypriots and Turkish Cypriots share primarily a common pre-Ottoman paternal ancestry.
Gaspar, Paulo; Seixas, Susana; Rocha, Jorge
2004-04-01
The genetic variation at a compound nonrecombining haplotype system, consisting of the previously reported SB19.3 Alu insertion polymorphism and a newly identified adjacent short tandem repeat (STR), was studied in population samples from Portugal and São Tomé (Gulf of Guinea, West Africa). Age estimates based on the linked microsatellite variation suggest that the Alu insertion occurred about 190,000 years ago. In accordance with the global patterns of distribution of human genetic variation, the highest haplotype diversity was found in the African sample. This excess in African diversity was due to both a substantial reduction in heterozygosity at the Alu polymorphism and a lower STR variability associated with the predominant Alu insertion allele in the Portuguese sample. The high level of interpopulation differentiation observed at the Alu locus (F(ST) = 0.43) was interpreted under alternative selective and demographic scenarios. The need for compatibility between patterns of variation at the STR and Alu loci could be used to restrict the range of selection coefficients in selection-driven genetic hitchhiking frameworks and to favor demographic scenarios dominated by larger pre-expansion African population sizes. Taken together, the data show that the SB19.3 Alu-STR system is an informative marker that can be included in more extended batteries of compound haplotypes used in human evolutionary studies.
Kwon, So Yeun; Lee, Hwan Young; Kim, Eun Hye; Lee, Eun Young; Shin, Kyoung-Jin
2016-11-01
Next-generation sequencing (NGS) can produce massively parallel sequencing (MPS) data for many targeted regions with a high depth of coverage, suggesting its successful application to the amplicons of forensic genetic markers. In the present study, we evaluated the practical utility of MPS in Y-chromosome short tandem repeat (Y-STR) analysis using a multiplex polymerase chain reaction (PCR) system. The multiplex PCR system simultaneously amplified 24 Y-chromosomal markers, including the PowerPlex ® Y23 loci (DYS19, DYS385ab, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS481, DYS533, DYS549, DYS570, DYS576, DYS635, DYS643, and YGATAH4) and the M175 marker with the small-sized amplicons ranging from 85 to 253bp. The barcoded libraries for the amplicons of the 24 Y-chromosomal markers were produced using a simplified PCR-based library preparation method and successfully sequenced using MPS on a MiSeq ® System with samples from 250 unrelated Korean males. The genotyping concordance between MPS and the capillary electrophoresis (CE) method, as well as the sequence structure of the 23 Y-STRs, were investigated. Three samples exhibited discordance between the MPS and CE results at DYS385, DYS439, and DYS576. There were 12 Y-STR loci that showed sequence variations in the alleles by a fragment size determination, and the most varied alleles occurred in DYS389II with a different sequence structure in the repeat region. The largest increase in gene diversity between the CE and MPS results was in DYS437 at +34.41%. Single nucleotide polymorphisms (SNPs), insertions, and deletions (indels) were observed in the flanking regions of DYS481, DYS576, and DYS385, respectively. Stutter and noise ratios of the 23 Y-STRs using the developed MPS system were also investigated. Based on these results, the MPS analysis system used in this study could facilitate the investigation into the sequences of the 23 Y-STRs in forensic
Current state-of-art of STR sequencing in forensic genetics.
Alonso, Antonio; Barrio, Pedro A; Müller, Petra; Köcher, Steffi; Berger, Burkhard; Martin, Pablo; Bodner, Martin; Willuweit, Sascha; Parson, Walther; Roewer, Lutz; Budowle, Bruce
2018-05-11
The current state of validation and implementation strategies of MPS technology for the analysis of STR markers for forensic genetics use is described, covering the topics of the current catalogue of commercial MPS-STR panels, leading MPS-platforms, and MPS-STR data analysis tools. In addition, the developmental and internal validation studies carried out to date to evaluate reliability, sensitivity, mixture analysis, concordance, and the ability to analyze challenged samples are summarized. The results of various MPS-STR population studies that showed a large number of new STR sequence variants that increase the power of discrimination in several forensically-relevant loci are also presented. Finally, various initiatives developed by several international projects and standardization (or guidelines) groups to facilitate application of MPS technology for STR marker analyses are discussed in regard to promoting a standard STR sequence nomenclature, performing population studies to detect sequence variants, and developing a universal system to translate sequence variants into a simple STR nomenclature (numbers and letters) compatible with national STR databases. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Iuvaro, Alessandra; Bini, Carla; Dilloo, Silvia; Sarno, Stefania; Pelotti, Susi
2018-04-17
The collection of biological debris beneath fingernails can be useful in forensic casework when a struggle between the victim and the offender is suspected. In the present study, we set up a controlled scratching experiment in which female volunteers scratched the male volunteers' forearms, simulating a defensive action during an assault. A total of 160 fingernail samples were collected: 80 "control samples" before the scratching, 40 samples immediately after the scratching (t = 0 h), and 40 samples 5 h after the scratching (t = 5 h). The aim was to evaluate, using a real-time PCR approach and Y-STR profiling, the transfer and the persistence of male DNA under female fingernails after scratching. A significant reduction in DNA yield was observed between fingernail samples collected immediately and those collected 5 h after scratching, with a corresponding decrease in Y-STR profile quality. Overall, 38/40 (95%) of the fingernail samples collected immediately (t = 0 h) and 24/40 (60%) of those collected 5 h later (t = 5 h) were suitable for comparison and the scratched male volunteers could not be excluded as donors of the foreign DNA from 37 (92.5%) of the t = 0 h and from 10 (25%) of the t = 5 h profiles. The analysis of male DNA under female fingernails showed that Y-chromosome STR typing may provide extremely valuable genetic information of the male contributor(s), although 5 h after scratching the profile of the scratched male was lost in three-quarters of samples.
Nuclear, chloroplast, and mitochondrial data of a US cannabis DNA database.
Houston, Rachel; Birck, Matthew; LaRue, Bobby; Hughes-Stamm, Sheree; Gangitano, David
2018-05-01
As Cannabis sativa (marijuana) is a controlled substance in many parts of the world, the ability to track biogeographical origin of cannabis could provide law enforcement with investigative leads regarding its trade and distribution. Population substructure and inbreeding may cause cannabis plants to become more genetically related. This genetic relatedness can be helpful for intelligence purposes. Analysis of autosomal, chloroplast, and mitochondrial DNA allows for not only prediction of biogeographical origin of a plant but also discrimination between individual plants. A previously validated, 13-autosomal STR multiplex was used to genotype 510 samples. Samples were analyzed from four different sites: 21 seizures at the US-Mexico border, Northeastern Brazil, hemp seeds purchased in the US, and the Araucania area of Chile. In addition, a previously reported multi-loci system was modified and optimized to genotype five chloroplast and two mitochondrial markers. For this purpose, two methods were designed: a homopolymeric STR pentaplex and a SNP triplex with one chloroplast (Cscp001) marker shared by both methods for quality control. For successful mitochondrial and chloroplast typing, a novel real-time PCR quantitation method was developed and validated to accurately estimate the quantity of the chloroplast DNA (cpDNA) using a synthetic DNA standard. Moreover, a sequenced allelic ladder was also designed for accurate genotyping of the homopolymeric STR pentaplex. For autosomal typing, 356 unique profiles were generated from the 425 samples that yielded full STR profiles and 25 identical genotypes within seizures were observed. Phylogenetic analysis and case-to-case pairwise comparisons of 21 seizures at the US-Mexico border, using the Fixation Index (F ST ) as genetic distance, revealed the genetic association of nine seizures that formed a reference population. For mitochondrial and chloroplast typing, subsampling was performed, and 134 samples were genotyped
Daneshpour, Maryam Sadat; Hosseinzadeh, Nima; Zarkesh, Maryam; Azizi, Fereidoun
2012-03-01
Different variants of haplotype frequencies may lead to various frequencies of the same variants in individuals with drug resistance and disease susceptibility at the population level. In this study, the haplotype frequencies of 4 STR loci including the D8S1132, D8S1779, D8S514 and D8S1743, and 3 STR loci including D11S1304, D11S1998 and D11S934 were investigated in 563 individuals of four Iranian ethnic groups in the capital city of Iran, Tehran. One hundred thirty subjects had the metabolic syndrome. Haplotype frequencies of all markers were calculated. There were significant differences in the haplotype frequencies in short and long alleles between the metabolic affected subjects and controls. In addition, haplotype frequencies were significant in the four ethnic groups in both chromosomes 8 and 11. Our findings show a relation between the short allele of D8S1743 in all related haplotype frequencies of subjects with metabolic syndrome. These findings may require more studies of some candidate genes, including the lipoprotein lipase gene, in this chromosomal region. Copyright © 2011. Published by Elsevier B.V.
Xin, Y P; Zan, L S; Wang, Y H; Liu, Y F; Tian, W Q; Fan, Y Y
2011-01-01
The correlations between Y chromosome polymorphisms and the carcass traits were studied in five Chinese beef cattle populations by PCR, single strand conformation polymorphism and Y-STR sequence analysis. Nine alleles and their frequencies were identified on Y-STR UMN0929 region in Qinchuan (n=116), Luxi (n=112), Jinnan (n=104) pure breeds, Simmental×Qinchuan crossbred (n=80) and Angus×Qinchuan crossbred (n=96). The most popular A-176 and B-178 alleles were presented in all 5 cattle populations in the range of 12% (Jinnan) to 66% (Simmental×Qinchuan). The allele I-194 presented Luxi and Angus×Qinchuan. In Qinchun cattle, G-190 and E-186 alleles had bigger effect on BPI (4.23±0.32 and 4.22±0.48 kg/cm, P<0.01) and CW (325.40±49.42 and 316.73±45.29 kg, P<0.01), respectively. In Luxi cattle, I-194 allele affected higher BPI (4.08±0.35 kg/cm, P<0.01) and CW (302.07±17.55 kg, P<0.01), respectively. In Jinnan cattle breed, H-192 had higher BPI (4.32±0.50 kg/cm, P<0.05) and CW (327.87±59.37 kg, P<0.05), respectively. In Simmental×Qinchuan cross breed, C-180 allele affected largely on BPI (5.16±0.25 kg/cm, P<0.05) and CW (393.16±25.92 kg, P<0.05). In Angus×Qinchuan cross breed, I-194 had higher BPI (4.43±0.33 kg, P<0.05) and CW (346.63±29.77 kg, P<0.05). Correlations between alleles and other carcass traits (net meat weight, top grade weight, slaughter rate, net meat rate, loin-eye muscle area, carcass length, meet tenderness and shear force) were also analyzed using mixed-effect model. Cattle Y-STR UMN0929 loci alleles and its correlation with carcass traits in beef cattle populations could be implemented into the cattle breeding program for choosing beef cattle with better carcass traits.
Niederstätter, Harald; Rampl, Gerhard; Erhart, Daniel; Pitterl, Florian; Oberacher, Herbert; Neuhuber, Franz; Hausner, Isolde; Gassner, Christoph; Schennach, Harald; Berger, Burkhard; Parson, Walther
2012-01-01
The small alpine district of East Tyrol (Austria) has an exceptional demographic history. It was contemporaneously inhabited by members of the Romance, the Slavic and the Germanic language groups for centuries. Since the Late Middle Ages, however, the population of the principally agrarian-oriented area is solely Germanic speaking. Historic facts about East Tyrol's colonization are rare, but spatial density-distribution analysis based on the etymology of place-names has facilitated accurate spatial mapping of the various language groups' former settlement regions. To test for present-day Y chromosome population substructure, molecular genetic data were compared to the information attained by the linguistic analysis of pasture names. The linguistic data were used for subdividing East Tyrol into two regions of former Romance (A) and Slavic (B) settlement. Samples from 270 East Tyrolean men were genotyped for 17 Y-chromosomal microsatellites (Y-STRs) and 27 single nucleotide polymorphisms (Y-SNPs). Analysis of the probands' surnames revealed no evidence for spatial genetic structuring. Also, spatial autocorrelation analysis did not indicate significant correlation between genetic (Y-STR haplotypes) and geographic distance. Haplogroup R-M17 chromosomes, however, were absent in region A, but constituted one of the most frequent haplogroups in region B. The R-M343 (R1b) clade showed a marked and complementary frequency distribution pattern in these two regions. To further test East Tyrol's modern Y-chromosomal landscape for geographic patterning attributable to the early history of settlement in this alpine area, principal coordinates analysis was performed. The Y-STR haplotypes from region A clearly clustered with those of Romance reference populations and the samples from region B matched best with Germanic speaking reference populations. The combined use of onomastic and molecular genetic data revealed and mapped the marked structuring of the distribution of Y
Niederstätter, Harald; Rampl, Gerhard; Erhart, Daniel; Pitterl, Florian; Oberacher, Herbert; Neuhuber, Franz; Hausner, Isolde; Gassner, Christoph; Schennach, Harald; Berger, Burkhard; Parson, Walther
2012-01-01
The small alpine district of East Tyrol (Austria) has an exceptional demographic history. It was contemporaneously inhabited by members of the Romance, the Slavic and the Germanic language groups for centuries. Since the Late Middle Ages, however, the population of the principally agrarian-oriented area is solely Germanic speaking. Historic facts about East Tyrol's colonization are rare, but spatial density-distribution analysis based on the etymology of place-names has facilitated accurate spatial mapping of the various language groups' former settlement regions. To test for present-day Y chromosome population substructure, molecular genetic data were compared to the information attained by the linguistic analysis of pasture names. The linguistic data were used for subdividing East Tyrol into two regions of former Romance (A) and Slavic (B) settlement. Samples from 270 East Tyrolean men were genotyped for 17 Y-chromosomal microsatellites (Y-STRs) and 27 single nucleotide polymorphisms (Y-SNPs). Analysis of the probands' surnames revealed no evidence for spatial genetic structuring. Also, spatial autocorrelation analysis did not indicate significant correlation between genetic (Y-STR haplotypes) and geographic distance. Haplogroup R-M17 chromosomes, however, were absent in region A, but constituted one of the most frequent haplogroups in region B. The R-M343 (R1b) clade showed a marked and complementary frequency distribution pattern in these two regions. To further test East Tyrol's modern Y-chromosomal landscape for geographic patterning attributable to the early history of settlement in this alpine area, principal coordinates analysis was performed. The Y-STR haplotypes from region A clearly clustered with those of Romance reference populations and the samples from region B matched best with Germanic speaking reference populations. The combined use of onomastic and molecular genetic data revealed and mapped the marked structuring of the distribution of Y
Short Tandem Repeat DNA Internet Database
National Institute of Standards and Technology Data Gateway
SRD 130 Short Tandem Repeat DNA Internet Database (Web, free access) Short Tandem Repeat DNA Internet Database is intended to benefit research and application of short tandem repeat DNA markers for human identity testing. Facts and sequence information on each STR system, population data, commonly used multiplex STR systems, PCR primers and conditions, and a review of various technologies for analysis of STR alleles have been included.
Mitchell, R J; Earl, L; Fricke, B
1997-10-01
Variation on the Y chromosome may permit our understanding the evolution of the human paternal lineage and male gene flow. This study reports upon the distribution and non random association of alleles at four Y-chromosome specific loci in four populations, three Caucasoid (Italian, Greek and Slav) and one Asian. The markers include insertion/deletion (p12f), point mutation (92R7 and pY alpha I), and repeat sequence (p21A1) polymorphisms. Our data confirm that the p12f/TaqI 8 kb allele is a Caucasoid marker and that Asians are monomorphic at three of the loci (p12f, 92R7, and pY alpha I). The alleles at 92R7 and pY alpha I were found to be in complete disequilibrium in Europeans. Y-haplotype diversity was highly significant between Asians and all three European groups (P < 0.001), but the Greeks and Italians were also significantly different with respect to some alleles and haplotypes (P < 0.02). We find strong evidence that the p12f/TaqI 8 kb allele may have arisen only once, as a deletion event, and, additionally, that the present-day frequency distribution of Y chromosomes carrying the p12f/8 kb allele suggests that it may have been spread by colonising sea-faring peoples from the Near East, possibly the Phoenicians, rather than by expansion of Neolithic farmers into continental Europe. The p12f deletion is the key marker of a unique Y chromosome, found only in Caucasians to date, labelled 'Mediterranean' and this further increases the level of Y-chromosome diversity seen among Caucasoids when compared to the other major population groups.
Padilla-Gutiérrez, Jorge Ramón; Valle, Yeminia; Quintero-Ramos, Antonio; Hernández, Guillermo; Rodarte, Katya; Ortiz, Rocío; Olivares, Norma; Rivas, Fernando
2008-11-01
Nine Y-STR (DYS19, DYS390, DYS391, DYS392, DYS446, DYS447, DYS448, DYS456 and DYS458) were analyzed in a male sample of 285 unrelated individuals from Guadalajara, Jalisco, México. The haplotype diversity (0.996) and discrimination capacity (0.986) were calculated. A family study of around 200 father/son pairs and among 1828 meiosis showed five mutational events. All mutations were single step. The overall mutation rate estimated across the nine Y-STRs was 2.7 x 10(-3) (95% CI 1.2-6.4 x 10(-3))/locus/meiosis. The results indicate that these nine loci are useful Y-linked markers for forensic applications.
Brown, Sarah K; Pedersen, Niels C; Jafarishorijeh, Sardar; Bannasch, Danika L; Ahrens, Kristen D; Wu, Jui-Te; Okon, Michaella; Sacks, Benjamin N
2011-01-01
Modern genetic samples are commonly used to trace dog origins, which entails untested assumptions that village dogs reflect indigenous ancestry or that breed origins can be reliably traced to particular regions. We used high-resolution Y chromosome markers (SNP and STR) and mitochondrial DNA to analyze 495 village dogs/dingoes from the Middle East and Southeast Asia, along with 138 dogs from >35 modern breeds to 1) assess genetic divergence between Middle Eastern and Southeast Asian village dogs and their phylogenetic affinities to Australian dingoes and gray wolves (Canis lupus) and 2) compare the genetic affinities of modern breeds to regional indigenous village dog populations. The Y chromosome markers indicated that village dogs in the two regions corresponded to reciprocally monophyletic clades, reflecting several to many thousand years divergence, predating the Neolithic ages, and indicating long-indigenous roots to those regions. As expected, breeds of the Middle East and East Asia clustered within the respective regional village dog clade. Australian dingoes also clustered in the Southeast Asian clade. However, the European and American breeds clustered almost entirely within the Southeast Asian clade, even sharing many haplotypes, suggesting a substantial and recent influence of East Asian dogs in the creation of European breeds. Comparison to 818 published breed dog Y STR haplotypes confirmed this conclusion and indicated that some African breeds reflect another distinct patrilineal origin. The lower-resolution mtDNA marker consistently supported Y-chromosome results. Both marker types confirmed previous findings of higher genetic diversity in dogs from Southeast Asia than the Middle East. Our findings demonstrate the importance of village dogs as windows into the past and provide a reference against which ancient DNA can be used to further elucidate origins and spread of the domestic dog.
Brown, Sarah K.; Pedersen, Niels C.; Jafarishorijeh, Sardar; Bannasch, Danika L.; Ahrens, Kristen D.; Wu, Jui-Te; Okon, Michaella; Sacks, Benjamin N.
2011-01-01
Modern genetic samples are commonly used to trace dog origins, which entails untested assumptions that village dogs reflect indigenous ancestry or that breed origins can be reliably traced to particular regions. We used high-resolution Y chromosome markers (SNP and STR) and mitochondrial DNA to analyze 495 village dogs/dingoes from the Middle East and Southeast Asia, along with 138 dogs from >35 modern breeds to 1) assess genetic divergence between Middle Eastern and Southeast Asian village dogs and their phylogenetic affinities to Australian dingoes and gray wolves (Canis lupus) and 2) compare the genetic affinities of modern breeds to regional indigenous village dog populations. The Y chromosome markers indicated that village dogs in the two regions corresponded to reciprocally monophyletic clades, reflecting several to many thousand years divergence, predating the Neolithic ages, and indicating long-indigenous roots to those regions. As expected, breeds of the Middle East and East Asia clustered within the respective regional village dog clade. Australian dingoes also clustered in the Southeast Asian clade. However, the European and American breeds clustered almost entirely within the Southeast Asian clade, even sharing many haplotypes, suggesting a substantial and recent influence of East Asian dogs in the creation of European breeds. Comparison to 818 published breed dog Y STR haplotypes confirmed this conclusion and indicated that some African breeds reflect another distinct patrilineal origin. The lower-resolution mtDNA marker consistently supported Y-chromosome results. Both marker types confirmed previous findings of higher genetic diversity in dogs from Southeast Asia than the Middle East. Our findings demonstrate the importance of village dogs as windows into the past and provide a reference against which ancient DNA can be used to further elucidate origins and spread of the domestic dog. PMID:22194840
Benschop, Corina C G; van der Beek, Cornelis P; Meiland, Hugo C; van Gorp, Ankie G M; Westen, Antoinette A; Sijen, Titia
2011-08-01
To analyze DNA samples with very low DNA concentrations, various methods have been developed that sensitize short tandem repeat (STR) typing. Sensitized DNA typing is accompanied by stochastic amplification effects, such as allele drop-outs and drop-ins. Therefore low template (LT) DNA profiles are interpreted with care. One can either try to infer the genotype by a consensus method that uses alleles confirmed in replicate analyses, or one can use a statistical model to evaluate the strength of the evidence in a direct comparison with a known DNA profile. In this study we focused on the first strategy and we show that the procedure by which the consensus profile is assembled will affect genotyping reliability. In order to gain insight in the roles of replicate number and requested level of reproducibility, we generated six independent amplifications of samples of known donors. The LT methods included both increased cycling and enhanced capillary electrophoresis (CE) injection [1]. Consensus profiles were assembled from two to six of the replications using four methods: composite (include all alleles), n-1 (include alleles detected in all but one replicate), n/2 (include alleles detected in at least half of the replicates) and 2× (include alleles detected twice). We compared the consensus DNA profiles with the DNA profile of the known donor, studied the stochastic amplification effects and examined the effect of the consensus procedure on DNA database search results. From all these analyses we conclude that the accuracy of LT DNA typing and the efficiency of database searching improve when the number of replicates is increased and the consensus method is n/2. The most functional number of replicates within this n/2 method is four (although a replicate number of three suffices for samples showing >25% of the alleles in standard STR typing). This approach was also the optimal strategy for the analysis of 2-person mixtures, although modified search strategies may be
Bodner, Martin; Bastisch, Ingo; Butler, John M; Fimmers, Rolf; Gill, Peter; Gusmão, Leonor; Morling, Niels; Phillips, Christopher; Prinz, Mechthild; Schneider, Peter M; Parson, Walther
2016-09-01
The statistical evaluation of autosomal Short Tandem Repeat (STR) genotypes is based on allele frequencies. These are empirically determined from sets of randomly selected human samples, compiled into STR databases that have been established in the course of population genetic studies. There is currently no agreed procedure of performing quality control of STR allele frequency databases, and the reliability and accuracy of the data are largely based on the responsibility of the individual contributing research groups. It has been demonstrated with databases of haploid markers (EMPOP for mitochondrial mtDNA, and YHRD for Y-chromosomal loci) that centralized quality control and data curation is essential to minimize error. The concepts employed for quality control involve software-aided likelihood-of-genotype, phylogenetic, and population genetic checks that allow the researchers to compare novel data to established datasets and, thus, maintain the high quality required in forensic genetics. Here, we present STRidER (http://strider.online), a publicly available, centrally curated online allele frequency database and quality control platform for autosomal STRs. STRidER expands on the previously established ENFSI DNA WG STRbASE and applies standard concepts established for haploid and autosomal markers as well as novel tools to reduce error and increase the quality of autosomal STR data. The platform constitutes a significant improvement and innovation for the scientific community, offering autosomal STR data quality control and reliable STR genotype estimates. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Human Chromosome Y and Haplogroups; introducing YDHS Database.
Tiirikka, Timo; Moilanen, Jukka S
2015-12-01
As the high throughput sequencing efforts generate more biological information, scientists from different disciplines are interpreting the polymorphisms that make us unique. In addition, there is an increasing trend in general public to research their own genealogy, find distant relatives and to know more about their biological background. Commercial vendors are providing analyses of mitochondrial and Y-chromosomal markers for such purposes. Clearly, an easy-to-use free interface to the existing data on the identified variants would be in the interest of general public and professionals less familiar with the field. Here we introduce a novel metadatabase YDHS that aims to provide such an interface for Y-chromosomal DNA (Y-DNA) haplogroups and sequence variants. The database uses ISOGG Y-DNA tree as the source of mutations and haplogroups and by using genomic positions of the mutations the database links them to genes and other biological entities. YDHS contains analysis tools for deeper Y-SNP analysis. YDHS addresses the shortage of Y-DNA related databases. We have tested our database using a set of different cases from literature ranging from infertility to autism. The database is at http://www.semanticgen.net/ydhs Y-chromosomal DNA (Y-DNA) haplogroups and sequence variants have not been in the scientific limelight, excluding certain specialized fields like forensics, mainly because there is not much freely available information or it is scattered in different sources. However, as we have demonstrated Y-SNPs do play a role in various cases on the haplogroup level and it is possible to create a free Y-DNA dedicated bioinformatics resource.
Y-STR Haplogroup Diversity in the Jat Population Reveals Several Different Ancient Origins.
Mahal, David G; Matsoukas, Ianis G
2017-01-01
The Jats represent a large ethnic community that has inhabited the northwest region of India and Pakistan for several thousand years. It is estimated the community has a population of over 123 million people. Many historians and academics have asserted that the Jats are descendants of Aryans, Scythians, or other ancient people that arrived and lived in northern India at one time. Essentially, the specific origin of these people has remained a matter of contention for a long time. This study demonstrated that the origins of Jats can be clarified by identifying their Y-chromosome haplogroups and tracing their genetic markers on the Y-DNA haplogroup tree. A sample of 302 Y-chromosome haplotypes of Jats in India and Pakistan was analyzed. The results showed that the sample population had several different lines of ancestry and emerged from at least nine different geographical regions of the world. It also became evident that the Jats did not have a unique set of genes, but shared an underlying genetic unity with several other ethnic communities in the Indian subcontinent. A startling new assessment of the genetic ancient origins of these people was revealed with DNA science.
Reduction of Powerplex(®) Y23 reaction volume for genotyping buccal cell samples on FTA(TM) cards.
Raziel, Aliza; Dell'Ariccia-Carmon, Aviva; Zamir, Ashira
2015-01-01
PowerPlex(®) Y23 is a novel kit for Y-STR typing that includes new highly discriminating loci. The Israel DNA Database laboratory has recently adopted it for routine Y-STR analysis. This study examined PCR amplification from 1.2-mm FTA punch in reduced volumes of 5 and 10 μL. Direct amplification and washing of the FTA punches were examined in different PCR cycle numbers. One short robotically performed wash was found to improve the quality and the percent of profiles obtained. The optimal PCR cycle number was determined for 5 and 10 μL reaction volumes. The percent of obtained profiles, color balance, and reproducibility were examined. High-quality profiles were achieved in 90% and 88% of the samples amplified in 5 and 10 μL, respectively, in the first attempt. Volume reduction to 5 μL has a vast economic impact especially for DNA database laboratories. © 2014 American Academy of Forensic Sciences.
Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project
Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John
2008-01-01
The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213
Vullo, Carlos; Gomes, Verónica; Romanini, Carola; Oliveira, Andréa M; Rocabado, Omar; Aquino, Juliana; Amorim, António; Gusmão, Leonor
2015-07-01
For the correct evaluation of the weight of genetic evidence in a forensic context, databases must reflect the structure of the population, with all possible groups being represented. Countries with a recent history of admixture between strongly differentiated populations are usually highly heterogeneous and sub-structured. Bolivia is one of these countries, with a high diversity of ethnic groups and different levels of admixture (among Native Americans, Europeans and Africans) across the territory. For a better characterization of the male lineages in Bolivia, 17 Y-STR and 42 Y-SNP loci were genotyped in samples from La Paz and Chuquisaca. Only European and Native American Y-haplogroups were detected, and no sub-Saharan African chromosomes were found. Significant differences were observed between the two samples, with a higher frequency of European lineages in Chuquisaca than in La Paz. A sample belonging to haplogroup Q1a3a1a1-M19 was detected in La Paz, in a haplotype background different from those previously found in Argentina. This result supports an old M19 North-south dispersion in South America, possibly via two routes. When comparing the ancestry of each individual assessed through his Y chromosome with the one estimated using autosomal AIMs, (a) increased European ancestry in individuals with European Y chromosomes and (b) higher Native American ancestry in the carriers of Native American Y-haplogroups were observed, revealing an association between autosomal and Y-chromosomal markers. The results of this study demonstrate that a sub-structure does exist in Bolivia at both inter- and intrapopulation levels, a fact which must be taken into account in the evaluation of forensic genetic evidence.
Paleolithic Y-haplogroup heritage predominates in a Cretan highland plateau.
Martinez, Laisel; Underhill, Peter A; Zhivotovsky, Lev A; Gayden, Tenzin; Moschonas, Nicholas K; Chow, Cheryl-Emiliane T; Conti, Simon; Mamolini, Elisabetta; Cavalli-Sforza, L Luca; Herrera, Rene J
2007-04-01
The island of Crete, credited by some historical scholars as a central crucible of western civilization, has been under continuous archeological investigation since the second half of the nineteenth century. In the present work, the geographic stratification of the contemporary Cretan Y-chromosome gene pool was assessed by high-resolution haplotyping to investigate the potential imprints of past colonization episodes and the population substructure. In addition to analyzing the possible geographic origins of Y-chromosome lineages in relatively accessible areas of the island, this study includes samples from the isolated interior of the Lasithi Plateau--a mountain plain located in eastern Crete. The potential significance of the results from the latter region is underscored by the possibility that this region was used as a Minoan refugium. Comparisons of Y-haplogroup frequencies among three Cretan populations as well as with published data from additional Mediterranean locations revealed significant differences in the frequency distributions of Y-chromosome haplogroups within the island. The most outstanding differences were observed in haplogroups J2 and R1, with the predominance of haplogroup R lineages in the Lasithi Plateau and of haplogroup J lineages in the more accessible regions of the island. Y-STR-based analyses demonstrated the close affinity that R1a1 chromosomes from the Lasithi Plateau shared with those from the Balkans, but not with those from lowland eastern Crete. In contrast, Cretan R1b microsatellite-defined haplotypes displayed more resemblance to those from Northeast Italy than to those from Turkey and the Balkans.
Toward Male Individualization with Rapidly Mutating Y-Chromosomal Short Tandem Repeats
Ballantyne, Kaye N; Ralf, Arwin; Aboukhalid, Rachid; Achakzai, Niaz M; Anjos, Maria J; Ayub, Qasim; Balažic, Jože; Ballantyne, Jack; Ballard, David J; Berger, Burkhard; Bobillo, Cecilia; Bouabdellah, Mehdi; Burri, Helen; Capal, Tomas; Caratti, Stefano; Cárdenas, Jorge; Cartault, François; Carvalho, Elizeu F; Carvalho, Monica; Cheng, Baowen; Coble, Michael D; Comas, David; Corach, Daniel; D'Amato, Maria E; Davison, Sean; de Knijff, Peter; De Ungria, Maria Corazon A; Decorte, Ronny; Dobosz, Tadeusz; Dupuy, Berit M; Elmrghni, Samir; Gliwiński, Mateusz; Gomes, Sara C; Grol, Laurens; Haas, Cordula; Hanson, Erin; Henke, Jürgen; Henke, Lotte; Herrera-Rodríguez, Fabiola; Hill, Carolyn R; Holmlund, Gunilla; Honda, Katsuya; Immel, Uta-Dorothee; Inokuchi, Shota; Jobling, Mark A; Kaddura, Mahmoud; Kim, Jong S; Kim, Soon H; Kim, Wook; King, Turi E; Klausriegler, Eva; Kling, Daniel; Kovačević, Lejla; Kovatsi, Leda; Krajewski, Paweł; Kravchenko, Sergey; Larmuseau, Maarten H D; Lee, Eun Young; Lessig, Ruediger; Livshits, Ludmila A; Marjanović, Damir; Minarik, Marek; Mizuno, Natsuko; Moreira, Helena; Morling, Niels; Mukherjee, Meeta; Munier, Patrick; Nagaraju, Javaregowda; Neuhuber, Franz; Nie, Shengjie; Nilasitsataporn, Premlaphat; Nishi, Takeki; Oh, Hye H; Olofsson, Jill; Onofri, Valerio; Palo, Jukka U; Pamjav, Horolma; Parson, Walther; Petlach, Michal; Phillips, Christopher; Ploski, Rafal; Prasad, Samayamantri P R; Primorac, Dragan; Purnomo, Gludhug A; Purps, Josephine; Rangel-Villalobos, Hector; Rębała, Krzysztof; Rerkamnuaychoke, Budsaba; Gonzalez, Danel Rey; Robino, Carlo; Roewer, Lutz; Rosa, Alexandra; Sajantila, Antti; Sala, Andrea; Salvador, Jazelyn M; Sanz, Paula; Schmitt, Cornelia; Sharma, Anil K; Silva, Dayse A; Shin, Kyoung-Jin; Sijen, Titia; Sirker, Miriam; Siváková, Daniela; Škaro, Vedrana; Solano-Matamoros, Carlos; Souto, Luis; Stenzl, Vlastimil; Sudoyo, Herawati; Syndercombe-Court, Denise; Tagliabracci, Adriano; Taylor, Duncan; Tillmar, Andreas; Tsybovsky, Iosif S; Tyler-Smith, Chris; van der Gaag, Kristiaan J; Vanek, Daniel; Völgyi, Antónia; Ward, Denise; Willemse, Patricia; Yap, Eric PH; Yong, Rita YY; Pajnič, Irena Zupanič; Kayser, Manfred
2014-01-01
Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836–0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father–son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database. PMID:24917567
Seman, Ali; Sapawi, Azizian Mohd; Salleh, Mohd Zaki
2015-06-01
Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.
Alves, Cíntia; Gusmão, Leonor; Damasceno, Albertino; Soares, Benilde; Amorim, António
2004-01-28
Allele frequencies, together with some parameters of forensic interest, for 17 STRs included in the AmpF/STR Identifiler (CSF1PO, D2S1338, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D19S433, D21S11, FGA, TH01, TPO and VWA) and Powerplex 16 System (CSF1PO, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D21S11, FGA, Penta D, Penta E, TH01, TPO and VWA) were estimated from a sample of 135-144 unrelated individuals from Mozambique. No deviations from Hardy-Weinberg equilibrium were observed with the exception of the FGA locus (using the Bonferroni correction for the number of loci analysed, the departure observed at this locus was not significant). Comparative analyses between our population data and other African databases, namely Promega's African-Americans, AB Applied Biosystems African-Americans and two other population samples from Mozambique and Guiné Bissau, are presented and discussed. Genotype inconsistencies between both commercial kits (for D16S539 and D8S1179) and other genotypic variations (three-banded allele patterns for TPO) are also reported.
Da Fré, Nicole Nascimento; Rodenbusch, Rodrigo; Gastaldo, André Zoratto; Hanson, Erin; Ballantyne, Jack; Alho, Clarice Sampaio
2015-11-01
We evaluated haplotype and allele frequencies, as well as statistical forensic parameters, for 23 Y-chromosome short tandem repeats (STRs) loci of the PowerPlex®Y23 system (DYS19, DYS385a/b, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, Y-GATA-H4, DYS481, DYS533, DYS549, DYS570, DYS576, DYS643) in a sample of 150 apparently healthy males, resident in South Brazil. A total of 150 different haplotypes were identified. The highest gene diversity (GD) was observed for the single locus marker DYS570 (GD = 0.7888) and for a two-locus system DYS385 (GD = 0.9009). We also examined 150 father-son pairs by the same system, and a total of 13 mutations were identified in the 3450 father-son allelic transfers, with an overall mutation rate across the 23 loci of 3.768 × 10(-3) (95% CI: 3.542 × 10(-3) to 3.944 × 10(-3)). In all cases there was only one locus mutated with gain/loss of repeats in the son (5 one-repeat gains, and 7 one-repeat and 1 two-repeat losses); we observed no instances of mutations involving a non-integral number of repeats.
Y-Chromosomal Diversity in Lebanon Is Structured by Recent Historical Events
Zalloua, Pierre A.; Xue, Yali; Khalife, Jade; Makhoul, Nadine; Debiane, Labib; Platt, Daniel E.; Royyuru, Ajay K.; Herrera, Rene J.; Hernanz, David F. Soria; Blue-Smith, Jason; Wells, R. Spencer; Comas, David; Bertranpetit, Jaume; Tyler-Smith, Chris
2008-01-01
Lebanon is an eastern Mediterranean country inhabited by approximately four million people with a wide variety of ethnicities and religions, including Muslim, Christian, and Druze. In the present study, 926 Lebanese men were typed with Y-chromosomal SNP and STR markers, and unusually, male genetic variation within Lebanon was found to be more strongly structured by religious affiliation than by geography. We therefore tested the hypothesis that migrations within historical times could have contributed to this situation. Y-haplogroup J∗(xJ2) was more frequent in the putative Muslim source region (the Arabian Peninsula) than in Lebanon, and it was also more frequent in Lebanese Muslims than in Lebanese non-Muslims. Conversely, haplogroup R1b was more frequent in the putative Christian source region (western Europe) than in Lebanon and was also more frequent in Lebanese Christians than in Lebanese non-Christians. The most common R1b STR-haplotype in Lebanese Christians was otherwise highly specific for western Europe and was unlikely to have reached its current frequency in Lebanese Christians without admixture. We therefore suggest that the Islamic expansion from the Arabian Peninsula beginning in the seventh century CE introduced lineages typical of this area into those who subsequently became Lebanese Muslims, whereas the Crusader activity in the 11th–13th centuries CE introduced western European lineages into Lebanese Christians. PMID:18374297
Meadows, J R S; Kijas, J W
2009-02-01
The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.
Afghanistan from a Y-chromosome perspective.
Lacau, Harlette; Gayden, Tenzin; Regueiro, Maria; Chennakrishnaiah, Shilpa; Bukhari, Areej; Underhill, Peter A; Garcia-Bertrand, Ralph L; Herrera, Rene J
2012-10-01
Central Asia has served as a corridor for human migrations providing trading routes since ancient times. It has functioned as a conduit connecting Europe and the Middle East with South Asia and far Eastern civilizations. Therefore, the study of populations in this region is essential for a comprehensive understanding of early human dispersal on the Eurasian continent. Although Y- chromosome distributions in Central Asia have been widely surveyed, present-day Afghanistan remains poorly characterized genetically. The present study addresses this lacuna by analyzing 190 Pathan males from Afghanistan using high-resolution Y-chromosome binary markers. In addition, haplotype diversity for its most common lineages (haplogroups R1a1a*-M198 and L3-M357) was estimated using a set of 15 Y-specific STR loci. The observed haplogroup distribution suggests some degree of genetic isolation of the northern population, likely due to the Hindu Kush mountain range separating it from the southern Afghans who have had greater contact with neighboring Pathans from Pakistan and migrations from the Indian subcontinent. Our study demonstrates genetic similarities between Pathans from Afghanistan and Pakistan, both of which are characterized by the predominance of haplogroup R1a1a*-M198 (>50%) and the sharing of the same modal haplotype. Furthermore, the high frequencies of R1a1a-M198 and the presence of G2c-M377 chromosomes in Pathans might represent phylogenetic signals from Khazars, a common link between Pathans and Ashkenazi groups, whereas the absence of E1b1b1a2-V13 lineage does not support their professed Greek ancestry.
Manco, Licínio; Albuquerque, Joana; Sousa, Maria Francisca; Martiniano, Rui; de Oliveira, Ricardo Costa; Marques, Sofia; Gomes, Verónica; Amorim, António; Alvarez, Luís; Prata, Maria João
2018-03-01
We examined internal lineages and haplotype diversity in Portuguese samples belonging to J-M304 to improve the spatial and temporal understanding of the introduction of this haplogroup in Iberia, using the available knowledge about the phylogeography of its main branches, J1-M267 and J2-M172. A total of 110 males of Portuguese descent were analyzed for 17 Y-chromosome bi-allelic markers and seven Y-chromosome short tandem repeats (Y-STR) loci. Among J1-M267 individuals (n = 36), five different sub-haplogroups were identified, with the most common being J1a2b2-L147.1 (∼72%), which encompassed the majority of representatives of the J1a2b-P58 subclade. One sample belonged to the rare J1a1-M365.1 lineage and presented a core Y-STR haplotype consistent with the Iberian settlement during the fifth century by the Alans, a people of Iranian heritage. The analysis of J2-M172 Portuguese males (n = 74) enabled the detection of the two main subclades at very dissimilar frequencies, J2a-M410 (∼80%) and J2b-M12 (∼20%), among which the most common branches were J2a1(xJ2a1b,h)-L26 (22.9%), J2a1b(xJ2a1b1)-M67 (20.3%), J2a1h-L24 (27%), and J2b2-M241 (20.3%). While previous inferences based on modern haplogroup J Y-chromosomes implicated a main Neolithic dissemination, here we propose a later arrival of J lineages into Iberia using a combination of novel Portuguese Y-chromosomal data and recent evidence from ancient DNA. Our analysis suggests that a substantial tranche of J1-M267 lineages was likely carried into the Iberian Peninsula as a consequence of the trans-Mediterranean contacts during the first millennium BC, while most of the J2-M172 lineages may be associated with post-Neolithic population movements within Europe. © 2017 Wiley Periodicals, Inc.
Male-specific contributions to the Brazilian population of Espirito Santo.
de F Figueiredo, Raquel; Ambrosio, Isabela B; Braganholi, Danilo F; Chemale, Gustavo; Martins, Joyce A; Gomes, Veronica; Gusmão, Leonor; Cicarelli, Regina M B
2016-05-01
Y chromosome markers have been widely studied due to their various applications in the fields of forensic and evolutionary genetics. In this study, 35 Y-SNPs and 17 Y-STRs were genotyped in 253 males from the State of Espirito Santo, Brazil. A total of 18 haplogroups and 243 haplotypes were detected; the haplogroup and haplotype diversities were 0.7794 and 0.9997, respectively. Genetic distance analysis using the Y-STR data showed no statistically significant differences between Espirito Santo and other admixed populations from Brazil. The classification of paternal lineages based on haplogroups showed a predominant European contribution (85.88%), followed by African (11.37%) and Amerindian (2.75%) contributions.
Forensic parameters of the X-STR Decaplex system in Mexican populations.
Mariscal Ramos, C; Martínez-Cortes, G; Ramos-González, B; Rangel-Villalobos, H
2018-03-01
We studied the X-STR decaplex system in 529 DNA female samples of Mexican populations from five geographic regions. Allele frequencies and forensic parameters were estimated in each region and in the pooled Mexican population. Genotype distribution by locus was in agreement with Hardy-Weinberg expectations in each Mexican population sample. Similarly, linkage equilibrium was demonstrated between pair of loci. Pairwise comparisons and genetic distances between Mexican, Iberoamerican and one African populations were estimated and graphically represented. Interestingly, a non-significant interpopulation differentiation was detected (Fst = 0.0021; p = .74389), which allows using a global Mexican database for forensic interpretation of X-STR genotypes. Copyright © 2017 Elsevier B.V. All rights reserved.
Post-Mortem Identification of a Fire Carbonized Body by STR Genotyping.
Dumache, Raluca; Muresan, Camelia; Ciocan, Veronica; Rogobete, Alexandru F; Enache, Alexandra
2016-10-01
Identification of bodies of unknown identity that are victims of exposure to very high temperatures, resulting from fires, plane crashes, and terrorist attacks, represents one of the most difficult sides of forensic genetics, because of the advanced state of decomposition. The aim of this study was the identification of the carbonized cadaver of a fire victim through STR genotyping. We used blood samples obtained from the iliac artery during the autopsy examination as biological samples from the unidentified victim. After DNA isolation and quantification, we proceeded to its amplification using the multiplex PCR kit AmpFlSTR Identifiler. The DNA products were separated using an ABI 3500 genetic analyzer. Further analysis of the data was done using Gene Mapper ID-X version 1.4 software. In this case, it was possible to obtain a complete DNA profile from the biological samples. Due to the fact that the amelogenin gene presented two alleles, X and Y, we concluded that the victim was a man. We conclude that STR profiling of unidentified bodies (carbonized, decomposed) represents a powerful method of human identification in forensic medicine.
Allele frequency distribution for 21 autosomal STR loci in Nepal.
Kraaijenbrink, T; van Driem, G L; Opgenort, J R M L; Tuladhar, N M; de Knijff, P
2007-05-24
The allele frequency distributions of 21 autosomal loci contained in the AmpFlSTR Identifiler, the Powerplex 16 and the FFFL multiplex PCR kits, was studied in 953 unrelated individuals from Nepal. Several new alleles (i.e. not yet reported in the NIST Short Tandem Repeat DNA Internet DataBase [http://www.cstl.nist.gov/biotech/strbase/]) have been detected in the process.
Ancestral inference from haplotypes and mutations.
Griffiths, Robert C; Tavaré, Simon
2018-04-25
We consider inference about the history of a sample of DNA sequences, conditional upon the haplotype counts and the number of segregating sites observed at the present time. After deriving some theoretical results in the coalescent setting, we implement rejection sampling and importance sampling schemes to perform the inference. The importance sampling scheme addresses an extension of the Ewens Sampling Formula for a configuration of haplotypes and the number of segregating sites in the sample. The implementations include both constant and variable population size models. The methods are illustrated by two human Y chromosome datasets. Copyright © 2018. Published by Elsevier Inc.
Calacal, Gayvelline C; Delfin, Frederick C; Tan, Michelle Music M; Roewer, Lutz; Magtanong, Danilo L; Lara, Myra C; Fortun, Raquel dR; De Ungria, Maria Corazon A
2005-09-01
In a fire tragedy in Manila in December 1998, one of the worst tragic incidents which resulted in the reported death of 23 children, identity could not be established initially resulting in the burial of still unidentified bodies. Underscoring the importance of identifying each of the human remains, the bodies were exhumed 3 months after the tragedy. We describe here our work, which was the first national case handled by local laboratories wherein conventional and molecular-based techniques were successfully applied in forensic identification. The study reports analysis of DNA obtained from skeletal remains exposed to conditions of burning, burial, and exhumation. DNA typing methods using autosomal and Y-chromosomal short tandem repeat (Y-STR) markers reinforced postmortem examinations using conventional identification techniques. The strategy resulted in the identification of 18 out of the 21 human remains analyzed, overcoming challenges encountered due to the absence of established procedures for the recovery of mass disaster remains. There was incomplete antemortem information to match the postmortem data obtained from the remains of 3 female child victims. Two victims were readily identified due to the availability of antemortem tissues. In the absence of this biologic material, parentage testing was performed using reference blood samples collected from parents and relatives. Data on patrilineal lineage based on common Y-STR haplotypes augmented autosomal DNA typing, particularly in deficiency cases.
Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng
2009-02-01
High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.
Detecting local haplotype sharing and haplotype association
USDA-ARS?s Scientific Manuscript database
A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...
Doğan, Serkan; Doğan, Gŭlşen; Ašić, Adna; Besić, Larisa; Klimenta, Biljana; Hukić, Mirsada; Turan, Yusuf; Primorac, Dragan; Marjanović, Damir
2016-04-01
Analysis of Y-chromosome haplogroup distribution is widely used when investigating geographical clustering of different populations, which is why it plays an important role in population genetics, human migration patterns and even in forensic investigations. Individual determination of these haplogroups is mostly based on the analysis of single nucleotide polymorphism (SNP) markers located in the non-recombining part of Y-chromosome (NRY). On the other hand, the number of forensic and anthropology studies investigating short tandem repeats on the Y-chromosome (Y-STRs) increases rapidly every year. During the last few years, these markers have been successfully used as haplogroup prediction methods, which is why they have been used in this study. Previously obtained Y-STR haplotypes (23 loci) from 100 unrelated Turkish males recently settled in Sarajevo were used for the determination of haplogroups via 'Whit Athey's Haplogroup Predictor' software. The Bayesian probability of 90 of the studied haplotypes is greater than 92.2% and ranges from 51.4% to 84.3% for the remaining 10 haplotypes. A distribution of 17 different haplogroups was found, with the Y- haplogroup J2a being most prevalent, having been found in 26% of all the samples, whereas R1b, G2a and R1a were less prevalent, covering a range of 10% to 15% of all the samples. Together, these four haplogroups account for 63% of all Y-chromosomes. Eleven haplogroups (E1b1b, G1, I1, I2a, I2b, J1, J2b, L, Q, R2, and T) range from 2% to 5%, while E1b1a and N are found in 1% of all samples. Obtained results indicate that a large majority of the Turkish paternal line belongs to West Asia, Europe Caucasus, Western Europe, Northeast Europe, Middle East, Russia, Anatolia, and Black Sea Y-chromosome lineages. As the distribution of Y-chromosome haplogroups is consistent with the previously published data for the Turkish population residing in Turkey, it was concluded that the analyzed population could also be recognized as
An Ultra-High Discrimination Y Chromosome Short Tandem Repeat Multiplex DNA Typing System
Hanson, Erin K.; Ballantyne, Jack
2007-01-01
In forensic casework, Y chromosome short tandem repeat markers (Y-STRs) are often used to identify a male donor DNA profile in the presence of excess quantities of female DNA, such as is found in many sexual assault investigations. Commercially available Y-STR multiplexes incorporating 12–17 loci are currently used in forensic casework (Promega's PowerPlex® Y and Applied Biosystems' AmpFlSTR® Yfiler®). Despite the robustness of these commercial multiplex Y-STR systems and the ability to discriminate two male individuals in most cases, the coincidence match probabilities between unrelated males are modest compared with the standard set of autosomal STR markers. Hence there is still a need to develop new multiplex systems to supplement these for those cases where additional discriminatory power is desired or where there is a coincidental Y-STR match between potential male participants. Over 400 Y-STR loci have been identified on the Y chromosome. While these have the potential to increase the discrimination potential afforded by the commercially available kits, many have not been well characterized. In the present work, 91 loci were tested for their relative ability to increase the discrimination potential of the commonly used ‘core’ Y-STR loci. The result of this extensive evaluation was the development of an ultra high discrimination (UHD) multiplex DNA typing system that allows for the robust co-amplification of 14 non-core Y-STR loci. Population studies with a mixed African American and American Caucasian sample set (n = 572) indicated that the overall discriminatory potential of the UHD multiplex was superior to all commercial kits tested. The combined use of the UHD multiplex and the Applied Biosystems' AmpFlSTR® Yfiler® kit resulted in 100% discrimination of all individuals within the sample set, which presages its potential to maximally augment currently available forensic casework markers. It could also find applications in human evolutionary
Das, Manuj K; Chetry, Sumi; Kalita, Mohan C; Dutta, Prafulla
2016-12-01
North-east region of India has consistent role in the spread of multi drug resistant Plasmodium (P.) falciparum to other parts of Southeast Asia. After rapid clinical treatment failure of Artemisinin based combination therapy-Sulphadoxine/Pyrimethamine (ACT-SP) chemoprophylaxis, Artemether-Lumefantrine (ACT-AL) combination therapy was introduced in the year 2012 in this region for the treatment of uncomplicated P. falciparum malaria. In a DNA sequencing based polymorphism analysis, seven codons of P. falciparum dihydropteroate synthetase ( Pf dhps) gene were screened in a total of 127 P. falciparum isolates collected from Assam, Arunachal Pradesh and Tripura of North-east India during the year 2014 and 2015 to document current sulfadoxine resistant haplotypes. Sequences were analyzed to rearrange both nucleotide and protein haplotypes. Molecular diversity indices were analyzed in DNA Sequence Polymorphism software (DnaSP) on the basis of Pf dhps gene sequences. Disappearance from selective neutrality was assessed based on the ratio of non-synonomous to synonomous nucleotide substitutions [dN/dS ratio]. Moreover, two-tailed Z test was performed in search of the significance for probability of rejecting null hypothesis of strict neutrality [dN = dS]. Presence of mutant P. falciparum multidrug resistance protein1 ( Pf mdr1) was also checked in those isolates that were present with new Pf dhps haplotypes. Phylogenetic relationship based on Pf dhps gene was reconstructed in Molecular Evolutionary Genetics Analysis (MEGA). Among eight different sulfadoxine resistant haplotypes found, IS GNG A haplotype was documented in a total of five isolates from Tripura with association of a new mutant M538 R allele. Sequence analysis of Pf mdr1 gene in these five isolates came to notice that not all but only one isolate was mutant at codon 86 (N86 Y ; Y YSND) in the multidrug resistance protein. Molecular diversity based on Pf dhps haplotypes revealed that P. falciparum
STR data for 15 autosomal STR markers from Paraná (Southern Brazil).
Alves, Hemerson B; Leite, Fábio P N; Sotomaior, Vanessa S; Rueda, Fábio F; Silva, Rosane; Moura-Neto, Rodrigo S
2014-03-01
Allelic frequencies for 15 STR autosomal loci, using AmpFℓSTR® Identifiler™, forensic, and statistical parameters were calculated. All loci reached the Hardy-Weinberg equilibrium. The combined power of discrimination and mean power of exclusion were 0.999999999999999999 and 0.9999993, respectively. The MDS plot and NJ tree analysis, generated by FST matrix, corroborated the notion of the origins of the Paraná population as mainly European-derived. The combination of these 15 STR loci represents a powerful strategy for individual identification and parentage analyses for the Paraná population.
Soodyall, Himla
2013-10-11
Previous historical, anthropological and genetic data provided overwhelming support for the Semitic origins of the Lemba, a Bantu-speaking people in southern Africa. To revisit the question concerning genetic affinities between the Lemba and Jews. Y-chromosome variation was examined in two Lemba groups: one from South Africa (SA) and, for the first time, a group from Zimbabwe (Remba), to re-evaluate the previously reported Jewish link. A sample of 261 males (76 Lemba, 54 Remba, 43 Venda and 88 SA Jews) was initially analysed for 16 bi-allelic and 6 short tandem repeats (STRs) that resulted in the resolution of 102 STR haplotypes distributed across 13 haplogroups. The non-African component in the Lemba and Remba was estimated to be 73.7% and 79.6%, respectively. In addition, a subset of 91 individuals (35 Lemba, 24 Remba, 32 SA Jews) with haplogroup J were resolved further using 6 additional bi-allelic markers and 12 STRs to screen for the extended Cohen modal haplotype (CMH). Although 24 individuals (10 Lemba and 14 SA Jews) were identified as having the original CMH (six STRs), only one SA Jew harboured the extended CMH.CONCLUSIONS. While it was not possible to trace unequivocally the origins of the non-African Y chromosomes in the Lemba and Remba, this study does not support the earlier claims of their Jewish genetic heritage.
Y-chromosomal variation in Sub-Saharan Africa: insights into the history of Niger-Congo groups
de Filippo, Cesare; Barbieri, Chiara; Whitten, Mark; Mpoloka, Sununguko Wata; Gunnarsdóttir, Ellen Drofn; Bostoen, Koen; Nyambe, Terry; Beyer, Klaus; Schreiber, Henning; de Knijff, Peter; Luiselli, Donata; Stoneking, Mark; Pakendorf, Brigitte
2013-01-01
Technological and cultural innovations, as well as climate changes, are thought to have influenced the diffusion of major language phyla in sub-Saharan Africa. The most widespread and the richest in diversity is the Niger-Congo phylum, thought to have originated in West Africa ~10,000 years ago. The expansion of Bantu languages (a family within the Niger-Congo phylum) ~5,000 years ago represents a major event in the past demography of the continent. Many previous studies on Y chromosomal variation in Africa associated the Bantu expansion with haplogroup E1b1a (and sometimes its sub-lineage E1b1a7). However, the distribution of these two lineages extends far beyond the area occupied nowadays by Bantu speaking people, raising questions on the actual genetic structure behind this expansion. To address these issues, we directly genotyped 31 biallelic markers and 12 microsatellites on the Y chromosome in 1195 individuals of African ancestry focusing on areas that were previously poorly characterized (Botswana, Burkina Faso, D.R.C, and Zambia). With the inclusion of published data, we analyzed 2736 individuals from 26 groups representing all linguistic phyla and covering a large portion of Sub-Saharan Africa. Within the Niger-Congo phylum, we ascertain for the first time differences in haplogroup composition between Bantu and non-Bantu groups via two markers (U174 and U175) on the background of haplogroup E1b1a (and E1b1a7), which were directly genotyped in our samples and for which genotypes were inferred from published data using Linear Discriminant Analysis on STR haplotypes. No reduction in STR diversity levels was found across the Bantu groups, suggesting the absence of serial founder effects. In addition, the homogeneity of haplogroup composition and pattern of haplotype sharing between Western and Eastern Bantu groups suggest that their expansion throughout Sub-Saharan Africa reflects a rapid spread followed by backward and forward migrations. Overall, we found
Y-chromosomal variation in sub-Saharan Africa: insights into the history of Niger-Congo groups.
de Filippo, Cesare; Barbieri, Chiara; Whitten, Mark; Mpoloka, Sununguko Wata; Gunnarsdóttir, Ellen Drofn; Bostoen, Koen; Nyambe, Terry; Beyer, Klaus; Schreiber, Henning; de Knijff, Peter; Luiselli, Donata; Stoneking, Mark; Pakendorf, Brigitte
2011-03-01
Technological and cultural innovations as well as climate changes are thought to have influenced the diffusion of major language phyla in sub-Saharan Africa. The most widespread and the richest in diversity is the Niger-Congo phylum, thought to have originated in West Africa ∼ 10,000 years ago (ya). The expansion of Bantu languages (a family within the Niger-Congo phylum) ∼ 5,000 ya represents a major event in the past demography of the continent. Many previous studies on Y chromosomal variation in Africa associated the Bantu expansion with haplogroup E1b1a (and sometimes its sublineage E1b1a7). However, the distribution of these two lineages extends far beyond the area occupied nowadays by Bantu-speaking people, raising questions on the actual genetic structure behind this expansion. To address these issues, we directly genotyped 31 biallelic markers and 12 microsatellites on the Y chromosome in 1,195 individuals of African ancestry focusing on areas that were previously poorly characterized (Botswana, Burkina Faso, Democratic Republic of Congo, and Zambia). With the inclusion of published data, we analyzed 2,736 individuals from 26 groups representing all linguistic phyla and covering a large portion of sub-Saharan Africa. Within the Niger-Congo phylum, we ascertain for the first time differences in haplogroup composition between Bantu and non-Bantu groups via two markers (U174 and U175) on the background of haplogroup E1b1a (and E1b1a7), which were directly genotyped in our samples and for which genotypes were inferred from published data using linear discriminant analysis on short tandem repeat (STR) haplotypes. No reduction in STR diversity levels was found across the Bantu groups, suggesting the absence of serial founder effects. In addition, the homogeneity of haplogroup composition and pattern of haplotype sharing between Western and Eastern Bantu groups suggests that their expansion throughout sub-Saharan Africa reflects a rapid spread followed by
Barton, James C; Acton, Ronald T
2002-10-07
We wanted to quantify HLA-A and -B allele and haplotype frequencies in Alabama hemochromatosis probands with HFE C282Y homozygosity and controls, and to compare results to those in other populations. Alleles were detected using DNA-based typing (probands) and microlymphocytotoxicity (controls). Alleles were determined in 139 probands (1,321 controls) and haplotypes in 118 probands (605 controls). In probands, A*03 positivity was 0.7482 (0.2739 controls; p = or < 0.0001; odds ratio (OR) 7.9); positivity for B*07, B*14, and B*56 was also increased. In probands, haplotypes A*03-B*07 and A*03-B*14 were more frequent (p < 0.0001, respectively; OR = 12.3 and 11.1, respectively). The haplotypes A*01-B*60, A*02-B*39, A*02-B*62, A*03-B*13, A*03-B*15, A*03-B*27, A*03-B*35, A*03-B*44, A*03-B*47, and A*03-B*57 were also significantly more frequent in probands. 37.3% of probands were HLA-haploidentical with other proband(s). A*03 and A*03-B*07 frequencies are increased in Alabama probands, as in other hemochromatosis cohorts. Increased absolute frequencies of A*03-B*35 have been reported only in the present Alabama probands and in hemochromatosis patients in Italy. Increased absolute frequencies of A*01-B*60, A*02-B*39, A*02-B*62, A*03-B*13, A*03-B*15, A*03-B*27, A*03-B*44, A*03-B*47, and A*03-B*57 in hemochromatosis cohorts have not been reported previously.
Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha
2011-01-01
Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10-17. This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications. PMID:21597912
Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha
2011-07-01
Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10(-17). This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications.
The clinical application of single-sperm-based SNP haplotyping for PGD of osteogenesis imperfecta.
Chen, Linjun; Diao, Zhenyu; Xu, Zhipeng; Zhou, Jianjun; Yan, Guijun; Sun, Haixiang
2018-05-15
Osteogenesis imperfecta (OI) is a genetically heterogeneous disorder, presenting either autosomal dominant, autosomal recessive or X-linked inheritance patterns. The majority of OI cases are autosomal dominant and are caused by heterozygous mutations in either the COL1A1 or COL1A2 gene. In these dominant disorders, allele dropout (ADO) can lead to misdiagnosis in preimplantation genetic diagnosis (PGD). Polymorphic markers linked to the mutated genes have been used to establish haplotypes for identifying ADO and ensuring the accuracy of PGD. However, the haplotype of male patients cannot be determined without data from affected relatives. Here, we developed a method for single-sperm-based single-nucleotide polymorphism (SNP) haplotyping via next-generation sequencing (NGS) for the PGD of OI. After NGS, 10 informative polymorphic SNP markers located upstream and downstream of the COL1A1 gene and its pathogenic mutation site were linked to individual alleles in a single sperm from an affected male. After haplotyping, a normal blastocyst was transferred to the uterus for a subsequent frozen embryo transfer cycle. The accuracy of PGD was confirmed by amniocentesis at 19 weeks of gestation. A healthy infant weighing 4,250 g was born via vaginal delivery at the 40th week of gestation. Single-sperm-based SNP haplotyping can be applied for PGD of any monogenic disorders or de novo mutations in males in whom the haplotype of paternal mutations cannot be determined due to a lack of affected relatives. ADO: allele dropout; DI: dentinogenesis imperfect; ESHRE: European Society of Human Reproduction and Embryology; FET: frozen embryo transfer; gDNA: genomic DNA; ICSI: intracytoplasmic sperm injection; IVF: in vitro fertilization; MDA: multiple displacement amplification; NGS: next-generation sequencing; OI: osteogenesis imperfect; PBS: phosphate buffer saline; PCR: polymerase chain reaction; PGD: preimplantation genetic diagnosis; SNP: single-nucleotide polymorphism; STR
Phillips, C; Gettings, K Butler; King, J L; Ballard, D; Bodner, M; Borsuk, L; Parson, W
2018-05-01
The STR sequence template file published in 2016 as part of the considerations from the DNA Commission of the International Society for Forensic Genetics on minimal STR sequence nomenclature requirements, has been comprehensively revised and audited using the latest GRCh38 genome assembly. The list of forensic STRs characterized was expanded by including supplementary autosomal, X- and Y-chromosome microsatellites in less common use for routine DNA profiling, but some likely to be adopted in future massively parallel sequencing (MPS) STR panels. We outline several aspects of sequence alignment and annotation that required care and attention to detail when comparing sequences to GRCh37 and GRCh38 assemblies, as well as the necessary matching of MPS-based allele descriptions to previously established repeat region structures described in initial sequencing studies of the less well known forensic STRs. The revised sequence guide is now available in a dynamically updated FTP format from the STRidER website with a date-stamped change log to allow users to explore their own MPS data with the most up-to-date forensic STR sequence information compiled in a simple guide. Copyright © 2018 Elsevier B.V. All rights reserved.
Forensic evaluation of STR typing reliability in lung cancer.
Zhang, Peng; Zhu, Ying; Li, Yongguo; Zhu, Shisheng; Ma, Ruoxiang; Zhao, Minzhu; Li, Jianbo
2018-01-01
Short tandem repeats (STR) analysis is the gold standard method in the forensics field for personal identification and paternity testing. In cancerous tissues, STR markers are gaining attention, with some studies showing increased instability. Lung cancer, which is one of the most commonmalignancies, has become the most lethal among all cancers. In certain situations, lung cancer tissues may be the only resource available for forensic analysis. Therefore, evaluating the reliability of STR markers in lung cancer tissues is required to avoid false exclusions. In this study, 75 lung cancer tissue samples were examined to evaluate the reliability of various STR markers. Out of the 75 examined samples, 24 of the cancerous samples (32%) showed genetic alterations on at least one STR loci, totaling 55 times. The most common type of STR variation was a partial loss of heterozygosity, with the D5S818 loci having the highest variation frequency and no alterations detected on the D2S441 and Penta E loci. Moreover, STR variation frequencies were shown to increase with an increased patient age and increased clinical and pathological characteristics, thus an older patient with an advanced stage of progression exhibited a higher variation frequency. Overall, this study provides forensic scientists with further insight into STR analysis relating to lung cancer tissue. Copyright © 2017 Elsevier B.V. All rights reserved.
[Chromosome as a chronicler: Genetic dating, historical events, and DNA-genealogic temptation].
Balanovsky, O P; Zaporozhchenko, V V
2016-07-01
Nonrecombinant portions of the genome, Y chromosome and mitochondrial DNA, are widely used for research on human population gene pools and reconstruction of their history. These systems allow the genetic dating of clusters of emerging haplotypes. The main method for age estimations is ρ statistics, which is an average number of mutations from founder haplotype to all modern-day haplotypes. A researcher can estimate the age of the cluster by multiplying this number by the mutation rate. The second method of estimation, ASD, is used for STR haplotypes of the Y chromosome and is based on the squared difference in the number of repeats. In addition to the methods of calculation, methods of Bayesian modeling assume a new significance. They have greater computational cost and complexity, but they allow obtaining an a posteriori distribution of the value of interest that is the most consistent with experimental data. The mutation rate must be known for both calculation methods and modeling methods. It can be determined either during the analysis of lineages or by providing calibration points based on populations with known formation time. These two approaches resulted in rate estimations for Y-chromosomal STR haplotypes with threefold difference. This contradiction was only recently refuted through the use of sequence data for the complete Y chromosome; “whole-genomic” rates of single nucleotide mutations obtained by both methods are mutually consistent and mark the area of application for different rates of STR markers. An issue even more crucial than that of the rates is correlation of the reconstructed history of the haplogroup (a cluster of haplotypes) and the history of the population. Although the need for distinguishing “lineage history” and “population history” arose in the earliest days of phylogeographic research, reconstructing the population history using genetic dating requires a number of methods and conditions. It is known that population
Matamoros-Angles, Andreu; Gayosso, Lucía Mayela; Richaud-Patin, Yvonne; di Domenico, Angelique; Vergara, Cristina; Hervera, Arnau; Sousa, Amaya; Fernández-Borges, Natalia; Consiglio, Antonella; Gavín, Rosalina; López de Maturana, Rakel; Ferrer, Isidro; López de Munain, Adolfo; Raya, Ángel; Castilla, Joaquín; Sánchez-Pernaute, Rosario; Del Río, José Antonio
2018-04-01
Gerstmann-Sträussler-Scheinker (GSS) syndrome is a fatal autosomal dominant neurodegenerative prionopathy clinically characterized by ataxia, spastic paraparesis, extrapyramidal signs and dementia. In some GSS familiar cases carrying point mutations in the PRNP gene, patients also showed comorbid tauopathy leading to mixed pathologies. In this study we developed an induced pluripotent stem (iPS) cell model derived from fibroblasts of a GSS patient harboring the Y218N PRNP mutation, as well as an age-matched healthy control. This particular PRNP mutation is unique with very few described cases. One of the cases presented neurofibrillary degeneration with relevant Tau hyperphosphorylation. Y218N iPS-derived cultures showed relevant astrogliosis, increased phospho-Tau, altered microtubule-associated transport and cell death. However, they failed to generate proteinase K-resistant prion. In this study we set out to test, for the first time, whether iPS cell-derived neurons could be used to investigate the appearance of disease-related phenotypes (i.e, tauopathy) identified in the GSS patient.
STRBase: a short tandem repeat DNA database for the human identity testing community
Ruitberg, Christian M.; Reeder, Dennis J.; Butler, John M.
2001-01-01
The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes. PMID:11125125
Analysis of 17 STR data on 5362 southern Portuguese individuals-an update on reference database.
Cabezas Silva, Raquel; Ribeiro, Teresa; Lucas, Isabel; Porto, Maria João; Costa Santos, Jorge; Dario, Paulo
2016-03-01
The main objective of this work consisted of the updating of allele frequencies and other relevant forensic parameters for the 17 autosomal STR loci provided by the combination of the two types of kits used routinely in our laboratory casework: AmpF/STR Identifiler(®) and the Powerplex(®) 16 Systems. This aim was of significant importance, given that the last study on these kits within the southern Portuguese population dates back to 2006, and, as a consequence, it was necessary to correct the deviation caused by population evolution over the last ten years so that they might be better applied to our forensic casework. For this reason genetic data from 5362 unrelated Caucasian Portuguese individuals from the south of Portugal who were involved in paternity testing casework from 2005 to 2014 was used. Of all the markers, TPOX proved to be the least polymorphic, and Penta E the most. Secondly, this up-to-date southern Portuguese population was compared not only with the northern and central Portuguese populations, but also with that of southern Portugal in 2006, along with populations from Spain, Italy, Greece, Romania, Morocco, Angola and Korea in order to infer information about the relatedness of these respective populations, and the variation of the southern Portuguese population over time. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Gusmão, Leonor; Gomes, Veronica; González, Miguel; Corach, Daniel; Sala, Andrea; Alechine, Evguenia; Palha, Teresinha; Santos, Ney; Ribeiro-dos-Santos, Andrea; Geppert, Maria; Willuweit, Sascha; Nagy, Marion; Zweynert, Sarah; Baeta, Miriam; Núñez, Carolina; Martínez-Jarreta, Begoña; González-Andrade, Fabricio; Fagundes de Carvalho, Elizeu; da Silva, Dayse Aparecida; Builes, Juan José; Turbón, Daniel; Lopez Parra, Ana Maria; Arroyo-Pardo, Eduardo; Toscanini, Ulises; Borjas, Lisbeth; Barletta, Claudia; Ewart, Elizabeth; Santos, Sidney; Krawczak, Michael
2013-01-01
Numerous studies of human populations in Europe and Asia have revealed a concordance between their extant genetic structure and the prevailing regional pattern of geography and language. For native South Americans, however, such evidence has been lacking so far. Therefore, we examined the relationship between Y-chromosomal genotype on the one hand, and male geographic origin and linguistic affiliation on the other, in the largest study of South American natives to date in terms of sampled individuals and populations. A total of 1,011 individuals, representing 50 tribal populations from 81 settlements, were genotyped for up to 17 short tandem repeat (STR) markers and 16 single nucleotide polymorphisms (Y-SNPs), the latter resolving phylogenetic lineages Q and C. Virtually no structure became apparent for the extant Y-chromosomal genetic variation of South American males that could sensibly be related to their inter-tribal geographic and linguistic relationships. This continent-wide decoupling is consistent with a rapid peopling of the continent followed by long periods of isolation in small groups. Furthermore, for the first time, we identified a distinct geographical cluster of Y-SNP lineages C-M217 (C3*) in South America. Such haplotypes are virtually absent from North and Central America, but occur at high frequency in Asia. Together with the locally confined Y-STR autocorrelation observed in our study as a whole, the available data therefore suggest a late introduction of C3* into South America no more than 6,000 years ago, perhaps via coastal or trans-Pacific routes. Extensive simulations revealed that the observed lack of haplogroup C3* among extant North and Central American natives is only compatible with low levels of migration between the ancestor populations of C3* carriers and non-carriers. In summary, our data highlight the fact that a pronounced correlation between genetic and geographic/cultural structure can only be expected under very specific
Gombos, Z; Hermann, R; Kiviniemi, M; Nejentsev, S; Reimand, K; Fadeyev, V; Peterson, P; Uibo, R; Ilonen, J
2007-12-01
Addison's disease is an organ-specific autoimmune disorder with a polygenic background. The aim of the study was to identify non-class II human leukocyte antigen (HLA) susceptibility genes for Addison's disease. Addison's disease patients from three European populations were analysed for selected HLA-DR-DQ alleles and for 11 microsatellite markers covering approximately 4 Mb over the HLA region. Subjects were 69 patients with Addison's disease from Estonia (24), Finland (14) and Russia (31). Consecutively recruited healthy newborns from the same geographical regions were used as controls (269 Estonian, 1000 Finnish and 413 Russian). Association measures for HLA-DRB1, DQB1, DQA1 and 11 microsatellites between D6S273 and D6S2223 were taken. A low-resolution full-house typing was used for HLA class II genes, while microsatellite markers were studied using fluorescence-based DNA fragment sizing technology. We confirmed that the HLA-DR3-DQ2 and the DQB1*0302-DRB1*0404 haplotypes confer disease susceptibility. In Russian patients, we also found an increase of DRB1*0403 allele, combined with DQB1*0305 allele in three out of six cases (P<0.0001). Analysis of 11 microsatellite markers including STR MICA confirmed the strong linkage in DR3-DQ2 haplotypes but DRB1*0404-DQB1*0302 haplotypes were diverse. MICA5.1 allele was found in 22 out of 24 Estonian patients, but results from Finnish and Russian patients did not support its independent role in disease susceptibility. HLA-DRB1*0403 was identified as a novel susceptibility allele for Addison's disease. Additionally, we found no evidence of a non-class II HLA disease susceptibility locus; however, the HLA-DR3-DQ2 haplotype appeared more conserved in patient groups with high DR-DQ2 frequencies.
Gurkan, Cemal; Sevay, Huseyin; Demirdov, Damla Kanliada; Hossoz, Sinem; Ceker, Deren; Teralı, Kerem; Erol, Ayla Sevim
2017-03-01
Cyprus is an island in the Eastern Mediterranean Sea with a documented history of human settlements dating back over 10,000 years. To investigate the paternal lineages of a representative population from Cyprus in the context of the larger Near Eastern/Southeastern European genetic landscape. Three hundred and eighty samples from the second most populous ethnic group in Cyprus (Turkish Cypriots) were analysed at 17 Y-chromosomal short tandem repeat (Y-STR) loci. A haplotype diversity of 0.9991 was observed, along with a number of allelic variants, multi-allelic patterns and a most frequent haplotype that have not previously been reported elsewhere. Pairwise genetic distance comparisons of the Turkish Cypriot Y-STR dataset and Y-chromosomal haplogroup distribution with those from Near East/Southeastern Europe both suggested a closer genetic connection with the Near Eastern populations. Median-joining network analyses of the most frequent haplogroups also revealed some evidence towards in situ radiation. Turkish Cypriot paternal lineages seem to bear an autochthonous character and closest genetic connection with the neighbouring Near Eastern populations. These observations are further underscored by the fact that the haplogroups associated with the spread of Neolithic Agricultural Revolution from the Fertile Crescent (E1b1b/J1/J2/G2a) dominate (>70%) the Turkish Cypriot haplogroup distribution.
Pesik, V Yu; Fedunin, A A; Agdzhoyan, A T; Utevska, O M; Chukhraeva, M I; Evseeva, I V; Churnosov, M I; Lependina, I N; Bogunov, Yu V; Bogunova, A A; Ignashkin, M A; Yankovsky, N K; Balanovska, E V; Orekhov, V A; Balanovsky, O P
2014-06-01
We conducted the first genetic analysis of a wide a range of rural Russian populations in European Russia with a panel of common DNA markers commonly used in criminalistics genetic identification. We examined a total of 647 samples from indigenous ethnic Russian populations in Arkhangelsk, Belgorod, Voronezh, Kursk, Rostov, Ryazan, and Orel regions. We employed a multiplex genotyping kit, COrDIS Plus, to genotype Short Tandem Repeat (STR) loci, which included the genetic marker panel officially recommended for DNA identification in the Russian Federation, the United States, and the European Union. In the course of our study, we created a database of allelic frequencies, examined the distribution of alleles and genotypes in seven rural Russian populations, and defined the genetic relationships between these populations. We found that, although multidimensional analysis indicated a difference between the Northern gene pool and the rest of the Russian European populations, a pairwise comparison using 19 STR markers among all populations did not reveal significant differences. This is in concordance with previous studies, which examined up to 12 STR markers of urban Russian populations. Therefore, the database of allelic frequencies created in this study can be applied for forensic examinations and DNA identification among the ethnic Russian population over European Russia. We also noted a decrease in the levels of heterozygosity in the northern Russian population compared to ethnic populations in southern and central Russia, which is consistent with trends identified previously using classical gene markers and analysis of mitochondrial DNA.
Mutation rates at 42 Y chromosomal short tandem repeats in Chinese Han population in Eastern China.
Wu, Weiwei; Ren, Wenyan; Hao, Honglei; Nan, Hailun; He, Xin; Liu, Qiuling; Lu, Dejian
2018-01-31
Mutation analysis of 42 Y chromosomal short tandem repeats (Y-STRs) loci was performed using a sample of 1160 father-son pairs from the Chinese Han population in Eastern China. The results showed that the average mutation rate across the 42 Y-STR loci was 0.0041 (95% CI 0.0036-0.0047) per locus per generation. The locus-specific mutation rates varied from 0.000 to 0.0190. No mutation was found at DYS388, DYS437, DYS448, DYS531, and GATA_H4. DYS627, DYS570, DYS576, and DYS449 could be classified as rapidly mutating Y-STRs, with mutation rates higher than 1.0 × 10 -2 . DYS458, DYS630, and DYS518 were moderately mutating Y-STRs, with mutation rates ranging from 8 × 10 -3 to 1 × 10 -2 . Although the characteristics of the Y-STR mutations were consistent with those in previous studies, mutation rate differences between our data and previous published data were found at some rapidly mutating Y-STRs. The single-copy loci located on the short arm of the Y chromosome (Yp) showed relatively higher mutation rates more frequently than the multi-copy loci. These results will not only extend the data for Y-STR mutations but also be important for kinship analysis, paternal lineage identification, and family relationship reconstruction in forensic Y-STR analysis.
The Geographic Distribution of Human Y Chromosome Variation
Hammer, M. F.; Spurdle, A. B.; Karafet, T.; Bonner, M. R.; Wood, E. T.; Novelletto, A.; Malaspina, P.; Mitchell, R. J.; Horai, S.; Jenkins, T.; Zegura, S. L.
1997-01-01
We examined variation on the nonrecombining portion of the human Y chromosome to investigate human evolution during the last 200,000 years. The Y-specific polymorphic sites included the Y Alu insertional polymorphism or ``YAP'' element (DYS287), the poly(A) tail associated with the YAP element, three point mutations in close association with the YAP insertion site, an A-G polymorphic transition (DYS271), and a tetranucleotide microsatellite (DYS19). Global variation at the five bi-allelic sites (DYS271, DYS287, and the three point mutations) gave rise to five ``YAP haplotypes'' in 60 populations from Africa, Europe, Asia, Australasia, and the New World (n = 1500). Combining the multi-allelic variation at the microsatellite loci (poly(A) tail and DYS19) with the YAP haplotypes resulted in a total of 27 ``combination haplotypes''. All five of the YAP haplotypes and 21 of the 27 combination haplotypes were found in African populations, which had greater haplotype diversity than did populations from other geographical locations. Only subsets of the five YAP haplotypes were found outside of Africa. Patterns of observed variation were compatible with a variety of hypotheses, including multiple human migrations and range expansions. PMID:9055088
StrAuto: automation and parallelization of STRUCTURE analysis.
Chhatre, Vikram E; Emerson, Kevin J
2017-03-24
Population structure inference using the software STRUCTURE has become an integral part of population genetic studies covering a broad spectrum of taxa including humans. The ever-expanding size of genetic data sets poses computational challenges for this analysis. Although at least one tool currently implements parallel computing to reduce computational overload of this analysis, it does not fully automate the use of replicate STRUCTURE analysis runs required for downstream inference of optimal K. There is pressing need for a tool that can deploy population structure analysis on high performance computing clusters. We present an updated version of the popular Python program StrAuto, to streamline population structure analysis using parallel computing. StrAuto implements a pipeline that combines STRUCTURE analysis with the Evanno Δ K analysis and visualization of results using STRUCTURE HARVESTER. Using benchmarking tests, we demonstrate that StrAuto significantly reduces the computational time needed to perform iterative STRUCTURE analysis by distributing runs over two or more processors. StrAuto is the first tool to integrate STRUCTURE analysis with post-processing using a pipeline approach in addition to implementing parallel computation - a set up ideal for deployment on computing clusters. StrAuto is distributed under the GNU GPL (General Public License) and available to download from http://strauto.popgen.org .
Phillips, C; Ballard, D; Gill, P; Court, D Syndercombe; Carracedo, A; Lareu, M V
2012-05-01
Family studies can be used to measure the genetic distance between same-chromosome (syntenic) STRs in order to detect physical linkage or linkage disequilibrium. However, family studies are expensive and time consuming, in many cases uninformative, and lack a reliable means to infer the phase of the diplotypes obtained. HapMap provides a more comprehensive and fine-scale estimation of recombination rates using high density multi-point SNP data (average inter-SNP distance: 900 nucleotides). Data at this fine scale detects sub-kilobase genetic distances across the whole recombining human genome. We have used the most recent HapMap SNP data release 22 to measure and compare genetic distances, and by inference fine-scale recombination rates, between 29 syntenic STR pairs identified from 39 validated STRs currently available for forensic use. The 39 STRs comprise 23 core loci: SE33, Penta D & E, 13 CODIS and 7 non-CODIS European Standard Set STRs, plus supplementary STRs in the recently released Promega CS-7™ and Qiagen Investigator HDplex™ kits. Also included were D9S1120, a marker we developed for forensic use unique to chromosome 9, and the novel D6S1043 component STR of SinoFiler™ (Applied Biosystems). The data collated provides reliable estimates of recombination rates between each STR pair, that can then be placed into haplotype frequency calculators for short pedigrees with multiple meiotic inputs and which just requires the addition of allele frequencies. This allows all current STR sets or their combinations to be used in supplemented paternity analyses without the need for further adjustment for physical linkage. The detailed analysis of recombination rates made for autosomal forensic STRs was extended to the more than 50 X chromosome STRs established or in development for complex kinship analyses. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Szymanski, Maciej; Barciszewska, Miroslawa Z.; Barciszewski, Jan; Erdmann, Volker A.
2000-01-01
This paper presents the updated version (Y2K) of the database of ribosomal 5S ribonucleic acids (5S rRNA) and their genes (5S rDNA), http://rose.man/poznan. pl/5SData/index.html . This edition of the database contains 1985 primary structures of 5S rRNA and 5S rDNA. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms. PMID:10592212
5S ribosomal RNA database Y2K.
Szymanski, M; Barciszewska, M Z; Barciszewski, J; Erdmann, V A
2000-01-01
This paper presents the updated version (Y2K) of the database of ribosomal 5S ribonucleic acids (5S rRNA) and their genes (5S rDNA), http://rose.man/poznan.pl/5SData/index.html. This edition of the database contains 1985primary structures of 5S rRNA and 5S rDNA. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms.
Olsson, K Sigvard; Ritter, Bernd; Hansson, Norbeth; Chowdhury, Ruma R
2008-07-01
The hemochromatosis mutation, C282Y of the HFE gene, seems to have originated from a single event which once occurred in a person living in the north west of Europe carrying human leukocyte antigen (HLA)-A3-B7. In descendants of this ancestor also other haplotypes appear probably caused by local recombinations and founder effects. The background of these associations is unknown. Isolated river valley populations may be fruitful for the mapping of genetic disorders such as hemochromatosis. In this study, we try to test this hypothesis in a study from central Sweden where the haplotyope A1-B8 was common. HLA haplotypes and HFE mutations were studied in hemochromatosis patients with present or past parental origin in a sparsely populated (1/km(2)) rural district (n = 8366 in the year of 2005), in central Sweden. Pedigrees were constructed from the Swedish church book registry. Extended haplotypes were studied to evaluate origin of recombinations. There were 87 original probands, 36 females and 51 males identified during 30 yr, of whom 86% carried C282Y/C282Y and 14% C282Y/H63D. Of 32 different HLA haplotypes A1-B8 was the most common (34%), followed by A3-B7 (16%), both in strong linkage disequilibrium with controls, (P < 0.001). Twenty-nine different families with A1-B8 had a common founder origin 15 generations ago in small bottleneck populations of the late 16th century. A second A1-B8 founder born 1655 was of Norwegian origin. Most of the A3 carriers (n = 26) had a common founder origin 16 generations ago in an even smaller nearby river valley. A fourth founder family carrying HLA-A2 seems to have originated from a recombination along the descendant lines from the A3 ancestor supported by extended haplotype studies. A1-haplotypes with alleles at the B locus different from B8 had a similar recombination origin as HLA-A2 alleles and a common founder origin 11 generations ago. The intergenerational time interval averaged 35.5 +/- 7.9 yr in men and 31.9 +/- 5.9 in
Gill, Peter; Haned, Hinda; Bleka, Oyvind; Hansson, Oskar; Dørum, Guro; Egeland, Thore
2015-09-01
The introduction of Short Tandem Repeat (STR) DNA was a revolution within a revolution that transformed forensic DNA profiling into a tool that could be used, for the first time, to create National DNA databases. This transformation would not have been possible without the concurrent development of fluorescent automated sequencers, combined with the ability to multiplex several loci together. Use of the polymerase chain reaction (PCR) increased the sensitivity of the method to enable the analysis of a handful of cells. The first multiplexes were simple: 'the quad', introduced by the defunct UK Forensic Science Service (FSS) in 1994, rapidly followed by a more discriminating 'six-plex' (Second Generation Multiplex) in 1995 that was used to create the world's first national DNA database. The success of the database rapidly outgrew the functionality of the original system - by the year 2000 a new multiplex of ten-loci was introduced to reduce the chance of adventitious matches. The technology was adopted world-wide, albeit with different loci. The political requirement to introduce pan-European databases encouraged standardisation - the development of European Standard Set (ESS) of markers comprising twelve-loci is the latest iteration. Although development has been impressive, the methods used to interpret evidence have lagged behind. For example, the theory to interpret complex DNA profiles (low-level mixtures), had been developed fifteen years ago, but only in the past year or so, are the concepts starting to be widely adopted. A plethora of different models (some commercial and others non-commercial) have appeared. This has led to a confusing 'debate' about the 'best' to use. The different models available are described along with their advantages and disadvantages. A section discusses the development of national DNA databases, along with details of an associated controversy to estimate the strength of evidence of matches. Current methodology is limited to
[Genetic polymorphisms of 21 non-CODIS STR loci].
Shao, Wei-bo; Zhang, Su-hua; Li, Li
2011-02-01
To investigate genetic polymorphisms of 21 non-CODIS STR loci in Han population from the east of China and to explore their forensic application value. Twenty-one non-CODIS STR loci, were amplified with AGCU 21+1 STR kit and DNA samples were obtained from 225 unrelated individuals of the Han population from the east of China. The PCR products were analyzed with 3130 Genetic Analyzer and genotyped with GeneMapper ID v3.2 software. The genetic data were statistically analyzed with PowerStats v12.xls and Cervus 2.0 software. The distributions of 21 non-CODIS STR loci satisfied the Hardy-Weinberg equilibration. The heterozygosity (H) distributions were 0.596-0.804, the discrimination power (DP) were 0.764-0.948, the probability of exclusion of duo-testing (PEduo) were 0.176-0.492, the probability of exclusion of trios-testing (PEtrio) were 0.334-0.663, and the polymorphic information content (PIC) were 0.522-0.807. The cumulative probability of exclusion (CPE) of duo-testing was 0.999707, the CPE of trios-testing was 0.9999994, and the cumulated discrimination power (CDP) was 0.99999999999999999994. Twenty-one non-CODIS STR loci are highly polymorphic. They can be effectively used in personal identification and paternity testing in trios cases. They can also be used as supplement in the difficult cases of diad paternity testing.
Feng, Chunmei; Wang, Xin; Wang, Xiaolong; Yu, Hao; Zhang, Guohua
2018-03-01
We investigated the frequencies of 15 autosomal STR loci in the Kazak population of the Ili Kazak Autonomous Prefecture with the aim of expanding the available population information in human genetic databases and for forensic DNA analysis. Genetic polymorphisms of 15 autosomal short tandem repeat (STR) loci were analysed in 456 individuals of the Kazak population from Ili Kazakh Autonomous Prefecture, northwestern China. A total of 173 alleles at 15 autosomal STR loci were found; the allele frequencies ranged from 0.5022-0.0011. The combined power of discrimination and exclusion statistics for the 15 STR loci were 0.999 999 999 85 and 0.999 998 800 65, respectively. In addition, phylogenetic analysis involving the Ili Uygur population and other relevant populations was carried out. A neighbour-joining tree and multidimensional scaling plot were generated based on Nei's standard genetic distance. Results of the population comparison indicated that the Ili Uygur population was most closely related genetically to the Uygur populations from other regions in China. These findings are consistent with the historical and geographic backgrounds of these populations.
Efficient algorithms for polyploid haplotype phasing.
He, Dan; Saha, Subrata; Finkers, Richard; Parida, Laxmi
2018-05-09
Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.
A massively parallel strategy for STR marker development, capture, and genotyping.
Kistler, Logan; Johnson, Stephen M; Irwin, Mitchell T; Louis, Edward E; Ratan, Aakrosh; Perry, George H
2017-09-06
Short tandem repeat (STR) variants are highly polymorphic markers that facilitate powerful population genetic analyses. STRs are especially valuable in conservation and ecological genetic research, yielding detailed information on population structure and short-term demographic fluctuations. Massively parallel sequencing has not previously been leveraged for scalable, efficient STR recovery. Here, we present a pipeline for developing STR markers directly from high-throughput shotgun sequencing data without a reference genome, and an approach for highly parallel target STR recovery. We employed our approach to capture a panel of 5000 STRs from a test group of diademed sifakas (Propithecus diadema, n = 3), endangered Malagasy rainforest lemurs, and we report extremely efficient recovery of targeted loci-97.3-99.6% of STRs characterized with ≥10x non-redundant sequence coverage. We then tested our STR capture strategy on P. diadema fecal DNA, and report robust initial results and suggestions for future implementations. In addition to STR targets, this approach also generates large, genome-wide single nucleotide polymorphism (SNP) panels from flanking regions. Our method provides a cost-effective and scalable solution for rapid recovery of large STR and SNP datasets in any species without needing a reference genome, and can be used even with suboptimal DNA more easily acquired in conservation and ecological studies. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.
Short-read, high-throughput sequencing technology for STR genotyping
Bornman, Daniel M.; Hester, Mark E.; Schuetter, Jared M.; Kasoji, Manjula D.; Minard-Smith, Angela; Barden, Curt A.; Nelson, Scott C.; Godbold, Gene D.; Baker, Christine H.; Yang, Boyu; Walther, Jacquelyn E.; Tornes, Ivan E.; Yan, Pearlly S.; Rodriguez, Benjamin; Bundschuh, Ralf; Dickens, Michael L.; Young, Brian A.; Faith, Seth A.
2013-01-01
DNA-based methods for human identification principally rely upon genotyping of short tandem repeat (STR) loci. Electrophoretic-based techniques for variable-length classification of STRs are universally utilized, but are limited in that they have relatively low throughput and do not yield nucleotide sequence information. High-throughput sequencing technology may provide a more powerful instrument for human identification, but is not currently validated for forensic casework. Here, we present a systematic method to perform high-throughput genotyping analysis of the Combined DNA Index System (CODIS) STR loci using short-read (150 bp) massively parallel sequencing technology. Open source reference alignment tools were optimized to evaluate PCR-amplified STR loci using a custom designed STR genome reference. Evaluation of this approach demonstrated that the 13 CODIS STR loci and amelogenin (AMEL) locus could be accurately called from individual and mixture samples. Sensitivity analysis showed that as few as 18,500 reads, aligned to an in silico referenced genome, were required to genotype an individual (>99% confidence) for the CODIS loci. The power of this technology was further demonstrated by identification of variant alleles containing single nucleotide polymorphisms (SNPs) and the development of quantitative measurements (reads) for resolving mixed samples. PMID:25621315
Strömende Flüssigkeiten und Gase
NASA Astrophysics Data System (ADS)
Heintze, Joachim
Die Bemerkung über die Probleme eines allgemeingültigen Ansatzes, die wir zu Anfang von Kap. 1 machten, gilt in noch höherem Maße für die Mechanik von strömenden Flüssigkeiten; dort erreicht man sogar ziemlich rasch die Grenze der Leistungsfähigkeit der heutigen Mathematik, d. h. wir können zwar - ausgehend von den Newtonschen Gesetzen (Bd. I/3) - eine Differentialgleichung für die Strömung von Flüssigkeiten aufstellen, die sog. Navier-Stokes-Gleichung, es sind aber keine allgemein anwendbaren Lösungsverfahren für diese Gleichung bekannt. Ein Blick in die Natur und auf die vielfältigen Strömungsphänomene zeigt, dass diese Tatsache nicht verwunderlich ist.
Wendt, Frank R; Churchill, Jennifer D; Novroski, Nicole M M; King, Jonathan L; Ng, Jillian; Oldt, Robert F; McCulloh, Kelly L; Weise, Jessica A; Smith, David Glenn; Kanthaswamy, Sreetharan; Budowle, Bruce
2016-09-01
Forensically-relevant genetic markers were typed for sixty-two Yavapai Native Americans using the ForenSeq™ DNA Signature Prep Kit.These data are invaluable to the human identity community due to the greater genetic differentiation among Native American tribes than among other subdivisions within major populations of the United States. Autosomal, X-chromosomal, and Y-chromosomal short tandem repeat (STR) and identity-informative (iSNPs), ancestry-informative (aSNPs), and phenotype-informative (pSNPs) single nucleotide polymorphism (SNP) allele frequencies are reported. Sequence-based allelic variants were observed in 13 autosomal, 3 X, and 3 Y STRs. These observations increased observed and expected heterozygosities for autosomal STRs by 0.081±0.068 and 0.073±0.063, respectively, and decreased single-locus random match probabilities by 0.051±0.043 for 13 autosomal STRs. The autosomal random match probabilities (RMPs) were 2.37×10-26 and 2.81×10-29 for length-based and sequence-based alleles, respectively. There were 22 and 25 unique Y-STR haplotypes among 26 males, generating haplotype diversities of 0.95 and 0.96, for length-based and sequencebased alleles, respectively. Of the 26 haplotypes generated, 17 were assigned to haplogroup Q, three to haplogroup R1b, two each to haplogroups E1b1b and L, and one each to haplogroups R1a and I1. Male and female sequence-based X-STR random match probabilities were 3.28×10-7 and 1.22×10-6, respectively. The average observed and expected heterozygosities for 94 iSNPs were 0.39±0.12 and 0.39±0.13, respectively, and the combined iSNP RMP was 1.08×10-32. The combined STR and iSNP RMPs were 2.55×10-58 and 3.02×10-61 for length-based and sequence-based STR alleles, respectively. Ancestry and phenotypic SNP information, performed using the ForenSeq™ Universal Analysis Software, predicted black hair, brown eyes, and some probability of East Asian ancestry for all but one sample that clustered between European and
Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.
Tyson, Jess; Armour, John A L
2012-12-11
Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.
Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR
2012-01-01
Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411
Gomes, Iva; Pereira, Plácido J P; Harms, Sonja; Oliveira, Andréa M; Schneider, Peter M; Brehm, António
2017-11-01
A male West African sample from Guinea-Bissau (West-African coast) was genetically analyzed using 12 X chromosomal short tandem repeats that are grouped into four haplotype groups. Linkage disequilibrium was tested (p≤0.0008) and association was detected for the majority of markers in three out of the four studied haplotype clusters. The sample of 332 unrelated individuals analyzed in this study belonged to several recognized ethnic groups (n=18) which were used to evaluate the genetic variation of Guinea-Bissau's population. Pairwise genetic distances (F ST ) did not reveal significant differences among the majority of groups. An additional 110 samples from other countries also belonging to West Africa were as well compared with the sample of Guinea-Bissau. No significant differences were found between these two groups of West African individuals, supporting the genetic homogeneity of this region on the X chromosome level. The generation of over 100 DNA West African sequences provided new insights into the repeat sequence structure of some of the present X-STRs. Parameters for forensic evaluation were also calculated for each X-STR, supporting the potential application of these markers in typical kinship scenarios. Also, the high power of discrimination values for samples of female and male origin observed in this study, confirms the usefulness of the present X-STRs in identification analysis. Copyright © 2017 Elsevier B.V. All rights reserved.
Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Harjula, Janne; Nyström Edmark, Veronica; Rannamäe, Eve; Lõugas, Lembi; Sajantila, Antti; Lidén, Kerstin; Taavitsainen, Jussi-Pekka
2015-01-01
Ancient DNA analysis offers a way to detect changes in populations over time. To date, most studies of ancient cattle have focused on their domestication in prehistory, while only a limited number of studies have analysed later periods. Conversely, the genetic structure of modern cattle populations is well known given the undertaking of several molecular and population genetic studies. Bones and teeth from ancient cattle populations from the North-East Baltic Sea region dated to the Prehistoric (Late Bronze and Iron Age, 5 samples), Medieval (14), and Post-Medieval (26) periods were investigated by sequencing 667 base pairs (bp) from the mitochondrial DNA (mtDNA) and 155 bp of intron 19 in the Y-chromosomal UTY gene. Comparison of maternal (mtDNA haplotypes) genetic diversity in ancient cattle (45 samples) with modern cattle populations in Europe and Asia (2094 samples) revealed 30 ancient mtDNA haplotypes, 24 of which were shared with modern breeds, while 6 were unique to the ancient samples. Of seven Y-chromosomal sequences determined from ancient samples, six were Y2 and one Y1 haplotype. Combined data including Swedish samples from the same periods (64 samples) was compared with the occurrence of Y-chromosomal haplotypes in modern cattle (1614 samples). The diversity of haplogroups was highest in the Prehistoric samples, where many haplotypes were unique. The Medieval and Post-Medieval samples also show a high diversity with new haplotypes. Some of these haplotypes have become frequent in modern breeds in the Nordic Countries and North-Western Russia while other haplotypes have remained in only a few local breeds or seem to have been lost. A temporal shift in Y-chromosomal haplotypes from Y2 to Y1 was detected that corresponds with the appearance of new mtDNA haplotypes in the Medieval and Post-Medieval period. This suggests a replacement of the Prehistoric mtDNA and Y chromosomal haplotypes by new types of cattle.
Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Harjula, Janne; Nyström Edmark, Veronica; Rannamäe, Eve; Lõugas, Lembi; Sajantila, Antti; Lidén, Kerstin; Taavitsainen, Jussi-Pekka
2015-01-01
Background Ancient DNA analysis offers a way to detect changes in populations over time. To date, most studies of ancient cattle have focused on their domestication in prehistory, while only a limited number of studies have analysed later periods. Conversely, the genetic structure of modern cattle populations is well known given the undertaking of several molecular and population genetic studies. Results Bones and teeth from ancient cattle populations from the North-East Baltic Sea region dated to the Prehistoric (Late Bronze and Iron Age, 5 samples), Medieval (14), and Post-Medieval (26) periods were investigated by sequencing 667 base pairs (bp) from the mitochondrial DNA (mtDNA) and 155 bp of intron 19 in the Y-chromosomal UTY gene. Comparison of maternal (mtDNA haplotypes) genetic diversity in ancient cattle (45 samples) with modern cattle populations in Europe and Asia (2094 samples) revealed 30 ancient mtDNA haplotypes, 24 of which were shared with modern breeds, while 6 were unique to the ancient samples. Of seven Y-chromosomal sequences determined from ancient samples, six were Y2 and one Y1 haplotype. Combined data including Swedish samples from the same periods (64 samples) was compared with the occurrence of Y-chromosomal haplotypes in modern cattle (1614 samples). Conclusions The diversity of haplogroups was highest in the Prehistoric samples, where many haplotypes were unique. The Medieval and Post-Medieval samples also show a high diversity with new haplotypes. Some of these haplotypes have become frequent in modern breeds in the Nordic Countries and North-Western Russia while other haplotypes have remained in only a few local breeds or seem to have been lost. A temporal shift in Y-chromosomal haplotypes from Y2 to Y1 was detected that corresponds with the appearance of new mtDNA haplotypes in the Medieval and Post-Medieval period. This suggests a replacement of the Prehistoric mtDNA and Y chromosomal haplotypes by new types of cattle. PMID:25992976
yStreX: yeast stress expression database
Wanichthanarak, Kwanjeera; Nookaew, Intawat; Petranovic, Dina
2014-01-01
Over the past decade genome-wide expression analyses have been often used to study how expression of genes changes in response to various environmental stresses. Many of these studies (such as effects of oxygen concentration, temperature stress, low pH stress, osmotic stress, depletion or limitation of nutrients, addition of different chemical compounds, etc.) have been conducted in the unicellular Eukaryal model, yeast Saccharomyces cerevisiae. However, the lack of a unifying or integrated, bioinformatics platform that would permit efficient and rapid use of all these existing data remain an important issue. To facilitate research by exploiting existing transcription data in the field of yeast physiology, we have developed the yStreX database. It is an online repository of analyzed gene expression data from curated data sets from different studies that capture genome-wide transcriptional changes in response to diverse environmental transitions. The first aim of this online database is to facilitate comparison of cross-platform and cross-laboratory gene expression data. Additionally, we performed different expression analyses, meta-analyses and gene set enrichment analyses; and the results are also deposited in this database. Lastly, we constructed a user-friendly Web interface with interactive visualization to provide intuitive access and to display the queried data for users with no background in bioinformatics. Database URL: http://www.ystrexdb.com PMID:25024351
High-throughput STR analysis for DNA database using direct PCR.
Sim, Jeong Eun; Park, Su Jeong; Lee, Han Chul; Kim, Se-Yong; Kim, Jong Yeol; Lee, Seung Hwan
2013-07-01
Since the Korean criminal DNA database was launched in 2010, we have focused on establishing an automated DNA database profiling system that analyzes short tandem repeat loci in a high-throughput and cost-effective manner. We established a DNA database profiling system without DNA purification using a direct PCR buffer system. The quality of direct PCR procedures was compared with that of conventional PCR system under their respective optimized conditions. The results revealed not only perfect concordance but also an excellent PCR success rate, good electropherogram quality, and an optimal intra/inter-loci peak height ratio. In particular, the proportion of DNA extraction required due to direct PCR failure could be minimized to <3%. In conclusion, the newly developed direct PCR system can be adopted for automated DNA database profiling systems to replace or supplement conventional PCR system in a time- and cost-saving manner. © 2013 American Academy of Forensic Sciences Published 2013. This article is a U.S. Government work and is in the public domain in the U.S.A.
Haplotype-Based Genotyping in Polyploids.
Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott
2018-01-01
Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.
A case of false mother included with 46 autosomal STR markers.
Li, Li; Lin, Yuan; Liu, Yan; Zhu, Ruxin; Zhao, Zhenmin; Que, Tingzhi
2015-01-01
For solving a maternity case, 19 autosomal short tandem repeats (STRs) were amplified using the AmpFℓSTR(®) Sinofiler(TM) kit and PowerPlex(®) 16 System. Additional 27 autosomal STR loci were analyzed using two domestic kits AGCU 21+1 and STRtyper-10G. The combined maternity index (CMI) was calculated to be 3.3 × 10(13), but the putative mother denied that she had given birth to the child. In order to reach an accurate conclusion, further testing of 20 X-chromosomal short tandem repeats (X-STRs), 40 single nucleotide polymorphism (SNP) loci, and mitochondrial DNA (mtDNA) was carried out. The putative mother and the boy shared at least one allele at all 46 tested autosomal STR loci. But, according to the profile data of 20 X-STR and 40 SNP markers, different genotypes at 13 X-STR loci and five SNP loci excluded maternity. Mitochondrial profiles also clearly excluded the mother as a parent of the son because they have multiple differences. It was finally found that the putative mother is the sister of the biological father. Different kinds of genetic markers needfully supplement the use of autosomal STR loci in case where the putative parent is suspected to be related to the true parent.
Interim analysis of STR effectiveness
DOT National Transportation Integrated Search
1978-01-01
The present report describes the status of the NHTSA Short Term Rehabilitation Study (STR) as of December, 1977, and summarizes the progress of data collection efforts by the eleven participating ASAP projects. Outcome measures considered as indicati...
Jin, Han Jun; Kim, Ki Cheol; Yoon, Cha Eun; Kim, Wook
2013-11-01
We analyzed the variation of eighteen miniSTR loci in 411 randomly chosen individuals from Korea to increase the probability that a degraded sample can be typed, as well as to provide an expanded and reliable population database. Six multiplex PCR systems were developed (multiplex I: D1S1677, D2S441 and D4S2364; multiplex II: D10S1248, D14S1434 and D22S1045; multiplex III: D12S391, D16S3253 and D20S161; multiplex IV: D3S4529, D8S1115 and D18S853; multiplex V: D6S1017, D11S4463 and D17S1301; multiplex VI: D5S2500, D9S1122 and D21S1437). Allele frequencies and forensic parameters were calculated to evaluate the suitability and robustness of these non-CODIS miniSTR systems. No significant deviation from Hardy-Weinberg equilibrium expectations were observed, except for D4S2364, D5S2500 and D20S161 loci. A multidimensional scaling plot based on allele frequencies of the six miniSTR loci (D1S1677, D2S441, D4S2364, D10S1248, D14S1434 and D22S1045) showed that Koreans appeared to have most genetic affinity with Chinese and Japanese than to other Eurasian populations compared here. The combined probability of match calculated from the 18 miniSTR loci was 2.902 × 10(-17), indicating a high degree of polymorphism. Thus, the 18 miniSTR loci can be suitable for recovering useful information for analyzing degraded forensic casework samples and for adding supplementary genetic information for a variety of analyses involving closely related individuals where there is a need for additional genetic information. Copyright © 2013 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes
Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee
2015-01-01
Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880
Niemcunowicz-Janica, Anna; Pepiński, Witold; Janica, Jacek Robert; Janica, Jerzy; Skawrońska, Małgorzata; Koc-Zórawska, Ewa
2007-01-01
In cases of decomposed bodies, Y chromosomal STR markers may be useful in identification of a male relative. The authors assessed typeability of PowerPlex Y (Promega) loci in post mortem tissue material stored in various environments. Kidney, spleen and pancreas specimens were collected during autopsies of five persons aged 20-30 years, whose time of death was determined within the limit of 14 hours. Tissue material was incubated at 21 degrees C and 4 degrees C in various environmental conditions. DNA was extracted by the organic method from tissue samples collected in 7-day intervals and subsequently typed using the PowerPlexY-STR kit and ABI 310. A fast decrease in the typeability rate was seen in specimens incubated in peat soil and in sand. Kidney tissue samples were typeable in all PowerPlexY-STR loci within 63 days of incubation at 4 degrees C. Faster DNA degradation was recorded in spleen and pancreas specimens. In samples with negative genotyping results, no DNA was found by fluorometric quantitation. Decomposed soft tissues are a potential material for DNA typing.
CCD Strömvil Photometry of M 37
NASA Astrophysics Data System (ADS)
Boyle, R. P.; Janusz, R.; Kazlauskas, A.; Philip, A. G. Davis
2001-12-01
We have been working on a program of setting up standards in the Strömvil photometric system and have been doing CCD photometry of globular and open clusters. A previous paper (Boyle et al. BAAS, AAS Meeting #193, #68.08) described the results of observations made in the open cluster M 67, which we are setting up as one of the prime standard fields for Strömvil photometry. Now we discuss our observations of M 37, made on the Vatican Advanced Technology Telescope on Mt. Graham, Arizona. One of us (R.J.) has automated the data processing by a novel method. The Strömvil group is multinational. By use of this innovative automated, yet interactive processing method, one systematically applies the same processing steps to run in IRAF by capturing them as presented in html files and submitting them to the IRAF command language. Use of the mouse avoids errors and accelerates the processing from raw data frames to calibrated photometry. From several G2 V stars in M 67 we have calculated their mean color indices and compare them to stars in M 37 to identify candidate G2 V stars there. Identifying such stars relates to the search for terrestrial exoplanets. Ultimately we will use the calibrated Strömvil indices to make photometric determinations of log g and Teff.
Gu, Ming-liang; Chu, Jia-you
2007-12-01
Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.
Allele frequency distribution for 21 autosomal STR loci in Bhutan.
Kraaijenbrink, Thirsa; van Driem, George L; Tshering of Gaselô, Karma; de Knijff, Peter
2007-07-20
We studied the allele frequency distribution of 21 autosomal STR loci contained in the AmpFlSTR Identifiler (Applied Biosystems), the Powerplex 16 (Promega) and the FFFL (Promega) multiplex PCR kits among 936 individuals from the Royal Kingdom of Bhutan. As such these are the first published autosomal DNA results from this country.
Gonzalez-Perez, E; Moral, P; Via, M; Vona, G; Varesi, L; Santamaria, J; Gaya-Vidal, M; Esteban, E
2007-01-01
The islands of the West Mediterranean have played a central role in numerous archaeological, historical and anthropological studies due to their active participation in the history of main Mediterranean civilisations. However, genetic data failed to fit in both their degree of internal differentiation and relationships. A set of 18 Alu markers and three short tandem repeats (STRs) closely linked to the CD4, F13B and DM Alu have been analysed in seven samples from Majorca, Corsica, Sardinia and Sicily to explore some of these issues. Our samples show a high genetic heterogeneity inside and among islands for the Alu data. Global differentiation among islands (F(ST) 2.2%) is slightly higher than that described for Europeans and North Africans. Both the estimated divergence times among samples and the high population heterogeneity revealed by Alu data are compatible with population differences since the first islands' settlement in the Paleolithic period. However, the high within-population diversities and the remarkable homogeneity observed in both STR and Alu/STR haplotype variation indicated that, at least since Neolithic times, gene flow has been acting in west Mediterranean. Genetic drift in west-coast Sardinia and gene flow in west Sicily have contributed to their general differentiation, whereas Corsica, Majorca and east Sicily seem to reflect more recent historical relationships from continental south Europe.
Alzualde, Ainhoa; Indakoetxea, Begoña; Ferrer, Isidre; Moreno, Fermin; Barandiaran, Myriam; Gorostidi, Ana; Estanga, Ainara; Ruiz, Irune; Calero, Miguel; van Leeuwen, Fred W; Atares, Begoña; Juste, Ramón; Rodriguez-Martínez, Ana Belén; López de Munain, Adolfo
2010-08-01
Gerstmann-Sträussler-Scheinker (GSS) disease is a prion disease associated with prion protein gene (PRNP) mutations. We report a novel PRNP mutation (Y218N) associated with GSS disease in a pathologically confirmed case and in two other affected family members. The clinical features of these cases met criteria for possible Alzheimer disease and possible frontotemporal dementia. Neuropathologic analysis revealed deposition of proteinase K-resistant prion protein (PrP(res)), widespread hyperphosphorylated tau pathology, abnormal accumulation of mitochondria in the vicinity of PrP deposits, and expression of mutant ubiquitin (UBB(+1)) in neurofibrillary tangles and dystrophic neurites. Prion protein immunoblotting using 3F4 and 1E4 antibodies disclosed multiple bands ranging from approximately 20 kd to 80 kd and lower bands of 15 kd and approximately 10 kd, the latter only seen after a long incubation. These bands were partially resistant to proteinase K pretreatment. This pattern differs from those seen in Creutzfeldt-Jakob disease andresembles those reported in other GSS cases. The approximately 10kd band was recognized with anti-PrP C-terminus antibodies but not with anti-N terminus antibodies, suggesting PrP truncation at the N terminal. This new mutation extends the list of known mutations responsible for GSS disease and reinforces its clinical heterogeneity. Genetic examination of the PRNP gene should be included in the workup of patients with poorly classifiable dementia.
Population Structure With Localized Haplotype Clusters
Browning, Sharon R.; Weir, Bruce S.
2010-01-01
We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data. PMID:20457877
TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS
AGUIAR, DEREK; WONG, WENDY S.W.; ISTRAIL, SORIN
2014-01-01
The growing availability of inexpensive high-throughput sequence data is enabling researchers to sequence tumor populations within a single individual at high coverage. But, cancer genome sequence evolution and mutational phenomena like driver mutations and gene fusions are difficult to investigate without first reconstructing tumor haplotype sequences. Haplotype assembly of single individual tumor populations is an exceedingly difficult task complicated by tumor haplotype heterogeneity, tumor or normal cell sequence contamination, polyploidy, and complex patterns of variation. While computational and experimental haplotype phasing of diploid genomes has seen much progress in recent years, haplotype assembly in cancer genomes remains uncharted territory. In this work, we describe HapCompass-Tumor a computational modeling and algorithmic framework for haplotype assembly of copy number variable cancer genomes containing haplotypes at different frequencies and complex variation. We extend our polyploid haplotype assembly model and present novel algorithms for (1) complex variations, including copy number changes, as varying numbers of disjoint paths in an associated graph, (2) variable haplotype frequencies and contamination, and (3) computation of tumor haplotypes using simple cycles of the compass graph which constrain the space of haplotype assembly solutions. The model and algorithm are implemented in the software package HapCompass-Tumor which is available for download from http://www.brown.edu/Research/Istrail_Lab/. PMID:24297529
Investigator® HDplex (Qiagen) reference population database for forensic use in Argentina.
Martínez, Gustavo; Borosky, Alicia; Corach, Daniel; Llull, Cintia; Locarno, Laura; Lojo, Mercedes; Marino, Miguel; Miozzo, María Cecilia; Modesti, Nidia; Pacharoni, Carla; Pilili, Juan Pablo; Ramella, María Isabel; Sala, Andrea; Schaller, Cecilia; Vullo, Carlos; Toscanini, Ulises
2017-01-01
Currently, autosomal Short Tandem Repeat (STR) markers represent the method of election in forensic human identification. Commercial kits of most common use nowadays -e.g. PowerPlex ® Fusion, Promega Corp.; AmpFlSTR GlobalFiler, Thermofisher scientific; Investigator 24Plex QS,Qiagen-, allow the co-amplification of 23 highly polymorphic STR loci providing a high discrimination power in human identity testing. However, in complex kinship analysis and familial database searches involving distant relationships, additional DNA typing is often required in order to achieve well-founded conclusions. The recently developed kit Investigator ® HDplex (Qiagen) co-amplify twelve autosomal STRs markers (D7S1517, D3S1744, D12S391, D2S1360, D6S474, D4S2366, D8S1132, D5S2500, D18S51, D21S2055, D10S2325, SE33), nine of which are not present in the above mentioned kits, providing a set of efficient supplementary markers for human identification purposes. In this study we genotyped a sample of 980 individuals from urban areas of ten Argentinean provinces using the Investigator ® HDplex kit, aiming to provide forensic estimates for use in forensic casework and parentage testing in Argentina. We report reference allelic frequency databases for each of the provinces studied as well as for the combined samples. No deviation of Hardy-Weinberg equilibrium was observed. A reasonable discrimination capacity and power of exclusion was estimated which allowed predicting an acceptable forensic behavior of this kit, either to be used as the main STR panel for simple cases or as an auxiliary tool in complex cases. Additionally, population comparison tests showed that the studied samples are relatively homogeneous across the country for these STR set. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Analysis of the NMI01 marker for a population database of cannabis seeds.
Shirley, Nicholas; Allgeier, Lindsay; Lanier, Tommy; Coyle, Heather Miller
2013-01-01
We have analyzed the distribution of genotypes at a single hexanucleotide short tandem repeat (STR) locus in a Cannabis sativa seed database along with seed-packaging information. This STR locus is defined by the polymerase chain reaction amplification primers CS1F and CS1R and is referred to as NMI01 (for National Marijuana Initiative) in our study. The population database consists of seed seizures of two categories: seed samples from labeled and unlabeled packages regarding seed bank source. Of a population database of 93 processed seeds including 12 labeled Cannabis varieties, the observed genotypes generated from single seeds exhibited between one and three peaks (potentially six alleles if in homozygous state). The total number of observed genotypes was 54 making this marker highly specific and highly individualizing even among seeds of common lineage. Cluster analysis associated many but not all of the handwritten labeled seed varieties tested to date as well as the National Park seizure to our known reference database containing Mr. Nice Seedbank and Sensi Seeds commercially packaged reference samples. © 2012 American Academy of Forensic Sciences.
Modifications of Gait as Predictors of Natural Osteoarthritis Progression in STR/Ort Mice
Poulet, Blandine; de Souza, Roberto; Knights, Chancie B; Gentry, Clive; Wilson, Alan M; Bevan, Stuart; Chang, Yu-Mei; Pitsillides, Andrew A
2014-01-01
Objective Osteoarthritis (OA) is a common chronic disease for which disease-modifying therapies are not currently available. Studies to seek new targets for slowing the progress of OA rely on mouse models, but these do not allow for longitudinal monitoring of disease development. This study was undertaken to determine whether gait can be used to measure disease severity in the STR/Ort mouse model of spontaneous OA and whether gait changes are related to OA joint pain. Methods Gait was monitored using a treadmill-based video system. Correlations between OA severity and gait at 3 treadmill speeds were assessed in STR/Ort mice. Gait and pain behaviors of STR/Ort mice and control CBA mice were analyzed longitudinally, with monthly assessments. Results The best speed to identify paw area changes associated with OA severity in STR/Ort mice was found to be 17 cm · seconds−1. Paw area was modified with age in CBA and STR/Ort mice, but this began earlier in STR/Ort mice and correlated with the onset of OA at 20 weeks of age. In addition, task noncompliance appeared at 20 weeks. Surprisingly, STR/Ort mice did not show any signs of pain with OA development, even when treated with the opioid antagonist naloxone, but did exhibit normal pain behaviors in response to complete Freund's adjuvant–induced arthritis. Conclusion The present results identify an animal model in which OA severity and OA pain can be studied in isolation from one another. The findings suggest that paw area and treadmill noncompliance may be useful tools to longitudinally monitor nonpainful OA development in STR/Ort mice. This will help in providing a noninvasive means of assessing new therapies to slow the progression of OA. PMID:24623711
Just, Rebecca S; Irwin, Jodi A
2018-05-01
Some of the expected advantages of next generation sequencing (NGS) for short tandem repeat (STR) typing include enhanced mixture detection and genotype resolution via sequence variation among non-homologous alleles of the same length. However, at the same time that NGS methods for forensic DNA typing have advanced in recent years, many caseworking laboratories have implemented or are transitioning to probabilistic genotyping to assist the interpretation of complex autosomal STR typing results. Current probabilistic software programs are designed for length-based data, and were not intended to accommodate sequence strings as the product input. Yet to leverage the benefits of NGS for enhanced genotyping and mixture deconvolution, the sequence variation among same-length products must be utilized in some form. Here, we propose use of the longest uninterrupted stretch (LUS) in allele designations as a simple method to represent sequence variation within the STR repeat regions and facilitate - in the nearterm - probabilistic interpretation of NGS-based typing results. An examination of published population data indicated that a reference LUS region is straightforward to define for most autosomal STR loci, and that using repeat unit plus LUS length as the allele designator can represent greater than 80% of the alleles detected by sequencing. A proof of concept study performed using a freely available probabilistic software demonstrated that the LUS length can be used in allele designations when a program does not require alleles to be integers, and that utilizing sequence information improves interpretation of both single-source and mixed contributor STR typing results as compared to using repeat unit information alone. The LUS concept for allele designation maintains the repeat-based allele nomenclature that will permit backward compatibility to extant STR databases, and the LUS lengths themselves will be concordant regardless of the NGS assay or analysis tools
Edwards, Ceiridwen J.; Ginja, Catarina; Kantanen, Juha; Pérez-Pardal, Lucía; Tresset, Anne; Stock, Frauke; Gama, Luis T.; Penedo, M. Cecilia T.; Bradley, Daniel G.; Lenstra, Johannes A.; Nijman, Isaäc J.
2011-01-01
Background Diversity patterns of livestock species are informative to the history of agriculture and indicate uniqueness of breeds as relevant for conservation. So far, most studies on cattle have focused on mitochondrial and autosomal DNA variation. Previous studies of Y-chromosomal variation, with limited breed panels, identified two Bos taurus (taurine) haplogroups (Y1 and Y2; both composed of several haplotypes) and one Bos indicus (indicine/zebu) haplogroup (Y3), as well as a strong phylogeographic structuring of paternal lineages. Methodology and Principal Findings Haplogroup data were collected for 2087 animals from 138 breeds. For 111 breeds, these were resolved further by genotyping microsatellites INRA189 (10 alleles) and BM861 (2 alleles). European cattle carry exclusively taurine haplotypes, with the zebu Y-chromosomes having appreciable frequencies in Southwest Asian populations. Y1 is predominant in northern and north-western Europe, but is also observed in several Iberian breeds, as well as in Southwest Asia. A single Y1 haplotype is predominant in north-central Europe and a single Y2 haplotype in central Europe. In contrast, we found both Y1 and Y2 haplotypes in Britain, the Nordic region and Russia, with the highest Y-chromosomal diversity seen in the Iberian Peninsula. Conclusions We propose that the homogeneous Y1 and Y2 regions reflect founder effects associated with the development and expansion of two groups of dairy cattle, the pied or red breeds from the North Sea and Baltic coasts and the spotted, yellow or brown breeds from Switzerland, respectively. The present Y1-Y2 contrast in central Europe coincides with historic, linguistic, religious and cultural boundaries. PMID:21253012
Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin
2007-08-10
The sickle (betas) mutation in the beta-globin gene (HBB) occurs on five "classical" betas haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the betas allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical betas haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical betas haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). The most common betas sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the betas mutation. Two different classical betas haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of betas haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle
Polymorphism of 11 Y Chromosome Short Tandem Repeat Markers among Malaysian Aborigines.
Mohd Yussup, Sofia Sakina; Marzukhi, Marlia; Md-Zain, Badrul Munir; Mamat, Kamaruddin; Mohd Yusof, Farida Zuraina
2017-01-01
The conventional technique such as patrilocality suggests some substantial effects on population diversity. With that, this particular study investigated the paternal line, specifically Scientific Working Group on DNA Analysis Methods (SWGDAM)-recommended Y-STR markers, namely, DYS19, DYS385, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS438, and DYS439. These markers were tested to compare 184 Orang Asli individuals from 3 tribes found in Peninsular Malaysia. As a result, the haplotype diversity and the discrimination capacity obtained were 0.9987 and 0.9076, respectively. Besides, the most diverse marker was DYS385b, whereas the least was DYS391. Furthermore, the Senoi and Proto-Malay tribes were found to be the most distant, whereas the Senoi and Negrito clans were almost similar to each other. In addition, the analysis of molecular variance analysis revealed 82% of variance within the population, but only 18% of difference between the tribes. Finally, the phylogenetic trees constructed using Neighbour Joining and UPGMA (Unweighted Pair Group Method with Arithmetic Mean) displayed several clusters that were tribe specific. With that, future studies are projected to analyse individuals based on more specific sub-tribes.
Uniparental genetic markers in South Amerindians
Bisso-Machado, Rafael; Bortolini, Maria Cátira; Salzano, Francisco Mauro
2012-01-01
A comprehensive review of uniparental systems in South Amerindians was undertaken. Variability in the Y-chromosome haplogroups were assessed in 68 populations and 1,814 individuals whereas that of Y-STR markers was assessed in 29 populations and 590 subjects. Variability in the mitochondrial DNA (mtDNA) haplogroup was examined in 108 populations and 6,697 persons, and sequencing studies used either the complete mtDNA genome or the highly variable segments 1 and 2. The diversity of the markers made it difficult to establish a general picture of Y-chromosome variability in the populations studied. However, haplogroup Q1a3a* was almost always the most prevalent whereas Q1a3* occurred equally in all regions, which suggested its prevalence among the early colonizers. The STR allele frequencies were used to derive a possible ancient Native American Q-clade chromosome haplotype and five of six STR loci showed significant geographic variation. Geographic and linguistic factors moderately influenced the mtDNA distributions (6% and 7%, respectively) and mtDNA haplogroups A and D correlated positively and negatively, respectively, with latitude. The data analyzed here provide rich material for understanding the biological history of South Amerindians and can serve as a basis for comparative studies involving other types of data, such as cultural data. PMID:22888284
Sakai, Satoki
2016-08-01
I developed a gametophytic self-incompatibility (SI) model to study the conditions leading to diversification in SI haplotypes. In the model, the SI system is assumed to be incomplete, and the pollen expressing a given specificity is not fully rejected by the pistils expressing the same specificity. I also assumed that mutations can occur that enhance the rejection of pollen by pistils with the same haplotype variant and reduce rejection by pistils with other variants in the same haplotype. I found that if such mutations occur, the new haplotypes (mutant variants) can stably coexist with the ancestral haplotype in which the mutant arose. This is because pollen bearing the new haplotype is most strongly rejected by pistils bearing the same new haplotype among the pistils in the population; hence, negative frequency-dependent selection prevents their fixation. I also performed simulations and found that the nearly complete SI system evolves from completely self-compatible populations and that SI haplotypes can increase to about 40-50 within a few thousand generations. On the basis of my findings, I propose that diversification of SI haplotypes occurred during the evolution of SI from self-compatibility.
Patterns of haplotypes for 92 cystic fibrosis mutations: Variability, association and recurrence
DOE Office of Scientific and Technical Information (OSTI.GOV)
Morral, N.; Llevadot, R.; Estivill, X.
1994-09-01
Most CFTR mutations are very uncommon among the cystic fibrosis population, with frequencies of less than 1%, and many are found only in specific areas. We have analyzed 92 CF mutations for several markers (4 microsatellites and 3 other polymorphisms) scattered in the CFTR gene. Haplotypes associated with these mutations can be used as a framework in the screening of chromosomes carrying unknown mutations. The association between mutation and haplotype reduces the number of mutations it is necessary to search for to a maximum of 16 for the same haplotype. Only mutations {triangle}F508, G542X and N1303K are associated with moremore » than one haplotype as a result of slippage at more than one microsatellite loci, suggesting that these three are the most ancient CF mutations. Recurrence has been found for at least 7 mutations: H199Y, R347P, L558S, R553X, 2184insA, 3272-26A{r_arrow}G, 3849+10kbC{r_arrow}T and R1162X. Also microsatellite analysis of chromosomes of several ethnic origins (Czech, Italian, Russian, Slovac and Spanish) suggested that possibility of three or more independent origins for mutations R334W, R347P, R1162X, and 3849+10kbC{r_arrow}T, which was confirmed by analysis of markers flanking these mutations.« less
Devesse, Laurence; Ballard, David; Davenport, Lucinda; Riethorst, Immy; Mason-Buck, Gabriella; Syndercombe Court, Denise
2018-05-01
By using sequencing technology to genotype loci of forensic interest it is possible to simultaneously target autosomal, X and Y STRs as well as identity, ancestry and phenotypic informative SNPs, resulting in a breadth of data obtained from a single run that is considerable when compared to that generated with standard technologies. It is important however that this information aligns with the genotype data currently obtained using commercially available kits for CE-based investigations such that results are compatible with existing databases and hence can be of use to the forensic community. In this work, 400 samples were typed using commercially available STR kits and CE, as well as using the Ilumina ForenSeq™ DNA Signature Prep Kit and MiSeq ® FGx to assess concordance of autosomal STRs and population variability. Results show a concordance rate between the two technologies exceeding 99.98% while numerous novel sequence based alleles are described. In order to make use of the sequence variation observed, sequence specific allele frequencies were generated for White British and British Chinese populations. Copyright © 2017 Elsevier B.V. All rights reserved.
Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin
2007-01-01
Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence
Krak, Karol; Vít, Petr; Belyayev, Alexander; Douda, Jan; Hreusová, Lucia; Mandák, Bohumil
2016-01-01
Reticulate evolution is characterized by occasional hybridization between two species, creating a network of closely related taxa below and at the species level. In the present research, we aimed to verify the hypothesis of the allopolyploid origin of hexaploid C. album s. str., identify its putative parents and estimate the frequency of allopolyploidization events. We sampled 122 individuals of the C. album aggregate, covering most of its distribution range in Eurasia. Our samples included putative progenitors of C. album s. str. of both ploidy levels, i.e. diploids (C. ficifolium, C. suecicum) and tetraploids (C. striatiforme, C. strictum). To fulfil these objectives, we analysed sequence variation in the nrDNA ITS region and the rpl32-trnL intergenic spacer of cpDNA and performed genomic in-situ hybridization (GISH). Our study confirms the allohexaploid origin of C. album s. str. Analysis of cpDNA revealed tetraploids as the maternal species. In most accessions of hexaploid C. album s. str., ITS sequences were completely or nearly completely homogenized towards the tetraploid maternal ribotype; a tetraploid species therefore served as one genome donor. GISH revealed a strong hybridization signal on the same eighteen chromosomes of C. album s. str. with both diploid species C. ficifolium and C. suecicum. The second genome donor was therefore a diploid species. Moreover, some individuals with completely unhomogenized ITS sequences were found. Thus, hexaploid individuals of C. album s. str. with ITS sequences homogenized to different degrees may represent hybrids of different ages. This proves the existence of at least two different allopolyploid lineages, indicating a polyphyletic origin of C. album s. str. PMID:27513342
Zhao, Xiaohong; Chen, Xiaogang; Zhao, Yuancun; Zhang, Shu; Gao, Zehua; Yang, Yiwen; Wang, Yufang; Zhang, Ji
2018-05-01
Insertion/deletion polymorphisms (indels), which combine the advantages of both short tandem repeats and single-nucleotide polymorphisms, are suitable for parentage testing. To overcome the limitations of the low polymorphism of di-allelic indels, we constructed a set of haplotypes with physically linked, multi-allelic indels. Candidate haplotypes were selected from the 1000 Genomes Project database, and were subject to the following criteria for inclusion: (i) each marker must have a minimum allele frequency (MAF) of ≥0.1 in the Han population of China; (ii) markers must exist in a non-coding region; (iii) the physical distance between a pair of candidate indels must be <500 bp; (iv) the allele length variation of each indel from 1 to 20 bp; (v) different haplotypes must be located on different chromosomes or chromosomal arms, or be more than 10 Mb apart if on the same chromosomal arm; and (vi) they must not be located across a recombination hotspot. A multiplex system with 11 haplotype markers, comprising 22 tri-allelic indel loci distributed over 10 chromosomes was developed. To validate the multiplex panel, we investigated the haplotype distribution in sets of two and three-generation pedigrees. The results demonstrated that the haplotypes consisting of multi-allelic indel markers exhibited higher polymorphism than a single indel locus, and thus provide Supplementary information for forensic kinship identification. Copyright © 2018 Elsevier B.V. All rights reserved.
Ginja, C; Penedo, M C T; Melucci, L; Quiroz, J; Martínez López, O R; Revidatti, M A; Martínez-Martínez, A; Delgado, J V; Gama, L T
2010-04-01
The ancestry of New World cattle was investigated through the analysis of mitochondrial and Y chromosome variation in Creoles from Argentina, Brazil, Mexico, Paraguay and the United States of America. Breeds that influenced the Creoles, such as Iberian native, British and Zebu, were also studied. Creoles showed high mtDNA diversity (H = 0.984 +/- 0.003) with a total of 78 haplotypes, and the European T3 matriline was the most common (72.1%). The African T1a haplogroup was detected (14.6%), as well as the ancestral African-derived AA matriline (11.9%), which was absent in the Iberian breeds. Genetic proximity among Creoles, Iberian and Atlantic Islands breeds was inferred through their sharing of mtDNA haplotypes. Y-haplotype diversity in Creoles was high (H = 0.779 +/- 0.019), with several Y1, Y2 and Y3 haplotypes represented. Iberian patrilines in Creoles were more difficult to infer and were reflected by the presence of H3Y1 and H6Y2. Y-haplotypes confirmed crossbreeding with British cattle, mainly of Hereford with Pampa Chaqueño and Texas Longhorn. Male-mediated Bos indicus introgression into Creoles was found in all populations, except Argentino1 (herd book registered) and Pampa Chaqueño. The detection of the distinct H22Y3 patriline with the INRA189-90 allele in Caracú suggests introduction of bulls directly from West Africa. Further studies of Spanish and African breeds are necessary to elucidate the origins of Creole cattle, and determine the exact source of their African lineages.
Forensic genetic study of 29 Y-STRs in Korean population.
Jung, Ju Yeon; Park, Ji-Hye; Oh, Yu-Li; Kwon, Han-Sol; Park, Hyun-Chul; Park, Kyung-Hwa; Kim, Eun Hye; Lee, Dong-Sub; Lim, Si-Keun
2016-11-01
In this study, we compared two recently released commercial Y-chromosomal short tandem repeat (Y-STR) kits: the PowerPlex Y23 System (PPY23) and Yfiler® Plus PCR amplification kit (YPlus). We performed validation studies, including sensitivity, tolerance to PCR inhibitors, and mixture analysis, and a population genetics study using 306 unrelated South Korean males. PPY23 and YPlus showed similar sensitivity, but PPY23 showed higher tolerance to humic acid than YPlus. Furthermore, the detection rate of unique minor alleles called from male/male mixtures was higher for PPY23 than for YPlus. Comparing the newly added loci, the mean values of gene diversity for PPY23 and YPlus were 0.6715 and 0.8158, respectively. The discrimination capacity in the 306 unrelated South Korean males for PPY23 was 0.9837, and that for YPlus was 0.9935. These results will inform the selection of suitable Y-STR kits based on the purpose of forensic DNA analysis. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Barton, James C; Barton, J Clayborn; Acton, Ronald T
2010-11-01
Human leukocyte antigen (HLA) haplotypes may influence iron phenotypes in patients with HFE hemochromatosis and could affect survival. We tabulated general characteristics of HLA-A and -B types and haplotypes of HFE C282Y/C282Y probands diagnosed in medical care and analyzed these data to identify HLA survival modifiers. There were 212 probands (130 men, 82 women). Mean follow-up was 12.0 ± 6.4 yr (0.1-41.2 yr; 34 deaths). HLA-A*03 was more prevalent in men (76.9% vs. 61.0% women; P = 0.0129); 35.4% of men and 29.3% of women had A*03, B*07; and 7.7% of men and 8.5% of women had A*03, B*14. Twenty-three probands had cirrhosis; none had A*03, B*14. Positivity for A*03 or A*03, B*07 was not a significant predictor or modifier of survival. In multiple regression analyses, A*03, B*14 predicted longer survival (P = 0.0004). Kaplan-Meier analysis confirmed longer survival in probands with A*03, B*14 (P = 0.0199, log-rank test). After excluding the 23 non-A*03, B*14 probands with cirrhosis, survival of probands with A*03, B*14 was still greater than that of probands without A*03, B*14 (P = 0.0254; log-rank test). Twenty-four years after diagnosis, cumulative survival of probands with and without A*03, B*14 was 100% and 58%, respectively. The percentage of deaths due to iron overload was lower in probands with A*03, B*14 (0% vs. 21.9%; P = 0.0392). In hemochromatosis probands with HFE C282Y/C282Y, survival was longer in those with HLA-A*03, B*14. Earlier age at diagnosis and less severe iron overload in probands with A*03, B*14 could explain this difference. © 2010 John Wiley & Sons A/S.
Santos, Sara; Oliveira, Manuela; Amorim, António; van Asch, Barbara
2014-11-01
The grapevine (Vitis vinifera subsp. vinifera) is one of the most important agricultural crops worldwide. A long interest in the historical origins of ancient and cultivated current grapevines, as well as the need to establish phylogenetic relationships and parentage, solve homonymies and synonymies, fingerprint cultivars and clones, and assess the authenticity of plants and wines has encouraged the development of genetic identification methods. STR analysis is currently the most commonly used method for these purposes. A large dataset of grapevines genotypes for many cultivars worldwide has been produced in the last decade using a common set of recommended dinucleotide nuclear STRs. This type of marker has been replaced by long core-repeat loci in standardized state-of-the-art human forensic genotyping. The first steps toward harmonized grapevine genotyping have already been taken to bring the genetic identification methods closer to human forensic STR standards by previous authors. In this context, we bring forward a set of basic suggestions that reinforce the need to (i) guarantee trueness-to-type of the sample; (ii) use the long core-repeat markers; (iii) verify the specificity and amplification consistency of PCR primers; (iv) sequence frequent alleles and use these standardized allele ladders; (v) consider mutation rates when evaluating results of STR-based parentage and pedigree analysis; (vi) genotype large and representative samples in order to obtain allele frequency databases; (vii) standardize genotype data by establishing allele nomenclature based on repeat number to facilitate information exchange and data compilation. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Indian Ocean Crossroads: Human Genetic Origin and Population Structure in the Maldives
Pijpe, Jeroen; Voogt, Alex; Oven, Mannis; Henneman, Peter; Gaag, Kristiaan J; Kayser, Manfred; Knijff, Peter
2013-01-01
The Maldives are an 850 km-long string of atolls located centrally in the northern Indian Ocean basin. Because of this geographic situation, the present-day Maldivian population has potential for uncovering genetic signatures of historic migration events in the region. We therefore studied autosomal DNA-, mitochondrial DNA-, and Y-chromosomal DNA markers in a representative sample of 141 unrelated Maldivians, with 119 from six major settlements. We found a total of 63 different mtDNA haplotypes that could be allocated to 29 mtDNA haplogroups, mostly within the M, R, and U clades. We found 66 different Y-STR haplotypes in 10 Y-chromosome haplogroups, predominantly H1, J2, L, R1a1a, and R2. Parental admixture analysis for mtDNA- and Y-haplogroup data indicates a strong genetic link between the Maldive Islands and mainland South Asia, and excludes significant gene flow from Southeast Asia. Paternal admixture from West Asia is detected, but cannot be distinguished from admixture from South Asia. Maternal admixture from West Asia is excluded. Within the Maldives, we find a subtle genetic substructure in all marker systems that is not directly related to geographic distance or linguistic dialect. We found reduced Y-STR diversity and reduced male-mediated gene flow between atolls, suggesting independent male founder effects for each atoll. Detected reduced female-mediated gene flow between atolls confirms a Maldives-specific history of matrilocality. In conclusion, our new genetic data agree with the commonly reported Maldivian ancestry in South Asia, but furthermore suggest multiple, independent immigration events and asymmetrical migration of females and males across the archipelago. Am J Phys Anthropol 151:58–67, 2013. © 2013 Wiley Periodicals, Inc. PMID:23526367
[Analysis of free foetal DNA in maternal plasma using STR loci].
Vodicka, R; Vrtel, R; Procházka, M; Santavá, A; Dusek, L; Vrbická, D; Singh, R; Krejciríková, E; Schneiderová, E; Santavý, J
2006-01-01
Problems of maternal and foetal genotype differentiation of maternal plasma in pregnant women are solved generally by real-time systems. In this case the specific probes are used to distinguish particular genotype. Mostly gonosomal sequences are utilised to recognise the male foetus. This work describes possibilities in free foetal DNA detection and quantification by STR. Artificial genotype mixtures ranging from 0,2 % to 100 % to simulate maternal and paternal genotypes and 27 DNA samples from pregnant women in different stage of pregnancy were used for DNA quantification and detection. Foetal genotype was confirmed by biological father genotyping. The detection was performed in STR from 21st chromosome Down syndrome (DS) responsible region by innovated (I) QF PCR which allows to reveal and quantify even very rare DNA mosaics. The STR quantification was assessed in artificial mixtures of genotypes and discriminability of particular genotypes was on the level of few percent. Foetal DNA was detected in 74 % of tested samples. The IQF PCR application in quantification and differentiation between maternal and foetal genotypes by STR loci could have importance in non-invasive prenatal diagnostics as another possible marker for DS risk assessment.
Haplotyping for disease association: a combinatorial approach.
Lancia, Giuseppe; Ravi, R; Rizzi, Romeo
2008-01-01
We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.
Dogan, Serkan; Kovacević, Lejla; Marjanović, Damir
2013-12-01
Allele frequencies of 15 STRs included in the PowerPlex 16 System (D3S1358, TH01, D21S11, D18S51, Penta E, D5S818, D13S317, D7S820, D16S539, CSF1PO, Penta D, VWA, D8S1179, TPOX and FGA) were calculated from the referent sample of 100 unrelated individuals of both sexes from Turkish student population living in Sarajevo, Bosnia and Herzegovina. Buccal swab, as a source of DNA, was collected from the volunteers from whom the informed consent form was obtained. DNA extraction was performed using QIAamp DNA Micro kit by Qiagen. DNA template ranging from 0.5 to 2 ng was used to amplify 15 STR loci by PCR multiplex amplification which was performed by using the PowerPlex 16 kit (Promega Corp., Madison, WI, USA) according to the manufacturer's protocol. The amplifications were carried out in a PE Gene Amp PCR System thermal cycler (Applied Biosystems) and capillary electrophoresis was carried out in an ABI PRISM 310 Genetic Analyzer (Applied Biosystems) in accordance with the manufacturer's recommendations. The frequency of each locus was calculated from the numbers of each observed genotype. Deviation from Hardy-Weinberg equilibrium and observed heterozygosity were calculated. Data were analyzed by using Microsoft Excel workbook template--Powerstats V12 and the power of discrimination (PD), power of exclusion (PE), as well as other population genetic indices for the 15 STR loci were calculated. Obtained results contribute to existing Turkish DNA database, as well as insight of differences and similarities in comparison to population of Bosnia and Herzegovina. In addition, 13 autosomal STR loci frequencies (D3S1358, TH01, D21S11, D18S51, Penta E, D5S818, D13S317, D7S820, D16S539, CSFIPO, Penta D, VWA, D8S1 179, TPOX, and FGA) were studied in 15 different worldwide populations (Turkish, Bosnian, Croatian, Serbian, Montenegrin, Macedonian, Albanian, Kosovan, Greek, Russian, Japanese, Korean, Lithuanian, Iraqi, Belarusian). For the proof of corresponding data, two different
Van Neste, Christophe; Vandewoestyne, Mado; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip
2014-03-01
Forensic scientists are currently investigating how to transition from capillary electrophoresis (CE) to massive parallel sequencing (MPS) for analysis of forensic DNA profiles. MPS offers several advantages over CE such as virtually unlimited multiplexy of loci, combining both short tandem repeat (STR) and single nucleotide polymorphism (SNP) loci, small amplicons without constraints of size separation, more discrimination power, deep mixture resolution and sample multiplexing. We present our bioinformatic framework My-Forensic-Loci-queries (MyFLq) for analysis of MPS forensic data. For allele calling, the framework uses a MySQL reference allele database with automatically determined regions of interest (ROIs) by a generic maximal flanking algorithm which makes it possible to use any STR or SNP forensic locus. Python scripts were designed to automatically make allele calls starting from raw MPS data. We also present a method to assess the usefulness and overall performance of a forensic locus with respect to MPS, as well as methods to estimate whether an unknown allele, which sequence is not present in the MySQL database, is in fact a new allele or a sequencing error. The MyFLq framework was applied to an Illumina MiSeq dataset of a forensic Illumina amplicon library, generated from multilocus STR polymerase chain reaction (PCR) on both single contributor samples and multiple person DNA mixtures. Although the multilocus PCR was not yet optimized for MPS in terms of amplicon length or locus selection, the results show excellent results for most loci. The results show a high signal-to-noise ratio, correct allele calls, and a low limit of detection for minor DNA contributors in mixed DNA samples. Technically, forensic MPS affords great promise for routine implementation in forensic genomics. The method is also applicable to adjacent disciplines such as molecular autopsy in legal medicine and in mitochondrial DNA research. Copyright © 2013 The Authors. Published by
Phylogeny and Haplotype Analysis of Fungi Within the Fusarium incarnatum-equiseti Species Complex.
Ramdial, H; Latchoo, R K; Hosein, F N; Rampersad, S N
2017-01-01
Fusarium spp. are ranked among the top 10 most economically and scientifically important plant-pathogenic fungi in the world and are associated with plant diseases that include fruit decay of a number of crops. Fusarium isolates infecting bell pepper in Trinidad were identified based on sequence comparisons of the translation elongation factor gene (EF-1a) with sequences of Fusarium incarnatum-equiseti species complex (FIESC) verified in the FUSARIUM-ID database. Eighty-two isolates were identified as belonging to one of four phylogenetic species within the subclades FIESC-1, FIESC-15, FIESC-16, and FIESC-26, with the majority of isolates belonging to FIESC-15. A comparison of the level of DNA polymorphism and phylogenetic inference for sequences of the internal transcribed spacer region (ITS1-5.8S-ITS2) and EF-1a sequences for Trinidad and FUSARIUM-ID type species was carried out. The ITS sequences were less informative, had lower haplotype diversity and restricted haplotype distribution, and resulted in poor resolution and taxa placement in the consensus maximum-likelihood tree. EF-1a sequences enabled strongly supported phylogenetic inference with highly resolved branching patterns of the 30 phylogenetic species within the FIESC and placement of representative Trinidad isolates. Therefore, global phylogeny was inferred from EF-1a sequences representing 11 countries, and separation into distinct Incarnatum and Equiseti clades was again evident. In total, 42 haplotypes were identified: 12 were shared and the remaining were unique haplotypes. The most diverse haplotype was represented by sequences from China, Indonesia, Malaysia, and Trinidad and consisted exclusively of F. incarnatum isolates. Spain had the highest haplotype diversity, perhaps because both F. equiseti and F. incarnatum sequences were represented; followed by the United States, which contributed both F. equiseti and F. incarnatum sequences to the data set; then by countries representing Southeast
Factor IX gene haplotypes in Amerindians.
Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S
1997-02-01
We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.
Vitamin K epoxide reductase complex subunit 1 (Vkorc1) haplotype diversity in mouse priority strains
Song, Ying; Vera, Nicole; Kohn, Michael H
2008-01-01
Background Polymorphisms in the vitamin K-epoxide reductase complex subunit 1 gene, Vkorc1, could affect blood coagulation and other vitamin K-dependent proteins, such as osteocalcin (bone Gla protein, BGP). Here we sequenced the Vkorc1 gene in 40 mouse priority strains. We analyzed Vkorc1 haplotypes with respect to prothrombin time (PT) and bone mineral density and composition (BMD and BMC); phenotypes expected to be vitamin K-dependent and represented by data in the Mouse Phenome Database (MPD). Findings In the commonly used laboratory strains of Mus musculus domesticus we identified only four haplotypes differing in the intron or 5' region sequence of the Vkorc1. Six haplotypes differing by coding and non-coding polymorphisms were identified in the other subspecies of Mus. We detected no significant association of Vkorc1 haplotypes with PT, BMD and BMC within each subspecies of Mus. Vkorc1 haplotype sequences divergence between subspecies was associated with PT, BMD and BMC. Conclusion Phenotypic variation in PT, BMD and BMC within subspecies of Mus, while substantial, appears to be dominated by genetic variation in genes other than the Vkorc1. This was particularly evident for M. m. domesticus, where a single haplotype was observed in conjunction with virtually the entire range of PT, BMD and BMC values of all 5 subspecies of Mus included in this study. Differences in these phenotypes between subspecies also should not be attributed to Vkorc1 variants, but should be viewed as a result of genome wide genetic divergence. PMID:19046458
Stochastic sampling effects in STR typing: Implications for analysis and interpretation.
Timken, Mark D; Klein, Sonja B; Buoncristiani, Martin R
2014-07-01
The analysis and interpretation of forensic STR typing results can become more complicated when reduced template amounts are used for PCR amplification due to increased stochastic effects. These effects are typically observed as reduced heterozygous peak-height balance and increased frequency of undetected alleles (allelic "dropout"). To investigate the origins of these effects, a study was performed using the AmpFlSTR(®) Identifiler Plus(®) and MiniFiler(®) kits to amplify replicates from a dilution series of NIST Human DNA Quantitation Standard (SRM(®) 2372A). The resulting amplicons were resolved and detected on two different genetic analyzer platforms, the Applied Biosystems 3130xL and 3500 analyzers. Results from our study show that the four different STR/genetic analyzer combinations exhibited very similar peak-height ratio statistics when normalized for the amount of template DNA in the PCR. Peak-height ratio statistics were successfully modeled using the Poisson distribution to simulate pre-PCR stochastic sampling of the alleles, confirming earlier explanations that sampling is the primary source for peak-height imbalance in reduced template dilutions. In addition, template-based pre-PCR sampling simulations also successfully predicted allelic dropout frequencies, as modeled by logistic regression methods, for the low-template DNA dilutions. We discuss the possibility that an accurately quantified DNA template might be used to characterize the linear signal response for data collected using different STR kits or genetic analyzer platforms, so as to provide a standardized approach for comparing results obtained from different STR/CE combinations and to aid in validation studies. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Detecting structure of haplotypes and local ancestry
USDA-ARS?s Scientific Manuscript database
We present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local an...
Extended Islands of Tractability for Parsimony Haplotyping
NASA Astrophysics Data System (ADS)
Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi
Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.
Y-Chromosome Markers for the Red Fox.
Rando, Halie M; Stutchman, Jeremy T; Bastounes, Estelle R; Johnson, Jennifer L; Driscoll, Carlos A; Barr, Christina S; Trut, Lyudmila N; Sacks, Benjamin N; Kukekova, Anna V
2017-09-01
The de novo assembly of the red fox (Vulpes vulpes) genome has facilitated the development of genomic tools for the species. Efforts to identify the population history of red foxes in North America have previously been limited by a lack of information about the red fox Y-chromosome sequence. However, a megabase of red fox Y-chromosome sequence was recently identified over 2 scaffolds in the reference genome. Here, these scaffolds were scanned for repeated motifs, revealing 194 likely microsatellites. Twenty-three of these loci were selected for primer development and, after testing, produced a panel of 11 novel markers that were analyzed alongside 2 markers previously developed for the red fox from dog Y-chromosome sequence. The markers were genotyped in 76 male red foxes from 4 populations: 7 foxes from Newfoundland (eastern Canada), 12 from Maryland (eastern United States), and 9 from the island of Great Britain, as well as 48 foxes of known North American origin maintained on an experimental farm in Novosibirsk, Russia. The full marker panel revealed 22 haplotypes among these red foxes, whereas the 2 previously known markers alone would have identified only 10 haplotypes. The haplotypes from the 4 populations clustered primarily by continent, but unidirectional gene flow from Great Britain and farm populations may influence haplotype diversity in the Maryland population. The development of new markers has increased the resolution at which red fox Y-chromosome diversity can be analyzed and provides insight into the contribution of males to red fox population diversity and patterns of phylogeography. © The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
STR-validator: an open source platform for validation and process control.
Hansson, Oskar; Gill, Peter; Egeland, Thore
2014-11-01
This paper addresses two problems faced when short tandem repeat (STR) systems are validated for forensic purposes: (1) validation is extremely time consuming and expensive, and (2) there is strong consensus about what to validate but not how. The first problem is solved by powerful data processing functions to automate calculations. Utilising an easy-to-use graphical user interface, strvalidator (hereafter referred to as STR-validator) can greatly increase the speed of validation. The second problem is exemplified by a series of analyses, and subsequent comparison with published material, highlighting the need for a common validation platform. If adopted by the forensic community STR-validator has the potential to standardise the analysis of validation data. This would not only facilitate information exchange but also increase the pace at which laboratories are able to switch to new technology. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
FaSTR DNA: a new expert system for forensic DNA analysis.
Power, Timothy; McCabe, Brendan; Harbison, Sally Ann
2008-06-01
The automation of DNA profile analysis of reference and crime samples continues to gain pace driven in part by a realisation by the criminal justice system of the positive impact DNA technology can have in aiding in the solution of crime and the apprehension of suspects. Expert systems to automate the profile analysis component of the process are beginning to be developed. In this paper, we report the validation of a new expert system FaSTR DNA, an expert system suitable for the analysis of DNA profiles from single source reference samples and from crime samples. We compare the performance of FaSTR DNA with that of other equivalent systems, GeneMapper ID v3.2 (Applied Biosystems, Foster City, CA) and FSS-i(3) v4 (The Forensic Science Service((R)) DNA expert System Suite FSS-i(3), Forensic Science Service, Birmingham, UK) with GeneScan Analysis v3.7/Genotyper v3.7 software (Applied Biosystems, Foster City, CA, USA) with manual review. We have shown that FaSTR DNA provides an alternative solution to automating DNA profile analysis and is appropriate for implementation into forensic laboratories. The FaSTR DNA system was demonstrated to be comparable in performance to that of GeneMapper ID v3.2 and superior to that of FSS-i(3) v4 for the analysis of DNA profiles from crime samples.
Reconstruction of Haplotype-Blocks Selected during Experimental Evolution.
Franssen, Susanne U; Barton, Nicholas H; Schlötterer, Christian
2017-01-01
The genetic analysis of experimentally evolving populations typically relies on short reads from pooled individuals (Pool-Seq). While this method provides reliable allele frequency estimates, the underlying haplotype structure remains poorly characterized. With small population sizes and adaptive variants that start from low frequencies, the interpretation of selection signatures in most Evolve and Resequencing studies remains challenging. To facilitate the characterization of selection targets, we propose a new approach that reconstructs selected haplotypes from replicated time series, using Pool-Seq data. We identify selected haplotypes through the correlated frequencies of alleles carried by them. Computer simulations indicate that selected haplotype-blocks of several Mb can be reconstructed with high confidence and low error rates, even when allele frequencies change only by 20% across three replicates. Applying this method to real data from D. melanogaster populations adapting to a hot environment, we identify a selected haplotype-block of 6.93 Mb. We confirm the presence of this haplotype-block in evolved populations by experimental haplotyping, demonstrating the power and accuracy of our haplotype reconstruction from Pool-Seq data. We propose that the combination of allele frequency estimates with haplotype information will provide the key to understanding the dynamics of adaptive alleles. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
StrBioLib: a Java library for development of custom computationalstructural biology applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chandonia, John-Marc
2007-05-14
Summary: StrBioLib is a library of Java classes useful fordeveloping software for computational structural biology research.StrBioLib contains classes to represent and manipulate proteinstructures, biopolymer sequences, sets of biopolymer sequences, andalignments between biopolymers based on either sequence or structure.Interfaces are provided to interact with commonly used bioinformaticsapplications, including (PSI)-BLAST, MODELLER, MUSCLE, and Primer3, andtools are provided to read and write many file formats used to representbioinformatic data. The library includes a general-purpose neural networkobject with multiple training algorithms, the Hooke and Jeeves nonlinearoptimization algorithm, and tools for efficient C-style string parsingand formatting. StrBioLib is the basis for the Pred2ary secondarystructure predictionmore » program, is used to build the ASTRAL compendium forsequence and structure analysis, and has been extensively tested throughuse in many smaller projects. Examples and documentation are available atthe site below.Availability: StrBioLib may be obtained under the terms ofthe GNU LGPL license from http://strbio.sourceforge.net/Contact:JMChandonia@lbl.gov« less
Park, Joonhong; Kim, Myungshin; Jang, Woori; Chae, Hyojin; Kim, Yonggoo; Chung, Nack-Gyun; Lee, Jae-Wook; Cho, Bin; Jeong, Dae-Chul; Park, In Yang; Park, Mi Sun
2015-05-01
A common ancestral haplotype is strongly suggested in the Korean and Japanese patients with Fanconi anemia (FA), because common mutations have been frequently found: c.2546delC and c.3720_3724delAAACA of FANCA; c.307+1G>C, c.1066C>T, and c.1589_1591delATA of FANCG. Our aim in this study was to investigate the origin of these common mutations of FANCA and FANCG. We genotyped 13 FA patients consisting of five FA-A patients and eight FA-G patients from the Korean FA population. Microsatellite markers used for haplotype analysis included four CA repeat markers which are closely linked with FANCA and eight CA repeat markers which are contiguous with FANCG. As a result, Korean FA-A patients carrying c.2546delC or c.3720_3724delAAACA did not share the same haplotypes. However, three unique haplotypes carrying c.307+1G>C, c.1066C > T, or c.1589_1591delATA, that consisted of eight polymorphic loci covering a flanking region were strongly associated with Korean FA-G, consistent with founder haplotypes reported previously in the Japanese FA-G population. Our finding confirmed the common ancestral haplotypes on the origins of the East Asian FA-G patients, which will improve our understanding of the molecular population genetics of FA-G. To the best of our knowledge, this is the first report on the association between disease-linked mutations and common ancestral haplotypes in the Korean FA population. © 2015 John Wiley & Sons Ltd/University College London.
Tomazetto, Geizecler; Hahnke, Sarah; Wibberg, Daniel; Pühler, Alfred; Klocke, Michael; Schlüter, Andreas
2018-06-01
Proteiniphilum saccharofermentans str. M3/6 T is a recently described species within the family Porphyromonadaceae (phylum Bacteroidetes ), which was isolated from a mesophilic laboratory-scale biogas reactor. The genome of the strain was completely sequenced and manually annotated to reconstruct its metabolic potential regarding biomass degradation and fermentation pathways. The P. saccharofermentans str. M3/6 T genome consists of a 4,414,963 bp chromosome featuring an average GC-content of 43.63%. Genome analyses revealed that the strain possesses 3396 protein-coding sequences. Among them are 158 genes assigned to the carbohydrate-active-enzyme families as defined by the CAZy database, including 116 genes encoding glycosyl hydrolases (GHs) involved in pectin, arabinogalactan, hemicellulose (arabinan, xylan, mannan, β-glucans), starch, fructan and chitin degradation. The strain also features several transporter genes, some of which are located in polysaccharide utilization loci (PUL). PUL gene products are involved in glycan binding, transport and utilization at the cell surface. In the genome of strain M3/6 T , 64 PUL are present and most of them in association with genes encoding carbohydrate-active enzymes. Accordingly, the strain was predicted to metabolize several sugars yielding carbon dioxide, hydrogen, acetate, formate, propionate and isovalerate as end-products of the fermentation process. Moreover, P. saccharofermentans str. M3/6 T encodes extracellular and intracellular proteases and transporters predicted to be involved in protein and oligopeptide degradation. Comparative analyses between P. saccharofermentans str. M3/6 T and its closest described relative P. acetatigenes str. DSM 18083 T indicate that both strains share a similar metabolism regarding decomposition of complex carbohydrates and fermentation of sugars.
Martins, Joyce A; Martins, Denise P; Oliveira-Brancati, Camila I F; Martinez, Juliana; Cicarelli, Regina M B; Souza, Dorotéia R S
2017-11-01
Studies with X-STR loci show population genetic substructure, which makes necessary the characterization of such markers in the different geographical and/or ethnic populations. Therefore, this study assessed the distribution and forensic efficiency of an X-STR decaplex system in the population of the State of Mato Grosso, as well as analysed the population structure of this State based on the aforementioned system. All X-STR markers were in Hardy-Weinberg equilibrium and linkage equilibrium, and the DXS6809 was the most informative marker. The power of discrimination value in females and males was 0.99999999995 and 0.9999994, respectively. Analysis of molecular variance indicated 1.10% (p < 0.00001) of heterogeneity among Europeans, Africans, Brazilians and other Latin Americans, and in relation to such groups, the population of the State of Mato Grosso showed lower genetic variation when compared with the Brazilian group (-0.10%, p = 0.67327). The genetic distance analysis showed lower values of F ST (0.0004 ≤ F ST ≤ 0.00331), with non-significant p value (p > 0.00024), between the populations of Mato Grosso and Mato Grosso do Sul, Paraná and the Southeast region of Brazil (except for one sample of Rio de Janeiro). F ST values with significant p values (p ≤ 0.00024) were obtained between the population of Mato Grosso and Iberian, African and some Latin American populations. The X-STR decaplex system proved to be extremely useful in the population of the State of Mato Grosso, and the data obtained does not show the need for a specific forensic database for this State in relation to the Brazilian populations compared in this study, except for population of Rio de Janeiro.
Evaluation of a 13-loci STR multiplex system for Cannabis sativa genetic identification.
Houston, Rachel; Birck, Matthew; Hughes-Stamm, Sheree; Gangitano, David
2016-05-01
Marijuana (Cannabis sativa) is the most commonly used illicit substance in the USA. The development of a validated method using Cannabis short tandem repeats (STRs) could aid in the individualization of samples as well as serve as an intelligence tool to link multiple cases. For this purpose, a modified 13-loci STR multiplex method was optimized and evaluated according to ISFG and SWGDAM guidelines. A real-time PCR quantification method for C. sativa was developed and validated, and a sequenced allelic ladder was also designed to accurately genotype 199 C. sativa samples from 11 U.S. Customs and Border Protection seizures. Distinguishable DNA profiles were generated from 127 samples that yielded full STR profiles. Four duplicate genotypes within seizures were found. The combined power of discrimination of this multilocus system is 1 in 70 million. The sensitivity of the multiplex STR system is 0.25 ng of template DNA. None of the 13 STR markers cross-reacted with any of the studied species, except for Humulus lupulus (hops) which generated unspecific peaks. Phylogenetic analysis and case-to-case pairwise comparison of 11 cases using F st as genetic distance revealed the genetic association of four groups of cases. Moreover, due to their genetic similarity, a subset of samples (N = 97) was found to form a homogeneous population in Hardy-Weinberg and linkage equilibrium. The results of this research demonstrate the applicability of this 13-loci STR system in associating Cannabis cases for intelligence purposes.
Haapalainen, Minna L; Wang, Jinhui; Latvala, Satu; Lehtonen, Mikko T; Pirhonen, Minna; Nissinen, Anne I
2018-03-30
'Candidatus Liberibacter solanacearum' (CLso) haplotype C is associated with disease in carrots and transmitted by the carrot psyllid Trioza apicalis. To identify possible other sources and vectors of this pathogen in Finland, samples were taken of wild plants within and near the carrot fields, the psyllids feeding on these plants, parsnips growing next to carrots, and carrot seeds. For analyzing the genotype of the CLso positive samples, a multi-locus sequence typing (MLST) scheme was developed. CLso haplotype C was detected in 11% of the Trioza anthrisci samples, in 35% of the Anthriscus sylvestris plants with discoloration, and in parsnips showing leaf discoloration. MLST revealed that the CLso in T. anthrisci and most A. sylvestris plants represent different strains than the bacteria found in T. apicalis and the cultivated plants. CLso haplotype D was detected in two of the 34 carrot seed lots tested, but was not detected in the plants grown from these seeds. Phylogenetic analysis by UPGMA clustering suggested that the haplotype D is more closely related to the haplotype A than to C. A novel, sixth haplotype of CLso, most closely related to A and D, was found in the psyllid Trioza urticae and stinging nettle (Urtica dioica, Urticaceae), and named as haplotype U.
Mutation rate estimation for 15 autosomal STR loci in a large population from Mainland China.
Zhao, Zhuo; Zhang, Jie; Wang, Hua; Liu, Zhi-Peng; Liu, Ming; Zhang, Yuan; Sun, Li; Zhang, Hui
2015-09-01
STR, short tandem repeats, are well known as a type of powerful genetic marker and widely used in studying human population genetics. Compared with the conventional genetic markers, the mutation rate of STR is higher. Additionally, the mutations of STR loci do not lead to genetic inconsistencies between the genotypes of parents and children; therefore, the analysis of STR mutation is more suited to assess the population mutation. In this study, we focused on 15 autosomal STR loci. DNA samples from a total of 42,416 unrelated healthy individuals (19,037 trios) from the population of Mainland China collected between Jan 2012 and May 2014 were successfully investigated. In our study, the allele frequencies, paternal mutation rates, maternal mutation rates and average mutation rates were detected. Furthermore, we also investigated the relationship between paternal ages, maternal ages, area, the time of pregnancy and average mutation rate. We found that the paternal mutation rate was higher than the maternal mutation rate and the paternal, maternal, and average mutation rates had a positive correlation with paternal age, maternal age and the time of pregnancy respectively. Additionally, the average mutation rate of coastal areas was higher than that of inland areas.
Klitz, W; Maiers, M; Spellman, S; Baxter-Lowe, L A; Schmeckpeper, B; Williams, T M; Fernandez-Viña, M
2003-10-01
A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.
Haplotype diversity in 11 candidate genes across four populations.
Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F
2005-09-01
Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.
DeitY-TU face database: its design, multiple camera capturing, characteristics, and evaluation
NASA Astrophysics Data System (ADS)
Bhowmik, Mrinal Kanti; Saha, Kankan; Saha, Priya; Bhattacharjee, Debotosh
2014-10-01
The development of the latest face databases is providing researchers different and realistic problems that play an important role in the development of efficient algorithms for solving the difficulties during automatic recognition of human faces. This paper presents the creation of a new visual face database, named the Department of Electronics and Information Technology-Tripura University (DeitY-TU) face database. It contains face images of 524 persons belonging to different nontribes and Mongolian tribes of north-east India, with their anthropometric measurements for identification. Database images are captured within a room with controlled variations in illumination, expression, and pose along with variability in age, gender, accessories, make-up, and partial occlusion. Each image contains the combined primary challenges of face recognition, i.e., illumination, expression, and pose. This database also represents some new features: soft biometric traits such as mole, freckle, scar, etc., and facial anthropometric variations that may be helpful for researchers for biometric recognition. It also gives an equivalent study of the existing two-dimensional face image databases. The database has been tested using two baseline algorithms: linear discriminant analysis and principal component analysis, which may be used by other researchers as the control algorithm performance score.
The effect of using genealogy-based haplotypes for genomic prediction.
Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt
2013-03-06
Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.
iXora: exact haplotype inferencing and trait association.
Utro, Filippo; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar E; Royaert, Stefan; Schnell, Raymond J; Motamayor, Juan Carlos; Kuhn, David N; Parida, Laxmi
2013-06-06
We address the task of extracting accurate haplotypes from genotype data of individuals of large F1 populations for mapping studies. While methods for inferring parental haplotype assignments on large F1 populations exist in theory, these approaches do not work in practice at high levels of accuracy. We have designed iXora (Identifying crossovers and recombining alleles), a robust method for extracting reliable haplotypes of a mapping population, as well as parental haplotypes, that runs in linear time. Each allele in the progeny is assigned not just to a parent, but more precisely to a haplotype inherited from the parent. iXora shows an improvement of at least 15% in accuracy over similar systems in literature. Furthermore, iXora provides an easy-to-use, comprehensive environment for association studies and hypothesis checking in populations of related individuals. iXora provides detailed resolution in parental inheritance, along with the capability of handling very large populations, which allows for accurate haplotype extraction and trait association. iXora is available for non-commercial use from http://researcher.ibm.com/project/3430.
Facilitating Comprehension of Non-Native English Speakers during Lectures in English with STR-Texts
ERIC Educational Resources Information Center
Shadiev, Rustam; Wu, Ting-Ting; Huang, Yueh-Min
2018-01-01
We provided texts generated by speech-to text-recognition (STR) technology for non-native English speaking students during lectures in English in order to test whether STR-texts were useful for enhancing students' comprehension of lectures. To this end, we carried out an experiment in which 60 participants were randomly assigned to a control group…
StrBioLib: a Java library for development of custom computational structural biology applications.
Chandonia, John-Marc
2007-08-01
StrBioLib is a library of Java classes useful for developing software for computational structural biology research. StrBioLib contains classes to represent and manipulate protein structures, biopolymer sequences, sets of biopolymer sequences, and alignments between biopolymers based on either sequence or structure. Interfaces are provided to interact with commonly used bioinformatics applications, including (psi)-blast, modeller, muscle and Primer3, and tools are provided to read and write many file formats used to represent bioinformatic data. The library includes a general-purpose neural network object with multiple training algorithms, the Hooke and Jeeves non-linear optimization algorithm, and tools for efficient C-style string parsing and formatting. StrBioLib is the basis for the Pred2ary secondary structure prediction program, is used to build the astral compendium for sequence and structure analysis, and has been extensively tested through use in many smaller projects. Examples and documentation are available at the site below. StrBioLib may be obtained under the terms of the GNU LGPL license from http://strbio.sourceforge.net/
Lim, K B; Jeevan, N H; Jaya, P; Othman, M I; Lee, Y H
2001-06-01
Allele frequencies for the nine STRs genetic loci included in the AmpFlSTR Profiler kit were obtained from samples of unrelated individuals comprising 139-156 Malays, 149-153 Chinese and 132-135 Indians, residing in Malaysia.
Haplotype-Based Association Analysis via Variance-Components Score Test
Tzeng, Jung-Ying ; Zhang, Daowen
2007-01-01
Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies. PMID:17924336
Kling, Daniel; Egeland, Thore; Mostad, Petter
2012-01-01
In a number of applications there is a need to determine the most likely pedigree for a group of persons based on genetic markers. Adequate models are needed to reach this goal. The markers used to perform the statistical calculations can be linked and there may also be linkage disequilibrium (LD) in the population. The purpose of this paper is to present a graphical Bayesian Network framework to deal with such data. Potential LD is normally ignored and it is important to verify that the resulting calculations are not biased. Even if linkage does not influence results for regular paternity cases, it may have substantial impact on likelihood ratios involving other, more extended pedigrees. Models for LD influence likelihoods for all pedigrees to some degree and an initial estimate of the impact of ignoring LD and/or linkage is desirable, going beyond mere rules of thumb based on marker distance. Furthermore, we show how one can readily include a mutation model in the Bayesian Network; extending other programs or formulas to include such models may require considerable amounts of work and will in many case not be practical. As an example, we consider the two STR markers vWa and D12S391. We estimate probabilities for population haplotypes to account for LD using a method based on data from trios, while an estimate for the degree of linkage is taken from the literature. The results show that accounting for haplotype frequencies is unnecessary in most cases for this specific pair of markers. When doing calculations on regular paternity cases, the markers can be considered statistically independent. In more complex cases of disputed relatedness, for instance cases involving siblings or so-called deficient cases, or when small differences in the LR matter, independence should not be assumed. (The networks are freely available at http://arken.umb.no/~dakl/BayesianNetworks.) PMID:22984448
Mitochondrial haplotype variation and phylogeography of Iberian brown trout populations.
MacHordom, A; Suárez, J; Almodóvar, A; Bautista, J M
2000-09-01
The biogeographical distribution of brown trout mitochondrial DNA haplotypes throughout the Iberian Peninsula was established by polymerase chain reaction-restriction fragment polymorphism analysis. The study of 507 specimens from 58 localities representing eight widely separated Atlantic-slope (north and west Iberian coasts) and six Mediterranean drainage systems served to identify five main groups of mitochondrial haplotypes: (i) haplotypes corresponding to non-native, hatchery-reared brown trout that were widely distributed but also found in wild populations of northern Spain (Cantabrian slope); (ii) a widespread Atlantic haplotype group; (iii) a haplotype restricted to the Duero Basin; (iv) a haplotype shown by southern Iberian populations; and (v) a Mediterranean haplotype. The Iberian distribution of these haplotypes reflects both the current fishery management policy of introducing non-native brown trout, and Messinian palaeobiogeography. Our findings complement and extend previous allozyme studies on Iberian brown trout and improve present knowledge of glacial refugia and postglacial movement of brown trout lineages.
Minimizing inhibition of PCR-STR typing using digital agarose droplet microfluidics.
Geng, Tao; Mathies, Richard A
2015-01-01
The presence of PCR inhibitors in forensic and other biological samples reduces the amplification efficiency, sometimes resulting in complete PCR failure. Here we demonstrate a high-performance digital agarose droplet microfluidics technique for single-cell and single-molecule forensic short tandem repeat (STR) typing of samples contaminated with high concentrations of PCR inhibitors. In our multifaceted strategy, the mitigation of inhibitory effects is achieved by the efficient removal of inhibitors from the porous agarose microgel droplets carrying the DNA template through washing and by the significant dilution of targets and remaining inhibitors to the stochastic limit within the ultralow nL volume droplet reactors. Compared to conventional tube-based bulk PCR, our technique shows enhanced (20 ×, 10 ×, and 16 ×) tolerance of urea, tannic acid, and humic acid, respectively, in STR typing of GM09948 human lymphoid cells. STR profiling of single cells is not affected by small soluble molecules like urea and tannic acid because of their effective elimination from the agarose droplets; however, higher molecular weight humic acid still partially inhibits single-cell PCR when the concentration is higher than 200 ng/μL. Nevertheless, the full STR profile of 9948 male genomic DNA contaminated with 500 ng/μL humic acid was generated by pooling and amplifying beads carrying single-molecule 9948 DNA PCR products in a single secondary reaction. This superior performance suggests that our digital agarose droplet microfluidics technology is a promising approach for analyzing low-abundance DNA targets in the presence of inhibitors. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
The effect of using genealogy-based haplotypes for genomic prediction
2013-01-01
Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971
[Analysis on genetic polymorphism of 5 STR loci selected from X chromosome].
Liu, Qi-ji; Gong, Yao-qin; Zhang, Xi-yu; Gao, Gui-min; Li, Jiang-xia; Guo, Yi-shou
2005-02-01
To select short tandem repeats(STR) from X chromosome. STR is a universal genetic marker that has changeable polymorphism and stable heredity in human genome. It is a specific DNA segment composed of 2-6 base pairs as its core sequence. It is an ideal DNA marker used in linkage analysis and gene mapping. In this study, 8 short tandem repeats were selected from two genomic clones on X chromosome by using BCM Search Launcher. Primers amplifying the STR loci were designed by using Primer 3.0 according to the unique sequence flanking the STRs. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five of these STRs were polymorphic. Chi-square test indicated that the distribution of genotypes agreed with Hardy-Weinberg equilibrium (P>0.05). Five polymorphic short tandem repeats have been identified on chromosome X and will be useful for linkage analysis and gene mapping.
Zhang, Xiufeng; Hu, Liping; Du, Lei; Nie, Aiting; Rao, Min; Pang, Jing Bo; Nie, Shengjie
2017-05-01
The genetic polymorphisms of 20 autosomal short tandem repeat (STR) loci included in the PowerPlex® 21 kit were evaluated in 522 healthy unrelated Vietnamese from Yunnan, China. All of the loci reached the Hardy-Weinberg equilibrium. These loci were examined to determine allele frequencies and forensic statistical parameters. The combined discrimination power and probability of excluding paternity of the 20 STR loci were 0.999999999999999999999991 26 and 0.999999975, respectively. Results suggested that the 20 STR loci are highly polymorphic, which is suitable for forensic personal identification and paternity testing.
MHC Class II haplotypes of Colombian Amerindian tribes
Yunis, Juan J.; Yunis, Edmond J.; Yunis, Emilio
2013-01-01
We analyzed 1041 individuals belonging to 17 Amerindian tribes of Colombia, Chimila, Bari and Tunebo (Chibcha linguistic family), Embera, Waunana (Choco linguistic family), Puinave and Nukak (Maku-Puinave linguistic families), Cubeo, Guanano, Tucano, Desano and Piratapuyo (Tukano linguistic family), Guahibo and Guayabero (Guayabero Linguistic Family), Curripaco and Piapoco (Arawak linguistic family) and Yucpa (Karib linguistic family). for MHC class II haplotypes (HLA-DRB1, DQA1, DQB1). Approximately 90% of the MHC class II haplotypes found among these tribes are haplotypes frequently encountered in other Amerindian tribes. Nonetheless, striking differences were observed among Chibcha and non-Chibcha speaking tribes. The DRB1*04:04, DRB1*04:11, DRB1*09:01 carrying haplotypes were frequently found among non-Chibcha speaking tribes, while the DRB1*04:07 haplotype showed significant frequencies among Chibcha speaking tribes, and only marginal frequencies among non-Chibcha speaking tribes. Our results suggest that the differences in MHC class II haplotype frequency found among Chibcha and non-Chibcha speaking tribes could be due to genetic differentiation in Mesoamerica of the ancestral Amerindian population into Chibcha and non-Chibcha speaking populations before they entered into South America. PMID:23885196
Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.
Hashemi, Abolfazl; Zhu, Banghua; Vikalo, Haris
2018-03-21
Haplotype assembly is the task of reconstructing haplotypes of an individual from a mixture of sequenced chromosome fragments. Haplotype information enables studies of the effects of genetic variations on an organism's phenotype. Most of the mathematical formulations of haplotype assembly are known to be NP-hard and haplotype assembly becomes even more challenging as the sequencing technology advances and the length of the paired-end reads and inserts increases. Assembly of haplotypes polyploid organisms is considerably more difficult than in the case of diploids. Hence, scalable and accurate schemes with provable performance are desired for haplotype assembly of both diploid and polyploid organisms. We propose a framework that formulates haplotype assembly from sequencing data as a sparse tensor decomposition. We cast the problem as that of decomposing a tensor having special structural constraints and missing a large fraction of its entries into a product of two factors, U and [Formula: see text]; tensor [Formula: see text] reveals haplotype information while U is a sparse matrix encoding the origin of erroneous sequencing reads. An algorithm, AltHap, which reconstructs haplotypes of either diploid or polyploid organisms by iteratively solving this decomposition problem is proposed. The performance and convergence properties of AltHap are theoretically analyzed and, in doing so, guarantees on the achievable minimum error correction scores and correct phasing rate are established. The developed framework is applicable to diploid, biallelic and polyallelic polyploid species. The code for AltHap is freely available from https://github.com/realabolfazl/AltHap . AltHap was tested in a number of different scenarios and was shown to compare favorably to state-of-the-art methods in applications to haplotype assembly of diploids, and significantly outperforms existing techniques when applied to haplotype assembly of polyploids.
Yund, Philip O; Collins, Catherine; Johnson, Sheri L
2015-06-01
The colonial ascidian Botryllus schlosseri should be considered cryptogenic (i.e., not definitively classified as either native or introduced) in the Northwest Atlantic. Although all the evidence is quite circumstantial, over the last 15 years most research groups have accepted the scenario of human-mediated dispersal and classified B. schlosseri as introduced; others have continued to consider it native or cryptogenic. We address the invasion status of this species by adding 174 sequences to the growing worldwide database for the mitochondrial gene cytochrome c oxidase subunit I (COI) and analyzing 1077 sequences to compare genetic diversity of one clade of haplotypes in the Northwest Atlantic with two hypothesized source regions (the Northeast Atlantic and Mediterranean). Our results lead us to reject the prevailing view of the directionality of transport across the Atlantic. We argue that the genetic diversity patterns at COI are far more consistent with the existence of at least one haplotype clade in the Northwest Atlantic (and possibly a second) that substantially pre-dates human colonization from Europe, with this native North American clade subsequently introduced to three sites in Northeast Atlantic and Mediterranean waters. However, we agree with past researchers that some sites in the Northwest Atlantic have more recently been invaded by alien haplotypes, so that some populations are currently composed of a mixture of native and invader haplotypes. © 2015 Marine Biological Laboratory.
Tucker, Valerie C; Hopwood, Andrew J; Sprecher, Cynthia J; McLaren, Robert S; Rabbach, Dawn R; Ensenberger, Martin G; Thompson, Jonelle M; Storts, Douglas R
2011-11-01
In response to the ENFSI and EDNAP groups' call for new STR multiplexes for Europe, Promega(®) developed a suite of four new DNA profiling kits. This paper describes the developmental validation study performed on the PowerPlex(®) ESI 16 (European Standard Investigator 16) and the PowerPlex(®) ESI 17 Systems. The PowerPlex(®) ESI 16 System combines the 11 loci compatible with the UK National DNA Database(®), contained within the AmpFlSTR(®) SGM Plus(®) PCR Amplification Kit, with five additional loci: D2S441, D10S1248, D22S1045, D1S1656 and D12S391. The multiplex was designed to reduce the amplicon size of the loci found in the AmpFlSTR(®) SGM Plus(®) kit. This design facilitates increased robustness and amplification success for the loci used in the national DNA databases created in many countries, when analyzing degraded DNA samples. The PowerPlex(®) ESI 17 System amplifies the same loci as the PowerPlex(®) ESI 16 System, but with the addition of a primer pair for the SE33 locus. Tests were designed to address the developmental validation guidelines issued by the Scientific Working Group on DNA Analysis Methods (SWGDAM), and those of the DNA Advisory Board (DAB). Samples processed include DNA mixtures, PCR reactions spiked with inhibitors, a sensitivity series, and 306 United Kingdom donor samples to determine concordance with data generated with the AmpFlSTR(®) SGM Plus(®) kit. Allele frequencies from 242 white Caucasian samples collected in the United Kingdom are also presented. The PowerPlex(®) ESI 16 and ESI 17 Systems are robust and sensitive tools, suitable for the analysis of forensic DNA samples. Full profiles were routinely observed with 62.5pg of a fully heterozygous single source DNA template. This high level of sensitivity was found to impact on mixture analyses, where 54-86% of unique minor contributor alleles were routinely observed in a 1:19 mixture ratio. Improved sensitivity combined with the robustness afforded by smaller amplicons
Pemberton, T J; Jakobsson, M; Conrad, D F; Coop, G; Wall, J D; Pritchard, J K; Patel, P I; Rosenberg, N A
2008-07-01
When performing association studies in populations that have not been the focus of large-scale investigations of haplotype variation, it is often helpful to rely on genomic databases in other populations for study design and analysis - such as in the selection of tag SNPs and in the imputation of missing genotypes. One way of improving the use of these databases is to rely on a mixture of database samples that is similar to the population of interest, rather than using the single most similar database sample. We demonstrate the effectiveness of the mixture approach in the application of African, European, and East Asian HapMap samples for tag SNP selection in populations from India, a genetically intermediate region underrepresented in genomic studies of haplotype variation.
Yuasa, Isao; Jin, Feng; Harihara, Shinji; Matsusue, Aya; Fujihara, Junko; Takeshita, Haruo; Akane, Atsushi; Umetsu, Kazuo; Saitou, Naruya; Chattopadhyay, Prasanta K
2013-09-01
Previous studies of four populations revealed that a hypervariable short tandem repeat (iSTR) in intron 7 of the human complement factor I (CFI) gene on chromosome 4q was unique, with 17 possible East Asian-specific group H alleles observed at relatively high frequencies. To develop a deeper anthropological and forensic understanding of iSTR, 1161 additional individuals from 11 Asian populations were investigated. Group H alleles of iSTR and c.1217A allele of a SNP in exon 11 of the CFI gene were associated with each other and were almost entirely confined to East Asian populations. Han Chinese in Changsha, southern China, showed the highest frequency for East Asian-specific group H alleles (0.201) among 15 populations. Group H alleles were observed to decrease gradually from south to north in 11 East Asian populations. This expansion of group H alleles provides evidence that southern China and Southeast Asia are a hotspot of Asian diversity and a genetic reservoir of Asians after they entered East Asia. The expected heterozygosity values of iSTR ranged from 0.927 in Thais to 0.874 in Oroqens, higher than those of an STR in the fibrinogen alpha chain (FGA) gene on chromosome 4q. Thus, iSTR is a useful marker for anthropological and forensic genetics. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Lepez, Trees; Vandewoestyne, Mado; Van Hoofstat, David; Deforce, Dieter
2014-11-01
The success rate of STR profiling of hairs found at a crime scene is quite low and negative results of hair analysis are frequently reported. To increase the success rate of DNA analysis of hairs in forensics, nuclei in hair roots can be counted after staining the hair root with DAPI. Two staining methods were tested: a longer method with two 1h incubations in respectively a DAPI- and a wash-solution, and a fast, direct staining of the hair root on microscope slides. The two staining methods were not significantly different. The results of the STR analysis for both procedures showed that 20 nuclei are necessary to obtain at least partial STR profiles. When more than 50 nuclei were counted, full STR profiles were always obtained. In 96% of the cases where no nuclei were detected, no STR profile could be obtained. However, 4% of the DAPI-negative hair roots resulted in at least partial STR profiles. Therefore, each forensic case has to be evaluated separately in function of the importance of the evidential value of the found hair. The fast staining method was applied in 36 forensic cases on 279 hairs in total. A fast screening method using DAPI can be used to increase the success rate of hair analysis in forensics. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Hongdan, Wang; Bing, Kang; Ning, Su; Miao, He; Bo, Zhang; Yuxin, Guo; Bofeng, Zhu; Shixiu, Liao; Zhaoshu, Zeng
2017-01-01
At present, the Han nationality is China's main ethnic group and also the most populous nation in the world. This is a great resource to study microsatellite mutations and for the study of ethnogeny. The aim of this study is to investigate the genetic polymorphisms and mutations of 22 autosomal STR loci in 2475 individuals from Henan province, China. DNA is amplified and genotyped using PowerPlex™24 system. The gene frequencies, forensic parameters, and the mutation rate of the 22 STR loci are analyzed. A total of 295 alleles are observed in this Henan Han population, and the allelic frequencies ranged from 0.0003 to 0.5036. In order to investigate the genetic relationships between the Henan Han and the other 14 different populations, our present data were compared with previously published data for the same 15 STR loci. The results indicated that the Henan Han had closer genetic relationships the groups including Minnan Han, Maonan, Yi and Guangdong Han groups while the South morocco population, the Moroccan population, the Malay group, and the Uigur stand away from Henan Han. Except of D2S441, D13S317, PentaE, D2S1338, D5S818, TPOX and D19S433, the mutation events are found in the other 15 STR loci. A total of 40 mutation events are observed in the 15 STR loci. The mutation rates are ranged from 0 to 4.85 × 10 -3 . In this study, 39 mutations are single-step mutations, and only one at FGA comprised two steps. STR mutation is commonly existed in paternity testing, while there are no STR mutation studies of the 22 STR loci in the Henan Han population. It is of great importance in forensic individual discrimination and paternal testing.
Evaluation of reliability on STR typing at leukemic patients used for forensic purposes.
Filoglu, G; Bulbul, O; Rayimoglu, G; Yediay, F E; Zorlu, T; Ongoren, S; Altuncul, H
2014-06-01
Over the past decades, main advances in the field of molecular biology, coupled with benefits in genomic technologies, have led to detailed molecular investigations in the genetic diversity generated by researchers. Short tandem repeat (STR) loci are polymorphic loci found throughout all eukaryotic genome. DNA profiling identification, parental testing and kinship analysis by analysis of STR loci have been widely used in forensic sciences since 1993. Malignant tissues may sometimes be the source of biological material for forensic analysis, including identification of individuals or paternity testing. There are a number of studies on microsatellite instability in different types of tumors by comparing the STR profiles of malignant and healthy tissues on the same individuals. Defects in DNA repair pathways (non-repair or mis-repair) and metabolism lead to an accumulation of microsatellite alterations in genomic DNA of various cancer types that result genomic instabilities on forensic analyses. Common forms of genomic instability are loss of heterozygosity (LOH) and microsatellite instability (MSI). In this study, the applicability of autosomal STR markers, which are routinely used in forensic analysis, were investigated in order to detect genotypes in blood samples collected from leukemic patients to estimate the reliability of the results when malignant tissues are used as a source of forensic individual identification. Specimens were collected from 90 acute and 10 chronic leukemia volunteers with oral swabs as well as their paired peripheral blood samples from the Oncology Centre of the Department of Hematology at Istanbul University, during the years 2010-2011. Specimens were tested and compared with 16 somatic STR loci (CSFIPO, THO1, TPOX, vWA, D2S1338, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D19S433, D21S11 and FGA) widely used in forensic identification and kinship. Only two STR instabilities were encountered among 100 specimens. An MSI in
The STR/ort mouse model of spontaneous osteoarthritis - an update.
Staines, K A; Poulet, B; Wentworth, D N; Pitsillides, A A
2017-06-01
Osteoarthritis is a degenerative joint disease and a world-wide healthcare burden. Characterized by cartilage degradation, subchondral bone thickening and osteophyte formation, osteoarthritis inflicts much pain and suffering, for which there are currently no disease-modifying treatments available. Mouse models of osteoarthritis are proving critical in advancing our understanding of the underpinning molecular mechanisms. The STR/ort mouse is a well-recognized model which develops a natural form of osteoarthritis very similar to the human disease. In this Review we discuss the use of the STR/ort mouse in understanding this multifactorial disease with an emphasis on recent advances in its genetics and its bone, endochondral and immune phenotypes. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Prion gene haplotypes of U.S. cattle
Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W
2006-01-01
Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337
Development of a rapid 21-plex autosomal STR typing system for forensic applications.
Yang, Meng; Yin, Caiyong; Lv, Yuexin; Yang, Yaran; Chen, Jing; Yu, Zailiang; Liu, Xu; Xu, Meibo; Chen, Feng; Wu, Huijuan; Yan, Jiangwei
2016-10-01
DNA-STR genotyping technology has been widely used in forensic investigations. Even with such success, there is a great need to reduce the analysis time. In this study, we established a new rapid 21-plex STR typing system, including 13 CODIS loci, Penta D, Penta E, D12S391, D2S1338, D6S1043, D19S433, D2S441 and Amelogenin loci. This system could shorten the amplification time to a minimum of 90 min and does not require DNA extraction from the samples. Validation of the typing system complied with the Scientific Working Group on DNA Analysis Methods (SWGDAM) and the Chinese National Standard (GA/T815-2009) guidelines. The results demonstrated that this 21-plex STR typing system was a valuable tool for rapid criminal investigation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genetic diversity and paternal origin of domestic donkeys.
Han, H; Chen, N; Jordana, J; Li, C; Sun, T; Xia, X; Zhao, X; Ji, C; Shen, S; Yu, J; Ainhoa, F; Chen, H; Lei, C; Dang, R
2017-12-01
Numerous studies have been conducted to investigate genetic diversity, origins and domestication of donkey using autosomal microsatellites and the mitochondrial genome, whereas the male-specific region of the Y chromosome of modern donkeys is largely uncharacterized. In the current study, 14 published equine Y chromosome-specific microsatellites (Y-STR) were investigated in 395 male donkey samples from China, Egypt, Spain and Peru using fluorescent labeled microsatellite markers. The results showed that seven Y-STRs-EcaYP9, EcaYM2, EcaYE2, EcaYE3, EcaYNO1, EcaYNO2 and EcaYNO4-were male specific and polymorphic, showing two to eight alleles in the donkeys studied. A total of 21 haplotypes corresponding to three haplogroups were identified, indicating three independent patrilines in domestic donkey. These markers are useful for the study the Y-chromosome diversity and population genetics of donkeys in Africa, Europe, South America and China. © 2017 Stichting International Foundation for Animal Genetics.
Marín, J C; Romero, K; Rivera, R; Johnson, W E; González, B A
2017-10-01
Investigations of genetic diversity and domestication in South American camelids (SAC) have relied on autosomal microsatellite and maternally-inherited mitochondrial data. We present the first integrated analysis of domestic and wild SAC combining male and female sex-specific markers (male specific Y-chromosome and female-specific mtDNA sequence variation) to assess: (i) hypotheses about the origin of domestic camelids, (ii) directionality of introgression among domestic and/or wild taxa as evidence of hybridization and (iii) currently recognized subspecies patterns. Three male-specific Y-chromosome markers and control region sequences of mitochondrial DNA are studied here. Although no sequence variation was found in SRY and ZFY, there were seven variable sites in DBY generating five haplotypes on the Y-chromosome. The haplotype network showed clear separation between haplogroups of guanaco-llama and vicuña-alpaca, indicating two genetically distinct patrilineages with near absence of shared haplotypes between guanacos and vicuñas. Although we document some examples of directional hybridization, the patterns strongly support the hypothesis that llama (Lama glama) is derived from guanaco (Lama guanicoe) and the alpaca (Vicugna pacos) from vicuña (Vicugna vicugna). Within male guanacos we identified a haplogroup formed by three haplotypes with different geographical distributions, the northernmost of which (Peru and northern Chile) was also observed in llamas, supporting the commonly held hypothesis that llamas were domesticated from the northernmost populations of guanacos (L. g. cacilensis). Southern guanacos shared the other two haplotypes. A second haplogroup, consisting of two haplotypes, was mostly present in vicuñas and alpacas. However, Y-chromosome variation did not distinguish the two subspecies of vicuñas. © 2017 Stichting International Foundation for Animal Genetics.
Alpha-globin gene haplotypes in South American Indians.
Zago, M A; Melo Santos, E J; Clegg, J B; Guerreiro, J F; Martinson, J J; Norwich, J; Figueiredo, M S
1995-08-01
The haplotypes of the alpha-globin gene cluster were determined for 99 Indians from the Brazilian Amazon region who belong to 5 tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Three predominant haplotypes were identified: Ia (present in 38.9% of chromosomes), IIIa (25.8%), and IIe (22.1%). The only alpha-globin gene rearrangement detected was alpha alpha alpha 3.7 I gene triplication associated with haplotype IIIa, found in high frequencies (5.6% and 10.6%) in two tribes and absent in the others. alpha-Globin gene deletions that cause alpha-thalassemia were not seen, supporting the argument that malaria was absent in these populations until recently. The heterogeneous distribution of alpha-globin gene haplotypes and rearrangements among the different tribes differs markedly from the homogeneous distribution of beta-globin gene cluster haplotypes and reflects the action of various genetic mechanisms (genetic drift, founder effect, consanguinity) on small isolated population groups with a complicated history of divergence-fusion events. The alpha-globin gene haplotype distribution has some similarities to distributions observed in Southeast Asian and Pacific Island populations, indicating that these populations have considerable genetic affinities. However, the absence of several features of the alpha-globin gene cluster that are consistently present among the Pacific Islanders suggests that the similarity of haplotypes between Brazilian Indians and people from Polynesia, Micronesia, and Melanesia is more likely to result of ancient common ancestry rather than the consequence of recent direct genetic contribution through immigration.
Gao, Su-Qing; Cheng, Xi; Li, Qian; Li, Yu-Zhu; Deng, Zhi-Hui
2009-06-01
This study was aimed to discover the novel HLA recombination haplotypes and investigate the distribution of haplotypes in Chinese Han population. Based on the HLA-A, B, DRB1 typing results of 179 family members, 791 haplotypes were assigned by the mode of inheritance. The results showed that a total of 4 novel recombinant haplotypes in HLA-DRB1 locus region were observed in 4 families, which ratio of paternal to maternal chromosomes was 3:1. The recombination ratio between HLA-DRB1 and HLA-A or B loci was 0.92% (4/433). There were a total of 362 kinds of HLA-A, -B, -DRB1 haplotypes to be confirmed in Chinese Han partial population. A33-B58-DR17, A2-B46-DR9, A30-B13-DR7, A11-B13-DR15, A11-B75-DR12 and A2-B46-DR14 were the most common haplotypes that was consistent with the distribution of HLA alleles in unrelated donors. There were A1-B63-DR12, A29-B46-DR15, A1-B61-DR10, A34-B35-DR9, A29-B54-DR4, A23-B13-DR16 and A34-B62-DR15 haplotypes and so on, which were rare haplotypes not yet reported in Chinese. It is concluded that the HLA-A-B-DRB1 haplotypes would be confirmed by analysis of their family pedigree. The results obtained in this study are basic data for study of Chinese anthropology, organ transplantation and disease correlation analysis.
Zhang, Ting; Hu, Siyu; Li, Guoli; Li, Hui; Liu, Xiaoli; Niu, Jianjun; Wang, Feng; Wen, Huixin; Xu, Ye; Li, Qingge
2015-03-01
Rapid and comprehensive detection of drug-resistance is essential for the control of tuberculosis, which has facilitated the development of molecular assays for the detection of drug-resistant mutations in Mycobacterium tuberculosis. We hereby assessed the analytical and clinical performance of an assay for streptomycin-resistant mutations. MeltPro TB/STR is a closed-tube, dual-color, melting curve analysis-based, real-time PCR test designed to detect 15 streptomycin-resistant mutations in rpsL 43, rpsL 88, rrs 513, rrs 514, rrs 517, and rrs 905-908 of M. tuberculosis. Analytical studies showed that the accuracy was 100%, the limit of detection was 50-500 bacilli per reaction, the reproducibility in the form of Tm variation was within 1.0 °C, and we could detect 20% STR resistance in mixed bacterial samples. The cross-platform study demonstrated that the assay could be performed on six models of real-time PCR instruments. A multicenter clinical study was conducted using 1056 clinical isolates, which were collected from three geographically different healthcare units, including 709 STR-susceptible and 347 STR-resistant isolates characterized on Löwenstein-Jensen solid medium by traditional drug susceptibility testing. The results showed that the clinical sensitivity and specificity of the MeltPro TB/STR was 88.8% and 95.8%, respectively. Sequencing analysis confirmed the accuracy of the mutation types. Among all the 8 mutation types detected, rpsL K43R (AAG → AGG), rpsL K88R (AAG → AGG) and rrs 514 A → C accounted for more than 90%. We concluded that MeltPro TB/STR represents a rapid and reliable assay for the detection of STR resistance in clinical isolates. Copyright © 2014. Published by Elsevier Ltd.
NASA Astrophysics Data System (ADS)
Nieva, M. F.; Pintado, O. I.; Adelman, S.; Rayle, K. E.; Sanders, S. E., Jr.
Las temperaturas efectivas (Teff) y gravedades superficiales (log g) de un grupo de estrellas de tipo B y A de Secuencia Principal se determinaron en varias etapas. En una primera aproximación se usaron los índices fotométricos de Strömgren para realizar el cálculo con el programa de Napiwotski et al.(1993). Luego se hizo un ajuste comparando datos espectrofotométricos con flujos obtenidos con el modelo ATLAS9 en la región visible. Y a continuación se hizo un mejor ajuste comparando los perfiles de la línea Hγ con espectros sintéticos calculados con SYNTHE. Además, se analizó el efecto de usar el modelo de Canuto y Mazzitelli (1991), donde se considera The Mixing Length Theory, en modelos de atmósferas de estrellas.
Haplotype Reconstruction in Large Pedigrees with Many Untyped Individuals
NASA Astrophysics Data System (ADS)
Li, Xin; Li, Jing
Haplotypes, as they specify the linkage patterns between dispersed genetic variations, provide important information for understanding the genetics of human traits. However haplotypes are not directly available from current genotyping platforms, and hence there are extensive investigations of computational methods to recover such information. Two major computational challenges arising in current family-based disease studies are large family sizes and many ungenotyped family members. Traditional haplotyping methods can neither handle large families nor families with missing members. In this paper, we propose a method which addresses these issues by integrating multiple novel techniques. The method consists of three major components: pairwise identical-bydescent (IBD) inference, global IBD reconstruction and haplotype restoring. By reconstructing the global IBD of a family from pairwise IBD and then restoring the haplotypes based on the inferred IBD, this method can scale to large pedigrees, and more importantly it can handle families with missing members. Compared with existing methods, this method demonstrates much higher power to recover haplotype information, especially in families with many untyped individuals.
In Vivo Characterization of Human APOA5 Haplotypes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Akiyama, Jennifer; Chapman-Helleboid, Audrey
2006-10-01
Increased plasma triglycerides concentrations are an independent risk factor for cardiovascular disease. Numerous studies support a reproducible genetic association between two minor haplotypes in the human apolipoprotein A5 gene (APOA5) and increased plasma triglyceride concentrations. We thus sought to investigate the effect of these minor haplotypes (APOA5*2 and APOA5*3) on ApoAV plasma levels through the precise insertion of single-copy intact APOA5 haplotypes at a targeted location in the mouse genome. While we found no difference in the amount of human plasma ApoAV in mice containing the common APOA5*1 and minor APOA5*2 haplotype, the introduction of the single APOA5*3 defining allelemore » (19W) resulted in 3-fold lower ApoAV plasma levels consistent with existing genetic association studies. These results indicate that S19W polymorphism is likely to be functional and explain the strong association of this variant with plasma triglycerides supporting the value of sensitive in vivo assays to define the functional nature of human haplotypes.« less
A parsimonious tree-grow method for haplotype inference.
Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan
2005-09-01
Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf
Tack, Lois C; Thomas, Michelle; Reich, Karl
2007-03-01
Forensic labs globally face the same problem-a growing need to process a greater number and wider variety of samples for DNA analysis. The same forensic lab can be tasked all at once with processing mixed casework samples from crime scenes, convicted offender samples for database entry, and tissue from tsunami victims for identification. Besides flexibility in the robotic system chosen for forensic automation, there is a need, for each sample type, to develop new methodology that is not only faster but also more reliable than past procedures. FTA is a chemical treatment of paper, unique to Whatman Bioscience, and is used for the stabilization and storage of biological samples. Here, the authors describe optimization of the Whatman FTA Purification Kit protocol for use with the AmpFlSTR Identifiler PCR Amplification Kit.
Detecting disease-predisposing variants: the haplotype method.
Valdes, A M; Thomson, G
1997-01-01
For many HLA-associated diseases, multiple alleles-- and, in some cases, multiple loci--have been suggested as the causative agents. The haplotype method for identifying disease-predisposing amino acids in a genetic region is a stratification analysis. We show that, for each haplotype combination containing all the amino acid sites involved in the disease process, the relative frequencies of amino acid variants at sites not involved in disease but in linkage disequilibrium with the disease-predisposing sites are expected to be the same in patients and controls. The haplotype method is robust to mode of inheritance and penetrance of the disease and can be used to determine unequivocally whether all amino acid sites involved in the disease have not been identified. Using a resampling technique, we developed a statistical test that takes account of the nonindependence of the sites sampled. Further, when multiple sites in the genetic region are involved in disease, the test statistic gives a closer fit to the null expectation when some--compared with none--of the true predisposing factors are included in the haplotype analysis. Although the haplotype method cannot distinguish between very highly correlated sites in one population, ethnic comparisons may help identify the true predisposing factors. The haplotype method was applied to insulin-dependent diabetes mellitus (IDDM) HLA class II DQA1-DQB1 data from Caucasian, African, and Japanese populations. Our results indicate that the combination DQA1#52 (Arg predisposing) DQB1#57 (Asp protective), which has been proposed as an important IDDM agent, does not include all the predisposing elements. With rheumatoid arthritis HLA class II DRB1 data, the results were consistent with the shared-epitope hypothesis. PMID:9042931
Li, Zhenghui; Zhang, Jian; Zhang, Hantao; Lin, Ziqing; Ye, Jian
2018-05-01
Short tandem repeats (STRs) play a vitally important role in forensics. Population data is needed to improve the field. There is currently no large population data-based data set in Chamdo Tibetan. In our study, the allele frequencies and forensic statistical parameters of 18 autosomal STR loci (D5S818, D21S11, D7S820, CSF1PO, D2S1338, D3S1358, VWA, D8S1179, D16S539, PentaE, TPOX, TH01, D19S433, D18S51, FGA, D6S1043, D13S317, and D12S391) included in the DNATyper™19 kit were investigated in 2249 healthy, unrelated Tibetan subjects living in Tibet Chamdo, Southwest China. The combined power of discrimination and the combined probability of exclusion of all 18 loci were 0.9999999999999999999998174 and 0.99999994704, respectively. Furthermore, the genetic relationship between our Tibetan group and 33 previously published populations was also investigated. Phylogenetic analyses revealed that the Chamdo Tibetan population is more closely related genetically with the Lhasa Tibetan group. Our results suggest that these autosomal STR loci are highly polymorphic in the Tibetan population living in Tibet Chamdo and can be used as a powerful tool in forensics, linguistics, and population genetic analyses.
Wilson, Paul J; Rutledge, Linda Y; Wheeldon, Tyler J; Patterson, Brent R; White, Bradley N
2012-09-01
There has been considerable discussion on the origin of the red wolf and eastern wolf and their evolution independent of the gray wolf. We analyzed mitochondrial DNA (mtDNA) and a Y-chromosome intron sequence in combination with Y-chromosome microsatellites from wolves and coyotes within the range of extensive wolf-coyote hybridization, that is, eastern North America. The detection of divergent Y-chromosome haplotypes in the historic range of the eastern wolf is concordant with earlier mtDNA findings, and the absence of these haplotypes in western coyotes supports the existence of the North American evolved eastern wolf (Canis lycaon). Having haplotypes observed exclusively in eastern North America as a result of insufficient sampling in the historic range of the coyote or that these lineages subsequently went extinct in western geographies is unlikely given that eastern-specific mtDNA and Y-chromosome haplotypes represent lineages divergent from those observed in extant western coyotes. By combining Y-chromosome and mtDNA distributional patterns, we identified hybrid genomes of eastern wolf, coyote, gray wolf, and potentially dog origin in Canis populations of central and eastern North America. The natural contemporary eastern Canis populations represent an important example of widespread introgression resulting in hybrid genomes across the original C. lycaon range that appears to be facilitated by the eastern wolf acting as a conduit for hybridization. Applying conventional taxonomic nomenclature and species-based conservation initiatives, particularly in human-modified landscapes, may be counterproductive to the effective management of these hybrids and fails to consider their evolutionary potential.
Antiquity and diversity of aboriginal Australian Y-chromosomes.
Nagle, Nano; Ballantyne, Kaye N; van Oven, Mannis; Tyler-Smith, Chris; Xue, Yali; Taylor, Duncan; Wilcox, Stephen; Wilcox, Leah; Turkalov, Rust; van Oorschot, Roland A H; McAllister, Peter; Williams, Lesley; Kayser, Manfred; Mitchell, Robert J
2016-03-01
Understanding the origins of Aboriginal Australians is crucial in reconstructing the evolution and spread of Homo sapiens as evidence suggests they represent the descendants of the earliest group to leave Africa. This study analyzed a large sample of Y-chromosomes to answer questions relating to the migration routes of their ancestors, the age of Y-haplogroups, date of colonization, as well as the extent of male-specific variation. Knowledge of Y-chromosome variation among Aboriginal Australians is extremely limited. This study examined Y-SNP and Y-STR variation among 657 self-declared Aboriginal males from locations across the continent. 17 Y-STR loci and 47 Y-SNPs spanning the Y-chromosome phylogeny were typed in total. The proportion of non-indigenous Y-chromosomes of assumed Eurasian origin was high, at 56%. Y lineages of indigenous Sahul origin belonged to haplogroups C-M130*(xM8,M38,M217,M347) (1%), C-M347 (19%), K-M526*(xM147,P308,P79,P261,P256,M231,M175,M45,P202) (12%), S-P308 (12%), and M-M186 (0.9%). Haplogroups C-M347, K-M526*, and S-P308 are Aboriginal Australian-specific. Dating of C-M347, K-M526*, and S-P308 indicates that all are at least 40,000 years old, confirming their long-term presence in Australia. Haplogroup C-M347 comprised at least three sub-haplogroups: C-DYS390.1del, C-M210, and the unresolved paragroup C-M347*(xDYS390.1del,M210). There was some geographic structure to the Y-haplogroup variation, but most haplogroups were present throughout Australia. The age of the Australian-specific Y-haplogroups suggests New Guineans and Aboriginal Australians have been isolated for over 30,000 years, supporting findings based on mitochondrial DNA data. Our data support the hypothesis of more than one route (via New Guinea) for males entering Sahul some 50,000 years ago and give no support for colonization events during the Holocene, from either India or elsewhere. © 2015 Wiley Periodicals, Inc.
Kullback-Leibler divergence for detection of rare haplotype common disease association.
Lin, Shili
2015-11-01
Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.
Deep divergence and apparent sex-biased dispersal revealed by a Y-linked marker in rainbow trout
Brunelli, Joseph P.; Steele, Craig A.; Thorgaard, Gary H.
2010-01-01
Y-chromosome and mitochondrial DNA markers can reveal phylogenetic patterns by allowing tracking of male and female lineages, respectively. We used sequence data from a recently discovered Y-linked marker and a mitochondrial marker to examine phylogeographic structure in the widespread and economically important rainbow trout (Oncorhynchus mykiss). Two distinct geographic groupings that generally correspond to coastal and inland subspecies were evident within the Y marker network while the mtDNA haplotype network showed little geographic structure. Our results suggest that male-specific behavior has prevented widespread admixture of Y haplotypes and that gene flow between the coastal and inland subspecies has largely occurred through females. This new Y marker may also aid conservation efforts by genetically identifying inland populations that have not hybridized with widely stocked coastal-derived hatchery fish. PMID:20546904
Prediction of autosomal STR typing success in ancient and Second World War bone samples.
Zupanič Pajnič, Irena; Zupanc, Tomaž; Balažic, Jože; Geršak, Živa Miriam; Stojković, Oliver; Skadrić, Ivan; Črešnar, Matija
2017-03-01
Human-specific quantitative PCR (qPCR) has been developed for forensic use in the last 10 years and is the preferred DNA quantification technique since it is very accurate, sensitive, objective, time-effective and automatable. The amount of information that can be gleaned from a single quantification reaction using commercially available quantification kits has increased from the quantity of nuclear DNA to the amount of male DNA, presence of inhibitors and, most recently, to the degree of DNA degradation. In skeletal remains samples from disaster victims, missing persons and war conflict victims, the DNA is usually degraded. Therefore the new commercial qPCR kits able to assess the degree of degradation are potentially able to predict the success of downstream short tandem repeat (STR) typing. The goal of this study was to verify the quantification step using the PowerQuant kit with regard to its suitability as a screening method for autosomal STR typing success on ancient and Second World War (WWII) skeletal remains. We analysed 60 skeletons excavated from five archaeological sites and four WWII mass graves from Slovenia. The bones were cleaned, surface contamination was removed and the bones ground to a powder. Genomic DNA was obtained from 0.5g of bone powder after total demineralization. The DNA was purified using a Biorobot EZ1 device. Following PowerQuant quantification, DNA samples were subjected to autosomal STR amplification using the NGM kit. Up to 2.51ng DNA/g of powder were extracted. No inhibition was detected in any of bones analysed. 82% of the WWII bones gave full profiles while 73% of the ancient bones gave profiles not suitable for interpretation. Four bone extracts yielded no detectable amplification or zero quantification results and no profiles were obtained from any of them. Full or useful partial profiles were produced only from bone extracts where short autosomal (Auto) and long degradation (Deg) PowerQuant targets were detected. It is
Listman, Jennifer B; Hasin, Deborah; Kranzler, Henry R; Malison, Robert T; Mutirangura, Apiwat; Sughondhabirom, Atapol; Aharonovich, Efrat; Spivak, Baruch; Gelernter, Joel
2010-06-14
Detecting population substructure is a critical issue for association studies of health behaviors and other traits. Whether inherent in the population or an artifact of marker choice, determining aspects of a population's genetic history as potential sources of substructure can aid in design of future genetic studies. Jewish populations, among which association studies are often conducted, have a known history of migrations. As a necessary step in understanding population structure to conduct valid association studies of health behaviors among Israeli Jews, we investigated genetic signatures of this history and quantified substructure to facilitate future investigations of these phenotypes in this population. Using 32 autosomal STR markers and the program STRUCTURE, we differentiated between Ashkenazi (AJ, N = 135) and non-Ashkenazi (NAJ, N = 226) Jewish populations in the form of Northern and Southern geographic genetic components (AJ north 73%, south 23%, NAJ north 33%, south 60%). The ability to detect substructure within these closely related populations using a small STR panel was contingent on including additional samples representing major continental populations in the analyses. Although clustering programs such as STRUCTURE are designed to assign proportions of ancestry to individuals without reference population information, when Jewish samples were analyzed in the absence of proxy parental populations, substructure within Jews was not detected. Generally, for samples with a given grandparental country of birth, STRUCTURE assignment values to Northern, Southern, African and Asian clusters agreed with mitochondrial DNA and Y-chromosomal data from previous studies as well as historical records of migration and intermarriage.
2010-01-01
Background Detecting population substructure is a critical issue for association studies of health behaviors and other traits. Whether inherent in the population or an artifact of marker choice, determining aspects of a population's genetic history as potential sources of substructure can aid in design of future genetic studies. Jewish populations, among which association studies are often conducted, have a known history of migrations. As a necessary step in understanding population structure to conduct valid association studies of health behaviors among Israeli Jews, we investigated genetic signatures of this history and quantified substructure to facilitate future investigations of these phenotypes in this population. Results Using 32 autosomal STR markers and the program STRUCTURE, we differentiated between Ashkenazi (AJ, N = 135) and non-Ashkenazi (NAJ, N = 226) Jewish populations in the form of Northern and Southern geographic genetic components (AJ north 73%, south 23%, NAJ north 33%, south 60%). The ability to detect substructure within these closely related populations using a small STR panel was contingent on including additional samples representing major continental populations in the analyses. Conclusions Although clustering programs such as STRUCTURE are designed to assign proportions of ancestry to individuals without reference population information, when Jewish samples were analyzed in the absence of proxy parental populations, substructure within Jews was not detected. Generally, for samples with a given grandparental country of birth, STRUCTURE assignment values to Northern, Southern, African and Asian clusters agreed with mitochondrial DNA and Y-chromosomal data from previous studies as well as historical records of migration and intermarriage. PMID:20546593
Wallner, Barbara; Vogl, Claus; Shukla, Priyank; Burgstaller, Joerg P.; Druml, Thomas; Brem, Gottfried
2013-01-01
The paternally inherited Y chromosome displays the population genetic history of males. While modern domestic horses (Equus caballus) exhibit abundant diversity within maternally inherited mitochondrial DNA, no significant Y-chromosomal sequence diversity has been detected. We used high throughput sequencing technology to identify the first polymorphic Y-chromosomal markers useful for tracing paternal lines. The nucleotide variability of the modern horse Y chromosome is extremely low, resulting in six haplotypes (HT), all clearly distinct from the Przewalski horse (E. przewalskii). The most widespread HT1 is ancestral and the other five haplotypes apparently arose on the background of HT1 by mutation or gene conversion after domestication. Two haplotypes (HT2 and HT3) are widely distributed at high frequencies among modern European horse breeds. Using pedigree information, we trace the distribution of Y-haplotype diversity to particular founders. The mutation leading to HT3 occurred in the germline of the famous English Thoroughbred stallion “Eclipse” or his son or grandson and its prevalence demonstrates the influence of this popular paternal line on modern sport horse breeds. The pervasive introgression of Thoroughbred stallions during the last 200 years to refine autochthonous breeds has strongly affected the distribution of Y-chromosomal variation in modern horse breeds and has led to the replacement of autochthonous Y chromosomes. Only a few northern European breeds bear unique variants at high frequencies or fixed within but not shared among breeds. Our Y-chromosomal data complement the well established mtDNA lineages and document the male side of the genetic history of modern horse breeds and breeding practices. PMID:23573227
Fang, Yating; Guo, Yuxin; Xie, Tong; Jin, Xiaoye; Lan, Qiong; Zhou, Yongsong; Zhu, Bofeng
2018-03-26
In present study, the genetic polymorphisms of 22 autosomal short tandem repeat (STR) loci were analyzed in 496 unrelated Chinese Xinjiang Hui individuals. These autosomal STR loci were multiplex amplified and genotyped based on a novel STR panel. There were 246 observed alleles with the allele frequencies ranging from 0.0010 to 0.3609. All polymorphic information content values were higher than 0.7. The combined power of discrimination and the combined probability of exclusion were 0.999999999999999999999999999426766 and 0.999999999860491, respectively. Based on analysis of molecular variance method, genetic differentiation analysis between the Xinjiang Hui and other reported groups were conducted at these 22 loci. The results indicated that there were no significant differences in statistics between Hui group and Northern Han group (including Han groups from Hebei, Henan, Shaanxi provinces), and significant deviations with Southern Han group (including those from Guangdong, Guangxi provinces) at 7 loci, and Uygur group at 10 loci. To sum up, these 22 autosomal STR loci were high genetic polymorphic in Xinjiang Hui group.
HaploForge: a comprehensive pedigree drawing and haplotype visualization web application.
Tekman, Mehmet; Medlar, Alan; Mozere, Monika; Kleta, Robert; Stanescu, Horia
2017-12-15
Haplotype reconstruction is an important tool for understanding the aetiology of human disease. Haplotyping infers the most likely phase of observed genotypes conditional on constraints imposed by the genotypes of other pedigree members. The results of haplotype reconstruction, when visualized appropriately, show which alleles are identical by descent despite the presence of untyped individuals. When used in concert with linkage analysis, haplotyping can help delineate a locus of interest and provide a succinct explanation for the transmission of the trait locus. Unfortunately, the design choices made by existing haplotype visualization programs do not scale to large numbers of markers. Indeed, following haplotypes from generation to generation requires excessive scrolling back and forth. In addition, the most widely used program for haplotype visualization produces inconsistent recombination artefacts for the X chromosome. To resolve these issues, we developed HaploForge, a novel web application for haplotype visualization and pedigree drawing. HaploForge takes advantage of HTML5 to be fast, portable and avoid the need for local installation. It can accurately visualize autosomal and X-linked haplotypes from both outbred and consanguineous pedigrees. Haplotypes are coloured based on identity by descent using a novel A* search algorithm and we provide a flexible viewing mode to aid visual inspection. HaploForge can currently process haplotype reconstruction output from Allegro, GeneHunter, Merlin and Simwalk. HaploForge is licensed under GPLv3 and is hosted and maintained via GitHub. https://github.com/mtekman/haploforge. r.kleta@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Genetic polymorphisms of 15 STR loci in two Tibetan populations from Tibet Changdu and Naqu, China.
Kang, LongLi; Yuan, Dongya; Yang, Fengying; Liu, Kai; Za, Xi
2007-07-04
The allelic distribution of 15 short tandem repeat (STR) loci included in the AmpFl STR Identifiler kit was examined in 100 Changdu Tibetan and 118 Naqu Tibetan unrelated individuals living in the Tibet Province, PR China. The distribution of these observed genotypes was not significantly different from the expected distribution according to Hardy-Weinberg equilibrium.
A new mathematical modeling for pure parsimony haplotyping problem.
Feizabadi, R; Bagherian, M; Vaziri, H R; Salahi, M
2016-11-01
Pure parsimony haplotyping (PPH) problem is important in bioinformatics because rational haplotyping inference plays important roles in analysis of genetic data, mapping complex genetic diseases such as Alzheimer's disease, heart disorders and etc. Haplotypes and genotypes are m-length sequences. Although several integer programing models have already been presented for PPH problem, its NP-hardness characteristic resulted in ineffectiveness of those models facing the real instances especially instances with many heterozygous sites. In this paper, we assign a corresponding number to each haplotype and genotype and based on those numbers, we set a mixed integer programing model. Using numbers, instead of sequences, would lead to less complexity of the new model in comparison with previous models in a way that there are neither constraints nor variables corresponding to heterozygous nucleotide sites in it. Experimental results approve the efficiency of the new model in producing better solution in comparison to two state-of-the art haplotyping approaches. Copyright © 2016 Elsevier Inc. All rights reserved.
Haplotype assembly in polyploid genomes and identical by descent shared tracts.
Aguiar, Derek; Istrail, Sorin
2013-07-01
Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and computational efficiency. Furthermore, current models and methodologies for haplotype assembly (i) do not consider individuals sharing haplotypes jointly, which reduces the size and accuracy of assembled haplotypes, and (ii) are unable to model genomes having more than two sets of homologous chromosomes (polyploidy). Polyploid organisms are increasingly becoming the target of many research groups interested in the genomics of disease, phylogenetics, botany and evolution but there is an absence of theory and methods for polyploid haplotype reconstruction. In this work, we present a number of results, extensions and generalizations of compass graphs and our HapCompass framework. We prove the theoretical complexity of two haplotype assembly optimizations, thereby motivating the use of heuristics. Furthermore, we present graph theory-based algorithms for the problem of haplotype assembly using our previously developed HapCompass framework for (i) novel implementations of haplotype assembly optimizations (minimum error correction), (ii) assembly of a pair of individuals sharing a haplotype tract identical by descent and (iii) assembly of polyploid genomes. We evaluate our methods on 1000 Genomes Project, Pacific Biosciences and simulated sequence data. HapCompass is available for download at http://www.brown.edu/Research/Istrail_Lab/. Supplementary data are available at Bioinformatics online.
BCL11A Enhancer Haplotypes and Fetal Hemoglobin in Sickle Cell Anemia
Sebastiani, P.; Farrell, J.J.; Alsultan, A.; Wang, S.; Edward, H. L.; Shappell, H.; Bae, H.; Milton, J. N.; Baldwin, C.T.; Al-Rubaish, A.M.; Naserullah, Z.; Al-Muhanna, F.; Alsuliman, A.; Patra, P. K.; Farrer, L.A.; Ngo, D.; Vathipadiekal, V.; Chui, D.H.K.; Al-Ali, A.K.; Steinberg, M.H.
2015-01-01
Background Fetal hemoglobin (HbF) levels in sickle cell anemia patients vary. We genotyped polymorphisms in the erythroid-specific enhancer of BCL11A to see if they might account for the very high HbF associated with the Arab-Indian (AI) haplotype and Benin haplotype of sickle cell anemia. Methods and Results Six BCL112A enhancer SNPs and their haplotypes were studied in Saudi Arabs from the Eastern Province and Indian patients with AI haplotype (HbF ~20%), African Americans (HbF ~7%), and Saudi Arabs from the Southwestern Province (HbF ~12%). Four SNPs (rs1427407, rs6706648, rs6738440, and rs7606173) and their haplotypes were consistently associated with HbF levels. The distributions of haplotypes differ in the 3 cohorts but not their genetic effects: the haplotype TCAG was associated with the lowest HbF level and the haplotype GTAC was associated with the highest HbF level and differences in HbF levels between carriers of these haplotypes in all cohorts was approximately 6%. Conclusions Common HbF BCL11A enhancer haplotypes in patients with African origin and AI sickle cell anemia have similar effects on HbF but they do not explain their differences in HbF. PMID:25703683
Conwell, J L; Creek, K L; Pozzi, A R; Whyte, H M
2001-02-01
The Industrial Hygiene and Safety Group at Los Alamos National Laboratory (LANL) developed a database application known as IH DataView, which manages industrial hygiene monitoring data. IH DataView replaces a LANL legacy system, IHSD, that restricted user access to a single point of data entry needed enhancements that support new operational requirements, and was not Year 2000 (Y2K) compliant. IH DataView features a comprehensive suite of data collection and tracking capabilities. Through the use of Oracle database management and application development tools, the system is Y2K compliant and Web enabled for easy deployment and user access via the Internet. System accessibility is particularly important because LANL operations are spread over 43 square miles, and industrial hygienists (IHs) located across the laboratory will use the system. IH DataView shows promise of being useful in the future because it eliminates these problems. It has a flexible architecture and sophisticated capability to collect, track, and analyze data in easy-to-use form.
Modeling haplotype block variation using Markov chains.
Greenspan, G; Geiger, D
2006-04-01
Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity.
Modeling Haplotype Block Variation Using Markov Chains
Greenspan, G.; Geiger, D.
2006-01-01
Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity. PMID:16361244
Ivanov, P L; Leonov, S N; Zemskova, E Iu; Kobylianskiĭ, A G; Dziubenko, E V
2013-01-01
This study was designed to estimate the effectiveness of special technical procedures for the enhancement of sensitivity of multiplex analysis of DNA, such as the use of low-plexity PCR systems and the whole genome preamplification technology, and the possibility of their application for the purpose of forensic medical genotyping of polymorphous STR-loci of chromosomal DNA in individual cells. The authors refused to use the imitation model (equivalent DNA dilutions) for the sake of obtaining the maximally informative data and chose to work with real preparations of solitary buccal epithelial cells isolated by the laser microdissection technique. It was shown that neither the use of the low-plexity multilocus PCR systems nor the whole genome pre-amplification technology makes possible reliable genotyping of STR-loci of chromosomal DNA in individual cells. The proposed techniques allow for DNA genotyping in preparations consisting of 10 diploid cells whereas the methods for reliable genotyping of STR-loci of chromosomal DNA in individual cells remains to be developed.
Mineralocorticoid receptor haplotype, oral contraceptives and emotional information processing.
Hamstra, D A; de Kloet, E R; van Hemert, A M; de Rijk, R H; Van der Does, A J W
2015-02-12
Oral contraceptives (OCs) affect mood in some women and may have more subtle effects on emotional information processing in many more users. Female carriers of mineralocorticoid receptor (MR) haplotype 2 have been shown to be more optimistic and less vulnerable to depression. To investigate the effects of oral contraceptives on emotional information processing and a possible moderating effect of MR haplotype. Cross-sectional study in 85 healthy premenopausal women of West-European descent. We found significant main effects of oral contraceptives on facial expression recognition, emotional memory and decision-making. Furthermore, carriers of MR haplotype 1 or 3 were sensitive to the impact of OCs on the recognition of sad and fearful faces and on emotional memory, whereas MR haplotype 2 carriers were not. Different compounds of OCs were included. No hormonal measures were taken. Most naturally cycling participants were assessed in the luteal phase of their menstrual cycle. Carriers of MR haplotype 2 may be less sensitive to depressogenic side-effects of OCs. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Y-chromosome diversity in Catalan surname samples: insights into surname origin and frequency
Solé-Morata, Neus; Bertranpetit, Jaume; Comas, David; Calafell, Francesc
2015-01-01
The biological behavior of the Y chromosome, which is paternally inherited, implies that males sharing the same surname may also share a similar Y chromosome. However, socio-cultural factors, such as polyphyletism, non-paternity, adoption, or matrilineal surname transmission, may prevent the joint transmission of the surname and the Y chromosome. By genotyping 17 Y-STRs and 68 SNPs in ~2500 male samples that each carried one of the 50 selected Catalan surnames, we could determine sets of descendants of a common ancestor, the population of origin of the common ancestor, and the date when such a common ancestor lived. Haplotype diversity was positively correlated with surname frequency, that is, rarer surnames showed the strongest signals of coancestry. Introgression rates of Y chromosomes into a surname by non-paternity, adoption, and transmission of the maternal surname were estimated at 1.5−2.6% per generation, with some local variation. Average ages for the founders of the surnames were estimated at ~500 years, suggesting a delay between the origin of surnames (twelfth and thirteenth centuries) and the systematization of their paternal transmission. We have found that, in general, a foreign etymology for a surname does not often result in a non-indigenous origin of surname founders; however, bearers of some surnames with an Arabic etymology show an excess of North African haplotypes. Finally, we estimate that surname prediction from a Y-chromosome haplotype, which may have interesting forensic applications, has a ~60% sensitivity but a 17% false discovery rate. PMID:25689924
Ultraaccurate genome sequencing and haplotyping of single human cells.
Chu, Wai Keung; Edge, Peter; Lee, Ho Suk; Bansal, Vikas; Bafna, Vineet; Huang, Xiaohua; Zhang, Kun
2017-11-21
Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10 -8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.
RTEL1 tagging SNPs and haplotypes were associated with glioma development.
Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong
2013-05-17
As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.
Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families
Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido
2015-01-01
DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576
Haplotype phasing and inheritance of copy number variants in nuclear families.
Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido
2015-01-01
DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.
Mineralocorticoid receptor haplotype, estradiol, progesterone and emotional information processing.
Hamstra, Danielle A; de Kloet, E Ronald; Quataert, Ina; Jansen, Myrthe; Van der Does, Willem
2017-02-01
Carriers of MR-haplotype 1 and 3 (GA/CG; rs5522 and rs2070951) are more sensitive to the influence of oral contraceptives (OC) and menstrual cycle phase on emotional information processing than MR-haplotype 2 (CA) carriers. We investigated whether this effect is associated with estradiol (E2) and/or progesterone (P4) levels. Healthy MR-genotyped premenopausal women were tested twice in a counterbalanced design. Naturally cycling (NC) women were tested in the early-follicular and mid-luteal phase and OC-users during OC-intake and in the pill-free week. At both sessions E2 and P4 were assessed in saliva. Tests included implicit and explicit positive and negative affect, attentional blink accuracy, emotional memory, emotion recognition, and risky decision-making (gambling). MR-haplotype 2 homozygotes had higher implicit happiness scores than MR-haplotype 2 heterozygotes (p=0.031) and MR-haplotype 1/3 carriers (p<0.001). MR-haplotype 2 homozygotes also had longer reaction times to happy faces in an emotion recognition test than MR-haplotype 1/3 (p=0.001). Practice effects were observed for most measures. The pattern of correlations between information processing and P4 or E2 differed between sessions, as well as the moderating effects of the MR genotype. In the first session the MR-genotype moderated the influence of P4 on implicit anxiety (sr=-0.30; p=0.005): higher P4 was associated with reduction in implicit anxiety, but only in MR-haplotype 2 homozygotes (sr=-0.61; p=0.012). In the second session the MR-genotype moderated the influence of E2 on the recognition of facial expressions of happiness (sr=-0.21; p=0.035): only in MR-haplotype 1/3 higher E2 was correlated with happiness recognition (sr=0.29; p=0.005). In the second session higher E2 and P4 were negatively correlated with accuracy in lag2 trials of the attentional blink task (p<0.001). Thus NC women, compared to OC-users, performed worse on lag 2 trials (p=0.041). The higher implicit happiness scores of MR-haplotype
A phased SNP-based classification of sickle cell anemia HBB haplotypes.
Shaikho, Elmutaz M; Farrell, John J; Alsultan, Abdulrahman; Qutub, Hatem; Al-Ali, Amein K; Figueiredo, Maria Stella; Chui, David H K; Farrer, Lindsay A; Murphy, George J; Mostoslavsky, Gustavo; Sebastiani, Paola; Steinberg, Martin H
2017-08-11
Sickle cell anemia causes severe complications and premature death. Five common β-globin gene cluster haplotypes are each associated with characteristic fetal hemoglobin (HbF) levels. As HbF is the major modulator of disease severity, classifying patients according to haplotype is useful. The first method of haplotype classification used restriction fragment length polymorphisms (RFLPs) to detect single nucleotide polymorphisms (SNPs) in the β-globin gene cluster. This is labor intensive, and error prone. We used genome-wide SNP data imputed to the 1000 Genomes reference panel to obtain phased data distinguishing parental alleles. We successfully haplotyped 813 sickle cell anemia patients previously classified by RFLPs with a concordance >98%. Four SNPs (rs3834466, rs28440105, rs10128556, and rs968857) marking four different restriction enzyme sites unequivocally defined most haplotypes. We were able to assign a haplotype to 86% of samples that were either partially or misclassified using RFLPs. Phased data using only four SNPs allowed unequivocal assignment of a haplotype that was not always possible using a larger number of RFLPs. Given the availability of genome-wide SNP data, our method is rapid and does not require high computational resources.
Genomic evolution in domestic cattle: ancestral haplotypes and healthy beef.
Williamson, Joseph F; Steele, Edward J; Lester, Susan; Kalai, Oscar; Millman, John A; Wolrige, Lindsay; Bayard, Dominic; McLure, Craig; Dawkins, Roger L
2011-05-01
We have identified numerous Ancestral Haplotypes encoding a 14-Mb region of Bota C19. Three are frequent in Simmental, Angus and Wagyu and have been conserved since common progenitor populations. Others are more relevant to the differences between these 3 breeds including fat content and distribution in muscle. SREBF1 and Growth Hormone, which have been implicated in the production of healthy beef, are included within these haplotypes. However, we conclude that alleles at these 2 loci are less important than other sequences within the haplotypes. Identification of breeds and hybrids is improved by using haplotypes rather than individual alleles. Copyright © 2010 Elsevier Inc. All rights reserved.
Tong, Da Yue; Wu, Xin Yao; Sun, Hong Yu; Zhao, Hu; Lu, Hui Ling
2010-11-01
Knowledge of allele and genotype frequencies is an essential prerequisite to the use of any human polymorphism in forensic work. To study the genetic polymorphism and evaluate the application value of nine STR loci. Genotyping of nine STR loci, including D11S2368, D12S391, D13S325, D18S1364, D22-GATA198B05, D6S1043, D2S1772, D7S3048 and D8S1132, of 1050 unrelated individuals was performed with the STR_Typer_10_v1 kit and Genetic Analyzer 3100 and analyzed with PowerState V12.xls and Arlequin ver 3.11 analyzing software. Allele frequency distribution was statistically analyzed and Hardy-Weinberg equilibrium determined. Several common parameters used in forensic sciences were found: the heterozygosity (H) ranged from 0.827 to 0.892; the matching probability (MP) ranged from 0.029 to 0.074; the power of discrimination (PD) ranged from 0.926 to 0.971; the power of exclusion (PE) ranged from 0.649 to 0.779; the polymorphic information content (PIC) ranged from 0.77 to 0.86; and the typical paternity index (TPI) ranged from 2.88 to 4.62. The results indicate that nine STR loci are high polymorphic among the Han population in Southern China. This set of polymorphic STR loci is a useful tool in forensic paternity testing and anthropological study.
TNF-alpha SNP haplotype frequencies in equidae.
Brown, J J; Ollier, W E R; Thomson, W; Matthews, J B; Carter, S D; Binns, M; Pinchbeck, G; Clegg, P D
2006-05-01
Tumour necrosis factor alpha (TNF-alpha) is a pro-inflammatory cytokine that plays a crucial role in the regulation of inflammatory and immune responses. In all vertebrate species the genes encoding TNF-alpha are located within the major histocompatability complex. In the horse TNF-alpha has been ascribed a role in a variety of important disease processes. Previously two single nucleotide polymorphisms (SNPs) have been reported within the 5' un-translated region of the equine TNF-alpha gene. We have examined the equine TNF-alpha promoter region further for additional SNPs by analysing DNA from 131 horses (Equus caballus), 19 donkeys (E. asinus), 2 Grant's zebras (E. burchellii boehmi) and one onager (E. hemionus). Two further SNPs were identified at nucleotide positions 24 (T/G) and 452 (T/C) relative to the first nucleotide of the 522 bp polymerase chain reaction product. A sequence variant at position 51 was observed between equidae. SNaPSHOT genotyping assays for these and the two previously reported SNPs were performed on 457 horses comprising seven different breeds and 23 donkeys to determine the gene frequencies. SNP frequencies varied considerably between different horse breeds and also between the equine species. In total, nine different TNF-alpha promoter SNP haplotypes and their frequencies were established amongst the various equidae examined, with some haplotypes being found only in horses and others only in donkeys or zebras. The haplotype frequencies observed varied greatly between different horse breeds. Such haplotypes may relate to levels of TNF-alpha production and disease susceptibility and further investigation is required to identify associations between particular haplotypes and altered risk of disease.
Population-specific FST values for forensic STR markers: A worldwide survey
Buckleton, John; Curran, James; Goudet, Jérôme; Taylor, Duncan; Thiery, Alexandre; Weir, B.S.
2016-01-01
The interpretation of matching between DNA profiles of a person of interest and an item of evidence is undertaken using population genetic models to predict the probability of matching by chance. Calculation of matching probabilities is straightforward if allelic probabilities are known, or can be estimated, in the relevant population. It is more often the case, however, that the relevant population has not been sampled and allele frequencies are available only from a broader collection of populations as might be represented in a national or regional database. Variation of allele probabilities among the relevant populations is quantified by the population structure quantity FST and this quanity affects matching propoptions. Matching within a population can be interpreted only with respect to matching between populations and we show here that FST, can be estimated from sample allelic matching proportions within and between populations. We report such estimates from data we extracted from 250 papers in the forensic literature, representing STR profiles at up to 24 loci from nearly 500,000 people in 446 different populations. The results suggest that theta values in current forensic use do not have the buffer of conservativism often thought. PMID:27082756
Population data of 21 autosomal STR loci in the Hausa, Igbo and Yoruba people of Nigeria.
Okolie, Victoria O; Cisana, Selena; Schanfield, Moses S; Adekoya, Khalid O; Oyedeji, Olufemi A; Podini, Daniele
2018-05-01
The three major ethnic groups of Nigerian population namely the Hausa, Igbo and Yoruba make up 29, 21 and 18% of the total population, respectively. To provide genetic information necessary for forensic analysis, this study was carried out to determine STR allele frequencies in 102 Hausa, 128 Igbo and 134 Yoruba individuals in Nigeria using 21 STR loci including the 20 CODIS (Combined DNA Index System) loci plus SE33.
A spatial haplotype copying model with applications to genotype imputation.
Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan
2015-05-01
Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.
APC Yin-Yang haplotype associated with colorectal cancer risk
GARRE, P.; DE LA HOYA, M.; INIESTA, P.; ROMERA, A.; LLOVET, P.; GONZALEZ, S.; PEREZ-SEGURA, P.; CAPELLA, G.; DIAZ-RUBIO, E.; CALDES, T.
2010-01-01
The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan® assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32–2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61–1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC. PMID:22993613
APC Yin-Yang haplotype associated with colorectal cancer risk.
Garre, P; DE LA Hoya, M; Iniesta, P; Romera, A; Llovet, P; Gonzalez, S; Perez-Segura, P; Capella, G; Diaz-Rubio, E; Caldes, T
2010-09-01
The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan(®) assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32-2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61-1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC.
Beta-globin gene cluster haplotypes of Amerindian populations from the Brazilian Amazon region.
Guerreiro, J F; Figueiredo, M S; Zago, M A
1994-01-01
We have determined the beta-globin cluster haplotypes for 80 Indians from four Brazilian Amazon tribes: Kayapó, Wayampí, Wayana-Apalaí, and Arára. The results are analyzed together with 20 Yanomámi previously studied. From 2 to 4 different haplotypes were identified for each tribe, and 7 of the possible 32 haplotypes were found in a sample of 172 chromosomes for which the beta haplotypes were directly determined or derived from family studies. The haplotype distribution does not differ significantly among the five populations. The two most common haplotypes in all tribes were haplotypes 2 and 6, with average frequencies of 0.843 and 0.122, respectively. The genetic affinities between Brazilian Indians and other human populations were evaluated by estimates of genetic distance based on haplotype data. The lowest values were observed in relation to Asians, especially Chinese, Polynesians, and Micronesians.
Van Neste, Christophe; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip
2016-01-01
It is difficult to predict if and when massively parallel sequencing of forensic STR loci will replace capillary electrophoresis as the new standard technology in forensic genetics. The main benefits of sequencing are increased multiplexing scales and SNP detection. There is not yet a consensus on how sequenced profiles should be reported. We present the Forensic Loci Allele Database (FLAD) service, made freely available on http://forensic.ugent.be/FLAD/. It offers permanent identifiers for sequenced forensic alleles (STR or SNP) and their microvariants for use in forensic allele nomenclature. Analogous to Genbank, its aim is to provide permanent identifiers for forensically relevant allele sequences. Researchers that are developing forensic sequencing kits or are performing population studies, can register on http://forensic.ugent.be/FLAD/ and add loci and allele sequences with a short and simple application interface (API). Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Mathematical properties and bounds on haplotyping populations by pure parsimony.
Wang, I-Lin; Chang, Chia-Yuan
2011-06-01
Although the haplotype data can be used to analyze the function of DNA, due to the significant efforts required in collecting the haplotype data, usually the genotype data is collected and then the population haplotype inference (PHI) problem is solved to infer haplotype data from genotype data for a population. This paper investigates the PHI problem based on the pure parsimony criterion (HIPP), which seeks the minimum number of distinct haplotypes to infer a given genotype data. We analyze the mathematical structure and properties for the HIPP problem, propose techniques to reduce the given genotype data into an equivalent one of much smaller size, and analyze the relations of genotype data using a compatible graph. Based on the mathematical properties in the compatible graph, we propose a maximal clique heuristic to obtain an upper bound, and a new polynomial-sized integer linear programming formulation to obtain a lower bound for the HIPP problem. Copyright © 2011 Elsevier Inc. All rights reserved.
Mirabal, Sheyla; Herrera, Kristian J; Gayden, Tenzin; Regueiro, Maria; Underhill, Peter A; Garcia-Bertrand, Ralph L; Herrera, Rene J
2012-01-25
The Austronesian expansion has left its fingerprint throughout two thirds of the circumference of the globe reaching the island of Madagascar in East Africa to the west and Easter Island, off the coast of Chile, to the east. To date, several theories exist to explain the current genetic distribution of Austronesian populations, with the "slow boat" model being the most widely accepted, though other conjectures (i.e., the "express train" and "entangled bank" hypotheses) have also been widely discussed. In the current study, 158 Y chromosomes from the Polynesian archipelagos of Samoa and Tonga were typed using high resolution binary markers and compared to populations across Mainland East Asia, Taiwan, Island Southeast Asia, Melanesia and Polynesia in order to establish their patrilineal genetic relationships. Y-STR haplotypes on the C2 (M38), C2a (M208), O1a (M119), O3 (M122) and O3a2 (P201) backgrounds were utilized in an attempt to identify the differing sources of the current Y-chromosomal haplogroups present throughout Polynesia (of Melanesian and/or Asian descent). We find that, while haplogroups C2a, S and K3-P79 suggest a Melanesian component in 23%-42% of the Samoan and Tongan Y chromosomes, the majority of the paternal Polynesian gene pool exhibits ties to East Asia. In particular, the prominence of sub-haplogroup O3a2c* (P164), which has previously been observed at only minimal levels in Mainland East Asians (2.0-4.5%), in both Polynesians (ranging from 19% in Manua to 54% in Tonga) and Ami aborigines from Taiwan (37%) provides, for the first time, evidence for a genetic connection between the Polynesian populations and the Ami. Copyright © 2011 Elsevier B.V. All rights reserved.
Röper, Andrea; Reichert, Walter; Mattern, Rainer
2007-01-01
In the field of forensic DNA typing, the analysis of Short Tandem Repeats (STRs) can fail in cases of degraded DNA. The typing of coding region Single Nucleotide Polymorphisms (SNPs) of the mitochondrial genome provides an approach to acquire additional information. In the examined case of aggravated theft, both suspects could be excluded of having left the analyzed hair on the crime scene by SNP typing. This conclusion was not possible subsequent to STR typing. SNP typing of the trace on the torch light left on the crime scene increased the likelihood for suspect no. 2 to be the origin of this trace. This finding was already indicated by STR analysis. Suspect no. 1 was excluded for being the origin of this trace by SNP typing which was also indicated by STR analysis. A limiting factor for the analysis of SNPs is the maternal inheritance of mitochondrial DNA. Individualisation is not possible. In conclusion, it can be said that in the case of traces which cause problems with conventional STR typing the supplementary analysis of coding region SNPs from the mitochondrial genome is very reasonable and greatly contributes to the refinement of analysis methods in the field of forensic genetics.
MiniX-STR multiplex system population study in Japan and application to degraded DNA analysis.
Asamura, H; Sakai, H; Kobayashi, K; Ota, M; Fukushima, H
2006-05-01
We sought to evaluate a more effective system for analyzing X-chromosomal short tandem repeats (X-STRs) in highly degraded DNA. To generate smaller amplicon lengths, we designed new polymerase chain reaction (PCR) primers for DXS7423, DXS6789, DXS101, GATA31E08, DXS8378, DXS7133, DXS7424, and GATA165B12 at X-linked short tandem repeat (STR) loci, devising two miniX-multiplex PCR systems. Among 333 Japanese individuals, these X-linked loci were detected in amplification products ranging in length from 76 to 169 bp, and statistical analyses of the eight loci indicated a high usefulness for the Japanese forensic practice. Results of tests on highly degraded DNA indicated the miniX-STR multiplex strategies to be an effective system for analyzing degraded DNA. We conclude that analysis by the current miniX-STR multiplex systems offers high effectiveness for personal identification from degraded DNA samples.
Origin and Diversification Dynamics of Self-Incompatibility Haplotypes
Gervais, Camille E.; Castric, Vincent; Ressayre, Adrienne; Billiard, Sylvain
2011-01-01
Self-incompatibility (SI) is a genetic system found in some hermaphrodite plants. Recognition of pollen by pistils expressing cognate specificities at two linked genes leads to rejection of self pollen and pollen from close relatives, i.e., to avoidance of self-fertilization and inbred matings, and thus increased outcrossing. These genes generally have many alleles, yet the conditions allowing the evolution of new alleles remain mysterious. Evolutionary changes are clearly necessary in both genes, since any mutation affecting only one of them would result in a nonfunctional self-compatible haplotype. Here, we study diversification at the S-locus (i.e., a stable increase in the total number of SI haplotypes in the population, through the incorporation of new SI haplotypes), both deterministically (by investigating analytically the fate of mutations in an infinite population) and by simulations of finite populations. We show that the conditions allowing diversification are far less stringent in finite populations with recurrent mutations of the pollen and pistil genes, suggesting that diversification is possible in a panmictic population. We find that new SI haplotypes emerge fastest in populations with few SI haplotypes, and we discuss some implications for empirical data on S-alleles. However, allele numbers in our simulations never reach values as high as observed in plants whose SI systems have been studied, and we suggest extensions of our models that may reconcile the theory and data. PMID:21515570
Zhang, Yu-Long; Zhi, Yong-Chao; Zhang, Dao-Chuan
2018-04-23
A new species (i.e. Bryodemella (s. str.) rufifemura sp. nov. from China is described in this paper. It is similar to Bryodemella (s. str.) diamesum (Bei-Bienko, 1930), but differs from the latter by red inner side of hind femur, median keel of pronotum indistinct in metazoan, and vertical diameter of eye shorter than subocular groove in female. The type specimens are deposited in the Museum of Hebei University (MHU), China.
PWHATSHAP: efficient haplotyping for future generation sequencing.
Bracciali, Andrea; Aldinucci, Marco; Patterson, Murray; Marschall, Tobias; Pisanti, Nadia; Merelli, Ivan; Torquati, Massimo
2016-09-22
Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WHATSHAP is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered PWHATSHAP, a parallel, high-performance version of WHATSHAP. PWHATSHAP is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WHATSHAP, PWHATSHAP exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WHATSHAP, which increases with coverage. Due to its structure and management of the large datasets, the parallelisation of WHATSHAP posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, PWHATSHAP, is a freely available toolkit that improves the efficiency of the analysis of genomics
Gutiérrez-Preciado, Ana; Vargas-Chávez, Carlos; Reyes-Prieto, Mariana; Ordoñez, Omar F; Santos-García, Diego; Rosas-Pérez, Tania; Valdivia-Anistro, Jorge; Rebollar, Eria A; Saralegui, Andrés; Moya, Andrés; Merino, Enrique; Farías, María Eugenia; Latorre, Amparo; Souza, Valeria
2017-01-01
We report the genome sequence of Exiguobacterium chiriqhucha str. N139, isolated from a high-altitude Andean lake. Comparative genomic analyses of the Exiguobacterium genomes available suggest that our strain belongs to the same species as the previously reported E. pavilionensis str. RW-2 and Exiguobacterium str. GIC 31. We describe this species and propose the chiriqhucha name to group them. 'Chiri qhucha' in Quechua means 'cold lake', which is a common origin of these three cosmopolitan Exiguobacteria. The 2,952,588-bp E. chiriqhucha str. N139 genome contains one chromosome and three megaplasmids. The genome analysis of the Andean strain suggests the presence of enzymes that confer E. chiriqhucha str. N139 the ability to grow under multiple environmental extreme conditions, including high concentrations of different metals, high ultraviolet B radiation, scavenging for phosphorous and coping with high salinity. Moreover, the regulation of its tryptophan biosynthesis suggests that novel pathways remain to be discovered, and that these pathways might be fundamental in the amino acid metabolism of the microbial community from Laguna Negra, Argentina.
Reyes-Prieto, Mariana; Ordoñez, Omar F.; Santos-García, Diego; Rosas-Pérez, Tania; Valdivia-Anistro, Jorge; Rebollar, Eria A.; Saralegui, Andrés; Moya, Andrés; Merino, Enrique; Farías, María Eugenia
2017-01-01
We report the genome sequence of Exiguobacterium chiriqhucha str. N139, isolated from a high-altitude Andean lake. Comparative genomic analyses of the Exiguobacterium genomes available suggest that our strain belongs to the same species as the previously reported E. pavilionensis str. RW-2 and Exiguobacterium str. GIC 31. We describe this species and propose the chiriqhucha name to group them. ‘Chiri qhucha’ in Quechua means ‘cold lake’, which is a common origin of these three cosmopolitan Exiguobacteria. The 2,952,588-bp E. chiriqhucha str. N139 genome contains one chromosome and three megaplasmids. The genome analysis of the Andean strain suggests the presence of enzymes that confer E. chiriqhucha str. N139 the ability to grow under multiple environmental extreme conditions, including high concentrations of different metals, high ultraviolet B radiation, scavenging for phosphorous and coping with high salinity. Moreover, the regulation of its tryptophan biosynthesis suggests that novel pathways remain to be discovered, and that these pathways might be fundamental in the amino acid metabolism of the microbial community from Laguna Negra, Argentina. PMID:28439458
[Mutation Analysis of 19 STR Loci in 20 723 Cases of Paternity Testing].
Bi, J; Chang, J J; Li, M X; Yu, C Y
2017-06-01
To observe and analyze the confirmed cases of paternity testing, and to explore the mutation rules of STR loci. The mutant STR loci were screened from 20 723 confirmed cases of paternity testing by Goldeneye 20A system.The mutation rates, and the sources, fragment length, steps and increased or decreased repeat sequences of mutant alleles were counted for the analysis of the characteristics of mutation-related factors. A total of 548 mutations were found on 19 STR loci, and 557 mutation events were observed. The loci mutation rate was 0.07‰-2.23‰. The ratio of paternal to maternal mutant events was 3.06:1. One step mutation was the main mutation, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. The repeat sequences were more likely to decrease in two steps mutation and above. Mutation mainly occurred in the medium allele, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. In long allele mutations, the decreased repeat sequences were significantly more than the increased repeat sequences. The number of the increased repeat sequences was almost the same as the decreased repeat sequences in paternal mutation, while the decreased repeat sequences were more than the increased in maternal mutation. There are significant differences in the mutation rate of each locus. When one or two loci do not conform to the genetic law, other detection system should be added, and PI value should be calculated combined with the information of the mutate STR loci in order to further clarify the identification opinions. Copyright© by the Editorial Department of Journal of Forensic Medicine
Recent Advances in Experimental Whole Genome Haplotyping Methods
Huang, Mengting; Lu, Zuhong
2017-01-01
Haplotype plays a vital role in diverse fields; however, the sequencing technologies cannot resolve haplotype directly. Pioneers demonstrated several approaches to resolve haplotype in the early years, which was extensively reviewed. Since then, numerous methods have been developed recently that have significantly improved phasing performance. Here, we review experimental methods that have emerged mainly over the past five years, and categorize them into five classes according to their maximum scale of contiguity: (i) encapsulation, (ii) 3D structure capture and construction, (iii) compartmentalization, (iv) fluorography, (v) long-read sequencing. Several subsections of certain methods are attached to each class as instances. We also discuss the relative advantages and disadvantages of different classes and make comparisons among representative methods of each class. PMID:28891974
DOT National Transportation Integrated Search
1976-09-01
The present report discusses the development, implementation, and current status of the Short Term Rehabilitation (STR) Study initiates by the NHTSA in 1974. Experimental designs employed by each of the 11 ASAP/STR sites for the assignment of mid-ran...
Zhang, Xiufeng; Hu, Liping; Du, Lei; Nie, Aiting; Rao, Min; Pang, Jing Bo; Xiran, Zeng; Nie, Shengjie
2017-05-01
The genetic polymorphisms of 20 autosomal short tandem repeat (STR) loci included in the PowerPlex ® 21 kit were evaluated from 748 unrelated healthy individuals of the Miao ethnic minority living in the Yunnan province in southwestern China. All of the loci reached Hardy-Weinberg equilibrium. These loci were examined to determine allele frequencies and forensic statistical parameters. The genetic relationship between the Miao population and other Chinese populations were also estimated. The combined discrimination power and probability of excluding paternity of the 20 STR loci were 0.999 999 999 999 999 999 999 991 26 and 0.999 999 975, respectively. The results suggested that the 20 STR loci were highly polymorphic, which makes them suitable for forensic personal identification and paternity testing. Copyright © 2017 Elsevier B.V. All rights reserved.
Schlebusch, Carina M; Soodyall, Himlya
2012-12-01
The San and Khoe people currently represent remnant groups of a much larger and widely distributed population of hunter-gatherers and pastoralists who had exclusive occupation of southern Africa before the arrival of Bantu-speaking groups in the past 1,200 years and sea-borne immigrants within the last 350 years. Genetic studies [mitochondrial deoxyribonucleic acid (DNA) and Y-chromosome] conducted on San and Khoe groups revealed that they harbor some of the most divergent lineages found in living peoples throughout the world. Recently, high-density, autosomal, single-nucleotide polymorphism (SNP)-array studies confirmed the early divergence of Khoe-San population groups from all other human populations. The present study made use of 220 autosomal SNP markers (in the format of both haplotypes and genotypes) to examine the population structure of various San and Khoe groups and their relationship to other neighboring groups. Whereas analyses based on the genotypic SNP data only supported the division of the included populations into three main groups-Khoe-San, Bantu-speakers, and non-African populations-haplotype analyses revealed finer structure within Khoe-San populations. By the use of only 44 short SNP haplotypes (compiled from a total of 220 SNPs), most of the Khoe-San groups could be resolved as separate groups by applying STRUCTURE analyses. Therefore, by carefully selecting a few SNPs and combining them into haplotypes, we were able to achieve the same level of population distinction that was achieved previously in high-density SNP studies on the same population groups. Using haplotypes proved to be a very efficient and cost-effective way to study population structure. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.
De novo assembly of a haplotype-resolved human genome.
Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun
2015-06-01
The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.
HERC1 polymorphisms: population-specific variations in haplotype composition.
Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen
2009-08-01
Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.
Honey bee-inspired algorithms for SNP haplotype reconstruction problem
NASA Astrophysics Data System (ADS)
PourkamaliAnaraki, Maryam; Sadeghi, Mehdi
2016-03-01
Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/
Short communication: casein haplotype variability in sicilian dairy goat breeds.
Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B
2008-09-01
In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.
Vinkers, Christiaan H; Joëls, Marian; Milaneschi, Yuri; Gerritsen, Lotte; Kahn, René S; Penninx, Brenda W J H; Boks, Marco P M
2015-04-01
The MR is an important regulator of the hypothalamic-pituitary-adrenal (HPA) axis and a prime target for corticosteroids. There is increasing evidence from both clinical and preclinical studies that the MR has different effects on behavior and mood in males and females. To investigate the hypothesis that the MR sex-dependently influences the relation between childhood maltreatment and depression, we investigated three common and functional MR haplotypes (GA, CA, and CG haplotype, based on rs5522 and rs2070951) in a population-based cohort (N = 665) and an independent clinical cohort from the Netherlands Study of Depression and Anxiety (NESDA) (N = 1639). The CA haplotype sex-dependently moderated the relation between childhood maltreatment and depressive symptoms both in the population-based sample (sex × maltreatment × haplotype: β = -4.07, P = 0.029) and in the clinical sample (sex × maltreatment × haplotype, β = -2.40, P = 0.011). Specifically, female individuals in the population-based sample were protected (β = -4.58, P = 2.0 e(-5)), whereas males in the clinical sample were at increased risk (β = 2.54, P = 0.0022). In line with these results, female GA haplotype carriers displayed increased vulnerability in the population-based sample (β = 4.58, P = 7.5 e(-5)) whereas male CG-carriers showed increased resilience in the clinical sample (β = -2.71, P = 0.016). Consistently, we found a decreased lifetime MDD risk for male GA haplotype carriers following childhood maltreatment but an increased risk for male CA haplotype carriers in the clinical sample. In both samples, sex-dependent effects were observed for GA-GA diplotype carriers. In summary, sex plays an important role in determining whether functional genetic variation in MR is beneficial or detrimental, with an apparent female advantage for the CA haplotype but male advantage for the GA and CG haplotype. These sex-dependent effects of MR on depression susceptibility following childhood
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.
Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A
2016-10-11
Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.
Ancestral association between HLA and HFE H63D and C282Y gene mutations from northwest Colombia
Rodriguez, Libia M; Giraldo, Mabel C; Velasquez, Laura I; Alvarez, Cristiam M; Garcia, Luis F; Jimenez-Del-Rio, Marlene; Velez-Pardo, Carlos
2015-01-01
A significant association between HFE gene mutations and the HLA-A*03-B*07 and HLA-A*29-B*44 haplotypes has been reported in the Spanish population. It has been proposed that these mutations are probably connected with Celtic and North African ancestry, respectively. We aimed to find the possible ancestral association between HLA alleles and haplotypes associated with the HFE gene (C282Y and H63D) mutations in 214 subjects from Antioquia, Colombia. These were 18 individuals with presumed hereditary hemochromatosis (“HH”) and 196 controls. The HLA-B*07 allele was in linkage disequilibrium (LD) with C282Y, while HLA-A*23, A*29, HLA-B*44, and B*49 were in LD with H63D. Altogether, our results show that, although the H63D mutation is more common in the Antioquia population, it is not associated with any particular HLA haplotype, whereas the C282Y mutation is associated with HLA-A*03-B*07, this supporting a northern Spaniard ancestry. PMID:25983618
Ancestral association between HLA and HFE H63D and C282Y gene mutations from northwest Colombia.
Rodriguez, Libia M; Giraldo, Mabel C; Velasquez, Laura I; Alvarez, Cristiam M; Garcia, Luis F; Jimenez-Del-Rio, Marlene; Velez-Pardo, Carlos
2015-03-01
A significant association between HFE gene mutations and the HLA-A*03-B*07 and HLA-A*29-B*44 haplotypes has been reported in the Spanish population. It has been proposed that these mutations are probably connected with Celtic and North African ancestry, respectively. We aimed to find the possible ancestral association between HLA alleles and haplotypes associated with the HFE gene (C282Y and H63D) mutations in 214 subjects from Antioquia, Colombia. These were 18 individuals with presumed hereditary hemochromatosis ("HH") and 196 controls. The HLA-B*07 allele was in linkage disequilibrium (LD) with C282Y, while HLA-A*23, A*29, HLA-B*44, and B*49 were in LD with H63D. Altogether, our results show that, although the H63D mutation is more common in the Antioquia population, it is not associated with any particular HLA haplotype, whereas the C282Y mutation is associated with HLA-A*03-B*07, this supporting a northern Spaniard ancestry.
Fitchi: haplotype genealogy graphs based on the Fitch algorithm.
Matschiner, Michael
2016-04-15
: In population genetics and phylogeography, haplotype genealogy graphs are important tools for the visualization of population structure based on sequence data. In this type of graph, node sizes are often drawn in proportion to haplotype frequencies and edge lengths represent the minimum number of mutations separating adjacent nodes. I here present Fitchi, a new program that produces publication-ready haplotype genealogy graphs based on the Fitch algorithm. http://www.evoinformatics.eu/fitchi.htm : michaelmatschiner@mac.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Development of a 20-locus fluorescent multiplex system as a valuable tool for national DNA database.
Jiang, Xianhua; Guo, Fei; Jia, Fei; Jin, Ping; Sun, Zhu
2013-02-01
The multiplex system allows the detection of 19 autosomal short tandem repeat (STR) loci [including all Combined DNA Index System (CODIS) STR loci as well as D2S1338, D6S1043, D12S391, D19S433, Penta D and Penta E] plus the sex-determining locus Amelogenin in a single reaction, comprising all STR loci in various commercial kits used in the China national DNA database (NDNAD). Primers are designed so that the amplicons are distributed ranging from 90 base pairs (bp) to 450 bp within a five-dye fluorescent design with the fifth dye reserved for the internal size standard. With 30 cycles, 125 pg to 2 ng DNA template showed optimal profiling result, while robust profiles could also be achieved by adjusting the cycle numbers for the DNA template beyond that optimal DNA input range. Mixture studies showed that 83% and 87% of minor alleles were detected at 9:1 and 1:9 ratios, respectively. When 4 ng of degraded DNA was digested by 2-min DNase and 1 ng undegraded DNA was added to 400 μM haematin, the complete profiles were still observed. Polymerase chain reaction (PCR)-based procedures were examined and optimized including the concentrations of primer set, magnesium and the Taq polymerase as well as volume, cycle number and annealing temperature. In addition, the system has been validated by 3000 bloodstain samples and 35 common case samples in line with the Chinese National Standards and Scientific Working Group on DNA Analysis Methods (SWGDAM) guidelines. The total probability of identity (TPI) can reach to 8×10(-24), where DNA database can be improved at the level of 10 million DNA profiles or more because the number of expected match is far from one person (4×10(-10)) and can be negligible. Further, our system also demonstrates its good performance in case samples and it will be an ideal tool for forensic DNA typing and databasing with potential application. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Houston, Rachel; Birck, Matthew; Hughes-Stamm, Sheree; Gangitano, David
2017-05-01
Marijuana (Cannabis sativa L.) is a plant cultivated and trafficked worldwide as a source of fiber (hemp), medicine, and intoxicant. The development of a validated method using molecular techniques such as short tandem repeats (STRs) could serve as an intelligence tool to link multiple cases by means of genetic individualization or association of cannabis samples. For this purpose, a 13 loci STR multiplex method was developed, optimized, and validated according to relevant ISFG and SWGDAM guidelines. The STR multiplex consists of 13 previously described C. sativa STR loci: ANUCS501, 9269, 4910, 5159, ANUCS305, 9043, B05, 1528, 3735, CS1, D02, C11, and H06. A sequenced allelic ladder consisting of 56 alleles was designed to accurately genotype 101 C. sativa samples from three seizures provided by a U.S. Customs and Border Protection crime lab. Using an optimal range of DNA (0.5-1.0ng), validation studies revealed well-balanced electropherograms (inter-locus balance range: 0.500-1.296), relatively balanced heterozygous peaks (mean peak height ratio of 0.83 across all loci) with minimal artifacts and stutter ratio (mean stutter of 0.021 across all loci). This multi-locus system is relatively sensitive (0.13ng of template DNA) with a combined power of discrimination of 1 in 55 million. The 13 STR panel was found to be species specific for C. sativa; however, non-specific peaks were produced with Humulus lupulus. The results of this research demonstrate the robustness and applicability of this 13 loci STR system for forensic DNA profiling of marijuana samples. Copyright © 2017 Elsevier B.V. All rights reserved.
RTEL1 tagging SNPs and haplotypes were associated with glioma development
2013-01-01
Abstract As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case–control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF) > 5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P = 0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P = 0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype “GG” of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P = 0.0002), while the genotype “CC” of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P = 0.0003). Furthermore, haplotype “GCT” in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher’s P = 0.0005; Pearson’s P = 0.0005), and haplotype “ATT” was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher’s P = 0.0013; Pearson’s P = 0.0013). Two single variants, the genotypes of “GG” of rs6010620 and “CC” of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. Virtual slides The virtual slides for this article
Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes
Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.
2016-01-01
The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800
An Accelerated Analytical Process for the Development of STR Profiles for Casework Samples.
Laurin, Nancy; Frégeau, Chantal J
2015-07-01
Significant efforts are being devoted to the development of methods enabling rapid generation of short tandem repeat (STR) profiles in order to reduce turnaround times for the delivery of human identification results from biological evidence. Some of the proposed solutions are still costly and low throughput. This study describes the optimization of an analytical process enabling the generation of complete STR profiles (single-source or mixed profiles) for human identification in approximately 5 h. This accelerated process uses currently available reagents and standard laboratory equipment. It includes a 30-min lysis step, a 27-min DNA extraction using the Promega Maxwell(®) 16 System, DNA quantification in <1 h using the Qiagen Investigator(®) Quantiplex HYres kit, fast amplification (<26 min) of the loci included in AmpFℓSTR(®) Identifiler(®), and analysis of the profiles on the 3500-series Genetic Analyzer. This combination of fast individual steps produces high-quality profiling results and offers a cost-effective alternative approach to rapid DNA analysis. © 2015 American Academy of Forensic Sciences.
Association of HLA haplotype with alopecia areata in Chinese Hans.
Xiao, F-L; Ye, D-Q; Yang, S; Zhou, F-S; Zhou, S-M; Zhu, Y-G; Liang, Y-H; Ren, Y-Q; Zhang, X-J
2006-11-01
Some studies have shown discrepancies in human leucocyte antigen (HLA) associated with alopecia areata (AA) between different ethnic populations. To investigate whether HLA-I, -DQA1 and -DQB1 alleles and the HLA haplotype are associated with AA, and the correlation between the HLA haplotype profile, age of onset and severity of AA in Chinese Hans. The polymerase chain reaction-sequence specific primer (PCR-SSP) method was used to analyse the frequencies of HLA class I, -DQA1 and -DQB1 alleles in 192 patients with AA and 252 controls in Chinese Hans. The linkage disequilibrium was calculated using the 2 x 2 table. The 24 two-locus haplotypes [including A*02-B*18, A*02-B*27, A*02-B*52, A*02-Cw*0704, A*02-DQA1*0104, A*02-DQB1*0604, A*02-DQB1*0606, B*18-Cw*0704, B*18-DQA1*0104, B*18-DQA1*0302, B*18-DQB1*0606, B*27-Cw*0704, B*27-DQA1*0104, B*27-DQA1*0302, B*52-Cw*0704, B*52-DQA1*0104, B*52-DQA1*0302, B52-DQB1*0606, Cw*0704-DQA1*0104, Cw*0704-DQA1*0302, Cw*0704-DQB1*0606, DQA1*0104-DQB1*0604, DQA1*0104-DQB1*0606, DQA1*0302-DQB1*0606 (P<0.05)] were associated with AA, while eight extended haplotypes (A*02-B*18-DQA1*0104, A*02-B*27-DQA1*0104, A*02-B*52-DQA1*0104, A*02-B*52-DQA1*0302, A*02-B*52-DQB1*0606, B*52-Cw*0704-DQA1*0104, B*52-Cw*0704-DQA1*0302, A*02-B*52-DQA1*0302-DQB1*0606) were found to be related to AA in Chinese Hans. Through stratified analysis, we found that the extended haplotype B*52-Cw*0704-DQA1*0302 was related to early onset of AA, and no haplotype was only associated with severe AA. This is the first detailed report to elucidate HLA haplotypes associated with AA and that demonstrates the significant HLA haplotypes in Chinese Hans AA. The haplotype B*52-Cw*0704-DQA1*0302 was identified to be related to early onset of AA. Our results provide some information for future research on predisposing genes in HLA regions in Chinese Hans.
Dimensional Anxiety Mediates Linkage of GABRA2 Haplotypes With Alcoholism
Enoch, Mary-Anne; Schwartz, Lori; Albaugh, Bernard; Virkkunen, Matti; Goldman, David
2015-01-01
The GABAAα2 receptor gene (GABRA2) modulates anxiety and stress response. Three recent association studies implicate GABRA2 in alcoholism, however in these papers both common, opposite-configuration haplotypes in the region distal to intron3 predict risk. We have now replicated the GABRA2 association with alcoholism in 331 Plains Indian men and women and 461 Finnish Caucasian men. Using a dimensional measure of anxiety, harm avoidance (HA), we also found that the association with alcoholism is mediated, or moderated, by anxiety. Nine SNPs were genotyped revealing two haplotype blocks. Within the previously implicated block 2 region, we identified the two common, opposite-configuration risk haplotypes, A and B. Their frequencies differed markedly in Finns and Plains Indians. In both populations, most block 2 SNPs were significantly associated with alcoholism. The associations were due to increased frequencies of both homozygotes in alcoholics, indicating the possibility of alcoholic subtypes with opposite genotypes. Congruently, there was no significant haplotype association. Using HA as an indicator variable for anxiety, we found haplotype linkage to alcoholism with high and low dimensional anxiety, and to HA itself, in both populations. High HA alcoholics had the highest frequency of the more abundant haplotype (A in Finns, B in Plains Indians); low HA alcoholics had the highest frequency of the less abundant haplotype (B in Finns, A in Plains Indians) (Finns: P α0.007, OR α2.1, Plains Indians: P α0.040, OR α1.9). Non-alcoholics had intermediate frequencies. Our results suggest that within the distal GABRA2 region is a functional locus or loci that may differ between populations but that alters risk for alcoholism via the mediating action of anxiety. PMID:16874763
HLA-G Haplotypes Are Differentially Associated with Asthmatic Features.
Ribeyre, Camille; Carlini, Federico; René, Céline; Jordier, François; Picard, Christophe; Chiaroni, Jacques; Abi-Rached, Laurent; Gouret, Philippe; Marin, Grégory; Molinari, Nicolas; Chanez, Pascal; Paganini, Julien; Gras, Delphine; Di Cristofaro, Julie
2018-01-01
Human leukocyte antigen (HLA)-G, a HLA class Ib molecule, interacts with receptors on lymphocytes such as T cells, B cells, and natural killer cells to influence immune responses. Unlike classical HLA molecules, HLA-G expression is not found on all somatic cells, but restricted to tissue sites, including human bronchial epithelium cells (HBEC). Individual variation in HLA-G expression is linked to its genetic polymorphism and has been associated with many pathological situations such as asthma, which is characterized by epithelium abnormalities and inflammatory cell activation. Studies reported both higher and equivalent soluble HLA-G (sHLA-G) expression in different cohorts of asthmatic patients. In particular, we recently described impaired local expression of HLA-G and abnormal profiles for alternatively spliced isoforms in HBEC from asthmatic patients. sHLA-G dosage is challenging because of its many levels of polymorphism (dimerization, association with β2-microglobulin, and alternative splicing), thus many clinical studies focused on HLA-G single-nucleotide polymorphisms as predictive biomarkers, but few analyzed HLA-G haplotypes. Here, we aimed to characterize HLA-G haplotypes and describe their association with asthmatic clinical features and sHLA-G peripheral expression and to describe variations in transcription factor (TF) binding sites and alternative splicing sites. HLA - G haplotypes were differentially distributed in 330 healthy and 580 asthmatic individuals. Furthermore, HLA-G haplotypes were associated with asthmatic clinical features showed. However, we did not confirm an association between sHLA-G and genetic, biological, or clinical parameters. HLA-G haplotypes were phylogenetically split into distinct groups, with each group displaying particular variations in TF binding or RNA splicing sites that could reflect differential HLA-G qualitative or quantitative expression, with tissue-dependent specificities. Our results, based on a multicenter
HLA-G Haplotypes Are Differentially Associated with Asthmatic Features
Ribeyre, Camille; Carlini, Federico; René, Céline; Jordier, François; Picard, Christophe; Chiaroni, Jacques; Abi-Rached, Laurent; Gouret, Philippe; Marin, Grégory; Molinari, Nicolas; Chanez, Pascal; Paganini, Julien; Gras, Delphine; Di Cristofaro, Julie
2018-01-01
Human leukocyte antigen (HLA)-G, a HLA class Ib molecule, interacts with receptors on lymphocytes such as T cells, B cells, and natural killer cells to influence immune responses. Unlike classical HLA molecules, HLA-G expression is not found on all somatic cells, but restricted to tissue sites, including human bronchial epithelium cells (HBEC). Individual variation in HLA-G expression is linked to its genetic polymorphism and has been associated with many pathological situations such as asthma, which is characterized by epithelium abnormalities and inflammatory cell activation. Studies reported both higher and equivalent soluble HLA-G (sHLA-G) expression in different cohorts of asthmatic patients. In particular, we recently described impaired local expression of HLA-G and abnormal profiles for alternatively spliced isoforms in HBEC from asthmatic patients. sHLA-G dosage is challenging because of its many levels of polymorphism (dimerization, association with β2-microglobulin, and alternative splicing), thus many clinical studies focused on HLA-G single-nucleotide polymorphisms as predictive biomarkers, but few analyzed HLA-G haplotypes. Here, we aimed to characterize HLA-G haplotypes and describe their association with asthmatic clinical features and sHLA-G peripheral expression and to describe variations in transcription factor (TF) binding sites and alternative splicing sites. HLA-G haplotypes were differentially distributed in 330 healthy and 580 asthmatic individuals. Furthermore, HLA-G haplotypes were associated with asthmatic clinical features showed. However, we did not confirm an association between sHLA-G and genetic, biological, or clinical parameters. HLA-G haplotypes were phylogenetically split into distinct groups, with each group displaying particular variations in TF binding or RNA splicing sites that could reflect differential HLA-G qualitative or quantitative expression, with tissue-dependent specificities. Our results, based on a multicenter
Population-specific FST values for forensic STR markers: A worldwide survey.
Buckleton, John; Curran, James; Goudet, Jérôme; Taylor, Duncan; Thiery, Alexandre; Weir, B S
2016-07-01
The interpretation of matching between DNA profiles of a person of interest and an item of evidence is undertaken using population genetic models to predict the probability of matching by chance. Calculation of matching probabilities is straightforward if allelic probabilities are known, or can be estimated, in the relevant population. It is more often the case, however, that the relevant population has not been sampled and allele frequencies are available only from a broader collection of populations as might be represented in a national or regional database. Variation of allele probabilities among the relevant populations is quantified by the population structure quantity FST and this quantity affects matching proportions. Matching within a population can be interpreted only with respect to matching between populations and we show here that FST, can be estimated from sample allelic matching proportions within and between populations. We report such estimates from data we extracted from 250 papers in the forensic literature, representing STR profiles at up to 24 loci from nearly 500,000 people in 446 different populations. The results suggest that theta values in current forensic use do not have the buffer of conservatism often thought. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
β3 Integrin Haplotype Influences Gene Regulation and Plasma von Willebrand Factor Activity
Payne, Katie E; Bray, Paul F; Grant, Peter J; Carter, Angela M
2008-01-01
The Leu33Pro polymorphism of the gene encoding β3 integrin (ITGB3) is associated with acute coronary syndromes and influences platelet aggregation. Three common promoter polymorphisms have also been identified. The aims of this study were to (1) investigate the influence of the ITGB3 −400C/A, −425A/C and −468G/A promoter polymorphisms on reporter gene expression and nuclear protein binding and (2) determine genotype and haplotype associations with platelet αIIbβ3 receptor density. Promoter haplotypes were introduced into an ITGB3 promoter-pGL3 construct by site directed mutagenesis and luciferase reporter gene expression analysed in HEL and HMEC-1 cells. Binding of nuclear proteins was assessed by electrophoretic mobility shift assay. The association of ITGB3 haplotype with platelet αIIbβ3 receptor density was determined in 223 subjects. Species conserved motifs were identified in the ITGB3 promoter in the vicinity of the 3 polymorphisms. The GAA, GCC, AAC, AAA and ACC constructs induced ~50% increased luciferase expression relative to the GAC construct in both cell types. Haplotype analysis including Leu33Pro indicated 5 common haplotypes; no associations between ITGB3 haplotypes and receptor density were found. However, the GCC-Pro33 haplotype was associated with significantly higher vWF activity (128.6 [112.1–145.1]%) compared with all other haplotypes (107.1 [101.2–113.0]%, p=0.02). In conclusion, the GCC-Pro33 haplotype was associated with increased vWF activity but not with platelet αIIbβ3 receptor density, which may indicate ITGB3 haplotype influences endothelial function. PMID:18045606
Islam, Kazi T; Bond, Jason P; Fakhoury, Ahmad M
2017-08-01
The soil-borne fungus Fusarium virguliforme causes sudden death syndrome (SDS), one of the most devastating diseases of soybean in North and South America. Despite the importance of SDS, a clear understanding of the fungal pathogenicity factors that affect the development of this disease is still lacking. We have identified FvSTR1, a F. virguliforme gene, which encodes a protein similar to a family of striatin proteins previously reported to regulate signalling pathways, cell differentiation, conidiation, sexual development, and virulence in filamentous fungi. Striatins are multi-domain proteins that serve as scaffolding units in the striatin-interacting phosphatase and kinase (STRIPAK) complex in fungi and animals. To address the function of a striatin homologue in F. virguliforme, FvSTR1 was disrupted and functionally characterized using a gene knock out strategy. The resulting Fvstr1 mutants were largely impaired in conidiation and pigmentation, and displayed defective conidia and conidiophore morphology compared to the wild-type and ectopic transformants. Greenhouse virulence assays revealed that the disruption of FvSTR1 resulted in complete loss of virulence in F. virguliforme. Microtome studies using fluorescence microscopy showed that the Fvstr1 mutants were defective in their ability to colonize the vascular system. The Fvstr1 mutants also showed a reduced transcript level of genes involved in asexual reproduction and in the production of secondary metabolites. These results suggest that FvSTR1 has a critical role in asexual development and virulence in F. virguliforme.
MOHANTY, APARAJITA; MARTÍN, JUAN PEDRO; GONZÁLEZ, LUIS MIGUEL; AGUINAGALDE, ITZIAR
2003-01-01
Chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) were studied in 24 populations of Prunus spinosa sampled across Europe. The cpDNA and mtDNA fragments were amplified using universal primers and subsequently digested with restriction enzymes to obtain the polymorphisms. Combinations of all the polymorphisms resulted in 33 cpDNA haplotypes and two mtDNA haplotypes. Strict association between the cpDNA haplotypes and the mtDNA haplotypes was detected in most cases, indicating conjoint inheritance of the two genomes. The most frequent and abundant cpDNA haplotype (C20; frequency, 51 %) is always associated with the more frequent and abundant mtDNA haplotype (M1; frequency, 84 %). All but two of the cpDNA haplotypes associated with the less frequent mtDNA haplotype (M2) are private haplotypes. These private haplotypes are phylogenetically related but geographically unrelated. They form a separate cluster on the minimum‐length spanning tree. PMID:14534199
β-globin gene cluster haplotypes in ethnic minority populations of southwest China
Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing
2017-01-01
The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625
Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.
Städler, Thomas; Delph, Lynda F
2002-09-03
Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.
Identification and genetic effect of haplotype in the bovine BMP7 gene.
Huang, Yong-Zhen; Wang, Xin-Lei; He, Hua; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong
2013-12-15
Bone morphogenetic proteins (BMPs) are peptide growth factors belonging to the transforming growth factor-beta (TGF-β) superfamily, and some members of the BMP family support white adipocyte differentiation. In this study, we focused on the BMP7 which singularly promotes the differentiation of brown preadipocytes. Haplotypes involving 5 single nucleotide polymorphism (SNP) sites in the bovine BMP7 gene were identified and their effect on body weight was analyzed. 16 haplotypes and 18 combined haplotypes were revealed and the linkage disequilibrium was assessed in the cattle population with 602 individuals representing three main cattle breeds from China. The results showed that haplotypes 3, 10 and 14 were predominant and accounted for 75.64%, 69.85%, and 83.36% in Nanyang, Qinchuan and Jiaxian cattle breeds, respectively. The statistical analyses indicated that the SNP 1, 4, and 5 are associated with the body weight, body length, and heart girth at 12 and 24 months in Nanyang cattle population (P<0.05), whereas there is no significant association between their 16 haplotypes and 18 combined haplotypes. Our results provide evidence that some SNPs and haplotypes in BMP7 are associated with growth traits, and may be utilized as a genetic marker in marker-assisted selection for beef cattle breeding programs. Copyright © 2013. Published by Elsevier B.V.
Lee, So-Yeon; Ha, Eun-Ju; Woo, Seung-Kyun; Lee, So-Min; Lim, Kyung-Hee; Eom, Yong-Bin
2017-07-01
Telogen hairs presented in the crime scene are commonly encountered as trace evidence. However, short tandem repeat (STR) profiling of the hairs currently have low and limited use due to poor success rate. To increase the success rate of STR profiling of telogen hairs, we developed a rapid and cost-effective method to estimate the number of nuclei in the hair roots. Five cationic dyes, Methyl green (MG), Harris hematoxylin (HH), Methylene blue (MB), Toluidine blue (TB), and Safranin O (SO) were evaluated in this study. We conducted a screening test based on microscopy and the percentage of loss with nuclear DNA, in order to select the best dye. MG was selected based on its specific nuclei staining and low adverse effect on the hair-associated nuclear DNA. We examined 330 scalp and 100 pubic telogen hairs with MG. Stained hairs were classified into five groups and analyzed by STR. The fast staining method revealed 70% (head hair) and 33.4% (pubic hair) of full (30 alleles) and high partial (18-29 alleles) STR profiling proportion from the lowest nuclei count group (one to ten nuclei). The results of this study demonstrated a rapid, specific, nondestructive, and high yield DNA profiling method applicable for screening telogen hairs. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Geographic distribution of haplotype diversity at the bovine casein locus
Jann, Oliver C; Ibeagha-Awemu, Eveline M; Özbeyaz, Ceyhan; Zaragoza, Pilar; Williams, John L; Ajmone-Marsan, Paolo; Lenstra, Johannes A; Moazami-Goudarzi, Katy; Erhardt, Georg
2004-01-01
The genetic diversity of the casein locus in cattle was studied on the basis of haplotype analysis. Consideration of recently described genetic variants of the casein genes which to date have not been the subject of diversity studies, allowed the identification of new haplotypes. Genotyping of 30 cattle breeds from four continents revealed a geographically associated distribution of haplotypes, mainly defined by frequencies of alleles at CSN1S1 and CSN3. The genetic diversity within taurine breeds in Europe was found to decrease significantly from the south to the north and from the east to the west. Such geographic patterns of cattle genetic variation at the casein locus may be a result of the domestication process of modern cattle as well as geographically differentiated natural or artificial selection. The comparison of African Bos taurus and Bos indicus breeds allowed the identification of several Bos indicus specific haplotypes (CSN1S1*C-CSN2*A2-CSN3*AI/CSN3*H) that are not found in pure taurine breeds. The occurrence of such haplotypes in southern European breeds also suggests that an introgression of indicine genes into taurine breeds could have contributed to the distribution of the genetic variation observed. PMID:15040901
Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.
Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng
2010-02-01
Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.
Haplotype estimation using sequencing reads.
Delaneau, Olivier; Howie, Bryan; Cox, Anthony J; Zagury, Jean-François; Marchini, Jonathan
2013-10-03
High-throughput sequencing technologies produce short sequence reads that can contain phase information if they span two or more heterozygote genotypes. This information is not routinely used by current methods that infer haplotypes from genotype data. We have extended the SHAPEIT2 method to use phase-informative sequencing reads to improve phasing accuracy. Our model incorporates the read information in a probabilistic model through base quality scores within each read. The method is primarily designed for high-coverage sequence data or data sets that already have genotypes called. One important application is phasing of single samples sequenced at high coverage for use in medical sequencing and studies of rare diseases. Our method can also use existing panels of reference haplotypes. We tested the method by using a mother-father-child trio sequenced at high-coverage by Illumina together with the low-coverage sequence data from the 1000 Genomes Project (1000GP). We found that use of phase-informative reads increases the mean distance between switch errors by 22% from 274.4 kb to 328.6 kb. We also used male chromosome X haplotypes from the 1000GP samples to simulate sequencing reads with varying insert size, read length, and base error rate. When using short 100 bp paired-end reads, we found that using mixtures of insert sizes produced the best results. When using longer reads with high error rates (5-20 kb read with 4%-15% error per base), phasing performance was substantially improved. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
He, GuangLin; Li, Ye; Zou, Xing; Li, Ping; Chen, PengYu; Song, Feng; Gao, Tianzhen; Liao, Miao; Yan, Jing; Wu, Jin
2017-09-01
The demographic characteristics and genetic polymorphism data of 56 Chinese nationalities or 31 administrative divisions in Chinese mainland have repeatedly been the genetic research hotspots. While most genetic studies focused on some particular Chinese populations based on autosomal or Y-chromosomal genetic markers, the forensic characteristics and phylogenetic analyses of the seventh largest Chinese population (Yi ethnicity) on the X-chromosomal genetic markers are scarce. Here, allele frequencies and forensic statistical parameters for 19 X-chromosomal short tandem repeat loci (DXS7424-DXS101, DXS6789-DXS6809, DXS7423-DXS10134, DXS10103-HPRTB-DXS10101, DXS10159-DXS10162-DXS10164, DXS10148-DXS10135-DXS8378, and DXS7132-DXS10079-DXS10074-DXS10075) of 331 Chinese Yi individuals were obtained. All 19 X-chromosomal short tandem repeat (STR) loci in females were consistent with the Hardy-Weinberg equilibrium test. A total of 214 alleles were identified with the corresponding allele frequencies spanned from 0.0019 to 0.6106. The combined PE, PDF, and PDM were 0.9999999214, 0.9999999999999999999993, and 0.9999999999998, respectively. The high combined MEC Krüger , MEC Kishida , MEC Desmarais , and MEC Desmarais Duo were achieved as 0.9999999617638, 0.9999999999971, 0.9999999999971, and 0.9999999931538, respectively. The findings suggested that the panel of 19 X-STR loci is highly polymorphic and informative in the Yi ethnic population and can be considered to be a powerful tool in forensic complex kinship identification. Population differentiation analyses among 12 populations indicated that significant differences in genetic structure were observed in between the Yi ethnicity and the Chinese Uyghur as well as Kazakh, and genetic homogeneity existed in similar ethno-origin or geographic origin populations.
Congruence as a measurement of extended haplotype structure across the genome
2012-01-01
Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. PMID:22369243
Olofsson, Jill Katharina; Pereira, Vania; Børsting, Claus; Morling, Niels
2015-01-01
The human population in Greenland is characterized by migration events of Paleo- and Neo-Eskimos, as well as admixture with Europeans. In this study, the Y-chromosomal variation in male Greenlanders was investigated in detail by typing 73 Y-chromosomal single nucleotide polymorphisms (Y-SNPs) and 17 Y-chromosomal short tandem repeats (Y-STRs). Approximately 40% of the analyzed Greenlandic Y chromosomes were of European origin (I-M170, R1a-M513 and R1b-M343). Y chromosomes of European origin were mainly found in individuals from the west and south coasts of Greenland, which is in agreement with the historic records of the geographic placements of European settlements in Greenland. Two Inuit Y-chromosomal lineages, Q-M3 (xM19, M194, L663, SA01 and L766) and Q-NWT01 (xM265) were found in 23% and 31% of the male Greenlanders, respectively. The time to the most recent common ancestor (TMRCA) of the Q-M3 lineage of the Greenlanders was estimated to be between 4,400 and 10,900 years ago (y. a.) using two different methods. This is in agreement with the theory that the North Circumpolar Region was populated via a second expansion of humans in the North American continent. The TMRCA of the Q-NWT01 (xM265) lineage in Greenland was estimated to be between 7,000 and 14,300 y. a. using two different methods, which is older than the previously reported TMRCA of this lineage in other Inuit populations. Our results indicate that Inuit individuals carrying the Q-NWT01 (xM265) lineage may have their origin in the northeastern parts of North America and could be descendants of the Dorset culture. This in turn points to the possibility that the current Inuit population in Greenland is comprised of individuals of both Thule and Dorset descent.
Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z
2016-10-01
The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.
Cox, Jordan O; DeCarmen, Teresa Sikes; Ouyang, Yiwen; Strachan, Briony; Sloane, Hillary; Connon, Cathey; Gibson, Kemper; Jackson, Kimberly; Landers, James P; Cruz, Tracey Dawson
2016-12-01
This work describes the development of a novel microdevice for forensic DNA processing of reference swabs. This microdevice incorporates an enzyme-based assay for DNA preparation, which allows for faster processing times and reduced sample handling. Infrared-mediated PCR (IR-PCR) is used for STR amplification using a custom reaction mixture, allowing for amplification of STR loci in 45 min while circumventing the limitations of traditional block thermocyclers. Uniquely positioned valves coupled with a simple rotational platform are used to exert fluidic control, eliminating the need for bulky external equipment. All microdevices were fabricated using laser ablation and thermal bonding of PMMA layers. Using this microdevice, the enzyme-mediated DNA liberation module produced DNA yields similar to or higher than those produced using the traditional (tube-based) protocol. Initial microdevice IR-PCR experiments to test the amplification module and reaction (using Phusion Flash/SpeedSTAR) generated near-full profiles that suffered from interlocus peak imbalance and poor adenylation (significant -A). However, subsequent attempts using KAPA 2G and Pfu Ultra polymerases generated full STR profiles with improved interlocus balance and the expected adenylated product. A fully integrated run designed to test microfluidic control successfully generated CE-ready STR amplicons in less than 2 h (<1 h of hands-on time). Using this approach, high-quality STR profiles were developed that were consistent with those produced using conventional DNA purification and STR amplification methods. This method is a smaller, more elegant solution than current microdevice methods and offers a cheaper, hands-free, closed-system alternative to traditional forensic methods. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Martiniano, Rui; McLaughlin, Russell; Silva, Nuno M.; Manco, Licinio; Pereira, Tania; Coelho, Maria J.; Serra, Miguel; Burger, Joachim; Parreira, Rui; Moran, Elena; Valera, Antonio C.; Silva, Ana M.
2017-01-01
We analyse new genomic data (0.05–2.95x) from 14 ancient individuals from Portugal distributed from the Middle Neolithic (4200–3500 BC) to the Middle Bronze Age (1740–1430 BC) and impute genomewide diploid genotypes in these together with published ancient Eurasians. While discontinuity is evident in the transition to agriculture across the region, sensitive haplotype-based analyses suggest a significant degree of local hunter-gatherer contribution to later Iberian Neolithic populations. A more subtle genetic influx is also apparent in the Bronze Age, detectable from analyses including haplotype sharing with both ancient and modern genomes, D-statistics and Y-chromosome lineages. However, the limited nature of this introgression contrasts with the major Steppe migration turnovers within third Millennium northern Europe and echoes the survival of non-Indo-European language in Iberia. Changes in genomic estimates of individual height across Europe are also associated with these major cultural transitions, and ancestral components continue to correlate with modern differences in stature. PMID:28749934
Kyselková, Martina; Chrudimský, Tomáš; Husník, Filip; Chroňáková, Alica; Heuer, Holger; Smalla, Kornelia; Elhottová, Dana
2016-06-01
Manure from dairy farms has been shown to contain diverse tetracycline resistance genes that are transferable to soil. Here, we focus on conjugative plasmids that may spread tetracycline resistance at a conventional dairy farm. We performed exogenous plasmid isolation from cattle feces using chlortetracycline for transconjugant selection. The transconjugants obtained harbored LowGC-type plasmids and tet(Y). A representative plasmid (pFK2-7) was fully sequenced and this was compared with previously described LowGC plasmids from piggery manure-treated soil and a GenBank record from Acinetobacter nosocomialis that we also identified as a LowGC plasmid. The pFK2-7 plasmid had the conservative backbone typical of LowGC plasmids, though this region was interrupted with an insert containing the tet(Y)-tet(R) tetracycline resistance genes and the strA-strB streptomycin resistance genes. Despite Acinetobacter populations being considered natural hosts of LowGC plasmids, these plasmids were not found in three Acinetobacter isolates from the study farm. The isolates harbored tet(Y)-tet(R) genes in identical genetic surroundings as pFK2-7, however, suggesting genetic exchange between Acinetobacter and LowGC plasmids. Abundance of LowGC plasmids and tet(Y) was correlated in manure and soil samples from the farm, indicating that LowGC plasmids may be involved in the spread of tet(Y) in the environment. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
A TNF region haplotype offers protection from typhoid fever in Vietnamese patients
2009-01-01
The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085
Louzoun, Yoram; Alter, Idan; Gragert, Loren; Albrecht, Mark; Maiers, Martin
2018-05-01
Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.
HLA-A*02 allele frequencies and haplotypic associations in Koreans.
Park, M H; Whang, D H; Kang, S J; Han, K S
2000-03-01
We have investigated the frequencies of HLA-A*02 alleles and their haplotypic associations with HLA-B and -DRB1 loci in 439 healthy unrelated Koreans, including 214 parents from 107 families. All of the 227 samples (51.7%) typed as A2 by serology were analyzed for A*02 alleles using polymerase chain reaction (PCR)-low ionic strength-single-strand conformation polymorphism (LIS-SSCP) method. A total of six different A*02 alleles were detected (A*02 allele frequency 29.6%): A*0201/9 (16.6%), *0203 (0.5%), *0206 (9.3%), *0207 (3.0%), and one each case of *0210 and *02 undetermined type. Two characteristic haplotypes showing the strongest linkage disequilibrium were A*0203-B38-DRB]*1502 and A*0207-B46-DRB1*0803. Besides these strong associations, significant two-locus associations (P<0.001) were observed for A*0201 with B61, DRB1*0901 and DRB1*1401, and for A*0206 with B48 and B61. HLA haplotypes carrying HLA-A2 showed a variable distribution of A*02 alleles, and all of the eight most common A2-B-DR haplotypes occurring at frequencies of > or =1% were variably associated with two different A*02 alleles. These results demonstrate that substantial heterogeneity is present in the distribution of HLA-A*02 alleles and related haplotypes in Koreans.
A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women
Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.
2013-01-01
Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962
The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?
Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco
2018-04-11
The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.
Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.
Long, Ji-Rong; Zhao, Lan-Juan; Liu, Peng-Yuan; Lu, Yan; Dvornyk, Volodymyr; Shen, Hui; Liu, Yong-Jun; Zhang, Yuan-Yuan; Xiong, Dong-Hai; Xiao, Peng; Deng, Hong-Wen
2004-05-24
The adequacy of association studies for complex diseases depends critically on the existence of linkage disequilibrium (LD) between functional alleles and surrounding SNP markers. We examined the patterns of LD and haplotype distribution in eight candidate genes for osteoporosis and/or obesity using 31 SNPs in 1,873 subjects. These eight genes are apolipoprotein E (APOE), type I collagen alpha1 (COL1A1), estrogen receptor-alpha (ER-alpha), leptin receptor (LEPR), parathyroid hormone (PTH)/PTH-related peptide receptor type 1 (PTHR1), transforming growth factor-beta1 (TGF-beta1), uncoupling protein 3 (UCP3), and vitamin D (1,25-dihydroxyvitamin D3) receptor (VDR). Yin yang haplotypes, two high-frequency haplotypes composed of completely mismatching SNP alleles, were examined. To quantify LD patterns, two common measures of LD, D' and r2, were calculated for the SNPs within the genes. The haplotype distribution varied in the different genes. Yin yang haplotypes were observed only in PTHR1 and UCP3. D' ranged from 0.020 to 1.000 with the average of 0.475, whereas the average r2 was 0.158 (ranging from 0.000 to 0.883). A decay of LD was observed as the intermarker distance increased, however, there was a great difference in LD characteristics of different genes or even in different regions within gene. The differences in haplotype distributions and LD patterns among the genes underscore the importance of characterizing genomic regions of interest prior to association studies.
Mikhailova, S V; Babenko, V N; Ivanoshchuk, D E; Gubina, M A; Maksimov, V N; Solovjova, I G; Voevoda, M I
2016-06-17
Previously, it was shown that the HFE gene (associated with human hereditary hemochromatosis) has several haplotypes of intronic polymorphisms. Some haplotype frequencies are race specific and hence can be used in phylogenetic analysis. We assumed that analysis of Caucasoid patients-living now in Western Siberia and having diseases associated with dietary habits and metabolic rate-will allow us to understand the processes of possible selection during settling of the northern part of Asia. Haplotype analysis of Northern Eurasian native and recently settled ethnic groups was performed on polymorphisms rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, rs1572982, rs2794719, rs807209, and rs2032451 of this gene. The CCA haplotype of the rs2071303, rs1800708, and rs1572982 was found to be associated with HLA-A2 (39 %) in Asian populations. Haplotype analysis for the rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, and rs1572982 was performed on Russian patients with some metabolic disorders or stomach cancer and among long-lived people. Decreased frequencies of the TTA haplotype (T in rs2071303, T in rs1800708, and A in rs1572982) were observed in the groups of patients with diseases associated with overweight (fatty liver disease, type 2 diabetes mellitus, or metabolic syndrome + arterial hypertension) as compared with the control sample. We detected significant differences in this haplotype's frequency between the patients with type 2 diabetes mellitus and Russian adolescents, elderly citizens, and long-lived people (χ(2) P value = 0.003, 0.010, and 0.015, respectively). No significant differences in frequencies of the alleles with mutations in coding regions of the HFE gene (C282Y, H63D, and S65C) were detected between the analyzed patients (with stomach cancer, metabolic syndrome, fatty liver disease, or type 2 diabetes mellitus) and the control Caucasoid sample. Monophyletic origin of H63D (rs1799945) was confirmed in Caucasoids and Northern
Adams, Susan M.; Bosch, Elena; Balaresque, Patricia L.; Ballereau, Stéphane J.; Lee, Andrew C.; Arroyo, Eduardo; López-Parra, Ana M.; Aler, Mercedes; Grifo, Marina S. Gisbert; Brion, Maria; Carracedo, Angel; Lavinha, João; Martínez-Jarreta, Begoña; Quintana-Murci, Lluis; Picornell, Antònia; Ramon, Misericordia; Skorecki, Karl; Behar, Doron M.; Calafell, Francesc; Jobling, Mark A.
2008-01-01
Most studies of European genetic diversity have focused on large-scale variation and interpretations based on events in prehistory, but migrations and invasions in historical times could also have had profound effects on the genetic landscape. The Iberian Peninsula provides a suitable region for examination of the demographic impact of such recent events, because its complex recent history has involved the long-term residence of two very different populations with distinct geographical origins and their own particular cultural and religious characteristics—North African Muslims and Sephardic Jews. To address this issue, we analyzed Y chromosome haplotypes, which provide the necessary phylogeographic resolution, in 1140 males from the Iberian Peninsula and Balearic Islands. Admixture analysis based on binary and Y-STR haplotypes indicates a high mean proportion of ancestry from North African (10.6%) and Sephardic Jewish (19.8%) sources. Despite alternative possible sources for lineages ascribed a Sephardic Jewish origin, these proportions attest to a high level of religious conversion (whether voluntary or enforced), driven by historical episodes of social and religious intolerance, that ultimately led to the integration of descendants. In agreement with the historical record, analysis of haplotype sharing and diversity within specific haplogroups suggests that the Sephardic Jewish component is the more ancient. The geographical distribution of North African ancestry in the peninsula does not reflect the initial colonization and subsequent withdrawal and is likely to result from later enforced population movement—more marked in some regions than in others—plus the effects of genetic drift. PMID:19061982
VNTR alleles associated with the {alpha}-globin locus are haplotype and population related
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martinson, J.J.; Clegg, J.B.; Boyce, A.J.
1994-09-01
The human {alpha}-globin complex contains several polymorphic restriction-enzyme sites (i.e., RFLPs) linked to form haplotypes and is flanked by two hypervariable VNTR loci, the 5{prime} hypervariable region (HVR) and the more highly polymorphic 3{prime}HVR. Using a combination of RFLP analysis and PCR, the authors have characterized the 5{prime}HVR and 3{prime}HVR alleles associated with the {alpha}-globin haplotypes of 133 chromosomes, and they here show that specific {alpha}-globin haplotypes are each associated with discrete subsets of the alleles observed at these two VNTR loci. This statistically highly significant association is observed over a region spanning {approximately} 100 kb. With the exception ofmore » closely related haplotypes, different haplotypes do not share identically sized 3{prime}HVR alleles. Earlier studies have shown that {alpha}-globin haplotype distributions differ between populations; the current findings also reveal extensive population substructure in the repertoire of {alpha}-globin VNTRs. If similar features are characteristic of other VNTR loci, this will have important implications for forensic and anthropological studies. 42 refs., 5 figs., 5 tabs.« less
SAM: String-based sequence search algorithm for mitochondrial DNA database queries
Röck, Alexander; Irwin, Jodi; Dür, Arne; Parsons, Thomas; Parson, Walther
2011-01-01
The analysis of the haploid mitochondrial (mt) genome has numerous applications in forensic and population genetics, as well as in disease studies. Although mtDNA haplotypes are usually determined by sequencing, they are rarely reported as a nucleotide string. Traditionally they are presented in a difference-coded position-based format relative to the corrected version of the first sequenced mtDNA. This convention requires recommendations for standardized sequence alignment that is known to vary between scientific disciplines, even between laboratories. As a consequence, database searches that are vital for the interpretation of mtDNA data can suffer from biased results when query and database haplotypes are annotated differently. In the forensic context that would usually lead to underestimation of the absolute and relative frequencies. To address this issue we introduce SAM, a string-based search algorithm that converts query and database sequences to position-free nucleotide strings and thus eliminates the possibility that identical sequences will be missed in a database query. The mere application of a BLAST algorithm would not be a sufficient remedy as it uses a heuristic approach and does not address properties specific to mtDNA, such as phylogenetically stable but also rapidly evolving insertion and deletion events. The software presented here provides additional flexibility to incorporate phylogenetic data, site-specific mutation rates, and other biologically relevant information that would refine the interpretation of mitochondrial DNA data. The manuscript is accompanied by freeware and example data sets that can be used to evaluate the new software (http://stringvalidation.org). PMID:21056022
A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.
Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin
2018-04-26
Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.
Mapping of HLA- DQ haplotypes in a group of Danish patients with celiac disease.
Lund, Flemming; Hermansen, Mette N; Pedersen, Merete F; Hillig, Thore; Toft-Hansen, Henrik; Sölétormos, György
2015-10-01
A cost-effective identification of HLA- DQ risk haplotypes using the single nucleotide polymorphism (SNP) technique has recently been applied in the diagnosis of celiac disease (CD) in four European populations. The objective of the study was to map risk HLA- DQ haplotypes in a group of Danish CD patients using the SNP technique. Cohort A: Among 65 patients with gastrointestinal symptoms we compared the HLA- DQ2 and HLA- DQ8 risk haplotypes obtained by the SNP technique (method 1) with results based on a sequence specific primer amplification technique (method 2) and a technique used in an assay from BioDiagene (method 3). Cohort B: 128 patients with histologically verified CD were tested for CD risk haplotypes (method 1). Patients with negative results were further tested for sub-haplotypes of HLA- DQ2 (methods 2 and 3). Cohort A: The three applied methods provided the same HLA- DQ2 and HLA- DQ8 results among 61 patients. Four patients were negative for the HLA- DQ2 and HLA- DQ8 haplotypes (method 1) but were positive for the HLA- DQ2.5-trans and HLA- DQ2.2 haplotypes (methods 2 and 3). Cohort B: A total of 120 patients were positive for the HLA- DQ2.5-cis and HLA- DQ8 haplotypes (method 1). The remaining seven patients were positive for HLA- DQ2.5-trans or HLA- DQ2.2 haplotypes (methods 2 and 3). One patient was negative with all three HLA methods. The HLA- DQ risk haplotypes were detected in 93.8% of the CD patients using the SNP technique (method 1). The sensitivity increased to 99.2% by combining methods 1 - 3.
Haplotype-based approach to known MS-associated regions increases the amount of explained risk
Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S
2015-01-01
Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143
Diegoli, Toni Marie; Rohde, Heinrich; Borowski, Stefan; Krawczak, Michael; Coble, Michael D; Nothnagel, Michael
2016-11-01
Typing of X chromosomal short tandem repeat (X STR) markers has become a standard element of human forensic genetic analysis. Joint consideration of many X STR markers at a time increases their discriminatory power but, owing to physical linkage, requires inter-marker recombination rates to be accurately known. We estimated the recombination rates between 15 well established X STR markers using genotype data from 158 families (1041 individuals) and following a previously proposed likelihood-based approach that allows for single-step mutations. To meet the computational requirements of this family-based type of analysis, we modified a previous implementation so as to allow multi-core parallelization on a high-performance computing system. While we obtained recombination rate estimates larger than zero for all but one pair of adjacent markers within the four previously proposed linkage groups, none of the three X STR pairs defining the junctions of these groups yielded a recombination rate estimate of 0.50. Corroborating previous studies, our results therefore argue against a simple model of independent X chromosomal linkage groups. Moreover, the refined recombination fraction estimates obtained in our study will facilitate the appropriate joint consideration of all 15 investigated markers in forensic analysis. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
2011-01-01
Background Yami and Ivatan islanders are Austronesian speakers from Orchid Island and the Batanes archipelago that are located between Taiwan and the Philippines. The paternal genealogies of the Yami tribe from 1962 monograph of Wei and Liu were compared with our dataset of non-recombining Y (NRY) chromosomes from the corresponding families. Then mitochondrial DNA polymorphism was also analyzed to determine the matrilineal relationships between Yami, Ivatan, and other East Asian populations. Results The family relationships inferred from the NRY Phylogeny suggested a low number of paternal founders and agreed with the genealogy of Wei and Liu (P < 0.01). Except for one Y short tandem repeat lineage (Y-STR), seen in two unrelated Yami families, no other Y-STR lineages were shared between villages, whereas mtDNA haplotypes were indiscriminately distributed throughout Orchid Island. The genetic affinity seen between Yami and Taiwanese aborigines or between Ivatan and the Philippine people was closer than that between Yami and Ivatan, suggesting that the Orchid islanders were colonized separately by their nearest neighbors and bred in isolation. However a northward gene flow to Orchid Island from the Philippines was suspected as Yami and Ivatan peoples both speak Western Malayo-Polynesian languages which are not spoken in Taiwan. Actually, only very little gene flow was observed between Yami and Ivatan or between Yami and the Philippines as indicated by the sharing of mtDNA haplogroup B4a1a4 and one O1a1* Y-STR lineage. Conclusions The NRY and mtDNA genetic information among Yami tribe peoples fitted well the patrilocal society model proposed by Wei and Liu. In this proposal, there were likely few genetic exchanges among Yami and the Philippine people. Trading activities may have contributed to the diffusion of Malayo-Polynesian languages among them. Finally, artifacts dating 4,000 YBP, found on Orchid Island and indicating association with the Out of Taiwan hypothesis
Assessing transmission of ‘Candidatus Liberibacter solanacearum’ haplotypes through seed potato
USDA-ARS?s Scientific Manuscript database
Conflicting data has previously been reported concerning the impact of zebra chip disease transmission through seed tubers. These discrepancies may be due to the experimental design of each study, whereby different pathogen haplotypes, insect vector haplotypes, and potato plant varieties were used....
Zumárraga, Mercedes; Arrúe, Aurora; Basterreche, Nieves; Macías, Isabel; Catalán, Ana; Madrazo, Arantza; Bustamante, Sonia; Zamalloa, María I; Erkoreka, Leire; Gordo, Estibaliz; Arnaiz, Ainara; Olivas, Olga; Arroita, Ariane; Marín, Elena; González-Torres, Miguel A
2016-06-01
We examined the association of COMT haplotypes and plasma metabolites of catecholamines in relation to the clinical response to antipsychotics in schizophrenic and bipolar patients. We studied 165 patients before and after four weeks of treatment, and 163 healthy controls. We assessed four COMT haplotypes and the plasma concentrations of HVA, DOPAC and MHPG. Bipolar patients: haplotypes are associated with age at onset and clinical evolution. In schizophrenic patients, an haplotype previously associated with increased risk, is related to better response of negative symptoms. Haplotypes would be good indicators of the clinical status and the treatment response in bipolar and schizophrenic patients. Larger studies are required to elucidate the clinical usefulness of these findings.
Two families from New England with usher syndrome type IC with distinct haplotypes.
DeAngelis, M M; McGee, T L; Keats, B J; Slim, R; Berson, E L; Dryja, T P
2001-03-01
To search for patients with Usher syndrome type IC among those with Usher syndrome type I who reside in New England. Genotype analysis of microsatellite markers closely linked to the USH1C locus was done using the polymerase chain reaction. We compared the haplotype of our patients who were homozygous in the USH1C region with the haplotypes found in previously reported USH1C Acadian families who reside in southwestern Louisiana and from a single family residing in Lebanon. Of 46 unrelated cases of Usher syndrome type I residing in New England, two were homozygous at genetic markers in the USH1C region. Of these, one carried the Acadian USH1C haplotype and had Acadian ancestors (that is, from Nova Scotia) who did not participate in the 1755 migration of Acadians to Louisiana. The second family had a haplotype that proved to be the same as that of a family with USH1C residing in Lebanon. Each of the two families had haplotypes distinct from the other. This is the first report that some patients residing in New England have Usher syndrome type IC. Patients with Usher syndrome type IC can have the Acadian haplotype or the Lebanese haplotype compatible with the idea that at least two independently arising pathogenic mutations have occurred in the yet-to-be identified USH1C gene.
Areškeviciute, Aušrine; Melchior, Linea Cecilie; Broholm, Helle; Krarup, Lars-Henrik; Granhøj Lindquist, Suzanne; Johansen, Peter; McKenzie, Neil; Green, Alison; Nielsen, Jørgen Erik; Laursen, Henning; Løbner Lund, Eva
2018-06-07
This is the first report of presumed sporadic Creutzfeldt-Jakob disease (sCJD) and Gerstmann-Sträussler-Scheinker disease (GSS) with the prion protein gene c.305C>T mutation (p.P102L) occurring in one family. The father and son were affected with GSS and the mother had a rapidly progressive form of CJD. Diagnosis of genetic, variant, and iatrogenic CJD was ruled out based on the mother's clinical history, genetic tests, and biochemical investigations, all of which supported the diagnosis of sCJD. However, given the low incidence of sCJD and GSS, their co-occurrence in one family is extraordinary and challenging. Thus, a hypothesis for the transmission of infectious prion proteins (PrPSc) via microchimerism was proposed and investigated. DNA from 15 different brain regions and plasma samples of the CJD patient was subjected to PCR and shallow sequencing for detection of a male sex-determining chromosome Y (chr. Y). However, no trace of chr. Y was found. A long CJD incubation period or presumed small concentrations of chr. Y may explain the obtained results. Further studies of CJD and GSS animal models with controlled genetic and proteomic features are needed to determine whether maternal CJD triggered via microchimerism by a GSS fetus might present a new PrPSc transmission route.
Two Orangutan Species Have Evolved Different KIR Alleles and Haplotypes1
Guethlein, Lisbeth A.; Norman, Paul J.; Heijmans, Corinne M. C.; de Groot, Natasja G.; Hilton, Hugo G.; Babrzadeh, Farbod; Abi-Rached, Laurent; Bontrop, Ronald E.; Parham, Peter
2017-01-01
The immune and reproductive functions of human Natural Killer (NK) cells are regulated by interactions of the C1 and C2 epitopes of HLA-C with C1-specific and C2-specific lineage III killer cell immunoglobulin-like receptors (KIR). This rapidly evolving and diverse system of ligands and receptors is restricted to humans and great apes. In this context, the orangutan has particular relevance because it represents an evolutionary intermediate, one having the C1 epitope and corresponding KIR, but lacking the C2 epitope. Through a combination of direct sequencing, KIR genotyping and data mining from the Great Ape Genome Project (GAGP) we characterized the KIR alleles and haplotypes for panels of ten Bornean orangutans and 19 Sumatran orangutans. The orangutan KIR haplotypes have between five and ten KIR genes. The seven orangutan lineage III KIR genes all locate to the centromeric region of the KIR locus, whereas their human counterparts also populate the telomeric region. One lineage III KIR gene is Bornean-specific, one is Sumatran-specific and five are shared. Of twelve KIR gene-content haplotypes five are Bornean-specific, five are Sumatran-specific and two are shared. The haplotypes have different combinations of genes encoding activating and inhibitory C1 receptors that can be of higher or lower affinity. All haplotypes encode an inhibitory C1 receptor, but only some haplotypes encode an activating C1 receptor. Of 130 KIR alleles, 55 are Bornean-specific, 65 are Sumatran specific and ten are shared. PMID:28264973
Fetal hemoglobin in sickle cell anemia: The Arab-Indian haplotype and new therapeutic agents.
Habara, Alawi H; Shaikho, Elmutaz M; Steinberg, Martin H
2017-11-01
Fetal hemoglobin (HbF) has well-known tempering effects on the symptoms of sickle cell disease and its levels vary among patients with different haplotypes of the sickle hemoglobin gene. Compared with sickle cell anemia haplotypes found in patients of African descent, HbF levels in Saudi and Indian patients with the Arab-Indian (AI) haplotype exceed that in any other haplotype by nearly twofold. Genetic association studies have identified some loci associated with high HbF in the AI haplotype but these observations require functional confirmation. Saudi patients with the Benin haplotype have HbF levels almost twice as high as African patients with this haplotype but this difference is unexplained. Hydroxyurea is still the only FDA approved drug for HbF induction in sickle cell disease. While most patients treated with hydroxyurea have an increase in HbF and some clinical improvement, 10 to 20% of adults show little response to this agent. We review the genetic basis of HbF regulation focusing on sickle cell anemia in Saudi Arabia and discuss new drugs that can induce increased levels of HbF. © 2017 Wiley Periodicals, Inc.
Morales Colón, Emely; Hernández, Mireya; Candelario, Mariel; Meléndez, María; Dawson Cruz, Tracey
2018-03-01
Traditional methods for bone pulverization typically generate heat, risking stability of DNA sample. SPEX™ has developed cryogenic grinders which introduce liquid nitrogen to cool the sample and aid in the grinding process. In this study, the Freezer Mill 6970 EFM was used with two DNA extraction methods and routine downstream STR analysis procedures. DNA from as little as 0.1 g of bone powder was used to develop full STR profiles after freezer mill pulverization, and the method was reproducible. Further, no contamination was detected upon cleaning/reuse of the sample vials. There were no significant differences in DNA yield, STR alleles detected, or peak heights using the freezer mill as compared to traditional grinding, and successful DNA profiles were achieved from as low as 0.1 g of bone powder with this method. Overall, this work indicates that this cryogenic mill method may be used as a viable alternative to traditional tissue grinders. © 2017 American Academy of Forensic Sciences.
Larson, D.L.; Galatowitsch, S.M.; Larson, J.L.
2011-01-01
Phragmites australis (common reed) is known to have occurred along the Platte River historically, but recent rapid increases in both distribution and density have begun to impact habitat for migrating sandhill cranes and nesting piping plovers and least terns. Invasiveness in Phragmites has been associated with the incursion of a European genotype (haplotype M) in other areas; determining the genotype of Phragmites along the central Platte River has implications for proper management of the river system. In 2008 we sampled Phragmites patches along the central Platte River from Lexington to Chapman, NE, stratified by bridge segments, to determine the current distribution of haplotype E (native) and haplotype M genotypes. In addition, we did a retrospective analysis of historical Phragmites collections from the central Platte watershed (1902-2006) at the Bessey Herbarium. Fresh tissue from the 2008 survey and dried tissue from the herbarium specimens were classified as haplotype M or E using the restriction fragment length polymorphism procedure. The European haplotype was predominant in the 2008 samples: only 14 Phragmites shoots were identified as native haplotype E; 224 were non-native haplotype M. The retrospective analysis revealed primarily native haplotype individuals. Only collections made in Lancaster County, near Lincoln, NE, were haplotype M, and the earliest of these was collected in 1973. ?? 2011 Copyright by the Center for Great Plains Studies, University of Nebraska-Lincoln.
Wang, Hongdan; Kang, Bing; Gao, Yue; Huo, Xiaodong; Li, Tao; Guo, Qiannan; Zhu, Bofeng; Liao, Shixiu
2017-04-10
To study the genetic polymorphisms and mutations of 20 frequently used autosomal microsatellites among ethnic Hans from Henan. Peripheral blood samples of 2604 individuals were collected. DNA was amplified and genotyped using a PowerPlex(TM) 21 system. The frequencies, forensic parameters and mutation rates of the 20 short tandem repeat (STR) loci were analyzed. A total of 323 alleles were found in this population and the allelic frequencies have ranged from 0.0003 to 0.5144. Except for D3S1358, TH01 and TPOX, mutations have been found in all of the remaining 17 STR loci, which totaled 47, with mutation rates ranging from 0 to 3.46 × 10 -3 . The 20 STR loci selected by the PowerPlex(TM) 21 system are highly polymorphic among ethnic Hans from Henan, and may be of great value in forensic and human population studies. As no similar study has been carried out previously, above results may be of great value for individual discrimination and paternal testing.
Wang, Zheng; Zhou, Di; Wang, Hui; Jia, Zhenjun; Liu, Jing; Qian, Xiaoqin; Li, Chengtao; Hou, Yiping
2017-11-01
Massively parallel sequencing (MPS) technologies have proved capable of sequencing the majority of the key forensic STR markers. By MPS, not only the repeat-length size but also sequence variations could be detected. Recently, Thermo Fisher Scientific has designed an advanced MPS 32-plex panel, named the Precision ID GlobalFiler™ NGS STR Panel, where the primer set has been designed specifically for the purpose of MPS technologies and the data analysis are supported by a new version HID STR Genotyper Plugin (V4.0). In this study, a series of experiments that evaluated concordance, reliability, sensitivity of detection, mixture analysis, and the ability to analyze case-type and challenged samples were conducted. In addition, 106 unrelated Han individuals were sequenced to perform genetic analyses of allelic diversity. As expected, MPS detected broader allele variations and gained higher power of discrimination and exclusion rate. MPS results were found to be concordant with current capillary electrophoresis methods, and single source complete profiles could be obtained stably using as little as 100pg of input DNA. Moreover, this MPS panel could be adapted to case-type samples and partial STR genotypes of the minor contributor could be detected up to 19:1 mixture. Aforementioned results indicate that the Precision ID GlobalFiler™ NGS STR Panel is reliable, robust and reproducible and have the potential to be used as a tool for human forensics. Copyright © 2017 Elsevier B.V. All rights reserved.
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers
Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.
2018-01-01
Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.
Jiang, Yong; Schmidt, Renate H; Reif, Jochen C
2018-05-04
Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.
Jannink, Jean-Luc
2010-01-01
Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933
Hoogenboom, Jerry; van der Gaag, Kristiaan J; de Leeuw, Rick H; Sijen, Titia; de Knijff, Peter; Laros, Jeroen F J
2017-03-01
Massively parallel sequencing (MPS) is on the advent of a broad scale application in forensic research and casework. The improved capabilities to analyse evidentiary traces representing unbalanced mixtures is often mentioned as one of the major advantages of this technique. However, most of the available software packages that analyse forensic short tandem repeat (STR) sequencing data are not well suited for high throughput analysis of such mixed traces. The largest challenge is the presence of stutter artefacts in STR amplifications, which are not readily discerned from minor contributions. FDSTools is an open-source software solution developed for this purpose. The level of stutter formation is influenced by various aspects of the sequence, such as the length of the longest uninterrupted stretch occurring in an STR. When MPS is used, STRs are evaluated as sequence variants that each have particular stutter characteristics which can be precisely determined. FDSTools uses a database of reference samples to determine stutter and other systemic PCR or sequencing artefacts for each individual allele. In addition, stutter models are created for each repeating element in order to predict stutter artefacts for alleles that are not included in the reference set. This information is subsequently used to recognise and compensate for the noise in a sequence profile. The result is a better representation of the true composition of a sample. Using Promega Powerseq™ Auto System data from 450 reference samples and 31 two-person mixtures, we show that the FDSTools correction module decreases stutter ratios above 20% to below 3%. Consequently, much lower levels of contributions in the mixed traces are detected. FDSTools contains modules to visualise the data in an interactive format allowing users to filter data with their own preferred thresholds. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Genetic analysis of autoimmune regulator haplotypes in alopecia areata.
Wengraf, D A; McDonagh, A J G; Lovewell, T R J; Vasilopoulos, Y; Macdonald-Hull, S P; Cork, M J; Messenger, A G; Tazi-Ahnini, R
2008-03-01
Alopecia areata is an immune-mediated disorder, occurring with the highest observed frequency in the rare recessive autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED) syndrome caused by mutations of the autoimmune regulator (AIRE) gene on chromosome 21q22.3. We have previously detected association between alopecia areata and a single nucleotide polymorphism (SNP) in the AIRE gene in patients without APECED, and we now report the findings of an extended examination of the association of alopecia areata with haplotype analysis including six SNPs in the AIRE gene: C-103T, C4144G, T5238C, G6528A, T7215C and T11787C. In Caucasian groups of 295 patients and 363 controls, we found strong association between the AIRE 7215C allele and AA [P = 3.8 x 10(-8), OR (95% CI): 2.69 (1.8-4.0)]. The previously reported association between AA and the AIRE 4144G allele was no longer significant on correction for multiple testing. The AIRE haplotypes CCTGCT and CGTGCC showed a highly significant association with AA [P = 6.05 x 10(-6), 9.47 (2.91-30.8) and P = 0.001, 3.51 (1.55-7.95), respectively]. To select the haplotypes most informative for analysis, we tagged the polymorphisms using SNPTag software. Employing AIRE C-103T, G6528A, T7215C and T11787C as tag SNPs, two haplotypes were associated with AA; AIRE CGCT and AIRE CGCC [P = 3.84 x 10(-7), 11.40 (3.53-36.9) and P = 3.94 x 10(-4), 2.13 (1.39-3.24) respectively]. The AIRE risk haplotypes identified in this study potentially account for a major component of the genetic risk of developing alopecia areata.
Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.
Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi
2004-01-01
In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.
Contu, L; Carcassi, C; Dausset, J
1989-01-01
The C4 and 21-OH loci of the class III HLA have been studied by specific DNA probes and the restriction enzyme Taq 1 in 24 unrelated Sardinian individuals selected from completely HLA-typed families. All 24 individuals had the HLA extended haplotype A30,Cw5,B18, BfF1,DR3,DRw52,DQw2, named "Sardinian" in the present paper because of its frequency of 15% in the Sardinian population. Eighteen of these were homozygous for the entire haplotype, and six were heterozygous at the A locus and blank (or homozygous) at all the other loci. In all completely homozygous cells and in four heterozygous cells at the A locus, the restriction fragments of the 21-OHA (3.2 kb) and C4B (5.8 kb or 5.4 kb) genes were absent, and the fragments of the C4A (7.0 kb) and 21-OHB (3.7 kb) genes were present. It is suggested that the "Sardinian" haplotype is an ancestral haplotype without duplication of the C4 and 21-OH genes, practically always identical in its structure, also in unrelated individuals. The diversity of this haplotype in the class III region (about 30 kb less) may be at least partially responsible for its misalignment with most haplotypes, which have duplicated C4 and 21-OH genes, and therefore also for its decreased probability to recombine. This can help explain its high stability and frequency in the Sardinian population. The same conclusion can be suggested for the Caucasian extended haplotype A1,B8,DR3 that always seems to lack the C4A and 21-OHA genes.
Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.
Patil, N; Berno, A J; Hinds, D A; Barrett, W A; Doshi, J M; Hacker, C R; Kautzer, C R; Lee, D H; Marjoribanks, C; McDonough, D P; Nguyen, B T; Norris, M C; Sheehan, J B; Shen, N; Stern, D; Stokowski, R P; Thomas, D J; Trulson, M O; Vyas, K R; Frazer, K A; Fodor, S P; Cox, D R
2001-11-23
Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. We have used high-density oligonucleotide arrays, in combination with somatic cell genetics, to identify a large fraction of all common human chromosome 21 SNPs and to directly observe the haplotype structure defined by these SNPs. This structure reveals blocks of limited haplotype diversity in which more than 80% of a global human sample can typically be characterized by only three common haplotypes.
Better ILP models for haplotype assembly.
Etemadi, Maryam; Bagherian, Mehri; Chen, Zhi-Zhong; Wang, Lusheng
2018-02-19
The haplotype assembly problem for diploid is to find a pair of haplotypes from a given set of aligned Single Nucleotide Polymorphism (SNP) fragments (reads). It has many applications in association studies, drug design, and genetic research. Since this problem is computationally hard, both heuristic and exact algorithms have been designed for it. Although exact algorithms are much slower, they are still of great interest because they usually output significantly better solutions than heuristic algorithms in terms of popular measures such as the Minimum Error Correction (MEC) score, the number of switch errors, and the QAN50 score. Exact algorithms are also valuable because they can be used to witness how good a heuristic algorithm is. The best known exact algorithm is based on integer linear programming (ILP) and it is known that ILP can also be used to improve the output quality of every heuristic algorithm with a little decline in speed. Therefore, faster ILP models for the problem are highly demanded. As in previous studies, we consider not only the general case of the problem but also its all-heterozygous case where we assume that if a column of the input read matrix contains at least one 0 and one 1, then it corresponds to a heterozygous SNP site. For both cases, we design new ILP models for the haplotype assembly problem which aim at minimizing the MEC score. The new models are theoretically better because they contain significantly fewer constraints. More importantly, our experimental results show that for both simulated and real datasets, the new model for the all-heterozygous (respectively, general) case can usually be solved via CPLEX (an ILP solver) at least 5 times (respectively, twice) faster than the previous bests. Indeed, the running time can sometimes be 41 times better. This paper proposes a new ILP model for the haplotype assembly problem and its all-heterozygous case, respectively. Experiments with both real and simulated datasets show that the
Missing data imputation and haplotype phase inference for genome-wide association studies
Browning, Sharon R.
2009-01-01
Imputation of missing data and the use of haplotype-based association tests can improve the power of genome-wide association studies (GWAS). In this article, I review methods for haplotype inference and missing data imputation, and discuss their application to GWAS. I discuss common features of the best algorithms for haplotype phase inference and missing data imputation in large-scale data sets, as well as some important differences between classes of methods, and highlight the methods that provide the highest accuracy and fastest computational performance. PMID:18850115
Effects of IL-10 haplotype and atomic bomb radiation exposure on gastric cancer risk.
Hayashi, Tomonori; Ito, Reiko; Cologne, John; Maki, Mayumi; Morishita, Yukari; Nagamura, Hiroko; Sasaki, Keiko; Hayashi, Ikue; Imai, Kazue; Yoshida, Kengo; Kajimura, Junko; Kyoizumi, Seishi; Kusunoki, Yoichiro; Ohishi, Waka; Fujiwara, Saeko; Akahoshi, Masazumi; Nakachi, Kei
2013-07-01
Gastric cancer (GC) is one of the cancers that reveal increased risk of mortality and incidence in atomic bomb survivors. The incidence of gastric cancer in the Life Span Study cohort of the Radiation Effects Research Foundation (RERF) increased with radiation dose (gender-averaged excess relative risk per Gy = 0.28) and remains high more than 65 years after exposure. To assess a possible role of gene-environment interaction, we examined the dose response for gastric cancer incidence based on immunosuppression-related IL-10 genotype, in a cohort study with 200 cancer cases (93 intestinal, 96 diffuse and 11 other types) among 4,690 atomic bomb survivors participating in an immunological substudy. Using a single haplotype block composed of four haplotype-tagging SNPs (comprising the major haplotype allele IL-10-ATTA and the minor haplotype allele IL-10-GGCG, which are categorized by IL-10 polymorphisms at -819A>G and -592T>G, +1177T>C and +1589A>G), multiplicative and additive models for joint effects of radiation and this IL-10 haplotyping were examined. The IL-10 minor haplotype allele(s) was a risk factor for intestinal type gastric cancer but not for diffuse type gastric cancer. Radiation was not associated with intestinal type gastric cancer. In diffuse type gastric cancer, the haplotype-specific excess relative risk (ERR) for radiation was statistically significant only in the major homozygote category of IL-10 (ERR = 0.46/Gy, P = 0.037), whereas estimated ERR for radiation with the minor IL-10 homozygotes was close to 0 and nonsignificant. Thus, the minor IL-10 haplotype might act to reduce the radiation related risk of diffuse-type gastric cancer. The results suggest that this IL-10 haplotyping might be involved in development of radiation-associated gastric cancer of the diffuse type, and that IL-10 haplotypes may explain individual differences in the radiation-related risk of gastric cancer. © 2013 by Radiation Research Society
Association between endothelin type A receptor haplotypes and mortality in coronary heart disease.
Ellis, Katrina L; Pilbrow, Anna P; Potter, Howard C; Frampton, Chris M; Doughty, Rob N; Whalley, Gillian A; Ellis, Chris J; Palmer, Barry R; Skelton, Lorraine; Yandle, Tim G; Troughton, Richard W; Richards, A Mark; A Cameron, Vicky
2012-05-01
The endothelin type A receptor, encoded by EDNRA, mediates the effects of endothelin-1 to promote vasoconstriction, vascular cell growth, adhesion, fibrosis and thrombosis. We investigated the association between EDNRA haplotype and cardiovascular outcomes in patients with coronary artery disease. Coronary disease patients (n = 1007) were genotyped for the His323His (rs5333) variant and one tag SNP from each of the major EDNRA haplotype blocks (rs6537484, rs1568136, rs5335 and rs10003447). EDNRA haplotype associations with clinical history, natriuretic peptides cardiac function and cardiovascular outcomes were tested over a median 3.8 years. Univariate analysis identified a 'low-risk' EDNRA haplotype associated with later age of Type 2 diabetes onset (p = 0.004) smaller BMI (p = 0.021), and reduced mortality (log rank p = 0.001). Cox proportional hazards analysis including established cardiovascular risk factors revealed an independent association between haplotype and mortality (p < 0.0001). These data highlight the potential importance of the endothelin system, and in particular EDNRA in coronary disease.
Analysis of MHC class I genes across horse MHC haplotypes
Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.
2010-01-01
The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063
Maiers, M; Gragert, L; Madbouly, A; Steiner, D; Marsh, S G E; Gourraud, P-A; Oudshoorn, M; Zanden, H; Schmidt, A H; Pingel, J; Hofmann, J; Müller, C; Eberhard, H-P
2013-01-01
This project has the goal to validate bioinformatics methods and tools for HLA haplotype frequency analysis specifically addressing unique issues of haematopoietic stem cell registry data sets. In addition to generating new methods and tools for the analysis of registry data sets, the intent is to produce a comprehensive analysis of HLA data from 20 million donors from the Bone Marrow Donors Worldwide (BMDW) database. This report summarizes the activity on this project as of the 16IHIW meeting in Liverpool. PMID:23280139
Development of a novel forensic STR multiplex for ancestry analysis and extended identity testing.
Phillips, Chris; Fernandez-Formoso, Luis; Gelabert-Besada, Miguel; Garcia-Magariños, Manuel; Santos, Carla; Fondevila, Manuel; Carracedo, Angel; Lareu, Maria Victoria
2013-04-01
There is growing interest in developing additional DNA typing techniques to provide better investigative leads in forensic analysis. These include inference of genetic ancestry and prediction of common physical characteristics of DNA donors. To date, forensic ancestry analysis has centered on population-divergent SNPs but these binary loci cannot reliably detect DNA mixtures, common in forensic samples. Furthermore, STR genotypes, forming the principal DNA profiling system, are not routinely combined with forensic SNPs to strengthen frequency data available for ancestry inference. We report development of a 12-STR multiplex composed of ancestry informative marker STRs (AIM-STRs) selected from 434 tetranucleotide repeat loci. We adapted our online Bayesian classifier for AIM-SNPs: Snipper, to handle multiallele STR data using frequency-based training sets. We assessed the ability of the 12-plex AIM-STRs to differentiate CEPH Human Genome Diversity Panel populations, plus their informativeness combined with established forensic STRs and AIM-SNPs. We found combining STRs and SNPs improves the success rate of ancestry assignments while providing a reliable mixture detection system lacking from SNP analysis alone. As the 12 STRs generally show a broad range of alleles in all populations, they provide highly informative supplementary STRs for extended relationship testing and identification of missing persons with incomplete reference pedigrees. Lastly, mixed marker approaches (combining STRs with binary loci) for simple ancestry inference tests beyond forensic analysis bring advantages and we discuss the genotyping options available. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11
Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.
2009-01-01
Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120
Molecular pathology and haplotype analysis of Wilson disease in Mediterranean populations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Figus, A.; Farcia, A.M.G.; Nurchi, A.
1995-12-01
We analyzed mutations and defined the chromosomal haplotype in 127 patients of Mediterranean descent who were affected in Wilson disease (WD): 39 Sardinians, 49 Italians, 33 Turks, and 6 Albanians. Haplotypes were derived by use of the microsatellite markers D13S301, D13S296, D13S297, and D13S298, which are linked to the WD locus. There were five common haplotypes in Sardinians, three in Italians, and two in Turks, which accounted for 85%, 32%, and 30% of the WD chromosomes, respectively. We identified 16 novel mutations: 8 frameshifts, 7 missense mutations, and 1 splicing defect. In addition, we detected the previously described mutations: 2302insC,more » 3404delC, Arg1320ter, Gly944Ser, and His1070Gin. Of the new mutations detected, two, the 1515insT on haplotype I and 2464delC on haplotype XVI, accounted for 6% and 13%, respectively, of the mutations in WD chromsomes in the Sardinian populations. Mutations H1070Q, 2302insC, and 2533delA represented 13%, 8%, and 8%, respectively, of the mutations in WD chromsomes in other Mediterranean populations. The remaining mutations were rare and limited to one or two patients from different populations. Thus, WD results from some frequent mutations and many rare defects. 28 refs., 1 fig., 3 tabs.« less
The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?
Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina
2018-01-01
The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446
Mendel-GPU: haplotyping and genotype imputation on graphics processing units
Chen, Gary K.; Wang, Kai; Stram, Alex H.; Sobel, Eric M.; Lange, Kenneth
2012-01-01
Motivation: In modern sequencing studies, one can improve the confidence of genotype calls by phasing haplotypes using information from an external reference panel of fully typed unrelated individuals. However, the computational demands are so high that they prohibit researchers with limited computational resources from haplotyping large-scale sequence data. Results: Our graphics processing unit based software delivers haplotyping and imputation accuracies comparable to competing programs at a fraction of the computational cost and peak memory demand. Availability: Mendel-GPU, our OpenCL software, runs on Linux platforms and is portable across AMD and nVidia GPUs. Users can download both code and documentation at http://code.google.com/p/mendel-gpu/. Contact: gary.k.chen@usc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22954633
STR-typing of ancient skeletal remains: which multiplex-PCR kit is the best?
Harder, Melanie; Renneberg, Rebecca; Meyer, Patrick; Krause-Kyora, Ben; von Wurmb-Schwark, Nicole
2012-01-01
Aim To comparatively test nine commercially available short tandem repeat (STR)-multiplex kits (PowerPlex 16, 16HS, ES, ESI17, ESX17, S5 [all Promega]; AmpFiSTR Identifiler, NGM and SEfiler [all Applied Biosystems]) for their efficiency and applicability to analyze ancient and thus highly degraded DNA samples. Methods Fifteen human skeletal remains from the late medieval age were obtained and analyzed using the nine polymerase chain reaction assays with slightly modified protocols. Data were systematically compared to find the most meaningful and sensitive assay. Results The ESI, ESX, and NGM kits showed the best overall results regarding amplification success, detection rate, identification of heterozygous alleles, sex determination, and reproducibility of the obtained data. Conclusion Since application of these three kits enables the employment of different primer sequences for all the investigated amplicons, a combined application is recommended for best possible and – most importantly – reliable genetic analysis of ancient skeletal material or otherwise highly degraded samples, eg, from forensic cases. PMID:23100203
Wilmanns, J C
1982-12-01
The 100th anniversary of the first description of paroxysmal nocturnal hemoglobinuria by Paul Strübing presents an opportunity to analyze the premises valid for the description of this disease in addition to an attempt at an extensive pathophysiological analysis. Strübing's two papers of 1882 were way ahead of his time, when pathophysiology was just at its beginning, particularly considering the fact that neither Marchiafava, who is still commonly credited wit the first description of this disease (1911) and its recognition as a clinical entity (1928), nor his student Micheli analyzed the PNH syndrome in pathophysiological terms as carefully as Strübing. Both of the former names were given to the disease, which is generally referred to as the Marchiafava-Micheli Anemia. William Crosby, who in 1951 in a historical review of PHN first pointed out the pioneering achievement of Strübing, suggested that it was mainly due to the lack of the right "intellectual climate" at the time that so little attention was paid to his work. Still another important aspect of the early history of PNH will be described in the present paper. The analysis of Strübing's publications leads to the conclusion that he was only able to make his important contribution to medical science because he not only had the appropriate clinical setting but also the scientific backup of the famous physiologist Leonhard Landois and his institute at the University of Greifswald, which is an excellent example of scientific progress through cooperation between a clinician and a research scientist.
Genetic data for 26 autosomal STR markers from Brazilian population.
Pereira, Tamiris Fátima Correia; Malaghini, Marcelo; Magalhães, João Carlos Maciel; Moura-Neto, Rodrigo; Sotomaior, Vanessa Santos
2018-01-19
The allelic frequency distributions and statistical forensic parameters of 26 mini short tandem repeat (mini-STR) loci in a sample of 1575 unrelated individuals from five different Brazilian regions were obtained. All the analyzed loci showed great diversity and were highly informative. The results were compared with those of the US Caucasian, African American, and Hispanic population studies. This study aimed to contribute to forensic analysis for human identification and inference of the evidential value in familial bond tests.
Fritz, Sébastien; Capitan, Aurelien; Djari, Anis; Rodriguez, Sabrina C; Barbat, Anne; Baur, Aurélia; Grohs, Cécile; Weiss, Bernard; Boussaha, Mekki; Esquerré, Diane; Klopp, Christophe; Rocha, Dominique; Boichard, Didier
2013-01-01
The regular decrease of female fertility over time is a major concern in modern dairy cattle industry. Only half of this decrease is explained by indirect response to selection on milk production, suggesting the existence of other factors such as embryonic lethal genetic defects. Genomic regions harboring recessive deleterious mutations were detected in three dairy cattle breeds by identifying frequent haplotypes (>1%) showing a deficit in homozygotes among Illumina Bovine 50k Beadchip haplotyping data from the French genomic selection database (47,878 Holstein, 16,833 Montbéliarde, and 11,466 Normande animals). Thirty-four candidate haplotypes (p<10(-4)) including previously reported regions associated with Brachyspina, CVM, HH1, and HH3 in Holstein breed were identified. Haplotype length varied from 1 to 4.8 Mb and frequencies from 1.7 up to 9%. A significant negative effect on calving rate, consistent in heifers and in lactating cows, was observed for 9 of these haplotypes in matings between carrier bulls and daughters of carrier sires, confirming their association with embryonic lethal mutations. Eight regions were further investigated using whole genome sequencing data from heterozygous bull carriers and control animals (45 animals in total). Six strong candidate causative mutations including polymorphisms previously reported in FANCI (Brachyspina), SLC35A3 (CVM), APAF1 (HH1) and three novel mutations with very damaging effect on the protein structure, according to SIFT and Polyphen-2, were detected in GART, SHBG and SLC37A2 genes. In conclusion, this study reveals a yet hidden consequence of the important inbreeding rate observed in intensively selected and specialized cattle breeds. Counter-selection of these mutations and management of matings will have positive consequences on female fertility in dairy cattle.
Fritz, Sébastien; Capitan, Aurelien; Djari, Anis; Rodriguez, Sabrina C.; Barbat, Anne; Baur, Aurélia; Grohs, Cécile; Weiss, Bernard; Boussaha, Mekki; Esquerré, Diane; Klopp, Christophe; Rocha, Dominique; Boichard, Didier
2013-01-01
The regular decrease of female fertility over time is a major concern in modern dairy cattle industry. Only half of this decrease is explained by indirect response to selection on milk production, suggesting the existence of other factors such as embryonic lethal genetic defects. Genomic regions harboring recessive deleterious mutations were detected in three dairy cattle breeds by identifying frequent haplotypes (>1%) showing a deficit in homozygotes among Illumina Bovine 50k Beadchip haplotyping data from the French genomic selection database (47,878 Holstein, 16,833 Montbéliarde, and 11,466 Normande animals). Thirty-four candidate haplotypes (p<10−4) including previously reported regions associated with Brachyspina, CVM, HH1, and HH3 in Holstein breed were identified. Haplotype length varied from 1 to 4.8 Mb and frequencies from 1.7 up to 9%. A significant negative effect on calving rate, consistent in heifers and in lactating cows, was observed for 9 of these haplotypes in matings between carrier bulls and daughters of carrier sires, confirming their association with embryonic lethal mutations. Eight regions were further investigated using whole genome sequencing data from heterozygous bull carriers and control animals (45 animals in total). Six strong candidate causative mutations including polymorphisms previously reported in FANCI (Brachyspina), SLC35A3 (CVM), APAF1 (HH1) and three novel mutations with very damaging effect on the protein structure, according to SIFT and Polyphen-2, were detected in GART, SHBG and SLC37A2 genes. In conclusion, this study reveals a yet hidden consequence of the important inbreeding rate observed in intensively selected and specialized cattle breeds. Counter-selection of these mutations and management of matings will have positive consequences on female fertility in dairy cattle. PMID:23762392
The first successful use of a low stringency familial match in a French criminal investigation.
Pham-Hoai, Emmanuel; Crispino, Frank; Hampikian, Greg
2014-05-01
We describe how a very simple application of familial searching resolved a decade-old, high-profile rape/murder in France. This was the first use of familial searching in a criminal case using the French STR DNA database, which contains approximately 1,800,000 profiles. When an unknown forensic profile (18 loci) was searched against the French arrestee/offender database using CODIS configured for a low stringency search, a single low stringency match was identified. This profile was attributed to the father of the man suspected to be the source of the semen recovered from the murder victim Elodie Kulik. The identification was confirmed using Y-chromosome DNA from the putative father, an STR profile from the mother, and finally a tissue sample from the exhumed body of the man who left the semen. Because of this identification, the investigators are now pursuing possible co-conspirators. © 2014 American Academy of Forensic Sciences.
Ng, Kevin Kit Siong; Lee, Soon Leong; Tnah, Lee Hong; Nurul-Farhanah, Zakaria; Ng, Chin Hong; Lee, Chai Ting; Tani, Naoki; Diway, Bibian; Lai, Pei Sing; Khoo, Eyen
2016-07-01
Illegal logging and smuggling of Gonystylus bancanus (Thymelaeaceae) poses a serious threat to this fragile valuable peat swamp timber species. Using G. bancanus as a case study, DNA markers were used to develop identification databases at the species, population and individual level. The species level database for Gonystylus comprised of an rDNA (ITS2) and two cpDNA (trnH-psbA and trnL) markers based on a 20 Gonystylus species database. When concatenated, taxonomic species recognition was achieved with a resolution of 90% (18 out of the 20 species). In addition, based on 17 natural populations of G. bancanus throughout West (Peninsular Malaysia) and East (Sabah and Sarawak) Malaysia, population and individual identification databases were developed using cpDNA and STR markers respectively. A haplotype distribution map for Malaysia was generated using six cpDNA markers, resulting in 12 unique multilocus haplotypes, from 24 informative intraspecific variable sites. These unique haplotypes suggest a clear genetic structuring of West and East regions. A simulation procedure based on the composition of the samples was used to test whether a suspected sample conformed to a given regional origin. Overall, the observed type I and II errors of the databases showed good concordance with the predicted 5% threshold which indicates that the databases were useful in revealing provenance and establishing conformity of samples from West and East Malaysia. Sixteen STRs were used to develop the DNA profiling databases for individual identification. Bayesian clustering analyses divided the 17 populations into two main genetic clusters, corresponding to the regions of West and East Malaysia. Population substructuring (K=2) was observed within each region. After removal of bias resulting from sampling effects and population subdivision, conservativeness tests showed that the West and East Malaysia databases were conservative. This suggests that both databases can be used independently
Two independent apolipoprotein A5 haplotypes influence human plasma triglyceride levels.
Pennacchio, Len A; Olivier, Michael; Hubacek, Jaroslav A; Krauss, Ronald M; Rubin, Edward M; Cohen, Jonathan C
2002-11-15
The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in approximately 16% of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceride concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n=419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12% of Caucasians, 14% of African-Americans and 28% of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25-50% of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.
Two independent apolipoprotein a5 Haplotypes influence human plasma triglyceride levels
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pennacchio, Len A.; Olivier, Michael; Hubacek, Jaroslav A.
2002-09-16
The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in {approx}16 percent of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceridemore » concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n 1/4 419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12 percent of Caucasians, 14 percent of African-Americans and 28 percent of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25 50 percent of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.« less
Paleolithic spread of Y-chromosomal lineage of tribes in eastern and northeastern India.
Borkar, Minal; Ahmad, Fahim; Khan, Faisal; Agrawal, Suraksha
2011-11-01
The Indian peninsula provides a suitable region for examination of the demographic impact of migrations and invasions in historical times, because its complex recent history has involved the long-term residence of different populations with distinct geographical origins and their own particular cultural characteristics. The aim of the present study was to analyse Y chromosome haplotypes in tribes from eastern and north-eastern India, which provided the necessary phylogeographic resolution. A total of 32 Y-chromosome SNPs and 17 Y-STRs were genotyped in 607 males from nine populations (Munda, Birhor, Oraon, Paharia, Santhal, Ho, Lachung, Mech and Rajbanshi) residing in East and Northeastern India. Y-chromosomal analysis revealed high frequency of the O2a haplogroup in Austroasiatic tribes and high haplotype diversity within specific haplogroups demonstrating a lesser degree of admixture of these populations with neighbouring populations in eastern India. In addition, the presence of O3a haplogroups in Sino-Tibetan populations reflects the influx from Southeast Asia during the demographic expansion through the Northeastern corridor. The study suggested that the majority of the male gene flow of Austroasiatic tribes occurred during the late Pleistocene period. The results suggest gene flow from Southeast Asia to Northeast India, albeit more significantly among Tibeto-Burman than Austroasiatic-speaking populations.
Pingel, Julia; Solloch, Ute V; Hofmann, Jan A; Lange, Vinzenz; Ehninger, Gerhard; Schmidt, Alexander H
2013-03-01
In hematopoietic stem cell transplantation, human leukocyte antigens (HLA), usually HLA loci A, B, C, DRB1 and DQB1, are required to check histocompatibility between a potential donor and the recipient suffering from a malignant or non-malignant blood disease. As databases of potential unrelated donors are very heterogeneous with respect to typing resolution and number of typed loci, donor registries make use of haplotype frequency-based algorithms to provide matching probabilities for each potentially matching recipient/donor pair. However, it is well known that HLA allele and haplotype frequencies differ significantly between populations. We estimated high-resolution HLA-A, -B, -C, -DRB1 haplotype and allele frequencies of donors within DKMS German Bone Marrow Donor Center with parentage from 17 different countries: Turkey, Poland, Italy, Russian Federation, Croatia, Greece, Austria, Kazakhstan, France, The Netherlands, Republic of China, Romania, Portugal, USA, Spain, United Kingdom and Bosnia and Herzegovina. 5-locus haplotypes including HLA-DQB1 are presented for Turkey, Poland, Italy and Russian Federation. We calculated linkage disequilibria for each sample. Genetic distances between included countries could be shown to reflect geography. We further demonstrate how genetic differences between populations are reflected in matching probabilities of recipient/donor pairs and how they influence the search for unrelated donors as well as strategic donor center typings. Copyright © 2012 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
High-Density SNP Genotyping to Define β-Globin Locus Haplotypes
Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.
2014-01-01
Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352
Powers, T O; Bernard, E C; Harris, T; Higgins, R; Olson, M; Lodema, M; Mullin, P; Sutton, L; Powers, K S
2014-07-03
Without applying an a priori bias for species boundaries, specimen identities in the plant-parasitic nematode genus Mesocriconema were evaluated by examining mitochondrial COI nucleotide sequences, morphology, and biogeography. A total of 242 specimens that morphologically conformed to the genus were individually photographed, measured, and amplified by a PCR primer set to preserve the linkage between specimen morphology and a specific DNA barcode sequence. Specimens were extracted from soil samples representing 45 locations across 23 ecoregions in North America. Dendrograms constructed by neighbor-joining, maximum likelihood, and Bayesian Inference using a 721-bp COI barcode were used to group COI haplotypes. Each tree-building approach resulted in 24 major haplotype groups within the dataset. The distinctiveness of these groups was evaluated by node support, genetic distance, absence of intermediates, and several measures of distinctiveness included in software used for the exploration of species boundaries. Five of the 24 COI haplotype groups corresponded to morphologically characterized, Linnaean species. Morphospecies conforming to M. discus, Discocriconemella inarata, M. rusticum, M. onoense, and M. kirjanovae were represented by groups composed of multiple closely related or identical COI haplotypes. In other cases, morphospecies names could be equally applied to multiple haplotype groups that were genetically distant from each other. Identification based on morphology alone resulted in M. curvatum and M. ornatum species designations applied to seven and three groups, respectively. Morphological characters typically used for species level identification were demonstrably variable within haplotype groups, suggesting caution in assigning species names based on published compendia that solely consider morphological characters. Morphospecies classified as M. xenoplax formed a monophyletic group composed of seven genetically distinct COI subgroups. The species
Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo
2012-04-01
The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ(2)=23.9; P=0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8-4.9; P=0.000006) in the recessive model. Two polymorphisms, A-20C (P=0.003) and C3389T (P=0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ(2)=8.1; P=0.004) and H5 (χ(2)=5.1; P=0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as "superalleles."
Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo
2012-01-01
The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ2 = 23.9; P = 0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8–4.9; P = 0.000006) in the recessive model. Two polymorphisms, A-20C (P = 0.003) and C3389T (P = 0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ2 = 8.1; P = 0.004) and H5 (χ2 = 5.1; P = 0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as “superalleles.” PMID:22371359
The HLA-DRB9 gene and the origin of HLA-DR haplotypes.
Gongora, R; Figueroa, F; Klein, J
1996-11-01
HLA-DRB9 is a gene fragment consisting of exon 2 and flanking intron sequences. It is located at the extreme end of the DRB subregion, whose other end is demarcated by the DRB1 locus. We sequenced approximately 1400 base pairs of the segment encompassing the DRB9 locus from eight human haplotypes (DR1, DR10, DR2, DR3, DR5, DR6, DR8, and DR9, the DR4 and DR7 having been sequenced by others earlier), as well as two chimpanzee, five gorillas, one orangutan and one macaque haplotype. The analysis of these sequences indicates that the DRB9 locus, which we estimate to be more than 58 million years (my) old, has been coevolving with the DRB1 locus for the last 4.2 my. As a consequence of this coevolution, the human DRB9 alleles fall into groups that correlate with the DRB1 allelic groups and with the gene organization of the human haplotypes. This observation implies that the present-day HLA-DR haplotype groups (DR1, DR51, DR52, DR8, and DR53) were founded more than 4 my ago and have remained intact (barring minor internal rearrangements that did not recombine the DRB1 and DRB9 genes) for this period of time. The haplotypes have been transmitted during speciations from ancestral to emerging species just like allelic lineages at the DRB1 locus. Thus not only allelic but also haplotype polymorphism evolves trans-specifically.
Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe
2010-07-01
Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.
2010-01-01
Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405
Black, F L
1984-11-01
HLA B-C haplotypes exhibit common disequilibria in populations drawn from four continents, indicating that they are subject to broadly active selective forces. However, the A-B and A-C associations we have examined show no consistent disequilibrium pattern, leaving open the possibility that these disequilibria are due to descent from common progenitors. By examining HLA haplotype distributions, I have explored the implications that would follow from the hypothesis that biological selection played no role in determining A-C disequilibria in 10 diverse tribes of the lower Amazon Basin. Certain haplotypes are in strong positive disequilibria across a broad geographic area, suggesting that members of diverse tribes descend from common ancestors. On the basis of the extent of diffusion of the components of these haplotypes, one can estimate that the progenitors lived less than 6,000 years ago. One widely encountered lineage entered the area within the last 1,200 years. When haplotype frequencies are used in genetic distance measurements, they give a pattern of relationships very similar to that obtained by conventional chord measurements based on several genetic markers; but more than that, when individual haplotype disequilibria in the several tribes are compared, multiple origins of a single tribe are discernible and relationships are revealed that correlate more closely to geographic and linguistic patterns than do the genetic distance measurements.
Three Novel Haplotypes of Theileria bicornis in Black and White Rhinoceros in Kenya.
Otiende, M Y; Kivata, M W; Jowers, M J; Makumi, J N; Runo, S; Obanda, V; Gakuya, F; Mutinda, M; Kariuki, L; Alasaad, S
2016-02-01
Piroplasms, especially those in the genera Babesia and Theileria, have been found to naturally infect rhinoceros. Due to natural or human-induced stress factors such as capture and translocations, animals often develop fatal clinical piroplasmosis, which causes death if not treated. This study examines the genetic diversity and occurrence of novel Theileria species infecting both black and white rhinoceros in Kenya. Samples collected opportunistically during routine translocations and clinical interventions from 15 rhinoceros were analysed by polymerase chain reaction (PCR) using a nested amplification of the small subunit ribosomal RNA (18S rRNA) gene fragments of Babesia and Theileria. Our study revealed for the first time in Kenya the presence of Theileria bicornis in white (Ceratotherium simum simum) and black (Diceros bicornis michaeli) rhinoceros and the existence of three new haplotypes: haplotypes H1 and H3 were present in white rhinoceros, while H2 was present in black rhinoceros. No specific haplotype was correlated to any specific geographical location. The Bayesian inference 50% consensus phylogram recovered the three haplotypes monophyleticly, and Theileria bicornis had very high support (BPP: 0.98). Furthermore, the genetic p-uncorrected distances and substitutions between T. bicornis and the three haplotypes were the same in all three haplotypes, indicating a very close genetic affinity. This is the first report of the occurrence of Theileria species in white and black rhinoceros from Kenya. The three new haplotypes reported here for the first time have important ecological and conservational implications, especially for population management and translocation programs and as a means of avoiding the transport of infected animals into non-affected areas. © 2014 Blackwell Verlag GmbH.
Intricacies in arrangement of SNP haplotypes suggest "Great Admixture" that created modern humans.
Dutta, Rajib; Mainsah, Joseph; Yatskiv, Yuriy; Chakrabortty, Sharmistha; Brennan, Patrick; Khuder, Basil; Qiu, Shuhao; Fedorova, Larisa; Fedorov, Alexei
2017-06-05
Inferring history from genomic sequences is challenging and problematic because chromosomes are mosaics of thousands of small Identicalby-descent (IBD) fragments, each of them having their own unique story. However, the main events in recent evolution might be deciphered from comparative analysis of numerous loci. A paradox of why humans, whose effective population size is only 10 4 , have nearly three million frequent SNPs is formulated and examined. We studied 5398 loci evenly covering all human autosomes. Common haplotypes built from frequent SNPs that are present in people from various populations have been examined. We demonstrated highly non-random arrangement of alleles in common haplotypes. Abundance of mutually exclusive pairs of common haplotypes that have different alleles at every polymorphic position (so-called Yin/Yang haplotypes) was found in 56% of loci. A novel widely spread category of common haplotypes named Mosaic has been described. Mosaic consists of numerous pieces of Yin/Yang haplotypes and represents an ancestral stage of one of them. Scenarios of possible appearance of large number of frequent human SNPs and their habitual arrangement in Yin/Yang common haplotypes have been evaluated with an advanced genomic simulation algorithm. Computer modeling demonstrated that the observed arrangement of 2.9 million frequent SNPs could not originate from a sole stand-alone population. A "Great Admixture" event has been proposed that can explain peculiarities with frequent SNP distributions. This Great Admixture presumably occurred 100-300 thousand years ago between two ancestral populations that had been separated from each other about a million years ago. Our programs and algorithms can be applied to other species to perform evolutionary and comparative genomics.
FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads.
Wang, Meng; Lin, Shili
2014-09-15
In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
The putative oncogene Pim-1 in the mouse: its linkage and variation among t haplotypes.
Nadeau, J H; Phillips, S J
1987-11-01
Pim-1, a putative oncogene involved in T-cell lymphomagenesis, was mapped between the pseudo-alpha globin gene Hba-4ps and the alpha-crystallin gene Crya-1 on mouse chromosome 17 and therefore within the t complex. Pim-1 restriction fragment variants were identified among t haplotypes. Analysis of restriction fragment sizes obtained with 12 endonucleases demonstrated that the Pim-1 genes in some t haplotypes were indistinguishable from the sizes for the Pim-1b allele in BALB/c inbred mice. There are now three genes, Pim-1, Crya-1 and H-2 I-E, that vary among independently derived t haplotypes and that have indistinguishable alleles in t haplotypes and inbred strains. These genes are closely linked within the distal inversion of the t complex. Because it is unlikely that these variants arose independently in t haplotypes and their wild-type homologues, we propose that an exchange of chromosomal segments, probably through double crossingover, was responsible for indistinguishable Pim-1 genes shared by certain t haplotypes and their wild-type homologues. There was, however, no apparent association between variant alleles of these three genes among t haplotypes as would be expected if a single exchange introduced these alleles into t haplotypes. If these variant alleles can be shown to be identical to the wild-type allele, then lack of association suggests that multiple exchanges have occurred during the evolution of the t complex.
Salem, Rany M; Wessel, Jennifer; Schork, Nicholas J
2005-03-01
Interest in the assignment and frequency analysis of haplotypes in samples of unrelated individuals has increased immeasurably as a result of the emphasis placed on haplotype analyses by, for example, the International HapMap Project and related initiatives. Although there are many available computer programs for haplotype analysis applicable to samples of unrelated individuals, many of these programs have limitations and/or very specific uses. In this paper, the key features of available haplotype analysis software for use with unrelated individuals, as well as pooled DNA samples from unrelated individuals, are summarised. Programs for haplotype analysis were identified through keyword searches on PUBMED and various internet search engines, a review of citations from retrieved papers and personal communications, up to June 2004. Priority was given to functioning computer programs, rather than theoretical models and methods. The available software was considered in light of a number of factors: the algorithm(s) used, algorithm accuracy, assumptions, the accommodation of genotyping error, implementation of hypothesis testing, handling of missing data, software characteristics and web-based implementations. Review papers comparing specific methods and programs are also summarised. Forty-six haplotyping programs were identified and reviewed. The programs were divided into two groups: those designed for individual genotype data (a total of 43 programs) and those designed for use with pooled DNA samples (a total of three programs). The accuracy of programs using various criteria are assessed and the programs are categorised and discussed in light of: algorithm and method, accuracy, assumptions, genotyping error, hypothesis testing, missing data, software characteristics and web implementation. Many available programs have limitations (eg some cannot accommodate missing data) and/or are designed with specific tasks in mind (eg estimating haplotype frequencies rather than
Choi, Seong Yeol; Kim, Sooyeon; Lyuck, Sungsoo; Kim, Seung Bum; Mitchell, Robert J.
2015-01-01
A violacein-producing bacterial strain was isolated and identified as a relative of Duganella violaceinigra YIM 31327 based upon phylogenetic analyses using the 16S rRNA, gyrB and vioA gene sequences and a fatty acid methyl ester (FAME) analysis. This new strain was designated D. violaceinigra str. NI28. Although these two strains appear related based upon these analyses, the new isolate was phenotypically different from the type strain as it grew 25% faster on nutrient media and produced 45-fold more violacein. When compared with several other violacein producing strains, including Janthinobacterium lividum, D. violaceinigra str. NI28 was the best violacein producer. For instance, the crude violacein yield with D. violaceinigra str. NI28 was 6.0 mg/OD at 24 hours, a value that was more than two-fold higher than all the other strains. Finally, the antibacterial activity of D. violaceinigra str. NI28 crude violacein was assayed using several multidrug resistant Staphylococcus aureus. Addition of 30 μM crude violacein led to a 96% loss in the initial S. aureus population while the minimum inhibitory concentration was 1.8 μM. Consequently, this novel isolate represents a phenotypic variant of D. violaceinigra capable of producing much greater quantities of crude violacein, an antibiotic effective against multidrug resistant S. aureus. PMID:26489441
Choi, Seong Yeol; Kim, Sooyeon; Lyuck, Sungsoo; Kim, Seung Bum; Mitchell, Robert J
2015-10-22
A violacein-producing bacterial strain was isolated and identified as a relative of Duganella violaceinigra YIM 31327 based upon phylogenetic analyses using the 16S rRNA, gyrB and vioA gene sequences and a fatty acid methyl ester (FAME) analysis. This new strain was designated D. violaceinigra str. NI28. Although these two strains appear related based upon these analyses, the new isolate was phenotypically different from the type strain as it grew 25% faster on nutrient media and produced 45-fold more violacein. When compared with several other violacein producing strains, including Janthinobacterium lividum, D. violaceinigra str. NI28 was the best violacein producer. For instance, the crude violacein yield with D. violaceinigra str. NI28 was 6.0 mg/OD at 24 hours, a value that was more than two-fold higher than all the other strains. Finally, the antibacterial activity of D. violaceinigra str. NI28 crude violacein was assayed using several multidrug resistant Staphylococcus aureus. Addition of 30 μM crude violacein led to a 96% loss in the initial S. aureus population while the minimum inhibitory concentration was 1.8 μM. Consequently, this novel isolate represents a phenotypic variant of D. violaceinigra capable of producing much greater quantities of crude violacein, an antibiotic effective against multidrug resistant S. aureus.
Boulanger, Jérôme; Muresan, Leila; Tiemann-Boege, Irene
2012-01-01
In spite of the many advances in haplotyping methods, it is still very difficult to characterize rare haplotypes in tissues and different environmental samples or to accurately assess the haplotype diversity in large mixtures. This would require a haplotyping method capable of analyzing the phase of single molecules with an unprecedented throughput. Here we describe such a haplotyping method capable of analyzing in parallel hundreds of thousands single molecules in one experiment. In this method, multiple PCR reactions amplify different polymorphic regions of a single DNA molecule on a magnetic bead compartmentalized in an emulsion drop. The allelic states of the amplified polymorphisms are identified with fluorescently labeled probes that are then decoded from images taken of the arrayed beads by a microscope. This method can evaluate the phase of up to 3 polymorphisms separated by up to 5 kilobases in hundreds of thousands single molecules. We tested the sensitivity of the method by measuring the number of mutant haplotypes synthesized by four different commercially available enzymes: Phusion, Platinum Taq, Titanium Taq, and Phire. The digital nature of the method makes it highly sensitive to detecting haplotype ratios of less than 1:10,000. We also accurately quantified chimera formation during the exponential phase of PCR by different DNA polymerases.
Ando, A; Imaeda, N; Ohshima, S; Miyamoto, A; Kaneko, N; Takasu, M; Shiina, T; Kulski, J K; Inoko, H; Kitagawa, H
2014-12-01
Microminipigs are extremely small-sized, novel miniature pigs that were recently developed for medical research. The inbred Microminipigs with defined swine leukocyte antigen (SLA) haplotypes are expected to be useful for allo- and xenotransplantation studies and also for association analyses between SLA haplotypes and immunological traits. To establish SLA-defined Microminipig lines, we characterized the polymorphic SLA alleles for three class I (SLA-1, SLA-2 and SLA-3) and two class II (SLA-DRB1 and SLA-DQB1) genes of 14 parental Microminipigs using a high-resolution nucleotide sequence-based typing method. Eleven class I and II haplotypes, including three recombinant haplotypes, were found in the offspring of the parental Microminipigs. Two class I and class II haplotypes, Hp-31.0 (SLA-1*1502-SLA-3*070102-SLA-2*1601) and Hp-0.37 (SLA-DRB1*0701-SLA-DQB1*0502), are novel and have not so far been reported in other pig breeds. Crossover regions were defined by the analysis of 22 microsatellite markers within the SLA class III region of three recombinant haplotypes. The SLA allele and haplotype information of Microminipigs in this study will be useful to establish SLA homozygous lines including three recombinants for transplantation and immunological studies. © 2014 Stichting International Foundation for Animal Genetics.
Haplotype analysis of the apolipoprotein A5 gene in obese pediatric patients.
Horvatovich, Katalin; Bokor, Szilvia; Baráth, Akos; Maász, Anita; Kisfali, Péter; Járomi, Luca; Polgár, Noémi; Tóth, Dénes; Répásy, Judit; Endreffy, Emoke; Molnár, Dénes; Melegh, Béla
2011-06-01
Apolipoprotein A5 (APOA5) gene variants have been shown to be associated with elevated TG levels; the T-1131C (rs662799) variant has been reported to confer risk for the metabolic syndrome in adult populations. Little is known about the APOA5 variants in pediatric population, no such information is available for pediatric obesity at all. Here we examined four haplotype-tagging polymorphisms (T-1131C, IVS3 + G476A [rs2072560], T1259C [rs2266788] and C56G [rs3135506]) and studied also the frequency of major naturally occurring haplotypes of APOA5 in obese children. The polymorphisms were analyzed in 232 obese children, and in 137 healthy, normal weight controls, using PCR-RFLP methods. In the pediatric patients we could confirm the already known adult subjects based association of -1131C, IVS3 + 476A and 1259C variants with elevated triglyceride concentrations, both in obese patients and in the controls. The prevalence of the APOA5*2 haplotype (containing the minor allele of T-1131C, IVS3 + G476A and T1259C SNPs together) was 15.5% in obese children, and 5.80% in the controls (p<0.001); multiple logistic regression analysis revealed that this haplotype confers susceptibility for development of obesity (OR=2.87; 95% CI: 1.29-6.37; p≤0.01). By contrast, the APOA5*4 haplotype (with -1131C alone) did not show similar associations. Our findings also suggest that the APOA5*5 haplotype (1259C alone) can be protective against obesity (OR=0.25; 95% CI: 0.07-0.80; p<0.05). While previous studies in adults demonstrated, that the APOA5 -1131C minor allele confers risk for adult metabolic syndrome, here we show, that the susceptibility nature of this SNP restricted to the APOA5*2 haplotype in pediatric obese subjects.
Neural network modeling of drying of rice in BAU-STR dryer
NASA Astrophysics Data System (ADS)
Alam, Md. Ashraful; Saha, Chayan Kumer; Alam, Md. Monjurul; Ashraf, Md. Ali; Bala, Bilash Kanti; Harvey, Jagger
2018-05-01
The experimental performance and artificial neural network modeling of rice drying in BAU-STR dryer is presented in this paper. The dryer consists of a biomass stove as a heat source, a perforated inner bin and a perforated outer bin with annular space for grains, and a blower (1 hp) to supply heated air. The dryer capacity was 500 kg of freshly harvested rice. Twenty experimental runs were conducted to investigate the experimental performance of the dryer for drying of rice. An independent multilayer neural network approach was used to predict the performance of the BAU-STR dryer for drying of rice. Ten sets of experimental data were used for training using back propagation algorithm and another ten sets of data were used for testing the artificial neural network model. The prediction of the performance of the dryer was found to be excellent after it was adequately trained. The statistical analysis showed that the errors (MSE and RMSE) were within and acceptable range of ±5% with a coefficient of determination (R2) of 99%. The model can be used to predict the potential of the dryer for different locations, and can also be used in a predictive optimal control algorithm.
The geographic mosaic of Ecuadorian Y-chromosome ancestry.
Toscanini, U; Gaviria, A; Pardo-Seco, J; Gómez-Carballa, A; Moscoso, F; Vela, M; Cobos, S; Lupero, A; Zambrano, A K; Martinón-Torres, F; Carabajo-Marcillo, A; Yunga-León, R; Ugalde-Noritz, N; Ordoñez-Ugalde, A; Salas, A
2018-03-01
Ecuadorians originated from a complex mixture of Native American indigenous people with Europeans and Africans. We analyzed Y-chromosome STRs (Y-STRs) in a sample of 415 Ecuadorians (145 using the AmpFlSTR ® Yfiler™ system [Life Technologies, USA] and 270 using the PowerPlex ® Y23 system [Promega Corp., USA]; hereafter Yfiler and PPY23, respectively) representing three main ecological continental regions of the country, namely Amazon rainforest, Andes, and Pacific coast. Diversity values are high in the three regions, and the PPY23 exhibits higher discrimination power than the Yfiler set. While summary statistics, AMOVA, and R ST distances show low to moderate levels of population stratification, inferred ancestry derived from Y-STRs reveal clear patterns of geographic variation. The major ancestry in Ecuadorian males is European (61%), followed by an important Native American component (34%); whereas the African ancestry (5%) is mainly concentrated in the Northwest corner of the country. We conclude that classical procedures for measuring population stratification do not have the desirable sensitivity. Statistical inference of ancestry from Y-STRS is a satisfactory alternative for revealing patterns of spatial variation that would pass unnoticed when using popular statistical summary indices. Copyright © 2017 Elsevier B.V. All rights reserved.
Global selection on sucrose synthase haplotypes during a century of wheat breeding.
Hou, Jian; Jiang, Qiyan; Hao, Chenyang; Wang, Yuquan; Zhang, Hongna; Zhang, Xueyong
2014-04-01
Spike number per unit area, number of grains per spike, and thousand kernel weight (TKW) are important yield components. In China, increases in wheat (Triticum aestivum) yields are mainly due to increases in grain number per spike and TKW. TKW mainly depends on starch content, as starch accounts for about 70% of the grain endosperm. Sucrose synthase catalysis is the first step in the conversion of sucrose to starch, that is, the conversion of sucrose to fructose and UDP-glucose by the wheat sucrose synthase genes (TaSus1 and TaSus2) that are located on chromosomes 7A/7B/7D and 2A/2B/2D, respectively. A total of 1,520 wheat accessions were genotyped at the six loci. Two, two, five, and two haplotypes were identified at the TaSus2-2A, TaSus2-2B, TaSus1-7A, and TaSus1-7B loci, respectively. Their main variations were detected within the introns. Significant differences between the haplotypes correlated with TKW differences among 348 modern Chinese cultivars from the core collection. Frequency changes for favored haplotypes showed gradual increases in cultivars released since beginning of the last century in China, Europe, and North America. Geographic distributions and time changes of favored haplotypes were characterized in six major wheat production regions worldwide. Strong selection bottlenecks to haplotype variations occurred at polyploidization and domestication and during breeding of wheat. Genetic-effect differences between haplotypes at the same locus influence the selection time and intensity. This work shows that the endosperm starch synthesis pathway is a major target of indirect selection in global wheat breeding for higher yield.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kanukollu, Saranya; Voget, Sonja; Pohlner, Marion
Shimia strain SK013 is an aerobic, Gram-negative, rod shaped alphaproteobacterium affiliated with the Roseobacter group within the family Rhodobacteraceae. The strain was isolated from surface sediment (0-1 cm) of the Skagerrak at 114 m below sea level. The 4,049,808 bp genome of Shimia str. SK013 comprises 3,981 protein-coding genes and 47 RNA genes. It contains one chromosome and no extrachromosomal elements. The genome analysis revealed the presence of genes for a dimethylsulfoniopropionate lyase, demethylase and the trimethylamine methyltransferase ( mttB) as well as genes for nitrate, nitrite and dimethyl sulfoxide reduction. This indicates that Shimia str. SK013 is able tomore » switch from aerobic to anaerobic metabolism and thus is capable of aerobic and anaerobic sulfur cycling at the seafloor. Among the ability to convert other sulfur compounds it has the genetic capacity to produce climatically active dimethyl sulfide. Growth on glutamate as a sole carbon source results in formation of cell-connecting filaments, a putative phenotypic adaptation of the surface-associated strain to the environmental conditions at the seafloor. Genome analysis revealed the presence of a flagellum ( fla1) and a type IV pilus biogenesis, which is speculated to be a prerequisite for biofilm formation. This is also related to genes responsible for signalling such as N-acyl homoserine lactones, as well as quip-genes responsible for quorum quenching and antibiotic biosynthesis. Pairwise similarities of 16S rRNA genes (98.56 % sequence similarity to the next relative S. haliotis) and the in silico DNA-DNA hybridization (21.20 % sequence similarity to S. haliotis) indicated Shimia str. SK013 to be considered as a new species. In conclusion, the genome analysis of Shimia str. SK013 offered first insights into specific physiological and phenotypic adaptation mechanisms of Roseobacter-affiliated bacteria to the benthic environment.« less
Kanukollu, Saranya; Voget, Sonja; Pohlner, Marion; ...
2016-03-12
Shimia strain SK013 is an aerobic, Gram-negative, rod shaped alphaproteobacterium affiliated with the Roseobacter group within the family Rhodobacteraceae. The strain was isolated from surface sediment (0-1 cm) of the Skagerrak at 114 m below sea level. The 4,049,808 bp genome of Shimia str. SK013 comprises 3,981 protein-coding genes and 47 RNA genes. It contains one chromosome and no extrachromosomal elements. The genome analysis revealed the presence of genes for a dimethylsulfoniopropionate lyase, demethylase and the trimethylamine methyltransferase ( mttB) as well as genes for nitrate, nitrite and dimethyl sulfoxide reduction. This indicates that Shimia str. SK013 is able tomore » switch from aerobic to anaerobic metabolism and thus is capable of aerobic and anaerobic sulfur cycling at the seafloor. Among the ability to convert other sulfur compounds it has the genetic capacity to produce climatically active dimethyl sulfide. Growth on glutamate as a sole carbon source results in formation of cell-connecting filaments, a putative phenotypic adaptation of the surface-associated strain to the environmental conditions at the seafloor. Genome analysis revealed the presence of a flagellum ( fla1) and a type IV pilus biogenesis, which is speculated to be a prerequisite for biofilm formation. This is also related to genes responsible for signalling such as N-acyl homoserine lactones, as well as quip-genes responsible for quorum quenching and antibiotic biosynthesis. Pairwise similarities of 16S rRNA genes (98.56 % sequence similarity to the next relative S. haliotis) and the in silico DNA-DNA hybridization (21.20 % sequence similarity to S. haliotis) indicated Shimia str. SK013 to be considered as a new species. In conclusion, the genome analysis of Shimia str. SK013 offered first insights into specific physiological and phenotypic adaptation mechanisms of Roseobacter-affiliated bacteria to the benthic environment.« less
Aldehyde dehydrogenase-2 genotypes and HLA haplotypes in Japanese patients with esophageal cancer.
Watanabe, Seishiro; Sasahara, Katsuyuki; Kinekawa, Fumihiko; Uchida, Naohito; Masaki, Tsutomu; Kurokohchi, Kazutaka; Murota, Masayuki; Touge, Tetsuo; Kawauchi, Kazuyoshi; Oda, Syuji; Kuriyama, Shigeki
2002-01-01
The aim of this study was to examine how aldehyde dehydrogenase-2 (ALDH2) genotypes and human leukocyte antigen (HLA) haplotypes contribute to the risk for esophageal cancer. We examined ALDH2 genotypes and HLA haplotypes in 29 Japanese patients with esophageal cancer. The ratio of patients who experienced current or former intense vasodilatation upon consuming alcohol (flushing type) was much higher in individuals with the inactive form of ALDH2 encoded by the ALDH2(2)/2(2) or ALDH2(1)/2(2) genotype than in those with the active form of ALDH2 encoded by the ALDH2(1)/2(1) genotype. The ratio of inactive ALDH2 was significantly higher in patients with esophageal cancer than in control normal subjects, suggesting that alcoholics with inactive ALDH2 were susceptible to esophageal cancer. HLA haplotypes A24, A26, B54, B61 and DR9 were prevalent in patients with esophageal cancer (82.8, 24.1, 34.5, 37.9 and 44.8%, respectively). HLA haplotype of A24 and inactive ALDH2 were simultaneously found in 58.6% of patients with esophageal cancer. Furthermore, we found other primary malignancies in 6 of 29 (20.7%) patients with esophageal cancer, and 4 of these 6 patients had both the inactive form of ALDH2 and the HLA A24 haplotype. The present study showed the high prevalence of the inactive form of ALDH2 and HLA haplotypes A24, A26, B54, B61 and DR9 in Japanese patients with esophageal cancer. Therefore, the examination of genotypes of ALDH2 loci and HLA haplotypes may allow the early detection of esophageal cancer in the Japanese population.
Association between β2-adrenoceptor (ADRB2) haplotypes and insulin resistance in PCOS.
Tellechea, Mariana L; Muzzio, Damián O; Iglesias Molli, Andrea E; Belli, Susana H; Graffigna, Mabel N; Levalle, Oscar A; Frechtel, Gustavo D; Cerrone, Gloria E
2013-04-01
The aim of this study was to explore β2-adrenoceptor (ADRB2) haplotype associations with phenotypes and quantitative traits related to insulin resistance (IR) and the metabolic syndrome (MS) in a polycystic ovary syndrome (PCOS) population. A secondary purpose was to assess the association between ADRB2 haplotype and PCOS. Genetic polymorphism analysis. Cross-sectional case-control association study. Medical University Hospital and research laboratory. One hundred and sixty-five unrelated women with PCOS and 116 unrelated women without PCOS (control sample). Clinical and biochemical measurements, and ADRB2 genotyping in PCOS patients and control subjects. ADRB2 haplotypes (comprising rs1042711, rs1801704, rs1042713 and rs1042714 in that order), genotyping and statistical analysis to evaluate associations with continuous variables and traits related to IR and MS in a PCOS population. Associations between ADRB2 haplotypes and PCOS were also assessed. We observed an age-adjusted association between ADRB2 haplotype CCGG and lower insulin (P = 0·018) and HOMA (P = 0·008) in the PCOS sample. Interestingly, the expected differences in surrogate measures of IR between cases and controls were not significant in CCGG/CCGG carriers. In the case-control study, genotype CCGG/CCGG was associated with a 14% decrease in PCOS risk (P = 0·043), taking into account confounding variables. Haplotype I (CCGG) has a protective role for IR and MS in PCOS. © 2012 Blackwell Publishing Ltd.
RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination
Mirzaei, Sajad; Wu, Yufeng
2017-01-01
Abstract Motivation: Haplotypes from one or multiple related populations share a common genealogical history. If this shared genealogy can be inferred from haplotypes, it can be very useful for many population genetics problems. However, with the presence of recombination, the genealogical history of haplotypes is complex and cannot be represented by a single genealogical tree. Therefore, inference of genealogical history with recombination is much more challenging than the case of no recombination. Results: In this paper, we present a new approach called RENT+ for the inference of local genealogical trees from haplotypes with the presence of recombination. RENT+ builds on a previous genealogy inference approach called RENT, which infers a set of related genealogical trees at different genomic positions. RENT+ represents a significant improvement over RENT in the sense that it is more effective in extracting information contained in the haplotype data about the underlying genealogy than RENT. The key components of RENT+ are several greatly enhanced genealogy inference rules. Through simulation, we show that RENT+ is more efficient and accurate than several existing genealogy inference methods. As an application, we apply RENT+ in the inference of population demographic history from haplotypes, which outperforms several existing methods. Availability and Implementation: RENT+ is implemented in Java, and is freely available for download from: https://github.com/SajadMirzaei/RentPlus. Contacts: sajad@engr.uconn.edu or ywu@engr.uconn.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28065901
Gu, Chao; Liu, Qing-Zhong; Yang, Ya-Nan; Zhang, Shu-Jun; Khan, Muhammad Awais; Wu, Jun; Zhang, Shao-Ling
2013-01-01
The breakdown of self-incompatibility, which could result from the accumulation of non-functional S-haplotypes or competitive interaction between two different functional S-haplotypes, has been studied extensively at the molecular level in tetraploid Rosaceae species. In this study, two tetraploid Chinese cherry (Prunus pseudocerasus) cultivars and one diploid sweet cherry (Prunus avium) cultivar were used to investigate the ploidy of pollen grains and inheritance of pollen-S alleles. Genetic analysis of the S-genotypes of two intercross-pollinated progenies showed that the pollen grains derived from Chinese cherry cultivars were hetero-diploid, and that the two S-haplotypes were made up of every combination of two of the four possible S-haplotypes. Moreover, the distributions of single S-haplotypes expressed in self- and intercross-pollinated progenies were in disequilibrium. The number of individuals of the two different S-haplotypes was unequal in two self-pollinated and two intercross-pollinated progenies. Notably, the number of individuals containing two different S-haplotypes (S1- and S5-, S5- and S8-, S1- and S4-haplotype) was larger than that of other individuals in the two self-pollinated progenies, indicating that some of these hetero-diploid pollen grains may have the capability to inactivate stylar S-RNase inside the pollen tube and grow better into the ovaries. PMID:23596519
Yao, Yining; Yang, Qinrui; Shao, Chengchen; Liu, Baonian; Zhou, Yuxiang; Xu, Hongmei; Zhou, Yueqin; Tang, Qiqun; Xie, Jianhui
2018-01-01
Rare variants are widely observed in human genome and sequence variations at primer binding sites might impair the process of PCR amplification resulting in dropouts of alleles, named as null alleles. In this study, 5 cases from routine paternity testing using PowerPlex ® 21 System for STR genotyping were considered to harbor null alleles at TH01, FGA, D5S818, D8S1179, and D16S539, respectively. The dropout of alleles was confirmed by using alternative commercial kits AGCU Expressmarker 22 PCR amplification kit and AmpFℓSTR ® . Identifiler ® Plus Kit, and sequencing results revealed a single base variation at the primer binding site of each STR locus. Results from the collection of previous reports show that null alleles at D5S818 were frequently observed in population detected by two PowerPlex ® typing systems and null alleles at D19S433 were mostly observed in Japanese population detected by two AmpFℓSTR™ typing systems. Furthermore, the most popular mutation type appeared the transition from C to T with G to A, which might have a potential relationship with DNA methylation. Altogether, these results can provide helpful information in forensic practice to the elimination of genotyping discrepancy and the development of primer sets. Copyright © 2017 Elsevier B.V. All rights reserved.
African gene flow to north Brazil as revealed by HBB*S gene haplotype analysis.
Lemos Cardoso, Greice; Farias Guerreiro, João
2006-01-01
Haplotypes linked to the HBB*S gene were analyzed in a sample of 260 chromosomes of Brazilian sickle cell anemia patients from the population of Belém, state of Pará, to evaluate if the present-day haplotype frequencies correlate as well as expected with historical information on the geographic origin of African slaves sent directly to Northern Brazil. The HBB*S gene haplotype distribution (66% Bantu, 21.8% Benin, 10.9% Senegal, and 1.3% Cameroon) is in agreement with those observed for other Brazilian populations regarding the highest proportion of the Bantu type, followed by the Benin type, but it differs significantly concerning the Senegal type as this haplotype is rare or absent in samples from other Brazilian regions already studied. In addition, our results are in accordance with historical records that establish that about 90% of the slaves sent to Northern Brazil were from Angola, Congo, and Mozambique, where the Bantu haplotype predominates, in contrast to 10% of slaves from Senegambia, Guine-Bissau, and Cape Verde, where the Senegal haplotype is the most common. On the other hand, the observed frequency of the Benin haplotype in Belém was much higher than that expected by historical data. This fact corroborates the suggestion that the high prevalence of the Benin type in Belém is due to domestic slave trade and later internal migrations, mainly from the Northeast, since there are no historical records of direct slave trade from Central West Africa to North Brazil. Am. J. Hum. Biol. 18:93-98, 2006. (c) 2005 Wiley-Liss, Inc.
Genetic distribution of 15 autosomal STR markers in the Punjabi population of Pakistan.
Shan, Muhammad Adnan; Hussain, Manzoor; Shafique, Muhammad; Shahzad, Muhammad; Perveen, Rukhsana; Idrees, Muhammad
2016-11-01
Genetic diversity of 15 autosomal short tandem repeat (STR) loci was evaluated in 713 unrelated individual samples of a Punjabi population of Pakistan. These loci were scrutinized to establish allelic frequencies and statistical parameters of forensic and paternity interests. A total of 165 alleles were observed with the corresponding allele frequencies ranging from 0.001 to 0.446. D2S1338 was found as the most informative locus while TPOX (0.611) was the least discriminating locus. The combined power of discrimination (CPD), the combined probability of exclusion (CPE), and cumulative probability of matching (CPM) were found equaled to 0.999999999999999998606227424808, 0.999995777557989, and 1.37543 × 10-18, respectively. All the loci followed the Hardy-Weinberg equilibrium after the Bonferroni correction (p < 0.0033) except one locus D3S1358. The study revealed that these STR loci are highly polymorphic, suitable for forensic and parentage analyses. In comparison to different populations (Asians and non-Asians), significant differences were recorded for these loci.
Ruaño, Gualberto; Kocherla, Mohan; Graydon, James S; Holford, Theodore R; Makowski, Gregory S; Goethe, John W
2016-05-01
We describe a population genetic approach to compare samples interpreted with expert calling (EC) versus automated calling (AC) for CYP2D6 haplotyping. The analysis represents 4812 haplotype calls based on signal data generated by the Luminex xMap analyzers from 2406 patients referred to a high-complexity molecular diagnostics laboratory for CYP450 testing. DNA was extracted from buccal swabs. We compared the results of expert calls (EC) and automated calls (AC) with regard to haplotype number and frequency. The ratio of EC to AC was 1:3. Haplotype frequencies from EC and AC samples were convergent across haplotypes, and their distribution was not statistically different between the groups. Most duplications required EC, as only expansions with homozygous or hemizygous haplotypes could be automatedly called. High-complexity laboratories can offer equivalent interpretation to automated calling for non-expanded CYP2D6 loci, and superior interpretation for duplications. We have validated scientific expert calling specified by scoring rules as standard operating procedure integrated with an automated calling algorithm. The integration of EC with AC is a practical strategy for CYP2D6 clinical haplotyping. Copyright © 2016 Elsevier B.V. All rights reserved.
Guo, Yu-xin; Chen, Jian-gang; Wang, Yan; Yan, Jiang-wei; Chen, Jing; Yao, Tian-hua; Zhang, Li-ping; Yang, Guang; Meng, Hao-tian; Zhang, Yu-dang; Mei, Ting; Liu, Yao-shun; Dong, Qian; Zhu, Bo-feng
2016-01-01
The population genetic data and forensic parameters of 19 X-chromosome short tandem repeat (X-STR) loci in Chinese Uygur ethnic minority are presented. These loci were detected in a sample of 233 (94 males and 139 females) unrelated healthy individuals. We observed 238 alleles at the 19 X-STR loci, with the corresponding gene frequencies spanning the range from 0.0021 to 0.5644. After Bonferroni correction (P>0.0026), there were no significant deviations from Hardy-Weinberg equilibrium. The cumulative power of discrimination in females and males, and the probability of exclusion of the 19 X-STR loci were 0.999 999 999 999 999 999 998 091, 0.999 999 999 999 966, and 0.999 999 986 35, respectively. The cumulative mean exclusion chance was 0.999 999 992 849 in deficiency cases, 0.999 999 999 999 628 in normal trios, and 0.999 999 998 722 in duo cases. The high value of the forensic parameters mentioned above revealed that the novel panel of 19 loci had important values for forensic applications in the Uygur group. PMID:27143264
Petrovski, K R; Grinberg, A; Williamson, N B; Abdalla, M E; Lopez-Villalobos, N; Parkinson, T J; Tucker, I G; Rapnicki, P
2015-07-01
To compare the antimicrobial susceptibility patterns of three common mastitis pathogens (Staphylococcus aureus, Streptococcus uberis and Str. dysgalactiae) isolated from milk samples from New Zealand and the USA. A total of 182 S. aureus, 126 Str. uberis and 89 Str. dysgalactiae isolates from New Zealand (107, 106 and 41, respectively) and the USA (75, 20 and 48, respectively) were assessed using the disk diffusion test. Susceptibility varied among the bacterial species. All isolates were susceptible to the amoxicillin-clavulanic acid combination. Resistance to lincomycin was most frequent (susceptibility of 8.6%) across all species. Non-susceptible (i.e. resistant or intermediate) isolates of S. aureus were identified for the three non-isoxazolyl penicillins (amoxicillin, ampicillin and penicillin: 20.6% and 36.0%) and lincomycin (99.9% and 94.6%) for NZ and the USA, respectively. Resistance to erythromycin (5.3%) and tetracyclines (6.7%) was detected only in isolates from the USA. There were differences in susceptibility between Str. uberis and Str. dysgalactiae; all streptococcal isolates demonstrated resistance to aminoglycosides (neomycin 52.4% and streptomycin 27.9%) and enrofloxacin (28%). Resistance of Str. dysgalactiae to tetracycline was almost 100.0% and to oxytetracycline 89.9%. Most of the isolates tested were susceptible to most of the antimicrobials commonly used for treatment of bovine mastitis, with the exception of the lincosamides. Susceptibility to a selected class-representative antimicrobial and at the genus level should be interpreted with caution. Differences between NZ and the USA confirm the value of national surveys to determine the susceptibility patterns of mastitis pathogens. © 2015 Australian Veterinary Association.
BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.
Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F
2014-01-01
We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.
Wang, Linsheng; Zeng, Zixian; Zhang, Wenli; Jiang, Jiming
2014-02-01
We report discoveries of different haplotypes associated with the centromeres of three potato chromosomes, including haplotypes composed of long arrays of satellite repeats and haplotypes lacking the same repeats. These results are in favor of the hypothesis that satellite repeat-based centromeres may originate from neocentromeres that lack repeats.
Occurrence of 15 Haplotypes of Linepithema micans (Hymenoptera: Formicidae) in Southern Brazil.
Ramalho, Manuela Oliveira; Martins, C; Campos, T; Nondillo, A; Botton, M; Bueno, O C
2017-08-01
The ant genus Linepithema is widely known, thanks to the pest species Linepithema humile (Mayr), which is easily mistaken for Linepithema micans (Forel) due to their morphological similarity. Like L. humile, L. micans is associated to the main grapevine pest in Brazil, Eurhizococcus brasiliensis (Wille), also known as ground pearl. Therefore, the present study uses mtDNA fragments to expand the knowledge of haplotype diversity and distribution of L. micans in the state of Rio Grande do Sul (Brazil), to understand the genetic differences of the populations identified in this study. We identified 15 haplotypes of L. micans spread across different localities. Twelve of these haplotypes were new for the species. The high haplotype diversity uncovered in Rio Grande do Sul (Brazil) for this species was predictable, as L. micans is in its native environment. Additional studies that take gene flow into account may reveal interesting aspects of diversity in these populations. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Novel Harmful Recessive Haplotypes Identified for Fertility Traits in Nordic Holstein Cattle
Sahana, Goutam; Nielsen, Ulrik Sander; Aamand, Gert Pedersen; Lund, Mogens Sandø; Guldbrandtsen, Bernt
2013-01-01
Using genomic data, lethal recessives may be discovered from haplotypes that are common in the population but never occur in the homozygote state in live animals. This approach only requires genotype data from phenotypically normal (i.e. live) individuals and not from the affected embryos that die. A total of 7,937 Nordic Holstein animals were genotyped with BovineSNP50 BeadChip and haplotypes including 25 consecutive markers were constructed and tested for absence of homozygotes states. We have identified 17 homozygote deficient haplotypes which could be loosely clustered into eight genomic regions harboring possible recessive lethal alleles. Effects of the identified haplotypes were estimated on two fertility traits: non-return rates and calving interval. Out of the eight identified genomic regions, six regions were confirmed as having an effect on fertility. The information can be used to avoid carrier-by-carrier mattings in practical animal breeding. Further, identification of causative genes/polymorphisms responsible for lethal effects will lead to accurate testing of the individuals carrying a lethal allele. PMID:24376603
David, Sean P; Mezuk, Briana; Zandi, Peter P; Strong, David; Anthony, James C; Niaura, Raymond; Uhl, George R; Eaton, William W
2010-03-01
The 11q23.1 genomic region has been associated with nicotine dependence in Black and White Americans. By conducting linkage disequilibrium analyses of 7 informative single nucleotide polymorphisms (SNPs) within the tetratricopeptide repeat domain 12 (TTC12)/ankyrin repeat and kinase containing 1 (ANKK1)/dopamine (D2) receptor gene cluster, we identified haplotype block structures in 270 Black and 368 White (n = 638) participants, from the Baltimore Epidemiologic Catchment Area cohort study, spanning the TTC12 and ANKK1 genes consisting of three SNPs (rs2303380-rs4938015-rs11604671). Informative haplotypes were examined for sex-specific associations with daily tobacco smoking initiation and cessation using longitudinal data from 1993-1994 and 2004-2005 interviews. There was a Haplotype x Sex interaction such that Black men possessing the GTG haplotype who were smokers in 1993-2004 were more likely to have stopped smoking by 2004-2005 (55.6% GTG vs. 22.0% other haplotypes), while Black women were less likely to have quit smoking if they possessed the GTG (20.8%) versus other haplotypes (24.0%; p = .028). In Whites, the GTG haplotype (vs. other haplotypes) was associated with lifetime history of daily smoking (smoking initiation; odds ratio = 1.6; 95% CI = 1.1-2.4; p = .013). Moreover, there was a Haplotype x Sex interaction such that there was higher prevalence of smoking initiation with GTG (77.6%) versus other haplotypes (57.0%; p = .043). In 2 different ethnic American populations, we observed man-woman variation in the influence of the rs2303380-rs4938015-rs11604671 GTG haplotype on smoking initiation and cessation. These results should be replicated in larger cohorts to establish the relationship among the rs2303380-rs4938015-rs11604671 haplotype block, sex, and smoking behavior.
Mirabal, Sheyla; Varljen, Tatjana; Gayden, Tenzin; Regueiro, Maria; Vujovic, Slavica; Popovic, Danica; Djuric, Marija; Stojkovic, Oliver; Herrera, Rene J
2010-07-01
Southeastern Europe and, particularly, the Balkan Peninsula are especially useful when studying the mechanisms responsible for generating the current distribution of Paleolithic and Neolithic genetic signals observed throughout Europe. In this study, 404 individuals from Montenegro and 179 individuals from Serbia were typed for 17 Y-STR loci and compared across 9 Y-STR loci to geographically targeted previously published collections to ascertain the phylogenetic relationships of populations within the Balkan Peninsula and beyond. We aim to provide information on whether groups in the region represent an amalgamation of Paleolithic and Neolithic genetic substrata, or whether acculturation has played a critical role in the spread of agriculture. We have found genetic markers of Middle Eastern, south Asian and European descent in the area, however, admixture analyses indicate that over 80% of the Balkan gene pool is of European descent. Altogether, our data support the view that the diffusion of agriculture into the Balkan region was mostly a cultural phenomenon although some genetic infiltration from Africa, the Levant, the Caucasus, and the Near East has occurred. (c) 2010 Wiley-Liss, Inc.
Delaneau, Olivier; Marchini, Jonathan
2014-06-13
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.
RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination.
Mirzaei, Sajad; Wu, Yufeng
2017-04-01
: Haplotypes from one or multiple related populations share a common genealogical history. If this shared genealogy can be inferred from haplotypes, it can be very useful for many population genetics problems. However, with the presence of recombination, the genealogical history of haplotypes is complex and cannot be represented by a single genealogical tree. Therefore, inference of genealogical history with recombination is much more challenging than the case of no recombination. : In this paper, we present a new approach called RENT+ for the inference of local genealogical trees from haplotypes with the presence of recombination. RENT+ builds on a previous genealogy inference approach called RENT , which infers a set of related genealogical trees at different genomic positions. RENT+ represents a significant improvement over RENT in the sense that it is more effective in extracting information contained in the haplotype data about the underlying genealogy than RENT . The key components of RENT+ are several greatly enhanced genealogy inference rules. Through simulation, we show that RENT+ is more efficient and accurate than several existing genealogy inference methods. As an application, we apply RENT+ in the inference of population demographic history from haplotypes, which outperforms several existing methods. : RENT+ is implemented in Java, and is freely available for download from: https://github.com/SajadMirzaei/RentPlus . : sajad@engr.uconn.edu or ywu@engr.uconn.edu. : Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Duggan, Ana T; Whitten, Mark; Wiebe, Victor; Crawford, Michael; Butthof, Anne; Spitsyn, Victor; Makarov, Sergey; Novgorodov, Innokentiy; Osakovsky, Vladimir; Pakendorf, Brigitte
2013-01-01
Evenks and Evens, Tungusic-speaking reindeer herders and hunter-gatherers, are spread over a wide area of northern Asia, whereas their linguistic relatives the Udegey, sedentary fishermen and hunter-gatherers, are settled to the south of the lower Amur River. The prehistory and relationships of these Tungusic peoples are as yet poorly investigated, especially with respect to their interactions with neighbouring populations. In this study, we analyse over 500 complete mtDNA genome sequences from nine different Evenk and even subgroups as well as their geographic neighbours from Siberia and their linguistic relatives the Udegey from the Amur-Ussuri region in order to investigate the prehistory of the Tungusic populations. These data are supplemented with analyses of Y-chromosomal haplogroups and STR haplotypes in the Evenks, Evens, and neighbouring Siberian populations. We demonstrate that whereas the North Tungusic Evenks and Evens show evidence of shared ancestry both in the maternal and in the paternal line, this signal has been attenuated by genetic drift and differential gene flow with neighbouring populations, with isolation by distance further shaping the maternal genepool of the Evens. The Udegey, in contrast, appear quite divergent from their linguistic relatives in the maternal line, with a mtDNA haplogroup composition characteristic of populations of the Amur-Ussuri region. Nevertheless, they show affinities with the Evenks, indicating that they might be the result of admixture between local Amur-Ussuri populations and Tungusic populations from the north.
Rood, M; Keijsers, V; van der Linden, M W; Tong, T; Borggreve, S; Verweij, C; Breedveld, F; Huizinga, T
1999-01-01
OBJECTIVE—To investigate the association of interleukin 10 (IL10) promoter polymorphisms and neuropsychiatric manifestations of systemic lupus erythematosus (SLE). METHODS—IL10 haplotypes of 11 healthy volunteers were cloned to confirm that in the Dutch population, only the three common haplotypes (-1082/-819/-592) GCC, ACC and ATA exist. The IL10 promoter polymorphisms of 92 SLE patients and 162 healthy controls were determined. The medical records of the SLE patients were screened for the presence of neuropsychiatric involvement. RESULTS—All cloned haplotypes were either GCC, ACC or ATA. Forty two SLE patients had suffered from neuropsychiatric manifestations (NP-SLE). In NP-SLE patients, the frequency of the ATA haplotype is 30% versus 18% in the controls and 17% in the non-NP-SLE group (odds ratios 1.9, p=0.02, and 2.1, p=0.04, respectively), whereas the GCC haplotype frequency is lower in the NP-SLE group compared with controls and non-NP-SLE patients (40% versus 55% and 61%, odds ratios 0.6, p=0.02 and 0.4 p=0.006). The odds ratio for the presence of NP-SLE is inversely proportional to the number of GCC haplotypes per genotype when the NP-SLE group is compared with non-NP-SLE patients. CONCLUSIONS—The IL10 locus is associated with neuropsychiatric manifestations in SLE. This suggests that IL10 is implicated in the immunopathogenesis of neuropsychiatric manifestations in SLE. Keywords: systemic lupus erythematosus; neuropsychiatric manifestations; genetics; interleukin 10 promoter haplotypes PMID:10343522
mtDNA and Y-chromosome polymorphisms in four Native American populations from southern Mexico.
Torroni, A.; Chen, Y. S.; Semino, O.; Santachiara-Beneceretti, A. S.; Scott, C. R.; Lott, M. T.; Winter, M.; Wallace, D. C.
1994-01-01
mtDNA sequence variation was examined in 60 Native Americans (Mixtecs from the Alta, Mixtecs from the Baja, Valley Zapotecs, and Highland Mixe) from southern Mexico by PCR amplification and high-resolution restriction endonuclease analysis. Four groups of mtDNA haplotypes (haplogroups A, B, C, and D) characterize Amerind populations, but only three (haplogroups A, B, and C) were observed in these Mexican populations. The comparison of their mtDNA variation with that observed in other populations from Mexico and Central America permits a clear distinction among the different Middle American tribes and raises questions about some of their linguistic affiliations. The males of these population samples were also analyzed for Y-chromosome RFLPs with the probes 49a, 49f, and 12f2. This analysis suggests that certain Y-chromosome haplotypes were brought from Asia during the colonization of the Americas, and a differential gene flow was introduced into Native American populations from European males and females. Images Figure 4 PMID:8304347
Divergence at the casein haplotypes in dairy and meat goat breeds.
Küpper, Julia; Chessa, Stefania; Rignanese, Daniela; Caroli, Anna; Erhardt, Georg
2010-02-01
Casein genes have been proved to have an influence on milk properties, and are in addition appropriate for phylogeny studies. A large number of casein polymorphisms exist in goats, making their analysis quite complex. The four casein loci were analyzed by molecular techniques for genetic polymorphism detection in the two dairy goat breeds Bunte Deutsche Edelziege (BDE; n=96), Weisse Deutsche Edelziege (WDE; n=91), and the meat goat breed Buren (n=75). Of the 35 analyzed alleles, 18 were found in BDE, and 17 in Buren goats and WDE. In addition, a new allele was identified at the CSN1S1 locus in the BDE, showing a frequency of 0.05. This variant, named CSN1S1*A', is characterized by a t-->c transversion in intron 9. Linkage disequilibrium was found at the casein haplotype in all three breeds. A total of 30 haplotypes showed frequencies higher than 0.01. In the Buren breed only one haplotype showed a frequency higher than 0.1. The ancestral haplotype B-A-A-B (in the order: CSN1S1-CSN2-CSN1S2-CSN3) occurred in all three breeds, showing a very high frequency (>0.8) in the Buren.
Dehghanian, Fatemeh; Silawi, Mohammad; Tabei, Seyed M B
2017-02-01
Deficiency of phenylalanine hydroxylase (PAH) enzyme and elevation of phenylalanine in body fluids cause phenylketonuria (PKU). The gold standard for confirming PKU and PAH deficiency is detecting causal mutations by direct sequencing of the coding exons and splicing involved sequences of the PAH gene. Furthermore, haplotype analysis could be considered as an auxiliary approach for detecting PKU causative mutations before direct sequencing of the PAH gene by making comparisons between prior detected mutation linked-haplotypes and new PKU case haplotypes with undetermined mutations. In this study, 13 unrelated classical PKU patients took part in the study detecting causative mutations. Mutations were identified by polymerase chain reaction (PCR) and direct sequencing in all patients. After that, haplotype analysis was performed by studying VNTR and PAHSTR markers (linked genetic markers of the PAH gene) through application of PCR and capillary electrophoresis (CE). Mutation analysis was performed successfully and the detected mutations were as follows: c.782G>A, c.754C>T, c.842C>G, c.113-115delTCT, c.688G>A, and c.696A>G. Additionally, PAHSTR/VNTR haplotypes were detected to discover haplotypes linked to each mutation. Mutation detection is the best approach for confirming PAH enzyme deficiency in PKU patients. Due to the relatively large size of the PAH gene and high cost of the direct sequencing in developing countries, haplotype analysis could be used before DNA sequencing and mutation detection for a faster and cheaper way via identifying probable mutated exons.
NASA Astrophysics Data System (ADS)
Sell, Jerzy
2003-11-01
The distribution pattern of mtDNA haplotypes in distinct populations of the glacial relict crustacean Saduria entomon was examined to assess phylogeographic relationships among them. Populations from the Baltic, the White Sea and the Barents Sea were screened for mtDNA variation using PCR-based RFLP analysis of a 1150 bp fragment containing part of the CO I and CO II genes. Five mtDNA haplotypes were recorded. An analysis of geographical heterogeneity in haplotype frequency distributions revealed significant differences among populations. The isolated populations of S. entomon have diverged since the retreat of the last glaciation. The geographical pattern of variation is most likely the result of stochastic (founder effect, genetic drift) mechanisms and suggests that the haplotype differentiation observed is probably older than the isolation of the Baltic and Arctic seas.
[Observation and analysis on mutation of routine STR locus].
Li, Qiu-yang; Feng, Wei-jun; Yang, Qin-gen
2005-05-01
To observe and analyze the characteristic of mutation at STR locus. 27 mutant genes observed in 1211 paternity testing cases were checked by PAGE-silver stained and PowerPlex 16 System Kit and validated by sequencing. Mutant genes locate on 15 loci. The pattern of mutation was accord with stepwise mutation model. The mutation ratio of male-to-female was 8:1 and correlated to the age of father. Mutation rate is correlated to the geometric mean of the number of homogeneous repeats of locus. The higher the mean, the higher the mutation rate. These loci are not so appropriate for use in paternity testing.
Patterns of genetic diversity at the nine forensically approved STR loci in the Indian populations.
Dutta, Ranjan; Reddy, B Mohan; Chattopadhyay, P; Kashyap, V K; Sun, Guangyun; Deka, Ranjan
2002-02-01
Genetic diversity at the nine short tandem repeat (STR) loci, which are universally approved and widely used for forensic investigations, has been studied among nine Indian populations with diverse ethnic, linguistic, and geographic backgrounds. The nine STR loci were profiled on 902 individuals using fluorescent detection methods on an ABI377 System, with the aid of an Amp-F1 Profiler Plus Kit. The studied populations include two upper castes, Brahmin and Kayastha; a tribe, Garo, from West Bengal; a Hindu caste, Meitei, with historical links to Bengal Brahmins; a migrant group of Muslims; three tribal groups, Naga, Kuki and Hmar, from Manipur in northeast India; and a middle-ranking caste, Golla, who are seminomadic herders from Andhra Pradesh. Gene diversity analysis suggests that the average heterozygosity is uniformly high (>0.8) in the studied populations, with the coefficient of gene differentiation at 0.050 +/- 0.0054. Both neighbor-joining (NJ) and unweighted pair group method with arithmetic mean (UPGMA) trees based on DA distances bring out distinct clusters that are consistent with ethnic, linguistic, and/or geographic backgrounds of the populations. The fit of the Harpending and Ward model of regression of average heterozygosity on the gene frequency centroid is found to be good, and the observed outliers are consistent with the population structure and history of the studied populations. Our study suggests that the nine STR loci, used so far mostly for forensic investigations, can be used fruitfully for microevolutionary studies as well, and for reconstructing the phylogenetic history of human populations, at least at the local level.
Curry, Caitlin J.; White, Paula A.; Derr, James N.
2015-01-01
Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species. PMID:26674533
Kay, Chris; Tirado-Hurtado, Indira; Cornejo-Olivas, Mario; Collins, Jennifer A; Wright, Galen; Inca-Martinez, Miguel; Veliz-Otani, Diego; Ketelaar, Maria E; Slama, Ramy A; Ross, Colin J; Mazzetti, Pilar; Hayden, Michael R
2017-01-01
Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin (HTT) gene. HD occurs worldwide, but the causative mutation is found on different HTT haplotypes in distinct ethnic groups. In Latin America, HD is thought to have European origins, but indigenous Amerindian ancestry has not been investigated. Here, we report dense HTT haplotypes in 62 mestizo Peruvian HD families, 17 HD families from across Latin America, and 42 controls of defined Peruvian Amerindian ethnicity to determine the origin of HD in populations of admixed Amerindian and European descent. HD in Peru occurs most frequently on the A1 HTT haplotype (73%), as in Europe, but on an unexpected indigenous variant also found in Amerindian controls. This Amerindian A1 HTT haplotype predominates over the European A1 variant among geographically disparate Latin American controls and in HD families from across Latin America, supporting an indigenous origin of the HD mutation in mestizo American populations. We also show that a proportion of HD mutations in Peru occur on a C1 HTT haplotype of putative Amerindian origin (14%). The majority of HD mutations in Latin America may therefore occur on haplotypes of Amerindian ancestry rather than on haplotypes resulting from European admixture. Despite the distinct ethnic ancestry of Amerindian and European A1 HTT, alleles on the parent A1 HTT haplotype allow for development of identical antisense molecules to selectively silence the HD mutation in the greatest proportion of patients in both Latin American and European populations. PMID:28000697
Curry, Caitlin J; White, Paula A; Derr, James N
2015-01-01
Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species.
Deletion analysis of male sterility effects of t-haplotypes in the mouse.
Bennett, D; Artzt, K
1990-01-01
We present data on the effects of three chromosome 17 deletions on transmission ratio distortion (TRD) and sterility of several t-haplotypes. All three deletions have similar effects on male TRD: that is, Tdel/tcomplete genotypes all transmit their t-haplotype in very high proportion. However, each deletion has different effects on sterility of heterozygous males, with TOr/t being fertile, Thp/t less fertile, and TOrl/t still less fertile. These data suggest that wild-type genes on chromosomes homologous to t-haplotypes can be important regulators of both TRD and fertility in males, and that the wild-type genes concerned with TRD and fertility are at least to some extent different. The data also provide a rough map of the positions of these genes.
Duggan, Ana T.; Whitten, Mark; Wiebe, Victor; Crawford, Michael; Butthof, Anne; Spitsyn, Victor; Makarov, Sergey; Novgorodov, Innokentiy; Osakovsky, Vladimir; Pakendorf, Brigitte
2013-01-01
Evenks and Evens, Tungusic-speaking reindeer herders and hunter-gatherers, are spread over a wide area of northern Asia, whereas their linguistic relatives the Udegey, sedentary fishermen and hunter-gatherers, are settled to the south of the lower Amur River. The prehistory and relationships of these Tungusic peoples are as yet poorly investigated, especially with respect to their interactions with neighbouring populations. In this study, we analyse over 500 complete mtDNA genome sequences from nine different Evenk and even subgroups as well as their geographic neighbours from Siberia and their linguistic relatives the Udegey from the Amur-Ussuri region in order to investigate the prehistory of the Tungusic populations. These data are supplemented with analyses of Y-chromosomal haplogroups and STR haplotypes in the Evenks, Evens, and neighbouring Siberian populations. We demonstrate that whereas the North Tungusic Evenks and Evens show evidence of shared ancestry both in the maternal and in the paternal line, this signal has been attenuated by genetic drift and differential gene flow with neighbouring populations, with isolation by distance further shaping the maternal genepool of the Evens. The Udegey, in contrast, appear quite divergent from their linguistic relatives in the maternal line, with a mtDNA haplogroup composition characteristic of populations of the Amur-Ussuri region. Nevertheless, they show affinities with the Evenks, indicating that they might be the result of admixture between local Amur-Ussuri populations and Tungusic populations from the north. PMID:24349531
Haplotype Sharing Provides Insights into Fine-Scale Population History and Disease in Finland.
Martin, Alicia R; Karczewski, Konrad J; Kerminen, Sini; Kurki, Mitja I; Sarin, Antti-Pekka; Artomov, Mykyta; Eriksson, Johan G; Esko, Tõnu; Genovese, Giulio; Havulinna, Aki S; Kaprio, Jaakko; Konradi, Alexandra; Korányi, László; Kostareva, Anna; Männikkö, Minna; Metspalu, Andres; Perola, Markus; Prasad, Rashmi B; Raitakari, Olli; Rotar, Oxana; Salomaa, Veikko; Groop, Leif; Palotie, Aarno; Neale, Benjamin M; Ripatti, Samuli; Pirinen, Matti; Daly, Mark J
2018-05-03
Finland provides unique opportunities to investigate population and medical genomics because of its adoption of unified national electronic health records, detailed historical and birth records, and serial population bottlenecks. We assembled a comprehensive view of recent population history (≤100 generations), the timespan during which most rare-disease-causing alleles arose, by comparing pairwise haplotype sharing from 43,254 Finns to that of 16,060 Swedes, Estonians, Russians, and Hungarians from geographically and linguistically adjacent countries with different population histories. We find much more extensive sharing in Finns, with at least one ≥ 5 cM tract on average between pairs of unrelated individuals. By coupling haplotype sharing with fine-scale birth records from more than 25,000 individuals, we find that although haplotype sharing broadly decays with geographical distance, there are pockets of excess haplotype sharing; individuals from northeast Finland typically share several-fold more of their genome in identity-by-descent segments than individuals from southwest regions. We estimate recent effective population-size changes through time across regions of Finland, and we find that there was more continuous gene flow as Finns migrated from southwest to northeast between the early- and late-settlement regions than was dichotomously described previously. Lastly, we show that haplotype sharing is locally enriched by an order of magnitude among pairs of individuals sharing rare alleles and especially among pairs sharing rare disease-causing variants. Our work provides a general framework for using haplotype sharing to reconstruct an integrative view of recent population history and gain insight into the evolutionary origins of rare variants contributing to disease. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Genetic analysis of 7 medieval skeletons from Aragonese Pyrenees
Núńez, Carolina; Sosa, Cecilia; Baeta, Miriam; Geppert, Maria; Turnbough, Meredith; Phillips, Nicole; Casalod, Yolanda; Bolea, Miguel; Roby, Rhonda; Budowle, Bruce; Martínez-Jarreta, Begońa
2011-01-01
Aim To perform a genetic characterization of 7 skeletons from medieval age found in a burial site in the Aragonese Pyrenees. Methods Allele frequencies of autosomal short tandem repeats (STR) loci were determined by 3 different STR systems. Mitochondrial DNA (mtDNA) and Y-chromosome haplogroups were determined by sequencing of the hypervariable segment 1 of mtDNA and typing of phylogenetic Y chromosome single nucleotide polymorphisms (Y-SNP) markers, respectively. Possible familial relationships were also investigated. Results Complete or partial STR profiles were obtained in 3 of the 7 samples. Mitochondrial DNA haplogroup was determined in 6 samples, with 5 of them corresponding to the haplogroup H and 1 to the haplogroup U5a. Y-chromosome haplogroup was determined in 2 samples, corresponding to the haplogroup R. In one of them, the sub-branch R1b1b2 was determined. mtDNA sequences indicated that some of the individuals could be maternally related, while STR profiles indicated no direct family relationships. Conclusions Despite the antiquity of the samples and great difficulty that genetic analyses entail, the combined use of autosomal STR markers, Y-chromosome informative SNPs, and mtDNA sequences allowed us to genotype a group of skeletons from the medieval age. PMID:21674829
The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes
Lindenau, Juliana D.; Wagner, Sandrine C.; de Castro, Simone M.; Hutz, Mara H.
2016-01-01
Abstract Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management. PMID:27706371
Unique haplotypes of cacao trees as revealed by trnH-psbA chloroplast DNA
Gutiérrez-López, Nidia; Ovando-Medina, Isidro; Salvador-Figueroa, Miguel; Molina-Freaner, Francisco; Avendaño-Arrazate, Carlos H.
2016-01-01
Cacao trees have been cultivated in Mesoamerica for at least 4,000 years. In this study, we analyzed sequence variation in the chloroplast DNA trnH-psbA intergenic spacer from 28 cacao trees from different farms in the Soconusco region in southern Mexico. Genetic relationships were established by two analysis approaches based on geographic origin (five populations) and genetic origin (based on a previous study). We identified six polymorphic sites, including five insertion/deletion (indels) types and one transversion. The overall nucleotide diversity was low for both approaches (geographic = 0.0032 and genetic = 0.0038). Conversely, we obtained moderate to high haplotype diversity (0.66 and 0.80) with 10 and 12 haplotypes, respectively. The common haplotype (H1) for both networks included cacao trees from all geographic locations (geographic approach) and four genetic groups (genetic approach). This common haplotype (ancient) derived a set of intermediate haplotypes and singletons interconnected by one or two mutational steps, which suggested directional selection and event purification from the expansion of narrow populations. Cacao trees from Soconusco region were grouped into one cluster without any evidence of subclustering based on AMOVA (FST = 0) and SAMOVA (FST = 0.04393) results. One population (Mazatán) showed a high haplotype frequency; thus, this population could be considered an important reservoir of genetic material. The indels located in the trnH-psbA intergenic spacer of cacao trees could be useful as markers for the development of DNA barcoding. PMID:27076998
HLA-G, -A haplotypes in Amerindians (Ecuador): HLA-G*01:05N World distribution.
Arnaiz-Villena, Antonio; Palacio-Gruber, Jose; Enriquez de Salamanca, Mercedes; Juárez, Ignacio; Campos, Cristina; Nieto, Jorge; Muñiz, Ester; Martin-Villa, Jose Manuel
2018-02-01
HLA-G and HLA-A frequencies have been analysed in Amerindians from Ecuador. HLA-G allele frequencies are found to be closer to those of other Amerindians (Mayas from Guatemala and Uros from Peru) and closer to European ones than to Far East Asians groups, particularly, regarding to HLA-G*01:04 allele. HLA-G/-A haplotypes have been calculated for the first time in Amerindians. It is remarkable that HLA-G*01:05N "null" allele is found in a very low frequency (like in Amerindian Mayas and Uros) and is also found in haplotypes belonging to the HLA-A19 group of alleles (HLA-A*30, -A*31, -A*33). It was previously postulated that HLA-G*01:05N appeared in HLA-A*30/-B*13 haplotypes in Middle East Mediterraneans. It may be hypothesized that in Evolution, HLA-G*01:05N existed primarily in one of the HLA extant or extinct -A19 haplotype, whether this haplotype was placed in Middle East or other World areas, including America. However, the highest present day HLA-G*01:05N frequencies are found in Middle East Mediterraneans. Copyright © 2017. Published by Elsevier Inc.
Niemi, Marianna; Bläuer, Auli; Iso-Touru, Terhi; Nyström, Veronica; Harjula, Janne; Taavitsainen, Jussi-Pekka; Storå, Jan; Lidén, Kerstin; Kantanen, Juha
2013-01-22
Several molecular and population genetic studies have focused on the native sheep breeds of Finland. In this work, we investigated their ancestral sheep populations from Iron Age, Medieval and Post-Medieval periods by sequencing a partial mitochondrial DNA D-loop and the 5'-promoter region of the SRY gene. We compared the maternal (mitochondrial DNA haplotypes) and paternal (SNP oY1) genetic diversity of ancient sheep in Finland with modern domestic sheep populations in Europe and Asia to study temporal changes in genetic variation and affinities between ancient and modern populations. A 523-bp mitochondrial DNA sequence was successfully amplified for 26 of 36 sheep ancient samples i.e. five, seven and 14 samples representative of Iron Age, Medieval and Post-Medieval sheep, respectively. Genetic diversity was analyzed within the cohorts. This ancient dataset was compared with present-day data consisting of 94 animals from 10 contemporary European breeds and with GenBank DNA sequence data to carry out a haplotype sharing analysis. Among the 18 ancient mitochondrial DNA haplotypes identified, 14 were present in the modern breeds. Ancient haplotypes were assigned to the highly divergent ovine haplogroups A and B, haplogroup B being the major lineage within the cohorts. Only two haplotypes were detected in the Iron Age samples, while the genetic diversity of the Medieval and Post-Medieval cohorts was higher. For three of the ancient DNA samples, Y-chromosome SRY gene sequences were amplified indicating that they originated from rams. The SRY gene of these three ancient ram samples contained SNP G-oY1, which is frequent in modern north-European sheep breeds. Our study did not reveal any sign of major population replacement of native sheep in Finland since the Iron Age. Variations in the availability of archaeological remains may explain differences in genetic diversity estimates and patterns within the cohorts rather than demographic events that occurred in the past
[Frequency distribution of HLA antigens and haplotypes in newly arrived inhabitants of Magadan].
Solovenchuk, L L; Pereverzeva, V V; Nevretdinova, Z G
1994-09-01
Peculiarities of the frequency distribution of antigens and haplotypes of A, B, and Cw subloci of the HLA system in 924 Slavic inhabitants of Magadan are described. Significant differences in gene and haplotype frequencies between inhabitants of Magadan and those of Moscow, Odessa, Poles'e, Latvia, and England were revealed, which could not be attributed solely to the specificity of migration processes. On the basis of an analysis of gamete associations of the A and B subloci, an attempt was made to explain the specificity of the frequency distribution of HLA system alleles and haplotypes in the investigated sample from an ecological point of view.
Lee, Daniel K C; Bates, Caroline E; Lipworth, Brian J
2004-01-01
The relationship between beta2-adrenoceptor polymorphisms at positions 16 and 27, and the acute systemic beta2-adrenoceptor effects of inhaled salbutamol is unclear. We therefore elected to evaluate the influence of common homozygous beta2-adrenoceptor haplotypes on the acute systemic beta2-adrenoceptor effects following inhaled salbutamol in asthmatic subjects. An initial database search of 531 asthmatic subjects identified the two commonest homozygous haplotypes at positions 16 and 27 to be Arg16-Gln27 (12%) and Gly16-Glu27 (19%). After a 1-week washout period where all beta2-adrenoceptor agonists were withdrawn, 16 Caucasian subjects (Arg16-Gln27: n = 8 and Gly16-Glu27: n = 8) were given a single dose of inhaled salbutamol (1200 microg), followed by serial blood sampling for serum potassium, along with measurements of diastolic blood pressure and heart rate, at 5-min intervals for 20 min. The two groups were well matched for age, sex, FEV1, and inhaled corticosteroid dose. Baseline values for serum potassium, diastolic blood pressure and heart rate were not significantly different comparing Arg16-Gln27 vs Gly16-Glu27. The mean +/- SEM maximum serum potassium change from baseline over 20 min was significantly greater (P = 0.04) for Arg16-Gln27: -0.37 +/- 0.05 mmol l(-1) vs Gly16-Glu27: -0.23 +/- 0.04 mmol l(-1); 95% CI for difference: -0.01 to -0.28 mmol l(-1). The maximum diastolic blood pressure change from baseline over 20 min was significantly greater (P = 0.0008) for Arg16-Gln27: -13 +/- 1 mmHg vs Gly16-Glu27: -4 +/- 2 mmHg; 95% CI for difference: -5, 14 mmHg. There was no significant difference comparing the maximum heart rate change from baseline for Arg16-Gln27: 10 +/- 3 beats min(-1) vs Gly16-Glu27: 10 +/- 3 beats min(-1). Caucasian asthmatic subjects with the Arg16-Gln27 haplotype exhibited a greater systemic response to inhaled salbutamol, compared with those with the Gly16-Glu27 haplotype. The attenuated beta2-adrenoceptor response in the Gly16-Glu27
Schäfer, Christian; Schmidt, Alexander H; Sauter, Jürgen
2017-05-30
Knowledge of HLA haplotypes is helpful in many settings as disease association studies, population genetics, or hematopoietic stem cell transplantation. Regarding the recruitment of unrelated hematopoietic stem cell donors, HLA haplotype frequencies of specific populations are used to optimize both donor searches for individual patients and strategic donor registry planning. However, the estimation of haplotype frequencies from HLA genotyping data is challenged by the large amount of genotype data, the complex HLA nomenclature, and the heterogeneous and ambiguous nature of typing records. To meet these challenges, we have developed the open-source software Hapl-o-Mat. It estimates haplotype frequencies from population data including an arbitrary number of loci using an expectation-maximization algorithm. Its key features are the processing of different HLA typing resolutions within a given population sample and the handling of ambiguities recorded via multiple allele codes or genotype list strings. Implemented in C++, Hapl-o-Mat facilitates efficient haplotype frequency estimation from large amounts of genotype data. We demonstrate its accuracy and performance on the basis of artificial and real genotype data. Hapl-o-Mat is a versatile and efficient software for HLA haplotype frequency estimation. Its capability of processing various forms of HLA genotype data allows for a straightforward haplotype frequency estimation from typing records usually found in stem cell donor registries.
Hamstra, Danielle A; de Kloet, E Ronald; Tollenaar, Marieke; Verkuil, Bart; Manai, Meriem; Putman, Peter; Van der Does, Willem
2016-10-01
The processing of emotional information is affected by menstrual cycle phase and by the use of oral contraceptives (OCs). The stress hormone cortisol is known to affect emotional information processing via the limbic mineralocorticoid receptor (MR). We investigated in an exploratory study whether the MR-genotype moderates the effect of both OC-use and menstrual cycle phase on emotional cognition. Healthy premenopausal volunteers (n=93) of West-European descent completed a battery of emotional cognition tests. Forty-nine participants were OC users and 44 naturally cycling, 21 of whom were tested in the early follicular (EF) and 23 in the mid-luteal (ML) phase of the menstrual cycle. In MR-haplotype 1/3 carriers, ML women gambled more than EF women when their risk to lose was relatively small. In MR-haplotype 2, ML women gambled more than EF women, regardless of their odds of winning. OC-users with MR-haplotype 1/3 recognised fewer facial expressions than ML women with MR-haplotype 1/3. MR-haplotype 1/3 carriers may be more sensitive to the influence of their female hormonal status. MR-haplotype 2 carriers showed more risky decision-making. As this may reflect optimistic expectations, this finding may support previous observations in female carriers of MR-haplotype 2 in a naturalistic cohort study. © The Author(s) 2016.
Gaibar, Maria; Esteban, María Esther; Via, Marc; Harich, Nourdin; Kandil, Mostafa; Fernández-Santander, Ana
2012-07-01
This work describes, for the first time, the profile of Middle Atlas Berbers and Arabic-speaking central Moroccans for 15 autosomal STR loci widely used in forensic sciences. The main objectives were to determine the degree of heterogeneity among different Moroccan samples to identify geographic or linguistic patterns and to evaluate the usefulness of forensic STRs in anthropological studies. Blood samples were collected from 71 Arabic-speakers and 75 Berbers from the regions of Doukkala (central-west coast) and Khenifra (Middle Atlas), respectively. The AmpFlSTR Identifier kit was used to genotype 15 autosomal STR in both samples. Middle Atlas Berbers showed slightly higher genetic variation values compared to Arabic-speakers, both in the number of alleles and heterozygosity. In order to assess population relationships, data from Morocco, Algeria, Tunisia, Libya, Egypt, Kuwait, Qatar, Palestine, Syria, South-Spain and Turkey were included in the analysis. Within Morocco, genetic distances followed a clear geographic pattern. In the Arabic-speaking sample the genetic proportion of 'Arabian' admixture was estimated in 13%. The low value of admixture suggests that the Arabization of Morocco had a reduced demographic impact, which should be taken with caution because it is based on autosomal STRs with low inter-population variation levels.
Haplotype diversity of the myostatin gene among beef cattle breeds
Dunner, Susana; Miranda, M Eugenia; Amigues, Yves; Cañón, Javier; Georges, Michel; Hanset, Roger; Williams, John; Ménissier, François
2003-01-01
A total of 678 individuals from 28 European bovine breeds were both phenotyped and analysed at the myostatin locus by the Single Strand Conformation Polymorphism (SSCP) method. Seven new mutations were identified which contribute to the high polymorphism (1 SNP every 100 bp) present in this small gene; twenty haplotypes were described and a genotyping method was set up using the Oligonucleotide Ligation Assay (OLA) method. Some haplotypes appeared to be exclusive to a particular breed; this was the case for 5 in the Charolaise (involving mutation Q204X) and 7 in the Maine-Anjou (involving mutation E226X). The relationships between the different haplotypes were studied, thus allowing to test the earlier hypothesis on the origin of muscular hypertrophy in Europe: muscular hypertrophy (namely nt821(del11)) was mainly spread in different waves from northern Europe milk purpose populations in most breeds; however, other mutations (mostly disruptive) arose in a single breed, were highly selected and have since scarcely evolved to other populations. PMID:12605853
Aarnes, Siv Grethe; Hagen, Snorre B; Andreassen, Rune; Schregel, Julia; Knappskog, Per M; Hailer, Frank; Stenhouse, Gordon; Janke, Axel; Eiken, Hans Geir
2015-11-01
High-resolution Y-chromosomal markers have been applied to humans and other primates to study population genetics, migration, social structures and reproduction. Y-linked markers allow the direct assessment of the genetic structure and gene flow of uniquely male inherited lineages and may also be useful for wildlife conservation and forensics, but have so far been available only for few wild species. Thus, we have developed two multiplex PCR reactions encompassing nine Y-STR markers identified from the brown bear (Ursus arctos) and tested them on hair, fecal and tissue samples. The multiplex PCR approach was optimized and analyzed for species specificity, sensitivity and stutter-peak ratios. The nine Y-STRs also showed specific STR-fragments for male black bears and male polar bears, while none of the nine markers produced any PCR products when using DNA from female bears or males from 12 other mammals. The multiplex PCR approach in two PCR reactions could be amplified with as low as 0.2 ng template input. Precision was high in DNA templates from hairs, fecal scats and tissues, with standard deviations less than 0.14 and median stutter ratios from 0.04 to 0.63. Among the eight di- and one tetra-nucleotide repeat markers, we detected simple repeat structures in seven of the nine markers with 9-25 repeat units. Allelic variation was found for eight of the nine Y-STRs, with 2-9 alleles for each marker and a total of 36 alleles among 453 male brown bears sampled mainly from Northern Europe. We conclude that the multiplex PCR approach with these nine Y-STRs would provide male bear Y-chromosomal specificity and evidence suited for samples from conservation and wildlife forensics. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Specific CAPN10 gene haplotypes influence the clinical profile of polycystic ovary patients.
Gonzalez, Alejandro; Abril, Eduardo; Roca, Alfredo; Aragón, Maria José; Figueroa, Maria José; Velarde, Pilar; Ruiz, Rocío; Fayez, Omar; Galán, José Jorge; Herreros, José Antonio; Real, Luis Miguel; Ruiz, Agustín
2003-11-01
Recently, several research groups have evaluated CAPN10 gene in polycystic ovarian syndrome (PCOS) patients and other phenotypes, including hirsutism or intermediate phenotypes of PCOS. Molecular genetic analysis of CAPN10 gene indicates that different alleles may play a role in PCOS susceptibility and could be associated with idiopathic hirsutism. However, these observations are not exempt from controversy, because independent studies cannot replicate these preliminary findings. We present a haplotype-phenotype correlation study of CAPN10 haplotypes in 148 women showing ecographically detected polycystic ovaries (PCO) combined with one or more of these clinical symptoms: amenorrhea or severe oligomenorrhea, hyperandrogenism, and anovulatory infertility, as well as 93 unrelated controls. We have reconstructed and analyzed 482 CAPN10 haplotypes in patients and controls. We detected the association of UCSNP-44 allele with PCO phenotype in the Spanish population (P = 0.02). In addition, we identified several CAPN10 alleles associated to phenotypic differences observed between PCO patients, such as the presence of hypercholesterolemia (haplotype 1121, P = 0.005), presence of hyperandrogenic features (P = 0.05), and familial cancer incidence (haplotype 1111, P = 0.0005). Our results confirm the association of UCSNP-44 allele with PCO phenotype in the Spanish population. Moreover, we have identified novel candidate risk alleles and genotypes, within CAPN10 gene, that could be associated with important phenotypic and prognosis differences observed in PCOS patients.
Xu, Meixiang; Nekhayeva, Ilona; Cross, Courtney E; Rondelli, Catherine M; Wickliffe, Jeffrey K; Abdel-Rahman, Sherif Z
2014-03-01
The O6-methylguanine-DNA methyltransferase gene (MGMT) encodes the direct reversal DNA repair protein that removes alkyl adducts from the O6 position of guanine. Several single-nucleotide polymorphisms (SNPs) exist in the MGMT promoter/enhancer (P/E) region. However, the haplotype structure encompassing these SNPs and their functional/biological significance are currently unknown. We hypothesized that MGMT P/E haplotypes, rather than individual SNPs, alter MGMT transcription and can thus alter human sensitivity to alkylating agents. To identify the haplotype structure encompassing the MGMT P/E region SNPs, we sequenced 104 DNA samples from healthy individuals and inferred the haplotypes using the data generated. We identified eight SNPs in this region, namely T7C (rs180989103), T135G (rs1711646), G290A (rs61859810), C485A (rs1625649), C575A (rs113813075), G666A (rs34180180), C777A (rs34138162) and C1099T (rs16906252). Phylogenetics and Sequence Evolution analysis predicted 21 potential haplotypes that encompass these SNPs ranging in frequencies from 0.000048 to 0.39. Of these, 10 were identified in our study population as 20 paired haplotype combinations. To determine the functional significance of these haplotypes, luciferase reporter constructs representing these haplotypes were transfected into glioblastoma cells and their effect on MGMT promoter activity was determined. Compared with the most common (reference) haplotype 1, seven haplotypes significantly upregulated MGMT promoter activity (18-119% increase; P < 0.05), six significantly downregulated MGMT promoter activity (29-97% decrease; P < 0.05) and one haplotype had no effect. Mechanistic studies conducted support the conclusion that MGMT P/E haplotypes, rather than individual SNPs, differentially regulate MGMT transcription and could thus play a significant role in human sensitivity to environmental and therapeutic alkylating agents.
STR melting curve analysis as a genetic screening tool for crime scene samples.
Nguyen, Quang; McKinney, Jason; Johnson, Donald J; Roberts, Katherine A; Hardy, Winters R
2012-07-01
In this proof-of-concept study, high-resolution melt curve (HRMC) analysis was investigated as a postquantification screening tool to discriminate human CSF1PO and THO1 genotypes amplified with mini-STR primers in the presence of SYBR Green or LCGreen Plus dyes. A total of 12 CSF1PO and 11 HUMTHO1 genotypes were analyzed on the LightScanner HR96 and LS-32 systems and were correctly differentiated based upon their respective melt profiles. Short STR amplicon melt curves were affected by repeat number, and single-source and mixed DNA samples were additionally differentiated by the formation of heteroduplexes. Melting curves were shown to be unique and reproducible from DNA quantities ranging from 20 to 0.4 ng and distinguished identical from nonidentical genotypes from DNA derived from different biological fluids and compromised samples. Thus, a method is described which can assess both the quantity and the possible probative value of samples without full genotyping. 2012 American Academy of Forensic Sciences. Published 2012. This article is a U.S. Government work and is in the public domain in the U.S.A.
Vasquez, Edward A.; Glenn, Edward P.; Brown, J. Jed; Guntenspergen, Glenn R.; Nelson, Stephen G.
2005-01-01
A distinct, non-native haplotype of the common reed Phragmites australis has become invasive in Atlantic coastal Spartina marshes. We compared the salt tolerance and other growth characteristics of the invasive M haplotype with 2 native haplotypes (F and AC) in greenhouse experiments. The M haplotype retained 50% of its growth potential up to 0.4 M NaCl, whereas the F and AC haplotypes did not grow above 0.1 M NaCl. The M haplotype produced more shoots per gram of rhizome tissue and had higher relative growth rates than the native haplotypes on both freshwater and saline water treatments. The M haplotype also differed from the native haplotypes in shoot water content and the biometrics of shoots and rhizomes. The results offer an explanation for how the M haplotype is able to spread in coastal salt marshes and support the conclusion of DNA analyses that the M haplotype is a distinct ecotype of P. australis.
Qiu, Anqi; Tuan, Ta Anh; Ong, Mei Lyn; Li, Yue; Chen, Helen; Rifkin-Graboi, Anne; Broekman, Birit F P; Kwek, Kenneth; Saw, Seang-Mei; Chong, Yap-Seng; Gluckman, Peter D; Fortier, Marielle V; Holbrook, Joanna Dawn; Meaney, Michael J
2015-02-01
Exposure to antenatal maternal anxiety and complex genetic variations may shape fetal brain development. In particular, the catechol-O-methyltransferase (COMT) gene, located on chromosome 22q11.2, regulates catecholamine signaling in the prefrontal cortex and is implicated in anxiety, pain, and stress responsivity. This study examined whether individual single-nucleotide polymorphisms (SNPs) of the COMT gene and their haplotypes moderate the association between antenatal maternal anxiety and in utero cortical development. A total of 146 neonates were genotyped and underwent MRI shortly after birth. Neonatal cortical morphology was characterized using cortical thickness. Antenatal maternal anxiety was assessed using the State-Trait Anxiety Inventory at week 26 of pregnancy. Individual COMT SNPs (val158met, rs737865, and rs165599) modulated the association between antenatal maternal anxiety and the prefrontal and parietal cortical thickness in neonates. Based on haplotype trend regression analysis, findings also showed that among rs737865-val158met-rs165599 haplotypes, the A-val-G (AGG) haplotype probabilities modulated positive associations of antenatal maternal anxiety with cortical thickness in the right ventrolateral prefrontal cortex and the right superior parietal cortex and precuneus. In contrast, the G-met-A (GAA) haplotype probabilities modulated negative associations of antenatal maternal anxiety with cortical thickness in bilateral precentral gyrus and the dorsolateral prefrontal cortex. These results suggest that the association between maternal anxiety and in utero neurodevelopment is modified through complex genetic variation in COMT. Such genetic moderation may explain, in part, the variation in phenotypic outcomes in offspring associated with maternal emotional well-being.
Haplotype Analysis of the Melanopsin Gene in Seasonal Affective Disorder and Controls
2007-06-19
Cole, P. A. (2002). Serotonin n-acetyltransferase: Mechanism and inhibition. Current Medicinal Chemistry , 9(12), 1187-1199. 152 APPENDIX A STRUCTURED ...such that low light levels fall below this threshold during winter in individuals with SAD. The present study investigated the haplotype structure of...Association Studies 51 Advantages of Population-Based Case-Control Samples 52 Haplotype Structure 53 Linkage Disequilibrium: A Measure of Correlation Between
Khrustaleva, A M; Volkov, A A; Stoklitskaia, D S; Miuge, N S; Zelenina, D A
2010-11-01
Sockeye salmon samples from five largest lacustrine-riverine systems of Kamchatka Peninsula were tested for polymorphism at six microsatellite (STR) and five single nucleotide polymorphism (SNP) loci. Statistically significant genetic differentiation among local populations from this part of the species range examined was demonstrated. The data presented point to pronounced genetic divergence of the populations from two geographical regions, Eastern and Western Kamchatka. For sockeye salmon, the individual identification test accuracy was higher for microsatellites compared to similar number of SNP markers. Pooling of the STR and SNP allele frequency data sets provided the highest accuracy of the individual fish population assignment.
Insights into HLA-G Genetics Provided by Worldwide Haplotype Diversity
Castelli, Erick C.; Ramalho, Jaqueline; Porto, Iane O. P.; Lima, Thálitta H. A.; Felício, Leandro P.; Sabbagh, Audrey; Donadi, Eduardo A.; Mendes-Junior, Celso T.
2014-01-01
Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures. PMID:25339953
Terasawa, Hideo; Oda, Masaya; Morino, Hiroyuki; Miyachi, Takafumi; Izumi, Yuishin; Maruyama, Hirofumi; Matsumoto, Masayasu; Kawakami, Hideshi
2004-03-25
The highest prevalence rate of spinocerebellar ataxia type 6 (SCA6) in the worldwide population is in the Chugoku and Kansai areas of Western Japan, but the reason of this geographic characteristics is unclear. We investigated the predisposing haplotypes and their geographic distribution. Genotyping of five microsatellite markers and three single nucleotide polymorphisms linked to the CACNA1A gene in 150 Japanese SCA6 patients from unrelated 118 families revealed three major haplotypes, carrying a pool of one common haplotype core. A founder chromosome was thought to have historically diverged into at least three types. One of the major haplotypes newly identified showed a strong geographical cluster around the Seto Inland Sea in the Chugoku and Kansai areas of Western Japan, whereas the others were widely distributed throughout Japan. The distribution of predisposing haplotypes contributes to the geographical differences in prevalence of SCA6.
Balaresque, Patricia; Poulet, Nicolas; Cussat-Blanc, Sylvain; Gerard, Patrice; Quintana-Murci, Lluis; Heyer, Evelyne; Jobling, Mark A
2015-01-01
High-frequency microsatellite haplotypes of the male-specific Y-chromosome can signal past episodes of high reproductive success of particular men and their patrilineal descendants. Previously, two examples of such successful Y-lineages have been described in Asia, both associated with Altaic-speaking pastoral nomadic societies, and putatively linked to dynasties descending, respectively, from Genghis Khan and Giocangga. Here we surveyed a total of 5321 Y-chromosomes from 127 Asian populations, including novel Y-SNP and microsatellite data on 461 Central Asian males, to ask whether additional lineage expansions could be identified. Based on the most frequent eight-microsatellite haplotypes, we objectively defined 11 descent clusters (DCs), each within a specific haplogroup, that represent likely past instances of high male reproductive success, including the two previously identified cases. Analysis of the geographical patterns and ages of these DCs and their associated cultural characteristics showed that the most successful lineages are found both among sedentary agriculturalists and pastoral nomads, and expanded between 2100 BCE and 1100 CE. However, those with recent origins in the historical period are almost exclusively found in Altaic-speaking pastoral nomadic populations, which may reflect a shift in political organisation in pastoralist economies and a greater ease of transmission of Y-chromosomes through time and space facilitated by the use of horses. PMID:25585703
[Application of Multiple Genetic Markers in a Case of Determination of Half Sibling].
Yang, Xue; Shi, Mei-sen; Yuan, Li; Lu, Di
2016-02-01
A case of half sibling was determined with multiple genetic markers, which could be potentially applied for determination of half sibling relationship from same father. Half sibling relationship was detected by 39 autosomal STR genetic markers, 23 Y-chromosomal STR genetic markers and 12 X -chromosomal STR genetic markers among ZHAO -1, ZHAO -2, ZHAO -3, ZHAO -4, and ZHAO-5. According to autosomal STR, Y-STR and X-STR genotyping results, it was determined that ZHAO-4 (alleged half sibling) was unrelated with ZHAO-1 and ZHAO-2; however, ZHAO-3 (alleged half sibling), ZHAO-5 (alleged half sibling) shared same genetic profile with ZHAO-1, and ZHAO-2 from same father. It is reliable to use multiple genetic markers and family gene reconstruction to determine half sibling relationship from same father, but it is difficult to determination by calculating half sibling index with ITO and discriminant functions.
Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes
Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.
1990-01-01
We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812
Searching for the mother missed since the Second World War.
Zupanič Pajnič, Irena; Petaros, Anja; Balažic, Jože; Geršak, Ksenija
2016-11-01
The aim of the study was to perform the genetic identification of a human cranium from a Second World War gravesite in Slovenia and find out if it belonged to the mother of a woman used as a family reference. Both genetic and anthropological examinations were carried out. The genetic examination was performed on 2 molars and petrous bone. Prior to DNA isolation 0.5 g of tooth and bone powder was decalcified. The DNA was purified in a Biorobot EZ1 (Qiagen) device. The nuclear DNA of the samples was quantified and short tandem repeat (STR) typing performed using two different autosomal and Y-STR kits. Up to 22.4 ng DNA/g of powder was obtained from samples analyzed. We managed to obtain nuclear DNA for successful STR typing from the left second molar and from the petrous bone. Full autosomal genetic profile including amelogenin locus revealed the male origin of the cranium that was further confirmed by the analyses of Y-STRs. The same conclusions were adopted after the anthropological analysis which identified the cranium as that of a very young Caucasoid male. The male origin of the cranium rejected the possibility of motherhood for the compared daughter. For traceability in the event of contamination, we created an elimination database including genetic profiles of the nuclear and Y-STRs of all persons that had been in contact with the analyzed cranium and no match was found. Copyright © 2016 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Pedersen, Niels C; Brucker, Lynn; Tessier, Natalie Green; Liu, Hongwei; Penedo, Maria Cecilia T; Hughes, Shayne; Oberbauer, Anita; Sacks, Ben
2015-01-01
Sebaceous adenitis (SA) and Addison's disease (AD) increased rapidly in incidence among Standard Poodles after the mid-twentieth century. Previous attempts to identify specific genetic causes using genome wide association studies and interrogation of the dog leukocyte antigen (DLA) region have been non-productive. However, such studies led us to hypothesize that positive selection for desired phenotypic traits that arose in the mid-twentieth century led to intense inbreeding and the inadvertent amplification of AD and SA associated traits. This hypothesis was tested with genetic studies of 761 Standard, Miniature, and Miniature/Standard Poodle crosses from the USA, Canada and Europe, coupled with extensive pedigree analysis of thousands more dogs. Genome-wide diversity across the world-wide population was measured using a panel of 33 short tandem repeat (STR) loci. Allele frequency data were also used to determine the internal relatedness of individual dogs within the population as a whole. Assays based on linkage between STR genomic loci and DLA genes were used to identify class I and II haplotypes and disease associations. Genetic diversity statistics based on genomic STR markers indicated that Standard Poodles from North America and Europe were closely related and reasonably diverse across the breed. However, genetic diversity statistics, internal relatedness, principal coordinate analysis, and DLA haplotype frequencies showed a marked imbalance with 30 % of the diversity in 70 % of the dogs. Standard Poodles with SA and AD were strongly linked to this inbred population, with dogs suffering with SA being the most inbred. No single strong association was found between STR defined DLA class I or II haplotypes and SA or AD in the breed as a whole, although certain haplotypes present in a minority of the population appeared to confer moderate degrees of risk or protection against either or both diseases. Dogs possessing minor DLA class I haplotypes were half as
Swisher, Kylie D.; Henne, Donald C.; Crosslin, James M.
2014-01-01
Abstract The potato psyllid, Bactericera cockerelli (Sulc) (Hemiptera: Triozidae), is a pest of potato and other solanaceous crops in North and Central America and New Zealand. Previous genotyping studies have demonstrated the presence of three different haplotypes of B. cockerelli in the United States corresponding to three geographical regions: Central, Western, and Northwestern. These studies utilized psyllids collected in the western and central United States between 1998 and 2011. In an effort to further genotype potato psyllids collected in the 2012 growing season, a fourth B. cockerelli haplotype was discovered corresponding to the Southwestern United States geographical region. High-resolution melting analyses identified this new haplotype using an amplicon generated from a portion of the B. cockerelli mitochondrial cytochrome coxidase subunit I gene. Sequencing of this gene, as well as use of a restriction enzyme assay, confirmed the identification of the novel B. cockerelli haplotype in the United States. PMID:25368079
Drake, B.M.; Goto, R.M.; Miller, M.M.; Gee, G.F.; Briles, W.E.
1999-01-01
The major histocompatibility complex (MHC) is a group of genetic loci coding for haplotypes that have been associated with fitness traits in mammals and birds. Such associations suggest that MHC diversity may be an indicator of overall genetic fitness of endangered or threatened species. The MHC haplotypes of a captive population of 12 families of northern bobwhites (Colinus virginianus) were identified using a combination of immunogenetic and molecular techniques. Alloantisera were produced within families of northern bobwhites and were then tested for differential agglutination of erythrocytes of all members of each family. The pattern of reactions determined from testing these alloantisera identified a single genetic system of alloantigens in the northern bobwhites, resulting in the assignment of a tentative genotype to each individual within the quail families. Restriction fragment patterns of the DNA of each bird were determined using the chicken MHC B-G cDNA probe bg11. The concordance between the restriction fragment patterns and the alloantisera reactions showed that the alloantisera had identified the MHC of the northern bobwhite and supported the tentative genotype assignments, identifying at least 12 northern bobwhite MHC haplotypes.