Sample records for tandem repeat haplotype

  1. Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes

    PubMed Central

    Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.

    1990-01-01

    We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812

  2. Genetic variation in a compound short tandem repeat/Alu haplotype system at the SB19.3 locus: properties and interpretation.

    PubMed

    Gaspar, Paulo; Seixas, Susana; Rocha, Jorge

    2004-04-01

    The genetic variation at a compound nonrecombining haplotype system, consisting of the previously reported SB19.3 Alu insertion polymorphism and a newly identified adjacent short tandem repeat (STR), was studied in population samples from Portugal and São Tomé (Gulf of Guinea, West Africa). Age estimates based on the linked microsatellite variation suggest that the Alu insertion occurred about 190,000 years ago. In accordance with the global patterns of distribution of human genetic variation, the highest haplotype diversity was found in the African sample. This excess in African diversity was due to both a substantial reduction in heterozygosity at the Alu polymorphism and a lower STR variability associated with the predominant Alu insertion allele in the Portuguese sample. The high level of interpopulation differentiation observed at the Alu locus (F(ST) = 0.43) was interpreted under alternative selective and demographic scenarios. The need for compatibility between patterns of variation at the STR and Alu loci could be used to restrict the range of selection coefficients in selection-driven genetic hitchhiking frameworks and to favor demographic scenarios dominated by larger pre-expansion African population sizes. Taken together, the data show that the SB19.3 Alu-STR system is an informative marker that can be included in more extended batteries of compound haplotypes used in human evolutionary studies.

  3. Short Tandem Repeat DNA Internet Database

    National Institute of Standards and Technology Data Gateway

    SRD 130 Short Tandem Repeat DNA Internet Database (Web, free access)   Short Tandem Repeat DNA Internet Database is intended to benefit research and application of short tandem repeat DNA markers for human identity testing. Facts and sequence information on each STR system, population data, commonly used multiplex STR systems, PCR primers and conditions, and a review of various technologies for analysis of STR alleles have been included.

  4. Genetic analysis of haplotype data for 23 Y-chromosome short tandem repeat loci in the Turkish population recently settled in Sarajevo, Bosnia and Herzegovina

    PubMed Central

    Dogan, Serkan; Primorac, Dragan; Marjanović, Damir

    2014-01-01

    Aim To explore the distribution and polymorphisms of 23 short tandem repeat (STR) loci on the Y chromosome in the Turkish population recently settled in Sarajevo, Bosnia and Herzegovina and to investigate its genetic relationships with the homeland Turkish population and neighboring populations. Methods This study included 100 healthy unrelated male individuals from the Turkish population living in Sarajevo. Buccal swab samples were collected as a DNA source. Genomic DNA was extracted using the salting out method and amplification was performed using PowerPlex Y 23 amplification kit. The studied population was compared to other populations using pairwise genetic distances, which were represented with a multi-dimensional scaling plot. Results Haplotype and allele frequencies of the sample population were calculated and the results showed that all 100 samples had unique haplotypes. The most polymorphic locus was DYS458, and the least polymorphic DYS391. The observed haplotype diversity was 1.0000 ± 0.0014, with a discrimination capacity of 1.00 and the match probability of 0.01. Rst values showed that our sample population was closely related in both dimensions to the Lebanese and Iraqi populations, while it was more distant from Bosnian, Croatian, and Macedonian populations. Conclusion Turkish population residing in Sarajevo could be observed as a representative Turkish population, since our results were consistent with those previously published for the homeland Turkish population. Also, this study once again proved that geographically close populations were genetically more related to each other. PMID:25358886

  5. Genetic analysis of haplotype data for 23 Y-chromosome short tandem repeat loci in the Turkish population recently settled in Sarajevo, Bosnia and Herzegovina.

    PubMed

    Dogan, Serkan; Primorac, Dragan; Marjanović, Damir

    2014-10-01

    To explore the distribution and polymorphisms of 23 short tandem repeat (STR) loci on the Y chromosome in the Turkish population recently settled in Sarajevo, Bosnia and Herzegovina and to investigate its genetic relationships with the homeland Turkish population and neighboring populations. This study included 100 healthy unrelated male individuals from the Turkish population living in Sarajevo. Buccal swab samples were collected as a DNA source. Genomic DNA was extracted using the salting out method and amplification was performed using PowerPlex Y 23 amplification kit. The studied population was compared to other populations using pairwise genetic distances, which were represented with a multi-dimensional scaling plot. Haplotype and allele frequencies of the sample population were calculated and the results showed that all 100 samples had unique haplotypes. The most polymorphic locus was DYS458, and the least polymorphic DYS391. The observed haplotype diversity was 1.0000 ± 0.0014, with a discrimination capacity of 1.00 and the match probability of 0.01. Rst values showed that our sample population was closely related in both dimensions to the Lebanese and Iraqi populations, while it was more distant from Bosnian, Croatian, and Macedonian populations. Turkish population residing in Sarajevo could be observed as a representative Turkish population, since our results were consistent with those previously published for the homeland Turkish population. Also, this study once again proved that geographically close populations were genetically more related to each other.

  6. Toward Male Individualization with Rapidly Mutating Y-Chromosomal Short Tandem Repeats

    PubMed Central

    Ballantyne, Kaye N; Ralf, Arwin; Aboukhalid, Rachid; Achakzai, Niaz M; Anjos, Maria J; Ayub, Qasim; Balažic, Jože; Ballantyne, Jack; Ballard, David J; Berger, Burkhard; Bobillo, Cecilia; Bouabdellah, Mehdi; Burri, Helen; Capal, Tomas; Caratti, Stefano; Cárdenas, Jorge; Cartault, François; Carvalho, Elizeu F; Carvalho, Monica; Cheng, Baowen; Coble, Michael D; Comas, David; Corach, Daniel; D'Amato, Maria E; Davison, Sean; de Knijff, Peter; De Ungria, Maria Corazon A; Decorte, Ronny; Dobosz, Tadeusz; Dupuy, Berit M; Elmrghni, Samir; Gliwiński, Mateusz; Gomes, Sara C; Grol, Laurens; Haas, Cordula; Hanson, Erin; Henke, Jürgen; Henke, Lotte; Herrera-Rodríguez, Fabiola; Hill, Carolyn R; Holmlund, Gunilla; Honda, Katsuya; Immel, Uta-Dorothee; Inokuchi, Shota; Jobling, Mark A; Kaddura, Mahmoud; Kim, Jong S; Kim, Soon H; Kim, Wook; King, Turi E; Klausriegler, Eva; Kling, Daniel; Kovačević, Lejla; Kovatsi, Leda; Krajewski, Paweł; Kravchenko, Sergey; Larmuseau, Maarten H D; Lee, Eun Young; Lessig, Ruediger; Livshits, Ludmila A; Marjanović, Damir; Minarik, Marek; Mizuno, Natsuko; Moreira, Helena; Morling, Niels; Mukherjee, Meeta; Munier, Patrick; Nagaraju, Javaregowda; Neuhuber, Franz; Nie, Shengjie; Nilasitsataporn, Premlaphat; Nishi, Takeki; Oh, Hye H; Olofsson, Jill; Onofri, Valerio; Palo, Jukka U; Pamjav, Horolma; Parson, Walther; Petlach, Michal; Phillips, Christopher; Ploski, Rafal; Prasad, Samayamantri P R; Primorac, Dragan; Purnomo, Gludhug A; Purps, Josephine; Rangel-Villalobos, Hector; Rębała, Krzysztof; Rerkamnuaychoke, Budsaba; Gonzalez, Danel Rey; Robino, Carlo; Roewer, Lutz; Rosa, Alexandra; Sajantila, Antti; Sala, Andrea; Salvador, Jazelyn M; Sanz, Paula; Schmitt, Cornelia; Sharma, Anil K; Silva, Dayse A; Shin, Kyoung-Jin; Sijen, Titia; Sirker, Miriam; Siváková, Daniela; Škaro, Vedrana; Solano-Matamoros, Carlos; Souto, Luis; Stenzl, Vlastimil; Sudoyo, Herawati; Syndercombe-Court, Denise; Tagliabracci, Adriano; Taylor, Duncan; Tillmar, Andreas; Tsybovsky, Iosif S; Tyler-Smith, Chris; van der Gaag, Kristiaan J; Vanek, Daniel; Völgyi, Antónia; Ward, Denise; Willemse, Patricia; Yap, Eric PH; Yong, Rita YY; Pajnič, Irena Zupanič; Kayser, Manfred

    2014-01-01

    Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836–0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father–son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database. PMID:24917567

  7. Three potato centromeres are associated with distinct haplotypes with or without megabase-sized satellite repeat arrays.

    PubMed

    Wang, Linsheng; Zeng, Zixian; Zhang, Wenli; Jiang, Jiming

    2014-02-01

    We report discoveries of different haplotypes associated with the centromeres of three potato chromosomes, including haplotypes composed of long arrays of satellite repeats and haplotypes lacking the same repeats. These results are in favor of the hypothesis that satellite repeat-based centromeres may originate from neocentromeres that lack repeats.

  8. TRedD—A database for tandem repeats over the edit distance

    PubMed Central

    Sokol, Dina; Atagun, Firat

    2010-01-01

    A ‘tandem repeat’ in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats are common in the genomes of both eukaryotic and prokaryotic organisms. They are significant markers for human identity testing, disease diagnosis, sequence homology and population studies. In this article, we describe a new database, TRedD, which contains the tandem repeats found in the human genome. The database is publicly available online, and the software for locating the repeats is also freely available. The definition of tandem repeats used by TRedD is a new and innovative definition based upon the concept of ‘evolutive tandem repeats’. In addition, we have developed a tool, called TandemGraph, to graphically depict the repeats occurring in a sequence. This tool can be coupled with any repeat finding software, and it should greatly facilitate analysis of results. Database URL: http://tandem.sci.brooklyn.cuny.edu/ PMID:20624712

  9. TRAP: automated classification, quantification and annotation of tandemly repeated sequences.

    PubMed

    Sobreira, Tiago José P; Durham, Alan M; Gruber, Arthur

    2006-02-01

    TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files.

  10. Typing Clostridium difficile strains based on tandem repeat sequences

    PubMed Central

    2009-01-01

    Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124

  11. [Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

    PubMed

    Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

    2002-01-01

    To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.

  12. The association of 22 Y chromosome short tandem repeat loci with initiative-aggressive behavior.

    PubMed

    Yang, Chun; Ba, Huajie; Zhang, Wei; Zhang, Shuyou; Zhao, Hanqing; Yu, Haiying; Gao, Zhiqin; Wang, Binbin

    2018-05-15

    Aggressive behavior represents an important public concern and a clinical challenge to behaviorists and psychiatrists. Aggression in humans is known to have an important genetic basis, so to investigate the association of Y chromosome short tandem repeat (Y-STR) loci with initiative-aggressive behavior, we compared allelic and haplotypic distributions of 22 Y-STRs in a group of Chinese males convicted of premeditated extremely violent crimes (n = 271) with a normal control group (n = 492). Allelic distributions of DYS533 and DYS437 loci differed significantly between the two groups (P < 0.05). The case group had higher frequencies of DYS533 allele 14, DYS437 allele 14, and haplotypes 11-14 of DYS533-DYS437 compared with the control group. Additionally, the DYS437 allele 15 frequency was significantly lower in cases than controls. No frequency differences were observed in the other 20 Y-STR loci between these two groups. Our results indicate a genetic role for Y-STR loci in the development of initiative aggression in non-psychiatric subjects. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Tandem-repeat protein domains across the tree of life.

    PubMed

    Jernigan, Kristin K; Bordenstein, Seth R

    2015-01-01

    Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20-40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species.

  14. Tandem-repeat protein domains across the tree of life

    PubMed Central

    Jernigan, Kristin K.

    2015-01-01

    Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species. PMID:25653910

  15. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

    PubMed

    Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

    2013-01-30

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

  16. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    PubMed Central

    2013-01-01

    Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705

  17. Rational design of alpha-helical tandem repeat proteins with closed architectures

    PubMed Central

    Doyle, Lindsey; Hallinan, Jazmine; Bolduc, Jill; Parmeggiani, Fabio; Baker, David; Stoddard, Barry L.; Bradley, Philip

    2015-01-01

    Tandem repeat proteins, which are formed by repetition of modular units of protein sequence and structure, play important biological roles as macromolecular binding and scaffolding domains, enzymes, and building blocks for the assembly of fibrous materials1,2. The modular nature of repeat proteins enables the rapid construction and diversification of extended binding surfaces by duplication and recombination of simple building blocks3,4. The overall architecture of tandem repeat protein structures – which is dictated by the internal geometry and local packing of the repeat building blocks – is highly diverse, ranging from extended, super-helical folds that bind peptide, DNA, and RNA partners5–9, to closed and compact conformations with internal cavities suitable for small molecule binding and catalysis10. Here we report the development and validation of computational methods for de novo design of tandem repeat protein architectures driven purely by geometric criteria defining the inter-repeat geometry, without reference to the sequences and structures of existing repeat protein families. We have applied these methods to design a series of closed alpha-solenoid11 repeat structures (alpha-toroids) in which the inter-repeat packing geometry is constrained so as to juxtapose the N- and C-termini; several of these designed structures have been validated by X-ray crystallography. Unlike previous approaches to tandem repeat protein engineering12–20, our design procedure does not rely on template sequence or structural information taken from natural repeat proteins and hence can produce structures unlike those seen in nature. As an example, we have successfully designed and validated closed alpha-solenoid repeats with a left-handed helical architecture that – to our knowledge – is not yet present in the protein structure database21. PMID:26675735

  18. Variable number of tandem repeat polymorphisms of DRD4: re-evaluation of selection hypothesis and analysis of association with schizophrenia

    PubMed Central

    Hattori, Eiji; Nakajima, Mizuho; Yamada, Kazuo; Iwayama, Yoshimi; Toyota, Tomoko; Saitou, Naruya; Yoshikawa, Takeo

    2009-01-01

    Associations have been reported between the variable number of tandem repeat (VNTR) polymorphisms in the exon 3 of dopamine D4 receptor gene gene and multiple psychiatric illnesses/traits. We examined the distribution of VNTR alleles of different length in a Japanese cohort and found that, as reported earlier, the size of allele ‘7R' was much rarer (0.5%) in Japanese than in Caucasian populations (∼20%). This presents a challenge to an earlier proposed hypothesis that positive selection favoring the allele 7R has contributed to its high frequency. To further address the issue of selection, we carried out sequencing of the VNTR region not only from human but also from chimpanzee samples, and made inference on the ancestral repeat motif and haplotype by use of a phylogenetic analysis program. The most common 4R variant was considered to be the ancestral haplotype as earlier proposed. However, in a gene tree of VNTR constructed on the basis of this inferred ancestral haplotype, the allele 7R had five descendent haplotypes in relatively long lineage, where genetic drift can have major influence. We also tested this length polymorphism for association with schizophrenia, studying two Japanese sample sets (one with 570 cases and 570 controls, and the other with 124 pedigrees). No evidence of association between the allele 7R and schizophrenia was found in any of the two data sets. Collectively, this study suggests that the VNTR variation does not have an effect large enough to cause either selection or a detectable association with schizophrenia in a study of samples of moderate size. PMID:19092778

  19. Versatile communication strategies among tandem WW domain repeats

    PubMed Central

    Dodson, Emma Joy; Fishbain-Yoskovitz, Vered; Rotem-Bamberger, Shahar

    2015-01-01

    Interactions mediated by short linear motifs in proteins play major roles in regulation of cellular homeostasis since their transient nature allows for easy modulation. We are still far from a full understanding and appreciation of the complex regulation patterns that can be, and are, achieved by this type of interaction. The fact that many linear-motif-binding domains occur in tandem repeats in proteins indicates that their mutual communication is used extensively to obtain complex integration of information toward regulatory decisions. This review is an attempt to overview, and classify, different ways by which two and more tandem repeats cooperate in binding to their targets, in the well-characterized family of WW domains and their corresponding polyproline ligands. PMID:25710931

  20. Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.

    PubMed

    Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M

    1999-10-01

    This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.

  1. Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

    PubMed

    Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

    1997-12-01

    Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.

  2. Stabilization of perfect and imperfect tandem repeats by single-strand DNA exonucleases

    PubMed Central

    Feschenko, Vladimir V.; Rajman, Luis A.; Lovett, Susan T.

    2003-01-01

    Rearrangements between tandemly repeated DNA sequences are a common source of genetic instability. Such rearrangements underlie several human genetic diseases. In many organisms, the mismatch-repair (MMR) system functions to stabilize repeats when the repeat unit is short or when sequence imperfections are present between the repeats. We show here that the action of single-stranded DNA (ssDNA) exonucleases plays an additional, important role in stabilizing tandem repeats, independent of their role in MMR. For perfect repeats of ≈100 bp in Escherichia coli that are not susceptible to MMR, exonuclease (Exo)-I, ExoX, and RecJ exonuclease redundantly inhibit deletion. Our data suggest that >90% of potential deletion events are avoided by the combined action of these three exonucleases. Imperfect tandem repeats, less prone to rearrangements, are stabilized by both the MMR-pathway and ssDNA-specific exonucleases. For 100-bp repeats containing four mispairs, ExoI alone aborts most deletion events, even in the presence of a functional MMR system. By genetic analysis, we show that the inhibitory effect of ssDNA exonucleases on deletion formation is independent of the MutS and UvrD proteins. Exonuclease degradation of DNA displaced during the deletion process may abort slipped misalignment. Exonuclease action is therefore a significant force in genetic stabilization of many forms of repetitive DNA. PMID:12538867

  3. Variable Number Of Tandem Repeats (VNTR) and its application in bacterial epidemiology.

    PubMed

    Ramazanzadeh, Rashid; McNerney, Ruth

    2007-08-15

    Molecular epidemiology is the using of molecular techniques to study bacterial distribution in human populations. Recently molecular epidemiologist benefit from several techniques such as Variable Number Tandem Repeat (VNTR) typing method to typing bacterial strains. Variable Number Tandem Repeat (VNTR) typing is a tool for genotyping and provides data in a simple and numeric format based on the number of repetitive sequences. VNTR for first time identified in M. tuberculosis as Mycobacterial Interspersed Repeat Units (MIRUs). General terms of VNTR have now been reported in Bacillus anthracis, Legionella pneumophila, Pseudomonas aeruginosa, Salmonella enterica and Escherichia coli O157.

  4. Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species.

    PubMed

    Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg

    2005-12-01

    In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of

  5. Fourteen short tandem repeat loci Y chromosome haplotypes: Genetic analysis in populations from northern Brazil.

    PubMed

    Palha, Teresinha; Ribeiro-Rodrigues, Elzemar; Ribeiro-dos-Santos, Andrea; Santos, Sidney

    2012-05-01

    Fourteen Y-STR loci (DYS458, DYS439, Y-GATA H4, DYS576, DYS447, DYS460, DYS456, YGATA A10, DYS437, DYS449, DYS570, DYS635 or Y-GATA C4, DYS448 and DYS438) were analysed in 873 males from eight northern Brazil populations: Belém (N=400), Santarém (N=69), Manaus (N=75), Macapá (N=65), Palmas (N=30), Rio Branco (N=32), Porto Velho (N=135) and Boa Vista (N=67). A total of 871 different haplotypes were identified, of which 869 were unique. The panel's estimated total haplotype diversity (HD) is 0.9988, and its discrimination capacity (DC) is 0.9980. The lowest estimates of genetic diversity correspond to markers Y-GATA H4 (0.550) and DYS460 (0.581), and the greatest (above 0.700) to markers DYS458, DYS576, DYS447, YS449, DYS570 and DYS635. The genetic parameters obtained were higher for the 14-Y-STR panel than that for the minimum haplotype set (HD=0.9969; DC=0.76) and the parameters were similar to those obtained with the panel of 17 YSTR of YHRD (HD=0.9987; DC=0. 9870). The analysis of molecular variance (AMOVA) indicated that most of the genetic variance is found within populations and a smaller, but significant part, is found among populations (R(ST)=0.027, p value=0.009). The data when compared with those from African, Amerindian and European populations have shown no significant genetic distance between northern Brazil populations and Europeans, but there is a significant genetic distance when compared to Africans and Amerindians. The discrimination capacity of the markers shows a high potential for forensic analysis. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  6. Filipino DNA variation at 12 X-chromosome short tandem repeat markers.

    PubMed

    Salvador, Jazelyn M; Apaga, Dame Loveliness T; Delfin, Frederick C; Calacal, Gayvelline C; Dennis, Sheila Estacio; De Ungria, Maria Corazon A

    2018-06-08

    Demands for solving complex kinship scenarios where only distant relatives are available for testing have risen in the past years. In these instances, other genetic markers such as X-chromosome short tandem repeat (X-STR) markers are employed to supplement autosomal and Y-chromosomal STR DNA typing. However, prior to use, the degree of STR polymorphism in the population requires evaluation through generation of an allele or haplotype frequency population database. This population database is also used for statistical evaluation of DNA typing results. Here, we report X-STR data from 143 unrelated Filipino male individuals who were genotyped via conventional polymerase chain reaction-capillary electrophoresis (PCR-CE) using the 12 X-STR loci included in the Investigator ® Argus X-12 kit (Qiagen) and via massively parallel sequencing (MPS) of seven X-STR loci included in the ForenSeq ™ DNA Signature Prep kit of the MiSeq ® FGx ™ Forensic Genomics System (Illumina). Allele calls between PCR-CE and MPS systems were consistent (100% concordance) across seven overlapping X-STRs. Allele and haplotype frequencies and other parameters of forensic interest were calculated based on length (PCR-CE, 12 X-STRs) and sequence (MPS, seven X-STRs) variations observed in the population. Results of our study indicate that the 12 X-STRs in the PCR-CE system are highly informative for the Filipino population. MPS of seven X-STR loci identified 73 X-STR alleles compared with 55 X-STR alleles that were identified solely by length via PCR-CE. Of the 73 sequence-based alleles observed, six alleles have not been reported in the literature. The population data presented here may serve as a reference Philippine frequency database of X-STRs for forensic casework applications. Copyright © 2018 Elsevier B.V. All rights reserved.

  7. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    USDA-ARS?s Scientific Manuscript database

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres comprise of megabase-scale arrays of tandem repeats. The true prevalence of centromere tandem repeats, and whether they exhibit conserved seque...

  8. Tandemly repeated sequences in mtDNA control region of whitefish, Coregonus lavaretus.

    PubMed

    Brzuzan, P

    2000-06-01

    Length variation of the mitochondrial DNA control region was observed with PCR amplification of a sample of 138 whitefish (Coregonus lavaretus). Nucleotide sequences of representative PCR products showed that the variation was due to the presence of an approximately 100-bp motif tandemly repeated two, three, or five times in the region between the conserved sequence block-3 (CSB-3) and the gene for phenylalanine tRNA. This is the first report on the tandem array composed of long repeat units in mitochondrial DNA of salmonids.

  9. Tandem Repeat Proteins Inspired By Squid Ring Teeth

    NASA Astrophysics Data System (ADS)

    Pena-Francesch, Abdon

    Proteins are large biomolecules consisting of long chains of amino acids that hierarchically assemble into complex structures, and provide a variety of building blocks for biological materials. The repetition of structural building blocks is a natural evolutionary strategy for increasing the complexity and stability of protein structures. However, the relationship between amino acid sequence, structure, and material properties of protein systems remains unclear due to the lack of control over the protein sequence and the intricacies of the assembly process. In order to investigate the repetition of protein building blocks, a recently discovered protein from squids is examined as an ideal protein system. Squid ring teeth are predatory appendages located inside the suction cups that provide a strong grasp of prey, and are solely composed of a group of proteins with tandem repetition of building blocks. The objective of this thesis is the understanding of sequence, structure and property relationship in repetitive protein materials inspired in squid ring teeth for the first time. Specifically, this work focuses on squid-inspired structural proteins with tandem repeat units in their sequence (i.e., repetition of alternating building blocks) that are physically cross-linked via beta-sheet structures. The research work presented here tests the hypothesis that, in these systems, increasing the number of building blocks in the polypeptide chain decreases the protein network defects and improves the material properties. Hence, the sequence, nanostructure, and properties (thermal, mechanical, and conducting) of tandem repeat squid-inspired protein materials are examined. Spectroscopic structural analysis, advanced materials characterization, and entropic elasticity theory are combined to elucidate the structure and material properties of these repetitive proteins. This approach is applied not only to native squid proteins but also to squid-inspired synthetic polypeptides

  10. RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins.

    PubMed

    Hirsh, Layla; Paladin, Lisanna; Piovesan, Damiano; Tosatto, Silvio C E

    2018-05-09

    RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.

  11. A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

    PubMed

    Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

    2017-01-01

    TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.

  12. Tandem Repeated Irritation Test (TRIT) Studies and Clinical Relevance: Post 2006.

    PubMed

    Reddy, Rasika; Maibach, Howard

    2018-06-11

    Single or multiple applications of irritants can lead to occupational contact dermatitis, and most commonly irritant contact dermatitis (ICD). Tandem irritation, the sequential application of two irritants to a target skin area, has been studied using the Tandem Repeated Irritation Test (TRIT) to provide a more accurate representation of skin irritation. Here we present an update to Kartono's review on tandem irritation studies since 2006 [1]. We surveyed the literature available on PubMed, Embase, Google Scholar, and the UCSF Dermatology library databases since 2006. The studies included discuss the tandem effects of common chemical irritants, organic solvents, occlusion as well as clinical relevance - and enlarge our ability to discern whether multiple chemical exposures are more or less likely to enhance irritation.

  13. A novel species-specific tandem repeat DNA family from Sinapis arvensis: detection of telomere-like sequences.

    PubMed

    Kapila, R; Das, S; Srivastava, P S; Lakshmikumaran, M

    1996-08-01

    DNA sequences representing a tandemly repeated DNA family of the Sinapis arvensis genome were cloned and characterized. The 700-bp tandem repeat family is represented by two clones, pSA35 and pSA52, which are 697 and 709 bp in length, respectively. Dot matrix analysis of the sequences indicates the presence of repeated elements within each monomeric unit. Sequence analysis of the repetitive region of clones pSA35 and pSA52 shows that there are several copies of a 7-bp repeat element organized in tandem. The consensus sequence of this repeat element is 5'-TTTAGGG-3'. These elements are highly mutated and the difference in length between the two clones is due to different copy numbers of these elements. The repetitive region of clone pSA35 has 26 copies of the element TTTAGGG, whereas clone pSA52 has 28 copies. The repetitive region in both clones is flanked on either side by inverted repeats that may be footprints of a transposition event. Sequence comparison indicates that the element TTTAGGG is identical to telomeric repeats present in Arabidopsis, maize, tomato, and other plants. However, Bal31 digestion kinetics indicates non-telomeric localization of the 700-bp tandem repeats. The clones represent a novel repeat family as (i) they contain telomere-like motifs as subrepeats within each unit; and (ii) they do not hybridize to related crucifers and are species-specific in nature.

  14. Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping.

    PubMed

    U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul

    2007-03-30

    The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous typing methods. Given the health

  15. A TALE-inspired computational screen for proteins that contain approximate tandem repeats

    PubMed Central

    Krwawicz, Joanna

    2017-01-01

    TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832

  16. Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

    PubMed Central

    Benslimane, A A; Dron, M; Hartmann, C; Rode, A

    1986-01-01

    Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553

  17. The Association of a Novel Haplotype in the Dopamine Transporter with Preschool Age Posttraumatic Stress Disorder

    PubMed Central

    Brett, Zoë H.; Henry, Caitlin; Scheeringa, Michael

    2013-01-01

    Abstract Objective Significant evidence supports a genetic contribution to the development of posttraumatic stress disorder (PTSD). Three previous studies have demonstrated an association between PTSD and the nine repeat allele of the 3′ untranslated region (3′UTR) variable number tandem repeat (VNTR) in the dopamine transporter (DAT, rs28363170). Recently a novel, functionally significant C/T single-nucleotide polymorphism (SNP) in the 3′UTR (rs27072) with putative interactions with the 3′VNTR, has been identified. To provide enhanced support for the role of DAT and striatal dopamine regulation in the development of PTSD, this study examined the impact of a haplotype defined by the C allele of rs27072 and the nine repeat allele of the 3′VNTR on PTSD diagnosis in young trauma-exposed children. Methods DAT haplotypes were determined in 150 trauma-exposed 3–6 year-old children. PTSD was assessed with a semistructured interview. After excluding double heterozygotes, analysis was performed on 143 total subjects. Haplotype was examined in relation to categorical and continuous measures of PTSD, controlling for trauma type and race. Additional analysis within the two largest race categories was performed, as other means of controlling for ethnic stratification were not available. Results The number of haplotypes (0, 1, or 2) defined by the presence of the nine repeat allele of rs28363170 (VNTR in the 3′UTR) and the C allele of rs27072 (SNP in the 3′UTR) was significantly associated with both the diagnosis of PTSD and total PTSD symptoms. Specifically, children with one or two copies of the haplotype had significantly more PTSD symptoms and were more likely to be diagnosed with PTSD than were children without this haplotype. Conclusions These findings extend previous findings associating genetic variation in the DAT with PTSD. The association of a haplotype in DAT with PTSD provides incremental traction for a model of genetic vulnerability to PTSD, a

  18. Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina

    PubMed Central

    Kovačević, Lejla; Fatur-Cerić, Vera; Hadžić, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

    2013-01-01

    Aim To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. Methods The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. Results The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. Conclusion This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis. PMID:23771760

  19. Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina.

    PubMed

    Kovačević, Lejla; Fatur-Cerić, Vera; Hadzic, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

    2013-06-01

    To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis.

  20. Molecular tandem repeat strategy for elucidating mechanical properties of high-strength proteins

    PubMed Central

    Jung, Huihun; Pena-Francesch, Abdon; Saadat, Alham; Sebastian, Aswathy; Kim, Dong Hwan; Hamilton, Reginald F.; Albert, Istvan; Allen, Benjamin D.; Demirel, Melik C.

    2016-01-01

    Many globular and structural proteins have repetitions in their sequences or structures. However, a clear relationship between these repeats and their contribution to the mechanical properties remains elusive. We propose a new approach for the design and production of synthetic polypeptides that comprise one or more tandem copies of a single unit with distinct amorphous and ordered regions. Our designed sequences are based on a structural protein produced in squid suction cups that has a segmented copolymer structure with amorphous and crystalline domains. We produced segmented polypeptides with varying repeat number, while keeping the lengths and compositions of the amorphous and crystalline regions fixed. We showed that mechanical properties of these synthetic proteins could be tuned by modulating their molecular weights. Specifically, the toughness and extensibility of synthetic polypeptides increase as a function of the number of tandem repeats. This result suggests that the repetitions in native squid proteins could have a genetic advantage for increased toughness and flexibility. PMID:27222581

  1. Variable-Number Tandem Repeats That Are Useful in Genotyping Isolates of Salmonella enterica subsp. enterica Serovars Typhimurium and Newport▿

    PubMed Central

    Witonski, D. ; Stefanova, R.; Ranganathan, A.; Schutze, G. E.; Eisenach, K. D.; Cave, M. D.

    2006-01-01

    The genome of Salmonella enterica subsp. enterica serovar Typhimurium strain LT2 was analyzed for direct repeats, and 54 sequences containing variable-number tandem repeat loci were identified. Ten primer pairs that anneal upstream and downstream of each selected locus were designed and used to amplify PCR targets in isolates of S. enterica serovars Typhimurium and Newport. Four of the 10 loci did not show polymorphism in the length of products. Six loci were selected for analysis. Isolates of S. enterica serovars Typhimurium and Newport that were related to specific outbreaks and showed identical pulsed-field gel electrophoresis patterns were indistinguishable by the length of the six variable-number tandem repeats. Isolates that differed in their pulsed-field gel electrophoresis patterns showed polymorphism in variable-number tandem repeat profiles. Length of the products was confirmed by DNA sequence analysis. Only 2 of the 10 loci contained exact integers of the direct repeat. Eight loci contained partial copies. The partial copies were maintained at the ends of the variable-number tandem repeat loci in all isolates. In spite of having partial copies that were maintained in all isolates, the number of direct repeats at a locus was polymorphic. Six variable-number tandem repeat loci were useful in distinguishing isolates of S. enterica serovars Typhimurium and Newport that had different pulsed-field gel electrophoresis patterns and in identifying outbreak-associated cases that shared a common pulsed-field gel pattern. PMID:16943354

  2. Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.

    PubMed

    Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T

    1997-12-01

    The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.

  3. Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

    PubMed

    Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

    2015-01-01

    The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.

  4. 5meCpG epigenetic marks neighboring a primate-conserved core promoter short tandem repeat indicate X-chromosome inactivation.

    PubMed

    Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

    2014-01-01

    X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic

  5. 5meCpG Epigenetic Marks Neighboring a Primate-Conserved Core Promoter Short Tandem Repeat Indicate X-Chromosome Inactivation

    PubMed Central

    Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

    2014-01-01

    X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic

  6. [Family-based association study of a variable number of tandem repeat polymorphism of DAT1 gene with Tourette syndrome in a Chinese Han population].

    PubMed

    Zheng, Lanlan; Han, Zhen-liang; Zhang, Xin-hua; Wang, Xue-qin; Jiang, Wei-hua; Yi, Ming-ji; Liu, Shi-guo

    2013-10-01

    To assess the association of a 40 bp variable number of tandem repeat (VNTR) polymorphism within 3 untranslated region of dopamine transporter gene (DAT1) with Tourette syndrome (TS) in a Chinese Han population. A total of 160 TS patients and their parents were recruited. The VNTR polymorphism was detected with polymerase chain reaction-VNTR analysis, and its association with TS and its subtypes were assessed through a family-based association study comprising transmission disequilibrium test (TDT) and haplotype relative risk (HRR) analysis. The repeat numbers at the DAT1 40 bp locus were 11, 10, 9, 7.5 and 7 among the patients and their parents, with the most common type being a 10-repeat allele. No significant association was detected between the polymorphism and TS (TDT: X ² = 0.472, df = 1, P = 0.583; HRR: X ² = 0.313, P = 0.576, OR = 0.855, 95%CI: 0.493-1.481). Our data suggested that the VNTR polymorphism of DAT1 gene is not associated with susceptibility to TS in Chinese Han population. However, our results are to be validated in larger sets of patients collected from other populations.

  7. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

    PubMed

    Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

    2012-11-07

    Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40

  8. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae

    PubMed Central

    2012-01-01

    Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the

  9. Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.

    PubMed

    Pellegrini, Marco

    2015-01-01

    Tandem repetitions in protein sequence and structure is a fascinating subject of research which has been a focus of study since the late 1990s. In this survey, we give an overview on the multi-faceted aspects of research on protein tandem repeats (PTR for short), including prediction algorithms, databases, early classification efforts, mechanisms of PTR formation and evolution, and synthetic PTR design. We also touch on the rather open issue of the relationship between PTR and flexibility (or disorder) in proteins. Detection of PTR either from protein sequence or structure data is challenging due to inherent high (biological) signal-to-noise ratio that is a key feature of this problem. As early in silico analytic tools have been key enablers for starting this field of study, we expect that current and future algorithmic and statistical breakthroughs will have a high impact on the investigations of the biological role of PTR.

  10. Linking Y-chromosomal short tandem repeat loci to human male impulsive aggression.

    PubMed

    Yang, Chun; Ba, Huajie; Cao, Yin; Dong, Guoying; Zhang, Shuyou; Gao, Zhiqin; Zhao, Hanqing; Zhou, Xianju

    2017-11-01

    Men are more susceptible to impulsive behavior than women. Epidemiological studies revealed that the impulsive aggressive behavior is affected by genetic factors, and the male-specific Y chromosome plays an important role in this behavior. In this study, we investigated the association between the impulsive aggressive behavior and Y-chromosomal short tandem repeats (Y-STRs) loci. The collected biologic samples from 271 offenders with impulsive aggressive behavior and 492 healthy individuals without impulsive aggressive behavior were amplified by PowerPlex R Y23 PCR System and the resultant products were separated by electrophoresis and further genotyped. Then, comparisons in allele and haplotype frequencies of the selected 22 Y-STRs were made in the two groups. Our results showed that there were significant differences in allele frequencies at DYS448 and DYS456 between offenders and controls ( p  < .05). Univariate analysis further revealed significant frequency differences for alleles 18 and 22 at DYS448 (0.18 vs 0.27, compared to the controls, p  = .003, OR=0.57,95% CI=0.39-0.82; 0.03 vs 0.01, compared to the controls, p  = .003, OR=7.45, 95% CI=1.57-35.35, respectively) and for allele 17 at DYS456 (0.07 vs 0.14, compared to the controls, p  = .006, OR=0.48, 95% CI =0.28-0.82) between two groups. Interestingly, the frequency of haploid haplotype 22-15 on the DYS448-DYS456 (DYS448-DYS456-22-15) was significantly higher in offenders than in controls (0.033 vs 0.004, compared to the control, p  = .001, OR = 8.42, 95%CI =1.81-39.24). Moreover, there were no significant differences in allele frequencies of other Y-STRs loci between two groups. Furthermore, the unconditional logistic regression analysis confirmed that alleles 18 and 22 at DYS448 and allele 17 at DYS456 are associated with male impulsive aggression. However, the DYS448-DYS456-22-15 is less related to impulsive aggression. Our results suggest a link between Y-chromosomal allele types and male

  11. Genome-wide analysis of tandem repeats in plants and green algae

    Treesearch

    Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

    2014-01-01

    Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...

  12. Medium-sized tandem repeats represent an abundant component of the Drosophila virilis genome.

    PubMed

    Abdurashitov, Murat A; Gonchar, Danila A; Chernukhin, Valery A; Tomilov, Victor N; Tomilova, Julia E; Schostak, Natalia G; Zatsepina, Olga G; Zelentsova, Elena S; Evgen'ev, Michael B; Degtyarev, Sergey K H

    2013-11-09

    Previously, we developed a simple method for carrying out a restriction enzyme analysis of eukaryotic DNA in silico, based on the known DNA sequences of the genomes. This method allows the user to calculate lengths of all DNA fragments that are formed after a whole genome is digested at the theoretical recognition sites of a given restriction enzyme. A comparison of the observed peaks in distribution diagrams with the results from DNA cleavage using several restriction enzymes performed in vitro have shown good correspondence between the theoretical and experimental data in several cases. Here, we applied this approach to the annotated genome of Drosophila virilis which is extremely rich in various repeats. Here we explored the combined approach to perform the restriction analysis of D. virilis DNA. This approach enabled to reveal three abundant medium-sized tandem repeats within the D. virilis genome. While the 225 bp repeats were revealed previously in intergenic non-transcribed spacers between ribosomal genes of D. virilis, two other families comprised of 154 bp and 172 bp repeats were not described. Tandem Repeats Finder search demonstrated that 154 bp and 172 bp units are organized in multiple clusters in the genome of D. virilis. Characteristically, only 154 bp repeats derived from Helitron transposon are transcribed. Using in silico digestion in combination with conventional restriction analysis and sequencing of repeated DNA fragments enabled us to isolate and characterize three highly abundant families of medium-sized repeats present in the D. virilis genome. These repeats comprise a significant portion of the genome and may have important roles in genome function and structural integrity. Therefore, we demonstrated an approach which makes possible to investigate in detail the gross arrangement and expression of medium-sized repeats basing on sequencing data even in the case of incompletely assembled and/or annotated genomes.

  13. A naturally occurring, noncanonical GTP aptamer made of simple tandem repeats

    PubMed Central

    Curtis, Edward A; Liu, David R

    2014-01-01

    Recently, we used in vitro selection to identify a new class of naturally occurring GTP aptamer called the G motif. Here we report the discovery and characterization of a second class of naturally occurring GTP aptamer, the “CA motif.” The primary sequence of this aptamer is unusual in that it consists entirely of tandem repeats of CA-rich motifs as short as three nucleotides. Several active variants of the CA motif aptamer lack the ability to form consecutive Watson-Crick base pairs in any register, while others consist of repeats containing only cytidine and adenosine residues, indicating that noncanonical interactions play important roles in its structure. The circular dichroism spectrum of the CA motif aptamer is distinct from that of A-form RNA and other major classes of nucleic acid structures. Bioinformatic searches indicate that the CA motif is absent from most archaeal and bacterial genomes, but occurs in at least 70 percent of approximately 400 eukaryotic genomes examined. These searches also uncovered several phylogenetically conserved examples of the CA motif in rodent (mouse and rat) genomes. Together, these results reveal the existence of a second class of naturally occurring GTP aptamer whose sequence requirements, like that of the G motif, are not consistent with those of a canonical secondary structure. They also indicate a new and unexpected potential biochemical activity of certain naturally occurring tandem repeats. PMID:24824832

  14. [Molecular cloning and characterization of a novel Clonorchis sinensis antigenic protein containing tandem repeat sequences].

    PubMed

    Liu, Qian; Xu, Xue-Nian; Zhou, Yan; Cheng, Na; Dong, Yu-Ting; Zheng, Hua-Jun; Zhu, Yong-Qiang; Zhu, Yong-Qiang

    2013-08-01

    To find and clone new antigen genes from the lambda-ZAP cDNA expression library of adult Clonorchis sinensis, and determine the immunological characteristics of the recombinant proteins. The cDNA expression library of adult C. sinensis was screened by pooled sera of clonorchiasis patients. The sequences of the positive phage clones were compared with the sequences in EST database, and the full-length sequence of the gene (Cs22 gene) was obtained by RT-PCR. cDNA fragments containing 2 and 3 times tandem repeat sequences were generated by jumping PCR. The sequence encoding the mature peptide or the tandem repeat sequence was respectively cloned into the prokaryotic expression vector pET28a (+), and then transformed into E. coli Rosetta DE3 cells for expression. The recombinant proteins (rCs22-2r, rCs22-3r, rCs22M-2r, and rCs22M-3r) were purified by His-bind-resin (Ni-NTA) affinity chromatography. The immunogenicity of rCs22-2r and rCs22-3r was identified by ELISA. To evaluate the immunological diagnostic value of rCs22-2r and rCs22-3r, serum samples from 35 clonorchiasis patients, 31 healthy individuals, 15 schistosomiasis patients, 15 paragonimiasis westermani patients and 13 cysticercosis patients were examined by ELISA. To locate antigenic determinants, the pooled sera of clonorchiasis patients and healthy persons were analyzed for specific antibodies by ELISA with recombinant protein rCs22M-2r and rCs22M-3r containing the tandem repeat sequences. The full-length sequence of Cs22 antigen gene of C. sinensis was obtained. It contained 13 times tandem repeat sequences of EQQDGDEEGMGGDGGRGKEKGKVEGEDGAGEQKEQA. Bioinformatics analysis indicated that the protein (Cs22) belonged to GPI-anchored proteins family. The recombinant proteins rCs22-2r and rCs22-3r showed a certain level of immunogenicity. The positive rate by ELISA coated with the purified PrCs22-2r and PrCs22-3r for sera of clonorchiasis patients both were 45.7% (16/35), and 3.2% (1/31) for those of healthy

  15. Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2014-07-01

    The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  16. A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.

    PubMed

    Kogi, M; Fukushige, S; Lefevre, C; Hadano, S; Ikeda, J E

    1997-06-01

    In an effort to analyze the genomic region of the distal half of human chromosome 4p, to where Huntington disease and other diseases have been mapped, we have isolated the cosmid clone (CRS447) that was likely to contain a region with specific repeat sequences. Clone CRS447 was subjected to detailed analysis, including chromosome mapping, restriction mapping, and DNA sequencing. Chromosome mapping by both a human-CHO hybrid cell panel and FISH revealed that CRS447 was predominantly located in the 4p15.1-15.3 region. CRS447 was shown to consist of tandem repeats of 4.7-kb units present on chromosome 4p. A single EcoRI unit was subcloned (pRS447), and the complete sequence was determined as 4752 nucleotides. When pRS447 was used as a probe, the number of copies of this repeat per haploid genome was estimated to be 50-70. Sequence analysis revealed that it contained two internal CA repeats and one putative ORF. Database search established that this sequence was unreported. However, two homologous STS markers were found in the database. We concluded that CRS447/pRS447 is a novel tandem repeat sequence that is mainly specific to human chromosome 4p.

  17. Construction and forensic genetic characterization of 11 autosomal haplotypes consisting of 22 tri-allelic indels.

    PubMed

    Zhao, Xiaohong; Chen, Xiaogang; Zhao, Yuancun; Zhang, Shu; Gao, Zehua; Yang, Yiwen; Wang, Yufang; Zhang, Ji

    2018-05-01

    Insertion/deletion polymorphisms (indels), which combine the advantages of both short tandem repeats and single-nucleotide polymorphisms, are suitable for parentage testing. To overcome the limitations of the low polymorphism of di-allelic indels, we constructed a set of haplotypes with physically linked, multi-allelic indels. Candidate haplotypes were selected from the 1000 Genomes Project database, and were subject to the following criteria for inclusion: (i) each marker must have a minimum allele frequency (MAF) of ≥0.1 in the Han population of China; (ii) markers must exist in a non-coding region; (iii) the physical distance between a pair of candidate indels must be <500 bp; (iv) the allele length variation of each indel from 1 to 20 bp; (v) different haplotypes must be located on different chromosomes or chromosomal arms, or be more than 10 Mb apart if on the same chromosomal arm; and (vi) they must not be located across a recombination hotspot. A multiplex system with 11 haplotype markers, comprising 22 tri-allelic indel loci distributed over 10 chromosomes was developed. To validate the multiplex panel, we investigated the haplotype distribution in sets of two and three-generation pedigrees. The results demonstrated that the haplotypes consisting of multi-allelic indel markers exhibited higher polymorphism than a single indel locus, and thus provide Supplementary information for forensic kinship identification. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. MULTIPLE-LOCUS VARIABLE-NUMBER TANDEM REPEAT ANALYSIS OF BRUCELLA ISOLATES FROM THAILAND.

    PubMed

    Kumkrong, Khurawan; Chankate, Phanita; Tonyoung, Wittawat; Intarapuk, Apiradee; Kerdsin, Anusak; Kalambaheti, Thareerat

    2017-01-01

    Brucellosis-induced abortion can result in significant economic loss to farm animals. Brucellosis can be transmitted to humans during slaughter of infected animals or via consumption of contaminated food products. Strain identification of Brucella isolates can reveal the route of transmission. Brucella strains were isolated from vaginal swabs of farm animal, cow milk and from human blood cultures. Multiplex PCR was used to identify Brucella species, and owing to high DNA homology among Brucella isolates, multiple-locus variable-number tandem repeat analysis (MLVA) based on the number of tandem repeats at 16 different genomic loci was used for strain identification. Multiplex PCR categorized the isolates into B. abortus (n = 7), B. melitensis (n = 37), B. suis (n = 3), and 5 of unknown Brucella spp. MLVA-16 clustering analysis differentiated the strains into various genotypes, with Brucella isolates from the same geographic region being closely related, and revealed that the Thai isolates were phylogenetically distinct from those in other countries, including within the Southeast Asian region. Thus, MLVA-16 typing has utility in epidemiological studies.

  19. Investigation of extended Y chromosome STR haplotypes in Sardinia.

    PubMed

    Lacerenza, D; Aneli, S; Di Gaetano, C; Critelli, R; Piazza, A; Matullo, G; Culigioni, C; Robledo, R; Robino, C; Calò, C

    2017-03-01

    Y-chromosomal variation of selected single nucleotide polymorphisms (SNPs) and 32 short tandem repeat (STR) loci was evaluated in Sardinia in three open population groups (Northern Sardinia, n=40; Central Sardinia, n=56; Southern Sardinia, n=91) and three isolates (Desulo, n=34; Benetutti, n=45, Carloforte, n=42). The tested Y-STRs consisted of Yfiler ® Plus markers and the seven rapidly mutating (RM) loci not included in the YFiler ® Plus kit (DYF399S1, DYF403S1ab, DYF404S1, DYS526ab, DYS547, DYS612, and DYS626). As expected, inclusion of additional Y-STR loci increased haplotype diversity (h), though complete differentiation of male lineages was impossible even by means of RM Y-STRs (h=0.99997). Analysis of molecular variance indicated that the three open populations were fairly homogeneous, whereas signs of genetic heterogeneity could be detected when the three isolates were also included in the analysis. Multidimensional scaling analysis showed that, even for extended haplotypes including RM Y-STR markers, Sardinians were clearly differentiated from populations of the Italian peninsula and Sicily. The only exception was represented by the Carloforte sample that, in accordance with its peculiar population history, clustered with Northern/Central Italian populations. The introduction of extended forensic Y-STR panels, including highly variable RM Y-STR markers, is expected to reduce the impact of population structure on haplotype frequency estimations. However, our results show that the availability of geographically detailed reference databases is still important for the assessment of the evidential value of a Y-haplotype match. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  20. STRBase: a short tandem repeat DNA database for the human identity testing community

    PubMed Central

    Ruitberg, Christian M.; Reeder, Dennis J.; Butler, John M.

    2001-01-01

    The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes. PMID:11125125

  1. An examination of the origin and evolution of additional tandem repeats in the mitochondrial DNA control region of Japanese sika deer (Cervus Nippon).

    PubMed

    Ba, Hengxing; Wu, Lang; Liu, Zongyue; Li, Chunyi

    2016-01-01

    Tandem repeat units are only detected in the left domain of the mitochondrial DNA control region in sika deer. Previous studies showed that Japanese sika deer have more tandem repeat units than its cousins from the Asian continent and Taiwan, which often have only three repeat units. To determine the origin and evolution of these additional repeat units in Japanese sika deer, we obtained the sequence of repeat units from an expanded dataset of the control region from all sika deer lineages. The functional constraint is inferred to act on the first repeat unit because this repeat has the least sequence divergence in comparison to the other units. Based on slipped-strand mispairing mechanisms, the illegitimate elongation model could account for the addition or deletion of these additional repeat units in the Japanese sika deer population. We also report that these additional repeat units could be occurring in the internal positions of tandem repeat regions, possibly via coupling with a homogenization mechanism within and among these lineages. Moreover, the increased number of repeat units in the Japanese sika deer population could reflect a balance between mutation and selection, as well as genetic drift.

  2. PolyQ repeat expansions in ATXN2 associated with ALS are CAA interrupted repeats.

    PubMed

    Yu, Zhenming; Zhu, Yongqing; Chen-Plotkin, Alice S; Clay-Falcone, Dana; McCluskey, Leo; Elman, Lauren; Kalb, Robert G; Trojanowski, John Q; Lee, Virginia M-Y; Van Deerlin, Vivianna M; Gitler, Aaron D; Bonini, Nancy M

    2011-03-29

    Amyotrophic lateral sclerosis (ALS) is a devastating, rapidly progressive disease leading to paralysis and death. Recently, intermediate length polyglutamine (polyQ) repeats of 27-33 in ATAXIN-2 (ATXN2), encoding the ATXN2 protein, were found to increase risk for ALS. In ATXN2, polyQ expansions of ≥ 34, which are pure CAG repeat expansions, cause spinocerebellar ataxia type 2. However, similar length expansions that are interrupted with other codons, can present atypically with parkinsonism, suggesting that configuration of the repeat sequence plays an important role in disease manifestation in ATXN2 polyQ expansion diseases. Here we determined whether the expansions in ATXN2 associated with ALS were pure or interrupted CAG repeats, and defined single nucleotide polymorphisms (SNPs) rs695871 and rs695872 in exon 1 of the gene, to assess haplotype association. We found that the expanded repeat alleles of 40 ALS patients and 9 long-repeat length controls were all interrupted, bearing 1-3 CAA codons within the CAG repeat. 21/21 expanded ALS chromosomes with 3CAA interruptions arose from one haplotype (GT), while 18/19 expanded ALS chromosomes with <3CAA interruptions arose from a different haplotype (CC). Moreover, age of disease onset was significantly earlier in patients bearing 3 interruptions vs fewer, and was distinct between haplotypes. These results indicate that CAG repeat expansions in ATXN2 associated with ALS are uniformly interrupted repeats and that the nature of the repeat sequence and haplotype, as well as length of polyQ repeat, may play a role in the neurological effect conferred by expansions in ATXN2.

  3. Tandem repeated application of organic solvents and sodium lauryl sulphate enhances cumulative skin irritation.

    PubMed

    Schliemann, Sibylle; Schmidt, Christina; Elsner, Peter

    2014-01-01

    The objective of our study was to investigate the tandem irritation potential of two organic solvents with concurrent exposure to the hydrophilic detergent irritant sodium lauryl sulphate (SLS). A tandem repeated irritation test was performed with two undiluted organic solvents, cumene (C) and octane (O), with either alternating application with SLS 0.5% or twice daily application of each irritant alone in 27 volunteers on the skin of the back. The cumulative irritation induced over 4 days was quantified using visual scoring and non-invasive bioengineering measurements (skin colour reflectance, skin hydration and transepidermal water loss). Repeated application of C/SLS and O/SLS induced more decline of stratum corneum hydration and higher degrees of clinical irritation and erythema compared to each irritant alone. Our results demonstrate a further example of additive harmful skin effects induced by particular skin irritants and indicate that exposure to organic solvents together with detergents may increase the risk of acquiring occupational contact dermatitis. © 2014 S. Karger AG, Basel.

  4. Multi-locus variable number tandem repeat analysis for Escherichia coli causing extraintestinal infections.

    PubMed

    Manges, Amee R; Tellis, Patricia A; Vincent, Caroline; Lifeso, Kimberley; Geneau, Geneviève; Reid-Smith, Richard J; Boerlin, Patrick

    2009-11-01

    Discriminatory genotyping methods for the analysis of Escherichia coli other than O157:H7 are necessary for public health-related activities. A new multi-locus variable number tandem repeat analysis protocol is presented; this method achieves an index of discrimination of 99.5% and is reproducible and valid when tested on a collection of 836 diverse E. coli.

  5. Whole-genome sequencing reveals a coding non-pathogenic variant tagging a non-coding pathogenic hexanucleotide repeat expansion in C9orf72 as cause of amyotrophic lateral sclerosis.

    PubMed

    Herdewyn, Sarah; Zhao, Hui; Moisse, Matthieu; Race, Valérie; Matthijs, Gert; Reumers, Joke; Kusters, Benno; Schelhaas, Helenius J; van den Berg, Leonard H; Goris, An; Robberecht, Wim; Lambrechts, Diether; Van Damme, Philip

    2012-06-01

    Motor neuron degeneration in amyotrophic lateral sclerosis (ALS) has a familial cause in 10% of patients. Despite significant advances in the genetics of the disease, many families remain unexplained. We performed whole-genome sequencing in five family members from a pedigree with autosomal-dominant classical ALS. A family-based elimination approach was used to identify novel coding variants segregating with the disease. This list of variants was effectively shortened by genotyping these variants in 2 additional unaffected family members and 1500 unrelated population-specific controls. A novel rare coding variant in SPAG8 on chromosome 9p13.3 segregated with the disease and was not observed in controls. Mutations in SPAG8 were not encountered in 34 other unexplained ALS pedigrees, including 1 with linkage to chromosome 9p13.2-23.3. The shared haplotype containing the SPAG8 variant in this small pedigree was 22.7 Mb and overlapped with the core 9p21 linkage locus for ALS and frontotemporal dementia. Based on differences in coverage depth of known variable tandem repeat regions between affected and non-affected family members, the shared haplotype was found to contain an expanded hexanucleotide (GGGGCC)(n) repeat in C9orf72 in the affected members. Our results demonstrate that rare coding variants identified by whole-genome sequencing can tag a shared haplotype containing a non-coding pathogenic mutation and that changes in coverage depth can be used to reveal tandem repeat expansions. It also confirms (GGGGCC)n repeat expansions in C9orf72 as a cause of familial ALS.

  6. Founder haplotype analysis of Fanconi anemia in the Korean population finds common ancestral haplotypes for a FANCG variant.

    PubMed

    Park, Joonhong; Kim, Myungshin; Jang, Woori; Chae, Hyojin; Kim, Yonggoo; Chung, Nack-Gyun; Lee, Jae-Wook; Cho, Bin; Jeong, Dae-Chul; Park, In Yang; Park, Mi Sun

    2015-05-01

    A common ancestral haplotype is strongly suggested in the Korean and Japanese patients with Fanconi anemia (FA), because common mutations have been frequently found: c.2546delC and c.3720_3724delAAACA of FANCA; c.307+1G>C, c.1066C>T, and c.1589_1591delATA of FANCG. Our aim in this study was to investigate the origin of these common mutations of FANCA and FANCG. We genotyped 13 FA patients consisting of five FA-A patients and eight FA-G patients from the Korean FA population. Microsatellite markers used for haplotype analysis included four CA repeat markers which are closely linked with FANCA and eight CA repeat markers which are contiguous with FANCG. As a result, Korean FA-A patients carrying c.2546delC or c.3720_3724delAAACA did not share the same haplotypes. However, three unique haplotypes carrying c.307+1G>C, c.1066C > T, or c.1589_1591delATA, that consisted of eight polymorphic loci covering a flanking region were strongly associated with Korean FA-G, consistent with founder haplotypes reported previously in the Japanese FA-G population. Our finding confirmed the common ancestral haplotypes on the origins of the East Asian FA-G patients, which will improve our understanding of the molecular population genetics of FA-G. To the best of our knowledge, this is the first report on the association between disease-linked mutations and common ancestral haplotypes in the Korean FA population. © 2015 John Wiley & Sons Ltd/University College London.

  7. Intratypic variability of a tandem repeat locus within the DNA polymerase gene of human herpes simplex virus type 2.

    PubMed

    Sun, Yongjiang; Chan, Roy Kum Wah; Tan, Suat Hoon

    2004-01-01

    In this study, the irntratypic variability of a tandem repeat locus within the DNA polymerase (pol) gene of human herpes simplex virus type 2 (HSV2) was uncovered. The locus contained variable numbers of tandem dodecanucleotide (5'-GAC GAG GAC GGG-3') repetitive units. Our result showed that approximately 95% of analyzed HSV2 clinical isolates and the current GenBank HSV2 strains contained two copies of the repetitive units. From genital herpes specimens, three new HSV2 strains, which respectively contained 1, 3, and 4 copies of the repetitive units, were identified. This variable number of tandem repeat (VNTR) locus is absent in HSV1, and thus it also contributes to the intertypic variability of HSV1 and HSV2. The intratypic variability of the locus may be useful for HSV2 strain genotyping and this application is discussed.

  8. Altered Methylation in Tandem Repeat Element and Elemental Component Levels in Inhalable Air Particles

    PubMed Central

    Hou, Lifang; Zhang, Xiao; Zheng, Yinan; Wang, Sheng; Dou, Chang; Guo, Liqiong; Byun, Hyang-Min; Motta, Valeria; McCracken, John; Díaz, Anaité; Kang, Choong-Min; Koutrakis, Petros; Bertazzi, Pier Alberto; Li, Jingyun; Schwartz, Joel; Baccarelli, Andrea A.

    2014-01-01

    Exposure to particulate matter (PM) has been associated with lung cancer risk in epidemiology investigations. Elemental components of PM have been suggested to have critical roles in PM toxicity, but the molecular mechanisms underlying their association with cancer risks remain poorly understood. DNA methylation has emerged as a promising biomarker for environmental-related diseases, including lung cancer. In this study, we evaluated the effects of PM elemental components on methylation of three tandem repeats in a highly-exposed population in Beijing, China. The Beijing Truck Driver Air Pollution Study was conducted shortly before the 2008 Beijing Olympic Games (June 15-July 27, 2008) and included 60 truck drivers and 60 office workers. On two days separated by 1-2 weeks, we measured blood DNA methylation of SATα, NBL2, D4Z4, and personal exposure to eight elemental components in PM2.5, including aluminum (Al), silicon (Si), sulfur (S), potassium (K), calcium (Ca) titanium (Ti), iron (Fe), and zinc (Zn). We estimated the associations of individual elemental component with each tandem repeat methylation in generalized estimating equations (GEE) models adjusted for PM2.5 mass and other covariates. Out of the eight examined elements, NBL2 methylation was positively associated with concentrations of Si (0.121, 95%CI: 0.030; 0.212, FDR=0.047) and Ca (0.065, 95%CI: 0.014; 0.115, FDR=0.047) in truck drivers. In office workers, SATα methylation was positively associated with concentrations of S (0.115, 95%CI: 0.034; 0.196, FDR=0.042). PM-associated differences in blood tandem-repeat methylation may help detect biological effects of the exposure and identify individuals who may eventually experience higher lung cancer risk. PMID:24273195

  9. Variable-number tandem repeats as molecular markers for biotypes of Pasteuria ramosa in Daphnia spp.

    PubMed

    Mouton, Laurence; Nong, Guang; Preston, James F; Ebert, Dieter

    2007-06-01

    Variable-number tandem repeats (VNTRs) have been identified in populations of Pasteuria ramosa, a castrating endobacterium of Daphnia species. The allelic polymorphisms at 14 loci in laboratory and geographically diverse soil samples showed that VNTRs may serve as biomarkers for the genetic characterization of P. ramosa isolates.

  10. De novo generation of plant centromeres at tandem repeats.

    PubMed

    Teo, Chee How; Lermontova, Inna; Houben, Andreas; Mette, Michael Florian; Schubert, Ingo

    2013-06-01

    Artificial minichromosomes are highly desirable tools for basic research, breeding, and biotechnology purposes. We present an option to generate plant artificial minichromosomes via de novo engineering of plant centromeres in Arabidopsis thaliana by targeting kinetochore proteins to tandem repeat arrays at non-centromeric positions. We employed the bacterial lactose repressor/lactose operator system to guide derivatives of the centromeric histone H3 variant cenH3 to LacO operator sequences. Tethering of cenH3 to non-centromeric loci led to de novo assembly of kinetochore proteins and to dicentric carrier chromosomes which potentially form anaphase bridges. This approach will be further developed and may contribute to generating minichromosomes from preselected genomic regions, potentially even in a diploid background.

  11. Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA

    PubMed Central

    Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.

    1995-01-01

    The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581

  12. Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats.

    PubMed

    Seman, Ali; Sapawi, Azizian Mohd; Salleh, Mohd Zaki

    2015-06-01

    Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.

  13. Development of Multiple-Locus Variable-Number Tandem-Repeat Analysis for Molecular Subtyping of Campylobacter jejuni by Using Capillary Electrophoresis

    PubMed Central

    Techaruvichit, Punnida; Vesaratchavest, Mongkol; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

    2015-01-01

    Campylobacter jejuni is a common cause of the frequently reported food-borne diseases in developed and developing nations. This study describes the development of multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) using capillary electrophoresis as a novel typing method for microbial source tracking and epidemiological investigation of C. jejuni. Among 36 tandem repeat loci detected by the Tandem Repeat Finder program, 7 VNTR loci were selected and used for characterizing 60 isolates recovered from chicken meat samples from retail shops, samples from chicken meat processing factory, and stool samples. The discrimination ability of MLVA was compared with that of multilocus sequence typing (MLST). MLVA (diversity index of 0.97 with 31 MLVA types) provided slightly higher discrimination than MLST (diversity index of 0.95 with 25 MLST types). The overall concordance between MLVA and MLST was estimated at 63% by adjusted Rand coefficient. MLVA predicted MLST type better than MLST predicted MLVA type, as reflected by Wallace coefficient (Wallace coefficient for MLVA to MLST versus MLST to MLVA, 86% versus 51%). MLVA is a useful tool and can be used for effective monitoring of C. jejuni and investigation of epidemics caused by C. jejuni. PMID:26025899

  14. GENETIC VARIATION IN RED RASPBERRIES (RUBUS IDAEUS L.; ROSACEAE) FROM SITES DIFFERING IN ORGANIC POLLUTANTS COMPARED WITH SYNTHETIC TANDEM REPEAT DNA PROBES

    EPA Science Inventory

    Two synthetic tandem repetitive DNA probes were used to compare genetic variation at variable-number-tandem-repeat (VNTR) loci among Rubus idaeus L. var. strigosus (Michx.) Maxim. (Rosaceae) individuals sampled at eight sites contaminated by pollutants (N = 39) and eight adjacent...

  15. Fingerprinting of Cyanobacteria Based on PCR with Primers Derived from Short and Long Tandemly Repeated Repetitive Sequences

    PubMed Central

    Rasmussen, Ulla; Svenning, Mette M.

    1998-01-01

    The presence of repeated DNA (short tandemly repeated repetitive [STRR] and long tandemly repeated repetitive [LTRR]) sequences in the genome of cyanobacteria was used to generate a fingerprint method for symbiotic and free-living isolates. Primers corresponding to the STRR and LTRR sequences were used in the PCR, resulting in a method which generate specific fingerprints for individual isolates. The method was useful both with purified DNA and with intact cyanobacterial filaments or cells as templates for the PCR. Twenty-three Nostoc isolates from a total of 35 were symbiotic isolates from the angiosperm Gunnera species, including isolates from the same Gunnera species as well as from different species. The results show a genetic similarity among isolates from different Gunnera species as well as a genetic heterogeneity among isolates from the same Gunnera species. Isolates which have been postulated to be closely related or identical revealed similar results by the PCR method, indicating that the technique is useful for clustering of even closely related strains. The method was applied to nonheterocystus cyanobacteria from which a fingerprint pattern was obtained. PMID:16349487

  16. Multiple-Locus Variable-Number Tandem-Repeat Analysis in Genotyping Yersinia enterocolitica Strains from Human and Porcine Origins

    PubMed Central

    Laukkanen-Ninios, R.; Ortiz Martínez, P.; Siitonen, A.; Fredriksson-Ahomaa, M.; Korkeala, H.

    2013-01-01

    Sporadic and epidemiologically linked Yersinia enterocolitica strains (n = 379) isolated from fecal samples from human patients, tonsil or fecal samples from pigs collected at slaughterhouses, and pork samples collected at meat stores were genotyped using multiple-locus variable-number tandem-repeat analysis (MLVA) with six loci, i.e., V2A, V4, V5, V6, V7, and V9. In total, 312 different MLVA types were found. Similar types were detected (i) in fecal samples collected from human patients over 2 to 3 consecutive years, (ii) in samples from humans and pigs, and (iii) in samples from pigs that originated from the same farms. Among porcine strains, we found farm-specific MLVA profiles. Variations in the numbers of tandem repeats from one to four for variable-number tandem-repeat (VNTR) loci V2A, V5, V6, and V7 were observed within a farm. MLVA was applicable for serotypes O:3, O:5,27, and O:9 and appeared to be a highly discriminating tool for distinguishing sporadic and outbreak-related strains. With long-term use, interpretation of the results became more challenging due to variations in more-discriminating loci, as was observed for strains originating from pig farms. Additionally, we encountered unexpectedly short V2A VNTR fragments and sequenced them. According to the sequencing results, updated guidelines for interpreting V2A VNTR results were prepared. PMID:23637293

  17. Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

    PubMed

    Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

    2015-05-01

    Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.

  18. Thermal denaturation of the BRCT tandem repeat region of human tumour suppressor gene product BRCA1.

    PubMed

    Pyrpassopoulos, Serapion; Ladopoulou, Angela; Vlassi, Metaxia; Papanikolau, Yannis; Vorgias, Constantinos E; Yannoukakos, Drakoulis; Nounesis, George

    2005-04-01

    Reduced stability of the tandem BRCT domains of human BReast CAncer 1 (BRCA1) due to missense mutations may be critical for loss of function in DNA repair and damage-induced checkpoint control. In the present thermal denaturation study of the BRCA1 BRCT region, high-precision differential scanning calorimetry (DSC) and circular dichroism (CD) spectroscopy provide evidence for the existence of a denatured state that is structurally very similar to the native. Consistency between theoretical structure-based estimates of the enthalpy (DeltaH) and heat capacity change (DeltaCp) and the calorimetric results is obtained when considering partial thermal unfolding contained in the region of the conserved hydrophobic pocket formed at the interface of the two BRCT repeats. The structural integrity of this region has been shown to be crucial for the interaction of BRCA1 with phosphorylated peptides. In addition, cancer-causing missense mutations located at the inter-BRCT-repeat interface have been linked to the destabilization of the tandem BRCT structure.

  19. Intergenic Variable-Number Tandem-Repeat Polymorphism Upstream of rocA Alters Toxin Production and Enhances Virulence in Streptococcus pyogenes.

    PubMed

    Zhu, Luchang; Olsen, Randall J; Horstmann, Nicola; Shelburne, Samuel A; Fan, Jia; Hu, Ye; Musser, James M

    2016-07-01

    Variable-number tandem-repeat (VNTR) polymorphisms are ubiquitous in bacteria. However, only a small fraction of them has been functionally studied. Here, we report an intergenic VNTR polymorphism that confers an altered level of toxin production and increased virulence in Streptococcus pyogenes The nature of the polymorphism is a one-unit deletion in a three-tandem-repeat locus upstream of the rocA gene encoding a sensor kinase. S. pyogenes strains with this type of polymorphism cause human infection and produce significantly larger amounts of the secreted cytotoxins S. pyogenes NADase (SPN) and streptolysin O (SLO). Using isogenic mutant strains, we demonstrate that deleting one or more units of the tandem repeats abolished RocA production, reduced CovR phosphorylation, derepressed multiple CovR-regulated virulence factors (such as SPN and SLO), and increased virulence in a mouse model of necrotizing fasciitis. The phenotypic effect of the VNTR polymorphism was nearly the same as that of inactivating the rocA gene. In summary, we identified and characterized an intergenic VNTR polymorphism in S. pyogenes that affects toxin production and virulence. These new findings enhance understanding of rocA biology and the function of VNTR polymorphisms in S. pyogenes. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  20. Functional centromeres in Astragalus sinicus include a compact centromere-specific histone H3 and a 20-bp tandem repeat.

    PubMed

    Tek, Ahmet L; Kashihara, Kazunari; Murata, Minoru; Nagaki, Kiyotaka

    2011-11-01

    The centromere plays an essential role for proper chromosome segregation during cell division and usually harbors long arrays of tandem repeated satellite DNA sequences. Although this function is conserved among eukaryotes, the sequences of centromeric DNA repeats are variable. Most of our understanding of functional centromeres, which are defined by localization of a centromere-specific histone H3 (CENH3) protein, comes from model organisms. The components of the functional centromere in legumes are poorly known. The genus Astragalus is a member of the legumes and bears the largest numbers of species among angiosperms. Therefore, we studied the components of centromeres in Astragalus sinicus. We identified the CenH3 homolog of A. sinicus, AsCenH3 that is the most compact in size among higher eukaryotes. A CENH3-based assay revealed the functional centromeric DNA sequences from A. sinicus, called CentAs. The CentAs repeat is localized in A. sinicus centromeres, and comprises an AT-rich tandem repeat with a monomer size of 20 nucleotides.

  1. The clinical application of single-sperm-based SNP haplotyping for PGD of osteogenesis imperfecta.

    PubMed

    Chen, Linjun; Diao, Zhenyu; Xu, Zhipeng; Zhou, Jianjun; Yan, Guijun; Sun, Haixiang

    2018-05-15

    : short tandem repeat; TE: trophectoderm; WGA: whole-genome amplification.

  2. The discrete Laplace exponential family and estimation of Y-STR haplotype frequencies.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2013-07-21

    Estimating haplotype frequencies is important in e.g. forensic genetics, where the frequencies are needed to calculate the likelihood ratio for the evidential weight of a DNA profile found at a crime scene. Estimation is naturally based on a population model, motivating the investigation of the Fisher-Wright model of evolution for haploid lineage DNA markers. An exponential family (a class of probability distributions that is well understood in probability theory such that inference is easily made by using existing software) called the 'discrete Laplace distribution' is described. We illustrate how well the discrete Laplace distribution approximates a more complicated distribution that arises by investigating the well-known population genetic Fisher-Wright model of evolution by a single-step mutation process. It was shown how the discrete Laplace distribution can be used to estimate haplotype frequencies for haploid lineage DNA markers (such as Y-chromosomal short tandem repeats), which in turn can be used to assess the evidential weight of a DNA profile found at a crime scene. This was done by making inference in a mixture of multivariate, marginally independent, discrete Laplace distributions using the EM algorithm to estimate the probabilities of membership of a set of unobserved subpopulations. The discrete Laplace distribution can be used to estimate haplotype frequencies with lower prediction error than other existing estimators. Furthermore, the calculations could be performed on a normal computer. This method was implemented in the freely available open source software R that is supported on Linux, MacOS and MS Windows. Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. Linkage Study Revealed Complex Haplotypes in a Multifamily due to Different Mutations in CAPN3 Gene in an Iranian Ethnic Group.

    PubMed

    Mojbafan, Marzieh; Tonekaboni, Seyed Hassan; Abiri, Maryam; Kianfar, Soudeh; Sarhadi, Ameneh; Nilipour, Yalda; Tavakkoly-Bazzaz, Javad; Zeinali, Sirous

    2016-07-01

    Calpainopathy is an autosomal recessive form of limb girdle muscular dystrophies which is caused by mutation in CAPN3 gene. In the present study, co-segregation of this disorder was analyzed with four short tandem repeat markers linked to the CAPN3 gene. Three apparently unrelated Iranian families with same ethnicity were investigated. Haplotype analysis and sequencing of the CAPN3 gene were performed. DNA sample from one of the patients was simultaneously sent for next-generation sequencing. DNA sequencing identified two mutations. It was seen as a homozygous c.2105C>T in exon 19 in one family, a homozygous novel mutation c.380G>A in exon 3 in another family, and a compound heterozygote form of these two mutations in the third family. Next-generation sequencing also confirmed our results. It was expected that, due to the rare nature of limb girdle muscular dystrophies, affected individuals from the same ethnic group share similar mutations. Haplotype analysis showed two different homozygote patterns in two families, yet a compound heterozygote pattern in the third family as seen in the mutation analysis. This study shows that haplotype analysis would help in determining presence of different founders.

  4. A Dynamic Tandem Repeat in Monocotyledons Inferred from a Comparative Analysis of Chloroplast Genomes in Melanthiaceae.

    PubMed

    Do, Hoang Dang Khoa; Kim, Joo-Hwan

    2017-01-01

    Chloroplast genomes (cpDNA) are highly valuable resources for evolutionary studies of angiosperms, since they are highly conserved, are small in size, and play critical roles in plants. Slipped-strand mispairing (SSM) was assumed to be a mechanism for generating repeat units in cpDNA. However, research on the employment of different small repeated sequences through SSM events, which may induce the accumulation of distinct types of repeats within the same region in cpDNA, has not been documented. Here, we sequenced two chloroplast genomes from the endemic species Heloniopsis tubiflora (Korea) and Xerophyllum tenax (USA) to cover the gap between molecular data and explore "hot spots" for genomic events in Melanthiaceae. Comparative analysis of 23 complete cpDNA sequences revealed that there were different stages of deletion in the rps16 region across the Melanthiaceae. Based on the partial or complete loss of rps16 gene in cpDNA, we have firstly reported potential molecular markers for recognizing two sections ( Veratrum and Fuscoveratrum ) of Veratrum . Melathiaceae exhibits a significant change in the junction between large single copy and inverted repeat regions, ranging from trnH_GUG to a part of rps3 . Our results show an accumulation of tandem repeats in the rpl23-ycf2 regions of cpDNAs. Small conserved sequences exist and flank tandem repeats in further observation of this region across most of the examined taxa of Liliales. Therefore, we propose three scenarios in which different small repeated sequences were used during SSM events to generate newly distinct types of repeats. Occasionally, prior to the SSM process, point mutation event and double strand break repair occurred and induced the formation of initial repeat units which are indispensable in the SSM process. SSM may have likely occurred more frequently for short repeats than for long repeat sequences in tribe Parideae (Melanthiaceae, Liliales). Collectively, these findings add new evidence of dynamic

  5. High frequency of C9orf72 hexanucleotide repeat expansion in amyotrophic lateral sclerosis patients from two founder populations sharing the same risk haplotype.

    PubMed

    Goldstein, Orly; Gana-Weisz, Mali; Nefussy, Beatrice; Vainer, Batel; Nayshool, Omri; Bar-Shira, Anat; Traynor, Bryan J; Drory, Vivian E; Orr-Urtreger, Avi

    2018-04-01

    We characterized the C9orf72 hexanucleotide repeat expansion (RE) mutation in amyotrophic lateral sclerosis (ALS) patients of 2 distinct origins, Ashkenazi and North Africa Jews (AJ, NAJ), its frequency, and genotype-phenotype correlations. In AJ, 80% of familial ALS (fALS) and 11% of sporadic ALS carried the RE, a total of 12.9% of all AJ-ALS compared to 0.3% in AJ controls (odds ratio [OR] = 44.3, p < 0.0001). In NAJ, 10% of fALS and 9% of sporadic ALS carried the RE, a total of 9.1% of all NAJ-ALS compared to 1% in controls (OR = 9.9, p = 0.0006). We identified a risk haplotype shared among all ALS patients, although an association with age at disease onset, fALS, and dementia were observed only in AJ. Variations were identified downstream the repeats. The risk haplotype and these polymorphisms were at high frequencies in alleles with 8 repeats or more, suggesting sequence instability. The different genotype-phenotype correlations and OR, together with the large range in age at onset, suggest that other modifiers and risk factors may affect penetrance and phenotype in ALS. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Identification of exhumed remains of fire tragedy victims using conventional methods and autosomal/Y-chromosomal short tandem repeat DNA profiling.

    PubMed

    Calacal, Gayvelline C; Delfin, Frederick C; Tan, Michelle Music M; Roewer, Lutz; Magtanong, Danilo L; Lara, Myra C; Fortun, Raquel dR; De Ungria, Maria Corazon A

    2005-09-01

    In a fire tragedy in Manila in December 1998, one of the worst tragic incidents which resulted in the reported death of 23 children, identity could not be established initially resulting in the burial of still unidentified bodies. Underscoring the importance of identifying each of the human remains, the bodies were exhumed 3 months after the tragedy. We describe here our work, which was the first national case handled by local laboratories wherein conventional and molecular-based techniques were successfully applied in forensic identification. The study reports analysis of DNA obtained from skeletal remains exposed to conditions of burning, burial, and exhumation. DNA typing methods using autosomal and Y-chromosomal short tandem repeat (Y-STR) markers reinforced postmortem examinations using conventional identification techniques. The strategy resulted in the identification of 18 out of the 21 human remains analyzed, overcoming challenges encountered due to the absence of established procedures for the recovery of mass disaster remains. There was incomplete antemortem information to match the postmortem data obtained from the remains of 3 female child victims. Two victims were readily identified due to the availability of antemortem tissues. In the absence of this biologic material, parentage testing was performed using reference blood samples collected from parents and relatives. Data on patrilineal lineage based on common Y-STR haplotypes augmented autosomal DNA typing, particularly in deficiency cases.

  7. Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, P.J.; Walthers, E.A.; Richmond, K.L.

    1997-04-01

    PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less

  8. The proliferation marker pKi-67 becomes masked to MIB-1 staining after expression of its tandem repeats.

    PubMed

    Schmidt, Mirko H H; Broll, Rainer; Bruch, Hans-Peter; Duchrow, Michael

    2002-11-01

    The Ki-67 antigen, pKi-67, is one of the most commonly used markers of proliferating cells. The protein can only be detected in dividing cells (G(1)-, S-, G(2)-, and M-phase) but not in quiescent cells (G(0)). The standard antibody to detect pKi-67 is MIB-1, which detects the so-called 'Ki-67 motif' FKELF in 9 of the protein's 16 tandem repeats. To investigate the function of these repeats we expressed three of them in an inducible gene expression system in HeLa cells. Surprisingly, addition of a nuclear localization sequence led to a complete absence of signal in the nuclei of MIB-1-stained cells. At the same time antibodies directed against different epitopes of pKi-67 did not fail to detect the protein. We conclude that the overexpression of the 'Ki-67 motif', which is present in the repeats, can lead to inability of MIB-1 to detect its antigen as demonstrated in adenocarcinoma tissue samples. Thereafter, in order to prevent the underestimation of Ki-67 proliferation indices in MIB-1-labeled preparations, additional antibodies (for example, MIB-21) should be used. Additionally, we could show in a mammalian two-hybrid assay that recombinant pKi-67 repeats are capable of self-associating with endogenous pKi-67. Speculating that the tandem repeats are intimately involved in its protein-protein interactions, this offers new insights in how access to these repeats is regulated by pKi-67 itself.

  9. Development of an Italian RM Y-STR haplotype database: Results of the 2013 GEFI collaborative exercise.

    PubMed

    Robino, C; Ralf, A; Pasino, S; De Marchi, M R; Ballantyne, K N; Barbaro, A; Bini, C; Carnevali, E; Casarino, L; Di Gaetano, C; Fabbri, M; Ferri, G; Giardina, E; Gonzalez, A; Matullo, G; Nutini, A L; Onofri, V; Piccinini, A; Piglionica, M; Ponzano, E; Previderè, C; Resta, N; Scarnicci, F; Seidita, G; Sorçaburu-Cigliero, S; Turrina, S; Verzeletti, A; Kayser, M

    2015-03-01

    Recently introduced rapidly mutating Y-chromosomal short tandem repeat (RM Y-STR) loci, displaying a multiple-fold higher mutation rate relative to any other Y-STRs, including those conventionally used in forensic casework, have been demonstrated to improve the resolution of male lineage differentiation and to allow male relative separation usually impossible with standard Y-STRs. However, large and geographically-detailed frequency haplotype databases are required to estimate the statistical weight of RM Y-STR haplotype matches if observed in forensic casework. With this in mind, the Italian Working Group (GEFI) of the International Society for Forensic Genetics launched a collaborative exercise aimed at generating an Italian quality controlled forensic RM Y-STR haplotype database. Overall 1509 male individuals from 13 regional populations covering northern, central and southern areas of the Italian peninsula plus Sicily were collected, including both "rural" and "urban" samples classified according to population density in the sampling area. A subset of individuals was additionally genotyped for Y-STR loci included in the Yfiler and PowerPlex Y23 (PPY23) systems (75% and 62%, respectively), allowing the comparison of RM and conventional Y-STRs. Considering the whole set of 13 RM Y-STRs, 1501 unique haplotypes were observed among the 1509 sampled Italian men with a haplotype diversity of 0.999996, largely superior to Yfiler and PPY23 with 0.999914 and 0.999950, respectively. AMOVA indicated that 99.996% of the haplotype variation was within populations, confirming that genetic-geographic structure is almost undetected by RM Y-STRs. Haplotype sharing among regional Italian populations was not observed at all with the complete set of 13 RM Y-STRs. Haplotype sharing within Italian populations was very rare (0.27% non-unique haplotypes), and lower in urban (0.22%) than rural (0.29%) areas. Additionally, 422 father-son pairs were investigated, and 20.1% of them could

  10. Effect of Repeat Copy Number on Variable-Number Tandem Repeat Mutations in Escherichia coli O157:H7

    PubMed Central

    Vogler, Amy J.; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E.; Jay, Zack; Keim, Paul

    2006-01-01

    Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 × 10−4 mutations/generation and a combined 28-locus rate of 6.4 × 10−4 mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2 = 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2 = 0.833, P < 0.0001) or excluded (r2 = 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data. PMID:16740932

  11. Effect of repeat copy number on variable-number tandem repeat mutations in Escherichia coli O157:H7.

    PubMed

    Vogler, Amy J; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E; Jay, Zack; Keim, Paul

    2006-06-01

    Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 x 10(-4) mutations/generation and a combined 28-locus rate of 6.4 x 10(-4) mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2= 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2= 0.833, P < 0.0001) or excluded (r2= 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data.

  12. APE1 incision activity at abasic sites in tandem repeat sequences.

    PubMed

    Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

    2014-05-29

    Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.

  13. TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

    PubMed

    Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio

    2010-06-15

    Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.

  14. Microevolution of Pandemic Vibrio parahaemolyticus Assessed by the Number of Repeat Units in Short Sequence Tandem Repeat Regions

    PubMed Central

    García, Katherine; Gavilán, Ronnie G.; Höfle, Manfred G.; Martínez-Urtaza, Jaime; Espejo, Romilio T.

    2012-01-01

    The emergence of the pandemic strain Vibrio parahaemolyticus O3:K6 in 1996 caused a large increase of diarrhea outbreaks related to seafood consumption in Southeast Asia, and later worldwide. Isolates of this strain constitutes a clonal complex, and their effectual differentiation is possible by comparison of their variable number tandem repeats (VNTRs). The differentiation of the isolates by the differences in VNTRs will allow inferring the population dynamics and microevolution of this strain but this requires knowing the rate and mechanism of VNTRs' variation. Our study of mutants obtained after serial cultivation of clones showed that mutation rates of the six VNTRs examined are on the order of 10−4 mutant per generation and that difference increases by stepwise addition of single mutations. The single stepwise mutation (SSM) was deduced because mutants with 1, 2, 3, or more repeat unit deletions or insertions follow a geometric distribution. Plausible phylogenetic trees are obtained when, according to SSM, the genetic distance between clusters with different number of repeats is assessed by the absolute differences in repeats. Using this approach, mutants originated from different isolates of pandemic V. parahaemolyticus after serial cultivation are clustered with their parental isolates. Additionally, isolates of pandemic V. parahaemolyticus from Southeast Asia, Tokyo, and northern and southern Chile are clustered according their geographical origin. The deepest split in these four populations is observed between the Tokyo and southern Chile populations. We conclude that proper phylogenetic relations and successful tracing of pandemic V. parahaemolyticus requires measuring the differences between isolates by the absolute number of repeats in the VNTRs considered. PMID:22292049

  15. Production of monoclonal antibody, PR81, recognizing the tandem repeat region of MUC1 mucin.

    PubMed

    Paknejad, M; Rasaee, M J; Tehrani, F Karami; Kashanian, S; Mohagheghi, M A; Omidfar, K; Bazl, M Rajabi

    2003-06-01

    A monoclonal antibody (MAb) was generated by immunizing BALB/c mice with homogenized breast cancerous tissues. This antibody (PR81) was found to be of IgG(1) class and subclass, containing kappa light chain. PR81 reacted with either the membrane extracts of several breast cancerous tissues or the cell surface of some MUC1 positive cell lines (MCF-7, BT-20 and T-47D) tested by enzyme immunoassay and for MCF-7 by immunofluorescence method. PR81 also reacted with two synthetic 27 and 16-amino acid peptides, TSA-P1-24 and A-P1-15, respectively, which included the core tandem repeat sequence of MUC1. However, this antibody did not react with a synthetic 14 amino acid peptide that has no similarity with tandem repeat found in MUC1. The generated antibody had good and similar affinities (2.19 x 10(8) M(-1)) toward TSA-P1-24 and A-P1-15, which are mainly shared in the hydrophilic sequence of PDTRPAP. Through Western blot analysis of homogenized breast tissues, PR81 recognized only a major band of 250 kDa. This band is stronger in malignant tissue than benign and normal tissues.

  16. DNA Fingerprint Analysis of Three Short Tandem Repeat (STR) Loci for Biochemistry and Forensic Science Laboratory Courses

    ERIC Educational Resources Information Center

    McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.

    2006-01-01

    We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…

  17. Visualization of tandem repeat mutagenesis in Bacillus subtilis.

    PubMed

    Dormeyer, Miriam; Lentes, Sabine; Ballin, Patrick; Wilkens, Markus; Klumpp, Stefan; Kohlheyer, Dietrich; Stannek, Lorena; Grünberger, Alexander; Commichau, Fabian M

    2018-03-01

    Mutations are crucial for the emergence and evolution of proteins with novel functions, and thus for the diversity of life. Tandem repeats (TRs) are mutational hot spots that are present in the genomes of all organisms. Understanding the molecular mechanism underlying TR mutagenesis at the level of single cells requires the development of mutation reporter systems. Here, we present a mutation reporter system that is suitable to visualize mutagenesis of TRs occurring in single cells of the Gram-positive model bacterium Bacillus subtilis using microfluidic single-cell cultivation. The system allows measuring the elimination of TR units due to growth rate recovery. The cultivation of bacteria carrying the mutation reporter system in microfluidic chambers allowed us for the first time to visualize the emergence of a specific mutation at the level of single cells. The application of the mutation reporter system in combination with microfluidics might be helpful to elucidate the molecular mechanism underlying TR (in)stability in bacteria. Moreover, the mutation reporter system might be useful to assess whether mutations occur in response to nutrient starvation. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. The production and characterization of novel heavy-chain antibodies against the tandem repeat region of MUC1 mucin.

    PubMed

    Rahbarizadeh, Fatemeh; Rasaee, Mohammad J; Forouzandeh, Mehdi; Allameh, Abdolamir; Sarrami, Ramin; Nasiry, Habib; Sadeghizadeh, Majid

    2005-01-01

    Camelidae are known to produce immunoglobulins (Igs) devoid of light chains and constant heavy-chain domains (CH1). Antigen-specific fragments of these heavy-chain IgGs (VHH) are of great interest in biotechnology applications. This paper describes the first example of successfully raised heavy-chain antibodies in Camelus dromedarius (single-humped camel) and Camelus bactrianus (two-humped camel) against a MUC1 related peptide that is found to be an important epitope expressed in cancerous tissue. Camels were immunized against a synthetic peptide corresponding to the tandem repeat region of MUC1 mucin and cancerous tissue preparation obtained from patients suffering from breast carcinoma. Three IgG subclasses with different binding properties to protein A and G were purified by affinity chromatography. Both conventional and heavy-chain IgG antibodies were produced in response to MUC1-related peptide. The elicited antibodies could react specifically with the tandem repeat region of MUC1 mucin in an enzyme linked immunosorbant assay (ELISA). Anti-peptide antibodies were purified after passing antiserum over two affinity chromatography columns. Using ELISA, immunocytochemistry and Western blotting, the interaction of purified antibodies with different antigens was evaluated. The antibodies were observed to be selectively bound to antigens namely: MUC1 peptide (tandem repeat region), human milk fat globule membrane (HMFG), deglycosylated human milk fat globule membrane (D-HMFG), homogenized cancerous breast tissue and a native MUC1 purified from ascitic fluid. Ka values of specific polyclonal antipeptide antibodies were estimated in C. dromedarius and C. bactrianus, as 7 x 10(10) M(-1) and 1.4 x 10(10) M(-1) respectively.

  19. Ehrlichia chaffeensis Tandem Repeat Proteins and Ank200 are Type 1 Secretion System Substrates Related to the Repeats-in-Toxin Exoprotein Family

    PubMed Central

    Wakeel, Abdul; den Dulk-Ras, Amke; Hooykaas, Paul J. J.; McBride, Jere W.

    2011-01-01

    Ehrlichia chaffeensis has type 1 and 4 secretion systems (T1SS and T4SS), but the substrates have not been identified. Potential substrates include secreted tandem repeat protein (TRP) 47, TRP120, and TRP32, and the ankyrin repeat protein, Ank200, that are involved in molecular host–pathogen interactions including DNA binding and a network of protein–protein interactions with host targets associated with signaling, transcriptional regulation, vesicle trafficking, and apoptosis. In this study we report that E. chaffeensis TRP47, TRP32, TRP120, and Ank200 were not secreted in the Agrobacterium tumefaciens Cre recombinase reporter assay routinely used to identify T4SS substrates. In contrast, all TRPs and the Ank200 proteins were secreted by the Escherichia coli complemented with the hemolysin secretion system (T1SS), and secretion was reduced in a T1SS mutant (ΔTolC), demonstrating that these proteins are T1SS substrates. Moreover, T1SS secretion signals were identified in the C-terminal domains of the TRPs and Ank200, and a detailed bioinformatic analysis of E. chaffeensis TRPs and Ank200 revealed features consistent with those described in the repeats-in-toxins (RTX) family of exoproteins, including glycine- and aspartate-rich tandem repeats, homology with ATP-transporters, a non-cleavable C-terminal T1SS signal, acidic pIs, and functions consistent with other T1SS substrates. Using a heterologous E. coli T1SS, this investigation has identified the first Ehrlichia T1SS substrates supporting the conclusion that the T1SS and corresponding substrates are involved in molecular host–pathogen interactions that contribute to Ehrlichia pathobiology. Further investigation of the relationship between Ehrlichia TRPs, Ank200, and the RTX exoprotein family may lead to a greater understanding of the importance of T1SS substrates and specific functions of T1SS in the pathobiology of obligately intracellular bacteria. PMID:22919588

  20. Exceptionally long 5' UTR short tandem repeats specifically linked to primates.

    PubMed

    Namdar-Aligoodarzi, P; Mohammadparast, S; Zaker-Kandjani, B; Talebi Kakroodi, S; Jafari Vesiehsari, M; Ohadi, M

    2015-09-10

    We have previously reported genome-scale short tandem repeats (STRs) in the core promoter interval (i.e. -120 to +1 to the transcription start site) of protein-coding genes that have evolved identically in primates vs. non-primates. Those STRs may function as evolutionary switch codes for primate speciation. In the current study, we used the Ensembl database to analyze the 5' untranslated region (5' UTR) between +1 and +60 of the transcription start site of the entire human protein-coding genes annotated in the GeneCards database, in order to identify "exceptionally long" STRs (≥5-repeats), which may be of selective/adaptive advantage. The importance of this critical interval is its function as core promoter, and its effect on transcription and translation. In order to minimize ascertainment bias, we analyzed the evolutionary status of the human 5' UTR STRs of ≥5-repeats in several species encompassing six major orders and superorders across mammals, including primates, rodents, Scandentia, Laurasiatheria, Afrotheria, and Xenarthra. We introduce primate-specific STRs, and STRs which have expanded from mouse to primates. Identical co-occurrence of the identified STRs of rare average frequency between 0.006 and 0.0001 in primates supports a role for those motifs in processes that diverged primates from other mammals, such as neuronal differentiation (e.g. APOD and FGF4), and craniofacial development (e.g. FILIP1L). A number of the identified STRs of ≥5-repeats may be human-specific (e.g. ZMYM3 and DAZAP1). Future work is warranted to examine the importance of the listed genes in primate/human evolution, development, and disease. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. Isolation of human simple repeat loci by hybridization selection.

    PubMed

    Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

    1994-04-01

    We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.

  2. Tandem repeats analysis for the high resolution phylogenetic analysis of Yersinia pestis

    PubMed Central

    Pourcel, C; André-Mazeaud, F; Neubauer, H; Ramisse, F; Vergnaud, G

    2004-01-01

    Background Yersinia pestis, the agent of plague, is a young and highly monomorphic species. Three biovars, each one thought to be associated with the last three Y. pestis pandemics, have been defined based on biochemical assays. More recently, DNA based assays, including DNA sequencing, IS typing, DNA arrays, have significantly improved current knowledge on the origin and phylogenetic evolution of Y. pestis. However, these methods suffer either from a lack of resolution or from the difficulty to compare data. Variable number of tandem repeats (VNTRs) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses in a growing number of pathogens and have given promising results for Y. pestis as well. Results In this study we have genotyped 180 Y. pestis isolates by multiple locus VNTR analysis (MLVA) using 25 markers. Sixty-one different genotypes were observed. The three biovars were distributed into three main branches, with some exceptions. In particular, the Medievalis phenotype is clearly heterogeneous, resulting from different mutation events in the napA gene. Antiqua strains from Asia appear to hold a central position compared to Antiqua strains from Africa. A subset of 7 markers is proposed for the quick comparison of a new strain with the collection typed here. This can be easily achieved using a Web-based facility, specifically set-up for running such identifications. Conclusion Tandem-repeat typing may prove to be a powerful complement to the existing phylogenetic tools for Y. pestis. Typing can be achieved quickly at a low cost in terms of consumables, technical expertise and equipment. The resulting data can be easily compared between different laboratories. The number and selection of markers will eventually depend upon the type and aim of investigations. PMID:15186506

  3. Interpreting short tandem repeat variations in humans using mutational constraint

    PubMed Central

    Gymrek, Melissa; Willems, Thomas; Reich, David; Erlich, Yaniv

    2017-01-01

    Identifying regions of the genome that are depleted of mutations can reveal potentially deleterious variants. Short tandem repeats (STRs), also known as microsatellites, are among the largest contributors of de novo mutations in humans. However, per-locus studies of STR mutations have been limited to highly ascertained panels of several dozen loci. Here, we harnessed bioinformatics tools and a novel analytical framework to estimate mutation parameters for each STR in the human genome by correlating STR genotypes with local sequence heterozygosity. We applied our method to obtain robust estimates of the impact of local sequence features on mutation parameters and used this to create a framework for measuring constraint at STRs by comparing observed vs. expected mutation rates. Constraint scores identified known pathogenic variants with early onset effects. Our metric will provide a valuable tool for prioritizing pathogenic STRs in medical genetics studies. PMID:28892063

  4. Functional Haplotypes of Fc gamma (Fcγ) receptor (FcγRIIA and FcγRIIIB) predict risk to repeated episodes of severe malarial anemia and mortality in Kenyan children

    PubMed Central

    Ouma, Collins; Davenport, Gregory C.; Garcia, Steven; Kempaiah, Prakasha; Chaudhary, Ateefa; Were, Tom; Anyona, Samuel B.; Raballah, Evans; Konah, Stephen N.; Hittner, James B.; Vulule, John M.; Ong’echa, John M.; Perkins, Douglas J.

    2011-01-01

    Development of protective immunity against Plasmodium falciparum is partially mediated through binding of malaria-specific IgG to Fc gamma (γ) receptors. Variation in human FcγRIIA-H/R-131 and FcγRIIIB-NA1/NA2 affect differential binding of IgG sub-classes. Since variability in FcγR may play an important role in severe malarial anemia (SMA) pathogenesis by mediating phagocytosis of red blood cells and triggering cytokine production, the relationship between FcγRIIA-H/R131 and FcγRIIIB-NA1/NA2 haplotypes and susceptibility to SMA (Hb<6.0g/dL) was investigated in Kenyan children (n=528) with acute malaria residing in a holoendemic P. falciparum transmission region. In addition, the association between carriage of the haplotypes and repeated episodes of SMA and all-cause mortality were investigated over a three-year follow-up period. Since variability in FcγR can alter interferon (IFN)-γ production, a mediator of innate and adaptive immune responses, functional associations between the haplotypes and IFN-γ were also explored. During acute malaria, children with SMA had elevated peripheral IFN-γ levels (P=0.006). Although multivariate logistic regression analyses (controlling for covariates) revealed no associations between the FcγR haplotypes and susceptibility to SMA during acute infection, the FcγRIIA-131H/FcγRIIIB-NA1 haplotype was associated with decreased peripheral IFN-γ (P=0.046). Longitudinal analyses showed that carriage of the FcγRIIA-131H/FcγRIIIB-NA1 haplotype was associated with reduced risk of SMA (RR; 0.65, 95%CI, 0.46-0.90; P=0.012) and all-cause mortality (P=0.002). In contrast, carriers of the FcγRIIA-131H/FcγRIIIB-NA2 haplotype had increased susceptibility to SMA (RR; 1.47, 95%CI, 1.06-2.04; P=0.020). Results here demonstrate that variation in the FcγR gene alters susceptibility to repeated episodes of SMA and mortality, as well as functional changes in IFN-γ production. PMID:21818580

  5. The evolution and function of protein tandem repeats in plants.

    PubMed

    Schaper, Elke; Anisimova, Maria

    2015-04-01

    Sequence tandem repeats (TRs) are abundant in proteomes across all domains of life. For plants, little is known about their distribution or contribution to protein function. We exhaustively annotated TRs and studied the evolution of TR unit variations for all Ensembl plants. Using phylogenetic patterns of TR units, we detected conserved TRs with unit number and order preserved during evolution, and those TRs that have diverged via recent TR unit gains/losses. We correlated the mode of evolution of TRs to protein function. TR number was strongly correlated with proteome size, with about one-half of all TRs recognized as common protein domains. The majority of TRs have been highly conserved over long evolutionary distances, some since the separation of red algae and green plants c. 1.6 billion yr ago. Conversely, recurrent recent TR unit mutations were rare. Our results suggest that the first TRs by far predate the first plants, and that TR appearance is an ongoing process with similar rates across the plant kingdom. Interestingly, the few detected highly mutable TRs might provide a source of variation for rapid adaptation. In particular, such TRs are enriched in leucine-rich repeats (LRRs) commonly found in R genes, where TR unit gain/loss may facilitate resistance to emerging pathogens. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  6. Analysis of tandem repeat units of the promoter of capsanthin/capsorubin synthase (Ccs) gene in pepper fruit.

    PubMed

    Tian, Shi-Lin; Li, Zheng; Li, Li; Shah, S N M; Gong, Zhen-Hui

    2017-07-01

    Capsanthin/capsorubin synthase ( Ccs ) gene is a key gene that regulates the synthesis of capsanthin and the development of red coloration in pepper fruits. There are three tandem repeat units in the promoter region of Ccs , but the potential effects of the number of repetitive units on the transcriptional regulation of Ccs has been unclear. In the present study, expression vectors carrying different numbers of repeat units of the Ccs promoter were constructed, and the transient expression of the β-glucuronidase ( GUS ) gene was used to detect differences in expression levels associated with the promoter fragments. These repeat fragments and the plant expression vector PBI121 containing the 35s CaMV promoter were ligated to form recombinant vectors that were transfected into Agrobacterium tumefaciens GV3101. A fluorescence spectrophotometer was used to analyze the expression associated with the various repeat units. It was concluded that the constructs containing at least one repeat were associated with GUS expression, though they did not differ from one another. This repeating unit likely plays a role in transcription and regulation of Ccs expression.

  7. Characterization of toxin-producing cyanobacteria by using an oligonucleotide probe containing a tandemly repeated heptamer.

    PubMed Central

    Rouhiainen, L; Sivonen, K; Buikema, W J; Haselkorn, R

    1995-01-01

    Cyanobacteria produce toxins that kill animals. The two main classes of cyanobacterial toxins are cyclic peptides that cause liver damage and alkaloids that block nerve transmission. Many toxin-producing strains from Finnish lakes were brought into axenic culture, and their toxins were characterized. Restriction fragment length polymorphism analysis, probing with a short tandemly repeated DNA sequence found at many locations in the chromosome of Anabaena sp. strain PCC 7120, distinguishes hepatotoxic Anabaena isolates from neurotoxin-producing strains and from Nostoc spp. PMID:7592362

  8. Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

    PubMed

    Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

    2018-05-01

    Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.

  9. Short tandem repeat analysis in Japanese population.

    PubMed

    Hashiyada, M

    2000-01-01

    Short tandem repeats (STRs), known as microsatellites, are one of the most informative genetic markers for characterizing biological materials. Because of the relatively small size of STR alleles (generally 100-350 nucleotides), amplification by polymerase chain reaction (PCR) is relatively easy, affording a high sensitivity of detection. In addition, STR loci can be amplified simultaneously in a multiplex PCR. Thus, substantial information can be obtained in a single analysis with the benefits of using less template DNA, reducing labor, and reducing the contamination. We investigated 14 STR loci in a Japanese population living in Sendai by three multiplex PCR kits, GenePrint PowerPlex 1.1 and 2.2. Fluorescent STR System (Promega, Madison, WI, USA) and AmpF/STR Profiler (Perkin-Elmer, Norwalk, CT, USA). Genomic DNA was extracted using sodium dodecyl sulfate (SDS) proteinase K or Chelex 100 treatment followed by the phenol/chloroform extraction. PCR was performed according to the manufacturer's protocols. Electrophoresis was carried out on an ABI 377 sequencer and the alleles were determined by GeneScan 2.0.2 software (Perkin-Elmer). In 14 STRs loci, statistical parameters indicated a relatively high rate, and no significant deviation from Hardy-Weinberg equilibrium was detected. We apply this STR system to paternity testing and forensic casework, e.g., personal identification in rape cases. This system is an effective tool in the forensic sciences to obtain information on individual identification.

  10. Ten tandem repeats of {beta}-hCG 109-118 enhance immunogenicity and anti-tumor effects of {beta}-hCG C-terminal peptide carried by mycobacterial heat-shock protein HSP65

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang Yankai; Yan Rong; He Yi

    2006-07-14

    The {beta}-subunit of human chorionic gonadotropin ({beta}-hCG) is secreted by many kinds of tumors and it has been used as an ideal target antigen to develop vaccines against tumors. In view of the low immunogenicity of this self-peptide,we designed a method based on isocaudamer technique to repeat tandemly the 10-residue sequence X of {beta}-hCG (109-118), then 10 tandemly repeated copies of the 10-residue sequence combined with {beta}-hCG C-terminal 37 peptides were fused to mycobacterial heat-shock protein 65 to construct a fusion protein HSP65-X10-{beta}hCGCTP37 as an immunogen. In this study, we examined the effect of the tandem repeats of this 10-residuemore » sequence in eliciting an immune by comparing the immunogenicity and anti-tumor effects of the two immunogens, HSP65-X10-{beta}hCGCTP37 and HSP65-{beta}hCGCTP37 (without the 10 tandem repeats). Immunization of mice with the fusion protein HSP65-X10-{beta}hCGCTP37 elicited much higher levels of specific anti-{beta}-hCG antibodies and more effectively inhibited the growth of Lewis lung carcinoma (LLC) in vivo than with HSP65-{beta}hCGCTP37, which should suggest that HSP65-X10-{beta}hCGCTP37 may be an effective protein vaccine for the treatment of {beta}-hCG-dependent tumors and multiple tandem repeats of a certain epitope are an efficient method to overcome the low immunogenicity of self-peptide antigens.« less

  11. Multilocus Variable-Number Tandem Repeat Typing of Mycobacterium ulcerans

    PubMed Central

    Ablordey, Anthony; Swings, Jean; Hubans, Christine; Chemlal, Karim; Locht, Camille; Portaels, Françoise; Supply, Philip

    2005-01-01

    The apparent genetic homogeneity of Mycobacterium ulcerans contributes to the poorly understood epidemiology of M. ulcerans infection. Here, we report the identification of variable number tandem repeat (VNTR) sequences as novel polymorphic elements in the genome of this species. A total of 19 potential VNTR loci identified in the closely related M. marinum genome sequence were screened in a collection of 23 M. ulcerans isolates, one Mycobacterium species referred to here as an intermediate species, and five M. marinum strains. Nine of the 19 loci were polymorphic in the three species (including the intermediate species) and revealed eight M. ulcerans and five M. marinum genotypes. The results from the VNTR analysis corroborated the genetic relationships of M. ulcerans isolates from various geographical origins, as defined by independent molecular markers. Although these results further highlight the extremely high clonal homogeneity within certain geographic regions, we report for the first time the discrimination of the two South American strains from Surinam and French Guyana. These findings support the potential of a VNTR-based genotyping method for strain discrimination within M. ulcerans and M. marinum. PMID:15814964

  12. Detecting local haplotype sharing and haplotype association

    USDA-ARS?s Scientific Manuscript database

    A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...

  13. Rapid carrier screening using short tandem repeats in the phenylalanine hydroxylase gene.

    PubMed

    Shawky, R M; el-Aleem, K A; Rifaat, M M; el-Naggar, R L; Marzouk, G M

    2002-01-01

    Phenylketonuria (PKU) is an autosomal recessive genetic disorder caused by defects in the phenylalanine hydroxylase (PAH) system. Our work aimed to screen the PAH locus for the presence of potentially useful short tandem repeats (STR) as markers for carrier detection in PKU families in Egypt, and to determine the level of PAH heterozygosity within the Egyptian population. The system contains at least eight independent alleles in the Egyptian population, transmitted in a Mendelian fashion. Variations in the number of STR in the 16 families studied gave rise to polymorphisms that proved to be suitable markers for PKU carrier detection and prenatal diagnosis. The most frequent allelic fragment size in PKU patients was 246 bp (35.7%), which together with a fragment of 254 bp accounted for 60.7% of the mutant chromosomes.

  14. Production of novel recombinant single-domain antibodies against tandem repeat region of MUC1 mucin.

    PubMed

    Rahbarizadeh, F; Rasaee, M J; Forouzandeh Moghadam, M; Allameh, A A; Sadroddiny, E

    2004-06-01

    Recently, the existence of "heavy-chain" antibody in Camelidae has been described. However, as yet there is no data on the binding of this type of antibody to peptides. In addition, there was not any report of production of single-domain antibodies in two-humped camels (Camelus bactrianus). In the present study, these questions are addressed. We showed the feasibility of immunizing old world camels, cloning the repertoire of the variable domain of their heavy-chain antibodies, panning and selection, leading to the successful identification of minimum-sized antigen binders. Antigen-specific fragments of the heavy-chain IgGs (V(HH)) are of great interest in biotechnology because they are very stable, highly soluble, and react specifically and with high affinity to the antigens. In this study, we immunized two camels (Camelus dromedarius and Camelus bactrianus) with homogenized cancerous tissues, synthetic peptide, and human milk fat globule membrane (HMFG), and generated two V(HH) libraries displayed on phage particles. Some single-domain antibody fragments have been isolated that specifically recognize the tandem repeat region of MUC1. The camels' single-domain V(HH) harbor the original, intact antigen binding site and reacted specifically and with high affinity to the tandem repeat region of MUC1. Indeed soluble, specific antigen binders and good affinities (in the range of 0.2 x 10(9) M(-1) to 0.6 x 10(9) M(-1)) were identified from these libraries. This is the first example of the isolation of camel anti-peptide V(HH) domains.

  15. The central domain of bovine submaxillary mucin consists of over 50 tandem repeats of 329 amino acids. Chromosomal localization of the BSM1 gene and relations to ovine and porcine counterparts.

    PubMed

    Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P

    2000-04-01

    We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.

  16. Multilocus Variable-Number-Tandem-Repeats Analysis (MLVA) distinguishes a clonal complex of Clavibacter michiganensis subsp. michiganensis strains isolated from recent outbreaks of bacterial wilt and canker in Belgium

    PubMed Central

    2013-01-01

    Background Clavibacter michiganensis subsp. michiganensis (Cmm) causes bacterial wilt and canker in tomato. Cmm is present nearly in all European countries. During the last three years several local outbreaks were detected in Belgium. The lack of a convenient high-resolution strain-typing method has hampered the study of the routes of transmission of Cmm and epidemiology in tomato cultivation. In this study the genetic relatedness among a worldwide collection of Cmm strains and their relatives was approached by gyrB and dnaA gene sequencing. Further, we developed and applied a multilocus variable number of tandem repeats analysis (MLVA) scheme to discriminate among Cmm strains. Results A phylogenetic analysis of gyrB and dnaA gene sequences of 56 Cmm strains demonstrated that Belgian Cmm strains from recent outbreaks of 2010–2012 form a genetically uniform group within the Cmm clade, and Cmm is phylogenetically distinct from other Clavibacter subspecies and from non-pathogenic Clavibacter-like strains. MLVA conducted with eight minisatellite loci detected 25 haplotypes within Cmm. All strains from Belgian outbreaks, isolated between 2010 and 2012, together with two French strains from 2010 seem to form one monomorphic group. Regardless of the isolation year, location or tomato cultivar, Belgian strains from recent outbreaks belonged to the same haplotype. On the contrary, strains from diverse geographical locations or isolated over longer periods of time formed mostly singletons. Conclusions We hypothesise that the introduction might have originated from one lot of seeds or contaminated tomato seedlings that was the source of the outbreak in 2010 and that these Cmm strains persisted and induced infection in 2011 and 2012. Our results demonstrate that MLVA is a promising typing technique for a local surveillance and outbreaks investigation in epidemiological studies of Cmm. PMID:23738754

  17. Multilocus variable-number-tandem-repeats analysis (MLVA) distinguishes a clonal complex of Clavibacter michiganensis subsp. michiganensis strains isolated from recent outbreaks of bacterial wilt and canker in Belgium.

    PubMed

    Zaluga, Joanna; Stragier, Pieter; Van Vaerenbergh, Johan; Maes, Martine; De Vos, Paul

    2013-06-05

    Clavibacter michiganensis subsp. michiganensis (Cmm) causes bacterial wilt and canker in tomato. Cmm is present nearly in all European countries. During the last three years several local outbreaks were detected in Belgium. The lack of a convenient high-resolution strain-typing method has hampered the study of the routes of transmission of Cmm and epidemiology in tomato cultivation. In this study the genetic relatedness among a worldwide collection of Cmm strains and their relatives was approached by gyrB and dnaA gene sequencing. Further, we developed and applied a multilocus variable number of tandem repeats analysis (MLVA) scheme to discriminate among Cmm strains. A phylogenetic analysis of gyrB and dnaA gene sequences of 56 Cmm strains demonstrated that Belgian Cmm strains from recent outbreaks of 2010-2012 form a genetically uniform group within the Cmm clade, and Cmm is phylogenetically distinct from other Clavibacter subspecies and from non-pathogenic Clavibacter-like strains. MLVA conducted with eight minisatellite loci detected 25 haplotypes within Cmm. All strains from Belgian outbreaks, isolated between 2010 and 2012, together with two French strains from 2010 seem to form one monomorphic group. Regardless of the isolation year, location or tomato cultivar, Belgian strains from recent outbreaks belonged to the same haplotype. On the contrary, strains from diverse geographical locations or isolated over longer periods of time formed mostly singletons. We hypothesise that the introduction might have originated from one lot of seeds or contaminated tomato seedlings that was the source of the outbreak in 2010 and that these Cmm strains persisted and induced infection in 2011 and 2012. Our results demonstrate that MLVA is a promising typing technique for a local surveillance and outbreaks investigation in epidemiological studies of Cmm.

  18. Multiple-locus variable-number tandem repeat analysis of Salmonella Enteritidis isolates from human and non-human sources using a single multiplex PCR

    PubMed Central

    Cho, Seongbeom; Boxrud, David J; Bartkus, Joanne M; Whittam, Thomas S; Saeed, Mahdi

    2007-01-01

    Simplified multiple-locus variable-number tandem repeat analysis (MLVA) was developed using one-shot multiplex PCR for seven variable-number tandem repeats (VNTR) markers with high diversity capacity. MLVA, phage typing, and PFGE methods were applied on 34 diverse Salmonella Enteritidis isolates from human and non-human sources. MLVA detected allelic variations that helped to classify the S. Enteritidis isolates into more evenly distributed subtypes than other methods. MLVA-based S. Enteritidis clonal groups were largely associated with sources of the isolates. Nei's diversity indices for polymorphism ranged from 0.25 to 0.70 for seven VNTR loci markers. Based on Simpson's and Shannon's diversity indices, MLVA had a higher discriminatory power than pulsed field gel electrophoresis (PFGE), phage typing, or multilocus enzyme electrophoresis. Therefore, MLVA may be used along with PFGE to enhance the effectiveness of the molecular epidemiologic investigation of S. Enteritidis infections. PMID:17692097

  19. Variability of CAG tandem repeats in exon 1 of the androgen receptor gene is not related with dog intersexuality.

    PubMed

    Nowacka-Woszuk, J; Switonski, M

    2010-02-01

    Numerous mutations of the human androgen receptor (AR) gene cause an intersexual phenotype, called the androgen insensitivity syndrome. The intersexual phenotype is also quite often diagnosed in dogs. The aim of this study was to conduct a comparative analysis of the entire coding sequence (eight exons) of the AR gene in healthy and four intersex dogs, as well as in three other canids (the red fox, arctic fox and Chinese raccoon dog). The coding sequence of the studied species appeared to be conserved (similarity above 97%) and polymorphism was found in exon 1 only. Altogether, 2 SNPs were identified in healthy dogs, 14 in red foxes, 16 in arctic foxes and 6 were found in Chinese raccoon dogs, respectively. Moreover, a variable number of tandem repeats (CAG and CAA), encoding an array of glutamines, was also observed in this exon. The CAA codon numbers were invariable within species, but the CAG repeats were polymorphic. The highest number of the CAG and CAA repeats was found in dogs (from 40 to 42) and the observed variability was similar in intersex and healthy dogs. In the other canids the variability fell within the following ranges: 29-37 (red fox), 37-39 (arctic fox) and 29-32 (Chinese raccoon dog). In addition, a polymorphic microsatellite marker in intron 2 was found in the dog, red fox and Chinese raccoon dog. It was concluded that the polymorphism level of the AR gene in the dog was lower than in the other canids and none of the detected polymorphisms, including variability of the CAG tandem repeats, could be related with the intersexual phenotype of the studied dogs.

  20. Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

    PubMed

    Gibbons, John G; Rokas, Antonis

    2009-03-01

    Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.

  1. Whole genome evaluation of tandem repeat polymorphisms between two pathogenically similar strains of Xylella fastidiosa isolated from almond and grape in California

    USDA-ARS?s Scientific Manuscript database

    Whole genome tandem repeat polymorphisms were evaluated between two closely related Xylella fastidiosa strains, M23 and Temecula1, both cause almond leaf scorch disease (ALSD) and grape Pierce’s disease (PD) in California. Strain M23 was isolated from almond and the genome was sequenced in this stu...

  2. Multilocus variable-number tandem repeat analysis distinguishes outbreak and sporadic Escherichia coli O157:H7 isolates.

    PubMed

    Noller, Anna C; McEllistrem, M Catherine; Pacheco, Antonio G F; Boxrud, David J; Harrison, Lee H

    2003-12-01

    Escherichia coli O157:H7 is a major cause of food-borne illness in the United States. Outbreak detection involves traditional epidemiological methods and routine molecular subtyping by pulsed-field gel electrophoresis (PFGE). PFGE is labor-intensive, and the results are difficult to analyze and not easily transferable between laboratories. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a fast, portable method that analyzes multiple VNTR loci, which are areas of the bacterial genome that evolve quickly. Eighty isolates, including 21 isolates from five epidemiologically well-characterized outbreaks from Pennsylvania and Minnesota, were analyzed by PFGE and MLVA. Strains in PFGE clusters were defined as strains that differed by less than or equal to one band by using XbaI and the confirmatory enzyme SpeI. MLVA was performed by comparing the number of tandem repeats at seven loci. From 6 to 30 alleles were found at the seven loci, resulting in 64 MLVA types among the 80 isolates. MLVA correctly identified the isolates from all five outbreaks if only a single-locus variant was allowed. MLVA differentiated strains with unique PFGE types. Additionally, MLVA discriminated strains within PFGE-defined clusters that were not known to be part of an outbreak. In addition to being a simple and validated method for E. coli O157:H7 outbreak detection, MLVA appears to have a sensitivity equal to that of PFGE and a specificity superior to that of PFGE.

  3. Novel variable number of tandem repeats of gibbon MAOA gene and its evolutionary significance.

    PubMed

    Choi, Yuri; Jung, Yi-Deun; Ayarpadikannan, Selvam; Koga, Akihiko; Imai, Hiroo; Hirai, Hirohisa; Roos, Christian; Kim, Heui-Soo

    2014-08-01

    Variable number of tandem repeats (VNTRs) are scattered throughout the primate genome, and genetic variation of these VNTRs have been accumulated during primate radiation. Here, we analyzed VNTRs upstream of the monoamine oxidase A (MAOA) gene in 11 different gibbon species. An abundance of truncated VNTR sequences and copy number differences were observed compared to those of human VNTR sequences. To better understand the biological role of these VNTRs, a luciferase activity assay was conducted and results indicated that selected VNTR sequences of the MAOA gene from human and three different gibbon species (Hylobates klossii, Hylobates lar, and Nomascus concolor) showed silencing ability. Together, these data could be useful for understanding the evolutionary history and functional significance of MAOA VNTR sequences in gibbon species.

  4. Sex differences in TTC12/ANKK1 haplotype associations with daily tobacco smoking in Black and White Americans.

    PubMed

    David, Sean P; Mezuk, Briana; Zandi, Peter P; Strong, David; Anthony, James C; Niaura, Raymond; Uhl, George R; Eaton, William W

    2010-03-01

    The 11q23.1 genomic region has been associated with nicotine dependence in Black and White Americans. By conducting linkage disequilibrium analyses of 7 informative single nucleotide polymorphisms (SNPs) within the tetratricopeptide repeat domain 12 (TTC12)/ankyrin repeat and kinase containing 1 (ANKK1)/dopamine (D2) receptor gene cluster, we identified haplotype block structures in 270 Black and 368 White (n = 638) participants, from the Baltimore Epidemiologic Catchment Area cohort study, spanning the TTC12 and ANKK1 genes consisting of three SNPs (rs2303380-rs4938015-rs11604671). Informative haplotypes were examined for sex-specific associations with daily tobacco smoking initiation and cessation using longitudinal data from 1993-1994 and 2004-2005 interviews. There was a Haplotype x Sex interaction such that Black men possessing the GTG haplotype who were smokers in 1993-2004 were more likely to have stopped smoking by 2004-2005 (55.6% GTG vs. 22.0% other haplotypes), while Black women were less likely to have quit smoking if they possessed the GTG (20.8%) versus other haplotypes (24.0%; p = .028). In Whites, the GTG haplotype (vs. other haplotypes) was associated with lifetime history of daily smoking (smoking initiation; odds ratio = 1.6; 95% CI = 1.1-2.4; p = .013). Moreover, there was a Haplotype x Sex interaction such that there was higher prevalence of smoking initiation with GTG (77.6%) versus other haplotypes (57.0%; p = .043). In 2 different ethnic American populations, we observed man-woman variation in the influence of the rs2303380-rs4938015-rs11604671 GTG haplotype on smoking initiation and cessation. These results should be replicated in larger cohorts to establish the relationship among the rs2303380-rs4938015-rs11604671 haplotype block, sex, and smoking behavior.

  5. Haplotypes and gene expression implicate the MAPT region for Parkinson disease

    PubMed Central

    Tobin, J.E.; Latourelle, J.C.; Lew, M.F.; Klein, C.; Suchowersky, O.; Shill, H.A.; Golbe, L.I.; Mark, M.H.; Growdon, J.H.; Wooten, G.F.; Racette, B.A.; Perlmutter, J.S.; Watts, R.; Guttman, M.; Baker, K.B.; Goldwurm, S.; Pezzoli, G.; Singer, C.; Saint-Hilaire, M.H.; Hendricks, A.E.; Williamson, S.; Nagle, M.W.; Wilk, J.B.; Massood, T.; Laramie, J.M.; DeStefano, A.L.; Litvan, I.; Nicholson, G.; Corbett, A.; Isaacson, S.; Burn, D.J.; Chinnery, P.F.; Pramstaller, P.P.; Sherman, S.; Al-hinti, J.; Drasby, E.; Nance, M.; Moller, A.T.; Ostergaard, K.; Roxburgh, R.; Snow, B.; Slevin, J.T.; Cambi, F.; Gusella, J.F.; Myers, R.H.

    2009-01-01

    Background Microtubule-associated protein tau (MAPT) has been associated with several neurodegenerative disorders including forms of parkinsonism and Parkinson disease (PD). We evaluated the association of the MAPT region with PD in a large cohort of familial PD cases recruited by the GenePD Study. In addition, postmortem brain samples from patients with PD and neurologically normal controls were used to evaluate whether the expression of the 3-repeat and 4-repeat isoforms of MAPT, and neighboring genes Saitohin (STH) and KIAA1267, are altered in PD cerebellum. Methods Twenty-one single-nucleotide polymorphisms (SNPs) in the region of MAPT on chromosome 17q21 were genotyped in the GenePD Study. Single SNPs and haplotypes, including the H1 haplotype, were evaluated for association to PD. Relative quantification of gene expression was performed using real-time RT-PCR. Results After adjusting for multiple comparisons, SNP rs1800547 was significantly associated with PD affection. While the H1 haplotype was associated with a significantly increased risk for PD, a novel H1 subhaplotype was identified that predicted a greater increased risk for PD. The expression of 4-repeat MAPT, STH, and KIAA1267 was significantly increased in PD brains relative to controls. No difference in expression was observed for 3-repeat MAPT. Conclusions This study supports a role for MAPT in the pathogenesis of familial and idiopathic Parkinson disease (PD). Interestingly, the results of the gene expression studies suggest that other genes in the vicinity of MAPT, specifically STH and KIAA1267, may also have a role in PD and suggest complex effects for the genes in this region on PD risk. PMID:18509094

  6. Cis-acting mutation and duplication: History of molecular evolution in a P450 haplotype responsible for insecticide resistance in Culex quinquefasciatus.

    PubMed

    Itokawa, Kentaro; Komagata, Osamu; Kasai, Shinji; Masada, Masahiro; Tomita, Takashi

    2011-07-01

    A cytochrome P450 gene, Cyp9m10, is more than 200-fold overexpressed in a pyrethroid resistant strain of Culex quinquefasciatus, JPal-per. The haplotype of this strain contains two copies of Cyp9m10 resulted from recent tandem duplication. In this study, we discovered and isolated a Cyp9m10 haplotype closely related to this duplicated Cyp9m10 haplotype from JHB, a strain used for the recent genome project for this mosquito species. The isolated haplotype (JHB-NIID-B haplotype) shared the same insertion of a transposable element upstream of the coding region with JPal-per strain but not duplicated. The JHB-NIID-B haplotype was considered to have diverged from the JPal-per lineage just before the duplication event. Cyp9m10 was moderately overexpressed in larvae with the JHB-NIID-B haplotype. The overexpressions in JHB-NIID-B and JPal-per haplotypes were developmentally regulated in similar pattern indicating both haplotypes share a common cis-acting mutation responsible for the overexpressions. The isolated moderately overexpressed haplotype conferred resistance, however, its efficacy was relatively small. We hypothesized that the first cis-acting mutation modified the consequence of the subsequent duplication in JPal-per lineage to confer stronger phenotypic effect than that if it occurred before the first cis-acting mutation. Copyright © 2011 Elsevier Ltd. All rights reserved.

  7. Protein arginine methyltransferase 7 has a novel homodimer-like structure formed by tandem repeats.

    PubMed

    Hasegawa, Morio; Toma-Fukai, Sachiko; Kim, Jun-Dal; Fukamizu, Akiyoshi; Shimizu, Toshiyuki

    2014-05-21

    Protein arginine methyltransferase 7 (PRMT7) is a member of a family of enzymes that catalyze the transfer of methyl groups from S-adenosyl-l-methionine to nitrogen atoms on arginine residues. Here, we describe the crystal structure of Caenorhabditis elegans PRMT7 in complex with its reaction product S-adenosyl-L-homocysteine. The structural data indicated that PRMT7 harbors two tandem repeated PRMT core domains that form a novel homodimer-like structure. S-adenosyl-L-homocysteine bound to the N-terminal catalytic site only; the C-terminal catalytic site is occupied by a loop that inhibits cofactor binding. Mutagenesis demonstrated that only the N-terminal catalytic site of PRMT7 is responsible for cofactor binding. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  8. Worldwide genealogy of Entamoeba histolytica: an overview to understand haplotype distribution and infection outcome.

    PubMed

    Zermeño, Valeria; Ximénez, Cecilia; Morán, Patricia; Valadez, Alicia; Valenzuela, Olivia; Rascón, Edgar; Diaz, Daniel; Cerritos, René

    2013-07-01

    Although Entamoeba histolytica is one of the most prevalent intestinal parasites, how the different strains of this species are distributed all over the world and how different genotypes are associated with the infection outcome are yet to be fully understood. Recently, the use of a number of molecular markers has made the characterization of several genotypes in those regions with high incidence of amoebiasis possible. This work proposes the first genealogy of E. histolytica, with an haplotype network based on two tRNA gene-linked array of Short Tandem Repeats (STRs) reported until today, and 47 sequences from 39 new isolates of Mexican Amoebic Liver Abscesses (ALA) samples. One hundred and three sequences were obtained from D-A locus, their information about the geographic region of isolation as well as clinical diagnosis were also collected. One hundred and five sequences from N-K2 locus were also obtained as well as the region of isolation, but the information about clinical diagnosis was not available in all cases. The most abundant and widely distributed haplotype in the world is the one of E. histolytica HM1:IMSS strain. This was found in Mexico, Bangladesh, Japan, China and USA and is associated to symptomatic patients as well as asymptomatic cyst passers. Many other haplotypes were found only in a single country. Both genealogies suggest that there are no lineages within the networks that may be related to a particular geographic region or infection outcome. A concatenated analysis of the two molecular markers revealed 12 different combinations, which suggests the possibility of genetic recombination events. The present study is the first to propose a global genealogy of this species and suggests that there are still many genotypes to be discovered. The genotyping of new isolates will help to understand the great diversity and genetic structure of this parasite. Copyright © 2013 Elsevier B.V. All rights reserved.

  9. Huntington disease in the South African population occurs on diverse and ethnically distinct genetic haplotypes

    PubMed Central

    Baine, Fiona K; Kay, Chris; Ketelaar, Maria E; Collins, Jennifer A; Semaka, Alicia; Doty, Crystal N; Krause, Amanda; Jacquie Greenberg, L; Hayden, Michael R

    2013-01-01

    Huntington disease (HD) is a neurodegenerative disorder resulting from the expansion of a CAG trinucleotide repeat in the huntingtin (HTT) gene. Worldwide prevalence varies geographically with the highest figures reported in populations of European ancestry. HD in South Africa has been reported in Caucasian, black and mixed subpopulations, with similar estimated prevalence in the Caucasian and mixed groups and a lower estimate in the black subpopulation. Recent studies have associated specific HTT haplotypes with HD in distinct populations. Expanded HD alleles in Europe occur predominantly on haplogroup A (specifically high-risk variants A1/A2), whereas in East Asian populations, HD alleles are associated with haplogroup C. Whether specific HTT haplotypes associate with HD in black Africans and how these compare with haplotypes found in European and East Asian populations remains unknown. The current study genotyped the HTT region in unaffected individuals and HD patients from each of the South African subpopulations, and haplotypes were constructed. CAG repeat sizes were determined and phased to haplotype. Results indicate that HD alleles from Caucasian and mixed patients are predominantly associated with haplogroup A, signifying a similar European origin for HD. However, in black patients, HD occurs predominantly on haplogroup B, suggesting several distinct origins of the mutation in South Africa. The absence of high-risk variants (A1/A2) in the black subpopulation may also explain the reported low prevalence of HD. Identification of haplotypes associated with HD-expanded alleles is particularly relevant to the development of population-specific therapeutic targets for selective suppression of the expanded HTT transcript. PMID:23463025

  10. Effective application of multiple locus variable number of tandem repeats analysis to tracing Staphylococcus aureus in food-processing environment.

    PubMed

    Rešková, Z; Koreňová, J; Kuchta, T

    2014-04-01

    A total of 256 isolates of Staphylococcus aureus were isolated from 98 samples (34 swabs and 64 food samples) obtained from small or medium meat- and cheese-processing plants in Slovakia. The strains were genotypically characterized by multiple locus variable number of tandem repeats analysis (MLVA), involving multiplex polymerase chain reaction (PCR) with subsequent separation of the amplified DNA fragments by an automated flow-through gel electrophoresis. With the panel of isolates, MLVA produced 31 profile types, which was a sufficient discrimination to facilitate the description of spatial and temporal aspects of contamination. Further data on MLVA discrimination were obtained by typing a subpanel of strains by multiple locus sequence typing (MLST). MLVA coupled to automated electrophoresis proved to be an effective, comparatively fast and inexpensive method for tracing S. aureus contamination of food-processing factories. Subspecies genotyping of microbial contaminants in food-processing factories may facilitate identification of spatial and temporal aspects of the contamination. This may help to properly manage the process hygiene. With S. aureus, multiple locus variable number of tandem repeats analysis (MLVA) proved to be an effective method for the purpose, being sufficiently discriminative, yet comparatively fast and inexpensive. The application of automated flow-through gel electrophoresis to separation of DNA fragments produced by multiplex PCR helped to improve the accuracy and speed of the method. © 2013 The Society for Applied Microbiology.

  11. A multiplex PCR system for 13 RM Y-STRs with separate amplification of two different repeat motif structures in DYF403S1a.

    PubMed

    Lee, Eun Young; Lee, Hwan Young; Kwon, So Yeun; Oh, Yu Na; Yang, Woo Ick; Shin, Kyoung-Jin

    2017-01-01

    In forensic science and human genetics, Y-chromosomal short tandem repeats (Y-STRs) have been used as very useful markers. Recently, more Y-STR markers have been analyzed to enhance the resolution power in haplotype analysis, and 13 rapidly mutating (RM) Y-STRs have been suggested as revolutionary tools that can widen Y-chromosomal application from paternal lineage differentiation to male individualization. We have constructed two multiplex PCR sets for the amplification of 13 RM Y-STRs, which yield small-sized amplicons (<400bp) and a more balanced PCR efficiency with minimum PCR cycling. In particular, with the developed multiplex PCR system, we could separate three copies of DYF403S1a into two copies of DYF403S1a and one of DYF403S1b1. This is because DYF403S1b1 possesses distinguishable sequences from DYF403S1a at both the front and rear flanking regions of the repeat motif; therefore, the locus could be separately amplified using sequence-specific primers. In addition, the other copy, defined as DYF403S1b by Ballantyne et al., was renamed DYF403S1b2 because of its similar flanking region sequence to DYF403S1b1. By redefining DYF403S1 with the developed multiplex system, all genotypes of four copies could be successfully typed and more diverse haplotypes were obtained. We analyzed haplotype distributions in 705 Korean males based on four different Y-STR subsets: Yfiler, PowerPlex Y23, Yfiler Plus, and RM Y-STRs. All haplotypes obtained from RM Y-STRs were the most diverse and showed strong discriminatory power in Korean population. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  12. Transferability of short tandem repeat markers for two wild Canid species inhabiting the Brazilian Cerrado.

    PubMed

    Rodrigues, F M; Telles, M P C; Resende, L V; Soares, T N; Diniz-Filho, J A F; Jácomo, A T A; Silveira, L

    2006-12-13

    The maned wolf (Chrysocyon brachyurus) and the crab-eating fox (Cerdocyon thous) are two wild-canid species found in the Brazilian Cerrado. We tested cross-amplification and transferability of 29 short tandem repeat primers originally developed for cattle and domestic dogs and cats on 38 individuals of each of these two species, collected in the Emas National Park, which is the largest national park in the Cerrado region. Six of these primers were successfully transferred (CSSM-038, PEZ-05, PEZ-12, LOCO-13, LOCO-15, and PEZ-20); five of which were found to be polymorphic. Genetic parameter values (number of alleles per locus, observed and expected heterozygosities, and fixation indices) were within the expected range reported for canid populations worldwide.

  13. Variable-number-of-tandem-repeats analysis of genetic diversity in Pasteuria ramosa.

    PubMed

    Mouton, L; Ebert, D

    2008-05-01

    Variable-number-of-tandem-repeats (VNTR) markers are increasingly being used in population genetic studies of bacteria. They were recently developed for Pasteuria ramosa, an endobacterium that infects Daphnia species. In the present study, we genotyped P. ramosa in 18 infected hosts from the United Kingdom, Belgium, and two lakes in the United States using seven VNTR markers. Two Daphnia species were collected: D. magna and D. dentifera. Six loci showed length polymorphism, with as many as five alleles identified for a single locus. Similarity coefficient calculations showed that the extent of genetic variation between pairs of isolates within populations differed according to the population, but it was always less than the genetic distances among populations. Analysis of the genetic distances performed using principal component analysis revealed strong clustering by location of origin, but not by host Daphnia species. Our study demonstrated that the VNTR markers available for P. ramosa are informative in revealing genetic differences within and among populations and may therefore become an important tool for providing detailed analysis of population genetics and epidemiology.

  14. PGLa-H tandem-repeat peptides active against multidrug resistant clinical bacterial isolates.

    PubMed

    Rončević, Tomislav; Gajski, Goran; Ilić, Nada; Goić-Barišić, Ivana; Tonkić, Marija; Zoranić, Larisa; Simunić, Juraj; Benincasa, Monica; Mijaković, Marijana; Tossi, Alessandro; Juretić, Davor

    2017-02-01

    Antimicrobial peptides (AMPs) are promising candidates for new antibiotic classes but often display an unacceptably high toxicity towards human cells. A naturally produced C-terminal fragment of PGLa, named PGLa-H, has been reported to have a very low haemolytic activity while maintaining a moderate antibacterial activity. A sequential tandem repeat of this fragment, diPGLa-H, was designed, as well as an analogue with a Val to Gly substitution at a key position. These peptides showed markedly improved in vitro bacteriostatic and bactericidal activity against both reference strains and multidrug resistant clinical isolates of Gram-negative and Gram-positive pathogens, with generally low toxicity for human cells as assessed by haemolysis, cell viability, and DNA damage assays. The glycine substitution analogue, kiadin, had a slightly better antibacterial activity and reduced haemolytic activity, which may correlate with an increased flexibility of its helical structure, as deduced using molecular dynamics simulations. These peptides may serve as useful lead compounds for developing anti-infective agents against resistant Gram-negative and Gram-positive species. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. Clustering of Tuberculosis Cases Based on Variable-Number Tandem-Repeat Typing in Relation to the Population Structure of Mycobacterium tuberculosis in the Netherlands

    PubMed Central

    Sloot, Rosa; Borgdorff, Martien W.; de Beer, Jessica L.; van Ingen, Jakko; Supply, Philip

    2013-01-01

    The population structure of 3,776 Mycobacterium tuberculosis isolates was determined using variable-number tandem-repeat (VNTR) typing. The degree of clonality was so high that a more relaxed definition of clustering cannot be applied. Among recent immigrants with non-Euro-American isolates, transmission is overestimated if based on identical VNTR patterns. PMID:23658260

  16. TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

    PubMed

    Richard, François D; Kajava, Andrey V

    2014-06-01

    The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A

  18. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M

  19. Haplotypic Background of a Private Allele at High Frequency in the Americas

    PubMed Central

    Schroeder, Kari B.; Jakobsson, Mattias; Crawford, Michael H.; Schurr, Theodore G.; Boca, Simina M.; Conrad, Donald F.; Tito, Raul Y.; Osipova, Ludmilla P.; Tarskaia, Larissa A.; Zhadanov, Sergey I.; Wall, Jeffrey D.; Pritchard, Jonathan K.; Malhi, Ripan S.; Smith, David G.; Rosenberg, Noah A.

    2009-01-01

    Recently, the observation of a high-frequency private allele, the 9-repeat allele at microsatellite D9S1120, in all sampled Native American and Western Beringian populations has been interpreted as evidence that all modern Native Americans descend primarily from a single founding population. However, this inference assumed that all copies of the 9-repeat allele were identical by descent and that the geographic distribution of this allele had not been influenced by natural selection. To investigate whether these assumptions are satisfied, we genotyped 34 single nucleotide polymorphisms across ∼500 kilobases (kb) around D9S1120 in 21 Native American and Western Beringian populations and 54 other worldwide populations. All chromosomes with the 9-repeat allele share the same haplotypic background in the vicinity of D9S1120, suggesting that all sampled copies of the 9-repeat allele are identical by descent. Ninety-one percent of these chromosomes share the same 76.26 kb haplotype, which we call the “American Modal Haplotype” (AMH). Three observations lead us to conclude that the high frequency and widespread distribution of the 9-repeat allele are unlikely to be the result of positive selection: 1) aside from its association with the 9-repeat allele, the AMH does not have a high frequency in the Americas, 2) the AMH is not unusually long for its frequency compared with other haplotypes in the Americas, and 3) in Latin American mestizo populations, the proportion of Native American ancestry at D9S1120 is not unusual compared with that observed at other genomewide microsatellites. Using a new method for estimating the time to the most recent common ancestor (MRCA) of all sampled copies of an allele on the basis of an estimate of the length of the genealogy descended from the MRCA, we calculate the mean time to the MRCA of the 9-repeat allele to be between 7,325 and 39,900 years, depending on the demographic model used. The results support the hypothesis that all

  20. Single-cell forensic short tandem repeat typing within microfluidic droplets.

    PubMed

    Geng, Tao; Novak, Richard; Mathies, Richard A

    2014-01-07

    A short tandem repeat (STR) typing method is developed for forensic identification of individual cells. In our strategy, monodisperse 1.5 nL agarose-in-oil droplets are produced with a high frequency using a microfluidic droplet generator. Statistically dilute single cells, along with primer-functionalized microbeads, are randomly compartmentalized in the droplets. Massively parallel single-cell droplet polymerase chain reaction (PCR) is performed to transfer replicas of desired STR targets from the single-cell genomic DNA onto the coencapsulated microbeads. These DNA-conjugated beads are subsequently harvested and reamplified under statistically dilute conditions for conventional capillary electrophoresis (CE) STR fragment size analysis. The 9-plex STR profiles of single cells from both pure and mixed populations of GM09947 and GM09948 human lymphoid cells show that all alleles are correctly called and allelic drop-in/drop-out is not observed. The cell mixture study exhibits a good linear relationship between the observed and input cell ratios in the range of 1:1 to 10:1. Additionally, the STR profile of GM09947 cells could be deduced even in the presence of a high concentration of cell-free contaminating 9948 genomic DNA. Our method will be valuable for the STR analysis of samples containing mixtures of cells/DNA from multiple contributors and for low-concentration samples.

  1. FMR1 CGG repeat expansion mutation detection and linked haplotype analysis for reliable and accurate preimplantation genetic diagnosis of fragile X syndrome.

    PubMed

    Rajan-Babu, Indhu-Shree; Lian, Mulias; Cheah, Felicia S H; Chen, Min; Tan, Arnold S C; Prasath, Ethiraj B; Loh, Seong Feei; Chong, Samuel S

    2017-07-19

    Fragile X mental retardation 1 (FMR1) full-mutation expansion causes fragile X syndrome. Trans-generational fragile X syndrome transmission can be avoided by preimplantation genetic diagnosis (PGD). We describe a robust PGD strategy that can be applied to virtually any couple at risk of transmitting fragile X syndrome. This novel strategy utilises whole-genome amplification, followed by triplet-primed polymerase chain reaction (TP-PCR) for robust detection of expanded FMR1 alleles, in parallel with linked multi-marker haplotype analysis of 13 highly polymorphic microsatellite markers located within 1 Mb of the FMR1 CGG repeat, and the AMELX/Y dimorphism for gender identification. The assay was optimised and validated on single lymphoblasts isolated from fragile X reference cell lines, and applied to a simulated PGD case and a clinical in vitro fertilisation (IVF)-PGD case. In the simulated PGD case, definitive diagnosis of the expected results was achieved for all 'embryos'. In the clinical IVF-PGD case, delivery of a healthy baby girl was achieved after transfer of an expansion-negative blastocyst. FMR1 TP-PCR reliably detects presence of expansion mutations and obviates reliance on informative normal alleles for determining expansion status in female embryos. Together with multi-marker haplotyping and gender determination, misdiagnosis and diagnostic ambiguity due to allele dropout is minimised, and couple-specific assay customisation can be avoided.

  2. Genetic variation and willingness to participate in epidemiologic research: data from three studies.

    PubMed

    Bhatti, Parveen; Sigurdson, Alice J; Wang, Sophia S; Chen, Jinbo; Rothman, Nathaniel; Hartge, Patricia; Bergen, Andrew W; Landi, Maria Teresa

    2005-10-01

    The differences in common genetic polymorphism frequencies by willingness to participate in epidemiologic studies are unexplored, but the same threats to internal validity operate as for studies with nongenetic information. We analyzed single nucleotide polymorphism genotypes, haplotypes, and short tandem repeats among control groups from three studies with different recruitment designs that included early, late, and never questionnaire responders, one or more participation incentives, and blood or buccal DNA collection. Among 2,955 individuals, we compared 108 genotypes, 8 haplotypes, and 9 to 15 short tandem repeats by respondent type. Among our main comparisons, single nucleotide polymorphism genotype frequencies differed significantly (P < 0.05) between respondent groups in six instances, with 13 expected by chance alone. When comparing the odds of carrying a variant among the various response groups, 19 odds ratios were /=1.40, levels that might be notably different. Among the various respondent group comparisons, haplotype and short tandem repeat frequencies were not significantly different by willingness to participate. We observed little evidence to suggest that genotype differences underlie response characteristics in molecular epidemiologic studies, but a greater variety of genes should be examined, including those related to behavioral traits potentially associated with willingness to participate. To the extent possible, investigators should evaluate their own genetic data for bias in response categories.

  3. The Effective Mutation Rate at Y Chromosome Short Tandem Repeats, with Application to Human Population-Divergence Time

    PubMed Central

    Zhivotovsky, Lev A.; Underhill, Peter A.; Cinnioğlu, Cengiz; Kayser, Manfred; Morar, Bharti; Kivisild, Toomas; Scozzari, Rosaria; Cruciani, Fulvio; Destro-Bisol, Giovanni; Spedini, Gabriella; Chambers, Geoffrey K.; Herrera, Rene J.; Yong, Kiau Kiun; Gresham, David; Tournev, Ivailo; Feldman, Marcus W.; Kalaydjieva, Luba

    2004-01-01

    We estimate an effective mutation rate at an average Y chromosome short-tandem repeat locus as 6.9×10-4 per 25 years, with a standard deviation across loci of 5.7×10-4, using data on microsatellite variation within Y chromosome haplogroups defined by unique-event polymorphisms in populations with documented short-term histories, as well as comparative data on worldwide populations at both the Y chromosome and various autosomal loci. This value is used to estimate the times of the African Bantu expansion, the divergence of Polynesian populations (the Maoris, Cook Islanders, and Samoans), and the origin of Gypsy populations from Bulgaria. PMID:14691732

  4. The evolution of filamin-a protein domain repeat perspective.

    PubMed

    Light, Sara; Sagit, Rauan; Ithychanda, Sujay S; Qin, Jun; Elofsson, Arne

    2012-09-01

    Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.

  5. C9orf72 hexanucleotide repeat expansions in Chinese sporadic amyotrophic lateral sclerosis.

    PubMed

    He, Ji; Tang, Lu; Benyamin, Beben; Shah, Sonia; Hemani, Gib; Liu, Rong; Ye, Shan; Liu, Xiaolu; Ma, Yan; Zhang, Huagang; Cremin, Katie; Leo, Paul; Wray, Naomi R; Visscher, Peter M; Xu, Huji; Brown, Matthew A; Bartlett, Perry F; Mangelsdorf, Marie; Fan, Dongsheng

    2015-09-01

    A hexanucleotide repeat expansion (HRE) in the C9orf72 gene has been identified as the most common mutation in amyotrophic lateral sclerosis (ALS) among Caucasian populations. We sought to comprehensively evaluate genetic and epigenetic variants of C9orf72 and the contribution of the HRE in Chinese ALS cases. We performed fragment-length and repeat-primed polymerase chain reaction to determine GGGGCC copy number and expansion within the C9orf72 gene in 1092 sporadic ALS (sALS) and 1062 controls from China. We performed haplotype analysis of 23 single-nucleotide polymorphisms within and surrounding C9orf72. The C9orf72 HRE was found in 3 sALS patients (0.3%) but not in control subjects (p = 0.25). For 2 of the cases with the HRE, genotypes of 8 single-nucleotide polymorphisms flanking the HRE were inconsistent with the haplotype reported to be strongly associated with ALS in Caucasian populations. For these 2 individuals, we found hypermethylation of the CpG island upstream of the repeat, an observation not detected in other sALS patients (p < 10(-8)) or controls. The detailed analysis of the C9orf72 locus in a large cohort of Chinese samples provides robust evidence that may not be consistent with a single Caucasian founder event. Both the Caucasian and Chinese haplotypes associated with HRE were highly associated with repeat lengths >8 repeats implying that both haplotypes may confer instability of repeat length. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. 6-mercaptopurine influences TPMT gene transcription in a TPMT gene promoter variable number of tandem repeats-dependent manner.

    PubMed

    Kotur, Nikola; Stankovic, Biljana; Kassela, Katerina; Georgitsi, Marianthi; Vicha, Anna; Leontari, Iliana; Dokmanovic, Lidija; Janic, Dragana; Krstovski, Nada; Klaassen, Kristel; Radmilovic, Milena; Stojiljkovic, Maja; Nikcevic, Gordana; Simeonidis, Argiris; Sivolapenko, Gregory; Pavlovic, Sonja; Patrinos, George P; Zukic, Branka

    2012-02-01

    TPMT activity is characterized by a trimodal distribution, namely low, intermediate and high methylator. TPMT gene promoter contains a variable number of GC-rich tandem repeats (VNTRs), namely A, B and C, ranging from three to nine repeats in length in an A(n)B(m)C architecture. We have previously shown that the VNTR architecture in the TPMT gene promoter affects TPMT gene transcription. MATERIALS, METHODS & RESULTS: Here we demonstrate, using reporter assays, that 6-mercaptopurine (6-MP) treatment results in a VNTR architecture-dependent decrease of TPMT gene transcription, mediated by the binding of newly recruited protein complexes to the TPMT gene promoter, upon 6-MP treatment. We also show that acute lymphoblastic leukemia patients undergoing 6-MP treatment display a VNTR architecture-dependent response to 6-MP. These data suggest that the TPMT gene promoter VNTR architecture can be potentially used as a pharmacogenomic marker to predict toxicity due to 6-MP treatment in acute lymphoblastic leukemia patients.

  7. Topological characteristics of helical repeat proteins.

    PubMed

    Groves, M R; Barford, D

    1999-06-01

    The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem repeats of an alpha-helical structural unit, creating extended superhelical structures that are ideally suited to create a protein recognition interface.

  8. Highly Discriminatory Variable-Number Tandem-Repeat Markers for Genotyping of Trichophyton interdigitale Strains

    PubMed Central

    Drira, Ines; Hadrich, Ines; Neji, Sourour; Mahfouth, Nedia; Trabelsi, Houaida; Sellami, Hayet; Makni, Fattouma

    2014-01-01

    Trichophyton interdigitale is the second most frequent cause of superficial fungal infections of various parts of the human body. Studying the population structure and genotype differentiation of T. interdigitale strains may lead to significant improvements in clinical practice. The present study aimed to develop and select suitable variable-number tandem-repeat (VNTR) markers for 92 clinical strains of T. interdigitale. On the basis of an analysis of four VNTR markers, four to eight distinct alleles were detected for each marker. The marker with the highest discriminatory power had eight alleles and a D value of 0.802. The combination of all four markers yielded a D value of 0.969 with 29 distinct multilocus genotypes. VNTR typing revealed the genetic diversity of the strains, identifying three populations according to their colonization sites. A correlation between phenotypic characteristics and multilocus genotypes was observed. Seven patients harbored T. interdigitale strains with different genotypes. Typing of clinical T. interdigitale samples by VNTR markers displayed excellent discriminatory power and 100% reproducibility. PMID:24989614

  9. Analysis of an "off-ladder" allele at the Penta D short tandem repeat locus.

    PubMed

    Yang, Y L; Wang, J G; Wang, D X; Zhang, W Y; Liu, X J; Cao, J; Yang, S L

    2015-11-25

    Kinship testing of a father and his son from Guangxi, China, the location of the Zhuang minority people, was performed using the PowerPlex® 18D System with a short tandem repeat typing kit. The results indicated that both the father and his son had an off-ladder allele at the Penta D locus, with a genetic size larger than that of the maximal standard allelic ladder. To further identify this locus, monogenic amplification, gene cloning, and genetic sequencing were performed. Sequencing analysis demonstrated that the fragment size of the Penta D-OL locus was 469 bp and the core sequence was [AAAGA]21, also called Penta D-21. The rare Penta D-21 allele was found to be distributed among the Zhuang population from the Guangxi Zhuang Autonomous Region of China; therefore, this study improved the range of DNA data available for this locus and enhanced our ability for individual identification of gene loci.

  10. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    PubMed

    Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

    2013-01-01

    The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

  11. The evolution of filamin – A protein domain repeat perspective

    PubMed Central

    Light, Sara; Sagit, Rauan; Ithychanda, Sujay S.; Qin, Jun; Elofsson, Arne

    2013-01-01

    Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. PMID:22414427

  12. Efficient algorithms for polyploid haplotype phasing.

    PubMed

    He, Dan; Saha, Subrata; Finkers, Richard; Parida, Laxmi

    2018-05-09

    Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.

  13. Inbreeding drives maize centromere evolution

    PubMed Central

    Schneider, Kevin L.; Xie, Zidian; Wolfgruber, Thomas K.; Presting, Gernot G.

    2016-01-01

    Functional centromeres, the chromosomal sites of spindle attachment during cell division, are marked epigenetically by the centromere-specific histone H3 variant cenH3 and typically contain long stretches of centromere-specific tandem DNA repeats (∼1.8 Mb in maize). In 23 inbreds of domesticated maize chosen to represent the genetic diversity of maize germplasm, partial or nearly complete loss of the tandem DNA repeat CentC precedes 57 independent cenH3 relocation events that result in neocentromere formation. Chromosomal regions with newly acquired cenH3 are colonized by the centromere-specific retrotransposon CR2 at a rate that would result in centromere-sized CR2 clusters in 20,000–95,000 y. Three lines of evidence indicate that CentC loss is linked to inbreeding, including (i) CEN10 of temperate lineages, presumed to have experienced a genetic bottleneck, contain less CentC than their tropical relatives; (ii) strong selection for centromere-linked genes in domesticated maize reduced diversity at seven of the ten maize centromeres to only one or two postdomestication haplotypes; and (iii) the centromere with the largest number of haplotypes in domesticated maize (CEN7) has the highest CentC levels in nearly all domesticated lines. Rare recombinations introduced one (CEN2) or more (CEN5) alternate CEN haplotypes while retaining a single haplotype at domestication loci linked to these centromeres. Taken together, this evidence strongly suggests that inbreeding, favored by postdomestication selection for centromere-linked genes affecting key domestication or agricultural traits, drives replacement of the tandem centromere repeats in maize and other crop plants. Similar forces may act during speciation in natural systems. PMID:26858403

  14. Inbreeding drives maize centromere evolution.

    PubMed

    Schneider, Kevin L; Xie, Zidian; Wolfgruber, Thomas K; Presting, Gernot G

    2016-02-23

    Functional centromeres, the chromosomal sites of spindle attachment during cell division, are marked epigenetically by the centromere-specific histone H3 variant cenH3 and typically contain long stretches of centromere-specific tandem DNA repeats (∼1.8 Mb in maize). In 23 inbreds of domesticated maize chosen to represent the genetic diversity of maize germplasm, partial or nearly complete loss of the tandem DNA repeat CentC precedes 57 independent cenH3 relocation events that result in neocentromere formation. Chromosomal regions with newly acquired cenH3 are colonized by the centromere-specific retrotransposon CR2 at a rate that would result in centromere-sized CR2 clusters in 20,000-95,000 y. Three lines of evidence indicate that CentC loss is linked to inbreeding, including (i) CEN10 of temperate lineages, presumed to have experienced a genetic bottleneck, contain less CentC than their tropical relatives; (ii) strong selection for centromere-linked genes in domesticated maize reduced diversity at seven of the ten maize centromeres to only one or two postdomestication haplotypes; and (iii) the centromere with the largest number of haplotypes in domesticated maize (CEN7) has the highest CentC levels in nearly all domesticated lines. Rare recombinations introduced one (CEN2) or more (CEN5) alternate CEN haplotypes while retaining a single haplotype at domestication loci linked to these centromeres. Taken together, this evidence strongly suggests that inbreeding, favored by postdomestication selection for centromere-linked genes affecting key domestication or agricultural traits, drives replacement of the tandem centromere repeats in maize and other crop plants. Similar forces may act during speciation in natural systems.

  15. Determination of Sources of Escherichia coli on Beef by Multiple-Locus Variable-Number Tandem Repeat Analysis.

    PubMed

    Yang, Xianqin; Tran, Frances; Youssef, Mohamed K; Gill, Colin O

    2015-07-01

    The possible origin of Escherichia coli found on cuts and trimmings in the breaking facility of a beef packing plant was examined using multiple-locus variable-number tandem repeat analysis. Coliforms and E. coli were enumerated in samples obtained from 160 carcasses that would enter the breaking facility when work commenced and after each of the three production breaks throughout the day, from the conveyor belt before work and after each break, and from cuts and trimmings when work commenced and after each break. Most samples yielded no E. coli, irrespective of the surface types. E. coli was recovered from 7 (<5%) carcasses, at numbers mostly ≤1.0 log CFU/160,000 cm(2). The log total numbers of E. coli recovered from the conveyor belt, cuts, and trimmings were mostly between 1 and 2 log CFU/80,000 cm(2). A total of 554 E. coli isolates were recovered. Multiple-locus variable-number tandem repeat analysis of 327 selected isolates identified 80 distinct genotypes, with 37 (46%) each containing one isolate. However, 28% of the isolates were of genotypes that were recovered from more than one sampling day. Of the 80 genotypes, 65 and 2% were found in one or all four sampling periods throughout the day. However, they represented 23 and 14% of the isolates, respectively. Of the genotypes identified for each surface type, at least one contained ≥9 isolates. No unique genotypes were associated with carcasses, but 10, 17, and 19 were uniquely associated with cuts, trimmings, and the belt, respectively. Of the isolates recovered from cuts, 49, 3, and 19% were of genotypes that were found among isolates recovered from the belt, carcasses, or both the belt and carcasses, respectively. A similar composition was found for isolates recovered from trimmings. These findings show that the E. coli found on cuts and trimmings at this beef packing plant mainly originated from the conveyor belt and that small number of E. coli strains survived the daily cleaning and sanitation

  16. Tandem repeats of the 5' non-transcribed spacer of Tetrahymena rDNA function as high copy number autonomous replicons in the macronucleus but do not prevent rRNA gene dosage regulation.

    PubMed Central

    Pan, W J; Blackburn, E H

    1995-01-01

    The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number. Images PMID:7784211

  17. Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

    PubMed Central

    Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John

    2008-01-01

    The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213

  18. Application of multilocus variable number tandem repeat analysis to monitor Verocytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales: emergence of a profile associated with a national outbreak.

    PubMed

    Perry, N; Cheasty, T; Dallman, T; Launders, N; Willshaw, G

    2013-10-01

    Evaluation of multilocus variable number tandem repeat analysis (MLVA) to subtype all isolates of Vero cytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales. Over a 13 month period from December 2010, 483 isolates of VTEC O157 PT8 were tested by MLVA; 39% were received in the first 4 months of 2011, when infections are generally low. One profile, or single locus variants of it, was present in 249 (52%) isolates but was not common previously. These cases represented a national increase in PT8, associated epidemiologically with soil-contaminated vegetables. Most of the 177 other MLVA profiles were unique to a single isolate. Profiles shared by >1 isolate included cases from two small community, food-borne outbreaks and 11 households. Several shared profiles were found among 23 isolates without known links. Apart from one group, isolates linked to travel abroad had very diverse profiles. Multilocus variable number tandem repeat analysis discriminated apparent sporadic isolates of the same PT and assisted in detection of cases in an emerging national outbreak. Multilocus variable number tandem repeat analysis is an epidemiologically valid complement to surveillance and applicable as a rapid, practical test for large numbers of isolates. © 2013 The Society for Applied Microbiology.

  19. Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT){sub n} {center_dot} (GA){sub n} microsatellite embedded within the U2 repeat unit

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liao, D.; Weiner, A.M.

    1995-12-10

    The RNU2 locus encoding human U2 small nuclear RNA (snRNA) is organized as a nearly perfect tandem array containing 5 to 22 copies of a 5.8-kb repeat unit. Just downstream of the U2 snRNA gene in each 5.8-kb repeat unit lies a large (CT){sub n}{center_dot}(GA){sub n} dinucleotide repeat (n {approx} 70). This form of genomic organization, in which one repeat is embedded within another, provides an unusual opportunity to study the balance of forces maintaining the homogeneity of both kinds of repeats. Using a combination of field inversion gel electrophoresis and polymerase chain reaction, we have been able to studymore » the CT microsatellites within individual U2 tandem arrays. We find that the CT microsatellites within an RNU2 allele exhibit significant length polymorphism, despite the remarkable homogeneity of the surrounding U2 repeat units. Length polymorphism is due primarily to loss or gain of CT dinucleotide repeats, but other types of deletions, insertions, and substitutions are also frequent. Polymorphism is greatly reduced in regions where pure (CT){sub n} tracts are interrupted by occasional G residues, suggesting that irregularities stabilize both the length and the sequence of the dinucleotide repeat. We further show that the RNU2 loci of other catarrhine primates (gorilla, chimpanzee, ogangutan, and baboon) contain orthologous CT microsatellites; these also exhibit length polymorphism, but are highly divergent from each other. Thus, although the CT microsatellite is evolving far more rapidly than the rest of the U2 repeat unit, it has persisted through multiple speciation events spanning >35 Myr. The persistence of the CT microsatellite, despite polymorphism and rapid evolution, suggests that it might play a functional role in concerted evolution of the RNU2 loci, perhaps as an initiation site for recombination and/or gene conversion. 70 refs., 5 figs.« less

  20. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2012-12-11

    Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.

  1. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

    PubMed Central

    2012-01-01

    Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411

  2. Slipped-strand mispairing at noncontiguous repeats in Poecilia reticulata: a model for minisatellite birth.

    PubMed Central

    Taylor, J S; Breden, F

    2000-01-01

    The standard slipped-strand mispairing (SSM) model for the formation of variable number tandem repeats (VNTRs) proposes that a few tandem repeats, produced by chance mutations, provide the "raw material" for VNTR expansion. However, this model is unlikely to explain the formation of VNTRs with long motifs (e.g., minisatellites), because the likelihood of a tandem repeat forming by chance decreases rapidly as the length of the repeat motif increases. Phylogenetic reconstruction of the birth of a mitochondrial (mt) DNA minisatellite in guppies suggests that VNTRs with long motifs can form as a consequence of SSM at noncontiguous repeats. VNTRs formed in this manner have motifs longer than the noncontiguous repeat originally formed by chance and are flanked by one unit of the original, noncontiguous repeat. SSM at noncontiguous repeats can therefore explain the birth of VNTRs with long motifs and the "imperfect" or "short direct" repeats frequently observed adjacent to both mtDNA and nuclear VNTRs. PMID:10880490

  3. A global analysis of Y-chromosomal haplotype diversity for 23 STR loci

    PubMed Central

    Purps, Josephine; Siegert, Sabine; Willuweit, Sascha; Nagy, Marion; Alves, Cíntia; Salazar, Renato; Angustia, Sheila M.T.; Santos, Lorna H.; Anslinger, Katja; Bayer, Birgit; Ayub, Qasim; Wei, Wei; Xue, Yali; Tyler-Smith, Chris; Bafalluy, Miriam Baeta; Martínez-Jarreta, Begoña; Egyed, Balazs; Balitzki, Beate; Tschumi, Sibylle; Ballard, David; Court, Denise Syndercombe; Barrantes, Xinia; Bäßler, Gerhard; Wiest, Tina; Berger, Burkhard; Niederstätter, Harald; Parson, Walther; Davis, Carey; Budowle, Bruce; Burri, Helen; Borer, Urs; Koller, Christoph; Carvalho, Elizeu F.; Domingues, Patricia M.; Chamoun, Wafaa Takash; Coble, Michael D.; Hill, Carolyn R.; Corach, Daniel; Caputo, Mariela; D’Amato, Maria E.; Davison, Sean; Decorte, Ronny; Larmuseau, Maarten H.D.; Ottoni, Claudio; Rickards, Olga; Lu, Di; Jiang, Chengtao; Dobosz, Tadeusz; Jonkisz, Anna; Frank, William E.; Furac, Ivana; Gehrig, Christian; Castella, Vincent; Grskovic, Branka; Haas, Cordula; Wobst, Jana; Hadzic, Gavrilo; Drobnic, Katja; Honda, Katsuya; Hou, Yiping; Zhou, Di; Li, Yan; Hu, Shengping; Chen, Shenglan; Immel, Uta-Dorothee; Lessig, Rüdiger; Jakovski, Zlatko; Ilievska, Tanja; Klann, Anja E.; García, Cristina Cano; de Knijff, Peter; Kraaijenbrink, Thirsa; Kondili, Aikaterini; Miniati, Penelope; Vouropoulou, Maria; Kovacevic, Lejla; Marjanovic, Damir; Lindner, Iris; Mansour, Issam; Al-Azem, Mouayyad; Andari, Ansar El; Marino, Miguel; Furfuro, Sandra; Locarno, Laura; Martín, Pablo; Luque, Gracia M.; Alonso, Antonio; Miranda, Luís Souto; Moreira, Helena; Mizuno, Natsuko; Iwashima, Yasuki; Neto, Rodrigo S. Moura; Nogueira, Tatiana L.S.; Silva, Rosane; Nastainczyk-Wulf, Marina; Edelmann, Jeanett; Kohl, Michael; Nie, Shengjie; Wang, Xianping; Cheng, Baowen; Núñez, Carolina; Pancorbo, Marian Martínez de; Olofsson, Jill K.; Morling, Niels; Onofri, Valerio; Tagliabracci, Adriano; Pamjav, Horolma; Volgyi, Antonia; Barany, Gusztav; Pawlowski, Ryszard; Maciejewska, Agnieszka; Pelotti, Susi; Pepinski, Witold; Abreu-Glowacka, Monica; Phillips, Christopher; Cárdenas, Jorge; Rey-Gonzalez, Danel; Salas, Antonio; Brisighelli, Francesca; Capelli, Cristian; Toscanini, Ulises; Piccinini, Andrea; Piglionica, Marilidia; Baldassarra, Stefania L.; Ploski, Rafal; Konarzewska, Magdalena; Jastrzebska, Emila; Robino, Carlo; Sajantila, Antti; Palo, Jukka U.; Guevara, Evelyn; Salvador, Jazelyn; Ungria, Maria Corazon De; Rodriguez, Jae Joseph Russell; Schmidt, Ulrike; Schlauderer, Nicola; Saukko, Pekka; Schneider, Peter M.; Sirker, Miriam; Shin, Kyoung-Jin; Oh, Yu Na; Skitsa, Iulia; Ampati, Alexandra; Smith, Tobi-Gail; Calvit, Lina Solis de; Stenzl, Vlastimil; Capal, Thomas; Tillmar, Andreas; Nilsson, Helena; Turrina, Stefania; De Leo, Domenico; Verzeletti, Andrea; Cortellini, Venusia; Wetton, Jon H.; Gwynne, Gareth M.; Jobling, Mark A.; Whittle, Martin R.; Sumita, Denilce R.; Wolańska-Nowak, Paulina; Yong, Rita Y.Y.; Krawczak, Michael; Nothnagel, Michael; Roewer, Lutz

    2014-01-01

    In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent. PMID:24854874

  4. A global analysis of Y-chromosomal haplotype diversity for 23 STR loci.

    PubMed

    Purps, Josephine; Siegert, Sabine; Willuweit, Sascha; Nagy, Marion; Alves, Cíntia; Salazar, Renato; Angustia, Sheila M T; Santos, Lorna H; Anslinger, Katja; Bayer, Birgit; Ayub, Qasim; Wei, Wei; Xue, Yali; Tyler-Smith, Chris; Bafalluy, Miriam Baeta; Martínez-Jarreta, Begoña; Egyed, Balazs; Balitzki, Beate; Tschumi, Sibylle; Ballard, David; Court, Denise Syndercombe; Barrantes, Xinia; Bäßler, Gerhard; Wiest, Tina; Berger, Burkhard; Niederstätter, Harald; Parson, Walther; Davis, Carey; Budowle, Bruce; Burri, Helen; Borer, Urs; Koller, Christoph; Carvalho, Elizeu F; Domingues, Patricia M; Chamoun, Wafaa Takash; Coble, Michael D; Hill, Carolyn R; Corach, Daniel; Caputo, Mariela; D'Amato, Maria E; Davison, Sean; Decorte, Ronny; Larmuseau, Maarten H D; Ottoni, Claudio; Rickards, Olga; Lu, Di; Jiang, Chengtao; Dobosz, Tadeusz; Jonkisz, Anna; Frank, William E; Furac, Ivana; Gehrig, Christian; Castella, Vincent; Grskovic, Branka; Haas, Cordula; Wobst, Jana; Hadzic, Gavrilo; Drobnic, Katja; Honda, Katsuya; Hou, Yiping; Zhou, Di; Li, Yan; Hu, Shengping; Chen, Shenglan; Immel, Uta-Dorothee; Lessig, Rüdiger; Jakovski, Zlatko; Ilievska, Tanja; Klann, Anja E; García, Cristina Cano; de Knijff, Peter; Kraaijenbrink, Thirsa; Kondili, Aikaterini; Miniati, Penelope; Vouropoulou, Maria; Kovacevic, Lejla; Marjanovic, Damir; Lindner, Iris; Mansour, Issam; Al-Azem, Mouayyad; Andari, Ansar El; Marino, Miguel; Furfuro, Sandra; Locarno, Laura; Martín, Pablo; Luque, Gracia M; Alonso, Antonio; Miranda, Luís Souto; Moreira, Helena; Mizuno, Natsuko; Iwashima, Yasuki; Neto, Rodrigo S Moura; Nogueira, Tatiana L S; Silva, Rosane; Nastainczyk-Wulf, Marina; Edelmann, Jeanett; Kohl, Michael; Nie, Shengjie; Wang, Xianping; Cheng, Baowen; Núñez, Carolina; Pancorbo, Marian Martínez de; Olofsson, Jill K; Morling, Niels; Onofri, Valerio; Tagliabracci, Adriano; Pamjav, Horolma; Volgyi, Antonia; Barany, Gusztav; Pawlowski, Ryszard; Maciejewska, Agnieszka; Pelotti, Susi; Pepinski, Witold; Abreu-Glowacka, Monica; Phillips, Christopher; Cárdenas, Jorge; Rey-Gonzalez, Danel; Salas, Antonio; Brisighelli, Francesca; Capelli, Cristian; Toscanini, Ulises; Piccinini, Andrea; Piglionica, Marilidia; Baldassarra, Stefania L; Ploski, Rafal; Konarzewska, Magdalena; Jastrzebska, Emila; Robino, Carlo; Sajantila, Antti; Palo, Jukka U; Guevara, Evelyn; Salvador, Jazelyn; Ungria, Maria Corazon De; Rodriguez, Jae Joseph Russell; Schmidt, Ulrike; Schlauderer, Nicola; Saukko, Pekka; Schneider, Peter M; Sirker, Miriam; Shin, Kyoung-Jin; Oh, Yu Na; Skitsa, Iulia; Ampati, Alexandra; Smith, Tobi-Gail; Calvit, Lina Solis de; Stenzl, Vlastimil; Capal, Thomas; Tillmar, Andreas; Nilsson, Helena; Turrina, Stefania; De Leo, Domenico; Verzeletti, Andrea; Cortellini, Venusia; Wetton, Jon H; Gwynne, Gareth M; Jobling, Mark A; Whittle, Martin R; Sumita, Denilce R; Wolańska-Nowak, Paulina; Yong, Rita Y Y; Krawczak, Michael; Nothnagel, Michael; Roewer, Lutz

    2014-09-01

    In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  5. Genetic diversity of Y-short tandem repeats in Chinese native cattle breeds.

    PubMed

    Xin, Y P; Zan, L S; Liu, Y F; Tian, W Q; Wang, H B; Cheng, G; Li, A N; Yang, W C

    2014-11-14

    The aim of this study is to use Y-chromosome gene polymorphism method to investigate regional differences in genetic variation and population evolution history of the Chinese native cattle breeds. Six Y-chromosome short tandem repeat (Y-STR) loci (UMN0929, UMN0108, UMN0920, INRA124, UMN2404, and UMN0103) were analyzed using 1016 healthy and heterogenetic males and 90 females of 9 native cattle breeds (Qinchuan, Jinnan, Zaosheng, Luxi, Nanyang, Jiaxian, Dabieshan, Yanbian, and Menggu) in China. Allele frequency and gene diversity were calculated for the various populations. The results indicated that Y-STRs in the 6 loci have polymorphisms and genetic diversity in Chinese cattle populations. The genetic diversity analysis revealed that the Chinese cattle populations have a close genetic relationship. The analysis of INRA124, UMN2404, and UMN0103 loci revealed the original history of Chinese cattle because of which cattle belonging to Bos taurus or Bos indicus could be determined. Interestingly, a declining zebu introgression was displayed from South to North and from East to West in the Chinese geographical distribution, which implied that cattle population from various regions of China had been subjected to somewhat different evolutionary history. This conclusion supported other evidences such as earlier archaeological, historical research, and blood protein polymorphism analysis.

  6. Evaluation of advanced multiplex short tandem repeat systems in pairwise kinship analysis.

    PubMed

    Tamura, Tomonori; Osawa, Motoki; Ochiai, Eriko; Suzuki, Takanori; Nakamura, Takashi

    2015-09-01

    The AmpFLSTR Identifiler Kit, comprising 15 autosomal short tandem repeat (STR) loci, is commonly employed in forensic practice for calculating match probabilities and parentage testing. The conventional system exhibits insufficient estimation for kinship analysis such as sibship testing because of shortness of examined loci. This study evaluated the power of the PowerPlex Fusion System, GlobalFiler Kit, and PowerPlex 21 System, which comprise more than 20 autosomal STR loci, to estimate pairwise blood relatedness (i.e., parent-child, full siblings, second-degree relatives, and first cousins). The genotypes of all 24 STR loci in 10,000 putative pedigrees were constructed by simulation. The likelihood ratio for each locus was calculated from joint probabilities for relatives and non-relatives. The combined likelihood ratio was calculated according to the product rule. The addition of STR loci improved separation between relatives and non-relatives. However, these systems were less effectively extended to the inference for first cousins. In conclusion, these advanced systems will be useful in forensic personal identification, especially in the evaluation of full siblings and second-degree relatives. Moreover, the additional loci may give rise to two major issues of more frequent mutational events and several pairs of linked loci on the same chromosome. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  7. Haplotype-Based Genotyping in Polyploids.

    PubMed

    Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

    2018-01-01

    Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.

  8. Multiple-locus variable-number tandem repeat analysis for strain discrimination of non-O157 Shiga toxin-producing Escherichia coli.

    PubMed

    Timmons, Chris; Trees, Eija; Ribot, Efrain M; Gerner-Smidt, Peter; LaFon, Patti; Im, Sung; Ma, Li Maria

    2016-06-01

    Non-O157 Shiga toxin-producing Escherichia coli (STEC) are foodborne pathogens of growing concern worldwide that have been associated with several recent multistate and multinational outbreaks of foodborne illness. Rapid and sensitive molecular-based bacterial strain discrimination methods are critical for timely outbreak identification and contaminated food source traceback. One such method, multiple-locus variable-number tandem repeat analysis (MLVA), is being used with increasing frequency in foodborne illness outbreak investigations to augment the current gold standard bacterial subtyping technique, pulsed-field gel electrophoresis (PFGE). The objective of this study was to develop a MLVA assay for intra- and inter-serogroup discrimination of six major non-O157 STEC serogroups-O26, O111, O103, O121, O45, and O145-and perform a preliminary internal validation of the method on a limited number of clinical isolates. The resultant MLVA scheme consists of ten variable number tandem repeat (VNTR) loci amplified in three multiplex PCR reactions. Sixty-five unique MLVA types were obtained among 84 clinical non-O157 STEC strains comprised of geographically diverse sporadic and outbreak related isolates. Compared to PFGE, the developed MLVA scheme allowed similar discrimination among serogroups O26, O111, O103, and O121 but not among O145 and O45. To more fully compare the discriminatory power of this preliminary MLVA method to PFGE and to determine its epidemiological congruence, a thorough internal and external validation needs to be performed on a carefully selected large panel of strains, including multiple isolates from single outbreaks. Copyright © 2016. Published by Elsevier B.V.

  9. NIST mixed stain study 3: signal intensity balance in commercial short tandem repeat multiplexes.

    PubMed

    Duewer, David L; Kline, Margaret C; Redman, Janette W; Butler, John M

    2004-12-01

    Short-tandem repeat (STR) allelic intensities were collected from more than 60 forensic laboratories for a suite of seven samples as part of the National Institute of Standards and Technology-coordinated 2001 Mixed Stain Study 3 (MSS3). These interlaboratory challenge data illuminate the relative importance of intrinsic and user-determined factors affecting the locus-to-locus balance of signal intensities for currently used STR multiplexes. To varying degrees, seven of the eight commercially produced multiplexes used by MSS3 participants displayed very similar patterns of intensity differences among the different loci probed by the multiplexes for all samples, in the hands of multiple analysts, with a variety of supplies and instruments. These systematic differences reflect intrinsic properties of the individual multiplexes, not user-controllable measurement practices. To the extent that quality systems specify minimum and maximum absolute intensities for data acceptability and data interpretation schema require among-locus balance, these intrinsic intensity differences may decrease the utility of multiplex results and surely increase the cost of analysis.

  10. Lymphatic filarial species differentiation using evolutionarily modified tandem repeats: generation of new genetic markers.

    PubMed

    Sakthidevi, Moorthy; Murugan, Vadivel; Hoti, Sugeerappa Laxmanappa; Kaliraj, Perumal

    2010-05-01

    Polymerase chain reaction based methods are promising tools for the monitoring and evaluation of the Global Program for the Elimination of Lymphatic Filariasis. The currently available PCR methods do not differentiate the DNA of Wuchereria bancrofti or Brugia malayi by a single PCR and hence are cumbersome. Therefore, we designed a single step PCR strategy for differentiating Bancroftian infection from Brugian infection based on a newly identified gene from the W. bancrofti genome, abundant larval transcript-2 (alt-2), which is abundantly expressed. The difference in PCR product sizes generated from the presence or absence of evolutionarily altered tandem repeats in alt-2 intron-3 differentiated W. bancrofti from B. malayi. The analysis was performed on the genomic DNA of microfilariae from a number of patient blood samples or microfilariae positive slides from different Indian geographical regions. The assay gave consistent results, differentiating the two filarial parasite species accurately. This alt-2 intron-3 based PCR assay can be a potential tool for the diagnosis and differentiation of co-infections by lymphatic filarial parasites. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  11. GENETIC DIVERSITY OF TYPHA LATIFOLIA (TYPHACEAE) AND THE IMPACT OF POLLUTANTS EXAMINED WITH TANDEM-REPETITIVE DNA PROBES

    EPA Science Inventory

    Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...

  12. Associations between mutations and a VNTR in the human phenylalanine hydroxylase gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Goltsov, A.A.; Eisensmith, R.C.; Woo, S.L.C.

    1992-09-01

    The HindIII RFLP in the human phenylalanine hydroxylase (PAH) gene is caused by the presence of an AT-rich (70%) minisatellite region. This region contains various multiples of 30-bp tandem repeats and is located 3 kb downstream of the final exon of the gene. PCR-mediated amplification of this region from haplotyped PAH chromosomes indicates that the previously reported 4.0-kb HindIII allele contains three of these repeats, while the 4.4-kb HindIII allele contains 12 of these repeats. The 4.2-kb HindIII fragment can contain six, seven, eight, or nine copies of this repeat. These variations permit more detailed analysis of mutant haplotypes 1,more » 5, 6, and, possibly, others. Kindred analysis in phenylketonuria families demonstrates Mendelian segregation of these VNTR alleles, as well as associations between theses alleles and certain PAH mutations. The R261Q mutation, associated with haplotype 1, is associated almost exclusively with an allele containing eight repeats; the R408W mutation, when occurring on a haplotype 1 background, may also be associated with the eight-repeat VNTR allele. Other PAH mutations associated with haplotype 1, R252W and P281L, do not appear to segregate with specific VNTR alleles. The IVS-10 mutation, when associated with haplotype 6, is associated exclusively with an allele containing seven repeats. The combined use of this VNTR system and the existing RFLP haplotype system will increase the performance of prenatal diagnostic tests based on haplotype analysis. In addition, this VNTR may prove useful in studies concerning the origins and distributions of PAH mutations in different human populations. 32 refs., 3 figs., 3 tabs.« less

  13. [Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].

    PubMed

    Gu, Ming-liang; Chu, Jia-you

    2007-12-01

    Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.

  14. The targetable A1 Huntington disease haplotype has distinct Amerindian and European origins in Latin America

    PubMed Central

    Kay, Chris; Tirado-Hurtado, Indira; Cornejo-Olivas, Mario; Collins, Jennifer A; Wright, Galen; Inca-Martinez, Miguel; Veliz-Otani, Diego; Ketelaar, Maria E; Slama, Ramy A; Ross, Colin J; Mazzetti, Pilar; Hayden, Michael R

    2017-01-01

    Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin (HTT) gene. HD occurs worldwide, but the causative mutation is found on different HTT haplotypes in distinct ethnic groups. In Latin America, HD is thought to have European origins, but indigenous Amerindian ancestry has not been investigated. Here, we report dense HTT haplotypes in 62 mestizo Peruvian HD families, 17 HD families from across Latin America, and 42 controls of defined Peruvian Amerindian ethnicity to determine the origin of HD in populations of admixed Amerindian and European descent. HD in Peru occurs most frequently on the A1 HTT haplotype (73%), as in Europe, but on an unexpected indigenous variant also found in Amerindian controls. This Amerindian A1 HTT haplotype predominates over the European A1 variant among geographically disparate Latin American controls and in HD families from across Latin America, supporting an indigenous origin of the HD mutation in mestizo American populations. We also show that a proportion of HD mutations in Peru occur on a C1 HTT haplotype of putative Amerindian origin (14%). The majority of HD mutations in Latin America may therefore occur on haplotypes of Amerindian ancestry rather than on haplotypes resulting from European admixture. Despite the distinct ethnic ancestry of Amerindian and European A1 HTT, alleles on the parent A1 HTT haplotype allow for development of identical antisense molecules to selectively silence the HD mutation in the greatest proportion of patients in both Latin American and European populations. PMID:28000697

  15. Development of a Multiple-Locus Variable number of tandem repeat Analysis (MLVA) for Leptospira interrogans and its application to Leptospira interrogans serovar Australis isolates from Far North Queensland, Australia

    PubMed Central

    Slack, Andrew T; Dohnt, Michael F; Symonds, Meegan L; Smythe, Lee D

    2005-01-01

    Background Leptospirosis is a zoonotic disease caused by the genus, Leptospira. Leptospira interrogans is the most common genomospecies implicated in the disease. Epidemiological investigations are needed to distinguish outbreak situations or to trace reservoirs of the organisms. Current methodologies used for typing Leptospira have significant drawbacks. The development of an easy to perform yet high resolution method is needed for this organism. Methods In this study we have searched the available genomic sequence of L. interrogans serovar Copenhageni strain Fiocruz L1-130 for the presence of tandem repeats [1]. These repeats were evaluated against reference strains for diversity. Six loci were selected to create a Multiple Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) to explore the genetic diversity within L. interrogans serovar Australis clinical isolates from Far North Queensland. Results The 39 reference strains used for the development of the method displayed 39 distinct patterns. Diversity Indexes for the loci varied between 0.80 and 0.93 and the number of repeat units at each locus varied between less than one to 52 repeats. When the MLVA was applied to serovar Australis isolates three large clusters were distinguishable, each comprising various hosts including Rattus species, human and canines. Conclusion The MLVA described in this report, was easy to perform, analyse and was reproducible. The loci selected had high diversity allowing discrimination between serovars and also between strains within a serovar. This method provides a starting point on which improvements to the method and comparisons to other techniques can be made. PMID:15987533

  16. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese

    PubMed Central

    Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-01-01

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555

  17. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese.

    PubMed

    Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-08-22

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).

  18. Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data.

    PubMed

    Liu, San-Xu; Hou, Wei; Zhang, Xue-Yan; Peng, Chang-Jun; Yue, Bi-Song; Fan, Zhen-Xin; Li, Jing

    2018-07-18

    The Tibetan macaque, which is endemic to China, is currently listed as a Near Endangered primate species by the International Union for Conservation of Nature (IUCN). Short tandem repeats (STRs) refer to repetitive elements of genome sequence that range in length from 1-6 bp. They are found in many organisms and are widely applied in population genetic studies. To clarify the distribution characteristics of genome-wide STRs and understand their variation among Tibetan macaques, we conducted a genome-wide survey of STRs with next-generation sequencing of five macaque samples. A total of 1 077 790 perfect STRs were mined from our assembly, with an N50 of 4 966 bp. Mono-nucleotide repeats were the most abundant, followed by tetra- and di-nucleotide repeats. Analysis of GC content and repeats showed consistent results with other macaques. Furthermore, using STR analysis software (lobSTR), we found that the proportion of base pair deletions in the STRs was greater than that of insertions in the five Tibetan macaque individuals (P<0.05, t-test). We also found a greater number of homozygous STRs than heterozygous STRs (P<0.05, t-test), with the Emei and Jianyang Tibetan macaques showing more heterozygous loci than Huangshan Tibetan macaques. The proportion of insertions and mean variation of alleles in the Emei and Jianyang individuals were slightly higher than those in the Huangshan individuals, thus revealing differences in STR allele size between the two populations. The polymorphic STR loci identified based on the reference genome showed good amplification efficiency and could be used to study population genetics in Tibetan macaques. The neighbor-joining tree classified the five macaques into two different branches according to their geographical origin, indicating high genetic differentiation between the Huangshan and Sichuan populations. We elucidated the distribution characteristics of STRs in the Tibetan macaque genome and provided an effective method for

  19. An improved genome assembly uncovers prolific tandem repeats in Atlantic cod.

    PubMed

    Tørresen, Ole K; Star, Bastiaan; Jentoft, Sissel; Reinar, William B; Grove, Harald; Miller, Jason R; Walenz, Brian P; Knight, James; Ekholm, Jenny M; Peluso, Paul; Edvardsen, Rolf B; Tooming-Klunderud, Ave; Skage, Morten; Lien, Sigbjørn; Jakobsen, Kjetill S; Nederbragt, Alexander J

    2017-01-18

    The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.

  20. [Association of aggressive behaviors of schizophrenia with short tandem repeats loci].

    PubMed

    Yang, Chun; Ba, Huajie; Tan, Xingqi; Zhao, Hanqing; Zhang, Shuyou; Yu, Haiying

    2017-12-10

    To assess the association of short tandem repeats (STRs) loci with aggressive behaviors of schizophrenia. Blood samples from 123 schizophrenic patients with aggressive behaviors and 489 schizophrenic patients without aggressive behaviors were collected. DNA from all samples was amplified with a PowerPlex 21 system and separated by electrophoresis to determine the genotypes and allelic frequencies of 20 STR loci including D3S1368, D1S1656, D6S1043, D13S317, Penta E, D16S639, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433, and FGA. All of the 20 STR loci have reached Hardy-Weinberg equilibrium in both groups. A significant difference was found in allelic and genotypic frequencies of loci Penta D between the two groups (alleles: P=0.042; genotypes: P=0.014) but not for the remaining 19 loci (P> 0.05). Univariate analysis also showed a significant difference for allele 10 and genotypes 10-12 of Penta D between the two groups (P=0.0027, P=0.0001), with the OR being 1.81 (95%CI: 1.22-2.67) and 4.33 (95%CI: 1.95-9.59), respectively. Penta D may be associated with aggressive behaviors of schizophrenia. Allele 10 and genotypes 10-12 of Penta D may confer a risk for the disease.

  1. Huntingtin Haplotypes Provide Prioritized Target Panels for Allele-specific Silencing in Huntington Disease Patients of European Ancestry

    PubMed Central

    Kay, Chris; Collins, Jennifer A; Skotte, Niels H; Southwell, Amber L; Warby, Simon C; Caron, Nicholas S; Doty, Crystal N; Nguyen, Betty; Griguoli, Annamaria; Ross, Colin J; Squitieri, Ferdinando; Hayden, Michael R

    2015-01-01

    Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin gene (HTT). Heterozygous polymorphisms in cis with the mutation allow for allele-specific suppression of the pathogenic HTT transcript as a therapeutic strategy. To prioritize target selection, precise heterozygosity estimates are needed across diverse HD patient populations. Here we present the first comprehensive investigation of all common target alleles across the HTT gene, using 738 reference haplotypes from the 1000 Genomes Project and 2364 haplotypes from HD patients and relatives in Canada, Sweden, France, and Italy. The most common HD haplotypes (A1, A2, and A3a) define mutually exclusive sets of polymorphisms for allele-specific therapy in the greatest number of patients. Across all four populations, a maximum of 80% are treatable using these three target haplotypes. We identify a novel deletion found exclusively on the A1 haplotype, enabling potent and selective silencing of mutant HTT in approximately 40% of the patients. Antisense oligonucleotides complementary to the deletion reduce mutant A1 HTT mRNA by 78% in patient cells while sparing wild-type HTT expression. By suppressing specific haplotypes on which expanded CAG occurs, we demonstrate a rational approach to the development of allele-specific therapy for a monogenic disorder. PMID:26201449

  2. Chromosome fragility at FRAXA in human cleavage stage embryos at risk for fragile X syndrome.

    PubMed

    Verdyck, Pieter; Berckmoes, Veerle; De Vos, Anick; Verpoest, Willem; Liebaers, Inge; Bonduelle, Maryse; De Rycke, Martine

    2015-10-01

    Fragile X syndrome (FXS), the most common inherited intellectual disability syndrome, is caused by expansion and hypermethylation of the CGG repeat in the 5' UTR of the FMR1 gene. This expanded repeat, also known as the rare fragile site FRAXA, causes X chromosome fragility in cultured cells from patients but only when induced by perturbing pyrimidine synthesis. We performed preimplantation genetic diagnosis (PGD) on 595 blastomeres biopsied from 442 cleavage stage embryos at risk for FXS using short tandem repeat (STR) markers. In six blastomeres, from five embryos an incomplete haplotype was observed with loss of all alleles telomeric to the CGG repeat. In all five embryos, the incomplete haplotype corresponded to the haplotype carrying the CGG repeat expansion. Subsequent analysis of additional blastomeres from three embryos by array comparative genomic hybridization (aCGH) confirmed the presence of a terminal deletion with a breakpoint close to the CGG repeat in two blastomeres from one embryo. A blastomere from another embryo showed the complementary duplication. We conclude that a CGG repeat expansion at FRAXA causes X chromosome fragility in early human IVF embryos at risk for FXS. © 2015 Wiley Periodicals, Inc.

  3. Androgen receptor CAG repeat polymorphisms in canine prostate cancer.

    PubMed

    Lai, C-L; L'Eplattenier, H; van den Ham, R; Verseijden, F; Jagtenberg, A; Mol, J A; Teske, E

    2008-01-01

    Relatively shorter lengths of the polymorphic polyglutamine repeat-1 of the androgen receptor (AR) have been associated with an increased risk of prostate cancer (PC) in humans. In the dog, there are 2 polymorphic CAG repeat (CAGr) regions. To investigate the relationship of CAGr length of the canine AR-gene and the development of PC. Thirty-two dogs with PC and 172 control dogs were used. DNA was extracted from blood. Both CAG repeats were amplified by polymerase chain reaction (PCR) and PCR products were sequenced. In dogs with PC, CAG-1 repeat length was shorter (P = .001) by an increased proportion of 10 repeats (P = .011) and no 12 repeats (P = .0017) than in the control dogs. No significant changes were found in CAG-3 length distribution. CAG-1 and CAG-3 polymorphisms proved not to be in linkage disequilibrium. Breed difference in allelic distribution was found in the control group. Of the prostate-disease sensitive breeds, a high percentage (64.5%) of the shortest haplotype 10/11 was found in the Doberman, whereas Beagles and German Pointers had higher haplotype 12/11 (47.1 and 50%). Bernese Mountain dogs and Bouvier dogs both shared a high percentage of 11 CAG-1 repeats and 13 CAG-3 repeats. Differences in (combined) allelic distributions among breeds were not significant. In this preliminary study, short CAG-1 repeats in the AR-gene were associated with an increased risk of developing canine PC. Although breed-specific differences in allelic distribution of CAG-1 and CAG-3 repeats were found, these could not be related to PC risk.

  4. Population Structure With Localized Haplotype Clusters

    PubMed Central

    Browning, Sharon R.; Weir, Bruce S.

    2010-01-01

    We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data. PMID:20457877

  5. TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS

    PubMed Central

    AGUIAR, DEREK; WONG, WENDY S.W.; ISTRAIL, SORIN

    2014-01-01

    The growing availability of inexpensive high-throughput sequence data is enabling researchers to sequence tumor populations within a single individual at high coverage. But, cancer genome sequence evolution and mutational phenomena like driver mutations and gene fusions are difficult to investigate without first reconstructing tumor haplotype sequences. Haplotype assembly of single individual tumor populations is an exceedingly difficult task complicated by tumor haplotype heterogeneity, tumor or normal cell sequence contamination, polyploidy, and complex patterns of variation. While computational and experimental haplotype phasing of diploid genomes has seen much progress in recent years, haplotype assembly in cancer genomes remains uncharted territory. In this work, we describe HapCompass-Tumor a computational modeling and algorithmic framework for haplotype assembly of copy number variable cancer genomes containing haplotypes at different frequencies and complex variation. We extend our polyploid haplotype assembly model and present novel algorithms for (1) complex variations, including copy number changes, as varying numbers of disjoint paths in an associated graph, (2) variable haplotype frequencies and contamination, and (3) computation of tumor haplotypes using simple cycles of the compass graph which constrain the space of haplotype assembly solutions. The model and algorithm are implemented in the software package HapCompass-Tumor which is available for download from http://www.brown.edu/Research/Istrail_Lab/. PMID:24297529

  6. Developmental Validation of Short Tandem Repeat Reagent Kit for Forensic DNA Profiling of Canine Biological Materials

    PubMed Central

    Dayton, Melody; Koskinen, Mikko T; Tom, Bradley K; Mattila, Anna-Maria; Johnston, Eric; Halverson, Joy; Fantin, Dennis; DeNise, Sue; Budowle, Bruce; Smith, David Glenn; Kanthaswamy, Sree

    2009-01-01

    Aim To develop a reagent kit that enables multiplex polymerase chain reaction (PCR) amplification of 18 short tandem repeats (STR) and the canine sex-determining Zinc Finger marker. Methods Validation studies to determine the robustness and reliability in forensic DNA typing of this multiplex assay included sensitivity testing, reproducibility studies, intra- and inter-locus color balance studies, annealing temperature and cycle number studies, peak height ratio determination, characterization of artifacts such as stutter percentages and dye blobs, mixture analyses, species-specificity, case type samples analyses and population studies. Results The kit robustly amplified domesticated dog samples and consistently generated full 19-locus profiles from as little as 125 pg of dog DNA. In addition, wolf DNA samples could be analyzed with the kit. Conclusion The kit, which produces robust, reliable, and reproducible results, will be made available for the forensic research community after modifications based on this study’s evaluation to comply with the quality standards expected for forensic casework. PMID:19480022

  7. Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter baumannii and Interlaboratory Validation of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿†

    PubMed Central

    Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo

    2011-01-01

    Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956

  8. Mitochondrial DNA haplotype distribution patterns in Pinus ponderosa (Pinaceae): range-wide evolutionary history and implications for conservation.

    PubMed

    Potter, Kevin M; Hipkins, Valerie D; Mahalovich, Mary F; Means, Robert E

    2013-08-01

    Ponderosa pine (Pinus ponderosa Douglas ex P. Lawson & C. Lawson) exhibits complicated patterns of morphological and genetic variation across its range in western North America. This study aims to clarify P. ponderosa evolutionary history and phylogeography using a highly polymorphic mitochondrial DNA marker, with results offering insights into how geographical and climatological processes drove the modern evolutionary structure of tree species in the region. We amplified the mtDNA nad1 second intron minisatellite region for 3,100 trees representing 104 populations, and sequenced all length variants. We estimated population-level haplotypic diversity and determined diversity partitioning among varieties, races and populations. After aligning sequences of minisatellite repeat motifs, we evaluated evolutionary relationships among haplotypes. The geographical structuring of the 10 haplotypes corresponded with division between Pacific and Rocky Mountain varieties. Pacific haplotypes clustered with high bootstrap support, and appear to have descended from Rocky Mountain haplotypes. A greater proportion of diversity was partitioned between Rocky Mountain races than between Pacific races. Areas of highest haplotypic diversity were the southern Sierra Nevada mountain range in California, northwestern California, and southern Nevada. Pinus ponderosa haplotype distribution patterns suggest a complex phylogeographic history not revealed by other genetic and morphological data, or by the sparse paleoecological record. The results appear consistent with long-term divergence between the Pacific and Rocky Mountain varieties, along with more recent divergences not well-associated with race. Pleistocene refugia may have existed in areas of high haplotypic diversity, as well as the Great Basin, Southwestern United States/northern Mexico, and the High Plains.

  9. [Reticulate evolution of parthenogenetic species of the Lacertidae rock lizards: inheritance of CLsat tandem repeats and anonymous RAPD markers].

    PubMed

    Chobanu, D; Rudykh, I A; Riabinina, N L; Grechko, V V; Kramerov, D A; Darevskiĭ, I S

    2002-01-01

    The genetic relatedness of several bisexual and of four unisexual "Lacerta saxicola complex" lizards was studied, using monomer sequences of the complex-specific CLsat tandem repeats and anonymous RAPD markers. Genomes of parthenospecies were shown to include different satellite monomers. The structure of each such monomer is specific for a certain pair of bisexual species. This fact might be interpreted in favor of co-dominant inheritance of these markers in bisexual species hybridogenesis. This idea is supported by the results obtained with RAPD markers; i.e., unisexual species genomes include only the loci characteristic of certain bisexual species. At the same time, in neither case parthenospecies possess specific, autoapomorphic loci that were not present in this or that bisexual species.

  10. Immunogenicity of a recombinant fusion protein of tandem repeat epitopes of foot-and-mouth disease virus type Asia 1 for guinea pigs.

    PubMed

    Zhang, Q; Yang, Y Q; Zhang, Z Y; Li, L; Yan, W Y; Jiang, W J; Xin, A G; Lei, C X; Zheng, Z X

    2002-01-01

    In this study, the sequences of capsid protein VPI regions of YNAs1.1 and YNAs1.2 isolates of foot-and-mouth disease virus (FMDV) were analyzed and a peptide containing amino acids (aa) 133-158 of VP1 and aa 20-34 of VP4 of FMDV type Asia I was assumed to contain B and T cell epitopes, because it is hypervariable and includes a cell attachment site RGD located in the G-H loop. The DNA fragments encoding aa 133-158 of VP1 and aa 20-34 of VP4 of FMDV type Asia 1 were chemically synthesized and ligated into a tandem repeat of aa 133-158-20 approximately 34-133-158. In order to enhance its immunogenicity, the tandem repeat was inserted downstream of the beta-galactosidase gene in the expression vector pWR590. This insertion yielded a recombinant expression vector pAS1 encoding the fusion protein. The latter reacted with sera from FMDV type Asia 1-infected animals in vitro and elicited high levels of neutralizing antibodies in guinea pigs. The T cell proliferation in immunized animals increased following stimulation with the fusion protein. It is reported for the first time that a recombinant fusion protein vaccine was produced using B and T cell epitopes of FMDV type Asia 1 and that this fusion protein was immunogenic. The fusion protein reported here can serve as a candidate of fusion epitopes for design of a vaccine against FMDV type Asia 1.

  11. Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm

    PubMed Central

    Glunčić, Matko; Paar, Vladimir

    2013-01-01

    The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183

  12. Sequence repeats and protein structure

    NASA Astrophysics Data System (ADS)

    Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos

    2012-11-01

    Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.

  13. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations.

    PubMed

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-08-10

    The sickle (betas) mutation in the beta-globin gene (HBB) occurs on five "classical" betas haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the betas allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical betas haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical betas haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). The most common betas sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the betas mutation. Two different classical betas haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of betas haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle

  14. Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats.

    PubMed

    Fungtammasan, Arkarachai; Tomaszkiewicz, Marta; Campos-Sánchez, Rebeca; Eckert, Kristin A; DeGiorgio, Michael; Makova, Kateryna D

    2016-10-01

    Transcript variation has important implications for organismal function in health and disease. Most transcriptome studies focus on assessing variation in gene expression levels and isoform representation. Variation at the level of transcript sequence is caused by RNA editing and transcription errors, and leads to nongenetically encoded transcript variants, or RNA-DNA differences (RDDs). Such variation has been understudied, in part because its detection is obscured by reverse transcription (RT) and sequencing errors. It has only been evaluated for intertranscript base substitution differences. Here, we investigated transcript sequence variation for short tandem repeats (STRs). We developed the first maximum-likelihood estimator (MLE) to infer RT error and RDD rates, taking next generation sequencing error rates into account. Using the MLE, we empirically evaluated RT error and RDD rates for STRs in a large-scale DNA and RNA replicated sequencing experiment conducted in a primate species. The RT error rates increased exponentially with STR length and were biased toward expansions. The RDD rates were approximately 1 order of magnitude lower than the RT error rates. The RT error rates estimated with the MLE from a primate data set were concordant with those estimated with an independent method, barcoded RNA sequencing, from a Caenorhabditis elegans data set. Our results have important implications for medical genomics, as STR allelic variation is associated with >40 diseases. STR nonallelic transcript variation can also contribute to disease phenotype. The MLE and empirical rates presented here can be used to evaluate the probability of disease-associated transcripts arising due to RDD. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015

    PubMed Central

    Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

    2017-01-01

    Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. PMID:28277220

  16. Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015.

    PubMed

    Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

    2017-03-02

    Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. This article is copyright of The Authors, 2017.

  17. A Legionella pneumophila collagen-like protein encoded by a gene with a variable number of tandem repeats is involved in the adherence and invasion of host cells.

    PubMed

    Vandersmissen, Liesbeth; De Buck, Emmy; Saels, Veerle; Coil, David A; Anné, Jozef

    2010-05-01

    Legionella pneumophila is a Gram-negative, facultative intracellular pathogen and the causative agent of Legionnaires' disease, a severe pneumonia in humans. Analysis of the Legionella sequenced genomes revealed a gene with a variable number of tandem repeats (VNTRs), whose number varies between strains. We examined the strain distribution of this gene among a collection of 108 clinical, environmental and hot spring serotype I strains. Twelve variants were identified, but no correlation was observed between the number of repeat units and clinical and environmental strains. The encoded protein contains the C-terminal consensus motif of outer membrane proteins and has a large region of collagen-like repeats that is encoded by the VNTR region. We have therefore annotated this protein Lcl for Legionella collagen-like protein. Lcl was shown to contribute to the adherence and invasion of host cells and it was demonstrated that the number of repeat units present in lcl had an influence on these adhesion characteristics.

  18. How Have Self-Incompatibility Haplotypes Diversified? Generation of New Haplotypes during the Evolution of Self-Incompatibility from Self-Compatibility.

    PubMed

    Sakai, Satoki

    2016-08-01

    I developed a gametophytic self-incompatibility (SI) model to study the conditions leading to diversification in SI haplotypes. In the model, the SI system is assumed to be incomplete, and the pollen expressing a given specificity is not fully rejected by the pistils expressing the same specificity. I also assumed that mutations can occur that enhance the rejection of pollen by pistils with the same haplotype variant and reduce rejection by pistils with other variants in the same haplotype. I found that if such mutations occur, the new haplotypes (mutant variants) can stably coexist with the ancestral haplotype in which the mutant arose. This is because pollen bearing the new haplotype is most strongly rejected by pistils bearing the same new haplotype among the pistils in the population; hence, negative frequency-dependent selection prevents their fixation. I also performed simulations and found that the nearly complete SI system evolves from completely self-compatible populations and that SI haplotypes can increase to about 40-50 within a few thousand generations. On the basis of my findings, I propose that diversification of SI haplotypes occurred during the evolution of SI from self-compatibility.

  19. Multi-locus variable number tandem repeat analysis of 7th pandemic Vibrio cholerae

    PubMed Central

    2012-01-01

    Background Seven pandemics of cholera have been recorded since 1817, with the current and ongoing pandemic affecting almost every continent. Cholera remains endemic in developing countries and is still a significant public health issue. In this study we use multilocus variable number of tandem repeats (VNTRs) analysis (MLVA) to discriminate between isolates of the 7th pandemic clone of Vibrio cholerae. Results MLVA of six VNTRs selected from previously published data distinguished 66 V. cholerae isolates collected between 1961–1999 into 60 unique MLVA profiles. Only 4 MLVA profiles consisted of more than 2 isolates. The discriminatory power was 0.995. Phylogenetic analysis showed that, except for the closely related profiles, the relationships derived from MLVA profiles were in conflict with that inferred from Single Nucleotide Polymorphism (SNP) typing. The six SNP groups share consensus VNTR patterns and two SNP groups contained isolates which differed by only one VNTR locus. Conclusions MLVA is highly discriminatory in differentiating 7th pandemic V. cholerae isolates and MLVA data was most useful in resolving the genetic relationships among isolates within groups previously defined by SNPs. Thus MLVA is best used in conjunction with SNP typing in order to best determine the evolutionary relationships among the 7th pandemic V. cholerae isolates and for longer term epidemiological typing. PMID:22624829

  20. Use of Variable-Number Tandem Repeats To Examine Genetic Diversity of Neisseria meningitidis

    PubMed Central

    Yazdankhah, Siamak P.; Lindstedt, Bjørn-Arne; Caugant, Dominique A.

    2005-01-01

    Repetitive DNA motifs with potential variable-number tandem repeats (VNTR) were identified in the genome of Neisseria meningitidis and used to develop a typing method. A total of 146 meningococcal isolates recovered from carriers and patients were studied. These included 82 of the 107 N. meningitidis isolates previously used in the development of multilocus sequence typing (MLST), 45 isolates recovered from different counties in Norway in connection with local outbreaks, and 19 serogroup W135 isolates of sequence type 11 (ST-11), which were recovered in several parts of the world. The latter group comprised isolates related to the Hajj outbreak of 2000 and isolates recovered from outbreaks in Burkina Faso in 2001 and 2002. All isolates had been characterized previously by MLST or multilocus enzyme electrophoresis (MLEE). VNTR analysis showed that meningococcal isolates with similar MLST or MLEE types recovered from epidemiologically linked cases in a defined geographical area often presented similar VNTR patterns while isolates of the same MLST or MLEE types without an obvious epidemiological link showed variable VNTR patterns. Thus, VNTR analysis may be used for fine typing of meningococcal isolates after MLST or MLEE typing. The method might be especially valuable for differentiating among ST-11 strains, as shown by the VNTR analyses of serogroup W135 ST-11 meningococcal isolates recovered since the mid-1990s. PMID:15814988

  1. Ligand binding by repeat proteins: natural and designed

    PubMed Central

    Grove, Tijana Z; Cortajarena, Aitziber L; Regan, Lynne

    2012-01-01

    Repeat proteins contain tandem arrays of small structural motifs. As a consequence of this architecture, they adopt non-globular, extended structures that present large, highly specific surfaces for ligand binding. Here we discuss recent advances toward understanding the functional role of this unique modular architecture. We showcase specific examples of natural repeat proteins interacting with diverse ligands and also present examples of designed repeat protein–ligand interactions. PMID:18602006

  2. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations

    PubMed Central

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-01-01

    Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence

  3. Inter-laboratory comparison of multi-locus variable-number tandem repeat analysis (MLVA) for verocytotoxin-producing Escherichia coli O157 to facilitate data sharing.

    PubMed

    Holmes, A; Perry, N; Willshaw, G; Hanson, M; Allison, L

    2015-01-01

    Multi-locus variable number tandem repeat analysis (MLVA) is used in clinical and reference laboratories for subtyping verocytotoxin-producing Escherichia coli O157 (VTEC O157). However, as yet there is no common allelic or profile nomenclature to enable laboratories to easily compare data. In this study, we carried out an inter-laboratory comparison of an eight-loci MLVA scheme using a set of 67 isolates of VTEC O157. We found all but two isolates were identical in profile in the two laboratories, and repeat units were homogeneous in size but some were incomplete. A subset of the isolates (n = 17) were sequenced to determine the actual copy number of representative alleles, thereby enabling alleles to be named according to international consensus guidelines. This work has enabled us to realize the potential of MLVA as a portable, highly discriminatory and convenient subtyping method.

  4. Short tandem repeat DNA typing provides an international reference standard for authentication of human cell lines.

    PubMed

    Dirks, Wilhelm Gerhard; Faehnrich, Silke; Estella, Isabelle Annick Janine; Drexler, Hans Guenter

    2005-01-01

    Cell lines have wide applications as model systems in the medical and pharmaceutical industry. Much drug and chemical testing is now first carried out exhaustively on in vitro systems, reducing the need for complicated and invasive animal experiments. The basis for any research, development or production program involving cell lines is the choice of an authentic cell line. Microsatellites in the human genome that harbour short tandem repeat (STR) DNA markers allow individualisation of established cell lines at the DNA level. Fluorescence polymerase chain reaction amplification of eight highly polymorphic microsatellite STR loci plus gender determination was found to be the best tool to screen the uniqueness of DNA profiles in a fingerprint database. Our results demonstrate that cross-contamination and misidentification remain chronic problems in the use of human continuous cell lines. The combination of rapidly generated DNA types based on single-locus STR and their authentication or individualisation by screening the fingerprint database constitutes a highly reliable and robust method for the identification and verification of cell lines.

  5. The profile of repeat-associated histone lysine methylation states in the mouse epigenome

    PubMed Central

    Martens, Joost H A; O'Sullivan, Roderick J; Braunschweig, Ulrich; Opravil, Susanne; Radolf, Martin; Steinlein, Peter; Jenuwein, Thomas

    2005-01-01

    Histone lysine methylation has been shown to index silenced chromatin regions at, for example, pericentric heterochromatin or of the inactive X chromosome. Here, we examined the distribution of repressive histone lysine methylation states over the entire family of DNA repeats in the mouse genome. Using chromatin immunoprecipitation in a cluster analysis representing repetitive elements, our data demonstrate the selective enrichment of distinct H3-K9, H3-K27 and H4-K20 methylation marks across tandem repeats (e.g. major and minor satellites), DNA transposons, retrotransposons, long interspersed nucleotide elements and short interspersed nucleotide elements. Tandem repeats, but not the other repetitive elements, give rise to double-stranded (ds) RNAs that are further elevated in embryonic stem (ES) cells lacking the H3-K9-specific Suv39h histone methyltransferases. Importantly, although H3-K9 tri- and H4-K20 trimethylation appear stable at the satellite repeats, many of the other repeat-associated repressive marks vary in chromatin of differentiated ES cells or of embryonic trophoblasts and fibroblasts. Our data define a profile of repressive histone lysine methylation states for the repetitive complement of four distinct mouse epigenomes and suggest tandem repeats and dsRNA as primary triggers for more stable chromatin imprints. PMID:15678104

  6. Haplotyping for disease association: a combinatorial approach.

    PubMed

    Lancia, Giuseppe; Ravi, R; Rizzi, Romeo

    2008-01-01

    We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.

  7. Copy Number Heterogeneity, Large Origin Tandem Repeats, and Interspecies Recombination in Human Herpesvirus 6A (HHV-6A) and HHV-6B Reference Strains

    PubMed Central

    Roychoudhury, Pavitra; Makhsous, Negar; Hanson, Derek; Chase, Jill; Krueger, Gerhard; Xie, Hong; Huang, Meei-Li; Saunders, Lindsay; Ablashi, Dharam; Koelle, David M.; Cook, Linda; Jerome, Keith R.

    2018-01-01

    ABSTRACT Quantitative PCR is a diagnostic pillar for clinical virology testing, and reference materials are necessary for accurate, comparable quantitation between clinical laboratories. Accurate quantitation of human herpesvirus 6A/B (HHV-6A/B) is important for detection of viral reactivation and inherited chromosomally integrated HHV-6A/B in immunocompromised patients. Reference materials in clinical virology commonly consist of laboratory-adapted viral strains that may be affected by the culture process. We performed next-generation sequencing to make relative copy number measurements at single nucleotide resolution of eight candidate HHV-6A and seven HHV-6B reference strains and DNA materials from the HHV-6 Foundation and Advanced Biotechnologies Inc. Eleven of 17 (65%) HHV-6A/B candidate reference materials showed multiple copies of the origin of replication upstream of the U41 gene by next-generation sequencing. These large tandem repeats arose independently in culture-adapted HHV-6A and HHV-6B strains, measuring 1,254 bp and 983 bp, respectively. The average copy number measured was between 5 and 10 times the number of copies of the rest of the genome. We also report the first interspecies recombinant HHV-6A/B strain with a HHV-6A backbone and a >5.5-kb region from HHV-6B, from U41 to U43, that covered the origin tandem repeat. Specific HHV-6A reference strains demonstrated duplication of regions at U1/U2, U87, and U89, as well as deletion in the U12-to-U24 region and the U94/U95 genes. HHV-6A/B strains derived from cord blood mononuclear cells from different laboratories on different continents with fewer passages revealed no copy number differences throughout the viral genome. These data indicate that large origin tandem duplications are an adaptation of both HHV-6A and HHV-6B in culture and show interspecies recombination is possible within the Betaherpesvirinae. IMPORTANCE Anything in science that needs to be quantitated requires a standard unit of

  8. Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects

    PubMed Central

    Kok, Yung-Yean; Ong, Hing-Huat

    2017-01-01

    Interleukin-1 receptor antagonist (IL1RA) intron 2 86 bp repeat and interleukin-4 (IL4) intron 3 70 bp repeat are variable number tandem repeats (VNTRs) that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians). The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF) classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR) of I/II genotype = 12.21 (CI = 2.54, 58.79; p = 0.002); II allele = 5.78 (CI = 1.73, 19.29; p = 0.004)]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p = 0.03)]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79 ± 2.52 versus 23.51 ± 0.40; p = 0.005). Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects. PMID:28293435

  9. Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects.

    PubMed

    Kok, Yung-Yean; Ong, Hing-Huat; Say, Yee-How

    2017-01-01

    Interleukin-1 receptor antagonist ( IL1RA ) intron 2 86 bp repeat and interleukin-4 ( IL4 ) intron 3 70 bp repeat are variable number tandem repeats (VNTRs) that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians). The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF) classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR) of I/II genotype = 12.21 (CI = 2.54, 58.79; p = 0.002); II allele = 5.78 (CI = 1.73, 19.29; p = 0.004)]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p = 0.03)]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79 ± 2.52 versus 23.51 ± 0.40; p = 0.005). Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects.

  10. Factor IX gene haplotypes in Amerindians.

    PubMed

    Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S

    1997-02-01

    We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.

  11. Multi-locus variable-number tandem repeat analysis for outbreak studies of Salmonella enterica serotype Enteritidis

    PubMed Central

    Malorny, Burkhard; Junker, Ernst; Helmuth, Reiner

    2008-01-01

    Background Salmonella enterica subsp. enterica serotype Enteritidis is known as an important and pathogenic clonal group which continues to cause worldwide sporadic cases and outbreaks in humans. Here a new multiple-locus variable-number tandem repeat analysis (MLVA) method is reported for highly-discriminative subtyping of Salmonella Enteritidis. Emphasis was given on the most predominant phage types PT4 and PT8. The method comprises multiplex PCR specifically amplifying repeated sequences from nine different loci followed by an automatic fragment size analysis using a multicolor capillary electrophoresis instrument. A total of 240 human, animal, food and environmental isolates of S. Enteritidis including 23 definite phage types were used for development and validation. Furthermore, the MLVA types were compared to the phage types of several isolates from two recent outbreaks to determine the concordance between both methods and to estimate their in vivo stability. The in vitro stability of the two MLVA types specifically for PT4 and PT8 strains were determined by multiple freeze-thaw cycles. Results Seventy-nine different MLVA types were identified in 240 S. Enteritidis strains. The Simpson's diversity index for the MLVA method was 0.919 and Nei diversity values for the nine VNTR loci ranged from 0.07 to 0.65. Twenty-four MLVA types could be assigned to 62 PT4 strains and 21 types to 81 PT8 strains. All outbreak isolates had an indistinguishable outbreak specific MLVA type. The in vitro stability experiments showed no changes of the MLVA type compared to the original isolate. Conclusion This MLVA method is useful to discriminate S. Enteritidis strains even within a single phage type. It is easy in use, fast, and cheap compared to other high-resolution molecular methods and therefore an important tool for surveillance and outbreak studies for S. Enteritidis. PMID:18513386

  12. A comprehensive Y-STR portrait of Yousafzai's population.

    PubMed

    Tabassum, Sadia; Ilyas, Muhammad; Ullah, Inam; Israr, Muhammad; Ahmad, Habib

    2017-09-01

    In the current study, 17 Y-Chromosomal short tandem repeats (Y-STRs) included in theAmpFlSTR Y-Filer amplification kit (Applied Biosystems, Foster City, USA) were investigated in 146 unrelated Yousafzai males residing in the Khyber Pakhtunkhwa Province of Pakistan. A total of 94 (89.52%) unique haplotypes were observed. Discrimination capacity was 71.92%. Haplotype diversity ranged from 0.354 (DYS456) to 0.663 (DYS458). Both Rst pairwise analysis and multidimensional scaling plot showed that the genetic structure of the Yousafzais is significantly different from neighbouring populations.

  13. Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR Yfiler PCR amplification kit.

    PubMed

    Goedbloed, Miriam; Vermeulen, Mark; Fang, Rixun N; Lembring, Maria; Wollstein, Andreas; Ballantyne, Kaye; Lao, Oscar; Brauer, Silke; Krüger, Carmen; Roewer, Lutz; Lessig, Rüdiger; Ploski, Rafal; Dobosz, Tadeusz; Henke, Lotte; Henke, Jürgen; Furtado, Manohar R; Kayser, Manfred

    2009-11-01

    The Y-chromosomal short tandem repeat (Y-STR) polymorphisms included in the AmpFlSTR Yfiler polymerase chain reaction amplification kit have become widely used for forensic and evolutionary applications where a reliable knowledge on mutation properties is necessary for correct data interpretation. Therefore, we investigated the 17 Yfiler Y-STRs in 1,730-1,764 DNA-confirmed father-son pairs per locus and found 84 sequence-confirmed mutations among the 29,792 meiotic transfers covered. Of the 84 mutations, 83 (98.8%) were single-repeat changes and one (1.2%) was a double-repeat change (ratio, 1:0.01), as well as 43 (51.2%) were repeat gains and 41 (48.8%) repeat losses (ratio, 1:0.95). Medians from Bayesian estimation of locus-specific mutation rates ranged from 0.0003 for DYS448 to 0.0074 for DYS458, with a median rate across all 17 Y-STRs of 0.0025. The mean age (at the time of son's birth) of fathers with mutations was with 34.40 (+/-11.63) years higher than that of fathers without ones at 30.32 (+/-10.22) years, a difference that is highly statistically significant (p < 0.001). A Poisson-based modeling revealed that the Y-STR mutation rate increased with increasing father's age on a statistically significant level (alpha = 0.0294, 2.5% quantile = 0.0001). From combining our data with those previously published, considering all together 135,212 meiotic events and 331 mutations, we conclude for the Yfiler Y-STRs that (1) none had a mutation rate of >1%, 12 had mutation rates of >0.1% and four of <0.1%, (2) single-repeat changes were strongly favored over multiple-repeat ones for all loci but 1 and (3) considerable variation existed among loci in the ratio of repeat gains versus losses. Our finding of three Y-STR mutations in one father-son pair (and two pairs with two mutations each) has consequences for determining the threshold of allelic differences to conclude exclusion constellations in future applications of Y-STRs in paternity testing and pedigree analyses.

  14. Subtyping of a Large Collection of Historical Listeria monocytogenes Strains from Ontario, Canada, by an Improved Multilocus Variable-Number Tandem-Repeat Analysis (MLVA)

    PubMed Central

    Saleh-Lakha, S.; Allen, V. G.; Li, J.; Pagotto, F.; Odumeru, J.; Taboada, E.; Lombos, M.; Tabing, K. C.; Blais, B.; Ogunremi, D.; Downing, G.; Lee, S.; Gao, A.; Nadon, C.

    2013-01-01

    Listeria monocytogenes is responsible for severe and often fatal food-borne infections in humans. A collection of 2,421 L. monocytogenes isolates originating from Ontario's food chain between 1993 and 2010, along with Ontario clinical isolates collected from 2004 to 2010, was characterized using an improved multilocus variable-number tandem-repeat analysis (MLVA). The MLVA method was established based on eight primer pairs targeting seven variable-number tandem-repeat (VNTR) loci in two 4-plex fluorescent PCRs. Diversity indices and amplification rates of the individual VNTR loci ranged from 0.38 to 0.92 and from 0.64 to 0.99, respectively. MLVA types and pulsed-field gel electrophoresis (PFGE) patterns were compared using Comparative Partitions analysis involving 336 clinical and 99 food and environmental isolates. The analysis yielded Simpson's diversity index values of 0.998 and 0.992 for MLVA and PFGE, respectively, and adjusted Wallace coefficients of 0.318 when MLVA was used as a primary subtyping method and 0.088 when PFGE was a primary typing method. Statistical data analysis using BioNumerics allowed for identification of at least 8 predominant and persistent L. monocytogenes MLVA types in Ontario's food chain. The MLVA method correctly clustered epidemiologically related outbreak strains and separated unrelated strains in a subset analysis. An MLVA database was established for the 2,421 L. monocytogenes isolates, which allows for comparison of data among historical and new isolates of different sources. The subtyping method coupled with the MLVA database will help in effective monitoring/prevention approaches to identify environmental contamination by pathogenic strains of L. monocytogenes and investigation of outbreaks. PMID:23956391

  15. Diversity and evolution of centromere repeats in the maize genome.

    PubMed

    Bilinski, Paul; Distor, Kevin; Gutierrez-Lopez, Jose; Mendoza, Gabriela Mendoza; Shi, Jinghua; Dawe, R Kelly; Ross-Ibarra, Jeffrey

    2015-03-01

    Centromere repeats are found in most eukaryotes and play a critical role in kinetochore formation. Though centromere repeats exhibit considerable diversity both within and among species, little is understood about the mechanisms that drive centromere repeat evolution. Here, we use maize as a model to investigate how a complex history involving polyploidy, fractionation, and recent domestication has impacted the diversity of the maize centromeric repeat CentC. We first validate the existence of long tandem arrays of repeats in maize and other taxa in the genus Zea. Although we find considerable sequence diversity among CentC copies genome-wide, genetic similarity among repeats is highest within these arrays, suggesting that tandem duplications are the primary mechanism for the generation of new copies. Nonetheless, clustering analyses identify similar sequences among distant repeats, and simulations suggest that this pattern may be due to homoplasious mutation. Although the two ancestral subgenomes of maize have contributed nearly equal numbers of centromeres, our analysis shows that the majority of all CentC repeats derive from one of the parental genomes, with an even stronger bias when examining the largest assembled contiguous clusters. Finally, by comparing maize with its wild progenitor teosinte, we find that the abundance of CentC likely decreased after domestication, while the pericentromeric repeat Cent4 has drastically increased.

  16. Detecting structure of haplotypes and local ancestry

    USDA-ARS?s Scientific Manuscript database

    We present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local an...

  17. Allele Frequencies for 15 Short Tandem Repeat Loci in Representative Sample of Croatian Population

    PubMed Central

    Projić, Petar; Škaro, Vedrana; Šamija, Ivana; Pojskić, Naris; Durmić-Pašić, Adaleta; Kovačević, Lejla; Bakal, Narcisa; Primorac, Dragan; Marjanović, Damir

    2007-01-01

    Aim To study the distribution of allele frequencies of 15 short tandem repeat (STR) loci in a representative sample of the Croatian population. Methods A total of 195 unrelated Caucasian individuals born in Croatia, from 14 counties and the City of Zagreb, were sampled for the analysis. All the tested individuals were voluntary donors. Buccal swab was used as the DNA source. AmpFlSTR® Identifiler® was applied to simultaneously amplify 15 STR loci. Total reaction volume was 12.5 μL. The polymerase chain reaction (PCR) amplification was carried out in PE Gene Amp PCR System Thermal Cycler. Electrophoresis of the amplification products was preformed on an ABI PRISM 3130 Genetic Analyzer. After PCR amplification and separation by electrophoresis, raw data were compiled, analyzed, and numerical allele designations of the profiles were obtained. Deviation from Hardy-Weinberg equilibrium, observed and expected heterozygosity, power of discrimination, and power of exclusion were calculated. Bonferroni’s correction was used before each comparative analysis. Results We compared Croatian data with those obtained from geographically neighboring European populations. The significant difference (at P<0.01) in allele frequencies was recorded only between the Croatian and Slovenian populations for vWA locus. There was no significant deviation from Hardy-Weinberg equilibrium for all the observed loci. Conclusion Obtained population data concurred with the expected “STR data frame” for this part of Europe. PMID:17696301

  18. Extended Islands of Tractability for Parsimony Haplotyping

    NASA Astrophysics Data System (ADS)

    Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi

    Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.

  19. Expanded complexity of unstable repeat diseases

    PubMed Central

    Polak, Urszula; McIvor, Elizabeth; Dent, Sharon Y.R.; Wells, Robert D.; Napierala, Marek

    2015-01-01

    Unstable Repeat Diseases (URDs) share a common mutational phenomenon of changes in the copy number of short, tandemly repeated DNA sequences. More than 20 human neurological diseases are caused by instability, predominantly expansion, of microsatellite sequences. Changes in the repeat size initiate a cascade of pathological processes, frequently characteristic of a unique disease or a small subgroup of the URDs. Understanding of both the mechanism of repeat instability and molecular consequences of the repeat expansions is critical to developing successful therapies for these diseases. Recent technological breakthroughs in whole genome, transcriptome and proteome analyses will almost certainly lead to new discoveries regarding the mechanisms of repeat instability, the pathogenesis of URDs, and will facilitate development of novel therapeutic approaches. The aim of this review is to give a general overview of unstable repeats diseases, highlight the complexities of these diseases, and feature the emerging discoveries in the field. PMID:23233240

  20. Exploring the repeat protein universe through computational protein design

    DOE PAGES

    Brunette, TJ; Parmeggiani, Fabio; Huang, Po-Ssu; ...

    2015-12-16

    A central question in protein evolution is the extent to which naturally occurring proteins sample the space of folded structures accessible to the polypeptide chain. Repeat proteins composed of multiple tandem copies of a modular structure unit are widespread in nature and have critical roles in molecular recognition, signalling, and other essential biological processes. Naturally occurring repeat proteins have been re-engineered for molecular recognition and modular scaffolding applications. In this paper, we use computational protein design to investigate the space of folded structures that can be generated by tandem repeating a simple helix–loop–helix–loop structural motif. Eighty-three designs with sequences unrelatedmore » to known repeat proteins were experimentally characterized. Of these, 53 are monomeric and stable at 95 °C, and 43 have solution X-ray scattering spectra consistent with the design models. Crystal structures of 15 designs spanning a broad range of curvatures are in close agreement with the design models with root mean square deviations ranging from 0.7 to 2.5 Å. Finally, our results show that existing repeat proteins occupy only a small fraction of the possible repeat protein sequence and structure space and that it is possible to design novel repeat proteins with precisely specified geometries, opening up a wide array of new possibilities for biomolecular engineering.« less

  1. Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools.

    PubMed

    Guizard, Sébastien; Piégu, Benoît; Arensburger, Peter; Guillou, Florian; Bigot, Yves

    2016-08-19

    The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.

  2. New Multilocus Variable-Number Tandem-Repeat Analysis Tool for Surveillance and Local Epidemiology of Bacterial Leaf Blight and Bacterial Leaf Streak of Rice Caused by Xanthomonas oryzae

    PubMed Central

    Poulin, L.; Grygiel, P.; Magne, M.; Rodriguez-R, L. M.; Forero Serna, N.; Zhao, S.; El Rafii, M.; Dao, S.; Tekete, C.; Wonni, I.; Koita, O.; Pruvost, O.; Verdier, V.; Vernière, C.

    2014-01-01

    Multilocus variable-number tandem-repeat analysis (MLVA) is efficient for routine typing and for investigating the genetic structures of natural microbial populations. Two distinct pathovars of Xanthomonas oryzae can cause significant crop losses in tropical and temperate rice-growing countries. Bacterial leaf streak is caused by X. oryzae pv. oryzicola, and bacterial leaf blight is caused by X. oryzae pv. oryzae. For the latter, two genetic lineages have been described in the literature. We developed a universal MLVA typing tool both for the identification of the three X. oryzae genetic lineages and for epidemiological analyses. Sixteen candidate variable-number tandem-repeat (VNTR) loci were selected according to their presence and polymorphism in 10 draft or complete genome sequences of the three X. oryzae lineages and by VNTR sequencing of a subset of loci of interest in 20 strains per lineage. The MLVA-16 scheme was then applied to 338 strains of X. oryzae representing different pathovars and geographical locations. Linkage disequilibrium between MLVA loci was calculated by index association on different scales, and the 16 loci showed linear Mantel correlation with MLSA data on 56 X. oryzae strains, suggesting that they provide a good phylogenetic signal. Furthermore, analyses of sets of strains for different lineages indicated the possibility of using the scheme for deeper epidemiological investigation on small spatial scales. PMID:25398857

  3. Substructure of a Tunisian Berber population as inferred from 15 autosomal short tandem repeat loci.

    PubMed

    Khodjet-El-Khil, Houssein; Fadhlaoui-Zid, Karima; Gusmão, Leonor; Alves, Cíntia; Benammar-Elgaaied, Amel; Amorim, Antonio

    2008-08-01

    Currently, language and cultural practices are the only criteria to distinguish between Berber autochthonous Tunisian populations. To evaluate these populations' possible genetic structure and differentiation, we have analyzed 15 autosomal short tandem repeat loci (CSF1PO, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D21S11, FGA, TH01, TPOX, VWA, D2S1338, and D19S433) in three southern Tunisian Berber groups: Sened, Matmata, and Chenini-Douiret. The exact test of population differentiation based on allele frequencies at the 15 loci shows significant P values at 7 loci between Chenini-Douiret and both Sened and Matmata, whereas just 5 loci show significant P values between Sened and Matmata. Comparative analyses between the three Berber groups based on genetic distances show that P values for F(ST) distances are significant between the three Berber groups. Population analysis performed using Structure shows a clear differentiation between these Berber groups, with strong genetic isolation of Chenini-Douiret. These results confirm at the autosomal level the high degree of heterogeneity of Tunisian Berber populations that had been previously reported for uniparental markers.

  4. Molecular typing of Argentinian Mycobacterium avium subsp. paratuberculosis isolates by multiple-locus variable number-tandem repeat analysis

    PubMed Central

    Gioffré, Andrea; Correa Muñoz, Magnolia; Alvarado Pinedo, María F.; Vaca, Roberto; Morsella, Claudia; Fiorentino, María Andrea; Paolicchi, Fernando; Ruybal, Paula; Zumárraga, Martín; Travería, Gabriel E.; Romano, María Isabel

    2015-01-01

    Multiple-locus variable number-tandem repeat analysis (MLVA) of Mycobacterium avium subspecies paratuberculosis (MAP) isolates may contribute to the knowledge of strain diversity in Argentina. Although the diversity of MAP has been previously investigated in Argentina using IS900-RFLP, a small number of isolates were employed, and a low discriminative power was reached. The aim of the present study was to test the genetic diversity among MAP isolates using an MLVA approach based on 8 repetitive loci. We studied 97 isolates from cattle, goat and sheep and could describe 7 different patterns: INMV1, INMV2, INMV11, INMV13, INMV16, INMV33 and one incomplete pattern. INMV1 and INMV2 were the most frequent patterns, grouping 76.3% of the isolates. We were also able to demonstrate the coexistence of genotypes in herds and co-infection at the organism level. This study shows that all the patterns described are common to those described in Europe, suggesting an epidemiological link between the continents. PMID:26273274

  5. A novel typing method for Listeria monocytogenes using high-resolution melting analysis (HRMA) of tandem repeat regions.

    PubMed

    Ohshima, Chihiro; Takahashi, Hajime; Iwakawa, Ai; Kuda, Takashi; Kimura, Bon

    2017-07-17

    Listeria monocytogenes, which is responsible for causing food poisoning known as listeriosis, infects humans and animals. Widely distributed in the environment, this bacterium is known to contaminate food products after being transmitted to factories via raw materials. To minimize the contamination of products by food pathogens, it is critical to identify and eliminate factory entry routes and pathways for the causative bacteria. High resolution melting analysis (HRMA) is a method that takes advantage of differences in DNA sequences and PCR product lengths that are reflected by the disassociation temperature. Through our research, we have developed a multiple locus variable-number tandem repeat analysis (MLVA) using HRMA as a simple and rapid method to differentiate L. monocytogenes isolates. While evaluating our developed method, the ability of MLVA-HRMA, MLVA using capillary electrophoresis, and multilocus sequence typing (MLST) was compared for their ability to discriminate between strains. The MLVA-HRMA method displayed greater discriminatory ability than MLST and MLVA using capillary electrophoresis, suggesting that the variation in the number of repeat units, along with mutations within the DNA sequence, was accurately reflected by the melting curve of HRMA. Rather than relying on DNA sequence analysis or high-resolution electrophoresis, the MLVA-HRMA method employs the same process as PCR until the analysis step, suggesting a combination of speed and simplicity. The result of MLVA-HRMA method is able to be shared between different laboratories. There are high expectations that this method will be adopted for regular inspections at food processing facilities in the near future. Copyright © 2017. Published by Elsevier B.V.

  6. Origin of the Outbreak in France of Pseudomonas syringae pv. actinidiae Biovar 3, the Causal Agent of Bacterial Canker of Kiwifruit, Revealed by a Multilocus Variable-Number Tandem-Repeat Analysis.

    PubMed

    Cunty, A; Cesbron, S; Poliakoff, F; Jacques, M-A; Manceau, C

    2015-10-01

    The first outbreaks of bacterial canker of kiwifruit caused by Pseudomonas syringae pv. actinidiae biovar 3 were detected in France in 2010. P. syringae pv. actinidiae causes leaf spots, dieback, and canker that sometimes lead to the death of the vine. P. syringae pv. actinidifoliorum, which is pathogenic on kiwi as well, causes only leaf spots. In order to conduct an epidemiological study to track the spread of the epidemics of these two pathogens in France, we developed a multilocus variable-number tandem-repeat (VNTR) analysis (MLVA). MLVA was conducted on 340 strains of P. syringae pv. actinidiae biovar 3 isolated in Chile, China, France, Italy, and New Zealand and on 39 strains of P. syringae pv. actinidifoliorum isolated in Australia, France, and New Zealand. Eleven polymorphic VNTR loci were identified in the genomes of P. syringae pv. actinidiae biovar 3 ICMP 18744 and of P. syringae pv. actinidifoliorum ICMP 18807. MLVA enabled the structuring of P. syringae pv. actinidiae biovar 3 and P. syringae pv. actinidifoliorum strains in 55 and 16 haplotypes, respectively. MLVA and discriminant analysis of principal components revealed that strains isolated in Chile, China, and New Zealand are genetically distinct from P. syringae pv. actinidiae strains isolated in France and in Italy, which appear to be closely related at the genetic level. In contrast, no structuring was observed for P. syringae pv. actinidifoliorum. We developed an MLVA scheme to explore the diversity within P. syringae pv. actinidiae biovar 3 and to trace the dispersal routes of epidemic P. syringae pv. actinidiae biovar 3 in Europe. We suggest using this MLVA scheme to trace the dispersal routes of P. syringae pv. actinidiae at a global level. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  7. Origin of the Outbreak in France of Pseudomonas syringae pv. actinidiae Biovar 3, the Causal Agent of Bacterial Canker of Kiwifruit, Revealed by a Multilocus Variable-Number Tandem-Repeat Analysis

    PubMed Central

    Cunty, A.; Cesbron, S.; Poliakoff, F.; Jacques, M.-A.

    2015-01-01

    The first outbreaks of bacterial canker of kiwifruit caused by Pseudomonas syringae pv. actinidiae biovar 3 were detected in France in 2010. P. syringae pv. actinidiae causes leaf spots, dieback, and canker that sometimes lead to the death of the vine. P. syringae pv. actinidifoliorum, which is pathogenic on kiwi as well, causes only leaf spots. In order to conduct an epidemiological study to track the spread of the epidemics of these two pathogens in France, we developed a multilocus variable-number tandem-repeat (VNTR) analysis (MLVA). MLVA was conducted on 340 strains of P. syringae pv. actinidiae biovar 3 isolated in Chile, China, France, Italy, and New Zealand and on 39 strains of P. syringae pv. actinidifoliorum isolated in Australia, France, and New Zealand. Eleven polymorphic VNTR loci were identified in the genomes of P. syringae pv. actinidiae biovar 3 ICMP 18744 and of P. syringae pv. actinidifoliorum ICMP 18807. MLVA enabled the structuring of P. syringae pv. actinidiae biovar 3 and P. syringae pv. actinidifoliorum strains in 55 and 16 haplotypes, respectively. MLVA and discriminant analysis of principal components revealed that strains isolated in Chile, China, and New Zealand are genetically distinct from P. syringae pv. actinidiae strains isolated in France and in Italy, which appear to be closely related at the genetic level. In contrast, no structuring was observed for P. syringae pv. actinidifoliorum. We developed an MLVA scheme to explore the diversity within P. syringae pv. actinidiae biovar 3 and to trace the dispersal routes of epidemic P. syringae pv. actinidiae biovar 3 in Europe. We suggest using this MLVA scheme to trace the dispersal routes of P. syringae pv. actinidiae at a global level. PMID:26209667

  8. Reconstruction of Haplotype-Blocks Selected during Experimental Evolution.

    PubMed

    Franssen, Susanne U; Barton, Nicholas H; Schlötterer, Christian

    2017-01-01

    The genetic analysis of experimentally evolving populations typically relies on short reads from pooled individuals (Pool-Seq). While this method provides reliable allele frequency estimates, the underlying haplotype structure remains poorly characterized. With small population sizes and adaptive variants that start from low frequencies, the interpretation of selection signatures in most Evolve and Resequencing studies remains challenging. To facilitate the characterization of selection targets, we propose a new approach that reconstructs selected haplotypes from replicated time series, using Pool-Seq data. We identify selected haplotypes through the correlated frequencies of alleles carried by them. Computer simulations indicate that selected haplotype-blocks of several Mb can be reconstructed with high confidence and low error rates, even when allele frequencies change only by 20% across three replicates. Applying this method to real data from D. melanogaster populations adapting to a hot environment, we identify a selected haplotype-block of 6.93 Mb. We confirm the presence of this haplotype-block in evolved populations by experimental haplotyping, demonstrating the power and accuracy of our haplotype reconstruction from Pool-Seq data. We propose that the combination of allele frequency estimates with haplotype information will provide the key to understanding the dynamics of adaptive alleles. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  9. Repeat-containing protein effectors of plant-associated organisms

    PubMed Central

    Mesarich, Carl H.; Bowen, Joanna K.; Hamiaux, Cyril; Templeton, Matthew D.

    2015-01-01

    Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms. PMID:26557126

  10. Repeat-containing protein effectors of plant-associated organisms.

    PubMed

    Mesarich, Carl H; Bowen, Joanna K; Hamiaux, Cyril; Templeton, Matthew D

    2015-01-01

    Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.

  11. Rare Sequence Variation in the Genome Flanking a Short Tandem Repeat Locus Can Lead to a Question of “Nonmaternity”

    PubMed Central

    Deucher, Anne; Chiang, Tsoyu; Schrijver, Iris

    2010-01-01

    Typing of STR (short tandem repeat) alleles is used in a variety of applications in clinical molecular pathology, including evaluations for maternal cell contamination. Using a commercially available STR typing assay for maternal cell contamination performed in conjunction with prenatal diagnostic testing, we were posed with apparent nonmaternity when the two fetal samples did not demonstrate the expected maternal allele at one locus. By designing primers external to the region amplified by the primers from the commercial assay and by performing direct sequencing of the resulting amplicon, we were able to determine that a guanine to adenine sequence variation led to primer mismatch and allele dropout. This explained the apparent null allele shared between the maternal and fetal samples. Therefore, although rare, allele dropout must be considered whenever unexplained homozygosity at an STR locus is observed. PMID:20203001

  12. Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

    PubMed

    Lewis, Cecil M

    2010-02-01

    This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America. 2009 Wiley-Liss, Inc.

  13. Analysis of short tandem repeat polymorphisms using infrared fluorescence with M18 tailed primers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Oetting, W.S.; Wiesner, G.; Laken, S.

    The use of short tandem repeat polymorphisms (STRPs) are becoming increasingly important as markers for linkage analysis due to their large numbers of the human genome and their high degree of polymorphism. Fluorescence based detection of the STRP pattern using the LI-COR model 4000S automated DNA sequencer eliminates the need for radioactivity and produces a digitized image that can be used for the analysis of the polymorphisms. In an effort to reduce the cost of STRP analysis, we have synthesized primers with a 19 bp extension complementary to the sequence of the M13 primer on the 5{prime} end of onemore » of the two primers used in the amplification of the STRP instead of using primers with direct conjugation of the infrared fluorescent dye. Up to 5 primer pairs can be multiplexed together with the M13 primer-dye conjugate as the sole primer conjugated to the fluorescent dye. Comparisons between primers that have been directly conjugated to the fluor with those having the M13 sequence extension show no difference in the ability to determine the STRP pattern. At present, the entire Weber 4A set of STRP markers is available with the M13 5{prime} extension. We are currently using this technique for linkage analysis of familial breast cancer and asthma. The combination of STRP analysis using fluorescence detection will allow this technique to be fully automated for allele scoring and linkage analysis.« less

  14. Genetic variation of 'Candidatus Liberibacter solanacearum' haplotype C and identification of a novel haplotype from Trioza urticae and stinging nettle.

    PubMed

    Haapalainen, Minna L; Wang, Jinhui; Latvala, Satu; Lehtonen, Mikko T; Pirhonen, Minna; Nissinen, Anne I

    2018-03-30

    'Candidatus Liberibacter solanacearum' (CLso) haplotype C is associated with disease in carrots and transmitted by the carrot psyllid Trioza apicalis. To identify possible other sources and vectors of this pathogen in Finland, samples were taken of wild plants within and near the carrot fields, the psyllids feeding on these plants, parsnips growing next to carrots, and carrot seeds. For analyzing the genotype of the CLso positive samples, a multi-locus sequence typing (MLST) scheme was developed. CLso haplotype C was detected in 11% of the Trioza anthrisci samples, in 35% of the Anthriscus sylvestris plants with discoloration, and in parsnips showing leaf discoloration. MLST revealed that the CLso in T. anthrisci and most A. sylvestris plants represent different strains than the bacteria found in T. apicalis and the cultivated plants. CLso haplotype D was detected in two of the 34 carrot seed lots tested, but was not detected in the plants grown from these seeds. Phylogenetic analysis by UPGMA clustering suggested that the haplotype D is more closely related to the haplotype A than to C. A novel, sixth haplotype of CLso, most closely related to A and D, was found in the psyllid Trioza urticae and stinging nettle (Urtica dioica, Urticaceae), and named as haplotype U.

  15. A Large Population Genetic Study of 15 Autosomal Short Tandem Repeat Loci for Establishment of Korean DNA Profile Database

    PubMed Central

    Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha

    2011-01-01

    Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10-17. This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications. PMID:21597912

  16. A large-scale dataset of single and mixed-source short tandem repeat profiles to inform human identification strategies: PROVEDIt.

    PubMed

    Alfonse, Lauren E; Garrett, Amanda D; Lun, Desmond S; Duffy, Ken R; Grgicak, Catherine M

    2018-01-01

    DNA-based human identity testing is conducted by comparison of PCR-amplified polymorphic Short Tandem Repeat (STR) motifs from a known source with the STR profiles obtained from uncertain sources. Samples such as those found at crime scenes often result in signal that is a composite of incomplete STR profiles from an unknown number of unknown contributors, making interpretation an arduous task. To facilitate advancement in STR interpretation challenges we provide over 25,000 multiplex STR profiles produced from one to five known individuals at target levels ranging from one to 160 copies of DNA. The data, generated under 144 laboratory conditions, are classified by total copy number and contributor proportions. For the 70% of samples that were synthetically compromised, we report the level of DNA damage using quantitative and end-point PCR. In addition, we characterize the complexity of the signal by exploring the number of detected alleles in each profile. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. A large population genetic study of 15 autosomal short tandem repeat loci for establishment of Korean DNA profile database.

    PubMed

    Yoo, Seong Yeon; Cho, Nam Soo; Park, Myung Jin; Seong, Ki Min; Hwang, Jung Ho; Song, Seok Bean; Han, Myun Soo; Lee, Won Tae; Chung, Ki Wha

    2011-07-01

    Genotyping of highly polymorphic short tandem repeat (STR) markers is widely used for the genetic identification of individuals in forensic DNA analyses and in paternity disputes. The National DNA Profile Databank recently established by the DNA Identification Act in Korea contains the computerized STR DNA profiles of individuals convicted of crimes. For the establishment of a large autosomal STR loci population database, 1805 samples were obtained at random from Korean individuals and 15 autosomal STR markers were analyzed using the AmpFlSTR Identifiler PCR Amplification kit. For the 15 autosomal STR markers, no deviations from the Hardy-Weinberg equilibrium were observed. The most informative locus in our data set was the D2S1338 with a discrimination power of 0.9699. The combined matching probability was 1.521 × 10(-17). This large STR profile dataset including atypical alleles will be important for the establishment of the Korean DNA database and for forensic applications.

  18. Characterization of Escherichia coli O157:H7 in New Zealand using multiple-locus variable-number tandem-repeat analysis.

    PubMed

    Dyet, K H; Robertson, I; Turbitt, E; Carter, P E

    2011-03-01

    Recently, multiple-locus variable-number tandem-repeat analysis (MLVA) has been proposed as an alternative to pulsed-field gel electrophoresis (PFGE) for characterization of Escherichia coli O157:H7. In this study we characterized 118 E. coli O157:H7 isolates from cases of gastrointestinal disease in New Zealand using XbaI PFGE profiles and a MLVA scheme that assessed variability in eight polymorphic loci. The 118 isolates characterized included all 80 E. coli O157:H7 referred to New Zealand's Enteric Reference Laboratory in 2006 and 29 phage-type 2 isolates from 2005. When applied to these isolates the discriminatory power of PFGE and MLVA was not significantly different. However, MLVA data may be more epidemiologically relevant as isolates from family clusters of disease had identical MLVA profiles, even when the XbaI PFGE profiles differed slightly. Furthermore, most isolates with indistinguishable XbaI PFGE profiles that did not appear to be epidemiologically related had distinct MLVA profiles.

  19. Multicolor-based discrimination of 21 short tandem repeats and amelogenin using four fluorescent universal primers.

    PubMed

    Asari, Masaru; Okuda, Katsuhiro; Hoshina, Chisato; Omura, Tomohiro; Tasaki, Yoshikazu; Shiono, Hiroshi; Matsubara, Kazuo; Shimizu, Keiko

    2016-02-01

    The aim of this study was to develop a cost-effective genotyping method using high-quality DNA for human identification. A total of 21 short tandem repeats (STRs) and amelogenin were selected, and fluorescent fragments at 22 loci were simultaneously amplified in a single-tube reaction using locus-specific primers with 24-base universal tails and four fluorescent universal primers. Several nucleotide substitutions in universal tails and fluorescent universal primers enabled the detection of specific fluorescent fragments from the 22 loci. Multiplex polymerase chain reaction (PCR) produced intense FAM-, VIC-, NED-, and PET-labeled fragments ranging from 90 to 400 bp, and these fragments were discriminated using standard capillary electrophoretic analysis. The selected 22 loci were also analyzed using two commercial kits (the AmpFLSTR Identifiler Kit and the PowerPlex ESX 17 System), and results for two loci (D19S433 and D16S539) were discordant between these kits due to mutations at the primer binding sites. All genotypes from the 100 samples were determined using 2.5 ng of DNA by our method, and the expected alleles were completely recovered. Multiplex 22-locus genotyping using four fluorescent universal primers effectively reduces the costs to less than 20% of genotyping using commercial kits, and our method would be useful to detect silent alleles from commercial kit analysis. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Evaluation of a highly discriminating multiplex multi-locus variable-number of tandem-repeats (MLVA) analysis for Vibrio cholerae.

    PubMed

    Olsen, Jaran S; Aarskaug, Tone; Skogan, Gunnar; Fykse, Else Marie; Ellingsen, Anette Bauer; Blatny, Janet M

    2009-09-01

    Vibrio cholerae is the etiological agent of cholera and may be used in bioterror actions due to the easiness of its dissemination, and the public fear for acquiring the cholera disease. A simple and highly discriminating method for connecting clinical and environmental isolates of V. cholerae is needed in microbial forensics. Twelve different loci containing variable numbers of tandem-repeats (VNTRs) were evaluated in which six loci were polymorphic. Two multiplex reactions containing PCR primers targeting these six VNTRs resulted in successful DNA amplification of 142 various environmental and clinical V. cholerae isolates. The genetic distribution inside the V. cholerae strain collection was used to evaluate the discriminating power (Simpsons Diversity Index=0.99) of this new MLVA analysis, showing that the assay have a potential to differentiate between various strains, but also to identify those isolates which are collected from a common V. cholerae outbreak. This work has established a rapid and highly discriminating MLVA assay useful for track back analyses and/or forensic studies of V. cholerae infections.

  1. Crystal structures of ryanodine receptor SPRY1 and tandem-repeat domains reveal a critical FKBP12 binding determinant

    NASA Astrophysics Data System (ADS)

    Yuchi, Zhiguang; Yuen, Siobhan M. Wong King; Lau, Kelvin; Underhill, Ainsley Q.; Cornea, Razvan L.; Fessenden, James D.; van Petegem, Filip

    2015-08-01

    Ryanodine receptors (RyRs) form calcium release channels located in the membranes of the sarcoplasmic and endoplasmic reticulum. RyRs play a major role in excitation-contraction coupling and other Ca2+-dependent signalling events, and consist of several globular domains that together form a large assembly. Here we describe the crystal structures of the SPRY1 and tandem-repeat domains at 1.2-1.5 Å resolution, which reveal several structural elements not detected in recent cryo-EM reconstructions of RyRs. The cryo-EM studies disagree on the position of SPRY domains, which had been proposed based on homology modelling. Computational docking of the crystal structures, combined with FRET studies, show that the SPRY1 domain is located next to FK506-binding protein (FKBP). Molecular dynamics flexible fitting and mutagenesis experiments suggest a hydrophobic cluster within SPRY1 that is crucial for FKBP binding. A RyR1 disease mutation, N760D, appears to directly impact FKBP binding through interfering with SPRY1 folding.

  2. Mitochondrial haplotypes are not associated with mice selectively bred for high voluntary wheel running.

    PubMed

    Wone, Bernard W M; Yim, Won C; Schutz, Heidi; Meek, Thomas H; Garland, Theodore

    2018-04-04

    Mitochondrial haplotypes have been associated with human and rodent phenotypes, including nonshivering thermogenesis capacity, learning capability, and disease risk. Although the mammalian mitochondrial D-loop is highly polymorphic, D-loops in laboratory mice are identical, and variation occurs elsewhere mainly between nucleotides 9820 and 9830. Part of this region codes for the tRNA Arg gene and is associated with mitochondrial densities and number of mtDNA copies. We hypothesized that the capacity for high levels of voluntary wheel-running behavior would be associated with mitochondrial haplotype. Here, we analyzed the mtDNA polymorphic region in mice from each of four replicate lines selectively bred for 54 generations for high voluntary wheel running (HR) and from four control lines (Control) randomly bred for 54 generations. Sequencing the polymorphic region revealed a variable number of adenine repeats. Single nucleotide polymorphisms (SNPs) varied from 2 to 3 adenine insertions, resulting in three haplotypes. We found significant genetic differentiations between the HR and Control groups (F st  = 0.779, p ≤ 0.0001), as well as among the replicate lines of mice within groups (F sc  = 0.757, p ≤ 0.0001). Haplotypes, however, were not strongly associated with voluntary wheel running (revolutions run per day), nor with either body mass or litter size. This system provides a useful experimental model to dissect the physiological processes linking mitochondrial, genomic SNPs, epigenetics, or nuclear-mitochondrial cross-talk to exercise activity. Copyright © 2018. Published by Elsevier B.V.

  3. New HLA haplotype frequency reference standards: high-resolution and large sample typing of HLA DR-DQ haplotypes in a sample of European Americans.

    PubMed

    Klitz, W; Maiers, M; Spellman, S; Baxter-Lowe, L A; Schmeckpeper, B; Williams, T M; Fernandez-Viña, M

    2003-10-01

    A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.

  4. Variable Number of Tandem Repeats in Salmonella enterica subsp. enterica for Typing Purposes

    PubMed Central

    Ramisse, Vincent; Houssu, Perrine; Hernandez, Eric; Denoeud, France; Hilaire, Valérie; Lisanti, Olivier; Ramisse, Françoise; Cavallo, Jean-Didier; Vergnaud, Gilles

    2004-01-01

    The genomic sequences of Salmonella enterica subsp. enterica strains CT18, Ty2 (serovar Typhi), and LT2 (serovar Typhimurium) were analyzed for potential variable number tandem repeats (VNTRs). A multiple-locus VNTR analysis (MLVA) of 99 strains of S. enterica supsp. enterica based on 10 VNTRs distinguished 52 genotypes and placed them into four groups. All strains tested were independent human isolates from France and did not reflect isolates from outbreak episodes. Of these 10 VNTRs, 7 showed variability within serovar Typhi, whereas 1 showed variability within serovar Typhimurium. Four VNTRs showed high Nei's diversity indices (DIs) of 0.81 to 0.87 within serovar Typhi (n = 27). Additionally, three of these more variable VNTRs showed DIs of 0.18 to 0.58 within serovar Paratyphi A (n = 10). The VNTR polymorphic site within multidrug-resistant (MDR) serovar Typhimurium isolates (n = 39; resistance to ampicillin, chloramphenicol, spectinomycin, sulfonamides, and tetracycline) showed a DI of 0.81. Cluster analysis not only identified three genetically distinct groups consistent with the present serovar classification of salmonellae (serovars Typhi, Paratyphi A, and Typhimurium) but also discriminated 25 subtypes (93%) within serovar Typhi isolates. The analysis discriminated only eight subtypes within serovar Typhimurium isolates resistant to ampicillin, chloramphenicol, spectinomycin, sulfonamides, and tetracycline, possibly reflecting the emergence in the mid-1990s of the DT104 phage type, which often displays such an MDR spectrum. Coupled with the ongoing improvements in automated procedures offered by capillary electrophoresis, use of these markers is proposed in further investigations of the potential of MLVA in outbreaks of salmonellosis, especially outbreaks of typhoid fever. PMID:15583305

  5. Low expression of a Ddm7/Ldm7-hybrid mutant (D/Ldm7) in the novel haplotype H-2nc identified in atopic dermatitis model NC/Nga mice.

    PubMed

    Ohkusu-Tsukada, Kozo; Yamashita, Tadashi; Tsukada, Teruyo; Takahashi, Kimimasa

    2017-12-22

    Environmental factors and the major histocompatibility complex (MHC) are involved in the pathogenesis of atopic dermatitis (AD). However, MHC type (H2 haplotype) of AD model mice NC/Nga is poorly understood. Alloreactive CD8 + or CD4 + T cells in NC/Nga strongly responded to each antigen-presenting cells (A/J: H-2 a , C57BL/6: H-2 b , BALB/c: H-2 d , or C3H/HeJ: H-2 k ), suggesting that NC/Nga has other H2 haplotype. Polymorphic microsatellite (CA) n repeats in TNF-α gene differ based on the H2 haplotype at present. NC/Nga's (CA) n repeats (n = 19) were different from other examined strains, A/J (n = 14), BALB/c (n = 14), C3H/HeJ (n = 16), and C57BL/6 (n = 20). Using flow cytometry and genotyping, we demonstrated the NC/Nga H2 haplotype had a unique phenotype (K d , I-A k , and I-E k ) in which D d and L d lacked as protein despite sensitive mRNA detection. The loss of D d and L d was caused by forming a unique D dm7 /L dm7 -hybrid mutant (D/L dm7 ). We propose to call this novel H2 haplotype the "H-2 nc ," and provide the important information regarding the AD research using NC/Nga mice.

  6. Haplotype diversity in 11 candidate genes across four populations.

    PubMed

    Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

    2005-09-01

    Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.

  7. ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

    PubMed

    Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

    2006-06-16

    The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.

  8. Genotypes and Haplotypes of the Estrogen Receptor α Gene (ESR1) Are Associated With Female-to-Male Gender Dysphoria.

    PubMed

    Cortés-Cortés, Joselyn; Fernández, Rosa; Teijeiro, Nerea; Gómez-Gil, Esther; Esteva, Isabel; Almaraz, Mari Cruz; Guillamón, Antonio; Pásaro, Eduardo

    2017-03-01

    Gender dysphoria, a marked incongruence between one's experienced gender and biological sex, is commonly believed to arise from discrepant cerebral and genital sexual differentiation. With the discovery that estrogen receptor β is associated with female-to-male (FtM) but not with male-to-female (MtF) gender dysphoria, and given estrogen receptor α involvement in central nervous system masculinization, it was hypothesized that estrogen receptor α, encoded by the ESR1 gene, also might be implicated. To investigate whether ESR1 polymorphisms (TA)n-rs3138774, PvuII-rs2234693, and XbaI-rs9340799 and their haplotypes are associated with gender dysphoria in adults. Molecular analysis was performed in peripheral blood samples from 183 FtM subjects, 184 MtF subjects, and 394 sex- and ethnically-matched controls. Genotype and haplotype analyses of the (TA)n-rs3138774, PvuII-rs2234693, and XbaI-rs9340799 polymorphisms. Allele and genotype frequencies for the polymorphism XbaI were statistically significant only in FtM vs control XX subjects (P = .021 and P = .020). In XX individuals, the A/G genotype was associated with a low risk of gender dysphoria (odds ratio [OR] = 0.34; 95% CI = 0.16-0.74; P = .011); in XY individuals, the A/A genotype implied a low risk of gender dysphoria (OR = 0.39; 95% CI = 0.17-0.89; P = .008). Binary logistic regression showed partial effects for all three polymorphisms in FtM but not in MtF subjects. The three polymorphisms were in linkage disequilibrium: a small number of TA repeats was linked to the presence of PvuII and XbaI restriction sites (haplotype S-T-A), and a large number of TA repeats was linked to the absence of these restriction sites (haplotype L-C-G). In XX individuals, the presence of haplotype L-C-G carried a low risk of gender dysphoria (OR = 0.66; 95% CI = 0.44-0.99; P = .046), whereas the presence of haplotype L-C-A carried a high susceptibility to gender dysphoria (OR = 3.96; 95% CI = 1.04-15.02; P = .044

  9. Globally dispersed Y chromosomal haplotypes in wild and domestic sheep.

    PubMed

    Meadows, J R S; Hanotte, O; Drögemüller, C; Calvo, J; Godfrey, R; Coltman, D; Maddox, J F; Marzanov, N; Kantanen, J; Kijas, J W

    2006-10-01

    To date, investigations of genetic diversity and the origins of domestication in sheep have utilised autosomal microsatellites and variation in the mitochondrial genome. We present the first analysis of both domestic and wild sheep using genetic markers residing on the ovine Y chromosome. Analysis of a single nucleotide polymorphism (oY1) in the SRY promoter region revealed that allele A-oY1 was present in all wild bighorn sheep (Ovis canadensis), two subspecies of thinhorn sheep (Ovis dalli), European Mouflon (Ovis musimon) and the Barbary (Ammontragis lervia). A-oY1 also had the highest frequency (71.4%) within 458 domestic sheep drawn from 65 breeds sampled from Africa, Asia, Australia, the Caribbean, Europe, the Middle East and Central Asia. Sequence analysis of a second locus, microsatellite SRYM18, revealed a compound repeat array displaying fixed differences, which identified bighorn and thinhorn sheep as distinct from the European Mouflon and domestic animals. Combined genotypic data identified 11 male-specific haplotypes that represented at least two separate lineages. Investigation of the geographical distribution of each haplotype revealed that one (H6) was both very common and widespread in the global sample of domestic breeds. The remaining haplotypes each displayed more restricted and informative distributions. For example, H5 was likely founded following the domestication of European breeds and was used to trace the recent transportation of animals to both the Caribbean and Australia. A high rate of Y chromosomal dispersal appears to have taken place during the development of domestic sheep as only 12.9% of the total observed variation was partitioned between major geographical regions.

  10. The effect of using genealogy-based haplotypes for genomic prediction.

    PubMed

    Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

    2013-03-06

    Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.

  11. Allele frequency distribution for the variable number of tandem repeat locus D10S28 in Tamil Nadu (south India) population.

    PubMed

    Pandian, S K; Kumar, S; Krishnan, M; Dharmalingam, K; Damodaran, C

    1995-09-01

    Allele frequencies were determined in unrelated individuals of Tamil speaking population from the Madras City (Tamil Nadu, South India) area for the polymorphic DNA locus D10S28 using the probe TBQ7. Membranes hybridized with the probe YNH24 were subjected to deprobing and were subsequently hybridized with random priming - labeled, purified inserts of TBQ7. The sizes of the fragments were grouped to 100 bp as well as to arbitrary fixed bins (Federal Bureau of Investigation / Royal Canadian Mounted Police). There were 14 bins in the latter with the most common bin being 11 (1789-1924 bp) with a frequency of 9.8%. We observed a heterozygosity of 92% comparable to Caucasian populations. The data presented here can be used as the basis for utilizing this variable number of tandem repeats (TNTR) DNA marker for paternity determinations and forensic investigations.

  12. iXora: exact haplotype inferencing and trait association.

    PubMed

    Utro, Filippo; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar E; Royaert, Stefan; Schnell, Raymond J; Motamayor, Juan Carlos; Kuhn, David N; Parida, Laxmi

    2013-06-06

    We address the task of extracting accurate haplotypes from genotype data of individuals of large F1 populations for mapping studies. While methods for inferring parental haplotype assignments on large F1 populations exist in theory, these approaches do not work in practice at high levels of accuracy. We have designed iXora (Identifying crossovers and recombining alleles), a robust method for extracting reliable haplotypes of a mapping population, as well as parental haplotypes, that runs in linear time. Each allele in the progeny is assigned not just to a parent, but more precisely to a haplotype inherited from the parent. iXora shows an improvement of at least 15% in accuracy over similar systems in literature. Furthermore, iXora provides an easy-to-use, comprehensive environment for association studies and hypothesis checking in populations of related individuals. iXora provides detailed resolution in parental inheritance, along with the capability of handling very large populations, which allows for accurate haplotype extraction and trait association. iXora is available for non-commercial use from http://researcher.ibm.com/project/3430.

  13. Ancestral Asian source(s) of new world Y-chromosome founder haplotypes.

    PubMed Central

    Karafet, T M; Zegura, S L; Posukh, O; Osipova, L; Bergen, A; Long, J; Goldman, D; Klitz, W; Harihara, S; de Knijff, P; Wiebe, V; Griffiths, R C; Templeton, A R; Hammer, M F

    1999-01-01

    Haplotypes constructed from Y-chromosome markers were used to trace the origins of Native Americans. Our sample consisted of 2,198 males from 60 global populations, including 19 Native American and 15 indigenous North Asian groups. A set of 12 biallelic polymorphisms gave rise to 14 unique Y-chromosome haplotypes that were unevenly distributed among the populations. Combining multiallelic variation at two Y-linked microsatellites (DYS19 and DXYS156Y) with the unique haplotypes results in a total of 95 combination haplotypes. Contra previous findings based on Y- chromosome data, our new results suggest the possibility of more than one Native American paternal founder haplotype. We postulate that, of the nine unique haplotypes found in Native Americans, haplotypes 1C and 1F are the best candidates for major New World founder haplotypes, whereas haplotypes 1B, 1I, and 1U may either be founder haplotypes and/or have arrived in the New World via recent admixture. Two of the other four haplotypes (YAP+ haplotypes 4 and 5) are probably present because of post-Columbian admixture, whereas haplotype 1G may have originated in the New World, and the Old World source of the final New World haplotype (1D) remains unresolved. The contrasting distribution patterns of the two major candidate founder haplotypes in Asia and the New World, as well as the results of a nested cladistic analysis, suggest the possibility of more than one paternal migration from the general region of Lake Baikal to the Americas. PMID:10053017

  14. Mitochondrial haplotype variation and phylogeography of Iberian brown trout populations.

    PubMed

    MacHordom, A; Suárez, J; Almodóvar, A; Bautista, J M

    2000-09-01

    The biogeographical distribution of brown trout mitochondrial DNA haplotypes throughout the Iberian Peninsula was established by polymerase chain reaction-restriction fragment polymorphism analysis. The study of 507 specimens from 58 localities representing eight widely separated Atlantic-slope (north and west Iberian coasts) and six Mediterranean drainage systems served to identify five main groups of mitochondrial haplotypes: (i) haplotypes corresponding to non-native, hatchery-reared brown trout that were widely distributed but also found in wild populations of northern Spain (Cantabrian slope); (ii) a widespread Atlantic haplotype group; (iii) a haplotype restricted to the Duero Basin; (iv) a haplotype shown by southern Iberian populations; and (v) a Mediterranean haplotype. The Iberian distribution of these haplotypes reflects both the current fishery management policy of introducing non-native brown trout, and Messinian palaeobiogeography. Our findings complement and extend previous allozyme studies on Iberian brown trout and improve present knowledge of glacial refugia and postglacial movement of brown trout lineages.

  15. Haplotype-Based Association Analysis via Variance-Components Score Test

    PubMed Central

    Tzeng, Jung-Ying ; Zhang, Daowen 

    2007-01-01

    Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies. PMID:17924336

  16. The effect of using genealogy-based haplotypes for genomic prediction

    PubMed Central

    2013-01-01

    Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971

  17. Multiple-locus variable-number tandem-repeat analysis of the swine dysentery pathogen, Brachyspira hyodysenteriae.

    PubMed

    Hidalgo, Alvaro; Carvajal, Ana; La, Tom; Naharro, Germán; Rubio, Pedro; Phillips, Nyree D; Hampson, David J

    2010-08-01

    The spirochete Brachyspira hyodysenteriae is the causative agent of swine dysentery, a severe colonic infection of pigs that has a considerable economic impact in many swine-producing countries. In spite of its importance, knowledge about the global epidemiology and population structure of B. hyodysenteriae is limited. Progress in this area has been hampered by the lack of a low-cost, portable, and discriminatory method for strain typing. The aim of the current study was to develop and test a multiple-locus variable-number tandem-repeat analysis (MLVA) method that could be used in basic veterinary diagnostic microbiology laboratories equipped with PCR technology or in more advanced laboratories with access to capillary electrophoresis. Based on eight loci, and when performed on isolates from different farms in different countries, as well as type and reference strains, the MLVA technique developed was highly discriminatory (Hunter and Gaston discriminatory index, 0.938 [95% confidence interval, 0.9175 to 0.9584]) while retaining a high phylogenetic value. Using the technique, the species was shown to be diverse (44 MLVA types from 172 isolates and strains), although isolates were stable in herds over time. The population structure appeared to be clonal. The finding of B. hyodysenteriae MLVA type 3 in piggeries in three European countries, as well as other, related, strains in different countries, suggests that spreading of the pathogen via carrier pigs is likely. MLVA overcame drawbacks associated with previous typing techniques for B. hyodysenteriae and was a powerful method for epidemiologic and population structure studies on this important pathogenic spirochete.

  18. Tandem betatron

    DOEpatents

    Keinigs, Rhonald K.

    1992-01-01

    Two betatrons are provided in tandem for alternately accelerating an electron beam to avoid the single flux swing limitation of conventional betatrons and to accelerate the electron beam to high energies. The electron beam is accelerated in a first betatron during a period of increasing magnetic flux. The eletron beam is extracted from the first betatron as a peak magnetic flux is reached and then injected into a second betatron at a time of minimum magnetic flux in the second betatron. The cycle may be repeated until the desired electron beam energy is obtained. In one embodiment, the second betatron is axially offset from the first betatron to provide for electron beam injection directly at the axial location of the beam orbit in the second betatron.

  19. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

    PubMed

    Trofimova, Irina; Krasikova, Alla

    2016-12-01

    Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.

  20. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts

    PubMed Central

    Krasikova, Alla

    2016-01-01

    ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817

  1. MHC Class II haplotypes of Colombian Amerindian tribes

    PubMed Central

    Yunis, Juan J.; Yunis, Edmond J.; Yunis, Emilio

    2013-01-01

    We analyzed 1041 individuals belonging to 17 Amerindian tribes of Colombia, Chimila, Bari and Tunebo (Chibcha linguistic family), Embera, Waunana (Choco linguistic family), Puinave and Nukak (Maku-Puinave linguistic families), Cubeo, Guanano, Tucano, Desano and Piratapuyo (Tukano linguistic family), Guahibo and Guayabero (Guayabero Linguistic Family), Curripaco and Piapoco (Arawak linguistic family) and Yucpa (Karib linguistic family). for MHC class II haplotypes (HLA-DRB1, DQA1, DQB1). Approximately 90% of the MHC class II haplotypes found among these tribes are haplotypes frequently encountered in other Amerindian tribes. Nonetheless, striking differences were observed among Chibcha and non-Chibcha speaking tribes. The DRB1*04:04, DRB1*04:11, DRB1*09:01 carrying haplotypes were frequently found among non-Chibcha speaking tribes, while the DRB1*04:07 haplotype showed significant frequencies among Chibcha speaking tribes, and only marginal frequencies among non-Chibcha speaking tribes. Our results suggest that the differences in MHC class II haplotype frequency found among Chibcha and non-Chibcha speaking tribes could be due to genetic differentiation in Mesoamerica of the ancestral Amerindian population into Chibcha and non-Chibcha speaking populations before they entered into South America. PMID:23885196

  2. Y-SNPs haplotype diversity in four Chinese cattle breeds.

    PubMed

    Zhang, Runfeng; Cheng, Ming; Li, Xiaofeng; Chen, Fuying; Zheng, Jing; Wang, Xiaofei; Meng, Quanke

    2013-01-01

    To investigate the genetic diversity of Chinese cattle, 96 male samples of 4 Chinese native cattle breeds were investigated using 5 single nucleotide polymorphisms specific to the bovine Y chromosome. Two previously described haplotypes (taurine Y2 and indicine Y3) were detected in 74 and 22 animals, respectively. The haplotype frequencies varied amongst the four native breeds. The taurine Y2 haplotype dominated in the Qinchuan, Dabieshan, and Yunba breeds. However, the indicine Y3 haplotype occurred in high frequency in the Enshi breed. Among the four native breeds, Yunba had the highest haplotype diversity (0.4330 ± 0.0750), followed by Qinchuan (0.2899 ± 0.1028) and Enshi (0.2222 ± 0.1662), Dabieshan was the least differentiated (0.1079 ± 0.0680). Compared with some foreign cattle breeds, the low level of haplotype diversity was detected in our breeds (0.2633 ± 0.1030).

  3. DNA fingerprinting of Shiga-toxin producing Escherichia coli O157 based on Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA)

    PubMed Central

    Lindstedt, Bjørn-Arne; Heir, Even; Gjernes, Elisabet; Vardund, Traute; Kapperud, Georg

    2003-01-01

    Background The ability to react early to possible outbreaks of Escherichia coli O157:H7 and to trace possible sources relies on the availability of highly discriminatory and reliable techniques. The development of methods that are fast and has the potential for complete automation is needed for this important pathogen. Methods In all 73 isolates of shiga-toxin producing E. coli O157 (STEC) were used in this study. The two available fully sequenced STEC genomes were scanned for tandem repeated stretches of DNA, which were evaluated as polymorphic markers for isolate identification. Results The 73 E. coli isolates displayed 47 distinct patterns and the MLVA assay was capable of high discrimination between the E. coli O157 strains. The assay was fast and all the steps can be automated. Conclusion The findings demonstrate a novel high discriminatory molecular typing method for the important pathogen E. coli O157 that is fast, robust and offers many advantages compared to current methods. PMID:14664722

  4. Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.

    PubMed

    Hashemi, Abolfazl; Zhu, Banghua; Vikalo, Haris

    2018-03-21

    Haplotype assembly is the task of reconstructing haplotypes of an individual from a mixture of sequenced chromosome fragments. Haplotype information enables studies of the effects of genetic variations on an organism's phenotype. Most of the mathematical formulations of haplotype assembly are known to be NP-hard and haplotype assembly becomes even more challenging as the sequencing technology advances and the length of the paired-end reads and inserts increases. Assembly of haplotypes polyploid organisms is considerably more difficult than in the case of diploids. Hence, scalable and accurate schemes with provable performance are desired for haplotype assembly of both diploid and polyploid organisms. We propose a framework that formulates haplotype assembly from sequencing data as a sparse tensor decomposition. We cast the problem as that of decomposing a tensor having special structural constraints and missing a large fraction of its entries into a product of two factors, U and [Formula: see text]; tensor [Formula: see text] reveals haplotype information while U is a sparse matrix encoding the origin of erroneous sequencing reads. An algorithm, AltHap, which reconstructs haplotypes of either diploid or polyploid organisms by iteratively solving this decomposition problem is proposed. The performance and convergence properties of AltHap are theoretically analyzed and, in doing so, guarantees on the achievable minimum error correction scores and correct phasing rate are established. The developed framework is applicable to diploid, biallelic and polyallelic polyploid species. The code for AltHap is freely available from https://github.com/realabolfazl/AltHap . AltHap was tested in a number of different scenarios and was shown to compare favorably to state-of-the-art methods in applications to haplotype assembly of diploids, and significantly outperforms existing techniques when applied to haplotype assembly of polyploids.

  5. [Use of multiple locus variable number tandem repeats analysis for the Brucella systematization].

    PubMed

    Kulakov, Iu K; Kovalev, D A; Misetova, E N; Golovneva, S I; Liapustina, L V; Zheludkov, M M

    2012-01-01

    The methods of molecular-genetic differentiation to strain level acquire increasing significance in the current system of struggle with brucellosis. MLVA (multiple locus variable number tandem repeats analysis) was selected for molecular-genetic differentiation to strain level and simultaneous establishment of the genetic relationship of investigated Brucella strains. The goal of this work was MLVA typing of three pathogenic Brucella species strains with the analysis of stability of chosen loci, discrimination power and concordance to conventional phenotypic methods of the Brucella differentiation for use in systematization of brucellosis causing agents. Twenty six Brucella strains representing reference (n = 15), vaccine (n = 2) and field strains of three pathogenic Brucella species were tested: B. melitensis (n = 3), B. abortus (n = 2), B. suis (n = 2), and isolates (n = 2) with unidentified taxonomic position using MLVA with 9 pairs primers on known variable loci of Brucella genome. The analysis of the stability of chosen loci, discrimination power on Hunter-Gaston discrimination index (HGDI) and consistency to phenotypic methods of identification was performed. MLVA was confirmed for the results of phenotypic methods of identification, stability of the chosen loci in majority reference, and vaccine strains with a high index of variability HGDI 0.9969 for all loci. A dendrogram was plotted on the basis of MLVA data on distributed Brucella strains in related clusters according to its taxonomic species and biovar positions and construction of 25 genotypes. B. melitensis strains formed cluster related to the reference strain of B. melitensis 63/9 biovar 2. Australian isolates of Brucella 83-4 and Brucella 83-6 isolated from rodents formed a cluster distant from other strains of Brucella. MLVA is a promising method for differentiation of Brucella strains with known and unresolved taxonomic status for their systematization and creation of MLVA genotype catalogue that

  6. The upstream Variable Number Tandem Repeat polymorphism of the monoamine oxidase type A gene influences trigeminal pain-related evoked responses.

    PubMed

    Di Lorenzo, Cherubino; Daverio, Andrea; Pasqualetti, Patrizio; Coppola, Gianluca; Giannoudas, Ioannis; Barone, Ylenia; Grieco, Gaetano S; Niolu, Cinzia; Pascale, Esterina; Santorelli, Filippo M; Nicoletti, Ferdinando; Pierelli, Francesco; Siracusano, Alberto; Seri, Stefano; Di Lorenzo, Giorgio

    2014-02-01

    Monoamines have an important role in neural plasticity, a key factor in cortical pain processing that promotes changes in neuronal network connectivity. Monoamine oxidase type A (MAOA) is an enzyme that, due to its modulating role in monoaminergic activity, could play a role in cortical pain processing. The X-linked MAOA gene is characterized by an allelic variant of length, the MAOA upstream Variable Number Tandem Repeat (MAOA-uVNTR) region polymorphism. Two allelic variants of this gene are known, the high-activity MAOA (HAM) and low-activity MAOA (LAM). We investigated the role of MAOA-uVNTR in cortical pain processing in a group of healthy individuals measured by the trigeminal electric pain-related evoked potential (tPREP) elicited by repeated painful stimulation. A group of healthy volunteers was genotyped to detect MAOA-uVNTR polymorphism. Electrical tPREPs were recorded by stimulating the right supraorbital nerve with a concentric electrode. The N2 and P2 component amplitude and latency as well as the N2-P2 inter-peak amplitude were measured. The recording was divided into three blocks, each containing 10 consecutive stimuli and the N2-P2 amplitude was compared between blocks. Of the 67 volunteers, 37 were HAM and 30 were LAM. HAM subjects differed from LAM subjects in terms of amplitude of the grand-averaged and first-block N2-P2 responses (HAM>LAM). The N2-P2 amplitude decreased between the first and third block in HAM subjects but not LAM subjects. The MAOA-uVNTR polymorphism seemed to influence the brain response in a repeated tPREP paradigm and suggested a role of the MAOA as a modulator of neural plasticity related to cortical pain processing. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.

  7. First Worldwide Proficiency Study on Variable-Number Tandem-Repeat Typing of Mycobacterium tuberculosis Complex Strains

    PubMed Central

    de Beer, Jessica L.; Kremer, Kristin; Ködmön, Csaba; Supply, Philip

    2012-01-01

    Although variable-number tandem-repeat (VNTR) typing has gained recognition as the new standard for the DNA fingerprinting of Mycobacterium tuberculosis complex (MTBC) isolates, external quality control programs have not yet been developed. Therefore, we organized the first multicenter proficiency study on 24-locus VNTR typing. Sets of 30 DNAs of MTBC strains, including 10 duplicate DNA samples, were distributed among 37 participating laboratories in 30 different countries worldwide. Twenty-four laboratories used an in-house-adapted method with fragment sizing by gel electrophoresis or an automated DNA analyzer, nine laboratories used a commercially available kit, and four laboratories used other methods. The intra- and interlaboratory reproducibilities of VNTR typing varied from 0% to 100%, with averages of 72% and 60%, respectively. Twenty of the 37 laboratories failed to amplify particular VNTR loci; if these missing results were ignored, the number of laboratories with 100% interlaboratory reproducibility increased from 1 to 5. The average interlaboratory reproducibility of VNTR typing using a commercial kit was better (88%) than that of in-house-adapted methods using a DNA analyzer (70%) or gel electrophoresis (50%). Eleven laboratories using in-house-adapted manual typing or automated typing scored inter- and intralaboratory reproducibilities of 80% or higher, which suggests that these approaches can be used in a reliable way. In conclusion, this first multicenter study has documented the worldwide quality of VNTR typing of MTBC strains and highlights the importance of international quality control to improve genotyping in the future. PMID:22170917

  8. Repeatability and Reproducibility in Proteomic Identifications by Liquid Chromatography—Tandem Mass Spectrometry

    PubMed Central

    Tabb, David L.; Vega-Montoto, Lorenzo; Rudnick, Paul A.; Variyath, Asokan Mulayath; Ham, Amy-Joan L.; Bunk, David M.; Kilpatrick, Lisa E.; Billheimer, Dean D.; Blackman, Ronald K.; Cardasis, Helene L.; Carr, Steven A.; Clauser, Karl R.; Jaffe, Jacob D.; Kowalski, Kevin A.; Neubert, Thomas A.; Regnier, Fred E.; Schilling, Birgit; Tegeler, Tony J.; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Fisher, Susan J.; Gibson, Bradford W.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Steven E.; Tempst, Paul; Paulovich, Amanda G.; Liebler, Daniel C.; Spiegelman, Cliff

    2009-01-01

    The complexity of proteomic instrumentation for LC-MS/MS introduces many possible sources of variability. Data-dependent sampling of peptides constitutes a stochastic element at the heart of discovery proteomics. Although this variation impacts the identification of peptides, proteomic identifications are far from completely random. In this study, we analyzed interlaboratory data sets from the NCI Clinical Proteomic Technology Assessment for Cancer to examine repeatability and reproducibility in peptide and protein identifications. Included data spanned 144 LC-MS/MS experiments on four Thermo LTQ and four Orbitrap instruments. Samples included yeast lysate, the NCI-20 defined dynamic range protein mix, and the Sigma UPS 1 defined equimolar protein mix. Some of our findings reinforced conventional wisdom, such as repeatability and reproducibility being higher for proteins than for peptides. Most lessons from the data, however, were more subtle. Orbitraps proved capable of higher repeatability and reproducibility, but aberrant performance occasionally erased these gains. Even the simplest protein digestions yielded more peptide ions than LC-MS/MS could identify during a single experiment. We observed that peptide lists from pairs of technical replicates overlapped by 35–60%, giving a range for peptide-level repeatability in these experiments. Sample complexity did not appear to affect peptide identification repeatability, even as numbers of identified spectra changed by an order of magnitude. Statistical analysis of protein spectral counts revealed greater stability across technical replicates for Orbitraps, making them superior to LTQ instruments for biomarker candidate discovery. The most repeatable peptides were those corresponding to conventional tryptic cleavage sites, those that produced intense MS signals, and those that resulted from proteins generating many distinct peptides. Reproducibility among different instruments of the same type lagged behind

  9. Independent movement, dimerization and stability of tandem repeats of chicken brain alpha-spectrin

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kusunoki, H.; Minasov, G.; Macdonald, R.I.

    Previous X-ray crystal structures have shown that linkers of five amino acid residues connecting pairs of chicken brain {alpha}-spectrin and human erythroid {beta}-spectrin repeats can undergo bending without losing their {alpha}-helical structure. To test whether bending at one linker can influence bending at an adjacent linker, the structures of two and three repeat fragments of chicken brain {alpha}-spectrin have been determined by X-ray crystallography. The structure of the three-repeat fragment clearly shows that bending at one linker can occur independently of bending at an adjacent linker. This observation increases the possible trajectories of modeled chains of spectrin repeats. Furthermore, themore » three-repeat molecule crystallized as an antiparallel dimer with a significantly smaller buried interfacial area than that of {alpha}-actinin, a spectrin-related molecule, but large enough and of a type indicating biological specificity. Comparison of the structures of the spectrin and {alpha}-actinin dimers supports weak association of the former, which could not be detected by analytical ultracentrifugation, versus strong association of the latter, which has been observed by others. To correlate features of the structure with solution properties and to test a previous model of stable spectrin and dystrophin repeats, the number of inter-helical interactions in each repeat of several spectrin structures were counted and compared to their thermal stabilities. Inter-helical interactions, but not all interactions, increased in parallel with measured thermal stabilities of each repeat and in agreement with the thermal stabilities of two and three repeats and also partial repeats of spectrin.« less

  10. Screening of repetitive motifs inside the genome of the flat oyster (Ostrea edulis): Transposable elements and short tandem repeats.

    PubMed

    Vera, Manuel; Bello, Xabier; Álvarez-Dios, Jose-Antonio; Pardo, Belen G; Sánchez, Laura; Carlsson, Jens; Carlsson, Jeanette E L; Bartolomé, Carolina; Maside, Xulio; Martinez, Paulino

    2015-12-01

    The flat oyster (Ostrea edulis) is one of the most appreciated molluscs in Europe, but its production has been greatly reduced by the parasite Bonamia ostreae. Here, new generation genomic resources were used to analyse the repetitive fraction of the oyster genome, with the aim of developing molecular markers to face this main oyster production challenge. The resulting oyster database, consists of two sets of 10,318 and 7159 unique contigs (4.8 Mbp and 6.8 Mbp in total length) representing the oyster's genome (WG) and haemocyte transcriptome (HT), respectively. A total of 1083 sequences were identified as TE-derived, which corresponded to 4.0% of WG and 1.1% of HT. They were clustered into 142 homology groups, most of which were assigned to the Penelope order of retrotransposons, and to the Helitron and TIR DNA-transposons. Simple repeats and rRNA pseudogenes, also made a significant contribution to the oyster's genome (0.5% and 0.3% of WG and HT, respectively).The most frequent short tandem repeats identified in WG were tetranucleotide motifs while trinucleotide motifs were in HT. Forty identified microsatellite loci, 20 from each database, were selected for technical validation. Success was much lower among WG than HT microsatellites (15% vs 55%), which could reflect higher variation in anonymous regions interfering with primer annealing. All microsatellites developed adjusted to Hardy-Weinberg proportions and represent a useful tool to support future breeding programmes and to manage genetic resources of natural flat oyster beds. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. Prion gene haplotypes of U.S. cattle

    PubMed Central

    Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W

    2006-01-01

    Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337

  12. New paradigm in ankyrin repeats: Beyond protein-protein interaction module.

    PubMed

    Islam, Zeyaul; Nagampalli, Raghavendra Sashi Krishna; Fatima, Munazza Tamkeen; Ashraf, Ghulam Md

    2018-04-01

    Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. [A total of 362 HLA different haplotypes and HLA recombination haplotypes based on analysis of their family pedigree in Chinese partial Han populations].

    PubMed

    Gao, Su-Qing; Cheng, Xi; Li, Qian; Li, Yu-Zhu; Deng, Zhi-Hui

    2009-06-01

    This study was aimed to discover the novel HLA recombination haplotypes and investigate the distribution of haplotypes in Chinese Han population. Based on the HLA-A, B, DRB1 typing results of 179 family members, 791 haplotypes were assigned by the mode of inheritance. The results showed that a total of 4 novel recombinant haplotypes in HLA-DRB1 locus region were observed in 4 families, which ratio of paternal to maternal chromosomes was 3:1. The recombination ratio between HLA-DRB1 and HLA-A or B loci was 0.92% (4/433). There were a total of 362 kinds of HLA-A, -B, -DRB1 haplotypes to be confirmed in Chinese Han partial population. A33-B58-DR17, A2-B46-DR9, A30-B13-DR7, A11-B13-DR15, A11-B75-DR12 and A2-B46-DR14 were the most common haplotypes that was consistent with the distribution of HLA alleles in unrelated donors. There were A1-B63-DR12, A29-B46-DR15, A1-B61-DR10, A34-B35-DR9, A29-B54-DR4, A23-B13-DR16 and A34-B62-DR15 haplotypes and so on, which were rare haplotypes not yet reported in Chinese. It is concluded that the HLA-A-B-DRB1 haplotypes would be confirmed by analysis of their family pedigree. The results obtained in this study are basic data for study of Chinese anthropology, organ transplantation and disease correlation analysis.

  14. Alpha-globin gene haplotypes in South American Indians.

    PubMed

    Zago, M A; Melo Santos, E J; Clegg, J B; Guerreiro, J F; Martinson, J J; Norwich, J; Figueiredo, M S

    1995-08-01

    The haplotypes of the alpha-globin gene cluster were determined for 99 Indians from the Brazilian Amazon region who belong to 5 tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Three predominant haplotypes were identified: Ia (present in 38.9% of chromosomes), IIIa (25.8%), and IIe (22.1%). The only alpha-globin gene rearrangement detected was alpha alpha alpha 3.7 I gene triplication associated with haplotype IIIa, found in high frequencies (5.6% and 10.6%) in two tribes and absent in the others. alpha-Globin gene deletions that cause alpha-thalassemia were not seen, supporting the argument that malaria was absent in these populations until recently. The heterogeneous distribution of alpha-globin gene haplotypes and rearrangements among the different tribes differs markedly from the homogeneous distribution of beta-globin gene cluster haplotypes and reflects the action of various genetic mechanisms (genetic drift, founder effect, consanguinity) on small isolated population groups with a complicated history of divergence-fusion events. The alpha-globin gene haplotype distribution has some similarities to distributions observed in Southeast Asian and Pacific Island populations, indicating that these populations have considerable genetic affinities. However, the absence of several features of the alpha-globin gene cluster that are consistently present among the Pacific Islanders suggests that the similarity of haplotypes between Brazilian Indians and people from Polynesia, Micronesia, and Melanesia is more likely to result of ancient common ancestry rather than the consequence of recent direct genetic contribution through immigration.

  15. Mimosoid legume plastome evolution: IR expansion, tandem repeat expansions, and accelerated rate of evolution in clpP.

    PubMed

    Dugas, Diana V; Hernandez, David; Koenen, Erik J M; Schwarz, Erika; Straub, Shannon; Hughes, Colin E; Jansen, Robert K; Nageswara-Rao, Madhugiri; Staats, Martijn; Trujillo, Joshua T; Hajrah, Nahid H; Alharbi, Njud S; Al-Malki, Abdulrahman L; Sabir, Jamal S M; Bailey, C Donovan

    2015-11-23

    The Leguminosae has emerged as a model for studying angiosperm plastome evolution because of its striking diversity of structural rearrangements and sequence variation. However, most of what is known about legume plastomes comes from few genera representing a subset of lineages in subfamily Papilionoideae. We investigate plastome evolution in subfamily Mimosoideae based on two newly sequenced plastomes (Inga and Leucaena) and two recently published plastomes (Acacia and Prosopis), and discuss the results in the context of other legume and rosid plastid genomes. Mimosoid plastomes have a typical angiosperm gene content and general organization as well as a generally slow rate of protein coding gene evolution, but they are the largest known among legumes. The increased length results from tandem repeat expansions and an unusual 13 kb IR-SSC boundary shift in Acacia and Inga. Mimosoid plastomes harbor additional interesting features, including loss of clpP intron1 in Inga, accelerated rates of evolution in clpP for Acacia and Inga, and dN/dS ratios consistent with neutral and positive selection for several genes. These new plastomes and results provide important resources for legume comparative genomics, plant breeding, and plastid genetic engineering, while shedding further light on the complexity of plastome evolution in legumes and angiosperms.

  16. Concerted evolution of the tandem array encoding primate U2 snRNA occurs in situ, without changing the cytological context of the RNU2 locus.

    PubMed Central

    Pavelitz, T; Rusché, L; Matera, A G; Scharf, J M; Weiner, A M

    1995-01-01

    In primates, the tandemly repeated genes encoding U2 small nuclear RNA evolve concertedly, i.e. the sequence of the U2 repeat unit is essentially homogeneous within each species but differs somewhat between species. Using chromosome painting and the NGFR gene as an outside marker, we show that the U2 tandem array (RNU2) has remained at the same chromosomal locus (equivalent to human 17q21) through multiple speciation events over > 35 million years leading to the Old World monkey and hominoid lineages. The data suggest that the U2 tandem repeat, once established in the primate lineage, contained sequence elements favoring perpetuation and concerted evolution of the array in situ, despite a pericentric inversion in chimpanzee, a reciprocal translocation in gorilla and a paracentric inversion in orang utan. Comparison of the 11 kb U2 repeat unit found in baboon and other Old World monkeys with the 6 kb U2 repeat unit in humans and other hominids revealed that an ancestral U2 repeat unit was expanded by insertion of a 5 kb retrovirus bearing 1 kb long terminal repeats (LTRs). Subsequent excision of the provirus by homologous recombination between the LTRs generated a 6 kb U2 repeat unit containing a solo LTR. Remarkably, both junctions between the human U2 tandem array and flanking chromosomal DNA at 17q21 fall within the solo LTR sequence, suggesting a role for the LTR in the origin or maintenance of the primate U2 array. Images PMID:7828589

  17. Revisiting the TALE repeat.

    PubMed

    Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

    2014-04-01

    Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.

  18. Haplotype Reconstruction in Large Pedigrees with Many Untyped Individuals

    NASA Astrophysics Data System (ADS)

    Li, Xin; Li, Jing

    Haplotypes, as they specify the linkage patterns between dispersed genetic variations, provide important information for understanding the genetics of human traits. However haplotypes are not directly available from current genotyping platforms, and hence there are extensive investigations of computational methods to recover such information. Two major computational challenges arising in current family-based disease studies are large family sizes and many ungenotyped family members. Traditional haplotyping methods can neither handle large families nor families with missing members. In this paper, we propose a method which addresses these issues by integrating multiple novel techniques. The method consists of three major components: pairwise identical-bydescent (IBD) inference, global IBD reconstruction and haplotype restoring. By reconstructing the global IBD of a family from pairwise IBD and then restoring the haplotypes based on the inferred IBD, this method can scale to large pedigrees, and more importantly it can handle families with missing members. Compared with existing methods, this method demonstrates much higher power to recover haplotype information, especially in families with many untyped individuals.

  19. Rapid Identification of Laboratory Contamination with Mycobacterium tuberculosis Using Variable Number Tandem Repeat Analysis

    PubMed Central

    Gascoyne-Binzi, Deborah M.; Barlow, Rachael E. L.; Frothingham, Richard; Robinson, Grant; Collyns, Timothy A.; Gelletlie, Ruth; Hawkey, Peter M.

    2001-01-01

    Compared with solid media, broth-based mycobacterial culture systems have increased sensitivity but also have higher false-positive rates due to cross-contamination. Systematic strain typing is rarely undertaken because the techniques are technically demanding and the data are difficult to organize. Variable number tandem repeat (VNTR) analysis by PCR is rapid and reproducible. The digital profile is easily manipulated in a database. We undertook a retrospective study of Mycobacterium tuberculosis isolates collected over an 18-month period following the introduction of the BACTEC MGIT 960 system. VNTR allele profiles were determined with early positive broth cultures and entered into a database with the specimen processing date and other specimen data. We found 36 distinct VNTR profiles in cultures from 144 patients. Three common VNTR profiles accounted for 45% of true-positive cases. By combining VNTR results with specimen data, we identified nine cross-contamination incidents, six of which were previously unsuspected. These nine incidents resulted in 34 false-positive cultures for 29 patients. False-positive cultures were identified for three patients who had previously been culture positive for tuberculosis and were receiving treatment. Identification of cross-contamination incidents requires careful documentation of specimen data and good communication between clinical and laboratory staff. Automated broth culture systems should be supplemented with molecular analysis to identify cross-contamination events. VNTR analysis is reproducible and provides timely results when applied to early positive broth cultures. This method should ensure that patients are not placed on unnecessary tuberculosis therapy or that cases are not falsely identified as treatment failures. In addition, areas where existing procedures may be improved can be identified. PMID:11136751

  20. Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

    PubMed Central

    Wincker, P; Jubier-Maurin, V; Roizès, G

    1987-01-01

    Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566

  1. A parsimonious tree-grow method for haplotype inference.

    PubMed

    Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan

    2005-09-01

    Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf

  2. In Vivo Characterization of Human APOA5 Haplotypes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahituv, Nadav; Akiyama, Jennifer; Chapman-Helleboid, Audrey

    2006-10-01

    Increased plasma triglycerides concentrations are an independent risk factor for cardiovascular disease. Numerous studies support a reproducible genetic association between two minor haplotypes in the human apolipoprotein A5 gene (APOA5) and increased plasma triglyceride concentrations. We thus sought to investigate the effect of these minor haplotypes (APOA5*2 and APOA5*3) on ApoAV plasma levels through the precise insertion of single-copy intact APOA5 haplotypes at a targeted location in the mouse genome. While we found no difference in the amount of human plasma ApoAV in mice containing the common APOA5*1 and minor APOA5*2 haplotype, the introduction of the single APOA5*3 defining allelemore » (19W) resulted in 3-fold lower ApoAV plasma levels consistent with existing genetic association studies. These results indicate that S19W polymorphism is likely to be functional and explain the strong association of this variant with plasma triglycerides supporting the value of sensitive in vivo assays to define the functional nature of human haplotypes.« less

  3. Detecting disease-predisposing variants: the haplotype method.

    PubMed Central

    Valdes, A M; Thomson, G

    1997-01-01

    For many HLA-associated diseases, multiple alleles-- and, in some cases, multiple loci--have been suggested as the causative agents. The haplotype method for identifying disease-predisposing amino acids in a genetic region is a stratification analysis. We show that, for each haplotype combination containing all the amino acid sites involved in the disease process, the relative frequencies of amino acid variants at sites not involved in disease but in linkage disequilibrium with the disease-predisposing sites are expected to be the same in patients and controls. The haplotype method is robust to mode of inheritance and penetrance of the disease and can be used to determine unequivocally whether all amino acid sites involved in the disease have not been identified. Using a resampling technique, we developed a statistical test that takes account of the nonindependence of the sites sampled. Further, when multiple sites in the genetic region are involved in disease, the test statistic gives a closer fit to the null expectation when some--compared with none--of the true predisposing factors are included in the haplotype analysis. Although the haplotype method cannot distinguish between very highly correlated sites in one population, ethnic comparisons may help identify the true predisposing factors. The haplotype method was applied to insulin-dependent diabetes mellitus (IDDM) HLA class II DQA1-DQB1 data from Caucasian, African, and Japanese populations. Our results indicate that the combination DQA1#52 (Arg predisposing) DQB1#57 (Asp protective), which has been proposed as an important IDDM agent, does not include all the predisposing elements. With rheumatoid arthritis HLA class II DRB1 data, the results were consistent with the shared-epitope hypothesis. PMID:9042931

  4. Evolution of short inverted repeat in cupressophytes, transfer of accD to nucleus in Sciadopitys verticillata and phylogenetic position of Sciadopityaceae.

    PubMed

    Li, Jia; Gao, Lei; Chen, Shanshan; Tao, Ke; Su, Yingjuan; Wang, Ting

    2016-02-11

    Sciadopitys verticillata is an evergreen conifer and an economically valuable tree used in construction, which is the only member of the family Sciadopityaceae. Acquisition of the S. verticillata chloroplast (cp) genome will be useful for understanding the evolutionary mechanism of conifers and phylogenetic relationships among gymnosperm. In this study, we have first reported the complete chloroplast genome of S. verticillata. The total genome is 138,284 bp in length, consisting of 118 unique genes. The S. verticillata cp genome has lost one copy of the canonical inverted repeats and shown distinctive genomic structure comparing with other cupressophytes. Fifty-three simple sequence repeat loci and 18 forward tandem repeats were identified in the S. verticillata cp genome. According to the rearrangement of cupressophyte cp genome, we proposed one mechanism for the formation of inverted repeat: tandem repeat occured first, then rearrangement divided the tandem repeat into inverted repeats located at different regions. Phylogenetic estimates inferred from 59-gene sequences and cpDNA organizations have both shown that S. verticillata was sister to the clade consisting of Cupressaceae, Taxaceae, and Cephalotaxaceae. Moreover, accD gene was found to be lost in the S. verticillata cp genome, and a nucleus copy was identified from two transcriptome data.

  5. Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors

    PubMed Central

    Dannemann, Michael; Andrés, Aida M.; Kelso, Janet

    2016-01-01

    Pathogens and the diseases they cause have been among the most important selective forces experienced by humans during their evolutionary history. Although adaptive alleles generally arise by mutation, introgression can also be a valuable source of beneficial alleles. Archaic humans, who lived in Europe and Western Asia for more than 200,000 years, were probably well adapted to this environment and its local pathogens. It is therefore conceivable that modern humans entering Europe and Western Asia who admixed with them obtained a substantial immune advantage from the introgression of archaic alleles. Here we document a cluster of three Toll-like receptors (TLR6-TLR1-TLR10) in modern humans that carries three distinct archaic haplotypes, indicating repeated introgression from archaic humans. Two of these haplotypes are most similar to the Neandertal genome, and the third haplotype is most similar to the Denisovan genome. The Toll-like receptors are key components of innate immunity and provide an important first line of immune defense against bacteria, fungi, and parasites. The unusually high allele frequencies and unexpected levels of population differentiation indicate that there has been local positive selection on multiple haplotypes at this locus. We show that the introgressed alleles have clear functional effects in modern humans; archaic-like alleles underlie differences in the expression of the TLR genes and are associated with reduced microbial resistance and increased allergic disease in large cohorts. This provides strong evidence for recurrent adaptive introgression at the TLR6-TLR1-TLR10 locus, resulting in differences in disease phenotypes in modern humans. PMID:26748514

  6. Diversity and Plasticity of the Intracellular Plant Pathogen and Insect Symbiont “Candidatus Liberibacter asiaticus” as Revealed by Hypervariable Prophage Genes with Intragenic Tandem Repeats ▿ †

    PubMed Central

    Zhou, Lijuan; Powell, Charles A.; Hoffman, Michele T.; Li, Wenbin; Fan, Guocheng; Liu, Bo; Lin, Hong; Duan, Yongping

    2011-01-01

    “Candidatus Liberibacter asiaticus” is a psyllid-transmitted, phloem-limited alphaproteobacterium and the most prevalent species of “Ca. Liberibacter” associated with a devastating worldwide citrus disease known as huanglongbing (HLB). Two related and hypervariable genes (hyvI and hyvII) were identified in the prophage regions of the Psy62 “Ca. Liberibacter asiaticus” genome. Sequence analyses of the hyvI and hyvII genes in 35 “Ca. Liberibacter asiaticus” DNA isolates collected globally revealed that the hyvI gene contains up to 12 nearly identical tandem repeats (NITRs, 132 bp) and 4 partial repeats, while hyvII contains up to 2 NITRs and 4 partial repeats and shares homology with hyvI. Frequent deletions or insertions of these repeats within the hyvI and hyvII genes were observed, none of which disrupted the open reading frames. Sequence conservation within the individual repeats but an extensive variation in repeat numbers, rearrangement, and the sequences flanking the repeat region indicate the diversity and plasticity of “Ca. Liberibacter asiaticus” bacterial populations in the world. These differences were found not only in samples of distinct geographical origins but also in samples from a single origin and even from a single “Ca. Liberibacter asiaticus”-infected sample. This is the first evidence of different “Ca. Liberibacter asiaticus” populations coexisting in a single HLB-affected sample. The Florida “Ca. Liberibacter asiaticus” isolates contain both hyvI and hyvII, while all other global “Ca. Liberibacter asiaticus” isolates contain either one or the other. Interclade assignments of the putative HyvI and HyvII proteins from Florida isolates with other global isolates in phylogenetic trees imply multiple “Ca. Liberibacter asiaticus” populations in the world and a multisource introduction of the “Ca. Liberibacter asiaticus” bacterium into Florida. PMID:21784907

  7. A variable number of tandem repeats in the 3'-untranslated region of the dopamine transporter modulates striatal function during working memory updating across the adult age span.

    PubMed

    Sambataro, Fabio; Podell, Jamie E; Murty, Vishnu P; Das, Saumitra; Kolachana, Bhaskar; Goldberg, Terry E; Weinberger, Daniel R; Mattay, Venkata S

    2015-08-01

    Dopamine modulation of striatal function is critical for executive functions such as working memory (WM) updating. The dopamine transporter (DAT) regulates striatal dopamine signaling via synaptic reuptake. A variable number of tandem repeats in the 3'-untranslated region of SLC6A3 (DAT1-3'-UTR-VNTR) is associated with DAT expression, such that 9-repeat allele carriers tend to express lower levels (associated with higher extracellular dopamine concentrations) than 10-repeat homozygotes. Aging is also associated with decline of the dopamine system. The goal of the present study was to investigate the effects of aging and DAT1-3'-UTR-VNTR on the neural activity and functional connectivity of the striatum during WM updating. Our results showed both an age-related decrease in striatal activity and an effect of DAT1-3'-UTR-VNTR. Ten-repeat homozygotes showed reduced striatal activity and increased striatal-hippocampal connectivity during WM updating relative to the 9-repeat carriers. There was no age by DAT1-3'-UTR-VNTR interaction. These results suggest that, whereas striatal function during WM updating is modulated by both age and genetically determined DAT levels, the rate of the age-related decline in striatal function is similar across both DAT1-3'-UTR-VNTR genotype groups. They further suggest that, because of the baseline difference in striatal function based on DAT1-3'-UTR-VNTR polymorphism, 10-repeat homozygotes, who have lower levels of striatal function throughout the adult life span, may reach a threshold of decreased striatal function and manifest impairments in cognitive processes mediated by the striatum earlier in life than the 9-repeat carriers. Our data suggest that age and DAT1-3'-UTR-VNTR polymorphism independently modulate striatal function. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.

  8. Relationship between Distinct African Cholera Epidemics Revealed via MLVA Haplotyping of 337 Vibrio cholerae Isolates.

    PubMed

    Moore, Sandra; Miwanda, Berthe; Sadji, Adodo Yao; Thefenne, Hélène; Jeddi, Fakhri; Rebaudet, Stanislas; de Boeck, Hilde; Bidjada, Bawimodom; Depina, Jean-Jacques; Bompangue, Didier; Abedi, Aaron Aruna; Koivogui, Lamine; Keita, Sakoba; Garnotel, Eric; Plisnier, Pierre-Denis; Ruimy, Raymond; Thomson, Nicholas; Muyembe, Jean-Jacques; Piarroux, Renaud

    2015-01-01

    Since cholera appeared in Africa during the 1970s, cases have been reported on the continent every year. In Sub-Saharan Africa, cholera outbreaks primarily cluster at certain hotspots including the African Great Lakes Region and West Africa. In this study, we applied MLVA (Multi-Locus Variable Number Tandem Repeat Analysis) typing of 337 Vibrio cholerae isolates from recent cholera epidemics in the Democratic Republic of the Congo (DRC), Zambia, Guinea and Togo. We aimed to assess the relationship between outbreaks. Applying this method, we identified 89 unique MLVA haplotypes across our isolate collection. MLVA typing revealed the short-term divergence and microevolution of these Vibrio cholerae populations to provide insight into the dynamics of cholera outbreaks in each country. Our analyses also revealed strong geographical clustering. Isolates from the African Great Lakes Region (DRC and Zambia) formed a closely related group, while West African isolates (Togo and Guinea) constituted a separate cluster. At a country-level scale our analyses revealed several distinct MLVA groups, most notably DRC 2011/2012, DRC 2009, Zambia 2012 and Guinea 2012. We also found that certain MLVA types collected in the DRC persisted in the country for several years, occasionally giving rise to expansive epidemics. Finally, we found that the six environmental isolates in our panel were unrelated to the epidemic isolates. To effectively combat the disease, it is critical to understand the mechanisms of cholera emergence and diffusion in a region-specific manner. Overall, these findings demonstrate the relationship between distinct epidemics in West Africa and the African Great Lakes Region. This study also highlights the importance of monitoring and analyzing Vibrio cholerae isolates.

  9. Kullback-Leibler divergence for detection of rare haplotype common disease association.

    PubMed

    Lin, Shili

    2015-11-01

    Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.

  10. 47 CFR 69.111 - Tandem-switched transport and tandem charge.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 47 Telecommunication 3 2011-10-01 2011-10-01 false Tandem-switched transport and tandem charge. 69... SERVICES (CONTINUED) ACCESS CHARGES Computation of Charges § 69.111 Tandem-switched transport and tandem...-switched transport shall consist of two rate elements, a transmission charge and a tandem switching charge...

  11. Accurate quantification of chromosomal lesions via short tandem repeat analysis using minimal amounts of DNA

    PubMed Central

    Jann, Johann-Christoph; Nowak, Daniel; Nolte, Florian; Fey, Stephanie; Nowak, Verena; Obländer, Julia; Pressler, Jovita; Palme, Iris; Xanthopoulos, Christina; Fabarius, Alice; Platzbecker, Uwe; Giagounidis, Aristoteles; Götze, Katharina; Letsch, Anne; Haase, Detlef; Schlenk, Richard; Bug, Gesine; Lübbert, Michael; Ganser, Arnold; Germing, Ulrich; Haferlach, Claudia; Hofmann, Wolf-Karsten; Mossner, Maximilian

    2017-01-01

    Background Cytogenetic aberrations such as deletion of chromosome 5q (del(5q)) represent key elements in routine clinical diagnostics of haematological malignancies. Currently established methods such as metaphase cytogenetics, FISH or array-based approaches have limitations due to their dependency on viable cells, high costs or semi-quantitative nature. Importantly, they cannot be used on low abundance DNA. We therefore aimed to establish a robust and quantitative technique that overcomes these shortcomings. Methods For precise determination of del(5q) cell fractions, we developed an inexpensive multiplex-PCR assay requiring only nanograms of DNA that simultaneously measures allelic imbalances of 12 independent short tandem repeat markers. Results Application of this method to n=1142 samples from n=260 individuals revealed strong intermarker concordance (R²=0.77–0.97) and reproducibility (mean SD: 1.7%). Notably, the assay showed accurate quantification via standard curve assessment (R²>0.99) and high concordance with paired FISH measurements (R²=0.92) even with subnanogram amounts of DNA. Moreover, cytogenetic response was reliably confirmed in del(5q) patients with myelodysplastic syndromes treated with lenalidomide. While the assay demonstrated good diagnostic accuracy in receiver operating characteristic analysis (area under the curve: 0.97), we further observed robust correlation between bone marrow and peripheral blood samples (R²=0.79), suggesting its potential suitability for less-invasive clonal monitoring. Conclusions In conclusion, we present an adaptable tool for quantification of chromosomal aberrations, particularly in problematic samples, which should be easily applicable to further tumour entities. PMID:28600436

  12. Recommendation of short tandem repeat profiling for authenticating human cell lines, stem cells, and tissues.

    PubMed

    Barallon, Rita; Bauer, Steven R; Butler, John; Capes-Davis, Amanda; Dirks, Wilhelm G; Elmore, Eugene; Furtado, Manohar; Kline, Margaret C; Kohara, Arihiro; Los, Georgyi V; MacLeod, Roderick A F; Masters, John R W; Nardone, Mark; Nardone, Roland M; Nims, Raymond W; Price, Paul J; Reid, Yvonne A; Shewale, Jaiprakash; Sykes, Gregory; Steuer, Anton F; Storts, Douglas R; Thomson, Jim; Taraporewala, Zenobia; Alston-Roberts, Christine; Kerrigan, Liz

    2010-10-01

    Cell misidentification and cross-contamination have plagued biomedical research for as long as cells have been employed as research tools. Examples of misidentified cell lines continue to surface to this day. Efforts to eradicate the problem by raising awareness of the issue and by asking scientists voluntarily to take appropriate actions have not been successful. Unambiguous cell authentication is an essential step in the scientific process and should be an inherent consideration during peer review of papers submitted for publication or during review of grants submitted for funding. In order to facilitate proper identity testing, accurate, reliable, inexpensive, and standardized methods for authentication of cells and cell lines must be made available. To this end, an international team of scientists is, at this time, preparing a consensus standard on the authentication of human cells using short tandem repeat (STR) profiling. This standard, which will be submitted for review and approval as an American National Standard by the American National Standards Institute, will provide investigators guidance on the use of STR profiling for authenticating human cell lines. Such guidance will include methodological detail on the preparation of the DNA sample, the appropriate numbers and types of loci to be evaluated, and the interpretation and quality control of the results. Associated with the standard itself will be the establishment and maintenance of a public STR profile database under the auspices of the National Center for Biotechnology Information. The consensus standard is anticipated to be adopted by granting agencies and scientific journals as appropriate methodology for authenticating human cell lines, stem cells, and tissues.

  13. Recommendation of short tandem repeat profiling for authenticating human cell lines, stem cells, and tissues

    PubMed Central

    Barallon, Rita; Bauer, Steven R.; Butler, John; Capes-Davis, Amanda; Dirks, Wilhelm G.; Furtado, Manohar; Kline, Margaret C.; Kohara, Arihiro; Los, Georgyi V.; MacLeod, Roderick A. F.; Masters, John R. W.; Nardone, Mark; Nardone, Roland M.; Nims, Raymond W.; Price, Paul J.; Reid, Yvonne A.; Shewale, Jaiprakash; Sykes, Gregory; Steuer, Anton F.; Storts, Douglas R.; Thomson, Jim; Taraporewala, Zenobia; Alston-Roberts, Christine; Kerrigan, Liz

    2010-01-01

    Cell misidentification and cross-contamination have plagued biomedical research for as long as cells have been employed as research tools. Examples of misidentified cell lines continue to surface to this day. Efforts to eradicate the problem by raising awareness of the issue and by asking scientists voluntarily to take appropriate actions have not been successful. Unambiguous cell authentication is an essential step in the scientific process and should be an inherent consideration during peer review of papers submitted for publication or during review of grants submitted for funding. In order to facilitate proper identity testing, accurate, reliable, inexpensive, and standardized methods for authentication of cells and cell lines must be made available. To this end, an international team of scientists is, at this time, preparing a consensus standard on the authentication of human cells using short tandem repeat (STR) profiling. This standard, which will be submitted for review and approval as an American National Standard by the American National Standards Institute, will provide investigators guidance on the use of STR profiling for authenticating human cell lines. Such guidance will include methodological detail on the preparation of the DNA sample, the appropriate numbers and types of loci to be evaluated, and the interpretation and quality control of the results. Associated with the standard itself will be the establishment and maintenance of a public STR profile database under the auspices of the National Center for Biotechnology Information. The consensus standard is anticipated to be adopted by granting agencies and scientific journals as appropriate methodology for authenticating human cell lines, stem cells, and tissues. PMID:20614197

  14. Tandem repeat variation near the HIC1 (hypermethylated in cancer 1) promoter predicts outcome of oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer.

    PubMed

    Okazaki, Satoshi; Schirripa, Marta; Loupakis, Fotios; Cao, Shu; Zhang, Wu; Yang, Dongyun; Ning, Yan; Berger, Martin D; Miyamoto, Yuji; Suenaga, Mitsukuni; Iqubal, Syma; Barzi, Afsaneh; Cremolini, Chiara; Falcone, Alfredo; Battaglin, Francesca; Salvatore, Lisa; Borelli, Beatrice; Helentjaris, Timothy G; Lenz, Heinz-Josef

    2017-11-15

    The hypermethylated in cancer 1/sirtuin 1 (HIC1/SIRT1) axis plays an important role in regulating the nucleotide excision repair pathway, which is the main oxaliplatin-induced damage-repair system. On the basis of prior evidence that the variable number of tandem repeat (VNTR) sequence located near the promoter lesion of HIC1 is associated with HIC1 gene expression, the authors tested the hypothesis that this VNTR is associated with clinical outcome in patients with metastatic colorectal cancer who receive oxaliplatin-based chemotherapy. Four independent cohorts were tested. Patients who received oxaliplatin-based chemotherapy served as the training cohort (n = 218), and those who received treatment without oxaliplatin served as the control cohort (n = 215). Two cohorts of patients who received oxaliplatin-based chemotherapy were used for validation studies (n = 176 and n = 73). The VNTR sequence near HIC1 was analyzed by polymerase chain reaction analysis and gel electrophoresis and was tested for associations with the response rate, progression-free survival, and overall survival. In the training cohort, patients who harbored at least 5 tandem repeats (TRs) in both alleles had a significantly shorter PFS compared with those who had fewer than 4 TRs in at least 1 allele (9.5 vs 11.6 months; hazard ratio, 1.93; P = .012), and these findings remained statistically significant after multivariate analysis (hazard ratio, 2.00; 95% confidence interval, 1.13-3.54; P = .018). This preliminary association was confirmed in the validation cohort, and patients who had at least 5 TRs in both alleles had a worse PFS compared with the other cohort (7.9 vs 9.8 months; hazard ratio, 1.85; P = .044). The current findings suggest that the VNTR sequence near HIC1 could be a predictive marker for oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer. Cancer 2017;123:4506-14. © 2017 American Cancer Society. © 2017 American Cancer Society.

  15. HaploForge: a comprehensive pedigree drawing and haplotype visualization web application.

    PubMed

    Tekman, Mehmet; Medlar, Alan; Mozere, Monika; Kleta, Robert; Stanescu, Horia

    2017-12-15

    Haplotype reconstruction is an important tool for understanding the aetiology of human disease. Haplotyping infers the most likely phase of observed genotypes conditional on constraints imposed by the genotypes of other pedigree members. The results of haplotype reconstruction, when visualized appropriately, show which alleles are identical by descent despite the presence of untyped individuals. When used in concert with linkage analysis, haplotyping can help delineate a locus of interest and provide a succinct explanation for the transmission of the trait locus. Unfortunately, the design choices made by existing haplotype visualization programs do not scale to large numbers of markers. Indeed, following haplotypes from generation to generation requires excessive scrolling back and forth. In addition, the most widely used program for haplotype visualization produces inconsistent recombination artefacts for the X chromosome. To resolve these issues, we developed HaploForge, a novel web application for haplotype visualization and pedigree drawing. HaploForge takes advantage of HTML5 to be fast, portable and avoid the need for local installation. It can accurately visualize autosomal and X-linked haplotypes from both outbred and consanguineous pedigrees. Haplotypes are coloured based on identity by descent using a novel A* search algorithm and we provide a flexible viewing mode to aid visual inspection. HaploForge can currently process haplotype reconstruction output from Allegro, GeneHunter, Merlin and Simwalk. HaploForge is licensed under GPLv3 and is hosted and maintained via GitHub. https://github.com/mtekman/haploforge. r.kleta@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  16. Variable number of tandem repeat profiles and antimicrobial resistance patterns of Staphylococcus haemolyticus strains isolated from blood cultures in children.

    PubMed

    Hosseinkhani, Faride; Jabalameli, Fereshteh; Nodeh Farahani, Narges; Taherikalani, Morovat; van Leeuwen, Willem B; Emaneini, Mohammad

    2016-03-01

    Staphylococcus haemolyticus is a healthcare-associated pathogen and can cause a variety of lifethreatening infections. Additionally, multi-drug resistance (MDR), in particular methicillin-resistant S. haemolyticus (MRSH) isolates, have emerged. Dissemination of such strains can be of great concern in the hospital environment. A total number of 20S. haemolyticus isolates from blood cultures obtained from children were included in this study. A high prevalence of MDR-MRSH isolates with high MIC values to vancomycin was found and 35% of the isolates were intermediate resistant to vancomycin. Multilocus variable number of tandem repeats analysis (MLVF) revealed 5 MLVF types among 20 isolates of S. haemolyticus. Twelve isolates shared the same MLVF type and were isolated from different wards in a pediatric hospital in Iran. This is a serious alarm for infection control; i.e. in the absence of adequate infection diagnostics and infection control guidelines, these resistant strains can spread to other sectors of a hospital and possibly among the community. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Evolutionary Conservation of a Coding Function for D4Z4, the Tandem DNA Repeat Mutated in Facioscapulohumeral Muscular Dystrophy

    PubMed Central

    Clapp, Jannine ; Mitchell, Laura M. ; Bolland, Daniel J. ; Fantes, Judy ; Corcoran, Anne E. ; Scotting, Paul J. ; Armour, John A. L. ; Hewitt, Jane E. 

    2007-01-01

    Facioscapulohumeral muscular dystrophy (FSHD) is caused by deletions within the polymorphic DNA tandem array D4Z4. Each D4Z4 repeat unit has an open reading frame (ORF), termed “DUX4,” containing two homeobox sequences. Because there has been no evidence of a transcript from the array, these deletions are thought to cause FSHD by a position effect on other genes. Here, we identify D4Z4 homologues in the genomes of rodents, Afrotheria (superorder of elephants and related species), and other species and show that the DUX4 ORF is conserved. Phylogenetic analysis suggests that primate and Afrotherian D4Z4 arrays are orthologous and originated from a retrotransposed copy of an intron-containing DUX gene, DUXC. Reverse-transcriptase polymerase chain reaction and RNA fluorescence and tissue in situ hybridization data indicate transcription of the mouse array. Together with the conservation of the DUX4 ORF for >100 million years, this strongly supports a coding function for D4Z4 and necessitates re-examination of current models of the FSHD disease mechanism. PMID:17668377

  18. A new mathematical modeling for pure parsimony haplotyping problem.

    PubMed

    Feizabadi, R; Bagherian, M; Vaziri, H R; Salahi, M

    2016-11-01

    Pure parsimony haplotyping (PPH) problem is important in bioinformatics because rational haplotyping inference plays important roles in analysis of genetic data, mapping complex genetic diseases such as Alzheimer's disease, heart disorders and etc. Haplotypes and genotypes are m-length sequences. Although several integer programing models have already been presented for PPH problem, its NP-hardness characteristic resulted in ineffectiveness of those models facing the real instances especially instances with many heterozygous sites. In this paper, we assign a corresponding number to each haplotype and genotype and based on those numbers, we set a mixed integer programing model. Using numbers, instead of sequences, would lead to less complexity of the new model in comparison with previous models in a way that there are neither constraints nor variables corresponding to heterozygous nucleotide sites in it. Experimental results approve the efficiency of the new model in producing better solution in comparison to two state-of-the art haplotyping approaches. Copyright © 2016 Elsevier Inc. All rights reserved.

  19. Mutation rates at 42 Y chromosomal short tandem repeats in Chinese Han population in Eastern China.

    PubMed

    Wu, Weiwei; Ren, Wenyan; Hao, Honglei; Nan, Hailun; He, Xin; Liu, Qiuling; Lu, Dejian

    2018-01-31

    Mutation analysis of 42 Y chromosomal short tandem repeats (Y-STRs) loci was performed using a sample of 1160 father-son pairs from the Chinese Han population in Eastern China. The results showed that the average mutation rate across the 42 Y-STR loci was 0.0041 (95% CI 0.0036-0.0047) per locus per generation. The locus-specific mutation rates varied from 0.000 to 0.0190. No mutation was found at DYS388, DYS437, DYS448, DYS531, and GATA_H4. DYS627, DYS570, DYS576, and DYS449 could be classified as rapidly mutating Y-STRs, with mutation rates higher than 1.0 × 10 -2 . DYS458, DYS630, and DYS518 were moderately mutating Y-STRs, with mutation rates ranging from 8 × 10 -3 to 1 × 10 -2 . Although the characteristics of the Y-STR mutations were consistent with those in previous studies, mutation rate differences between our data and previous published data were found at some rapidly mutating Y-STRs. The single-copy loci located on the short arm of the Y chromosome (Yp) showed relatively higher mutation rates more frequently than the multi-copy loci. These results will not only extend the data for Y-STR mutations but also be important for kinship analysis, paternal lineage identification, and family relationship reconstruction in forensic Y-STR analysis.

  20. Phylogeography and population structure of the biologically invasive phytopathogen Erwinia amylovora inferred using minisatellites.

    PubMed

    Bühlmann, Andreas; Dreo, Tanja; Rezzonico, Fabio; Pothier, Joël F; Smits, Theo H M; Ravnikar, Maja; Frey, Jürg E; Duffy, Brion

    2014-07-01

    Erwinia amylovora causes a major disease of pome fruit trees worldwide, and is regulated as a quarantine organism in many countries. While some diversity of isolates has been observed, molecular epidemiology of this bacterium is hindered by a lack of simple molecular typing techniques with sufficiently high resolution. We report a molecular typing system of E. amylovora based on variable number of tandem repeats (VNTR) analysis. Repeats in the E. amylovora genome were identified with comparative genomic tools, and VNTR markers were developed and validated. A Multiple-Locus VNTR Analysis (MLVA) was applied to E. amylovora isolates from bacterial collections representing global and regional distribution of the pathogen. Based on six repeats, MLVA allowed the distinction of 227 haplotypes among a collection of 833 isolates of worldwide origin. Three geographically separated groups were recognized among global isolates using Bayesian clustering methods. Analysis of regional outbreaks confirmed presence of diverse haplotypes but also high representation of certain haplotypes during outbreaks. MLVA analysis is a practical method for epidemiological studies of E. amylovora, identifying previously unresolved population structure within outbreaks. Knowledge of such structure can increase our understanding on how plant diseases emerge and spread over a given geographical region. © 2013 Society for Applied Microbiology and John Wiley & Sons Ltd.

  1. Haplotype assembly in polyploid genomes and identical by descent shared tracts.

    PubMed

    Aguiar, Derek; Istrail, Sorin

    2013-07-01

    Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and computational efficiency. Furthermore, current models and methodologies for haplotype assembly (i) do not consider individuals sharing haplotypes jointly, which reduces the size and accuracy of assembled haplotypes, and (ii) are unable to model genomes having more than two sets of homologous chromosomes (polyploidy). Polyploid organisms are increasingly becoming the target of many research groups interested in the genomics of disease, phylogenetics, botany and evolution but there is an absence of theory and methods for polyploid haplotype reconstruction. In this work, we present a number of results, extensions and generalizations of compass graphs and our HapCompass framework. We prove the theoretical complexity of two haplotype assembly optimizations, thereby motivating the use of heuristics. Furthermore, we present graph theory-based algorithms for the problem of haplotype assembly using our previously developed HapCompass framework for (i) novel implementations of haplotype assembly optimizations (minimum error correction), (ii) assembly of a pair of individuals sharing a haplotype tract identical by descent and (iii) assembly of polyploid genomes. We evaluate our methods on 1000 Genomes Project, Pacific Biosciences and simulated sequence data. HapCompass is available for download at http://www.brown.edu/Research/Istrail_Lab/. Supplementary data are available at Bioinformatics online.

  2. Fifteen non-CODIS autosomal short tandem repeat loci multiplex data from nine population groups living in Taiwan.

    PubMed

    Hwa, Hsiao-Lin; Chang, Yih-Yuan; Lee, James Chun-I; Lin, Chun-Yen; Yin, Hsiang-Yi; Tseng, Li-Hui; Su, Yi-Ning; Ko, Tsang-Ming

    2012-07-01

    The analysis of autosomal short tandem repeat (STR) loci is a powerful tool in forensic genetics. We developed a multiplex system in which 15 non-Combined DNA Index System autosomal STRs (D3S1744, D4S2366, D8S1110, D10S2325, D12S1090, D13S765, D14S608, Penta E, D17S1294, D18S536, D18S1270, D20S470, D21S1437, Penta D, and D22S683) could be amplified in one single polymerase chain reaction. DNA samples from 1,098 unrelated subjects of nine population groups living in Taiwan, including Taiwanese Han, indigenous Taiwanese of Taiwan Island, Tao, mainland Chinese, Filipinos, Thais, Vietnamese, Indonesians, and Caucasians, were collected and analyzed using this system. The distributions of the allelic frequencies and the forensic parameters of each population group were presented. The combined discrimination power and the combined power of exclusion were high in all population groups tested in this study. A multidimensional scaling plot of these nine population groups based on the Reynolds' genetic distances calculated from 15 autosomal STRs was constructed, and the genetic substructure in this area was presented. In conclusion, this 15 autosomal STR multiplex system provides highly informative STR data and appears useful in forensic casework and parentage testing in different populations.

  3. Highly Effective DNA Extraction Method for Nuclear Short Tandem Repeat Testing of Skeletal Remains from Mass Graves

    PubMed Central

    Davoren, Jon; Vanek, Daniel; Konjhodzić, Rijad; Crews, John; Huffine, Edwin; Parsons, Thomas J.

    2007-01-01

    Aim To quantitatively compare a silica extraction method with a commonly used phenol/chloroform extraction method for DNA analysis of specimens exhumed from mass graves. Methods DNA was extracted from twenty randomly chosen femur samples, using the International Commission on Missing Persons (ICMP) silica method, based on Qiagen Blood Maxi Kit, and compared with the DNA extracted by the standard phenol/chloroform-based method. The efficacy of extraction methods was compared by real time polymerase chain reaction (PCR) to measure DNA quantity and the presence of inhibitors and by amplification with the PowerPlex 16 (PP16) multiplex nuclear short tandem repeat (STR) kit. Results DNA quantification results showed that the silica-based method extracted on average 1.94 ng of DNA per gram of bone (range 0.25-9.58 ng/g), compared with only 0.68 ng/g by the organic method extracted (range 0.0016-4.4880 ng/g). Inhibition tests showed that there were on average significantly lower levels of PCR inhibitors in DNA isolated by the organic method. When amplified with PP16, all samples extracted by silica-based method produced 16 full loci profiles, while only 75% of the DNA extracts obtained by organic technique amplified 16 loci profiles. Conclusions The silica-based extraction method showed better results in nuclear STR typing from degraded bone samples than a commonly used phenol/chloroform method. PMID:17696302

  4. BCL11A Enhancer Haplotypes and Fetal Hemoglobin in Sickle Cell Anemia

    PubMed Central

    Sebastiani, P.; Farrell, J.J.; Alsultan, A.; Wang, S.; Edward, H. L.; Shappell, H.; Bae, H.; Milton, J. N.; Baldwin, C.T.; Al-Rubaish, A.M.; Naserullah, Z.; Al-Muhanna, F.; Alsuliman, A.; Patra, P. K.; Farrer, L.A.; Ngo, D.; Vathipadiekal, V.; Chui, D.H.K.; Al-Ali, A.K.; Steinberg, M.H.

    2015-01-01

    Background Fetal hemoglobin (HbF) levels in sickle cell anemia patients vary. We genotyped polymorphisms in the erythroid-specific enhancer of BCL11A to see if they might account for the very high HbF associated with the Arab-Indian (AI) haplotype and Benin haplotype of sickle cell anemia. Methods and Results Six BCL112A enhancer SNPs and their haplotypes were studied in Saudi Arabs from the Eastern Province and Indian patients with AI haplotype (HbF ~20%), African Americans (HbF ~7%), and Saudi Arabs from the Southwestern Province (HbF ~12%). Four SNPs (rs1427407, rs6706648, rs6738440, and rs7606173) and their haplotypes were consistently associated with HbF levels. The distributions of haplotypes differ in the 3 cohorts but not their genetic effects: the haplotype TCAG was associated with the lowest HbF level and the haplotype GTAC was associated with the highest HbF level and differences in HbF levels between carriers of these haplotypes in all cohorts was approximately 6%. Conclusions Common HbF BCL11A enhancer haplotypes in patients with African origin and AI sickle cell anemia have similar effects on HbF but they do not explain their differences in HbF. PMID:25703683

  5. An Ultra-High Discrimination Y Chromosome Short Tandem Repeat Multiplex DNA Typing System

    PubMed Central

    Hanson, Erin K.; Ballantyne, Jack

    2007-01-01

    In forensic casework, Y chromosome short tandem repeat markers (Y-STRs) are often used to identify a male donor DNA profile in the presence of excess quantities of female DNA, such as is found in many sexual assault investigations. Commercially available Y-STR multiplexes incorporating 12–17 loci are currently used in forensic casework (Promega's PowerPlex® Y and Applied Biosystems' AmpFlSTR® Yfiler®). Despite the robustness of these commercial multiplex Y-STR systems and the ability to discriminate two male individuals in most cases, the coincidence match probabilities between unrelated males are modest compared with the standard set of autosomal STR markers. Hence there is still a need to develop new multiplex systems to supplement these for those cases where additional discriminatory power is desired or where there is a coincidental Y-STR match between potential male participants. Over 400 Y-STR loci have been identified on the Y chromosome. While these have the potential to increase the discrimination potential afforded by the commercially available kits, many have not been well characterized. In the present work, 91 loci were tested for their relative ability to increase the discrimination potential of the commonly used ‘core’ Y-STR loci. The result of this extensive evaluation was the development of an ultra high discrimination (UHD) multiplex DNA typing system that allows for the robust co-amplification of 14 non-core Y-STR loci. Population studies with a mixed African American and American Caucasian sample set (n = 572) indicated that the overall discriminatory potential of the UHD multiplex was superior to all commercial kits tested. The combined use of the UHD multiplex and the Applied Biosystems' AmpFlSTR® Yfiler® kit resulted in 100% discrimination of all individuals within the sample set, which presages its potential to maximally augment currently available forensic casework markers. It could also find applications in human evolutionary

  6. Multiple-locus variable-number tandem repeat analysis for molecular typing of Aspergillus fumigatus

    PubMed Central

    2010-01-01

    Background Multiple-locus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related microbial isolates to provide information for establishing genetic patterns among isolates and to investigate disease outbreaks. The usefulness of MLVA was recently demonstrated for the avian major pathogen Chlamydophila psittaci. In the present study, we developed a similar method for another pathogen of birds: the filamentous fungus Aspergillus fumigatus. Results We selected 10 VNTR markers located on 4 different chromosomes (1, 5, 6 and 8) of A. fumigatus. These markers were tested with 57 unrelated isolates from different hosts or their environment (53 isolates from avian species in France, China or Morocco, 3 isolates from humans collected at CHU Henri Mondor hospital in France and the reference strain CBS 144.89). The Simpson index for individual markers ranged from 0.5771 to 0.8530. A combined loci index calculated with all the markers yielded an index of 0.9994. In a second step, the panel of 10 markers was used in different epidemiological situations and tested on 277 isolates, including 62 isolates from birds in Guangxi province in China, 95 isolates collected in two duck farms in France and 120 environmental isolates from a turkey hatchery in France. A database was created with the results of the present study http://minisatellites.u-psud.fr/MLVAnet/. Three major clusters of isolates were defined by using the graphing algorithm termed Minimum Spanning Tree (MST). The first cluster comprised most of the avian isolates collected in the two duck farms in France, the second cluster comprised most of the avian isolates collected in poultry farms in China and the third one comprised most of the isolates collected in the turkey hatchery in France. Conclusions MLVA displayed excellent discriminatory power. The method showed a good reproducibility. MST analysis revealed an interesting clustering with a clear separation between

  7. The molecular epidemiology of Huntington disease is related to intermediate allele frequency and haplotype in the general population.

    PubMed

    Kay, Chris; Collins, Jennifer A; Wright, Galen E B; Baine, Fiona; Miedzybrodzka, Zosia; Aminkeng, Folefac; Semaka, Alicia J; McDonald, Cassandra; Davidson, Mark; Madore, Steven J; Gordon, Erynn S; Gerry, Norman P; Cornejo-Olivas, Mario; Squitieri, Ferdinando; Tishkoff, Sarah; Greenberg, Jacquie L; Krause, Amanda; Hayden, Michael R

    2018-04-01

    Huntington disease (HD) is the most common monogenic neurodegenerative disorder in populations of European ancestry, but occurs at lower prevalence in populations of East Asian or black African descent. New mutations for HD result from CAG repeat expansions of intermediate alleles (IAs), usually of paternal origin. The differing prevalence of HD may be related to the rate of new mutations in a population, but no comparative estimates of IA frequency or the HD new mutation rate are available. In this study, we characterize IA frequency and the CAG repeat distribution in fifteen populations of diverse ethnic origin. We estimate the HD new mutation rate in a series of populations using molecular IA expansion rates. The frequency of IAs was highest in Hispanic Americans and Northern Europeans, and lowest in black Africans and East Asians. The prevalence of HD correlated with the frequency of IAs by population and with the proportion of IAs found on the HD-associated A1 haplotype. The HD new mutation rate was estimated to be highest in populations with the highest frequency of IAs. In European ancestry populations, one in 5,372 individuals from the general population and 7.1% of individuals with an expanded CAG repeat in the HD range are estimated to have a molecular new mutation. Our data suggest that the new mutation rate for HD varies substantially between populations, and that IA frequency and haplotype are closely linked to observed epidemiological differences in the prevalence of HD across major ancestry groups in different countries. © 2018 Wiley Periodicals, Inc.

  8. Modeling haplotype block variation using Markov chains.

    PubMed

    Greenspan, G; Geiger, D

    2006-04-01

    Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity.

  9. Modeling Haplotype Block Variation Using Markov Chains

    PubMed Central

    Greenspan, G.; Geiger, D.

    2006-01-01

    Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity. PMID:16361244

  10. Evidence of triple mutant Pfdhps ISGNGA haplotype in Plasmodium falciparum isolates from North-east India: An analysis of sulfadoxine resistant haplotype selection.

    PubMed

    Das, Manuj K; Chetry, Sumi; Kalita, Mohan C; Dutta, Prafulla

    2016-12-01

    North-east region of India has consistent role in the spread of multi drug resistant Plasmodium (P.) falciparum to other parts of Southeast Asia. After rapid clinical treatment failure of Artemisinin based combination therapy-Sulphadoxine/Pyrimethamine (ACT-SP) chemoprophylaxis, Artemether-Lumefantrine (ACT-AL) combination therapy was introduced in the year 2012 in this region for the treatment of uncomplicated P. falciparum malaria. In a DNA sequencing based polymorphism analysis, seven codons of P. falciparum dihydropteroate synthetase ( Pf dhps) gene were screened in a total of 127 P. falciparum isolates collected from Assam, Arunachal Pradesh and Tripura of North-east India during the year 2014 and 2015 to document current sulfadoxine resistant haplotypes. Sequences were analyzed to rearrange both nucleotide and protein haplotypes. Molecular diversity indices were analyzed in DNA Sequence Polymorphism software (DnaSP) on the basis of Pf dhps gene sequences. Disappearance from selective neutrality was assessed based on the ratio of non-synonomous to synonomous nucleotide substitutions [dN/dS ratio]. Moreover, two-tailed Z test was performed in search of the significance for probability of rejecting null hypothesis of strict neutrality [dN = dS]. Presence of mutant P. falciparum multidrug resistance protein1 ( Pf mdr1) was also checked in those isolates that were present with new Pf dhps haplotypes. Phylogenetic relationship based on Pf dhps gene was reconstructed in Molecular Evolutionary Genetics Analysis (MEGA). Among eight different sulfadoxine resistant haplotypes found, IS GNG A haplotype was documented in a total of five isolates from Tripura with association of a new mutant M538 R allele. Sequence analysis of Pf mdr1 gene in these five isolates came to notice that not all but only one isolate was mutant at codon 86 (N86 Y ; Y YSND) in the multidrug resistance protein. Molecular diversity based on Pf dhps haplotypes revealed that P. falciparum

  11. Second generation subtyping: a proposed PulseNet protocol for multiple-locus variable-number tandem repeat analysis of Shiga toxin-producing Escherichia coli O157 (STEC O157).

    PubMed

    Hyytiä-Trees, Eija; Smole, Sandra C; Fields, Patricia A; Swaminathan, Bala; Ribot, Efrain M

    2006-01-01

    Most bacterial genomes contain tandem duplications of short DNA sequences, termed "variable-number tandem repeats" (VNTR). A subtyping method targeting these repeats, multiple-locus VNTR analysis (MLVA), has emerged as a powerful tool for characterization of clonal organisms such as Shiga toxin-producing Escherichia coli O157 (STEC O157). We modified and optimized a recently published MLVA scheme targeting 29 polymorphic VNTR regions of STEC O157 to render it suitable for routine use by public health laboratories that participate in PulseNet, the national and international molecular subtyping network for foodborne disease surveillance. Nine VNTR loci were included in the final protocol. They were amplified in three PCR reactions, after which the PCR products were sized using capillary electrophoresis. Two hundred geographically diverse, sporadic and outbreak- related STEC O157 isolates were characterized by MLVA and the results were compared with data obtained by pulsed-field gel electrophoresis (PFGE) using XbaI macrorestriction of genomic DNA. A total of 139 unique XbaI PFGE patterns and 162 MLVA types were identified. A subset of 100 isolates characterized by both XbaI and BlnI macrorestriction had 62 unique PFGE and MLVA types. Although the clustering of isolates by the two subtyping systems was generally in agreement, some discrepancies were observed. Importantly, MLVA was able to discriminate among some epidemiologically unrelated isolates which were indistinguishable by PFGE. However, among strains from three of the eight outbreaks included in the study, two single locus MLVA variants and one double locus variant were detected among epidemiologically implicated isolates that were indistinguishable by PFGE. Conversely, in three other outbreaks, isolates that were indistinguishable by MLVA displayed multiple PFGE types. An additional more extensive multi-laboratory validation of the MLVA protocol is in progress in order to address critical issues such as

  12. The structure of the protein phosphatase 2A PR65/A subunit reveals the conformation of its 15 tandemly repeated HEAT motifs.

    PubMed

    Groves, M R; Hanlon, N; Turowski, P; Hemmings, B A; Barford, D

    1999-01-08

    The PR65/A subunit of protein phosphatase 2A serves as a scaffolding molecule to coordinate the assembly of the catalytic subunit and a variable regulatory B subunit, generating functionally diverse heterotrimers. Mutations of the beta isoform of PR65 are associated with lung and colon tumors. The crystal structure of the PR65/Aalpha subunit, at 2.3 A resolution, reveals the conformation of its 15 tandemly repeated HEAT sequences, degenerate motifs of approximately 39 amino acids present in a variety of proteins, including huntingtin and importin beta. Individual motifs are composed of a pair of antiparallel alpha helices that assemble in a mainly linear, repetitive fashion to form an elongated molecule characterized by a double layer of alpha helices. Left-handed rotations at three interrepeat interfaces generate a novel left-hand superhelical conformation. The protein interaction interface is formed from the intrarepeat turns that are aligned to form a continuous ridge.

  13. Molecular characterization of Shiga-toxigenic Escherichia coli isolated from diverse sources from India by multi-locus variable number tandem repeat analysis (MLVA).

    PubMed

    Kumar, A; Taneja, N; Sharma, R K; Sharma, H; Ramamurthy, T; Sharma, M

    2014-12-01

    In a first study from India, a diverse collection of 140 environmental and clinical non-O157 Shiga-toxigenic Escherichia coli strains from a large geographical area in north India was typed by multi-locus variable number tandem repeat analysis (MLVA). The distribution of major virulence genes stx1, stx2 and eae was found to be 78%, 70% and 10%, respectively; 15 isolates were enterohaemorrhagic E. coli (stx1 +/stx2 + and eae +). By MLVA analysis, 44 different alleles were obtained. Dendrogram analysis revealed 104 different genotypes and 19 MLVA-type complexes divided into two main lineages, i.e. mutton and animal stool. Human isolates presented a statistically significant greater odds ratio for clustering with mutton samples compared to animal stool isolates. Five human isolates clustered with animal stool strains suggesting that some of the human infections may be from cattle, perhaps through milk, contact or the environment. Further epidemiological studies are required to explore these sources in context with occurrence of human cases.

  14. A multiple-locus variable-number tandem repeat analysis (MLVA) of Listeria monocytogenes isolated from Norwegian salmon-processing factories and from listeriosis patients.

    PubMed

    Lunestad, B T; Truong, T T T; Lindstedt, B-A

    2013-10-01

    The objective of this study was to characterize Listeria monocytogenes isolated from farmed Atlantic salmon (Salmo salar) and the processing environment in three different Norwegian factories, and compare these to clinical isolates by multiple-locus variable-number tandem repeat analysis (MLVA). The 65 L. monocytogenes isolates obtained gave 15 distinct MLVA profiles. There was great heterogeneity in the distribution of MLVA profiles in factories and within each factory. Nine of the 15 MLVA profiles found in the fish-associated isolates were found to match human profiles. The MLVA profile 07-07-09-10-06 was the most common strain in Norwegian listeriosis patients. L. monocytogenes with this profile has previously been associated with at least two known listeriosis outbreaks in Norway, neither determined to be due to fish consumption. However, since this profile was also found in fish and in the processing environment, fish should be considered as a possible food vehicle during sporadic cases and outbreaks of listeriosis.

  15. Mineralocorticoid receptor haplotype, oral contraceptives and emotional information processing.

    PubMed

    Hamstra, D A; de Kloet, E R; van Hemert, A M; de Rijk, R H; Van der Does, A J W

    2015-02-12

    Oral contraceptives (OCs) affect mood in some women and may have more subtle effects on emotional information processing in many more users. Female carriers of mineralocorticoid receptor (MR) haplotype 2 have been shown to be more optimistic and less vulnerable to depression. To investigate the effects of oral contraceptives on emotional information processing and a possible moderating effect of MR haplotype. Cross-sectional study in 85 healthy premenopausal women of West-European descent. We found significant main effects of oral contraceptives on facial expression recognition, emotional memory and decision-making. Furthermore, carriers of MR haplotype 1 or 3 were sensitive to the impact of OCs on the recognition of sad and fearful faces and on emotional memory, whereas MR haplotype 2 carriers were not. Different compounds of OCs were included. No hormonal measures were taken. Most naturally cycling participants were assessed in the luteal phase of their menstrual cycle. Carriers of MR haplotype 2 may be less sensitive to depressogenic side-effects of OCs. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.

  16. Accurate quantification of chromosomal lesions via short tandem repeat analysis using minimal amounts of DNA.

    PubMed

    Jann, Johann-Christoph; Nowak, Daniel; Nolte, Florian; Fey, Stephanie; Nowak, Verena; Obländer, Julia; Pressler, Jovita; Palme, Iris; Xanthopoulos, Christina; Fabarius, Alice; Platzbecker, Uwe; Giagounidis, Aristoteles; Götze, Katharina; Letsch, Anne; Haase, Detlef; Schlenk, Richard; Bug, Gesine; Lübbert, Michael; Ganser, Arnold; Germing, Ulrich; Haferlach, Claudia; Hofmann, Wolf-Karsten; Mossner, Maximilian

    2017-09-01

    Cytogenetic aberrations such as deletion of chromosome 5q (del(5q)) represent key elements in routine clinical diagnostics of haematological malignancies. Currently established methods such as metaphase cytogenetics, FISH or array-based approaches have limitations due to their dependency on viable cells, high costs or semi-quantitative nature. Importantly, they cannot be used on low abundance DNA. We therefore aimed to establish a robust and quantitative technique that overcomes these shortcomings. For precise determination of del(5q) cell fractions, we developed an inexpensive multiplex-PCR assay requiring only nanograms of DNA that simultaneously measures allelic imbalances of 12 independent short tandem repeat markers. Application of this method to n=1142 samples from n=260 individuals revealed strong intermarker concordance (R²=0.77-0.97) and reproducibility (mean SD: 1.7%). Notably, the assay showed accurate quantification via standard curve assessment (R²>0.99) and high concordance with paired FISH measurements (R²=0.92) even with subnanogram amounts of DNA. Moreover, cytogenetic response was reliably confirmed in del(5q) patients with myelodysplastic syndromes treated with lenalidomide. While the assay demonstrated good diagnostic accuracy in receiver operating characteristic analysis (area under the curve: 0.97), we further observed robust correlation between bone marrow and peripheral blood samples (R²=0.79), suggesting its potential suitability for less-invasive clonal monitoring. In conclusion, we present an adaptable tool for quantification of chromosomal aberrations, particularly in problematic samples, which should be easily applicable to further tumour entities. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  17. Clostridium botulinum group I strain genotyping by 15-locus multilocus variable-number tandem-repeat analysis.

    PubMed

    Fillo, Silvia; Giordani, Francesco; Anniballi, Fabrizio; Gorgé, Olivier; Ramisse, Vincent; Vergnaud, Gilles; Riehm, Julia M; Scholz, Holger C; Splettstoesser, Wolf D; Kieboom, Jasper; Olsen, Jaran-Strand; Fenicia, Lucia; Lista, Florigio

    2011-12-01

    Clostridium botulinum is a taxonomic designation that encompasses a broad variety of spore-forming, Gram-positive bacteria producing the botulinum neurotoxin (BoNT). C. botulinum is the etiologic agent of botulism, a rare but severe neuroparalytic disease. Fine-resolution genetic characterization of C. botulinum isolates of any BoNT type is relevant for both epidemiological studies and forensic microbiology. A 10-locus multiple-locus variable-number tandem-repeat analysis (MLVA) was previously applied to isolates of C. botulinum type A. The present study includes five additional loci designed to better address proteolytic B and F serotypes. We investigated 79 C. botulinum group I strains isolated from human and food samples in several European countries, including types A (28), B (36), AB (4), and F (11) strains, and 5 nontoxic Clostridium sporogenes. Additional data were deduced from in silico analysis of 10 available fully sequenced genomes. This 15-locus MLVA (MLVA-15) scheme identified 86 distinct genotypes that clustered consistently with the results of amplified fragment length polymorphism (AFLP) and MLVA genotyping in previous reports. An MLVA-7 scheme, a subset of the MLVA-15, performed on a lab-on-a-chip device using a nonfluorescent subset of primers, is also proposed as a first-line assay. The phylogenetic grouping obtained with the MLVA-7 does not differ significantly from that generated by the MLVA-15. To our knowledge, this report is the first to analyze genetic variability among all of the C. botulinum group I serotypes by MLVA. Our data provide new insights into the genetic variability of group I C. botulinum isolates worldwide and demonstrate that this group is genetically highly diverse.

  18. Population genetic study of 10 short tandem repeat loci from 600 domestic dogs in Korea.

    PubMed

    Moon, Seo Hyun; Jang, Yoon-Jeong; Han, Myun Soo; Cho, Myung-Haing

    2016-09-30

    Dogs have long shared close relationships with many humans. Due to the large number of dogs in human populations, they are often involved in crimes. Occasionally, canine biological evidence such as saliva, bloodstains and hairs can be found at crime scenes. Accordingly, canine DNA can be used as forensic evidence. The use of short tandem repeat (STR) loci from biological evidence is valuable for forensic investigations. In Korea, canine STR profiling-related crimes are being successfully analyzed, leading to diverse crimes such as animal cruelty, dog-attacks, murder, robbery, and missing and abandoned dogs being solved. However, the probability of random DNA profile matches cannot be analyzed because of a lack of canine STR data. Therefore, in this study, 10 STR loci were analyzed in 600 dogs in Korea (344 dogs belonging to 30 different purebreds and 256 crossbred dogs) to estimate canine forensic genetic parameters. Among purebred dogs, a separate statistical analysis was conducted for five major subgroups, 97 Maltese, 47 Poodles, 31 Shih Tzus, 32 Yorkshire Terriers, and 25 Pomeranians. Allele frequencies, expected (Hexp) and observed heterozygosity (Hobs), fixation index (F), probability of identity (P(ID)), probability of sibling identity (P(ID)sib) and probability of exclusion (PE) were then calculated. The Hexp values ranged from 0.901 (PEZ12) to 0.634 (FHC2079), while the P(ID)sib values were between 0.481 (FHC2079) and 0.304 (PEZ12) and the P(ID)sib was about 3.35 × 10(-)⁵ for the combination of all 10 loci. The results presented herein will strengthen the value of canine DNA to solving dog-related crimes.

  19. Development of new multilocus variable number of tandem repeat analysis (MLVA) for Listeria innocua and its application in a food processing plant.

    PubMed

    Takahashi, Hajime; Ohshima, Chihiro; Nakagawa, Miku; Thanatsang, Krittaporn; Phraephaisarn, Chirapiphat; Chaturongkasumrit, Yuphakhun; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

    2014-01-01

    Listeria innocua is an important hygiene indicator bacterium in food industries because it behaves similar to Listeria monocytogenes, which is pathogenic to humans. PFGE is often used to characterize bacterial strains and to track contamination source. However, because PFGE is an expensive, complicated, time-consuming protocol, and poses difficulty in data sharing, development of a new typing method is necessary. MLVA is a technique that identifies bacterial strains on the basis of the number of tandem repeats present in the genome varies depending on the strains. MLVA has gained attention due to its high reproducibility and ease of data sharing. In this study, we developed a MLVA protocol to assess L. innocua and evaluated it by tracking the contamination source of L. innocua in an actual food manufacturing factory by typing the bacterial strains isolated from the factory. Three VNTR regions of the L. innocua genome were chosen for use in the MLVA. The number of repeat units in each VNTR region was calculated based on the results of PCR product analysis using capillary electrophoresis (CE). The calculated number of repetitions was compared with the results of the gene sequence analysis to demonstrate the accuracy of the CE repeat number analysis. The developed technique was evaluated using 60 L. innocua strains isolated from a food factory. These 60 strains were classified into 11 patterns using MLVA. Many of the strains were classified into ST-6, revealing that this MLVA strain type can contaminate each manufacturing process in the factory. The MLVA protocol developed in this study for L. innocua allowed rapid and easy analysis through the use of CE. This technique was found to be very useful in hygiene control in factories because it allowed us to track contamination sources and provided information regarding whether the bacteria were present in the factories.

  20. Ultraaccurate genome sequencing and haplotyping of single human cells.

    PubMed

    Chu, Wai Keung; Edge, Peter; Lee, Ho Suk; Bansal, Vikas; Bafna, Vineet; Huang, Xiaohua; Zhang, Kun

    2017-11-21

    Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10 -8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.

  1. VNTR internal structure mapping at the {alpha}-globin 3{prime}HVR locus reveals a hierachy of related lineages in oceania

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martinson, J.J.; Clegg, J.B.; Boyce, A.J.

    1994-09-01

    Analysis of the {alpha}-globin gene complex in Oceania has revealed many different rearrangements which remove one of the adult globin genes. Frequencies of these deletion chromosomes are elevated by malarial resistance conferred by the resulting {alpha}-thalassaemia. One particular deletion chromosome, designated -{alpha}{sup 3.7}III, is found at high levels in Melanesia and Polynesia: RFLP haplotype analysis shows that this deletion is always found on chromosomes bearing the IIIa haplotype and is likely to be the product of one single rearrangement event. A subset of the -{alpha}{sup 3.7}III chromosomes carries a more recent mutation which generates the haemoglobin variant HbJ{sup Tongariki}. Wemore » have characterized the allelic variation at the 3{prime}HVR VNTR locus located 6 kb from the globin genes in each of these groups of chromosomes. We have determined the internal structure of these alleles by RFLP mapping of PCR-amplified DNA: within each group, the allelic diversity results from the insertion and/or deletion of small {open_quotes}motifs{close_quotes} of up to 6 adjacent repeats. Mapping of 3{prime}HVR alleles associated with other haplotypes reveals that these are composed of repeat arrays that are substantially different to those derived from IIIa chromosomes, indicating that interchromosomal recombination between heterologous haplotypes does not account for any of the diversity seen to date. We have recently shown that allelic size variation at the two VNTR loci flanking the {alpha}-globin complex is very closely linked to the haplotypes known to be present at this locus. Here we show that, within a haplotype, VNTR alleles are very closely related to each other on the basis of internal structure and demonstrate that intrachromosomal mutation processes involving small numbers of tandem repeats are the main cause of variation at this locus.« less

  2. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  3. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  4. Mineralocorticoid receptor haplotype, estradiol, progesterone and emotional information processing.

    PubMed

    Hamstra, Danielle A; de Kloet, E Ronald; Quataert, Ina; Jansen, Myrthe; Van der Does, Willem

    2017-02-01

    Carriers of MR-haplotype 1 and 3 (GA/CG; rs5522 and rs2070951) are more sensitive to the influence of oral contraceptives (OC) and menstrual cycle phase on emotional information processing than MR-haplotype 2 (CA) carriers. We investigated whether this effect is associated with estradiol (E2) and/or progesterone (P4) levels. Healthy MR-genotyped premenopausal women were tested twice in a counterbalanced design. Naturally cycling (NC) women were tested in the early-follicular and mid-luteal phase and OC-users during OC-intake and in the pill-free week. At both sessions E2 and P4 were assessed in saliva. Tests included implicit and explicit positive and negative affect, attentional blink accuracy, emotional memory, emotion recognition, and risky decision-making (gambling). MR-haplotype 2 homozygotes had higher implicit happiness scores than MR-haplotype 2 heterozygotes (p=0.031) and MR-haplotype 1/3 carriers (p<0.001). MR-haplotype 2 homozygotes also had longer reaction times to happy faces in an emotion recognition test than MR-haplotype 1/3 (p=0.001). Practice effects were observed for most measures. The pattern of correlations between information processing and P4 or E2 differed between sessions, as well as the moderating effects of the MR genotype. In the first session the MR-genotype moderated the influence of P4 on implicit anxiety (sr=-0.30; p=0.005): higher P4 was associated with reduction in implicit anxiety, but only in MR-haplotype 2 homozygotes (sr=-0.61; p=0.012). In the second session the MR-genotype moderated the influence of E2 on the recognition of facial expressions of happiness (sr=-0.21; p=0.035): only in MR-haplotype 1/3 higher E2 was correlated with happiness recognition (sr=0.29; p=0.005). In the second session higher E2 and P4 were negatively correlated with accuracy in lag2 trials of the attentional blink task (p<0.001). Thus NC women, compared to OC-users, performed worse on lag 2 trials (p=0.041). The higher implicit happiness scores of MR-haplotype

  5. Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

    PubMed

    Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

    1991-11-20

    A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.

  6. A phased SNP-based classification of sickle cell anemia HBB haplotypes.

    PubMed

    Shaikho, Elmutaz M; Farrell, John J; Alsultan, Abdulrahman; Qutub, Hatem; Al-Ali, Amein K; Figueiredo, Maria Stella; Chui, David H K; Farrer, Lindsay A; Murphy, George J; Mostoslavsky, Gustavo; Sebastiani, Paola; Steinberg, Martin H

    2017-08-11

    Sickle cell anemia causes severe complications and premature death. Five common β-globin gene cluster haplotypes are each associated with characteristic fetal hemoglobin (HbF) levels. As HbF is the major modulator of disease severity, classifying patients according to haplotype is useful. The first method of haplotype classification used restriction fragment length polymorphisms (RFLPs) to detect single nucleotide polymorphisms (SNPs) in the β-globin gene cluster. This is labor intensive, and error prone. We used genome-wide SNP data imputed to the 1000 Genomes reference panel to obtain phased data distinguishing parental alleles. We successfully haplotyped 813 sickle cell anemia patients previously classified by RFLPs with a concordance >98%. Four SNPs (rs3834466, rs28440105, rs10128556, and rs968857) marking four different restriction enzyme sites unequivocally defined most haplotypes. We were able to assign a haplotype to 86% of samples that were either partially or misclassified using RFLPs. Phased data using only four SNPs allowed unequivocal assignment of a haplotype that was not always possible using a larger number of RFLPs. Given the availability of genome-wide SNP data, our method is rapid and does not require high computational resources.

  7. PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile.

    PubMed

    Pereira, Luísa; Alshamali, Farida; Andreassen, Rune; Ballard, Ruth; Chantratita, Wasun; Cho, Nam Soo; Coudray, Clotilde; Dugoujon, Jean-Michel; Espinoza, Marta; González-Andrade, Fabricio; Hadi, Sibte; Immel, Uta-Dorothee; Marian, Catalin; Gonzalez-Martin, Antonio; Mertens, Gerhard; Parson, Walther; Perone, Carlos; Prieto, Lourdes; Takeshita, Haruo; Rangel Villalobos, Héctor; Zeng, Zhaoshu; Zhivotovsky, Lev; Camacho, Rui; Fonseca, Nuno A

    2011-09-01

    Because of their sensitivity and high level of discrimination, short tandem repeat (STR) maker systems are currently the method of choice in routine forensic casework and data banking, usually in multiplexes up to 15-17 loci. Constraints related to sample amount and quality, frequently encountered in forensic casework, will not allow to change this picture in the near future, notwithstanding the technological developments. In this study, we present a free online calculator named PopAffiliator ( http://cracs.fc.up.pt/popaffiliator ) for individual population affiliation in the three main population groups, Eurasian, East Asian and sub-Saharan African, based on genotype profiles for the common set of STRs used in forensics. This calculator performs affiliation based on a model constructed using machine learning techniques. The model was constructed using a data set of approximately fifteen thousand individuals collected for this work. The accuracy of individual population affiliation is approximately 86%, showing that the common set of STRs routinely used in forensics provide a considerable amount of information for population assignment, in addition to being excellent for individual identification.

  8. Genomic evolution in domestic cattle: ancestral haplotypes and healthy beef.

    PubMed

    Williamson, Joseph F; Steele, Edward J; Lester, Susan; Kalai, Oscar; Millman, John A; Wolrige, Lindsay; Bayard, Dominic; McLure, Craig; Dawkins, Roger L

    2011-05-01

    We have identified numerous Ancestral Haplotypes encoding a 14-Mb region of Bota C19. Three are frequent in Simmental, Angus and Wagyu and have been conserved since common progenitor populations. Others are more relevant to the differences between these 3 breeds including fat content and distribution in muscle. SREBF1 and Growth Hormone, which have been implicated in the production of healthy beef, are included within these haplotypes. However, we conclude that alleles at these 2 loci are less important than other sequences within the haplotypes. Identification of breeds and hybrids is improved by using haplotypes rather than individual alleles. Copyright © 2010 Elsevier Inc. All rights reserved.

  9. Copy number and haplotype variation at the VRN-A1 and central FR-A2 loci are associated with frost tolerance in hexaploid wheat.

    PubMed

    Zhu, Jie; Pearce, Stephen; Burke, Adrienne; See, Deven Robert; Skinner, Daniel Z; Dubcovsky, Jorge; Garland-Campbell, Kimberly

    2014-05-01

    The interaction between VRN - A1 and FR - A2 largely affect the frost tolerance of hexaploid wheat. Frost tolerance is critical for wheat survival during cold winters. Natural variation for this trait is mainly associated with allelic differences at the VERNALIZATION 1 (VRN1) and FROST RESISTANCE 2 (FR2) loci. VRN1 regulates the transition between vegetative and reproductive stages and FR2, a locus including several tandemly duplicated C-REPEAT BINDING FACTOR (CBF) transcription factors, regulates the expression of Cold-regulated genes. We identified sequence and copy number variation at these two loci among winter and spring wheat varieties and characterized their association with frost tolerance. We identified two FR-A2 haplotypes-'FR-A2-S' and 'FR-A2-T'-distinguished by two insertion/deletions and ten single nucleotide polymorphisms within the CBF-A12 and CBF-A15 genes. Increased copy number of CBF-A14 was frequently associated with the FR-A2-T haplotype and with higher CBF14 transcript levels in response to cold. Factorial ANOVAs revealed significant interactions between VRN1 and FR-A2 for frost tolerance in both winter and spring panels suggesting a crosstalk between vernalization and cold acclimation pathways. The model including these two loci and their interaction explained 32.0 and 20.7 % of the variation in frost tolerance in the winter and spring panels, respectively. The interaction was validated in a winter wheat F 4:5 population segregating for both genes. Increased VRN-A1 copy number was associated with improved frost tolerance among varieties carrying the FR-A2-T allele but not among those carrying the FR-A2-S allele. These results suggest that selection of varieties carrying the FR-A2-T allele and three copies of the recessive vrn-A1 allele would be a good strategy to improve frost tolerance in wheat.

  10. Distribution of MICA alleles and haplotypes associated with HLA in the Korean population.

    PubMed

    Pyo, Chul-Woo; Hur, Seong-Suk; Kim, Yang-Kyum; Choi, Hee-Baeg; Kim, Tae-Yoon; Kim, Tai-Gyu

    2003-03-01

    The MICA (MHC class I chain-related gene A) is a polymorphic gene located 46 kb centromeric of the HLA-B gene, and is preferentially expressed in epithelial cells and intestinal mucosa. The MICA gene, similar to human leukocyte antigen (HLA) class I, displays a high degree of genetic polymorphism in exons 2, 3, 4, and 5, amounting to 54 alleles. In this study, we investigated the polymorphisms at exons coding for extracellular domains (exons 2, 3, and 4), and the GCT repeat polymorphism at the transmembrane (exon 5) of MICA in 199 unrelated healthy Koreans. Eight alleles were observed in the Korean population, with allele frequencies for MICA*010, MICA*00201, MICA*027, MICA*004, MICA*012, MICA*00801, MICA*00901, and MICA*00701 being 18.3%, 17.8%, 13.6%, 12.3%, 11.1%, 10.8%, 10.6%, and 3.3%, respectively. Strong linkage disequilibria were also observed between the MICA and HLA-B gene-MICA*00201-B58, MICA*004-B44, MICA*00701-B27, MICA*00801-B60, MICA*00901-B51, MICA*010-B62, MICA*012-B54, and MICA*027-B61. In the analysis of the haplotypes of HLA class I genes (HLA-A, B, and C) and the MICA, the most common haplotype was MICA*004-A33-B44-Cw*07, followed by MICA*00201-A2-B58-Cw*0302 and MICA*012-A2-B54-Cw*0102. The MICA null haplotype might be identified in the HLA-B48 homozygous individual. These results will provide an understanding of the role of MICA in transplantation, disease association, and population analyses in Koreans.

  11. [Discriminatory power of variable number on tandem repeats loci for genotyping Mycobacterium tuberculosis strains in China].

    PubMed

    Chen, H X; Cai, C; Liu, J Y; Zhang, Z G; Yuan, M; Jia, J N; Sun, Z G; Huang, H R; Gao, J M; Li, W M

    2017-06-10

    Objective: Using the standard genotype method, variable number of tandem repeats (VNTR), we constructed a VNTR database to cover all provinces and proposed a set of optimized VNTR loci combinations for each province, in order to improve the preventive and control programs on tuberculosis, in China. Methods: A total of 15 loci VNTR was used to analyze 4 116 Mycobacterium tuberculosis strains, isolated from national survey of Drug Resistant Tuberculosis, in 2007. Hunter-Gaston Index (HGI) was also used to analyze the discriminatory power of each VNTR site. A set combination of 12-VNTR, 10-VNTR, 8-VNTR and 5-VNTR was respectively constructed for each province, based on 1) epidemic characteristics of M. tuberculosis lineages in China, with high discriminatory power and genetic stability. Results: Through the completed 15 loci VNTR patterns of 3 966 strains under 96.36 % (3 966/4 116) coverage, we found seven high HGI loci (including QUB11b and MIRU26) as well as low stable loci (including QUB26, MIRU16, Mtub21 and QUB11b) in several areas. In all the 31 provinces, we found an optimization VNTR combination as 10-VNTR loci in Inner Mongolia, Chongqing and Heilongjiang, but with 8-VNTR combination shared in other provinces. Conclusions: It is necessary to not only use the VNTR database for tracing the source of infection and cluster of M. tuberculosis in the nation but also using the set of optimized VNTR combinations in monitoring those local epidemics and M. tuberculosis (genetics in local) population.

  12. Population genetics and new insight into range of CAG repeats of spinocerebellar ataxia type 3 in the Han Chinese population.

    PubMed

    Gan, Shi-Rui; Ni, Wang; Dong, Yi; Wang, Ning; Wu, Zhi-Ying

    2015-01-01

    Spinocerebellar ataxia type 3 (SCA3), also called Machado-Joseph disease (MJD), is one of the most common SCAs worldwide and caused by a CAG repeat expansion located in ATXN3 gene. Based on the CAG repeat numbers, alleles of ATXN3 can be divided into normal alleles (ANs), intermediate alleles (AIs) and expanded alleles (AEs). It was controversial whether the frequency of large normal alleles (large ANs) is related to the prevalence of SCA3 or not. And there were huge chaos in the comprehension of the specific numbers of the range of CAG repeats which is fundamental for genetic analysis of SCA3. To illustrate these issues, we made a novel CAG repeat ladder to detect CAG repeats of ATXN3 in 1003 unrelated Chinese normal individuals and studied haplotypes defined by three single nucleotide polymorphisms (SNPs) closed to ATXN3. We found that the number of CAG repeats ranged from 13 to 49, among them, 14 was the most common number. Positive skew, the highest frequency of large ANs and 4 AIs which had never been reported before were found. Also, AEs and large ANs shared the same haplotypes defined by the SNPs. Based on these data and other related studies, we presumed that de novo mutations of ATXN3 emerging from large ANs are at least one survival mechanisms of mutational ATXN3 and we can redefine the range of CAG repeats as: ANs≤44, 45 ≤AIs ≤49 and AEs≥50.

  13. TNF-alpha SNP haplotype frequencies in equidae.

    PubMed

    Brown, J J; Ollier, W E R; Thomson, W; Matthews, J B; Carter, S D; Binns, M; Pinchbeck, G; Clegg, P D

    2006-05-01

    Tumour necrosis factor alpha (TNF-alpha) is a pro-inflammatory cytokine that plays a crucial role in the regulation of inflammatory and immune responses. In all vertebrate species the genes encoding TNF-alpha are located within the major histocompatability complex. In the horse TNF-alpha has been ascribed a role in a variety of important disease processes. Previously two single nucleotide polymorphisms (SNPs) have been reported within the 5' un-translated region of the equine TNF-alpha gene. We have examined the equine TNF-alpha promoter region further for additional SNPs by analysing DNA from 131 horses (Equus caballus), 19 donkeys (E. asinus), 2 Grant's zebras (E. burchellii boehmi) and one onager (E. hemionus). Two further SNPs were identified at nucleotide positions 24 (T/G) and 452 (T/C) relative to the first nucleotide of the 522 bp polymerase chain reaction product. A sequence variant at position 51 was observed between equidae. SNaPSHOT genotyping assays for these and the two previously reported SNPs were performed on 457 horses comprising seven different breeds and 23 donkeys to determine the gene frequencies. SNP frequencies varied considerably between different horse breeds and also between the equine species. In total, nine different TNF-alpha promoter SNP haplotypes and their frequencies were established amongst the various equidae examined, with some haplotypes being found only in horses and others only in donkeys or zebras. The haplotype frequencies observed varied greatly between different horse breeds. Such haplotypes may relate to levels of TNF-alpha production and disease susceptibility and further investigation is required to identify associations between particular haplotypes and altered risk of disease.

  14. Beta-globin gene cluster haplotypes of Amerindian populations from the Brazilian Amazon region.

    PubMed

    Guerreiro, J F; Figueiredo, M S; Zago, M A

    1994-01-01

    We have determined the beta-globin cluster haplotypes for 80 Indians from four Brazilian Amazon tribes: Kayapó, Wayampí, Wayana-Apalaí, and Arára. The results are analyzed together with 20 Yanomámi previously studied. From 2 to 4 different haplotypes were identified for each tribe, and 7 of the possible 32 haplotypes were found in a sample of 172 chromosomes for which the beta haplotypes were directly determined or derived from family studies. The haplotype distribution does not differ significantly among the five populations. The two most common haplotypes in all tribes were haplotypes 2 and 6, with average frequencies of 0.843 and 0.122, respectively. The genetic affinities between Brazilian Indians and other human populations were evaluated by estimates of genetic distance based on haplotype data. The lowest values were observed in relation to Asians, especially Chinese, Polynesians, and Micronesians.

  15. A spatial haplotype copying model with applications to genotype imputation.

    PubMed

    Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan

    2015-05-01

    Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.

  16. APC Yin-Yang haplotype associated with colorectal cancer risk

    PubMed Central

    GARRE, P.; DE LA HOYA, M.; INIESTA, P.; ROMERA, A.; LLOVET, P.; GONZALEZ, S.; PEREZ-SEGURA, P.; CAPELLA, G.; DIAZ-RUBIO, E.; CALDES, T.

    2010-01-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan® assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32–2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61–1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC. PMID:22993613

  17. APC Yin-Yang haplotype associated with colorectal cancer risk.

    PubMed

    Garre, P; DE LA Hoya, M; Iniesta, P; Romera, A; Llovet, P; Gonzalez, S; Perez-Segura, P; Capella, G; Diaz-Rubio, E; Caldes, T

    2010-09-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan(®) assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32-2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61-1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC.

  18. Mathematical properties and bounds on haplotyping populations by pure parsimony.

    PubMed

    Wang, I-Lin; Chang, Chia-Yuan

    2011-06-01

    Although the haplotype data can be used to analyze the function of DNA, due to the significant efforts required in collecting the haplotype data, usually the genotype data is collected and then the population haplotype inference (PHI) problem is solved to infer haplotype data from genotype data for a population. This paper investigates the PHI problem based on the pure parsimony criterion (HIPP), which seeks the minimum number of distinct haplotypes to infer a given genotype data. We analyze the mathematical structure and properties for the HIPP problem, propose techniques to reduce the given genotype data into an equivalent one of much smaller size, and analyze the relations of genotype data using a compatible graph. Based on the mathematical properties in the compatible graph, we propose a maximal clique heuristic to obtain an upper bound, and a new polynomial-sized integer linear programming formulation to obtain a lower bound for the HIPP problem. Copyright © 2011 Elsevier Inc. All rights reserved.

  19. Variant Alleles, Triallelic Patterns, and Point Mutations Observed in Nuclear Short Tandem Repeat Typing of Populations in Bosnia and Serbia

    PubMed Central

    Huel, René L. M.; Bašić, Lara; Madacki-Todorović, Kamelija; Smajlović, Lejla; Eminović, Izet; Berbić, Irfan; Miloš, Ana; Parsons, Thomas J.

    2007-01-01

    Aim To present a compendium of off-ladder alleles and other genotyping irregularities relating to rare/unexpected population genetic variation, observed in a large short tandem repeat (STR) database from Bosnia and Serbia. Methods DNA was extracted from blood stain cards relating to reference samples from a population of 32 800 individuals from Bosnia and Serbia, and typed using Promega’s PowerPlex®16 STR kit. Results There were 31 distinct off-ladder alleles were observed in 10 of the 15 STR loci amplified from the PowerPlex®16 STR kit. Of these 31 alleles, 3 have not been previously reported. Furthermore, 16 instances of triallelic patterns were observed in 9 of the 15 loci. Primer binding site mismatches that affected amplification were observed in two loci, D5S818 and D8S1179. Conclusion Instances of deviations from manufacturer’s allelic ladders should be expected and caution taken to properly designate the correct alleles in large DNA databases. Particular care should be taken in kinship matching or paternity cases as incorrect designation of any of these deviations from allelic ladders could lead to false exclusions. PMID:17696304

  20. Multiple-Locus Variable-Number Tandem-Repeats Analysis of Escherichia coli O157 using PCR multiplexing and multi-colored capillary electrophoresis.

    PubMed

    Lindstedt, Bjørn-Arne; Vardund, Traute; Kapperud, Georg

    2004-08-01

    The Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA) method is currently being used as the primary typing tool for Shiga-toxin-producing Escherichia coli (STEC) O157 isolates in our laboratory. The initial assay was performed using a single fluorescent dye and the different patterns were assigned using a gel image. Here, we present a significantly improved assay using multiple dye colors and enhanced PCR multiplexing to increase speed, and ease the interpretation of the results. The different MLVA patterns are now based on allele sizes entered as character values, thus removing the uncertainties introduced when analyzing band patterns from the gel image. We additionally propose an easy numbering scheme for the identification of separate isolates that will facilitate exchange of typing data. Seventy-two human and animal strains of Shiga-toxin-producing E. coli O157 were used for the development of the improved MLVA assay. The method is based on capillary separation of multiplexed PCR products of VNTR loci in the E. coli O157 genome labeled with multiple fluorescent dyes. The different alleles at each locus were then assigned to allele numbers, which were used for strain comparison.

  1. Application of Short Tandem Repeat markers in diagnosis of chromosomal aneuploidies and forensic DNA investigation in Pakistan.

    PubMed

    Chishti, Hafsah Muhammad; Ansar, Muhammad; Ajmal, Muhammad; Hameed, Abdul

    2014-09-15

    Short Tandem Repeat (STR) genetic markers hold great potential in forensic investigations, molecular diagnostics and molecular genetics research. AmpFlSTR® Identifiler™ PCR amplification kit is a multiplex system for co-amplification of 15 STR markers used worldwide in forensic investigations. This study attempts to assess forensic validity of these STRs in Pakistani population and to investigate its applicability in quick and simultaneous diagnosis and tracing parental source of common chromosomal aneuploidies. Samples from 554 healthy Pakistani individuals from 5 different ethnicities were analyzed for forensic parameters using Identifiler STRs and 74 patients' samples with different aneuploidies were evaluated for diagnostic strengths of these markers. All STRs hold sufficient forensic applicability in Pakistani population with paternity index between 1.5 and 3.5, polymorphic information content from 0.63 to 0.87 and discrimination power ≥0.9 (except TPOX locus). Variation from Hardy-Weinberg equilibrium was observed at some loci reflecting selective breeding and intermarriages trend in Pakistan. Among aneuploidic samples, all trisomies were precisely detectable while aneuploidies involving sex chromosomes or missing chromosomes were not clearly detectable using Identifiler STRs. Parental origin of aneuploidy was traceable in 92.54% patients. The studied STR markers are valuable tools for forensic application in Pakistan and utilizable for quick and simultaneous identification of some common trisomic conditions. Adding more sex chromosome specific STR markers can immensely increase the diagnostic and forensic potential of this system. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Genetic mapping of 15 human X chromosomal forensic short tandem repeat (STR) loci by means of multi-core parallelization.

    PubMed

    Diegoli, Toni Marie; Rohde, Heinrich; Borowski, Stefan; Krawczak, Michael; Coble, Michael D; Nothnagel, Michael

    2016-11-01

    Typing of X chromosomal short tandem repeat (X STR) markers has become a standard element of human forensic genetic analysis. Joint consideration of many X STR markers at a time increases their discriminatory power but, owing to physical linkage, requires inter-marker recombination rates to be accurately known. We estimated the recombination rates between 15 well established X STR markers using genotype data from 158 families (1041 individuals) and following a previously proposed likelihood-based approach that allows for single-step mutations. To meet the computational requirements of this family-based type of analysis, we modified a previous implementation so as to allow multi-core parallelization on a high-performance computing system. While we obtained recombination rate estimates larger than zero for all but one pair of adjacent markers within the four previously proposed linkage groups, none of the three X STR pairs defining the junctions of these groups yielded a recombination rate estimate of 0.50. Corroborating previous studies, our results therefore argue against a simple model of independent X chromosomal linkage groups. Moreover, the refined recombination fraction estimates obtained in our study will facilitate the appropriate joint consideration of all 15 investigated markers in forensic analysis. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  3. Investigation of Salmonella Enteritidis outbreaks in South Africa using multi-locus variable-number tandem-repeats analysis, 2013-2015.

    PubMed

    Muvhali, Munyadziwa; Smith, Anthony Marius; Rakgantso, Andronica Moipone; Keddy, Karen Helena

    2017-10-02

    Salmonella enterica serovar Enteritidis (Salmonella Enteritidis) has become a significant pathogen in South Africa, and the need for improved molecular surveillance of this pathogen has become important. Over the years, multi-locus variable-number tandem-repeats analysis (MLVA) has become a valuable molecular subtyping technique for Salmonella, particularly for highly homogenic serotypes such as Salmonella Enteritidis. This study describes the use of MLVA in the molecular epidemiological investigation of outbreak isolates in South Africa. Between the years 2013 and 2015, the Centre for Enteric Diseases (CED) received 39 Salmonella Enteritidis isolates from seven foodborne illness outbreaks, which occurred in six provinces. MLVA was performed on all isolates. Three MLVA profiles (MLVA profiles 21, 22 and 28) were identified among the 39 isolates. MLVA profile 28 accounted for 77% (30/39) of the isolates. Isolates from a single outbreak were grouped into a single MLVA profile. A minimum spanning tree (MST) created from the MLVA data showed a close relationship between MLVA profiles 21, 22 and 28, with a single VNTR locus difference between them. MLVA has proven to be a reliable method for the molecular epidemiological investigation of Salmonella Enteritidis outbreaks in South Africa. These foodborne outbreaks emphasize the importance of the One Health approach as an essential component for combating the spread of zoonotic pathogens such as Salmonella Enteritidis.

  4. Multiple-locus variable number of tandem repeat analysis (MLVA) of Irish verocytotoxigenic Escherichia coli O157 from feedlot cattle: uncovering strain dissemination routes.

    PubMed

    Murphy, Mary; Minihan, Donal; Buckley, James F; O'Mahony, Micheál; Whyte, Paul; Fanning, Séamus

    2008-01-24

    The identification of the routes of dissemination of Escherichia coli (E. coli) O157 through a cohort of cattle is a critical step to control this pathogen at farm level. The aim of this study was to identify potential routes of dissemination of E. coli O157 using Multiple-Locus Variable number of tandem repeat Analysis (MLVA). Thirty-eight environmental and sixteen cattle faecal isolates, which were detected in four adjacent pens over a four-month period were sub-typed. MLVA could separate these isolates into broadly defined clusters consisting of twelve MLVA types. Strain diversity was observed within pens, individual cattle and the environment. Application of MLVA is a broadly useful and convenient tool when applied to uncover the dissemination of E. coli O157 in the environment and in supporting improved on-farm management of this important pathogen. These data identified diverse strain types based on amplification of VNTR markers in each case.

  5. Application of a multilocus variable number of tandem repeats analysis to regional outbreak surveillance of Enterohemorrhagic Escherichia coli O157:H7 infections.

    PubMed

    Konno, Takayuki; Yatsuyanagi, Jun; Saito, Shioko

    2011-01-01

    A total of 18 strains of EHEC O157:H7 were isolated from distinct cases in Akita Prefecture, Japan from July to September 2007. The genetic relatedness of these isolates was investigated by performing a multilocus variable number of tandem repeats analysis (MLVA) and a pulsed-field gel electrophoresis (PFGE) analysis using XbaI. The PFGE analyses allowed us to group these 18 isolates into three major clusters. The MLVA results correlated closely with those obtained by PFGE, although some variants were found within the clusters obtained by PFGE, thus highlighting the utility of this technique for determining a precise classification when it is difficult to differentiate between isolates with indistinguishable or very similar PFGE patterns. In addition, MLVA is a much easier and more rapid method than PFGE for analysis of the genetic relatedness of strains. Thus, as a second molecular epidemiological subtyping method, MLVA is useful for the regional outbreak surveillance of EHEC O157:H7 infections.

  6. Mechanical unfolding of an ankyrin repeat protein.

    PubMed

    Serquera, David; Lee, Whasil; Settanni, Giovanni; Marszalek, Piotr E; Paci, Emanuele; Itzhaki, Laura S

    2010-04-07

    Ankryin repeat proteins comprise tandem arrays of a 33-residue, predominantly alpha-helical motif that stacks roughly linearly to produce elongated and superhelical structures. They function as scaffolds mediating a diverse range of protein-protein interactions, and some have been proposed to play a role in mechanical signal transduction processes in the cell. Here we use atomic force microscopy and molecular-dynamics simulations to investigate the natural 7-ankyrin repeat protein gankyrin. We find that gankyrin unfolds under force via multiple distinct pathways. The reactions do not proceed in a cooperative manner, nor do they always involve fully stepwise unfolding of one repeat at a time. The peeling away of half an ankyrin repeat, or one or more ankyrin repeats, occurs at low forces; however, intermediate species are formed that are resistant to high forces, and the simulations indicate that in some instances they are stabilized by nonnative interactions. The unfolding of individual ankyrin repeats generates a refolding force, a feature that may be more easily detected in these proteins than in globular proteins because the refolding of a repeat involves a short contraction distance and incurs a low entropic cost. We discuss the origins of the differences between the force- and chemical-induced unfolding pathways of ankyrin repeat proteins, as well as the differences between the mechanics of natural occurring ankyrin repeat proteins and those of designed consensus ankyin repeat and globular proteins. Copyright (c) 2010 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  7. Origin and Diversification Dynamics of Self-Incompatibility Haplotypes

    PubMed Central

    Gervais, Camille E.; Castric, Vincent; Ressayre, Adrienne; Billiard, Sylvain

    2011-01-01

    Self-incompatibility (SI) is a genetic system found in some hermaphrodite plants. Recognition of pollen by pistils expressing cognate specificities at two linked genes leads to rejection of self pollen and pollen from close relatives, i.e., to avoidance of self-fertilization and inbred matings, and thus increased outcrossing. These genes generally have many alleles, yet the conditions allowing the evolution of new alleles remain mysterious. Evolutionary changes are clearly necessary in both genes, since any mutation affecting only one of them would result in a nonfunctional self-compatible haplotype. Here, we study diversification at the S-locus (i.e., a stable increase in the total number of SI haplotypes in the population, through the incorporation of new SI haplotypes), both deterministically (by investigating analytically the fate of mutations in an infinite population) and by simulations of finite populations. We show that the conditions allowing diversification are far less stringent in finite populations with recurrent mutations of the pollen and pistil genes, suggesting that diversification is possible in a panmictic population. We find that new SI haplotypes emerge fastest in populations with few SI haplotypes, and we discuss some implications for empirical data on S-alleles. However, allele numbers in our simulations never reach values as high as observed in plants whose SI systems have been studied, and we suggest extensions of our models that may reconcile the theory and data. PMID:21515570

  8. Ancestral inference from haplotypes and mutations.

    PubMed

    Griffiths, Robert C; Tavaré, Simon

    2018-04-25

    We consider inference about the history of a sample of DNA sequences, conditional upon the haplotype counts and the number of segregating sites observed at the present time. After deriving some theoretical results in the coalescent setting, we implement rejection sampling and importance sampling schemes to perform the inference. The importance sampling scheme addresses an extension of the Ewens Sampling Formula for a configuration of haplotypes and the number of segregating sites in the sample. The implementations include both constant and variable population size models. The methods are illustrated by two human Y chromosome datasets. Copyright © 2018. Published by Elsevier Inc.

  9. PWHATSHAP: efficient haplotyping for future generation sequencing.

    PubMed

    Bracciali, Andrea; Aldinucci, Marco; Patterson, Murray; Marschall, Tobias; Pisanti, Nadia; Merelli, Ivan; Torquati, Massimo

    2016-09-22

    Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WHATSHAP is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered PWHATSHAP, a parallel, high-performance version of WHATSHAP. PWHATSHAP is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WHATSHAP, PWHATSHAP exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WHATSHAP, which increases with coverage. Due to its structure and management of the large datasets, the parallelisation of WHATSHAP posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, PWHATSHAP, is a freely available toolkit that improves the efficiency of the analysis of genomics

  10. The repeat organizer, a specialized insulator element within the intergenic spacer of the Xenopus rRNA genes.

    PubMed Central

    Robinett, C C; O'Connor, A; Dunaway, M

    1997-01-01

    We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359

  11. Maximal oxygen uptake is associated with allele -202 A of insulin-like growth factor binding protein-3 (IGFBP3) promoter polymorphism and (CA)n tandem repeats of insulin-like growth factor IGF1 in Caucasians from Poland.

    PubMed

    Gronek, Piotr; Holdys, Joanna; Kryściak, Jakub; Wieliński, Dariusz; Słomski, Ryszard

    2014-01-01

    Physical fitness is a trait determined by multiple genes, and its genetic basis is modified by numerous environmental factors. The present study examines the effects of the (CA)n tandem repeats polymorphism in IGFI gene and SNP Alw21I restriction site -202 A>C polymorphism in IGF1BP3 on VO2max--a physiological index of aerobic capacity of high heritability. The study sample consisted of 239 (154 male and 85 female) students of the University School of Physical Education in Poznań and athletes practicing various sports, including members of the Polish national team. An association was found between -202 A/C polymorphism of IGFBP3 gene with VO2max in men. Higher VO2max values were attained by men with CC genotype, especially male athletes practicing endurance sports and sports featuring energy metabolism of aerobic/anaerobic character. A statistically significant influence of allele 188 and genotype 188/188 of tandem repeats (CA)n polymorphism of IGF1 gene on VO2max was found in women. Also, lower values of maximal oxygen uptake were noted in individuals with allele 186 or genotype 186/186, and higher VO2max values in athletes with allele 194.

  12. Optimization of sequence alignment for simple sequence repeat regions.

    PubMed

    Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C

    2011-07-20

    Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.

  13. MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services.

    PubMed

    Pratt, Brian; Howbert, J Jeffry; Tasman, Natalie I; Nilsson, Erik J

    2012-01-01

    MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at hand. MR-Tandem is designed to drop in wherever X!Tandem is already in use and requires no modification to existing X!Tandem parameter files, and only minimal modification to X!Tandem-based workflows. MR-Tandem is implemented as a lightly modified X!Tandem C++ executable and a Python script that drives Hadoop clusters including Amazon Web Services (AWS) Elastic Map Reduce (EMR), using the modified X!Tandem program as a Hadoop Streaming mapper and reducer. The modified X!Tandem C++ source code is Artistic licensed, supports pluggable scoring, and is available as part of the Sashimi project at http://sashimi.svn.sourceforge.net/viewvc/sashimi/trunk/trans_proteomic_pipeline/extern/xtandem/. The MR-Tandem Python script is Apache licensed and available as part of the Insilicos Cloud Army project at http://ica.svn.sourceforge.net/viewvc/ica/trunk/mr-tandem/. Full documentation and a windows installer that configures MR-Tandem, Python and all necessary packages are available at this same URL. brian.pratt@insilicos.com

  14. C9orf72 repeat expansions in rapid eye movement sleep behaviour disorder.

    PubMed

    Daoud, Hussein; Postuma, Ronald B; Bourassa, Cynthia V; Rochefort, Daniel; Gauthier, Maude Turcotte; Montplaisir, Jacques; Gagnon, Jean-Francois; Arnulf, Isabelle; Dauvilliers, Yves; Charley, Christelle Monaca; Inoue, Yuichi; Sasai, Taeko; Högl, Birgit; Desautels, Alex; Frauscher, Birgit; Cochen De Cock, Valérie; Rouleau, Guy A; Dion, Patrick A

    2014-11-01

    A large hexanucleotide repeat expansion in C9orf72 has been identified as the most common genetic cause in familial amyotrophic lateral sclerosis and frontotemporal dementia. Rapid Eye Movement Sleep Behavior Disorder (RBD) is a sleep disorder that has been strongly linked to synuclein-mediated neurodegeneration. The aim of this study was to evaluate the role of the C9orf72 expansions in the pathogenesis of RBD. We amplified the C9orf72 repeat expansion in 344 patients with RBD by a repeat-primed polymerase chain reaction assay. We identified two RBD patients carrying the C9orf72 repeat expansion. Most interestingly, these patients have the same C9orf72 associated-risk haplotype identified in 9p21-linked amyotrophic lateral sclerosis and frontotemporal dementia families. Our study enlarges the phenotypic spectrum associated with the C9orf72 hexanucleotide repeat expansions and suggests that, although rare, this expansion may play a role in the pathogenesis of RBD.

  15. Recent Advances in Experimental Whole Genome Haplotyping Methods

    PubMed Central

    Huang, Mengting; Lu, Zuhong

    2017-01-01

    Haplotype plays a vital role in diverse fields; however, the sequencing technologies cannot resolve haplotype directly. Pioneers demonstrated several approaches to resolve haplotype in the early years, which was extensively reviewed. Since then, numerous methods have been developed recently that have significantly improved phasing performance. Here, we review experimental methods that have emerged mainly over the past five years, and categorize them into five classes according to their maximum scale of contiguity: (i) encapsulation, (ii) 3D structure capture and construction, (iii) compartmentalization, (iv) fluorography, (v) long-read sequencing. Several subsections of certain methods are attached to each class as instances. We also discuss the relative advantages and disadvantages of different classes and make comparisons among representative methods of each class. PMID:28891974

  16. MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web Services

    PubMed Central

    Pratt, Brian; Howbert, J. Jeffry; Tasman, Natalie I.; Nilsson, Erik J.

    2012-01-01

    Summary: MR-Tandem adapts the popular X!Tandem peptide search engine to work with Hadoop MapReduce for reliable parallel execution of large searches. MR-Tandem runs on any Hadoop cluster but offers special support for Amazon Web Services for creating inexpensive on-demand Hadoop clusters, enabling search volumes that might not otherwise be feasible with the compute resources a researcher has at hand. MR-Tandem is designed to drop in wherever X!Tandem is already in use and requires no modification to existing X!Tandem parameter files, and only minimal modification to X!Tandem-based workflows. Availability and implementation: MR-Tandem is implemented as a lightly modified X!Tandem C++ executable and a Python script that drives Hadoop clusters including Amazon Web Services (AWS) Elastic Map Reduce (EMR), using the modified X!Tandem program as a Hadoop Streaming mapper and reducer. The modified X!Tandem C++ source code is Artistic licensed, supports pluggable scoring, and is available as part of the Sashimi project at http://sashimi.svn.sourceforge.net/viewvc/sashimi/trunk/trans_proteomic_pipeline/extern/xtandem/. The MR-Tandem Python script is Apache licensed and available as part of the Insilicos Cloud Army project at http://ica.svn.sourceforge.net/viewvc/ica/trunk/mr-tandem/. Full documentation and a windows installer that configures MR-Tandem, Python and all necessary packages are available at this same URL. Contact: brian.pratt@insilicos.com PMID:22072385

  17. De novo assembly of a haplotype-resolved human genome.

    PubMed

    Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

    2015-06-01

    The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.

  18. Honey bee-inspired algorithms for SNP haplotype reconstruction problem

    NASA Astrophysics Data System (ADS)

    PourkamaliAnaraki, Maryam; Sadeghi, Mehdi

    2016-03-01

    Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/

  19. HERC1 polymorphisms: population-specific variations in haplotype composition.

    PubMed

    Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen

    2009-08-01

    Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.

  20. Short communication: casein haplotype variability in sicilian dairy goat breeds.

    PubMed

    Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B

    2008-09-01

    In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.

  1. Mineralocorticoid receptor haplotypes sex-dependently moderate depression susceptibility following childhood maltreatment.

    PubMed

    Vinkers, Christiaan H; Joëls, Marian; Milaneschi, Yuri; Gerritsen, Lotte; Kahn, René S; Penninx, Brenda W J H; Boks, Marco P M

    2015-04-01

    The MR is an important regulator of the hypothalamic-pituitary-adrenal (HPA) axis and a prime target for corticosteroids. There is increasing evidence from both clinical and preclinical studies that the MR has different effects on behavior and mood in males and females. To investigate the hypothesis that the MR sex-dependently influences the relation between childhood maltreatment and depression, we investigated three common and functional MR haplotypes (GA, CA, and CG haplotype, based on rs5522 and rs2070951) in a population-based cohort (N = 665) and an independent clinical cohort from the Netherlands Study of Depression and Anxiety (NESDA) (N = 1639). The CA haplotype sex-dependently moderated the relation between childhood maltreatment and depressive symptoms both in the population-based sample (sex × maltreatment × haplotype: β = -4.07, P = 0.029) and in the clinical sample (sex × maltreatment × haplotype, β = -2.40, P = 0.011). Specifically, female individuals in the population-based sample were protected (β = -4.58, P = 2.0 e(-5)), whereas males in the clinical sample were at increased risk (β = 2.54, P = 0.0022). In line with these results, female GA haplotype carriers displayed increased vulnerability in the population-based sample (β = 4.58, P = 7.5 e(-5)) whereas male CG-carriers showed increased resilience in the clinical sample (β = -2.71, P = 0.016). Consistently, we found a decreased lifetime MDD risk for male GA haplotype carriers following childhood maltreatment but an increased risk for male CA haplotype carriers in the clinical sample. In both samples, sex-dependent effects were observed for GA-GA diplotype carriers. In summary, sex plays an important role in determining whether functional genetic variation in MR is beneficial or detrimental, with an apparent female advantage for the CA haplotype but male advantage for the GA and CG haplotype. These sex-dependent effects of MR on depression susceptibility following childhood

  2. Short tandem repeat profiling: part of an overall strategy for reducing the frequency of cell misidentification.

    PubMed

    Nims, Raymond W; Sykes, Greg; Cottrill, Karin; Ikonomi, Pranvera; Elmore, Eugene

    2010-12-01

    The role of cell authentication in biomedical science has received considerable attention, especially within the past decade. This quality control attribute is now beginning to be given the emphasis it deserves by granting agencies and by scientific journals. Short tandem repeat (STR) profiling, one of a few DNA profiling technologies now available, is being proposed for routine identification (authentication) of human cell lines, stem cells, and tissues. The advantage of this technique over methods such as isoenzyme analysis, karyotyping, human leukocyte antigen typing, etc., is that STR profiling can establish identity to the individual level, provided that the appropriate number and types of loci are evaluated. To best employ this technology, a standardized protocol and a data-driven, quality-controlled, and publically searchable database will be necessary. This public STR database (currently under development) will enable investigators to rapidly authenticate human-based cultures to the individual from whom the cells were sourced. Use of similar approaches for non-human animal cells will require developing other suitable loci sets. While implementing STR analysis on a more routine basis should significantly reduce the frequency of cell misidentification, additional technologies may be needed as part of an overall authentication paradigm. For instance, isoenzyme analysis, PCR-based DNA amplification, and sequence-based barcoding methods enable rapid confirmation of a cell line's species of origin while screening against cross-contaminations, especially when the cells present are not recognized by the species-specific STR method. Karyotyping may also be needed as a supporting tool during establishment of an STR database. Finally, good cell culture practices must always remain a major component of any effort to reduce the frequency of cell misidentification.

  3. Association of STin2 Variable Number of Tandem Repeat (VNTR) Polymorphism of Serotonin Transporter Gene with Lifelong Premature Ejaculation: A Case-Control Study in Han Chinese Subjects

    PubMed Central

    Huang, Yuanyuan; Zhang, Xiansheng; Gao, Jingjing; Tang, Dongdong; Gao, Pan; Peng, Dangwei; Liang, Chaozhao

    2016-01-01

    Background The STin2 VNTR polymorphism has a variable number of tandem repeats in intron 2 of the serotonin transporter gene. We aimed to explore the relationship between STin2 VNTR polymorphism and lifelong premature ejaculation (LPE). Material/Methods We recruited a total of 115 outpatients who complained of ejaculating prematurely and who were diagnosed as LPE, and 101 controls without PE complaint. Allelic variations of STin2 VNTR were genotyped using PCR-based technology. We evaluated the associations between STin2 VNTR allelic and genotypic frequencies and LPE, as well as the intravaginal ejaculation latency time (IELT) of different STin2 VNTR genotypes among LPE patients. Results The patients and controls did not differ significantly in terms of any characteristic except age. A significantly higher frequency of STin2.12/12 genotype was found among LPE patients versus controls (P=0.026). Frequency of patients carrying at least 1 copy of the 10-repeat allele was significantly lower compared to the control group (28.3% vs. 41.8%, OR=0.55; 95%CI=0.31–0.97, P=0.040). In the LPE group, the mean IELT showed significant difference in STin2.12/12 genotype when compared to those with STin2.12/10 and STin2.10/10 genotypes. The mean IELT in10-repeat allele carriers was 50% longer compared to homozygous carriers of the STin2.12 allele. Conclusions Our results indicate the presence of STin2.10 allele is a protective factor for LPE. Men carrying the higher expression genotype STin2. 12/12 have shorter IELT than 10-repeat allele carriers. PMID:27713390

  4. New Multilocus Variable-Number Tandem-Repeat Analysis (MLVA) Scheme for Fine-Scale Monitoring and Microevolution-Related Study of Ralstonia pseudosolanacearum Phylotype I Populations

    PubMed Central

    Guinard, Jérémy; Latreille, Anne; Guérin, Fabien; Poussier, Stéphane

    2016-01-01

    ABSTRACT Bacterial wilt caused by the Ralstonia solanacearum species complex (RSSC) is considered one of the most harmful plant diseases in the world. Special attention should be paid to R. pseudosolanacearum phylotype I due to its large host range, its worldwide distribution, and its high evolutionary potential. So far, the molecular epidemiology and population genetics of this bacterium are poorly understood. Until now, the genetic structure of the RSSC has been analyzed on the worldwide and regional scales. Emerging questions regarding evolutionary forces in RSSC adaptation to hosts now require genetic markers that are able to monitor RSSC field populations. In this study, we aimed to evaluate the multilocus variable-number tandem-repeat analysis (MLVA) approach for its ability to discriminate genetically close phylotype I strains and for population genetics studies. We developed a new MLVA scheme (MLVA-7) allowing us to genotype 580 R. pseudosolanacearum phylotype I strains extracted from susceptible and resistant hosts and from different habitats (stem, soil, and rhizosphere). Based on specificity, polymorphism, and the amplification success rate, we selected seven fast-evolving variable-number tandem-repeat (VNTR) markers. The newly developed MLVA-7 scheme showed higher discriminatory power than the previously published MLVA-13 scheme when applied to collections sampled from the same location on different dates and to collections from different locations on very small scales. Our study provides a valuable tool for fine-scale monitoring and microevolution-related study of R. pseudosolanacearum phylotype I populations. IMPORTANCE Understanding the evolutionary dynamics of adaptation of plant pathogens to new hosts or ecological niches has become a key point for the development of innovative disease management strategies, including durable resistance. Whereas the molecular mechanisms underlying virulence or pathogenicity changes have been studied thoroughly, the

  5. Next Generation Sequencing Plus (NGS+) with Y-chromosomal Markers for Forensic Pedigree Searches.

    PubMed

    Qian, Xiaoqin; Hou, Jiayi; Wang, Zheng; Ye, Yi; Lang, Min; Gao, Tianzhen; Liu, Jing; Hou, Yiping

    2017-09-12

    There is high demand for forensic pedigree searches with Y-chromosome short tandem repeat (Y-STR) profiling in large-scale crime investigations. However, when two Y-STR haplotypes have a few mismatched loci, it is difficult to determine if they are from the same male lineage because of the high mutation rate of Y-STRs. Here we design a new strategy to handle cases in which none of pedigree samples shares identical Y-STR haplotype. We combine next generation sequencing (NGS), capillary electrophoresis and pyrosequencing under the term 'NGS+' for typing Y-STRs and Y-chromosomal single nucleotide polymorphisms (Y-SNPs). The high-resolution Y-SNP haplogroup and Y-STR haplotype can be obtained with NGS+. We further developed a new data-driven decision rule, FSindex, for estimating the likelihood for each retrieved pedigree. Our approach enables positive identification of pedigree from mismatched Y-STR haplotypes. It is envisaged that NGS+ will revolutionize forensic pedigree searches, especially when the person of interest was not recorded in forensic DNA database.

  6. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    PubMed Central

    2010-01-01

    Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by

  7. Fitchi: haplotype genealogy graphs based on the Fitch algorithm.

    PubMed

    Matschiner, Michael

    2016-04-15

    : In population genetics and phylogeography, haplotype genealogy graphs are important tools for the visualization of population structure based on sequence data. In this type of graph, node sizes are often drawn in proportion to haplotype frequencies and edge lengths represent the minimum number of mutations separating adjacent nodes. I here present Fitchi, a new program that produces publication-ready haplotype genealogy graphs based on the Fitch algorithm. http://www.evoinformatics.eu/fitchi.htm : michaelmatschiner@mac.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes

    PubMed Central

    Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.

    2016-01-01

    The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800

  9. Association of HLA haplotype with alopecia areata in Chinese Hans.

    PubMed

    Xiao, F-L; Ye, D-Q; Yang, S; Zhou, F-S; Zhou, S-M; Zhu, Y-G; Liang, Y-H; Ren, Y-Q; Zhang, X-J

    2006-11-01

    Some studies have shown discrepancies in human leucocyte antigen (HLA) associated with alopecia areata (AA) between different ethnic populations. To investigate whether HLA-I, -DQA1 and -DQB1 alleles and the HLA haplotype are associated with AA, and the correlation between the HLA haplotype profile, age of onset and severity of AA in Chinese Hans. The polymerase chain reaction-sequence specific primer (PCR-SSP) method was used to analyse the frequencies of HLA class I, -DQA1 and -DQB1 alleles in 192 patients with AA and 252 controls in Chinese Hans. The linkage disequilibrium was calculated using the 2 x 2 table. The 24 two-locus haplotypes [including A*02-B*18, A*02-B*27, A*02-B*52, A*02-Cw*0704, A*02-DQA1*0104, A*02-DQB1*0604, A*02-DQB1*0606, B*18-Cw*0704, B*18-DQA1*0104, B*18-DQA1*0302, B*18-DQB1*0606, B*27-Cw*0704, B*27-DQA1*0104, B*27-DQA1*0302, B*52-Cw*0704, B*52-DQA1*0104, B*52-DQA1*0302, B52-DQB1*0606, Cw*0704-DQA1*0104, Cw*0704-DQA1*0302, Cw*0704-DQB1*0606, DQA1*0104-DQB1*0604, DQA1*0104-DQB1*0606, DQA1*0302-DQB1*0606 (P<0.05)] were associated with AA, while eight extended haplotypes (A*02-B*18-DQA1*0104, A*02-B*27-DQA1*0104, A*02-B*52-DQA1*0104, A*02-B*52-DQA1*0302, A*02-B*52-DQB1*0606, B*52-Cw*0704-DQA1*0104, B*52-Cw*0704-DQA1*0302, A*02-B*52-DQA1*0302-DQB1*0606) were found to be related to AA in Chinese Hans. Through stratified analysis, we found that the extended haplotype B*52-Cw*0704-DQA1*0302 was related to early onset of AA, and no haplotype was only associated with severe AA. This is the first detailed report to elucidate HLA haplotypes associated with AA and that demonstrates the significant HLA haplotypes in Chinese Hans AA. The haplotype B*52-Cw*0704-DQA1*0302 was identified to be related to early onset of AA. Our results provide some information for future research on predisposing genes in HLA regions in Chinese Hans.

  10. Dimensional Anxiety Mediates Linkage of GABRA2 Haplotypes With Alcoholism

    PubMed Central

    Enoch, Mary-Anne; Schwartz, Lori; Albaugh, Bernard; Virkkunen, Matti; Goldman, David

    2015-01-01

    The GABAAα2 receptor gene (GABRA2) modulates anxiety and stress response. Three recent association studies implicate GABRA2 in alcoholism, however in these papers both common, opposite-configuration haplotypes in the region distal to intron3 predict risk. We have now replicated the GABRA2 association with alcoholism in 331 Plains Indian men and women and 461 Finnish Caucasian men. Using a dimensional measure of anxiety, harm avoidance (HA), we also found that the association with alcoholism is mediated, or moderated, by anxiety. Nine SNPs were genotyped revealing two haplotype blocks. Within the previously implicated block 2 region, we identified the two common, opposite-configuration risk haplotypes, A and B. Their frequencies differed markedly in Finns and Plains Indians. In both populations, most block 2 SNPs were significantly associated with alcoholism. The associations were due to increased frequencies of both homozygotes in alcoholics, indicating the possibility of alcoholic subtypes with opposite genotypes. Congruently, there was no significant haplotype association. Using HA as an indicator variable for anxiety, we found haplotype linkage to alcoholism with high and low dimensional anxiety, and to HA itself, in both populations. High HA alcoholics had the highest frequency of the more abundant haplotype (A in Finns, B in Plains Indians); low HA alcoholics had the highest frequency of the less abundant haplotype (B in Finns, A in Plains Indians) (Finns: P α0.007, OR α2.1, Plains Indians: P α0.040, OR α1.9). Non-alcoholics had intermediate frequencies. Our results suggest that within the distal GABRA2 region is a functional locus or loci that may differ between populations but that alters risk for alcoholism via the mediating action of anxiety. PMID:16874763

  11. HLA-G Haplotypes Are Differentially Associated with Asthmatic Features.

    PubMed

    Ribeyre, Camille; Carlini, Federico; René, Céline; Jordier, François; Picard, Christophe; Chiaroni, Jacques; Abi-Rached, Laurent; Gouret, Philippe; Marin, Grégory; Molinari, Nicolas; Chanez, Pascal; Paganini, Julien; Gras, Delphine; Di Cristofaro, Julie

    2018-01-01

    Human leukocyte antigen (HLA)-G, a HLA class Ib molecule, interacts with receptors on lymphocytes such as T cells, B cells, and natural killer cells to influence immune responses. Unlike classical HLA molecules, HLA-G expression is not found on all somatic cells, but restricted to tissue sites, including human bronchial epithelium cells (HBEC). Individual variation in HLA-G expression is linked to its genetic polymorphism and has been associated with many pathological situations such as asthma, which is characterized by epithelium abnormalities and inflammatory cell activation. Studies reported both higher and equivalent soluble HLA-G (sHLA-G) expression in different cohorts of asthmatic patients. In particular, we recently described impaired local expression of HLA-G and abnormal profiles for alternatively spliced isoforms in HBEC from asthmatic patients. sHLA-G dosage is challenging because of its many levels of polymorphism (dimerization, association with β2-microglobulin, and alternative splicing), thus many clinical studies focused on HLA-G single-nucleotide polymorphisms as predictive biomarkers, but few analyzed HLA-G haplotypes. Here, we aimed to characterize HLA-G haplotypes and describe their association with asthmatic clinical features and sHLA-G peripheral expression and to describe variations in transcription factor (TF) binding sites and alternative splicing sites. HLA - G haplotypes were differentially distributed in 330 healthy and 580 asthmatic individuals. Furthermore, HLA-G haplotypes were associated with asthmatic clinical features showed. However, we did not confirm an association between sHLA-G and genetic, biological, or clinical parameters. HLA-G haplotypes were phylogenetically split into distinct groups, with each group displaying particular variations in TF binding or RNA splicing sites that could reflect differential HLA-G qualitative or quantitative expression, with tissue-dependent specificities. Our results, based on a multicenter

  12. HLA-G Haplotypes Are Differentially Associated with Asthmatic Features

    PubMed Central

    Ribeyre, Camille; Carlini, Federico; René, Céline; Jordier, François; Picard, Christophe; Chiaroni, Jacques; Abi-Rached, Laurent; Gouret, Philippe; Marin, Grégory; Molinari, Nicolas; Chanez, Pascal; Paganini, Julien; Gras, Delphine; Di Cristofaro, Julie

    2018-01-01

    Human leukocyte antigen (HLA)-G, a HLA class Ib molecule, interacts with receptors on lymphocytes such as T cells, B cells, and natural killer cells to influence immune responses. Unlike classical HLA molecules, HLA-G expression is not found on all somatic cells, but restricted to tissue sites, including human bronchial epithelium cells (HBEC). Individual variation in HLA-G expression is linked to its genetic polymorphism and has been associated with many pathological situations such as asthma, which is characterized by epithelium abnormalities and inflammatory cell activation. Studies reported both higher and equivalent soluble HLA-G (sHLA-G) expression in different cohorts of asthmatic patients. In particular, we recently described impaired local expression of HLA-G and abnormal profiles for alternatively spliced isoforms in HBEC from asthmatic patients. sHLA-G dosage is challenging because of its many levels of polymorphism (dimerization, association with β2-microglobulin, and alternative splicing), thus many clinical studies focused on HLA-G single-nucleotide polymorphisms as predictive biomarkers, but few analyzed HLA-G haplotypes. Here, we aimed to characterize HLA-G haplotypes and describe their association with asthmatic clinical features and sHLA-G peripheral expression and to describe variations in transcription factor (TF) binding sites and alternative splicing sites. HLA-G haplotypes were differentially distributed in 330 healthy and 580 asthmatic individuals. Furthermore, HLA-G haplotypes were associated with asthmatic clinical features showed. However, we did not confirm an association between sHLA-G and genetic, biological, or clinical parameters. HLA-G haplotypes were phylogenetically split into distinct groups, with each group displaying particular variations in TF binding or RNA splicing sites that could reflect differential HLA-G qualitative or quantitative expression, with tissue-dependent specificities. Our results, based on a multicenter

  13. The development and application of a multiplex short tandem repeat (STR) system for identifying subspecies, individuals and sex in tigers.

    PubMed

    Zou, Zheng-Ting; Uphyrkina, Olga V; Fomenko, Pavel; Luo, Shu-Jin

    2015-07-01

    Poaching and trans-boundary trafficking of tigers and body parts are threatening the world's last remaining wild tigers. Development of an efficient molecular genetic assay for tracing the origins of confiscated specimens will assist in law enforcement and wildlife forensics for this iconic flagship species. We developed a multiplex genotyping system "tigrisPlex" to simultaneously assess 22 short tandem repeat (STR, or microsatellite) loci and a gender-identifying SRY gene, all amplified in 4 reactions using as little as 1 ng of template DNA. With DNA samples used for between-run calibration, the system generates STR genotypes that are directly compatible with voucher tiger subspecies genetic profiles, hence making it possible to identify subspecies via bi-parentally inherited markers. We applied "tigrisPlex" to 12 confiscated specimens from Russia and identified 6 individuals (3 females and 3 males), each represented by duplicated samples and all designated as Amur tigers (Panthera tigris altaica) with high confidence. This STR multiplex system can serve as an effective and versatile approach for genetic profiling of both wild and captive tigers as well as confiscated tiger products, fulfilling various conservation needs for identifying the origins of tiger samples. © 2015 International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and Wiley Publishing Asia Pty Ltd.

  14. Variable number of tandem repeats and pulsed-field gel electrophoresis cluster analysis of enterohemorrhagic Escherichia coli serovar O157 strains.

    PubMed

    Yokoyama, Eiji; Uchimura, Masako

    2007-11-01

    Ninety-five enterohemorrhagic Escherichia coli serovar O157 strains, including 30 strains isolated from 13 intrafamily outbreaks and 14 strains isolated from 3 mass outbreaks, were studied by pulsed-field gel electrophoresis (PFGE) and variable number of tandem repeats (VNTR) typing, and the resulting data were subjected to cluster analysis. Cluster analysis of the VNTR typing data revealed that 57 (60.0%) of 95 strains, including all epidemiologically linked strains, formed clusters with at least 95% similarity. Cluster analysis of the PFGE patterns revealed that 67 (70.5%) of 95 strains, including all but 1 of the epidemiologically linked strains, formed clusters with 90% similarity. The number of epidemiologically unlinked strains forming clusters was significantly less by VNTR cluster analysis than by PFGE cluster analysis. The congruence value between PFGE and VNTR cluster analysis was low and did not show an obvious correlation. With two-step cluster analysis, the number of clustered epidemiologically unlinked strains by PFGE cluster analysis that were divided by subsequent VNTR cluster analysis was significantly higher than the number by VNTR cluster analysis that were divided by subsequent PFGE cluster analysis. These results indicate that VNTR cluster analysis is more efficient than PFGE cluster analysis as an epidemiological tool to trace the transmission of enterohemorrhagic E. coli O157.

  15. β3 Integrin Haplotype Influences Gene Regulation and Plasma von Willebrand Factor Activity

    PubMed Central

    Payne, Katie E; Bray, Paul F; Grant, Peter J; Carter, Angela M

    2008-01-01

    The Leu33Pro polymorphism of the gene encoding β3 integrin (ITGB3) is associated with acute coronary syndromes and influences platelet aggregation. Three common promoter polymorphisms have also been identified. The aims of this study were to (1) investigate the influence of the ITGB3 −400C/A, −425A/C and −468G/A promoter polymorphisms on reporter gene expression and nuclear protein binding and (2) determine genotype and haplotype associations with platelet αIIbβ3 receptor density. Promoter haplotypes were introduced into an ITGB3 promoter-pGL3 construct by site directed mutagenesis and luciferase reporter gene expression analysed in HEL and HMEC-1 cells. Binding of nuclear proteins was assessed by electrophoretic mobility shift assay. The association of ITGB3 haplotype with platelet αIIbβ3 receptor density was determined in 223 subjects. Species conserved motifs were identified in the ITGB3 promoter in the vicinity of the 3 polymorphisms. The GAA, GCC, AAC, AAA and ACC constructs induced ~50% increased luciferase expression relative to the GAC construct in both cell types. Haplotype analysis including Leu33Pro indicated 5 common haplotypes; no associations between ITGB3 haplotypes and receptor density were found. However, the GCC-Pro33 haplotype was associated with significantly higher vWF activity (128.6 [112.1–145.1]%) compared with all other haplotypes (107.1 [101.2–113.0]%, p=0.02). In conclusion, the GCC-Pro33 haplotype was associated with increased vWF activity but not with platelet αIIbβ3 receptor density, which may indicate ITGB3 haplotype influences endothelial function. PMID:18045606

  16. Two novel monoclonal antibodies against the MUC4 tandem repeat reacting with an antigen overexpressed by lung cancer.

    PubMed

    Botti, C; Seregni, E; Ménard, S; Collini, P; Tagliabue, E; Campiglio, M; Vergani, B; Ghirelli, C; Aiello, P; Pilotti, S; Bombardieri, E

    2000-01-01

    In this study we investigated the immunochemical and cytochemical reactivity of two monoclonal antibodies against the 16-amino acid tandem repeat of MUC4 to demonstrate a possible variation of the mucin core peptide expression related to lung cancer. The immunocytochemical anti-MUC4 reactivity was analyzed in four lung cancer cell lines (Calu-1, Calu-3, H460, SKMES) and in other tumor cell lines, as well as in frozen materials from 21 lung adenocarcinomas (ACs), including five bronchioloalveolar carcinomas (BACs), and 11 squamous cell lung carcinomas (SqCCs). A weak fluorescence anti-MUC4 positivity (range: 10.3-16.2) was observed only in acetone-fixed lung cancer cell lines Calu-1, Calu-3 and H460. These three lung cancer cell lines also showed a cytoplasmic immunoperoxidase reactivity. The immunostaining in lung cancer tissues showed a granular cytoplasmic reactivity: 15/21 (71%) and 17/21 (80%) ACs were positive with BC-LuC18.2 and BC-LuCF12, respectively. All BACs were positive. Moderate to strong reactivity was present in well-differentiated ACs. In the normal lung parenchyma counterparts weak reactivity was found only in bronchiolar cells. All SqCCs were negative. Anti-MUC4 reactivity was also observed in the alveolar mucus. In conclusion, our anti-MUC4 MAbs detect a secretion product present in mucus and this product is elaborated by lung cancer cells and overexpressed in well-differentiated lung ACs.

  17. Genetic data and de novo mutation rates in father-son pairs of 23 Y-STR loci in Southern Brazil population.

    PubMed

    Da Fré, Nicole Nascimento; Rodenbusch, Rodrigo; Gastaldo, André Zoratto; Hanson, Erin; Ballantyne, Jack; Alho, Clarice Sampaio

    2015-11-01

    We evaluated haplotype and allele frequencies, as well as statistical forensic parameters, for 23 Y-chromosome short tandem repeats (STRs) loci of the PowerPlex®Y23 system (DYS19, DYS385a/b, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, Y-GATA-H4, DYS481, DYS533, DYS549, DYS570, DYS576, DYS643) in a sample of 150 apparently healthy males, resident in South Brazil. A total of 150 different haplotypes were identified. The highest gene diversity (GD) was observed for the single locus marker DYS570 (GD = 0.7888) and for a two-locus system DYS385 (GD = 0.9009). We also examined 150 father-son pairs by the same system, and a total of 13 mutations were identified in the 3450 father-son allelic transfers, with an overall mutation rate across the 23 loci of 3.768 × 10(-3) (95% CI: 3.542 × 10(-3) to 3.944 × 10(-3)). In all cases there was only one locus mutated with gain/loss of repeats in the son (5 one-repeat gains, and 7 one-repeat and 1 two-repeat losses); we observed no instances of mutations involving a non-integral number of repeats.

  18. Two different size classes of 5S rDNA units coexisting in the same tandem array in the razor clam Ensis macha: is this region suitable for phylogeographic studies?

    PubMed

    Fernández-Tajes, Juan; Méndez, Josefina

    2009-12-01

    For a study of 5S ribosomal genes (rDNA) in the razor clam Ensis macha, the 5S rDNA region was amplified and sequenced. Two variants, so-called type I or short repeat (approximately 430 bp) and type II or long repeat (approximately 735 bp), appeared to be the main components of the 5S rDNA of this species. Their spacers differed markedly, both in length and nucleotide composition. The organization of the two variants was investigated by amplifying the genomic DNA with primers based on the sequence of the type I and type II spacers. PCR amplification products with primers EMLbF and EMSbR showed that the long and short repeats are associated within the same tandem array, suggesting an intermixed arrangement of both spacers. Nevertheless, amplifications carried out with inverse primers EMSinvF/R and EMLinvF/R revealed that some short and long repeats are contiguous in the same tandem array. This is the first report of the coexistence of two variable spacers in the same tandem array in bivalve mollusks.

  19. Development of a Tandem Repeat-Based Polymerase Chain Displacement Reaction Method for Highly Sensitive Detection of 'Candidatus Liberibacter asiaticus'.

    PubMed

    Lou, Binghai; Song, Yaqin; RoyChowdhury, Moytri; Deng, Chongling; Niu, Ying; Fan, Qijun; Tang, Yan; Zhou, Changyong

    2018-02-01

    Huanglongbing (HLB) is one of the most destructive diseases in citrus production worldwide. Early detection of HLB pathogens can facilitate timely removal of infected citrus trees in the field. However, low titer and uneven distribution of HLB pathogens in host plants make reliable detection challenging. Therefore, the development of effective detection methods with high sensitivity is imperative. This study reports the development of a novel method, tandem repeat-based polymerase chain displacement reaction (TR-PCDR), for the detection of 'Candidatus Liberibacter asiaticus', a widely distributed HLB-associated bacterium. A uniquely designed primer set (TR2-PCDR-F/TR2-PCDR-1R) and a thermostable Taq DNA polymerase mutant with strand displacement activity were used for TR-PCDR amplification. Performed in a regular thermal cycler, TR-PCDR could produce more than two amplicons after each amplification cycle. Sensitivity of the developed TR-PCDR was 10 copies of target DNA fragment. The sensitive level was proven to be 100× higher than conventional PCR and similar to real-time PCR. Data from the detection of 'Ca. L. asiaticus' with filed samples using the above three methods also showed similar results. No false-positive TR-PCDR amplification was observed from healthy citrus samples and water controls. These results thereby illustrated that the developed TR-PCDR method can be applied to the reliable, highly sensitive, and cost-effective detection of 'Ca. L. asiaticus'.

  20. Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

    PubMed

    Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

    2010-08-01

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.

  1. Polymorphism of 11 Y Chromosome Short Tandem Repeat Markers among Malaysian Aborigines.

    PubMed

    Mohd Yussup, Sofia Sakina; Marzukhi, Marlia; Md-Zain, Badrul Munir; Mamat, Kamaruddin; Mohd Yusof, Farida Zuraina

    2017-01-01

    The conventional technique such as patrilocality suggests some substantial effects on population diversity. With that, this particular study investigated the paternal line, specifically Scientific Working Group on DNA Analysis Methods (SWGDAM)-recommended Y-STR markers, namely, DYS19, DYS385, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS438, and DYS439. These markers were tested to compare 184 Orang Asli individuals from 3 tribes found in Peninsular Malaysia. As a result, the haplotype diversity and the discrimination capacity obtained were 0.9987 and 0.9076, respectively. Besides, the most diverse marker was DYS385b, whereas the least was DYS391. Furthermore, the Senoi and Proto-Malay tribes were found to be the most distant, whereas the Senoi and Negrito clans were almost similar to each other. In addition, the analysis of molecular variance analysis revealed 82% of variance within the population, but only 18% of difference between the tribes. Finally, the phylogenetic trees constructed using Neighbour Joining and UPGMA (Unweighted Pair Group Method with Arithmetic Mean) displayed several clusters that were tribe specific. With that, future studies are projected to analyse individuals based on more specific sub-tribes.

  2. Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

    PubMed

    Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

    2009-02-01

    High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.

  3. Association Between Chloroplast DNA and Mitochondrial DNA Haplotypes in Prunus spinosa L. (Rosaceae) Populations across Europe

    PubMed Central

    MOHANTY, APARAJITA; MARTÍN, JUAN PEDRO; GONZÁLEZ, LUIS MIGUEL; AGUINAGALDE, ITZIAR

    2003-01-01

    Chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) were studied in 24 populations of Prunus spinosa sampled across Europe. The cpDNA and mtDNA fragments were amplified using universal primers and subsequently digested with restriction enzymes to obtain the polymorphisms. Combinations of all the polymorphisms resulted in 33 cpDNA haplotypes and two mtDNA haplotypes. Strict association between the cpDNA haplotypes and the mtDNA haplotypes was detected in most cases, indicating conjoint inheritance of the two genomes. The most frequent and abundant cpDNA haplotype (C20; frequency, 51 %) is always associated with the more frequent and abundant mtDNA haplotype (M1; frequency, 84 %). All but two of the cpDNA haplotypes associated with the less frequent mtDNA haplotype (M2) are private haplotypes. These private haplotypes are phylogenetically related but geographically unrelated. They form a separate cluster on the minimum‐length spanning tree. PMID:14534199

  4. Identification and genetic effect of haplotype in the bovine BMP7 gene.

    PubMed

    Huang, Yong-Zhen; Wang, Xin-Lei; He, Hua; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong

    2013-12-15

    Bone morphogenetic proteins (BMPs) are peptide growth factors belonging to the transforming growth factor-beta (TGF-β) superfamily, and some members of the BMP family support white adipocyte differentiation. In this study, we focused on the BMP7 which singularly promotes the differentiation of brown preadipocytes. Haplotypes involving 5 single nucleotide polymorphism (SNP) sites in the bovine BMP7 gene were identified and their effect on body weight was analyzed. 16 haplotypes and 18 combined haplotypes were revealed and the linkage disequilibrium was assessed in the cattle population with 602 individuals representing three main cattle breeds from China. The results showed that haplotypes 3, 10 and 14 were predominant and accounted for 75.64%, 69.85%, and 83.36% in Nanyang, Qinchuan and Jiaxian cattle breeds, respectively. The statistical analyses indicated that the SNP 1, 4, and 5 are associated with the body weight, body length, and heart girth at 12 and 24 months in Nanyang cattle population (P<0.05), whereas there is no significant association between their 16 haplotypes and 18 combined haplotypes. Our results provide evidence that some SNPs and haplotypes in BMP7 are associated with growth traits, and may be utilized as a genetic marker in marker-assisted selection for beef cattle breeding programs. Copyright © 2013. Published by Elsevier B.V.

  5. β-globin gene cluster haplotypes in ethnic minority populations of southwest China

    PubMed Central

    Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

    2017-01-01

    The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625

  6. Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.

    PubMed

    Städler, Thomas; Delph, Lynda F

    2002-09-03

    Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.

  7. Geographic distribution of haplotype diversity at the bovine casein locus

    PubMed Central

    Jann, Oliver C; Ibeagha-Awemu, Eveline M; Özbeyaz, Ceyhan; Zaragoza, Pilar; Williams, John L; Ajmone-Marsan, Paolo; Lenstra, Johannes A; Moazami-Goudarzi, Katy; Erhardt, Georg

    2004-01-01

    The genetic diversity of the casein locus in cattle was studied on the basis of haplotype analysis. Consideration of recently described genetic variants of the casein genes which to date have not been the subject of diversity studies, allowed the identification of new haplotypes. Genotyping of 30 cattle breeds from four continents revealed a geographically associated distribution of haplotypes, mainly defined by frequencies of alleles at CSN1S1 and CSN3. The genetic diversity within taurine breeds in Europe was found to decrease significantly from the south to the north and from the east to the west. Such geographic patterns of cattle genetic variation at the casein locus may be a result of the domestication process of modern cattle as well as geographically differentiated natural or artificial selection. The comparison of African Bos taurus and Bos indicus breeds allowed the identification of several Bos indicus specific haplotypes (CSN1S1*C-CSN2*A2-CSN3*AI/CSN3*H) that are not found in pure taurine breeds. The occurrence of such haplotypes in southern European breeds also suggests that an introgression of indicine genes into taurine breeds could have contributed to the distribution of the genetic variation observed. PMID:15040901

  8. Multi-locus variable-number tandem repeat analysis of Chinese Brucella strains isolated from 1953 to 2013.

    PubMed

    Tian, Guo-Zhong; Cui, Bu-Yun; Piao, Dong-Ri; Zhao, Hong-Yan; Li, Lan-Yu; Liu, Xi; Xiao, Pei; Zhao, Zhong-Zhi; Xu, Li-Qing; Jiang, Hai; Li, Zhen-Jun

    2017-05-02

    Brucellosis was a common human and livestock disease caused by Brucella strains, the category B priority pathogens by the US Center for Disease Control (CDC). Identified as a priority disease in human and livestock populations, the increasing incidence in recent years in China needs urgent control measures for this disease but the molecular background important for monitoring the epidemiology of Brucella strains at the national level is still lacking. A total of 600 Brucella isolates collected during 60 years (from 1953 to 2013) in China were genotyped by multiple locus variable-number tandem repeat analysis (MLVA) and the variation degree of MLVA11 loci was calculated by the Hunter Gaston Diversity Index (HGDI) values. The charts and map were processed by Excel 2013, and cluster analysis and epidemiological distribution was performed using BioNumerics (version 5.1). The 600 representative Brucella isolates fell into 104 genotypes with 58 singleton genotypes by the MLVA11 assay, including B. melitensis biovars 2 and 3 (five main genotypes), B. abortus biovars 1 and 3 (two main genotypes), B. suis biovars 1 and 3 (three main genotypes), and B. canis (two main genotypes) respectively. While most B. suis biovar 1 and biovar 3 were respectively found in northern provinces and southern provinces, B. melitensis and B. abortus strains were dominant in China. Canine Brucellosis was only found in animals without any human cases reported. Eight Brucellosis epidemic peaks emerged during the 60 years between 1953 and 2013: 1955 - 1959, 1962 - 1969, 1971 - 1975, 1977 - 1983, 1985 - 1989, 1992 - 1997, 2000 - 2008 and 2010 - 2013 in China. Brucellosis has its unique molecular epidemiological patterns with specific spatial and temporal distribution according to MLVA. IDOP-D-16-00101.

  9. Linkage analysis with multiplexed short tandem repeat polymorphisms using infrared fluorescence and M13 tailed primers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Oetting, W.S.; Lee, H.K.; Flanders, D.J.

    The use of short tandem repeat polymorphisms (STRPs) as marker loci for linkage analysis is becoming increasingly important due to their large numbers in the human genome and their high degree of polymorphism. Fluorescence-based detection of the STRP pattern with an automated DNA sequencer has improved the efficiency of this technique by eliminating the need for radioactivity and producing a digitized autoradiogram-like image that can be used for computer analysis. In an effort to simplify the procedure and to reduce the cost of fluorescence STRP analysis, we have developed a technique known as multiplexing STRPs with tailed primers (MSTP) usingmore » primers that have a 19-bp extension, identical to the sequence of an M13 sequencing primer, on the 5{prime} end of the forward primer in conjunction with multiplexing several primer pairs in a single polymerase chain reaction (PCR) amplification. The banding pattern is detected with the addition of the M13 primer-dye conjugate as the sole primer conjugated to the fluorescent dye, eliminating the need for direct conjugation of the infrared fluorescent dye to the STRP primers. The use of MSTP for linkage analysis greatly reduces the number of PCR reactions. Up to five primer pairs can be multiplexed together in the same reaction. At present, a set of 148 STRP markers spaced at an average genetic distance of 28 cM throughout the autosomal genome can be analyzed in 37 sets of multiplexed amplification reactions. We have automated the analysis of these patterns for linkage using software that both detects the STRP banding pattern and determines their sizes. This information can then be exported in a user-defined format from a database manager for linkage analysis. 15 refs., 2 figs., 4 tabs.« less

  10. Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.

    PubMed

    Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng

    2010-02-01

    Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.

  11. Multilocus variable-number tandem repeat analysis for molecular typing and phylogenetic analysis of Shigella flexneri

    PubMed Central

    2009-01-01

    Background Shigella flexneri is one of the causative agents of shigellosis, a major cause of childhood mortality in developing countries. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related bacterial isolates for investigation of disease outbreaks and provide information for establishing phylogenetic patterns among isolates. The present study aimed to develop an MLVA method for S. flexneri and the VNTR loci identified were tested on 242 S. flexneri isolates to evaluate their variability in various serotypes. The isolates were also analyzed by pulsed-field gel electrophoresis (PFGE) to compare the discriminatory power and to evaluate the usefulness of MLVA as a tool for phylogenetic analysis of S. flexneri. Results Thirty-six VNTR loci were identified by exploring the repeat sequence loci in genomic sequences of Shigella species and by testing the loci on nine isolates of different subserotypes. The VNTR loci in different serotype groups differed greatly in their variability. The discriminatory power of an MLVA assay based on four most variable VNTR loci was higher, though not significantly, than PFGE for the total isolates, a panel of 2a isolates, which were relatively diverse, and a panel of 4a/Y isolates, which were closely-related. Phylogenetic groupings based on PFGE patterns and MLVA profiles were considerably concordant. The genetic relationships among the isolates were correlated with serotypes. The phylogenetic trees constructed using PFGE patterns and MLVA profiles presented two distinct clusters for the isolates of serotype 3 and one distinct cluster for each of the serotype groups, 1a/1b/NT, 2a/2b/X/NT, 4a/Y, and 6. Isolates that had different serotypes but had closer genetic relatedness than those with the same serotype were observed between serotype Y and subserotype 4a, serotype X and subserotype 2b, subserotype 1a and 1b, and subserotype 3a and 3b. Conclusions The 36 VNTR loci

  12. Spectrum of Phenylalanine Hydroxylase Gene Mutations in Hamadan and Lorestan Provinces of Iran and Their Associations with Variable Number of Tandem Repeat Alleles.

    PubMed

    Alibakhshi, Reza; Moradi, Keivan; Biglari, Mostafa; Shafieenia, Samaneh

    2018-05-01

    Phenylketonuria (PKU) is one of the most common known inherited metabolic diseases. The present study aimed to investigate the status of molecular defects in phenylalanine hydroxylase ( PAH ) gene in western Iranian PKU patients (predominantly from Kermanshah, Hamadan, and Lorestan provinces) during 2014-2016. Additionally, the results were compared with similar studies in Iran. Nucleotide sequence analysis of all 13 exons and their flanking intronic regions of the PAH gene was performed in 18 western Iranian PKU patients. Moreover, a variable number of tandem repeat (VNTR) located in the PAH gene was studied. The results revealed a mutational spectrum encompassing 11 distinct mutations distributed along the PAH gene sequence on 34 of the 36 mutant alleles (diagnostic efficiency of 94.4%). Also, four PAH VNTR alleles (with repeats of 3, 7, 8 and 9) were detected. The three most frequent mutations were IVS9+5G>A, IVS7-5T>C, and p.P281L with the frequency of 27.8%, 11%, and 11%, respectively. The results showed that there is not only a consanguineous relation, but also a difference in PAH characters of mutations between Kermanshah and the other two parts of western Iran (Hamadan and Lorestan). Also, it seems that the spectrum of mutations in western Iran is relatively distinct from other parts of the country, suggesting that this region might be a special PAH gene distribution region. Moreover, our findings can be useful in the identification of genotype to phenotype relationship in patients, and provide future abilities for confirmatory diagnostic testing, prognosis, and predict the severity of PKU patients.

  13. Haplotype estimation using sequencing reads.

    PubMed

    Delaneau, Olivier; Howie, Bryan; Cox, Anthony J; Zagury, Jean-François; Marchini, Jonathan

    2013-10-03

    High-throughput sequencing technologies produce short sequence reads that can contain phase information if they span two or more heterozygote genotypes. This information is not routinely used by current methods that infer haplotypes from genotype data. We have extended the SHAPEIT2 method to use phase-informative sequencing reads to improve phasing accuracy. Our model incorporates the read information in a probabilistic model through base quality scores within each read. The method is primarily designed for high-coverage sequence data or data sets that already have genotypes called. One important application is phasing of single samples sequenced at high coverage for use in medical sequencing and studies of rare diseases. Our method can also use existing panels of reference haplotypes. We tested the method by using a mother-father-child trio sequenced at high-coverage by Illumina together with the low-coverage sequence data from the 1000 Genomes Project (1000GP). We found that use of phase-informative reads increases the mean distance between switch errors by 22% from 274.4 kb to 328.6 kb. We also used male chromosome X haplotypes from the 1000GP samples to simulate sequencing reads with varying insert size, read length, and base error rate. When using short 100 bp paired-end reads, we found that using mixtures of insert sizes produced the best results. When using longer reads with high error rates (5-20 kb read with 4%-15% error per base), phasing performance was substantially improved. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  14. Congruence as a measurement of extended haplotype structure across the genome

    PubMed Central

    2012-01-01

    Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. PMID:22369243

  15. MGMT DNA repair gene promoter/enhancer haplotypes alter transcription factor binding and gene expression.

    PubMed

    Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z

    2016-10-01

    The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.

  16. Y-STR haplotypes of Native American populations from the Brazilian Amazon region.

    PubMed

    Palha, Teresinha Jesus Brabo Ferreira; Rodrigues, Elzemar Martins Ribeiro; dos Santos, Sidney Emanuel Batista

    2010-10-01

    The allele and haplotype frequencies of nine Y-STRs (DYS19, DYS389 I, DYS389 II, DYS390, DYS391, DYS392, DYS393, DYS385 I/II) were determined in a sample of six native tribes from the Brazilian Amazon (Tiriyó, Awa-Guajá, Waiãpi, Urubu-Kaapor, Zoé and Parakanã). Forty-eight different haplotypes were identified, 28 of which unique. Five haplotypes are very frequent and were shared by over 10 individuals. The estimated haplotype diversity (0.9114) was very low compared to other geographic groups, including Africans, Europeans and Asians. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  17. Multiple-locus, variable number of tandem repeat analysis (MLVA) of the fish-pathogen Francisella noatunensis

    PubMed Central

    2011-01-01

    Background Since Francisella noatunensis was first isolated from cultured Atlantic cod in 2004, it has emerged as a global fish pathogen causing disease in both warm and cold water species. Outbreaks of francisellosis occur in several important cultured fish species making a correct management of this disease a matter of major importance. Currently there are no vaccines or treatments available. A strain typing system for use in studies of F. noatunensis epizootics would be an important tool for disease management. However, the high genetic similarity within the Francisella spp. makes strain typing difficult, but such typing of the related human pathogen Francisella tullarensis has been performed successfully by targeting loci with higher genetic variation than the traditional signature sequences. These loci are known as Variable Numbers of Tandem Repeat (VNTR). The aim of this study is to identify possible useful VNTRs in the genome of F. noatunensis. Results Seven polymorphic VNTR loci were identified in the preliminary genome sequence of F. noatunensis ssp. noatunensis GM2212 isolate. These VNTR-loci were sequenced in F. noatunensis isolates collected from Atlantic cod (Gadus morhua) from Norway (n = 21), Three-line grunt (Parapristipoma trilineatum) from Japan (n = 1), Tilapia (Oreochromis spp.) from Indonesia (n = 3) and Atlantic salmon (Salmo salar) from Chile (n = 1). The Norwegian isolates presented in this study show both nine allelic profiles and clades, and that the majority of the farmed isolates belong in two clades only, while the allelic profiles from wild cod are unique. Conclusions VNTRs can be used to separate isolates belonging to both subspecies of F. noatunensis. Low allelic diversity in F. noatunensis isolates from outbreaks in cod culture compared to isolates wild cod, indicate that transmission of these isolates may be a result of human activity. The sequence based MLVA system presented in this study should provide a good starting point for

  18. [Polymorphism analysis of 20 autosomal short-tandem repeat loci in southern Chinese Han population].

    PubMed

    Chen, Ling; Lu, Hui-Jie; DU, Wei-An; Qiu, Ping-Ming; Liu, Chao

    2016-02-20

    To evaluate the value of PowerPlex ® 21 System (Promega) and study the genetic polymorphism of its 20 short-tandem repeat (STR) loci in southern Chinese Han population. We conducted genotyping experiments using PowerPlex ® 21 System on 20 autosomal STR loci (D3S1358, D1S1656, D6S1043, D13S317, Penta E, D16S539, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433 and FGA) in 2367 unrelated Chinese Han individuals living in South China. The allele frequencies and parameters commonly used in forensic science were statistically analyzed in these individuals and compared with the reported data of other populations. The PowerPlex ® 21 System had a power of discrimination (PD) ranging from 0.7839 to 0.9852 and a power of exclusion (PE) ranging from 0.2974 to 0.8099 for the 20 loci. No significant deviation from Hardy-Weinberg expectations was found for all the loci except for D5S818. This southern Chinese Han population had significant differences in the allele frequencies from 8 ethnic groups reported in China, and showed significant differences at 8 to 20 STR foci from 5 foreign populations. The allele frequency at the locus D1S1656 in this southern Chinese Han population differed significantly from those in the 5 foreign populations and from 3 reported Han populations in Beijing, Zhejiang Province and Fujian Province of China. The neighbor-joining phylogenetictree showed clustering of all the Asian populations in one branch, while the northern Italian and Argentina populations clustered in a separate branch. This southern Chinese Han population had the nearest affinity with the Yi ethnic population in Yunnan Province of China. The 20 STR loci are highly polymorphic in this southern Chinese Han population, suggesting the value of this set of STR loci in forensic personal identification, paternity testing and anthropological study.

  19. A TNF region haplotype offers protection from typhoid fever in Vietnamese patients

    PubMed Central

    2009-01-01

    The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085

  20. Clostridium botulinum Group I Strain Genotyping by 15-Locus Multilocus Variable-Number Tandem-Repeat Analysis ▿ †

    PubMed Central

    Fillo, Silvia; Giordani, Francesco; Anniballi, Fabrizio; Gorgé, Olivier; Ramisse, Vincent; Vergnaud, Gilles; Riehm, Julia M.; Scholz, Holger C.; Splettstoesser, Wolf D.; Kieboom, Jasper; Olsen, Jaran-Strand; Fenicia, Lucia; Lista, Florigio

    2011-01-01

    Clostridium botulinum is a taxonomic designation that encompasses a broad variety of spore-forming, Gram-positive bacteria producing the botulinum neurotoxin (BoNT). C. botulinum is the etiologic agent of botulism, a rare but severe neuroparalytic disease. Fine-resolution genetic characterization of C. botulinum isolates of any BoNT type is relevant for both epidemiological studies and forensic microbiology. A 10-locus multiple-locus variable-number tandem-repeat analysis (MLVA) was previously applied to isolates of C. botulinum type A. The present study includes five additional loci designed to better address proteolytic B and F serotypes. We investigated 79 C. botulinum group I strains isolated from human and food samples in several European countries, including types A (28), B (36), AB (4), and F (11) strains, and 5 nontoxic Clostridium sporogenes. Additional data were deduced from in silico analysis of 10 available fully sequenced genomes. This 15-locus MLVA (MLVA-15) scheme identified 86 distinct genotypes that clustered consistently with the results of amplified fragment length polymorphism (AFLP) and MLVA genotyping in previous reports. An MLVA-7 scheme, a subset of the MLVA-15, performed on a lab-on-a-chip device using a nonfluorescent subset of primers, is also proposed as a first-line assay. The phylogenetic grouping obtained with the MLVA-7 does not differ significantly from that generated by the MLVA-15. To our knowledge, this report is the first to analyze genetic variability among all of the C. botulinum group I serotypes by MLVA. Our data provide new insights into the genetic variability of group I C. botulinum isolates worldwide and demonstrate that this group is genetically highly diverse. PMID:22012011

  1. Multiple-locus variable-number tandem-repeats analysis of Listeria monocytogenes using multicolour capillary electrophoresis and comparison with pulsed-field gel electrophoresis typing.

    PubMed

    Lindstedt, Bjørn-Arne; Tham, Wilhelm; Danielsson-Tham, Marie-Louise; Vardund, Traute; Helmersson, Seved; Kapperud, Georg

    2008-02-01

    The multiple-locus variable-number tandem-repeats analysis (MLVA) method for genotyping has proven to be a fast and reliable typing tool in several bacterial species. MLVA is in our laboratory the routine typing method for Salmonella enterica subsp. enterica serovar Typhimurium and Escherichia coli O157. The gram-positive bacteria Listeria monocytogenes, while not isolated as frequent as S. Typhimurium and E. coli, causes severe illness with an overall mortality rate of 30%. Thus, it is important that any outbreak of this pathogen is detected early and a fast trace to the source can be performed. In view of this, we have used the information provided by two fully sequenced L. monocytogenes strains to develop a MLVA assay coupled with high-resolution capillary electrophoresis and compared it to pulsed-field gel electrophoresis (PFGE) in two sets of isolates, one Norwegian (79 isolates) and one Swedish (61 isolates) set. The MLVA assay could resolve all of the L. monocytogenes serotypes tested, and was slightly more discriminatory than PFGE for the Norwegian isolates (28 MLVA profiles and 24 PFGE profiles) and opposite for the Swedish isolates (42 MLVA profiles and 43 PFGE profiles).

  2. Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection.

    PubMed

    Louzoun, Yoram; Alter, Idan; Gragert, Loren; Albrecht, Mark; Maiers, Martin

    2018-05-01

    Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.

  3. A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

    PubMed Central

    Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

    2013-01-01

    Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962

  4. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco

    2018-04-11

    The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.

  5. HLA-A*02 allele frequencies and haplotypic associations in Koreans.

    PubMed

    Park, M H; Whang, D H; Kang, S J; Han, K S

    2000-03-01

    We have investigated the frequencies of HLA-A*02 alleles and their haplotypic associations with HLA-B and -DRB1 loci in 439 healthy unrelated Koreans, including 214 parents from 107 families. All of the 227 samples (51.7%) typed as A2 by serology were analyzed for A*02 alleles using polymerase chain reaction (PCR)-low ionic strength-single-strand conformation polymorphism (LIS-SSCP) method. A total of six different A*02 alleles were detected (A*02 allele frequency 29.6%): A*0201/9 (16.6%), *0203 (0.5%), *0206 (9.3%), *0207 (3.0%), and one each case of *0210 and *02 undetermined type. Two characteristic haplotypes showing the strongest linkage disequilibrium were A*0203-B38-DRB]*1502 and A*0207-B46-DRB1*0803. Besides these strong associations, significant two-locus associations (P<0.001) were observed for A*0201 with B61, DRB1*0901 and DRB1*1401, and for A*0206 with B48 and B61. HLA haplotypes carrying HLA-A2 showed a variable distribution of A*02 alleles, and all of the eight most common A2-B-DR haplotypes occurring at frequencies of > or =1% were variably associated with two different A*02 alleles. These results demonstrate that substantial heterogeneity is present in the distribution of HLA-A*02 alleles and related haplotypes in Koreans.

  6. The Impact of Multilocus Variable-Number Tandem-Repeat Analysis on PulseNet Canada Escherichia coli O157:H7 Laboratory Surveillance and Outbreak Support, 2008-2012.

    PubMed

    Rumore, Jillian Leigh; Tschetter, Lorelee; Nadon, Celine

    2016-05-01

    The lack of pattern diversity among pulsed-field gel electrophoresis (PFGE) profiles for Escherichia coli O157:H7 in Canada does not consistently provide optimal discrimination, and therefore, differentiating temporally and/or geographically associated sporadic cases from potential outbreak cases can at times impede investigations. To address this limitation, DNA sequence-based methods such as multilocus variable-number tandem-repeat analysis (MLVA) have been explored. To assess the performance of MLVA as a supplemental method to PFGE from the Canadian perspective, a retrospective analysis of all E. coli O157:H7 isolated in Canada from January 2008 to December 2012 (inclusive) was conducted. A total of 2285 E. coli O157:H7 isolates and 63 clusters of cases (by PFGE) were selected for the study. Based on the qualitative analysis, the addition of MLVA improved the categorization of cases for 60% of clusters and no change was observed for ∼40% of clusters investigated. In such situations, MLVA serves to confirm PFGE results, but may not add further information per se. The findings of this study demonstrate that MLVA data, when used in combination with PFGE-based analyses, provide additional resolution to the detection of clusters lacking PFGE diversity as well as demonstrate good epidemiological concordance. In addition, MLVA is able to identify cluster-associated isolates with variant PFGE pattern combinations that may have been previously missed by PFGE alone. Optimal laboratory surveillance in Canada is achieved with the application of PFGE and MLVA in tandem for routine surveillance, cluster detection, and outbreak response.

  7. Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.

    PubMed

    Long, Ji-Rong; Zhao, Lan-Juan; Liu, Peng-Yuan; Lu, Yan; Dvornyk, Volodymyr; Shen, Hui; Liu, Yong-Jun; Zhang, Yuan-Yuan; Xiong, Dong-Hai; Xiao, Peng; Deng, Hong-Wen

    2004-05-24

    The adequacy of association studies for complex diseases depends critically on the existence of linkage disequilibrium (LD) between functional alleles and surrounding SNP markers. We examined the patterns of LD and haplotype distribution in eight candidate genes for osteoporosis and/or obesity using 31 SNPs in 1,873 subjects. These eight genes are apolipoprotein E (APOE), type I collagen alpha1 (COL1A1), estrogen receptor-alpha (ER-alpha), leptin receptor (LEPR), parathyroid hormone (PTH)/PTH-related peptide receptor type 1 (PTHR1), transforming growth factor-beta1 (TGF-beta1), uncoupling protein 3 (UCP3), and vitamin D (1,25-dihydroxyvitamin D3) receptor (VDR). Yin yang haplotypes, two high-frequency haplotypes composed of completely mismatching SNP alleles, were examined. To quantify LD patterns, two common measures of LD, D' and r2, were calculated for the SNPs within the genes. The haplotype distribution varied in the different genes. Yin yang haplotypes were observed only in PTHR1 and UCP3. D' ranged from 0.020 to 1.000 with the average of 0.475, whereas the average r2 was 0.158 (ranging from 0.000 to 0.883). A decay of LD was observed as the intermarker distance increased, however, there was a great difference in LD characteristics of different genes or even in different regions within gene. The differences in haplotype distributions and LD patterns among the genes underscore the importance of characterizing genomic regions of interest prior to association studies.

  8. Highly diverse variable number tandem repeat loci in the E. coli O157:H7 and O55:H7 genomes for high-resolution molecular typing.

    PubMed

    Keys, C; Kemper, S; Keim, P

    2005-01-01

    Evaluation of the Escherichia coli genome for variable number tandem repeat (VNTR) loci in order to provide a subtyping tool with greater discrimination and more efficient capacity. Twenty-nine putative VNTR loci were identified from the E. coli genomic sequence. Their variability was validated by characterizing the number of repeats at each locus in a set of 56 E. coli O157:H7/HN and O55:H7 isolates. An optimized multiplex assay system was developed to facility high capacity analysis. Locus diversity values ranged from 0.23 to 0.95 while the number of alleles ranged from two to 29. This multiple-locus VNTR analysis (MLVA) data was used to describe genetic relationships among these isolates and was compared with PFGE (pulse field gel electrophoresis) data from a subset of the same strains. Genetic similarity values were highly correlated between the two approaches, through MLVA was capable of discrimination amongst closely related isolates when PFGE similar values were equal to 1.0. Highly variable VNTR loci exist in the E. coli O157:H7 genome and are excellent estimators of genetic relationships, in particular for closely related isolates. Escherichia coli O157:H7 MLVA offers a complimentary analysis to the more traditional PFGE approach. Application of MLVA to an outbreak cluster could generate superior molecular epidemiology and result in a more effective public health response.

  9. Selection and Validation of a Multilocus Variable-Number Tandem-Repeat Analysis Panel for Typing Shigella spp.▿ †

    PubMed Central

    Gorgé, Olivier; Lopez, Stéphanie; Hilaire, Valérie; Lisanti, Olivier; Ramisse, Vincent; Vergnaud, Gilles

    2008-01-01

    The Shigella genus has historically been separated into four species, based on biochemical assays. The classification within each species relies on serotyping. Recently, genome sequencing and DNA assays, in particular the multilocus sequence typing (MLST) approach, greatly improved the current knowledge of the origin and phylogenetic evolution of Shigella spp. The Shigella and Escherichia genera are now considered to belong to a unique genomospecies. Multilocus variable-number tandem-repeat (VNTR) analysis (MLVA) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses of highly homogeneous bacterial pathogens. Here, we assess the capability of MLVA for Shigella typing. Thirty-two potentially polymorphic VNTRs were selected by analyzing in silico five Shigella genomic sequences and subsequently evaluated. Eventually, a panel of 15 VNTRs was selected (i.e., MLVA15 analysis). MLVA15 analysis of 78 strains or genome sequences of Shigella spp. and 11 strains or genome sequences of Escherichia coli distinguished 83 genotypes. Shigella population cluster analysis gave consistent results compared to MLST. MLVA15 analysis showed capabilities for E. coli typing, providing classification among pathogenic and nonpathogenic E. coli strains included in the study. The resulting data can be queried on our genotyping webpage (http://mlva.u-psud.fr). The MLVA15 assay is rapid, highly discriminatory, and reproducible for Shigella and Escherichia strains, suggesting that it could significantly contribute to epidemiological trace-back analysis of Shigella infections and pathogenic Escherichia outbreaks. Typing was performed on strains obtained mostly from collections. Further studies should include strains of much more diverse origins, including all pathogenic E. coli types. PMID:18216214

  10. Variation and Evolution in the Glutamine-Rich Repeat Region of Drosophila Argonaute-2

    PubMed Central

    Palmer, William H.; Obbard, Darren J.

    2016-01-01

    RNA interference pathways mediate biological processes through Argonaute-family proteins, which bind small RNAs as guides to silence complementary target nucleic acids . In insects and crustaceans Argonaute-2 silences viral nucleic acids, and therefore acts as a primary effector of innate antiviral immunity. Although the function of the major Argonaute-2 domains, which are conserved across most Argonaute-family proteins, are known, many invertebrate Argonaute-2 homologs contain a glutamine-rich repeat (GRR) region of unknown function at the N-terminus . Here we combine long-read amplicon sequencing of Drosophila Genetic Reference Panel (DGRP) lines with publicly available sequence data from many insect species to show that this region evolves extremely rapidly and is hyper-variable within species. We identify distinct GRR haplotype groups in Drosophila melanogaster, and suggest that one of these haplotype groups has recently risen to high frequency in a North American population. Finally, we use published data from genome-wide association studies of viral resistance in D. melanogaster to test whether GRR haplotypes are associated with survival after virus challenge. We find a marginally significant association with survival after challenge with Drosophila C Virus in the DGRP, but we were unable to replicate this finding using lines from the Drosophila Synthetic Population Resource panel. PMID:27317784

  11. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity

    PubMed Central

    Sperschneider, Jana; Garnica, Diana P.; Miller, Marisa E.; Taylor, Jennifer M.; Dodds, Peter N.; Park, Robert F.

    2018-01-01

    ABSTRACT A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb) and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales. In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies. PMID:29463659

  12. VNTR alleles associated with the {alpha}-globin locus are haplotype and population related

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martinson, J.J.; Clegg, J.B.; Boyce, A.J.

    1994-09-01

    The human {alpha}-globin complex contains several polymorphic restriction-enzyme sites (i.e., RFLPs) linked to form haplotypes and is flanked by two hypervariable VNTR loci, the 5{prime} hypervariable region (HVR) and the more highly polymorphic 3{prime}HVR. Using a combination of RFLP analysis and PCR, the authors have characterized the 5{prime}HVR and 3{prime}HVR alleles associated with the {alpha}-globin haplotypes of 133 chromosomes, and they here show that specific {alpha}-globin haplotypes are each associated with discrete subsets of the alleles observed at these two VNTR loci. This statistically highly significant association is observed over a region spanning {approximately} 100 kb. With the exception ofmore » closely related haplotypes, different haplotypes do not share identically sized 3{prime}HVR alleles. Earlier studies have shown that {alpha}-globin haplotype distributions differ between populations; the current findings also reveal extensive population substructure in the repertoire of {alpha}-globin VNTRs. If similar features are characteristic of other VNTR loci, this will have important implications for forensic and anthropological studies. 42 refs., 5 figs., 5 tabs.« less

  13. Genetic considerations in human sex-mate selection: partners share human leukocyte antigen but not short-tandem-repeat identity markers.

    PubMed

    Israeli, Moshe; Kristt, Don; Nardi, Yuval; Klein, Tirza

    2014-05-01

    Previous studies support a role for MHC on mating preference, yet it remains unsettled as to whether mating occurs preferentially between individuals sharing human leukocyte antigen (HLA) determinants or not. Investigating sex-mate preferences in the contemporary Israeli population is of further curiosity being a population with distinct genetic characteristics, where multifaceted cultural considerations influence mate selection. Pairs of male-female sex partners were evaluated in three groups. Two groups represented unmarried (n = 1002) or married (n = 308) couples and a control group of fictitious male-female couples. HLA and short-tandem-repeat (STR) genetic identification markers were assessed for the frequency of shared antigens and alleles. Human leukocyte antigen results showed that Class I and/ or Class II single antigen as well as double antigen sharing was more common in sex partners than in control group couples (P < 0.001). Married versus unmarried pairs were not distinguishable. In contrast, STR-DNA markers failed to differentiate between sex-mates and controls (P = 0.78). Sex partnerships shared HLA determinants more frequently than randomly constituted male-female pairs. The observed phenomenon does not reflect a syngenetic background between sex-mates as STR markers were not selectively shared. Thus, sex-mate selection in man may contravene the evolutionary pressure for genetic diversity in regard to HLA. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. Locus-specific mutational events in a multilocus variable-number tandem repeat analysis of Escherichia coli O157:H7.

    PubMed

    Noller, Anna C; McEllistrem, M Catherine; Shutt, Kathleen A; Harrison, Lee H

    2006-02-01

    Multilocus variable-number tandem repeat analysis (MLVA) is a validated molecular subtyping method for detecting and evaluating Escherichia coli O157:H7 outbreaks. In a previous study, five outbreaks with a total of 21 isolates were examined by MLVA. Nearly 20% of the epidemiologically linked strains were single-locus variants (SLV) of their respective predominant outbreak clone. This result prompted an investigation into the mutation rates of the seven MLVA loci (TR1 to TR7). With an outbreak strain that was an SLV at the TR1 locus of the predominant clone, parallel and serial batch culture experiments were performed. In a parallel experiment, none (0/384) of the strains analyzed had mutations at the seven MLVA loci. In contrast, in the two 5-day serial experiments, 4.3% (41/960) of the strains analyzed had a significant variation in at least one of these loci (P < 0.001). The TR2 locus accounted for 85.3% (35/41) of the mutations, with an average mutation rate of 3.5 x 10(-3); the mutations rates for TR1 and TR5 were 10-fold lower. Single additions accounted for 77.1% (27/35) of the mutation events in TR2 and all (6/6) of the additions in TR1 and TR5. The remaining four loci had no slippage events detected. The mutation rates were locus specific and may impact the interpretation of MLVA data for epidemiologic investigations.

  15. A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.

    PubMed

    Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin

    2018-04-26

    Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.

  16. Disease-associated repeat instability and mismatch repair.

    PubMed

    Schmidt, Monika H M; Pearson, Christopher E

    2016-02-01

    Expanded tandem repeat sequences in DNA are associated with at least 40 human genetic neurological, neurodegenerative, and neuromuscular diseases. Repeat expansion can occur during parent-to-offspring transmission, and arise at variable rates in specific tissues throughout the life of an affected individual. Since the ongoing somatic repeat expansions can affect disease age-of-onset, severity, and progression, targeting somatic expansion holds potential as a therapeutic target. Thus, understanding the factors that regulate this mutation is crucial. DNA repair, in particular mismatch repair (MMR), is the major driving force of disease-associated repeat expansions. In contrast to its anti-mutagenic roles, mammalian MMR curiously drives the expansion mutations of disease-associated (CAG)·(CTG) repeats. Recent advances have broadened our knowledge of both the MMR proteins involved in disease repeat expansions, including: MSH2, MSH3, MSH6, MLH1, PMS2, and MLH3, as well as the types of repeats affected by MMR, now including: (CAG)·(CTG), (CGG)·(CCG), and (GAA)·(TTC) repeats. Mutagenic slipped-DNA structures have been detected in patient tissues, and the size of the slip-out and their junction conformation can determine the involvement of MMR. Furthermore, the formation of other unusual DNA and R-loop structures is proposed to play a key role in MMR-mediated instability. A complex correlation is emerging between tissues showing varying amounts of repeat instability and MMR expression levels. Notably, naturally occurring polymorphic variants of DNA repair genes can have dramatic effects upon the levels of repeat instability, which may explain the variation in disease age-of-onset, progression and severity. An increasing grasp of these factors holds prognostic and therapeutic potential. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Haplotype-based approach to known MS-associated regions increases the amount of explained risk

    PubMed Central

    Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S

    2015-01-01

    Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143

  18. Mapping of HLA- DQ haplotypes in a group of Danish patients with celiac disease.

    PubMed

    Lund, Flemming; Hermansen, Mette N; Pedersen, Merete F; Hillig, Thore; Toft-Hansen, Henrik; Sölétormos, György

    2015-10-01

    A cost-effective identification of HLA- DQ risk haplotypes using the single nucleotide polymorphism (SNP) technique has recently been applied in the diagnosis of celiac disease (CD) in four European populations. The objective of the study was to map risk HLA- DQ haplotypes in a group of Danish CD patients using the SNP technique. Cohort A: Among 65 patients with gastrointestinal symptoms we compared the HLA- DQ2 and HLA- DQ8 risk haplotypes obtained by the SNP technique (method 1) with results based on a sequence specific primer amplification technique (method 2) and a technique used in an assay from BioDiagene (method 3). Cohort B: 128 patients with histologically verified CD were tested for CD risk haplotypes (method 1). Patients with negative results were further tested for sub-haplotypes of HLA- DQ2 (methods 2 and 3). Cohort A: The three applied methods provided the same HLA- DQ2 and HLA- DQ8 results among 61 patients. Four patients were negative for the HLA- DQ2 and HLA- DQ8 haplotypes (method 1) but were positive for the HLA- DQ2.5-trans and HLA- DQ2.2 haplotypes (methods 2 and 3). Cohort B: A total of 120 patients were positive for the HLA- DQ2.5-cis and HLA- DQ8 haplotypes (method 1). The remaining seven patients were positive for HLA- DQ2.5-trans or HLA- DQ2.2 haplotypes (methods 2 and 3). One patient was negative with all three HLA methods. The HLA- DQ risk haplotypes were detected in 93.8% of the CD patients using the SNP technique (method 1). The sensitivity increased to 99.2% by combining methods 1 - 3.

  19. The RNase P RNA from cyanobacteria: short tandemly repeated repetitive (STRR) sequences are present within the RNase P RNA gene in heterocyst-forming cyanobacteria.

    PubMed Central

    Vioque, A

    1997-01-01

    The RNase P RNA gene (rnpB) from 10 cyanobacteria has been characterized. These new RNAs, together with the previously available ones, provide a comprehensive data set of RNase P RNA from diverse cyanobacterial lineages. All heterocystous cyanobacteria, but none of the non-heterocystous strains analyzed, contain short tandemly repeated repetitive (STRR) sequences that increase the length of helix P12. Site-directed mutagenesis experiments indicate that the STRR sequences are not required for catalytic activity in vitro. STRR sequences seem to have recently and independently invaded the RNase P RNA genes in heterocyst-forming cyanobacteria because closely related strains contain unrelated STRR sequences. Most cyanobacteria RNase P RNAs lack the sequence GGU in the loop connecting helices P15 and P16 that has been established to interact with the 3'-end CCA in precursor tRNA substrates in other bacteria. This character is shared with plastid RNase P RNA. Helix P6 is longer than usual in most cyanobacteria as well as in plastid RNase P RNA. PMID:9254706

  20. Assessing transmission of ‘Candidatus Liberibacter solanacearum’ haplotypes through seed potato

    USDA-ARS?s Scientific Manuscript database

    Conflicting data has previously been reported concerning the impact of zebra chip disease transmission through seed tubers. These discrepancies may be due to the experimental design of each study, whereby different pathogen haplotypes, insect vector haplotypes, and potato plant varieties were used....

  1. COMT haplotypes, catecholamine metabolites in plasma and clinical response in schizophrenic and bipolar patients.

    PubMed

    Zumárraga, Mercedes; Arrúe, Aurora; Basterreche, Nieves; Macías, Isabel; Catalán, Ana; Madrazo, Arantza; Bustamante, Sonia; Zamalloa, María I; Erkoreka, Leire; Gordo, Estibaliz; Arnaiz, Ainara; Olivas, Olga; Arroita, Ariane; Marín, Elena; González-Torres, Miguel A

    2016-06-01

    We examined the association of COMT haplotypes and plasma metabolites of catecholamines in relation to the clinical response to antipsychotics in schizophrenic and bipolar patients. We studied 165 patients before and after four weeks of treatment, and 163 healthy controls. We assessed four COMT haplotypes and the plasma concentrations of HVA, DOPAC and MHPG. Bipolar patients: haplotypes are associated with age at onset and clinical evolution. In schizophrenic patients, an haplotype previously associated with increased risk, is related to better response of negative symptoms. Haplotypes would be good indicators of the clinical status and the treatment response in bipolar and schizophrenic patients. Larger studies are required to elucidate the clinical usefulness of these findings.

  2. Two families from New England with usher syndrome type IC with distinct haplotypes.

    PubMed

    DeAngelis, M M; McGee, T L; Keats, B J; Slim, R; Berson, E L; Dryja, T P

    2001-03-01

    To search for patients with Usher syndrome type IC among those with Usher syndrome type I who reside in New England. Genotype analysis of microsatellite markers closely linked to the USH1C locus was done using the polymerase chain reaction. We compared the haplotype of our patients who were homozygous in the USH1C region with the haplotypes found in previously reported USH1C Acadian families who reside in southwestern Louisiana and from a single family residing in Lebanon. Of 46 unrelated cases of Usher syndrome type I residing in New England, two were homozygous at genetic markers in the USH1C region. Of these, one carried the Acadian USH1C haplotype and had Acadian ancestors (that is, from Nova Scotia) who did not participate in the 1755 migration of Acadians to Louisiana. The second family had a haplotype that proved to be the same as that of a family with USH1C residing in Lebanon. Each of the two families had haplotypes distinct from the other. This is the first report that some patients residing in New England have Usher syndrome type IC. Patients with Usher syndrome type IC can have the Acadian haplotype or the Lebanese haplotype compatible with the idea that at least two independently arising pathogenic mutations have occurred in the yet-to-be identified USH1C gene.

  3. Fetal hemoglobin in sickle cell anemia: The Arab-Indian haplotype and new therapeutic agents.

    PubMed

    Habara, Alawi H; Shaikho, Elmutaz M; Steinberg, Martin H

    2017-11-01

    Fetal hemoglobin (HbF) has well-known tempering effects on the symptoms of sickle cell disease and its levels vary among patients with different haplotypes of the sickle hemoglobin gene. Compared with sickle cell anemia haplotypes found in patients of African descent, HbF levels in Saudi and Indian patients with the Arab-Indian (AI) haplotype exceed that in any other haplotype by nearly twofold. Genetic association studies have identified some loci associated with high HbF in the AI haplotype but these observations require functional confirmation. Saudi patients with the Benin haplotype have HbF levels almost twice as high as African patients with this haplotype but this difference is unexplained. Hydroxyurea is still the only FDA approved drug for HbF induction in sickle cell disease. While most patients treated with hydroxyurea have an increase in HbF and some clinical improvement, 10 to 20% of adults show little response to this agent. We review the genetic basis of HbF regulation focusing on sickle cell anemia in Saudi Arabia and discuss new drugs that can induce increased levels of HbF. © 2017 Wiley Periodicals, Inc.

  4. Two Orangutan Species Have Evolved Different KIR Alleles and Haplotypes1

    PubMed Central

    Guethlein, Lisbeth A.; Norman, Paul J.; Heijmans, Corinne M. C.; de Groot, Natasja G.; Hilton, Hugo G.; Babrzadeh, Farbod; Abi-Rached, Laurent; Bontrop, Ronald E.; Parham, Peter

    2017-01-01

    The immune and reproductive functions of human Natural Killer (NK) cells are regulated by interactions of the C1 and C2 epitopes of HLA-C with C1-specific and C2-specific lineage III killer cell immunoglobulin-like receptors (KIR). This rapidly evolving and diverse system of ligands and receptors is restricted to humans and great apes. In this context, the orangutan has particular relevance because it represents an evolutionary intermediate, one having the C1 epitope and corresponding KIR, but lacking the C2 epitope. Through a combination of direct sequencing, KIR genotyping and data mining from the Great Ape Genome Project (GAGP) we characterized the KIR alleles and haplotypes for panels of ten Bornean orangutans and 19 Sumatran orangutans. The orangutan KIR haplotypes have between five and ten KIR genes. The seven orangutan lineage III KIR genes all locate to the centromeric region of the KIR locus, whereas their human counterparts also populate the telomeric region. One lineage III KIR gene is Bornean-specific, one is Sumatran-specific and five are shared. Of twelve KIR gene-content haplotypes five are Bornean-specific, five are Sumatran-specific and two are shared. The haplotypes have different combinations of genes encoding activating and inhibitory C1 receptors that can be of higher or lower affinity. All haplotypes encode an inhibitory C1 receptor, but only some haplotypes encode an activating C1 receptor. Of 130 KIR alleles, 55 are Bornean-specific, 65 are Sumatran specific and ten are shared. PMID:28264973

  5. Native and European haplotypes of Phragmites Australis (common reed) in the central Platte River, Nebraska

    USGS Publications Warehouse

    Larson, D.L.; Galatowitsch, S.M.; Larson, J.L.

    2011-01-01

    Phragmites australis (common reed) is known to have occurred along the Platte River historically, but recent rapid increases in both distribution and density have begun to impact habitat for migrating sandhill cranes and nesting piping plovers and least terns. Invasiveness in Phragmites has been associated with the incursion of a European genotype (haplotype M) in other areas; determining the genotype of Phragmites along the central Platte River has implications for proper management of the river system. In 2008 we sampled Phragmites patches along the central Platte River from Lexington to Chapman, NE, stratified by bridge segments, to determine the current distribution of haplotype E (native) and haplotype M genotypes. In addition, we did a retrospective analysis of historical Phragmites collections from the central Platte watershed (1902-2006) at the Bessey Herbarium. Fresh tissue from the 2008 survey and dried tissue from the herbarium specimens were classified as haplotype M or E using the restriction fragment length polymorphism procedure. The European haplotype was predominant in the 2008 samples: only 14 Phragmites shoots were identified as native haplotype E; 224 were non-native haplotype M. The retrospective analysis revealed primarily native haplotype individuals. Only collections made in Lancaster County, near Lincoln, NE, were haplotype M, and the earliest of these was collected in 1973. ?? 2011 Copyright by the Center for Great Plains Studies, University of Nebraska-Lincoln.

  6. A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli.

    PubMed

    Li, Mingji; Wang, Junshu; Geng, Yanping; Li, Yikui; Wang, Qian; Liang, Quanfeng; Qi, Qingsheng

    2012-02-06

    For metabolic engineering, many rate-limiting steps may exist in the pathways of accumulating the target metabolites. Increasing copy number of the desired genes in these pathways is a general method to solve the problem, for example, the employment of the multi-copy plasmid-based expression system. However, this method may bring genetic instability, structural instability and metabolic burden to the host, while integrating of the desired gene into the chromosome may cause inadequate transcription or expression. In this study, we developed a strategy for obtaining gene overexpression by engineering promoter clusters consisted of multiple core-tac-promoters (MCPtacs) in tandem. Through a uniquely designed in vitro assembling process, a series of promoter clusters were constructed. The transcription strength of these promoter clusters showed a stepwise enhancement with the increase of tandem repeats number until it reached the critical value of five. Application of the MCPtacs promoter clusters in polyhydroxybutyrate (PHB) production proved that it was efficient. Integration of the phaCAB genes with the 5CPtacs promoter cluster resulted in an engineered E.coli that can accumulate 23.7% PHB of the cell dry weight in batch cultivation. The transcription strength of the MCPtacs promoter cluster can be greatly improved by increasing the tandem repeats number of the core-tac-promoter. By integrating the desired gene together with the MCPtacs promoter cluster into the chromosome of E. coli, we can achieve high and stale overexpression with only a small size. This strategy has an application potential in many fields and can be extended to other bacteria.

  7. Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers

    PubMed Central

    Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.

    2018-01-01

    Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092

  8. Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.

    PubMed

    Jiang, Yong; Schmidt, Renate H; Reif, Jochen C

    2018-05-04

    Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.

  9. Performance of Single Nucleotide Polymorphisms versus Haplotypes for Genome-Wide Association Analysis in Barley

    PubMed Central

    Jannink, Jean-Luc

    2010-01-01

    Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933

  10. Genetic analysis of autoimmune regulator haplotypes in alopecia areata.

    PubMed

    Wengraf, D A; McDonagh, A J G; Lovewell, T R J; Vasilopoulos, Y; Macdonald-Hull, S P; Cork, M J; Messenger, A G; Tazi-Ahnini, R

    2008-03-01

    Alopecia areata is an immune-mediated disorder, occurring with the highest observed frequency in the rare recessive autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED) syndrome caused by mutations of the autoimmune regulator (AIRE) gene on chromosome 21q22.3. We have previously detected association between alopecia areata and a single nucleotide polymorphism (SNP) in the AIRE gene in patients without APECED, and we now report the findings of an extended examination of the association of alopecia areata with haplotype analysis including six SNPs in the AIRE gene: C-103T, C4144G, T5238C, G6528A, T7215C and T11787C. In Caucasian groups of 295 patients and 363 controls, we found strong association between the AIRE 7215C allele and AA [P = 3.8 x 10(-8), OR (95% CI): 2.69 (1.8-4.0)]. The previously reported association between AA and the AIRE 4144G allele was no longer significant on correction for multiple testing. The AIRE haplotypes CCTGCT and CGTGCC showed a highly significant association with AA [P = 6.05 x 10(-6), 9.47 (2.91-30.8) and P = 0.001, 3.51 (1.55-7.95), respectively]. To select the haplotypes most informative for analysis, we tagged the polymorphisms using SNPTag software. Employing AIRE C-103T, G6528A, T7215C and T11787C as tag SNPs, two haplotypes were associated with AA; AIRE CGCT and AIRE CGCC [P = 3.84 x 10(-7), 11.40 (3.53-36.9) and P = 3.94 x 10(-4), 2.13 (1.39-3.24) respectively]. The AIRE risk haplotypes identified in this study potentially account for a major component of the genetic risk of developing alopecia areata.

  11. The "Sardinian" HLA-A30,B18,DR3,DQw2 haplotype constantly lacks the 21-OHA and C4B genes. Is it an ancestral haplotype without duplication?

    PubMed

    Contu, L; Carcassi, C; Dausset, J

    1989-01-01

    The C4 and 21-OH loci of the class III HLA have been studied by specific DNA probes and the restriction enzyme Taq 1 in 24 unrelated Sardinian individuals selected from completely HLA-typed families. All 24 individuals had the HLA extended haplotype A30,Cw5,B18, BfF1,DR3,DRw52,DQw2, named "Sardinian" in the present paper because of its frequency of 15% in the Sardinian population. Eighteen of these were homozygous for the entire haplotype, and six were heterozygous at the A locus and blank (or homozygous) at all the other loci. In all completely homozygous cells and in four heterozygous cells at the A locus, the restriction fragments of the 21-OHA (3.2 kb) and C4B (5.8 kb or 5.4 kb) genes were absent, and the fragments of the C4A (7.0 kb) and 21-OHB (3.7 kb) genes were present. It is suggested that the "Sardinian" haplotype is an ancestral haplotype without duplication of the C4 and 21-OH genes, practically always identical in its structure, also in unrelated individuals. The diversity of this haplotype in the class III region (about 30 kb less) may be at least partially responsible for its misalignment with most haplotypes, which have duplicated C4 and 21-OH genes, and therefore also for its decreased probability to recombine. This can help explain its high stability and frequency in the Sardinian population. The same conclusion can be suggested for the Caucasian extended haplotype A1,B8,DR3 that always seems to lack the C4A and 21-OHA genes.

  12. Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.

    PubMed

    Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi

    2004-01-01

    In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.

  13. Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

    PubMed

    Patil, N; Berno, A J; Hinds, D A; Barrett, W A; Doshi, J M; Hacker, C R; Kautzer, C R; Lee, D H; Marjoribanks, C; McDonough, D P; Nguyen, B T; Norris, M C; Sheehan, J B; Shen, N; Stern, D; Stokowski, R P; Thomas, D J; Trulson, M O; Vyas, K R; Frazer, K A; Fodor, S P; Cox, D R

    2001-11-23

    Global patterns of human DNA sequence variation (haplotypes) defined by common single nucleotide polymorphisms (SNPs) have important implications for identifying disease associations and human traits. We have used high-density oligonucleotide arrays, in combination with somatic cell genetics, to identify a large fraction of all common human chromosome 21 SNPs and to directly observe the haplotype structure defined by these SNPs. This structure reveals blocks of limited haplotype diversity in which more than 80% of a global human sample can typically be characterized by only three common haplotypes.

  14. Effects of IL-10 haplotype and atomic bomb radiation exposure on gastric cancer risk.

    PubMed

    Hayashi, Tomonori; Ito, Reiko; Cologne, John; Maki, Mayumi; Morishita, Yukari; Nagamura, Hiroko; Sasaki, Keiko; Hayashi, Ikue; Imai, Kazue; Yoshida, Kengo; Kajimura, Junko; Kyoizumi, Seishi; Kusunoki, Yoichiro; Ohishi, Waka; Fujiwara, Saeko; Akahoshi, Masazumi; Nakachi, Kei

    2013-07-01

    Gastric cancer (GC) is one of the cancers that reveal increased risk of mortality and incidence in atomic bomb survivors. The incidence of gastric cancer in the Life Span Study cohort of the Radiation Effects Research Foundation (RERF) increased with radiation dose (gender-averaged excess relative risk per Gy = 0.28) and remains high more than 65 years after exposure. To assess a possible role of gene-environment interaction, we examined the dose response for gastric cancer incidence based on immunosuppression-related IL-10 genotype, in a cohort study with 200 cancer cases (93 intestinal, 96 diffuse and 11 other types) among 4,690 atomic bomb survivors participating in an immunological substudy. Using a single haplotype block composed of four haplotype-tagging SNPs (comprising the major haplotype allele IL-10-ATTA and the minor haplotype allele IL-10-GGCG, which are categorized by IL-10 polymorphisms at -819A>G and -592T>G, +1177T>C and +1589A>G), multiplicative and additive models for joint effects of radiation and this IL-10 haplotyping were examined. The IL-10 minor haplotype allele(s) was a risk factor for intestinal type gastric cancer but not for diffuse type gastric cancer. Radiation was not associated with intestinal type gastric cancer. In diffuse type gastric cancer, the haplotype-specific excess relative risk (ERR) for radiation was statistically significant only in the major homozygote category of IL-10 (ERR = 0.46/Gy, P = 0.037), whereas estimated ERR for radiation with the minor IL-10 homozygotes was close to 0 and nonsignificant. Thus, the minor IL-10 haplotype might act to reduce the radiation related risk of diffuse-type gastric cancer. The results suggest that this IL-10 haplotyping might be involved in development of radiation-associated gastric cancer of the diffuse type, and that IL-10 haplotypes may explain individual differences in the radiation-related risk of gastric cancer. © 2013 by Radiation Research Society

  15. Missing data imputation and haplotype phase inference for genome-wide association studies

    PubMed Central

    Browning, Sharon R.

    2009-01-01

    Imputation of missing data and the use of haplotype-based association tests can improve the power of genome-wide association studies (GWAS). In this article, I review methods for haplotype inference and missing data imputation, and discuss their application to GWAS. I discuss common features of the best algorithms for haplotype phase inference and missing data imputation in large-scale data sets, as well as some important differences between classes of methods, and highlight the methods that provide the highest accuracy and fastest computational performance. PMID:18850115

  16. Association between endothelin type A receptor haplotypes and mortality in coronary heart disease.

    PubMed

    Ellis, Katrina L; Pilbrow, Anna P; Potter, Howard C; Frampton, Chris M; Doughty, Rob N; Whalley, Gillian A; Ellis, Chris J; Palmer, Barry R; Skelton, Lorraine; Yandle, Tim G; Troughton, Richard W; Richards, A Mark; A Cameron, Vicky

    2012-05-01

    The endothelin type A receptor, encoded by EDNRA, mediates the effects of endothelin-1 to promote vasoconstriction, vascular cell growth, adhesion, fibrosis and thrombosis. We investigated the association between EDNRA haplotype and cardiovascular outcomes in patients with coronary artery disease. Coronary disease patients (n = 1007) were genotyped for the His323His (rs5333) variant and one tag SNP from each of the major EDNRA haplotype blocks (rs6537484, rs1568136, rs5335 and rs10003447). EDNRA haplotype associations with clinical history, natriuretic peptides cardiac function and cardiovascular outcomes were tested over a median 3.8 years. Univariate analysis identified a 'low-risk' EDNRA haplotype associated with later age of Type 2 diabetes onset (p = 0.004) smaller BMI (p = 0.021), and reduced mortality (log rank p = 0.001). Cox proportional hazards analysis including established cardiovascular risk factors revealed an independent association between haplotype and mortality (p < 0.0001). These data highlight the potential importance of the endothelin system, and in particular EDNRA in coronary disease.

  17. Novel protein domains and repeats in Drosophila melanogaster: insights into structure, function, and evolution.

    PubMed

    Ponting, C P; Mott, R; Bork, P; Copley, R R

    2001-12-01

    Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.

  18. Better ILP models for haplotype assembly.

    PubMed

    Etemadi, Maryam; Bagherian, Mehri; Chen, Zhi-Zhong; Wang, Lusheng

    2018-02-19

    The haplotype assembly problem for diploid is to find a pair of haplotypes from a given set of aligned Single Nucleotide Polymorphism (SNP) fragments (reads). It has many applications in association studies, drug design, and genetic research. Since this problem is computationally hard, both heuristic and exact algorithms have been designed for it. Although exact algorithms are much slower, they are still of great interest because they usually output significantly better solutions than heuristic algorithms in terms of popular measures such as the Minimum Error Correction (MEC) score, the number of switch errors, and the QAN50 score. Exact algorithms are also valuable because they can be used to witness how good a heuristic algorithm is. The best known exact algorithm is based on integer linear programming (ILP) and it is known that ILP can also be used to improve the output quality of every heuristic algorithm with a little decline in speed. Therefore, faster ILP models for the problem are highly demanded. As in previous studies, we consider not only the general case of the problem but also its all-heterozygous case where we assume that if a column of the input read matrix contains at least one 0 and one 1, then it corresponds to a heterozygous SNP site. For both cases, we design new ILP models for the haplotype assembly problem which aim at minimizing the MEC score. The new models are theoretically better because they contain significantly fewer constraints. More importantly, our experimental results show that for both simulated and real datasets, the new model for the all-heterozygous (respectively, general) case can usually be solved via CPLEX (an ILP solver) at least 5 times (respectively, twice) faster than the previous bests. Indeed, the running time can sometimes be 41 times better. This paper proposes a new ILP model for the haplotype assembly problem and its all-heterozygous case, respectively. Experiments with both real and simulated datasets show that the

  19. Analysis of MHC class I genes across horse MHC haplotypes

    PubMed Central

    Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

    2010-01-01

    The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063

  20. Covalently Linked Tandem Lesions in DNA

    PubMed Central

    Patrzyc, Helen B.; Dawidzik, Jean B.; Budzinski, Edwin E.; Freund, Harold G.; Wilton, John H.; Box, Harold C.

    2013-01-01

    Reactive oxygen species (ROS) generate a type of DNA damage called tandem lesions, two adjacent nucleotides both modified. A subcategory of tandem lesions consists of adjacent nucleotides linked by a covalent bond. Covalently linked tandem lesions generate highly characteristic liquid chromotography-tandem mass spectrometry (LC-MS/MS) elution profiles. We have used this property to comprehensively survey X-irradiated DNA for covalently linked tandem lesions. A total of 15 tandem lesions were detected in DNA irradiated in deoxygenated aqueous solution, five tandem lesions were detected in DNA that was irradiated in oxygenated solution. PMID:23106212

  1. CRISPRcompar: a website to compare clustered regularly interspaced short palindromic repeats.

    PubMed

    Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

    2008-07-01

    Clustered regularly interspaced short palindromic repeat (CRISPR) elements are a particular family of tandem repeats present in prokaryotic genomes, in almost all archaea and in about half of bacteria, and which participate in a mechanism of acquired resistance against phages. They consist in a succession of direct repeats (DR) of 24-47 bp separated by similar sized unique sequences (spacers). In the large majority of cases, the direct repeats are highly conserved, while the number and nature of the spacers are often quite diverse, even among strains of a same species. Furthermore, the acquisition of new units (DR + spacer) was shown to happen almost exclusively on one side of the locus. Therefore, the CRISPR presents an interesting genetic marker for comparative and evolutionary analysis of closely related bacterial strains. CRISPRcompar is a web service created to assist biologists in the CRISPR typing process. Two tools facilitates the in silico investigation: CRISPRcomparison and CRISPRtionary. This website is freely accessible at http://crispr.u-psud.fr/CRISPRcompar/.

  2. Molecular pathology and haplotype analysis of Wilson disease in Mediterranean populations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Figus, A.; Farcia, A.M.G.; Nurchi, A.

    1995-12-01

    We analyzed mutations and defined the chromosomal haplotype in 127 patients of Mediterranean descent who were affected in Wilson disease (WD): 39 Sardinians, 49 Italians, 33 Turks, and 6 Albanians. Haplotypes were derived by use of the microsatellite markers D13S301, D13S296, D13S297, and D13S298, which are linked to the WD locus. There were five common haplotypes in Sardinians, three in Italians, and two in Turks, which accounted for 85%, 32%, and 30% of the WD chromosomes, respectively. We identified 16 novel mutations: 8 frameshifts, 7 missense mutations, and 1 splicing defect. In addition, we detected the previously described mutations: 2302insC,more » 3404delC, Arg1320ter, Gly944Ser, and His1070Gin. Of the new mutations detected, two, the 1515insT on haplotype I and 2464delC on haplotype XVI, accounted for 6% and 13%, respectively, of the mutations in WD chromsomes in the Sardinian populations. Mutations H1070Q, 2302insC, and 2533delA represented 13%, 8%, and 8%, respectively, of the mutations in WD chromsomes in other Mediterranean populations. The remaining mutations were rare and limited to one or two patients from different populations. Thus, WD results from some frequent mutations and many rare defects. 28 refs., 1 fig., 3 tabs.« less

  3. Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11

    PubMed Central

    Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.

    2009-01-01

    Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120

  4. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed Central

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina

    2018-01-01

    The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446

  5. Rapid and high resolution genotyping of all Escherichia coli serotypes using 10 genomic repeat-containing loci.

    PubMed

    Løbersli, Inger; Haugum, Kjersti; Lindstedt, Bjørn-Arne

    2012-01-01

    Our laboratory has previously published two multiple-locus variable-number tandem-repeats analysis (MLVA) methods for rapid genotyping of Escherichia coli (E. coli), which are now in routine use for surveillance and outbreak detection. The first assay developed was specific for E. coli O157:H7; however this assay was not suitable for genotyping other E. coli serotypes. A new generic MLVA-assay was then developed with the capability of genotyping all E. coli serotypes. This generic E. coli MLVA (GECM7) was based on polymorphism in seven variable number of tandem repeats (VNTR) loci. GECM7 worked well with the majority of E. coli serotypes; however we wanted to increase the resolution for this method based in part of comparison with PFGE typing of E. coli O26:H11, where PFGE appeared to display higher resolution. The GECM7 method was improved by adding three new repeat-loci to a total of ten (GECM10), and a considerable increase in resolution was observed (from 296 to 507 genotypes on the same set of strains). Copyright © 2011 Elsevier B.V. All rights reserved.

  6. Mendel-GPU: haplotyping and genotype imputation on graphics processing units

    PubMed Central

    Chen, Gary K.; Wang, Kai; Stram, Alex H.; Sobel, Eric M.; Lange, Kenneth

    2012-01-01

    Motivation: In modern sequencing studies, one can improve the confidence of genotype calls by phasing haplotypes using information from an external reference panel of fully typed unrelated individuals. However, the computational demands are so high that they prohibit researchers with limited computational resources from haplotyping large-scale sequence data. Results: Our graphics processing unit based software delivers haplotyping and imputation accuracies comparable to competing programs at a fraction of the computational cost and peak memory demand. Availability: Mendel-GPU, our OpenCL software, runs on Linux platforms and is portable across AMD and nVidia GPUs. Users can download both code and documentation at http://code.google.com/p/mendel-gpu/. Contact: gary.k.chen@usc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22954633

  7. [Usefulness of the variable numbers of tandem repeats (VNTR) analysis for complex infections of Mycobacterium avium and Mycobacterium intracellulare].

    PubMed

    Tsunematsu, Noriko; Goto, Mieko; Saiki, Yumiko; Baba, Michiko; Udagawa, Tadashi; Kazumi, Yuko

    2008-09-01

    The bacilli which were isolated from a patient suspected of the mixed infections with Mycobacterium avium and Mycobacterium intracellulare, were analyzed. The genotypes of M. avium in the sedimented fractions of treated sputum and in some colonies isolated from Ogawa medium were compared by the Variable Numbers of Tandem Repeats (VNTR). A woman, aged 57. Mycobacterial species isolated from some colonies by culture in 2004 and 2006 and from the treated sputum in 2006, were determined by DNA sequencing analysis of the 16S rRNA gene. Also, by using VNTR, the genotype of mycobacteria was analyzed. [Results] (1) The colony isolated from Ogawa medium in 2004 was monoclonal M. avium. (2) By VNTR analyses of specimens in 2006, multiple acid-fast bacteria were found in the sputum sediment and in isolated bacteria from Ogawa medium. (3) By analyses of 16S rRNA DNA sequence, M. avium and M. intracellulare were found in the colonies isolated from the sputum sediment and the Ogawa medium in 2006. (4) The same VNTR patterns were obtained in M. avium in 2004 and 2006 when single colony was analyzed. (5) From the showerhead and culvert of the bathroom in the patient's house, M. avium was not detected. By VNTR analyses, it was considered that the mixed infections of M. avium and M. intracellulare had been generated during treatment in this case. Therefore, in the case of suspected complex infection, VNTR analysis would be a useful genotyping method in M. avium complex infection.

  8. DXYS156: a multi-purpose short tandem repeat locus for determination of sex, paternal and maternal geographic origins and DNA fingerprinting.

    PubMed

    Calì, Francesco; Forster, P; Kersting, Christian; Mirisola, Mario G; D'Anna, Rosalba; De Leo, Giacomo; Romano, Valentino

    2002-06-01

    In forensic science and in legal medicine Y chromosomal typing is indispensable for sex determination, for paternity testing in the absence of the father and for distinguishing males in multiple rape cases. Another potential application is the estimation of paternal geographic origin or family name from a crime stain to narrow down the range of suspects and thus reduce costs of mass screenings. However, Y typing alone cannot provide a sufficiently resolved DNA fingerprint as required for court convictions. Thus, there is a dilemma whether or not to sacrifice valuable material for the sake of extensive Y chromosomal investigations when stain DNA is limited (typically allowing only few PCR amplifications). We here describe a Y-chromosome-specific nucleotide insertion in the duplicate short tandem repeat (STR) locus DXYS156 which allows us to distinguish males from females as does the commonly used amelogenin system, but with the advantage that this locus is multi-allelic, thus substantially contributing towards DNA fingerprinting of a sample and furthermore enabling the detection of sample contamination. Yet another bonus is that both the X and the Y copies of DXYS156 have alleles specific to different parts of the world, offering separate estimates of maternal and paternal descent of that sample. We therefore recommend the inclusion of DXYS156 in standard multiplexing kits for forensic, archaeological and genealogical applications.

  9. Genetic polymorphism of the 26 short tandem repeat loci in the Chinese Hebei Han population using two commercial forensic kits.

    PubMed

    Lei, Liang; Xu, Jie; Du, Qingqing; Fu, Lihong; Zhang, Xiaojing; Yu, Feng; Ma, Chunling; Cong, Bin; Li, Shujin

    2015-01-01

    We determined the allele frequencies and forensic parameters for the 26 short tandem repeat (STR) autosomal markers in two commercial kits (the Investigator HDplex and AmpFLSTR(®) Identifiler(®) systems) for 183 unrelated individuals from the Han population of the Hebei Province of China. The 26 STRs were all in Hardy-Weinberg equilibrium. No linkage disequilibrium was detected between any pair of loci. The combined power of discrimination and the combined power of exclusion for the 26 STR loci were 1-7.74E-31 and 1-1.21E-11, respectively. Six rare alleles of D10S2325 were identified and named 20, 21, 22, 23, 24, and 31. All the length of the six rare alleles were out of the range of allelic ladder. We calculated the population pairwise genetic distance based on the allele frequencies, using published population data including German, central Polish, south Dutch, northeastern Polish, south Brazilian, Korean, Sichuan Han of China, and Shanghai Han of China. Also we examined the population pairwise genetic distance of loci included in Identifiler system between Hebei Han and other ethnic population of China. These 26 autosomal STR loci could provide highly informative polymorphic data for paternity testing and forensic identification in the Hebei Han population in China. Because they are all in linkage equilibrium, they could be used together to solve deficient kinship cases or cases with mutations.

  10. Infrared fluorescent automated detection of thirteen short tandem repeat polymorphisms and one gender-determining system of the CODIS core system.

    PubMed

    Ricci, U; Sani, I; Guarducci, S; Biondi, C; Pelagatti, S; Lazzerini, V; Brusaferri, A; Lapini, M; Andreucci, E; Giunti, L; Giovannucci Uzielli, M L

    2000-11-01

    We used an infrared (IR) automated fluorescence monolaser sequencer for the analysis of 13 autosomal short tandem repeat (STR) systems (TPOX, D3S1358, FGA, CSF1PO, D5S818, D7S820, D8S1179, TH01, vWA, D13S317, D16S359, D18S51, D21S11) and the X-Y homologous gene amelogenin system. These two systems represent the core of the combined DNA index systems (CODIS). Four independent multiplex reactions, based on the polymerase chain reaction (PCR) technique and on the direct labeling of the forward primer of every primer pair, with a new molecule (IRDye800), were set up, permitting the exact characterization of the alleles by comparison with ladders of specific sequenced alleles. This is the first report of the whole analysis of the STRs of the CODIS core using an IR automated DNA sequencer. The protocol was used to solve paternity/maternity tests and for population studies. The electrophoretic system also proved useful for the correct typing of those loci differing in size by only 2 bp. A sensibility study demonstrated that the test can detect an average of 10 pg of undegraded human DNA. We also performed a preliminary study analyzing some forensic samples and mixed stains, which suggested the usefulness of using this analytical system for human identification as well as for forensic purposes.

  11. Whole-genome sequencing in patients with ciliopathies uncovers a novel recurrent tandem duplication in IFT140.

    PubMed

    Geoffroy, Véronique; Stoetzel, Corinne; Scheidecker, Sophie; Schaefer, Elise; Perrault, Isabelle; Bär, Séverine; Kröll, Ariane; Delbarre, Marion; Antin, Manuela; Leuvrey, Anne-Sophie; Henry, Charline; Blanché, Hélène; Decker, Eva; Kloth, Katja; Klaus, Günter; Mache, Christoph; Martin-Coignard, Dominique; McGinn, Steven; Boland, Anne; Deleuze, Jean-François; Friant, Sylvie; Saunier, Sophie; Rozet, Jean-Michel; Bergmann, Carsten; Dollfus, Hélène; Muller, Jean

    2018-04-24

    Ciliopathies represent a wide spectrum of rare diseases with overlapping phenotypes and a high genetic heterogeneity. Among those, IFT140 is implicated in a variety of phenotypes ranging from isolated retinis pigmentosa to more syndromic cases. Using whole-genome sequencing in patients with uncharacterized ciliopathies, we identified a novel recurrent tandem duplication of exon 27-30 (6.7 kb) in IFT140, c.3454-488_4182+2588dup p.(Tyr1152_Thr1394dup), missed by whole-exome sequencing. Pathogenicity of the mutation was assessed on the patients' skin fibroblasts. Several hundreds of patients with a ciliopathy phenotype were screened and biallelic mutations were identified in 11 families representing 12 pathogenic variants of which seven are novel. Among those unrelated families especially with a Mainzer-Saldino syndrome, eight carried the same tandem duplication (two at the homozygous state and six at the heterozygous state). In conclusion, we demonstrated the implication of structural variations in IFT140-related diseases expanding its mutation spectrum. We also provide evidences for a unique genomic event mediated by an Alu-Alu recombination occurring on a shared haplotype. We confirm that whole-genome sequencing can be instrumental in the ability to detect structural variants for genomic disorders. © 2018 Wiley Periodicals, Inc.

  12. Two independent apolipoprotein A5 haplotypes influence human plasma triglyceride levels.

    PubMed

    Pennacchio, Len A; Olivier, Michael; Hubacek, Jaroslav A; Krauss, Ronald M; Rubin, Edward M; Cohen, Jonathan C

    2002-11-15

    The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in approximately 16% of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceride concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n=419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12% of Caucasians, 14% of African-Americans and 28% of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25-50% of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.

  13. Two independent apolipoprotein a5 Haplotypes influence human plasma triglyceride levels

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pennacchio, Len A.; Olivier, Michael; Hubacek, Jaroslav A.

    2002-09-16

    The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in {approx}16 percent of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceridemore » concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n 1/4 419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12 percent of Caucasians, 14 percent of African-Americans and 28 percent of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25 50 percent of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.« less

  14. A new family of dispersed repeats from Brassica nigra: characterization and localization.

    PubMed

    Kapila, R; Negi, M S; This, P; Delseny, M; Srivastava, P S; Lakshmikumaran, M

    1996-11-01

    The 459-bp HindIII (pBN-4) and the 1732-bp Eco RI (pBNE8) fragments from the Brassica nigra genome were cloned and shown to be members of a dispersed repeat family. Of the three major diploid Brassica species, the repeat pBN-4 was found to be highly specific for the B. nigra genome. The family also hybridized to Sinapis arvensis showing that B. nigra had a closer relationship with the S. arvensis genome than with B. oleracea or B. campestris. The clone pBNE8 showed homology to a number of tRNA species indicating that this family of repeats may have originated from a tRNA sequence. The species-specific 459-bp repeat pBN-4 was localized on the B. nigra chromosomes using monosomic addition lines. In addition to the localization of pBN-4, the chromosomal distribution of two other species-specific repeats, pBN34 and pBNBH35 (reported earlier), was studied. The dispersed repeats pBN-4 and pBNBH35 were found to be present on all of the chromosomes, whereas the tandem repeat pBN34 was localized on two chromosomes.

  15. High-Density SNP Genotyping to Define β-Globin Locus Haplotypes

    PubMed Central

    Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.

    2014-01-01

    Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352

  16. Hypercontrols in genotype-phenotype analysis reveal ancestral haplotypes associated with essential hypertension.

    PubMed

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2012-04-01

    The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ(2)=23.9; P=0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8-4.9; P=0.000006) in the recessive model. Two polymorphisms, A-20C (P=0.003) and C3389T (P=0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ(2)=8.1; P=0.004) and H5 (χ(2)=5.1; P=0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as "superalleles."

  17. Hypercontrols in Genotype-Phenotype Analysis Reveal Ancestral Haplotypes Associated With Essential Hypertension

    PubMed Central

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2012-01-01

    The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ2 = 23.9; P = 0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8–4.9; P = 0.000006) in the recessive model. Two polymorphisms, A-20C (P = 0.003) and C3389T (P = 0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ2 = 8.1; P = 0.004) and H5 (χ2 = 5.1; P = 0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as “superalleles.” PMID:22371359

  18. The HLA-DRB9 gene and the origin of HLA-DR haplotypes.

    PubMed

    Gongora, R; Figueroa, F; Klein, J

    1996-11-01

    HLA-DRB9 is a gene fragment consisting of exon 2 and flanking intron sequences. It is located at the extreme end of the DRB subregion, whose other end is demarcated by the DRB1 locus. We sequenced approximately 1400 base pairs of the segment encompassing the DRB9 locus from eight human haplotypes (DR1, DR10, DR2, DR3, DR5, DR6, DR8, and DR9, the DR4 and DR7 having been sequenced by others earlier), as well as two chimpanzee, five gorillas, one orangutan and one macaque haplotype. The analysis of these sequences indicates that the DRB9 locus, which we estimate to be more than 58 million years (my) old, has been coevolving with the DRB1 locus for the last 4.2 my. As a consequence of this coevolution, the human DRB9 alleles fall into groups that correlate with the DRB1 allelic groups and with the gene organization of the human haplotypes. This observation implies that the present-day HLA-DR haplotype groups (DR1, DR51, DR52, DR8, and DR53) were founded more than 4 my ago and have remained intact (barring minor internal rearrangements that did not recombine the DRB1 and DRB9 genes) for this period of time. The haplotypes have been transmitted during speciations from ancestral to emerging species just like allelic lineages at the DRB1 locus. Thus not only allelic but also haplotype polymorphism evolves trans-specifically.

  19. COI haplotype groups in Mesocriconema (Nematoda: Criconematidae) and their morphospecies associations.

    PubMed

    Powers, T O; Bernard, E C; Harris, T; Higgins, R; Olson, M; Lodema, M; Mullin, P; Sutton, L; Powers, K S

    2014-07-03

    Without applying an a priori bias for species boundaries, specimen identities in the plant-parasitic nematode genus Mesocriconema were evaluated by examining mitochondrial COI nucleotide sequences, morphology, and biogeography. A total of 242 specimens that morphologically conformed to the genus were individually photographed, measured, and amplified by a PCR primer set to preserve the linkage between specimen morphology and a specific DNA barcode sequence. Specimens were extracted from soil samples representing 45 locations across 23 ecoregions in North America. Dendrograms constructed by neighbor-joining, maximum likelihood, and Bayesian Inference using a 721-bp COI barcode were used to group COI haplotypes. Each tree-building approach resulted in 24 major haplotype groups within the dataset. The distinctiveness of these groups was evaluated by node support, genetic distance, absence of intermediates, and several measures of distinctiveness included in software used for the exploration of species boundaries. Five of the 24 COI haplotype groups corresponded to morphologically characterized, Linnaean species. Morphospecies conforming to M. discus, Discocriconemella inarata, M. rusticum, M. onoense, and M. kirjanovae were represented by groups composed of multiple closely related or identical COI haplotypes. In other cases, morphospecies names could be equally applied to multiple haplotype groups that were genetically distant from each other. Identification based on morphology alone resulted in M. curvatum and M. ornatum species designations applied to seven and three groups, respectively. Morphological characters typically used for species level identification were demonstrably variable within haplotype groups, suggesting caution in assigning species names based on published compendia that solely consider morphological characters. Morphospecies classified as M. xenoplax formed a monophyletic group composed of seven genetically distinct COI subgroups. The species

  20. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil.

    PubMed

    Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe

    2010-07-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.

  1. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil

    PubMed Central

    2010-01-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405

  2. Interrelationships between Amerindian tribes of lower Amazonia as manifest by HLA haplotype disequilibria.

    PubMed

    Black, F L

    1984-11-01

    HLA B-C haplotypes exhibit common disequilibria in populations drawn from four continents, indicating that they are subject to broadly active selective forces. However, the A-B and A-C associations we have examined show no consistent disequilibrium pattern, leaving open the possibility that these disequilibria are due to descent from common progenitors. By examining HLA haplotype distributions, I have explored the implications that would follow from the hypothesis that biological selection played no role in determining A-C disequilibria in 10 diverse tribes of the lower Amazon Basin. Certain haplotypes are in strong positive disequilibria across a broad geographic area, suggesting that members of diverse tribes descend from common ancestors. On the basis of the extent of diffusion of the components of these haplotypes, one can estimate that the progenitors lived less than 6,000 years ago. One widely encountered lineage entered the area within the last 1,200 years. When haplotype frequencies are used in genetic distance measurements, they give a pattern of relationships very similar to that obtained by conventional chord measurements based on several genetic markers; but more than that, when individual haplotype disequilibria in the several tribes are compared, multiple origins of a single tribe are discernible and relationships are revealed that correlate more closely to geographic and linguistic patterns than do the genetic distance measurements.

  3. Three Novel Haplotypes of Theileria bicornis in Black and White Rhinoceros in Kenya.

    PubMed

    Otiende, M Y; Kivata, M W; Jowers, M J; Makumi, J N; Runo, S; Obanda, V; Gakuya, F; Mutinda, M; Kariuki, L; Alasaad, S

    2016-02-01

    Piroplasms, especially those in the genera Babesia and Theileria, have been found to naturally infect rhinoceros. Due to natural or human-induced stress factors such as capture and translocations, animals often develop fatal clinical piroplasmosis, which causes death if not treated. This study examines the genetic diversity and occurrence of novel Theileria species infecting both black and white rhinoceros in Kenya. Samples collected opportunistically during routine translocations and clinical interventions from 15 rhinoceros were analysed by polymerase chain reaction (PCR) using a nested amplification of the small subunit ribosomal RNA (18S rRNA) gene fragments of Babesia and Theileria. Our study revealed for the first time in Kenya the presence of Theileria bicornis in white (Ceratotherium simum simum) and black (Diceros bicornis michaeli) rhinoceros and the existence of three new haplotypes: haplotypes H1 and H3 were present in white rhinoceros, while H2 was present in black rhinoceros. No specific haplotype was correlated to any specific geographical location. The Bayesian inference 50% consensus phylogram recovered the three haplotypes monophyleticly, and Theileria bicornis had very high support (BPP: 0.98). Furthermore, the genetic p-uncorrected distances and substitutions between T. bicornis and the three haplotypes were the same in all three haplotypes, indicating a very close genetic affinity. This is the first report of the occurrence of Theileria species in white and black rhinoceros from Kenya. The three new haplotypes reported here for the first time have important ecological and conservational implications, especially for population management and translocation programs and as a means of avoiding the transport of infected animals into non-affected areas. © 2014 Blackwell Verlag GmbH.

  4. Intricacies in arrangement of SNP haplotypes suggest "Great Admixture" that created modern humans.

    PubMed

    Dutta, Rajib; Mainsah, Joseph; Yatskiv, Yuriy; Chakrabortty, Sharmistha; Brennan, Patrick; Khuder, Basil; Qiu, Shuhao; Fedorova, Larisa; Fedorov, Alexei

    2017-06-05

    Inferring history from genomic sequences is challenging and problematic because chromosomes are mosaics of thousands of small Identicalby-descent (IBD) fragments, each of them having their own unique story. However, the main events in recent evolution might be deciphered from comparative analysis of numerous loci. A paradox of why humans, whose effective population size is only 10 4 , have nearly three million frequent SNPs is formulated and examined. We studied 5398 loci evenly covering all human autosomes. Common haplotypes built from frequent SNPs that are present in people from various populations have been examined. We demonstrated highly non-random arrangement of alleles in common haplotypes. Abundance of mutually exclusive pairs of common haplotypes that have different alleles at every polymorphic position (so-called Yin/Yang haplotypes) was found in 56% of loci. A novel widely spread category of common haplotypes named Mosaic has been described. Mosaic consists of numerous pieces of Yin/Yang haplotypes and represents an ancestral stage of one of them. Scenarios of possible appearance of large number of frequent human SNPs and their habitual arrangement in Yin/Yang common haplotypes have been evaluated with an advanced genomic simulation algorithm. Computer modeling demonstrated that the observed arrangement of 2.9 million frequent SNPs could not originate from a sole stand-alone population. A "Great Admixture" event has been proposed that can explain peculiarities with frequent SNP distributions. This Great Admixture presumably occurred 100-300 thousand years ago between two ancestral populations that had been separated from each other about a million years ago. Our programs and algorithms can be applied to other species to perform evolutionary and comparative genomics.

  5. FamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads.

    PubMed

    Wang, Meng; Lin, Shili

    2014-09-15

    In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more valuable for detecting causal variants. Although a number of methods for detecting rare haplotype association have been proposed in recent years, they are population based and thus susceptible to population stratification. We propose family-triad-based logistic Bayesian Lasso (famLBL) for estimating effects of haplotypes on complex diseases using SNP data. By choosing appropriate prior distribution, effect sizes of unassociated haplotypes can be shrunk toward zero, allowing for more precise estimation of associated haplotypes, especially those that are rare, thereby achieving greater detection power. We evaluate famLBL using simulation to gauge its type I error and power. Compared with its population counterpart, LBL, highlights famLBL's robustness property in the presence of population substructure. Further investigation by comparing famLBL with Family-Based Association Test (FBAT) reveals its advantage for detecting rare haplotype association. famLBL is implemented as an R-package available at http://www.stat.osu.edu/∼statgen/SOFTWARE/LBL/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. The putative oncogene Pim-1 in the mouse: its linkage and variation among t haplotypes.

    PubMed

    Nadeau, J H; Phillips, S J

    1987-11-01

    Pim-1, a putative oncogene involved in T-cell lymphomagenesis, was mapped between the pseudo-alpha globin gene Hba-4ps and the alpha-crystallin gene Crya-1 on mouse chromosome 17 and therefore within the t complex. Pim-1 restriction fragment variants were identified among t haplotypes. Analysis of restriction fragment sizes obtained with 12 endonucleases demonstrated that the Pim-1 genes in some t haplotypes were indistinguishable from the sizes for the Pim-1b allele in BALB/c inbred mice. There are now three genes, Pim-1, Crya-1 and H-2 I-E, that vary among independently derived t haplotypes and that have indistinguishable alleles in t haplotypes and inbred strains. These genes are closely linked within the distal inversion of the t complex. Because it is unlikely that these variants arose independently in t haplotypes and their wild-type homologues, we propose that an exchange of chromosomal segments, probably through double crossingover, was responsible for indistinguishable Pim-1 genes shared by certain t haplotypes and their wild-type homologues. There was, however, no apparent association between variant alleles of these three genes among t haplotypes as would be expected if a single exchange introduced these alleles into t haplotypes. If these variant alleles can be shown to be identical to the wild-type allele, then lack of association suggests that multiple exchanges have occurred during the evolution of the t complex.

  7. Whole genome sequencing of Salmonella Typhimurium illuminates distinct outbreaks caused by an endemic multi-locus variable number tandem repeat analysis type in Australia, 2014.

    PubMed

    Phillips, Anastasia; Sotomayor, Cristina; Wang, Qinning; Holmes, Nadine; Furlong, Catriona; Ward, Kate; Howard, Peter; Octavia, Sophie; Lan, Ruiting; Sintchenko, Vitali

    2016-09-15

    Salmonella Typhimurium (STM) is an important cause of foodborne outbreaks worldwide. Subtyping of STM remains critical to outbreak investigation, yet current techniques (e.g. multilocus variable number tandem repeat analysis, MLVA) may provide insufficient discrimination. Whole genome sequencing (WGS) offers potentially greater discriminatory power to support infectious disease surveillance. We performed WGS on 62 STM isolates of a single, endemic MLVA type associated with two epidemiologically independent, food-borne outbreaks along with sporadic cases in New South Wales, Australia, during 2014. Genomes of case and environmental isolates were sequenced using HiSeq (Illumina) and the genetic distance between them was assessed by single nucleotide polymorphism (SNP) analysis. SNP analysis was compared to the epidemiological context. The WGS analysis supported epidemiological evidence and genomes of within-outbreak isolates were nearly identical. Sporadic cases differed from outbreak cases by a small number of SNPs, although their close relationship to outbreak cases may represent an unidentified common food source that may warrant further public health follow up. Previously unrecognised mini-clusters were detected. WGS of STM can discriminate foodborne community outbreaks within a single endemic MLVA clone. Our findings support the translation of WGS into public health laboratory surveillance of salmonellosis.

  8. A comprehensive literature review of haplotyping software and methods for use with unrelated individuals.

    PubMed

    Salem, Rany M; Wessel, Jennifer; Schork, Nicholas J

    2005-03-01

    Interest in the assignment and frequency analysis of haplotypes in samples of unrelated individuals has increased immeasurably as a result of the emphasis placed on haplotype analyses by, for example, the International HapMap Project and related initiatives. Although there are many available computer programs for haplotype analysis applicable to samples of unrelated individuals, many of these programs have limitations and/or very specific uses. In this paper, the key features of available haplotype analysis software for use with unrelated individuals, as well as pooled DNA samples from unrelated individuals, are summarised. Programs for haplotype analysis were identified through keyword searches on PUBMED and various internet search engines, a review of citations from retrieved papers and personal communications, up to June 2004. Priority was given to functioning computer programs, rather than theoretical models and methods. The available software was considered in light of a number of factors: the algorithm(s) used, algorithm accuracy, assumptions, the accommodation of genotyping error, implementation of hypothesis testing, handling of missing data, software characteristics and web-based implementations. Review papers comparing specific methods and programs are also summarised. Forty-six haplotyping programs were identified and reviewed. The programs were divided into two groups: those designed for individual genotype data (a total of 43 programs) and those designed for use with pooled DNA samples (a total of three programs). The accuracy of programs using various criteria are assessed and the programs are categorised and discussed in light of: algorithm and method, accuracy, assumptions, genotyping error, hypothesis testing, missing data, software characteristics and web implementation. Many available programs have limitations (eg some cannot accommodate missing data) and/or are designed with specific tasks in mind (eg estimating haplotype frequencies rather than

  9. Characterization of swine leukocyte antigen alleles and haplotypes on a novel miniature pig line, Microminipig.

    PubMed

    Ando, A; Imaeda, N; Ohshima, S; Miyamoto, A; Kaneko, N; Takasu, M; Shiina, T; Kulski, J K; Inoko, H; Kitagawa, H

    2014-12-01

    Microminipigs are extremely small-sized, novel miniature pigs that were recently developed for medical research. The inbred Microminipigs with defined swine leukocyte antigen (SLA) haplotypes are expected to be useful for allo- and xenotransplantation studies and also for association analyses between SLA haplotypes and immunological traits. To establish SLA-defined Microminipig lines, we characterized the polymorphic SLA alleles for three class I (SLA-1, SLA-2 and SLA-3) and two class II (SLA-DRB1 and SLA-DQB1) genes of 14 parental Microminipigs using a high-resolution nucleotide sequence-based typing method. Eleven class I and II haplotypes, including three recombinant haplotypes, were found in the offspring of the parental Microminipigs. Two class I and class II haplotypes, Hp-31.0 (SLA-1*1502-SLA-3*070102-SLA-2*1601) and Hp-0.37 (SLA-DRB1*0701-SLA-DQB1*0502), are novel and have not so far been reported in other pig breeds. Crossover regions were defined by the analysis of 22 microsatellite markers within the SLA class III region of three recombinant haplotypes. The SLA allele and haplotype information of Microminipigs in this study will be useful to establish SLA homozygous lines including three recombinants for transplantation and immunological studies. © 2014 Stichting International Foundation for Animal Genetics.

  10. Massively parallel haplotyping on microscopic beads for the high-throughput phase analysis of single molecules.

    PubMed

    Boulanger, Jérôme; Muresan, Leila; Tiemann-Boege, Irene

    2012-01-01

    In spite of the many advances in haplotyping methods, it is still very difficult to characterize rare haplotypes in tissues and different environmental samples or to accurately assess the haplotype diversity in large mixtures. This would require a haplotyping method capable of analyzing the phase of single molecules with an unprecedented throughput. Here we describe such a haplotyping method capable of analyzing in parallel hundreds of thousands single molecules in one experiment. In this method, multiple PCR reactions amplify different polymorphic regions of a single DNA molecule on a magnetic bead compartmentalized in an emulsion drop. The allelic states of the amplified polymorphisms are identified with fluorescently labeled probes that are then decoded from images taken of the arrayed beads by a microscope. This method can evaluate the phase of up to 3 polymorphisms separated by up to 5 kilobases in hundreds of thousands single molecules. We tested the sensitivity of the method by measuring the number of mutant haplotypes synthesized by four different commercially available enzymes: Phusion, Platinum Taq, Titanium Taq, and Phire. The digital nature of the method makes it highly sensitive to detecting haplotype ratios of less than 1:10,000. We also accurately quantified chimera formation during the exponential phase of PCR by different DNA polymerases.

  11. Single nucleotide polymorphisms and microsatellites in the canine glutathione S-transferase pi 1 (GSTP1) gene promoter.

    PubMed

    Sacco, James; Mann, Sarah; Toral, Keller

    2017-01-01

    Genetic polymorphisms within the glutathione S-transferase P1 ( GSTP1 ) gene affect the elimination of toxic xenobiotics by the GSTP1 enzyme. In dogs, exposure to environmental chemicals that may be GSTP1 substrates is associated with cancer. The objectives of this study were to investigate the genetic variability in the GSTP1 promoter in a diverse population of 278 purebred dogs, compare the incidence of any variants found between breeds, and predict their effects on gene expression. To provide information on ancestral alleles, a number of wolves, coyotes, and foxes were also sequenced. Fifteen single nucleotide polymorphisms (SNPs) and two microsatellites were discovered. Three of these loci were only polymorphic in dogs while three other SNPs were unique to wolves and coyotes. The major allele at c.-46 is T in dogs but is C in the wild canids. The c.-185 delT variant was unique to dogs. The microsatellite located in the 5' untranslated region (5'UTR) was a highly polymorphic GCC tandem repeat, consisting of simple and compound alleles that varied in size from 10 to 22-repeat units. The most common alleles consisted of 11, 16, and 17-repeats. The 11-repeat allele was found in 10% of dogs but not in the other canids. Unequal recombination and replication slippage between similar and distinct alleles may be the mechanism for the multiple microsatellites observed. Twenty-eight haplotypes were constructed in the dog, and an additional 8 were observed in wolves and coyotes. While the most common haplotype acrossbreeds was the wild-type *1A(17), other prevalent haplotypes included *3A(11) in Greyhounds, *6A(16) in Labrador Retrievers, *9A(16) in Golden Retrievers, and *8A(19) in Standard Poodles. Boxers and Siberian Huskies exhibited minimal haplotypic diversity. Compared to the simple 16*1 allele, the compound 16*2 allele (found in 12% of dogs) may interfere with transcription factor binding and/or the stability of the GSTP1 transcript. Dogs and other canids exhibit

  12. Haplotype analysis of the apolipoprotein A5 gene in obese pediatric patients.

    PubMed

    Horvatovich, Katalin; Bokor, Szilvia; Baráth, Akos; Maász, Anita; Kisfali, Péter; Járomi, Luca; Polgár, Noémi; Tóth, Dénes; Répásy, Judit; Endreffy, Emoke; Molnár, Dénes; Melegh, Béla

    2011-06-01

    Apolipoprotein A5 (APOA5) gene variants have been shown to be associated with elevated TG levels; the T-1131C (rs662799) variant has been reported to confer risk for the metabolic syndrome in adult populations. Little is known about the APOA5 variants in pediatric population, no such information is available for pediatric obesity at all. Here we examined four haplotype-tagging polymorphisms (T-1131C, IVS3 + G476A [rs2072560], T1259C [rs2266788] and C56G [rs3135506]) and studied also the frequency of major naturally occurring haplotypes of APOA5 in obese children. The polymorphisms were analyzed in 232 obese children, and in 137 healthy, normal weight controls, using PCR-RFLP methods. In the pediatric patients we could confirm the already known adult subjects based association of -1131C, IVS3 + 476A and 1259C variants with elevated triglyceride concentrations, both in obese patients and in the controls. The prevalence of the APOA5*2 haplotype (containing the minor allele of T-1131C, IVS3 + G476A and T1259C SNPs together) was 15.5% in obese children, and 5.80% in the controls (p<0.001); multiple logistic regression analysis revealed that this haplotype confers susceptibility for development of obesity (OR=2.87; 95% CI: 1.29-6.37; p≤0.01). By contrast, the APOA5*4 haplotype (with -1131C alone) did not show similar associations. Our findings also suggest that the APOA5*5 haplotype (1259C alone) can be protective against obesity (OR=0.25; 95% CI: 0.07-0.80; p<0.05). While previous studies in adults demonstrated, that the APOA5 -1131C minor allele confers risk for adult metabolic syndrome, here we show, that the susceptibility nature of this SNP restricted to the APOA5*2 haplotype in pediatric obese subjects.

  13. Ruminant Rhombencephalitis-Associated Listeria monocytogenes Alleles Linked to a Multilocus Variable-Number Tandem-Repeat Analysis Complex ▿ †

    PubMed Central

    Balandyté, Lina; Brodard, Isabelle; Frey, Joachim; Oevermann, Anna; Abril, Carlos

    2011-01-01

    Listeria monocytogenes is among the most important food-borne pathogens and is well adapted to persist in the environment. To gain insight into the genetic relatedness and potential virulence of L. monocytogenes strains causing central nervous system (CNS) infections, we used multilocus variable-number tandem-repeat analysis (MLVA) to subtype 183 L. monocytogenes isolates, most from ruminant rhombencephalitis and some from human patients, food, and the environment. Allelic-profile-based comparisons grouped L. monocytogenes strains mainly into three clonal complexes and linked single-locus variants (SLVs). Clonal complex A essentially consisted of isolates from human and ruminant brain samples. All but one rhombencephalitis isolate from cattle were located in clonal complex A. In contrast, food and environmental isolates mainly clustered into clonal complex C, and none was classified as clonal complex A. Isolates of the two main clonal complexes (A and C) obtained by MLVA were analyzed by PCR for the presence of 11 virulence-associated genes (prfA, actA, inlA, inlB, inlC, inlD, inlE, inlF, inlG, inlJ, and inlC2H). Virulence gene analysis revealed significant differences in the actA, inlF, inlG, and inlJ allelic profiles between clinical isolates (complex A) and nonclinical isolates (complex C). The association of particular alleles of actA, inlF, and newly described alleles of inlJ with isolates from CNS infections (particularly rhombencephalitis) suggests that these virulence genes participate in neurovirulence of L. monocytogenes. The overall absence of inlG in clinical complex A and its presence in complex C isolates suggests that the InlG protein is more relevant for the survival of L. monocytogenes in the environment. PMID:21984240

  14. Global selection on sucrose synthase haplotypes during a century of wheat breeding.

    PubMed

    Hou, Jian; Jiang, Qiyan; Hao, Chenyang; Wang, Yuquan; Zhang, Hongna; Zhang, Xueyong

    2014-04-01

    Spike number per unit area, number of grains per spike, and thousand kernel weight (TKW) are important yield components. In China, increases in wheat (Triticum aestivum) yields are mainly due to increases in grain number per spike and TKW. TKW mainly depends on starch content, as starch accounts for about 70% of the grain endosperm. Sucrose synthase catalysis is the first step in the conversion of sucrose to starch, that is, the conversion of sucrose to fructose and UDP-glucose by the wheat sucrose synthase genes (TaSus1 and TaSus2) that are located on chromosomes 7A/7B/7D and 2A/2B/2D, respectively. A total of 1,520 wheat accessions were genotyped at the six loci. Two, two, five, and two haplotypes were identified at the TaSus2-2A, TaSus2-2B, TaSus1-7A, and TaSus1-7B loci, respectively. Their main variations were detected within the introns. Significant differences between the haplotypes correlated with TKW differences among 348 modern Chinese cultivars from the core collection. Frequency changes for favored haplotypes showed gradual increases in cultivars released since beginning of the last century in China, Europe, and North America. Geographic distributions and time changes of favored haplotypes were characterized in six major wheat production regions worldwide. Strong selection bottlenecks to haplotype variations occurred at polyploidization and domestication and during breeding of wheat. Genetic-effect differences between haplotypes at the same locus influence the selection time and intensity. This work shows that the endosperm starch synthesis pathway is a major target of indirect selection in global wheat breeding for higher yield.

  15. Orthogonal tandem catalysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lohr, Tracy L.; Marks, Tobin J.

    2015-05-20

    Tandem catalysis is a growing field that is beginning to yield important scientific and technological advances toward new and more efficient catalytic processes. 'One-pot' tandem reactions, where multiple catalysts and reagents, combined in a single reaction vessel undergo a sequence of precisely staged catalytic steps, are highly attractive from the standpoint of reducing both waste and time. Orthogonal tandem catalysis is a subset of one-pot reactions in which more than one catalyst is used to promote two or more mechanistically distinct reaction steps. This Perspective summarizes and analyses some of the recent developments and successes in orthogonal tandem catalysis, withmore » particular focus on recent strategies to address catalyst incompatibility. We also highlight the concept of thermodynamic leveraging by coupling multiple catalyst cycles to effect challenging transformations not observed in single-step processes, and to encourage application of this technique to energetically unfavourable or demanding reactions.« less

  16. Forensic and population genetic analysis of Xinjiang Uyghur population on 21 short tandem repeat loci of 6-dye GlobalFiler™ PCR Amplification kit.

    PubMed

    Zhang, Honghua; Xia, Mingying; Qi, Lijie; Dong, Lei; Song, Shuang; Ma, Teng; Yang, Shuping; Jin, Li; Li, Liming; Li, Shilin

    2016-05-01

    Estimating the allele frequencies and forensic statistical parameters of commonly used short tandem repeat (STR) loci of the Uyghur population, which is the fifth largest group in China, provides a more precise reference database for forensic investigation. The 6-dye GlobalFiler™ Express PCR Amplification kit incorporates 21 autosomal STRs, which have been proven that could provide reliable DNA typing results and enhance the power of discrimination. Here we analyzed the GlobalFiler STR loci on 1962 unrelated individuals from Chinese Uyghur population of Xinjiang, China. No significant deviations from Hardy-Weinberg equilibrium and linkage disequilibrium were detected within and between the GlobalFiler STR loci. SE33 showed the greatest power of discrimination in Uyghur population, whereas TPOX showed the lowest. The combined power of discrimination was 99.999999999999999999999998746%. No significant difference was observed between Uyghur and the other two Uyghur populations at all tested STRs, as well as Dai and Mongolian. Significant differences were only observed between Uyghur and other Chinese populations at TH01, as well as Central-South Asian at D13S317, East Asian at TH01 and VWA. The phylogenetic analysis showed that Uyghur is genetically close to Chinese populations, as well as East Asian and Central-South Asian. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. Aldehyde dehydrogenase-2 genotypes and HLA haplotypes in Japanese patients with esophageal cancer.

    PubMed

    Watanabe, Seishiro; Sasahara, Katsuyuki; Kinekawa, Fumihiko; Uchida, Naohito; Masaki, Tsutomu; Kurokohchi, Kazutaka; Murota, Masayuki; Touge, Tetsuo; Kawauchi, Kazuyoshi; Oda, Syuji; Kuriyama, Shigeki

    2002-01-01

    The aim of this study was to examine how aldehyde dehydrogenase-2 (ALDH2) genotypes and human leukocyte antigen (HLA) haplotypes contribute to the risk for esophageal cancer. We examined ALDH2 genotypes and HLA haplotypes in 29 Japanese patients with esophageal cancer. The ratio of patients who experienced current or former intense vasodilatation upon consuming alcohol (flushing type) was much higher in individuals with the inactive form of ALDH2 encoded by the ALDH2(2)/2(2) or ALDH2(1)/2(2) genotype than in those with the active form of ALDH2 encoded by the ALDH2(1)/2(1) genotype. The ratio of inactive ALDH2 was significantly higher in patients with esophageal cancer than in control normal subjects, suggesting that alcoholics with inactive ALDH2 were susceptible to esophageal cancer. HLA haplotypes A24, A26, B54, B61 and DR9 were prevalent in patients with esophageal cancer (82.8, 24.1, 34.5, 37.9 and 44.8%, respectively). HLA haplotype of A24 and inactive ALDH2 were simultaneously found in 58.6% of patients with esophageal cancer. Furthermore, we found other primary malignancies in 6 of 29 (20.7%) patients with esophageal cancer, and 4 of these 6 patients had both the inactive form of ALDH2 and the HLA A24 haplotype. The present study showed the high prevalence of the inactive form of ALDH2 and HLA haplotypes A24, A26, B54, B61 and DR9 in Japanese patients with esophageal cancer. Therefore, the examination of genotypes of ALDH2 loci and HLA haplotypes may allow the early detection of esophageal cancer in the Japanese population.

  18. Association between β2-adrenoceptor (ADRB2) haplotypes and insulin resistance in PCOS.

    PubMed

    Tellechea, Mariana L; Muzzio, Damián O; Iglesias Molli, Andrea E; Belli, Susana H; Graffigna, Mabel N; Levalle, Oscar A; Frechtel, Gustavo D; Cerrone, Gloria E

    2013-04-01

    The aim of this study was to explore β2-adrenoceptor (ADRB2) haplotype associations with phenotypes and quantitative traits related to insulin resistance (IR) and the metabolic syndrome (MS) in a polycystic ovary syndrome (PCOS) population. A secondary purpose was to assess the association between ADRB2 haplotype and PCOS. Genetic polymorphism analysis. Cross-sectional case-control association study. Medical University Hospital and research laboratory. One hundred and sixty-five unrelated women with PCOS and 116 unrelated women without PCOS (control sample). Clinical and biochemical measurements, and ADRB2 genotyping in PCOS patients and control subjects. ADRB2 haplotypes (comprising rs1042711, rs1801704, rs1042713 and rs1042714 in that order), genotyping and statistical analysis to evaluate associations with continuous variables and traits related to IR and MS in a PCOS population. Associations between ADRB2 haplotypes and PCOS were also assessed. We observed an age-adjusted association between ADRB2 haplotype CCGG and lower insulin (P = 0·018) and HOMA (P = 0·008) in the PCOS sample. Interestingly, the expected differences in surrogate measures of IR between cases and controls were not significant in CCGG/CCGG carriers. In the case-control study, genotype CCGG/CCGG was associated with a 14% decrease in PCOS risk (P = 0·043), taking into account confounding variables. Haplotype I (CCGG) has a protective role for IR and MS in PCOS. © 2012 Blackwell Publishing Ltd.

  19. RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination

    PubMed Central

    Mirzaei, Sajad; Wu, Yufeng

    2017-01-01

    Abstract Motivation: Haplotypes from one or multiple related populations share a common genealogical history. If this shared genealogy can be inferred from haplotypes, it can be very useful for many population genetics problems. However, with the presence of recombination, the genealogical history of haplotypes is complex and cannot be represented by a single genealogical tree. Therefore, inference of genealogical history with recombination is much more challenging than the case of no recombination. Results: In this paper, we present a new approach called RENT+ for the inference of local genealogical trees from haplotypes with the presence of recombination. RENT+ builds on a previous genealogy inference approach called RENT, which infers a set of related genealogical trees at different genomic positions. RENT+ represents a significant improvement over RENT in the sense that it is more effective in extracting information contained in the haplotype data about the underlying genealogy than RENT. The key components of RENT+ are several greatly enhanced genealogy inference rules. Through simulation, we show that RENT+ is more efficient and accurate than several existing genealogy inference methods. As an application, we apply RENT+ in the inference of population demographic history from haplotypes, which outperforms several existing methods. Availability and Implementation: RENT+ is implemented in Java, and is freely available for download from: https://github.com/SajadMirzaei/RentPlus. Contacts: sajad@engr.uconn.edu or ywu@engr.uconn.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28065901

  20. Inheritance of Hetero-Diploid Pollen S-Haplotype in Self-Compatible Tetraploid Chinese Cherry (Prunus pseudocerasus Lindl)

    PubMed Central

    Gu, Chao; Liu, Qing-Zhong; Yang, Ya-Nan; Zhang, Shu-Jun; Khan, Muhammad Awais; Wu, Jun; Zhang, Shao-Ling

    2013-01-01

    The breakdown of self-incompatibility, which could result from the accumulation of non-functional S-haplotypes or competitive interaction between two different functional S-haplotypes, has been studied extensively at the molecular level in tetraploid Rosaceae species. In this study, two tetraploid Chinese cherry (Prunus pseudocerasus) cultivars and one diploid sweet cherry (Prunus avium) cultivar were used to investigate the ploidy of pollen grains and inheritance of pollen-S alleles. Genetic analysis of the S-genotypes of two intercross-pollinated progenies showed that the pollen grains derived from Chinese cherry cultivars were hetero-diploid, and that the two S-haplotypes were made up of every combination of two of the four possible S-haplotypes. Moreover, the distributions of single S-haplotypes expressed in self- and intercross-pollinated progenies were in disequilibrium. The number of individuals of the two different S-haplotypes was unequal in two self-pollinated and two intercross-pollinated progenies. Notably, the number of individuals containing two different S-haplotypes (S1- and S5-, S5- and S8-, S1- and S4-haplotype) was larger than that of other individuals in the two self-pollinated progenies, indicating that some of these hetero-diploid pollen grains may have the capability to inactivate stylar S-RNase inside the pollen tube and grow better into the ovaries. PMID:23596519

  1. African gene flow to north Brazil as revealed by HBB*S gene haplotype analysis.

    PubMed

    Lemos Cardoso, Greice; Farias Guerreiro, João

    2006-01-01

    Haplotypes linked to the HBB*S gene were analyzed in a sample of 260 chromosomes of Brazilian sickle cell anemia patients from the population of Belém, state of Pará, to evaluate if the present-day haplotype frequencies correlate as well as expected with historical information on the geographic origin of African slaves sent directly to Northern Brazil. The HBB*S gene haplotype distribution (66% Bantu, 21.8% Benin, 10.9% Senegal, and 1.3% Cameroon) is in agreement with those observed for other Brazilian populations regarding the highest proportion of the Bantu type, followed by the Benin type, but it differs significantly concerning the Senegal type as this haplotype is rare or absent in samples from other Brazilian regions already studied. In addition, our results are in accordance with historical records that establish that about 90% of the slaves sent to Northern Brazil were from Angola, Congo, and Mozambique, where the Bantu haplotype predominates, in contrast to 10% of slaves from Senegambia, Guine-Bissau, and Cape Verde, where the Senegal haplotype is the most common. On the other hand, the observed frequency of the Benin haplotype in Belém was much higher than that expected by historical data. This fact corroborates the suggestion that the high prevalence of the Benin type in Belém is due to domestic slave trade and later internal migrations, mainly from the Northeast, since there are no historical records of direct slave trade from Central West Africa to North Brazil. Am. J. Hum. Biol. 18:93-98, 2006. (c) 2005 Wiley-Liss, Inc.

  2. Practical interpretation of CYP2D6 haplotypes: Comparison and integration of automated and expert calling.

    PubMed

    Ruaño, Gualberto; Kocherla, Mohan; Graydon, James S; Holford, Theodore R; Makowski, Gregory S; Goethe, John W

    2016-05-01

    We describe a population genetic approach to compare samples interpreted with expert calling (EC) versus automated calling (AC) for CYP2D6 haplotyping. The analysis represents 4812 haplotype calls based on signal data generated by the Luminex xMap analyzers from 2406 patients referred to a high-complexity molecular diagnostics laboratory for CYP450 testing. DNA was extracted from buccal swabs. We compared the results of expert calls (EC) and automated calls (AC) with regard to haplotype number and frequency. The ratio of EC to AC was 1:3. Haplotype frequencies from EC and AC samples were convergent across haplotypes, and their distribution was not statistically different between the groups. Most duplications required EC, as only expansions with homozygous or hemizygous haplotypes could be automatedly called. High-complexity laboratories can offer equivalent interpretation to automated calling for non-expanded CYP2D6 loci, and superior interpretation for duplications. We have validated scientific expert calling specified by scoring rules as standard operating procedure integrated with an automated calling algorithm. The integration of EC with AC is a practical strategy for CYP2D6 clinical haplotyping. Copyright © 2016 Elsevier B.V. All rights reserved.

  3. Multi-locus variable-number tandem repeat analysis of Bordetella pertussis isolates circulating in Poland in the period 1959-2013.

    PubMed

    Mosiej, Ewa; Krysztopa-Grzybowska, Katarzyna; Polak, Maciej; Prygiel, Marta; Lutyńska, Anna

    2017-06-01

    Despite the long history of pertussis vaccination and high vaccination coverage in Poland and many other developed countries, pertussis incidence rates have increased substantially, making whooping cough one of the most prevalent vaccine-preventable diseases. Among the factors potentially involved in pertussis resurgence, the adaptation of the Bordetella pertussis population to country-specific vaccine-induced immunity through selection of non-vaccine-type strains still needs detailed studies. Multi-locus variable-number tandem repeat analysis (MLVA), also linked to MLST and PFGE profiling, was applied to trace the genetic changes in the B. pertussis population circulating in Poland in the period 1959-2013 versus country-specific vaccine strains. Generally, among 174 B. pertussis isolates, 31 MLVA types were detected, of which 11 were not described previously. The predominant MLVA types of recent isolates in Poland were different from those of the typical isolates circulating in other European countries. The MT27 type, currently predominant in Europe, was rarely seen and detected in only five isolates among all studied. The features of the vaccine strains used for production of the pertussis component of a national whole-cell diphtheria-tetanus-pertussis (DTP) vaccine, as studied by MLVA and MLST tools, were found to not match those observed in the currently circulating B. pertussis isolates in Poland. Differences traced by MLVA in relation to the MLST and PFGE profiling confirmed that the B. pertussis strain types currently observed elsewhere in Europe, even if appearing in Poland, were not able to successfully disseminate within a human population in Poland that has been vaccinated with a whole-cell pertussis vaccine not used in other countries.

  4. The EmsB Tandemly Repeated Multilocus Microsatellite: a New Tool To Investigate Genetic Diversity of Echinococcus granulosus Sensu Lato▿

    PubMed Central

    Maillard, S.; Gottstein, B.; Haag, K. L.; Ma, S.; Colovic, I.; Benchikh-Elfegoun, M. C.; Knapp, J.; Piarroux, R.

    2009-01-01

    Cystic echinococcosis (CE) is a widespread and severe zoonotic disease caused by infection with the larval stage of the eucestode Echinococcus granulosus sensu lato. The polymorphism exhibited by nuclear and mitochondrial markers conventionally used for the genotyping of different parasite species and strains does not reach the level necessary for the identification of genetic variants linked to restricted geographical areas. EmsB is a tandemly repeated multilocus microsatellite that proved its usefulness for the study of genetic polymorphisms within the species E. multilocularis, the causative agent of alveolar echinococcosis. In the present study, EmsB was used to characterize E. granulosus sensu lato samples collected from different host species (sheep, cattle, dromedaries, dogs, and human patients) originating from six different countries (Algeria, Mauritania, Romania, Serbia, Brazil, and the People's Republic of China). The conventional mitochondrial cox1 and nad1 markers identified genotypes G1, G3, G5, G6, and G7, which are clustered into three groups corresponding to the species E. granulosus sensu stricto, E. ortleppi, and E. canadensis. With the same samples, EmsB provided a higher degree of genetic discrimination and identified variations that correlated with the relatively small-scale geographic origins of the samples. In addition, one of the Brazilian single hydatid cysts presented a hybrid genotypic profile that suggested genetic exchanges between E. granulosus sensu stricto and E. ortleppi. In summary, the EmsB microsatellite exhibits an interesting potential for the elaboration of a detailed map of the distribution of genetic variants and therefore for the determination and tracking of the source of CE. PMID:19741078

  5. BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.

    PubMed

    Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F

    2014-01-01

    We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.

  6. Software for peak finding and elemental composition assignment for glycosaminoglycan tandem mass spectra.

    PubMed

    Hogan, John D; Klein, Joshua A; Wu, Jiandong; Chopra, Pradeep; Boons, Geert-Jan; Carvalho, Luis; Lin, Cheng; Zaia, Joseph

    2018-04-03

    Glycosaminoglycans (GAGs) covalently linked to proteoglycans (PGs) are characterized by repeating disaccharide units and variable sulfation patterns along the chain. GAG length and sulfation patterns impact disease etiology, cellular signaling, and structural support for cells. We and others have demonstrated the usefulness of tandem mass spectrometry (MS2) for assigning the structures of GAG saccharides; however, manual interpretation of tandem mass spectra is time-consuming, so computational methods must be employed. In the proteomics domain, the identification of monoisotopic peaks and charge states relies on algorithms that use averagine, or the average building block of the compound class being analyzed. While these methods perform well for protein and peptide spectra, they perform poorly on GAG tandem mass spectra, due to the fact that a single average building block does not characterize the variable sulfation of GAG disaccharide units. In addition, it is necessary to assign product ion isotope patterns in order to interpret the tandem mass spectra of GAG saccharides. To address these problems, we developed GAGfinder, the first tandem mass spectrum peak finding algorithm developed specifically for GAGs. We define peak finding as assigning experimental isotopic peaks directly to a given product ion composition, as opposed to deconvolution or peak picking, which are terms more accurately describing the existing methods previously mentioned. GAGfinder is a targeted, brute force approach to spectrum analysis that utilizes precursor composition information to generate all theoretical fragments. GAGfinder also performs peak isotope composition annotation, which is typically a subsequent step for averagine-based methods. Data are available via ProteomeXchange with identifier PXD009101. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.

  7. Phylogeny and Haplotype Analysis of Fungi Within the Fusarium incarnatum-equiseti Species Complex.

    PubMed

    Ramdial, H; Latchoo, R K; Hosein, F N; Rampersad, S N

    2017-01-01

    Fusarium spp. are ranked among the top 10 most economically and scientifically important plant-pathogenic fungi in the world and are associated with plant diseases that include fruit decay of a number of crops. Fusarium isolates infecting bell pepper in Trinidad were identified based on sequence comparisons of the translation elongation factor gene (EF-1a) with sequences of Fusarium incarnatum-equiseti species complex (FIESC) verified in the FUSARIUM-ID database. Eighty-two isolates were identified as belonging to one of four phylogenetic species within the subclades FIESC-1, FIESC-15, FIESC-16, and FIESC-26, with the majority of isolates belonging to FIESC-15. A comparison of the level of DNA polymorphism and phylogenetic inference for sequences of the internal transcribed spacer region (ITS1-5.8S-ITS2) and EF-1a sequences for Trinidad and FUSARIUM-ID type species was carried out. The ITS sequences were less informative, had lower haplotype diversity and restricted haplotype distribution, and resulted in poor resolution and taxa placement in the consensus maximum-likelihood tree. EF-1a sequences enabled strongly supported phylogenetic inference with highly resolved branching patterns of the 30 phylogenetic species within the FIESC and placement of representative Trinidad isolates. Therefore, global phylogeny was inferred from EF-1a sequences representing 11 countries, and separation into distinct Incarnatum and Equiseti clades was again evident. In total, 42 haplotypes were identified: 12 were shared and the remaining were unique haplotypes. The most diverse haplotype was represented by sequences from China, Indonesia, Malaysia, and Trinidad and consisted exclusively of F. incarnatum isolates. Spain had the highest haplotype diversity, perhaps because both F. equiseti and F. incarnatum sequences were represented; followed by the United States, which contributed both F. equiseti and F. incarnatum sequences to the data set; then by countries representing Southeast

  8. Occurrence of 15 Haplotypes of Linepithema micans (Hymenoptera: Formicidae) in Southern Brazil.

    PubMed

    Ramalho, Manuela Oliveira; Martins, C; Campos, T; Nondillo, A; Botton, M; Bueno, O C

    2017-08-01

    The ant genus Linepithema is widely known, thanks to the pest species Linepithema humile (Mayr), which is easily mistaken for Linepithema micans (Forel) due to their morphological similarity. Like L. humile, L. micans is associated to the main grapevine pest in Brazil, Eurhizococcus brasiliensis (Wille), also known as ground pearl. Therefore, the present study uses mtDNA fragments to expand the knowledge of haplotype diversity and distribution of L. micans in the state of Rio Grande do Sul (Brazil), to understand the genetic differences of the populations identified in this study. We identified 15 haplotypes of L. micans spread across different localities. Twelve of these haplotypes were new for the species. The high haplotype diversity uncovered in Rio Grande do Sul (Brazil) for this species was predictable, as L. micans is in its native environment. Additional studies that take gene flow into account may reveal interesting aspects of diversity in these populations. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  9. Novel Harmful Recessive Haplotypes Identified for Fertility Traits in Nordic Holstein Cattle

    PubMed Central

    Sahana, Goutam; Nielsen, Ulrik Sander; Aamand, Gert Pedersen; Lund, Mogens Sandø; Guldbrandtsen, Bernt

    2013-01-01

    Using genomic data, lethal recessives may be discovered from haplotypes that are common in the population but never occur in the homozygote state in live animals. This approach only requires genotype data from phenotypically normal (i.e. live) individuals and not from the affected embryos that die. A total of 7,937 Nordic Holstein animals were genotyped with BovineSNP50 BeadChip and haplotypes including 25 consecutive markers were constructed and tested for absence of homozygotes states. We have identified 17 homozygote deficient haplotypes which could be loosely clustered into eight genomic regions harboring possible recessive lethal alleles. Effects of the identified haplotypes were estimated on two fertility traits: non-return rates and calving interval. Out of the eight identified genomic regions, six regions were confirmed as having an effect on fertility. The information can be used to avoid carrier-by-carrier mattings in practical animal breeding. Further, identification of causative genes/polymorphisms responsible for lethal effects will lead to accurate testing of the individuals carrying a lethal allele. PMID:24376603

  10. Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

    PubMed

    Delaneau, Olivier; Marchini, Jonathan

    2014-06-13

    A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.

  11. RENT+: an improved method for inferring local genealogical trees from haplotypes with recombination.

    PubMed

    Mirzaei, Sajad; Wu, Yufeng

    2017-04-01

    : Haplotypes from one or multiple related populations share a common genealogical history. If this shared genealogy can be inferred from haplotypes, it can be very useful for many population genetics problems. However, with the presence of recombination, the genealogical history of haplotypes is complex and cannot be represented by a single genealogical tree. Therefore, inference of genealogical history with recombination is much more challenging than the case of no recombination. : In this paper, we present a new approach called RENT+  for the inference of local genealogical trees from haplotypes with the presence of recombination. RENT+  builds on a previous genealogy inference approach called RENT , which infers a set of related genealogical trees at different genomic positions. RENT+  represents a significant improvement over RENT in the sense that it is more effective in extracting information contained in the haplotype data about the underlying genealogy than RENT . The key components of RENT+  are several greatly enhanced genealogy inference rules. Through simulation, we show that RENT+  is more efficient and accurate than several existing genealogy inference methods. As an application, we apply RENT+  in the inference of population demographic history from haplotypes, which outperforms several existing methods. : RENT+  is implemented in Java, and is freely available for download from: https://github.com/SajadMirzaei/RentPlus . : sajad@engr.uconn.edu or ywu@engr.uconn.edu. : Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  12. Italian familial defective apolipoprotein B patients share a unique haplotype with other Caucasian patients.

    PubMed

    Cefalù, A B; Barbagallo, C M; Sesti, E; Caldarella, R; Polizzi, F; Marino, G; Noto, D; Rolleri, M; Travali, S; Scalisi, G; Notarbartolo, A; Corsini, A; Bertolini, S; Averna, M R

    2001-09-01

    Familial defective apolipoprotein (apo) B-100 together with familial hypercholesterolemia are the two common genetic conditions that cause hypercholesterolemia. Familial defective apolipoprotein B-100 is due to mutations around codon 3500 of the apo B gene. The most-characterized mutation is a G>A transition at nucleotide 10,708 that results in the substitution of arginine by glutamine at codon 3500 (Apo B Arg3500Gln). Two other mutations are caused by a C>T transition, one at nucleotide 10,800 (Apo B Arg3531Cys) and the other at nucleotide 10,707 (apo B Arg3500Trp). In the present study we describe three new Italian cases of familial defective apolipoprotein B-100 (Apo B Arg3500Gln), one from the Liguria region and two from Sicily, and the haplotype of the apo B gene co-segregating with the mutation. By screening two groups of probands, clinically diagnosed as having Familial Hypercholesterolemia (700 from mainland Italy and 305 from Sicily), the prevalence of familial defective apolipoprotein B-100 due to Arg3500Gln was found to be very low (0.28% and 0.65%, respectively). The Arg3531Cys mutation was not detected in any proband. In the three new families with Arg3500Gln mutation in the present study and in one previously described in Italy, the mutation was associated with a unique apo B haplotype, which is consistent with data previously reported for Caucasian patients [XbaI-, MspI+, EcoRI-, presence of the 5' signal peptide insertion (Ins) allele, and the 49-repeat allele of the 3'-VNTR].

  13. Neuropsychiatric systemic lupus erythematosus is associated with imbalance in interleukin 10 promoter haplotypes

    PubMed Central

    Rood, M; Keijsers, V; van der Linden, M W; Tong, T; Borggreve, S; Verweij, C; Breedveld, F; Huizinga, T

    1999-01-01

    OBJECTIVE—To investigate the association of interleukin 10 (IL10) promoter polymorphisms and neuropsychiatric manifestations of systemic lupus erythematosus (SLE).
METHODS—IL10 haplotypes of 11 healthy volunteers were cloned to confirm that in the Dutch population, only the three common haplotypes (-1082/-819/-592) GCC, ACC and ATA exist. The IL10 promoter polymorphisms of 92 SLE patients and 162 healthy controls were determined. The medical records of the SLE patients were screened for the presence of neuropsychiatric involvement.
RESULTS—All cloned haplotypes were either GCC, ACC or ATA. Forty two SLE patients had suffered from neuropsychiatric manifestations (NP-SLE). In NP-SLE patients, the frequency of the ATA haplotype is 30% versus 18% in the controls and 17% in the non-NP-SLE group (odds ratios 1.9, p=0.02, and 2.1, p=0.04, respectively), whereas the GCC haplotype frequency is lower in the NP-SLE group compared with controls and non-NP-SLE patients (40% versus 55% and 61%, odds ratios 0.6, p=0.02 and 0.4 p=0.006). The odds ratio for the presence of NP-SLE is inversely proportional to the number of GCC haplotypes per genotype when the NP-SLE group is compared with non-NP-SLE patients.
CONCLUSIONS—The IL10 locus is associated with neuropsychiatric manifestations in SLE. This suggests that IL10 is implicated in the immunopathogenesis of neuropsychiatric manifestations in SLE.

 Keywords: systemic lupus erythematosus; neuropsychiatric manifestations; genetics; interleukin 10 promoter haplotypes PMID:10343522

  14. Mutation Analysis in Classical Phenylketonuria Patients Followed by Detecting Haplotypes Linked to Some PAH Mutations.

    PubMed

    Dehghanian, Fatemeh; Silawi, Mohammad; Tabei, Seyed M B

    2017-02-01

    Deficiency of phenylalanine hydroxylase (PAH) enzyme and elevation of phenylalanine in body fluids cause phenylketonuria (PKU). The gold standard for confirming PKU and PAH deficiency is detecting causal mutations by direct sequencing of the coding exons and splicing involved sequences of the PAH gene. Furthermore, haplotype analysis could be considered as an auxiliary approach for detecting PKU causative mutations before direct sequencing of the PAH gene by making comparisons between prior detected mutation linked-haplotypes and new PKU case haplotypes with undetermined mutations. In this study, 13 unrelated classical PKU patients took part in the study detecting causative mutations. Mutations were identified by polymerase chain reaction (PCR) and direct sequencing in all patients. After that, haplotype analysis was performed by studying VNTR and PAHSTR markers (linked genetic markers of the PAH gene) through application of PCR and capillary electrophoresis (CE). Mutation analysis was performed successfully and the detected mutations were as follows: c.782G>A, c.754C>T, c.842C>G, c.113-115delTCT, c.688G>A, and c.696A>G. Additionally, PAHSTR/VNTR haplotypes were detected to discover haplotypes linked to each mutation. Mutation detection is the best approach for confirming PAH enzyme deficiency in PKU patients. Due to the relatively large size of the PAH gene and high cost of the direct sequencing in developing countries, haplotype analysis could be used before DNA sequencing and mutation detection for a faster and cheaper way via identifying probable mutated exons.

  15. Divergence at the casein haplotypes in dairy and meat goat breeds.

    PubMed

    Küpper, Julia; Chessa, Stefania; Rignanese, Daniela; Caroli, Anna; Erhardt, Georg

    2010-02-01

    Casein genes have been proved to have an influence on milk properties, and are in addition appropriate for phylogeny studies. A large number of casein polymorphisms exist in goats, making their analysis quite complex. The four casein loci were analyzed by molecular techniques for genetic polymorphism detection in the two dairy goat breeds Bunte Deutsche Edelziege (BDE; n=96), Weisse Deutsche Edelziege (WDE; n=91), and the meat goat breed Buren (n=75). Of the 35 analyzed alleles, 18 were found in BDE, and 17 in Buren goats and WDE. In addition, a new allele was identified at the CSN1S1 locus in the BDE, showing a frequency of 0.05. This variant, named CSN1S1*A', is characterized by a t-->c transversion in intron 9. Linkage disequilibrium was found at the casein haplotype in all three breeds. A total of 30 haplotypes showed frequencies higher than 0.01. In the Buren breed only one haplotype showed a frequency higher than 0.1. The ancestral haplotype B-A-A-B (in the order: CSN1S1-CSN2-CSN1S2-CSN3) occurred in all three breeds, showing a very high frequency (>0.8) in the Buren.

  16. Gene Conversion Violates the Stepwise Mutation Model for Microsatellites in Y-Chromosomal Palindromic Repeats

    PubMed Central

    Balaresque, Patricia; King, Turi E; Parkin, Emma J; Heyer, Evelyne; Carvalho-Silva, Denise; Kraaijenbrink, Thirsa; de Knijff, Peter; Tyler-Smith, Chris; Jobling, Mark A

    2014-01-01

    The male-specific region of the human Y chromosome (MSY) contains eight large inverted repeats (palindromes), in which high-sequence similarity between repeat arms is maintained by gene conversion. These palindromes also harbor microsatellites, considered to evolve via a stepwise mutation model (SMM). Here, we ask whether gene conversion between palindrome microsatellites contributes to their mutational dynamics. First, we study the duplicated tetranucleotide microsatellite DYS385a,b lying in palindrome P4. We show, by comparing observed data with simulated data under a SMM within haplogroups, that observed heteroallelic combinations in which the modal repeat number difference between copies was large, can give rise to homoallelic combinations with zero-repeats difference, equivalent to many single-step mutations. These are unlikely to be generated under a strict SMM, suggesting the action of gene conversion. Second, we show that the intercopy repeat number difference for a large set of duplicated microsatellites in all palindromes in the MSY reference sequence is significantly reduced compared with that for nonpalindrome-duplicated microsatellites, suggesting that the former are characterized by unusual evolutionary dynamics. These observations indicate that gene conversion violates the SMM for microsatellites in palindromes, homogenizing copies within individual Y chromosomes, but increasing overall haplotype diversity among chromosomes within related groups. PMID:24610746

  17. Haplotype Frequency Distribution in Northeastern European Saduria entomon (Crustacea: Isopoda) Populations. A Phylogeographic Approach

    NASA Astrophysics Data System (ADS)

    Sell, Jerzy

    2003-11-01

    The distribution pattern of mtDNA haplotypes in distinct populations of the glacial relict crustacean Saduria entomon was examined to assess phylogeographic relationships among them. Populations from the Baltic, the White Sea and the Barents Sea were screened for mtDNA variation using PCR-based RFLP analysis of a 1150 bp fragment containing part of the CO I and CO II genes. Five mtDNA haplotypes were recorded. An analysis of geographical heterogeneity in haplotype frequency distributions revealed significant differences among populations. The isolated populations of S. entomon have diverged since the retreat of the last glaciation. The geographical pattern of variation is most likely the result of stochastic (founder effect, genetic drift) mechanisms and suggests that the haplotype differentiation observed is probably older than the isolation of the Baltic and Arctic seas.

  18. Tyms double (2R) and triple repeat (3R) confers risk for human oral squamous cell carcinoma.

    PubMed

    Bezerra, Alexandre Medeiros; Sant'Ana, Thalita Araújo; Gomes, Adriana Vieira; de Lacerda Vidal, Aurora Karla; Muniz, Maria Tereza Cartaxo

    2014-12-01

    The oral cancer is responsible for approximately 3 % of cases of cancer in Brazil. Epidemiological studies have associated low folate intake with an increased risk of epithelial cancers, including oral cancer. Folic acid has a key role in DNA synthesis, repair, methylation and this is the basis of explanations for a putative role for folic acid in cancer prevention. The role of folic acid in carcinogenesis may be modulated by polymorphism C677T in MTHFR and tandem repeats 2R/3R in the promoter site of TYMS gene that are related to decreased enzymatic activity and quantity and availability of the enzyme, respectively. These events cause a decrease in the synthesis, repair and DNA methylation, which can lead to a disruption in the expression of tumor suppressor genes as TP53. The objective of this study was investigate the distribution of polymorphisms C677T and tandem repeats 2R/3R associated with the development of oral squamous cell carcinoma (OSCC). 53 paraffin-embedded samples from patients who underwent surgery but are no longer at the institution and 43 samples collected by method of oral exfoliation by cytobrush were selected. 132 healthy subjects were selected by specialists at the dental clinics of the Faculdade de Odontologia de Pernambuco-FOP. The MTHFR genotyping was performed by PCR-RFLP, and the TYMS genotyping was performed by conventional PCR. Fisher's Exact test at significant level of 5 %. Odds ratios (ORs) and 95 % confidence intervals (CIs) were used to measure the strength of association between genotype frequency and OSCC development. The results were statistically significant for the tandem repeats of the TYMS gene (p = 0.015). The TYMS 2R3R genotype was significantly associated with the development of OSCC (OR = 3.582; 95 % CI 1.240-10.348; p = 0.0262) and also the genotype 3R3R (OR = 3.553; 95 % CI 1.293-9.760; p = 0.0345). When analyzed together, the TYMS 2R3R + 3R3R genotypes also showed association (OR = 3.518; 95 % CI 11.188-10.348; p

  19. Achieving 15% Tandem Polymer Solar Cells

    DTIC Science & Technology

    2015-06-23

    solar cell structures – both polymer only and hybrid tandem cells to constantly pushing the envelope of solution processed solar cell ...performance – 11.6% polymer tandem cell , 7% transparent tandem polymer cell , and over 10% PCE hybrid tandem solar cells were achieved. In addition, AFOSR’s...final support also enabled us to explore novel hybrid perovskite solar cells in depth. For example, single junction cell efficiency

  20. MSDB: A Comprehensive Database of Simple Sequence Repeats

    PubMed Central

    Avvaru, Akshay Kumar; Saxena, Saketh; Mishra, Rakesh Kumar

    2017-01-01

    Abstract Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. PMID:28854643