Sample records for gene cluster haplotypes

  1. β-globin gene cluster haplotypes in ethnic minority populations of southwest China

    PubMed Central

    Sun, Hao; Liu, Hongxian; Huang, Kai; Lin, Keqin; Huang, Xiaoqin; Chu, Jiayou; Ma, Shaohui; Yang, Zhaoqing

    2017-01-01

    The genetic diversity and relationships among ethnic minority populations of southwest China were investigated using seven polymorphic restriction enzyme sites in the β-globin gene cluster. The haplotypes of 1392 chromosomes from ten ethnic populations living in southwest China were determined. Linkage equilibrium and recombination hotspot were found between the 5′ sites and 3′ sites of the β-globin gene cluster. 5′ haplotypes 2 (+−−−), 6 (−++−+), 9 (−++++) and 3′ haplotype FW3 (−+) were the predominant haplotypes. Notably, haplotype 9 frequency was significantly high in the southwest populations, indicating their difference with other Chinese. The interpopulation differentiation of southwest Chinese minority populations is less than those in populations of northern China and other continents. Phylogenetic analysis shows that populations sharing same ethnic origin or language clustered to each other, indicating current β-globin cluster diversity in the Chinese populations reflects their ethnic origin and linguistic affiliations to a great extent. This study characterizes β-globin gene cluster haplotypes in southwest Chinese minorities for the first time, and reveals the genetic variability and affinity of these populations using β-globin cluster haplotype frequencies. The results suggest that ethnic origin plays an important role in shaping variations of the β-globin gene cluster in the southwestern ethnic populations of China. PMID:28205625

  2. Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11

    PubMed Central

    Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.

    2009-01-01

    Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120

  3. Factor IX gene haplotypes in Amerindians.

    PubMed

    Franco, R F; Araújo, A G; Zago, M A; Guerreiro, J F; Figueiredo, M S

    1997-02-01

    We have determined the haplotypes of the factor IX gene for 95 Indians from 5 Brazilian Amazon tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Eight polymorphisms linked to the factor IX gene were investigated: MseI (at 5', nt -698), BamHI (at 5', nt -561), DdeI (intron 1), BamHI (intron 2), XmnI (intron 3), TaqI (intron 4), MspI (intron 4), and HhaI (at 3', approximately 8 kb). The results of the haplotype distribution and the allele frequencies for each of the factor IX gene polymorphisms in Amerindians were similar to the results reported for Asian populations but differed from results for other ethnic groups. Only five haplotypes were identified within the entire Amerindian study population, and the haplotype distribution was significantly different among the five tribes, with one (Arára) to four (Wayampí) haplotypes being found per tribe. These findings indicate a significant heterogeneity among the Indian tribes and contrast with the homogeneous distribution of the beta-globin gene cluster haplotypes but agree with our recent findings on the distribution of alpha-globin gene cluster haplotypes and the allele frequencies for six VNTRs in the same Amerindian tribes. Our data represent the first study of factor IX-associated polymorphisms in Amerindian populations and emphasizes the applicability of these genetic markers for population and human evolution studies.

  4. Alpha-globin gene haplotypes in South American Indians.

    PubMed

    Zago, M A; Melo Santos, E J; Clegg, J B; Guerreiro, J F; Martinson, J J; Norwich, J; Figueiredo, M S

    1995-08-01

    The haplotypes of the alpha-globin gene cluster were determined for 99 Indians from the Brazilian Amazon region who belong to 5 tribes: Wayampí, Wayana-Apalaí, Kayapó, Arára, and Yanomámi. Three predominant haplotypes were identified: Ia (present in 38.9% of chromosomes), IIIa (25.8%), and IIe (22.1%). The only alpha-globin gene rearrangement detected was alpha alpha alpha 3.7 I gene triplication associated with haplotype IIIa, found in high frequencies (5.6% and 10.6%) in two tribes and absent in the others. alpha-Globin gene deletions that cause alpha-thalassemia were not seen, supporting the argument that malaria was absent in these populations until recently. The heterogeneous distribution of alpha-globin gene haplotypes and rearrangements among the different tribes differs markedly from the homogeneous distribution of beta-globin gene cluster haplotypes and reflects the action of various genetic mechanisms (genetic drift, founder effect, consanguinity) on small isolated population groups with a complicated history of divergence-fusion events. The alpha-globin gene haplotype distribution has some similarities to distributions observed in Southeast Asian and Pacific Island populations, indicating that these populations have considerable genetic affinities. However, the absence of several features of the alpha-globin gene cluster that are consistently present among the Pacific Islanders suggests that the similarity of haplotypes between Brazilian Indians and people from Polynesia, Micronesia, and Melanesia is more likely to result of ancient common ancestry rather than the consequence of recent direct genetic contribution through immigration.

  5. Population Structure With Localized Haplotype Clusters

    PubMed Central

    Browning, Sharon R.; Weir, Bruce S.

    2010-01-01

    We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data. PMID:20457877

  6. Haplotypes in the APOA1-C3-A4-A5 gene cluster affect plasma lipids in both humans and baboons

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Qian-fei; Liu, Xin; O'Connell, Jeff

    2003-09-15

    Genetic studies in non-human primates serve as a potential strategy for identifying genomic intervals where polymorphisms impact upon human disease-related phenotypes. It remains unclear, however, whether independently arising polymorphisms in orthologous regions of non-human primates leads to similar variation in a quantitative trait found in both species. To explore this paradigm, we studied a baboon apolipoprotein gene cluster (APOA1/C3/A4/A5) for which the human gene orthologs have well established roles in influencing plasma HDL-cholesterol and triglyceride concentrations. Our extensive polymorphism analysis of this 68 kb gene cluster in 96 pedigreed baboons identified several haplotype blocks each with limited diversity, consistent withmore » haplotype findings in humans. To determine whether baboons, like humans, also have particular haplotypes associated with lipid phenotypes, we genotyped 634 well characterized baboons using 16 haplotype tagging SNPs. Genetic analysis of single SNPs, as well as haplotypes, revealed an association of APOA5 and APOC3 variants with HDL cholesterol and triglyceride concentrations, respectively. Thus, independent variation in orthologous genomic intervals does associate with similar quantitative lipid traits in both species, supporting the possibility of uncovering human QTL genes in a highly controlled non-human primate model.« less

  7. Beta-globin gene cluster haplotypes of Amerindian populations from the Brazilian Amazon region.

    PubMed

    Guerreiro, J F; Figueiredo, M S; Zago, M A

    1994-01-01

    We have determined the beta-globin cluster haplotypes for 80 Indians from four Brazilian Amazon tribes: Kayapó, Wayampí, Wayana-Apalaí, and Arára. The results are analyzed together with 20 Yanomámi previously studied. From 2 to 4 different haplotypes were identified for each tribe, and 7 of the possible 32 haplotypes were found in a sample of 172 chromosomes for which the beta haplotypes were directly determined or derived from family studies. The haplotype distribution does not differ significantly among the five populations. The two most common haplotypes in all tribes were haplotypes 2 and 6, with average frequencies of 0.843 and 0.122, respectively. The genetic affinities between Brazilian Indians and other human populations were evaluated by estimates of genetic distance based on haplotype data. The lowest values were observed in relation to Asians, especially Chinese, Polynesians, and Micronesians.

  8. Submegabase Clusters of Unstable Tandem Repeats Unique to the Tla Region of Mouse T Haplotypes

    PubMed Central

    Uehara, H.; Ebersole, T.; Bennett, D.; Artzt, K.

    1990-01-01

    We describe here the identification and genomic organization of mouse t haplotype-specific elements (TSEs) 7.8 and 5.8 kb in length. The TSEs exist as submegabase-long clusters of tandem repeats localized in the Tla region of the major histocompatibility complex of all t haplotype chromosomes examined. In contrast, no such clusters were detected among 12 inbred strains of Mus musculus and other Mus species; thus, clusters of TSEs represent the first absolutely qualitative difference between t haplotypes and wild-type chromosomes. Pulsed field gel electrophoresis shows that the number of clusters, and the number of repeats in each cluster are extremely variable. Dramatic quantitative differences of TSEs uniquely distinguish every independent t haplotype from any other. The complete nucleotide sequence of one 7.8-kb TSE reveals significant homology to the ETn (a major transcript in the early embryo of the mouse), and some homologies to intracisternal A-particles and the mammary tumor virus env gene. Apart from the diagnostic relevance to t haplotypes, evolutionary and functional significances are discussed with respect to chromosome structure and genetic recombination. PMID:2076812

  9. The effects of old and recent migration waves in the distribution of HBB*S globin gene haplotypes

    PubMed Central

    Lindenau, Juliana D.; Wagner, Sandrine C.; de Castro, Simone M.; Hutz, Mara H.

    2016-01-01

    Abstract Sickle cell hemoglobin is the result of a mutation at the sixth amino acid position of the beta (β) globin chain. The HBB*S gene is in linkage disequilibrium with five main haplotypes in the β-globin-like gene cluster named according to their ethnic and geographic origins: Bantu (CAR), Benin (BEN), Senegal (SEN), Cameroon (CAM) and Arabian-Indian (ARAB). These haplotypes demonstrated that the sickle cell mutation arose independently at least five times in human history. The distribution of βS haplotypes among Brazilian populations showed a predominance of the CAR haplotype. American populations were clustered in two groups defined by CAR or BEN haplotype frequencies. This scenario is compatible with historical records about the slave trade in the Americas. When all world populations where the sickle cell gene occurs were analyzed, three clusters were disclosed based on CAR, BEN or ARAB haplotype predominance. These patterns may change in the next decades due to recent migrations waves. Since these haplotypes show different clinical characteristics, these recent migrations events raise the necessity to develop optimized public health programs for sickle cell disease screening and management. PMID:27706371

  10. Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

    PubMed

    Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

    2009-02-01

    High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.

  11. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A

  12. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M

  13. Frequency and origin of haplotypes associated with the beta-globin gene cluster in individuals with trait and sickle cell anemia in the Atlantic and Pacific coastal regions of Colombia

    PubMed Central

    Fong, Cristian; Lizarralde-Iragorri, María Alejandra; Rojas-Gallardo, Diana; Barreto, Guillermo

    2013-01-01

    Sickle cell anemia is a genetic disease with high prevalence in people of African descent. There are five typical haplotypes associated with this disease and the haplotypes associated with the beta-globin gene cluster have been used to establish the origin of African-descendant people in America. In this work, we determined the frequency and the origin of haplotypes associated with hemoglobin S in a sample of individuals with sickle cell anemia (HbSS) and sickle cell hemoglobin trait (HbAS) in coastal regions of Colombia. Blood samples from 71 HbAS and 79 HbSS individuals were obtained. Haplotypes were determined based on the presence of variable restriction sites within the β-globin gene cluster. On the Pacific coast of Colombia the most frequent haplotype was Benin, while on the Atlantic coast Bantu was marginally higher than Benin. Eight atypical haplotypes were observed on both coasts, being more diverse in the Atlantic than in the Pacific region. These results suggest a differential settlement of the coasts, dependent on where slaves were brought from, either from the Gulf of Guinea or from Angola, where the haplotype distributions are similar. Atypical haplotypes probably originated from point mutations that lost or gained a restriction site and/or by recombination events. PMID:24385850

  14. Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

    PubMed Central

    Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John

    2008-01-01

    The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213

  15. Haplotype diversity in 11 candidate genes across four populations.

    PubMed

    Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

    2005-09-01

    Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.

  16. Estimating the age of Hb G-Coushatta [β22(B4)Glu→Ala] mutation by haplotypes of β-globin gene cluster in Denizli, Turkey.

    PubMed

    Ozturk, Onur; Arikan, Sanem; Atalay, Ayfer; Atalay, Erol O

    2018-05-01

    Hb G-Coushatta variant was reported from various populations' parts of the world such as Thai, Korea, Algeria, Thailand, China, Japan and Turkey. In our study, we aimed to discuss the possible historical relationships of the Hb G-Coushatta mutation with the possible migration routes of the world. For this purpose, associated haplotypes were determined using polymorphic loci in the beta globin gene cluster of hemoglobin G-Coushatta and normal populations in Denizli, Turkey. We performed statistical analysis such as haplotype analysis, Hardy-Weinberg equilibrium, measurement of genetic diversity and population differentiation parameters, analysis of molecular variance using F-statistics, historical-demographic analyses, mismatch distribution analysis of both populations and applied the test statistics in Arlequin ver. 3.5 software program. The diversity of haplotypes has been shown to indicate different genetic origins for two populations. However, AMOVA results, molecular diversity parameters and population demographic expansion times showed that the Hb G-Coushatta mutation develops on the normal population gene pool. Our estimated τ values showed the average time since the demographic expansion for normal and Hb G-Coushatta populations ranged from approximately 42,000 to 38,000 ybp, respectively. Our data suggest that Hb G-Coushatta population originate in normal population in Denizli, Turkey. These results support the hypothesis that the multiple origin of Hb G-Coushatta and indicate that mutation may have been triggered the formation of new variants on beta globin haplotypes. © 2018 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.

  17. Cluster analysis of European Y-chromosomal STR haplotypes using the discrete Laplace method.

    PubMed

    Andersen, Mikkel Meyer; Eriksen, Poul Svante; Morling, Niels

    2014-07-01

    The European Y-chromosomal short tandem repeat (STR) haplotype distribution has previously been analysed in various ways. Here, we introduce a new way of analysing population substructure using a new method based on clustering within the discrete Laplace exponential family that models the probability distribution of the Y-STR haplotypes. Creating a consistent statistical model of the haplotypes enables us to perform a wide range of analyses. Previously, haplotype frequency estimation using the discrete Laplace method has been validated. In this paper we investigate how the discrete Laplace method can be used for cluster analysis to further validate the discrete Laplace method. A very important practical fact is that the calculations can be performed on a normal computer. We identified two sub-clusters of the Eastern and Western European Y-STR haplotypes similar to results of previous studies. We also compared pairwise distances (between geographically separated samples) with those obtained using the AMOVA method and found good agreement. Further analyses that are impossible with AMOVA were made using the discrete Laplace method: analysis of the homogeneity in two different ways and calculating marginal STR distributions. We found that the Y-STR haplotypes from e.g. Finland were relatively homogeneous as opposed to the relatively heterogeneous Y-STR haplotypes from e.g. Lublin, Eastern Poland and Berlin, Germany. We demonstrated that the observed distributions of alleles at each locus were similar to the expected ones. We also compared pairwise distances between geographically separated samples from Africa with those obtained using the AMOVA method and found good agreement. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  18. Prion gene haplotypes of U.S. cattle

    PubMed Central

    Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W

    2006-01-01

    Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337

  19. Patterns of linkage disequilibrium and haplotype distribution in disease candidate genes.

    PubMed

    Long, Ji-Rong; Zhao, Lan-Juan; Liu, Peng-Yuan; Lu, Yan; Dvornyk, Volodymyr; Shen, Hui; Liu, Yong-Jun; Zhang, Yuan-Yuan; Xiong, Dong-Hai; Xiao, Peng; Deng, Hong-Wen

    2004-05-24

    The adequacy of association studies for complex diseases depends critically on the existence of linkage disequilibrium (LD) between functional alleles and surrounding SNP markers. We examined the patterns of LD and haplotype distribution in eight candidate genes for osteoporosis and/or obesity using 31 SNPs in 1,873 subjects. These eight genes are apolipoprotein E (APOE), type I collagen alpha1 (COL1A1), estrogen receptor-alpha (ER-alpha), leptin receptor (LEPR), parathyroid hormone (PTH)/PTH-related peptide receptor type 1 (PTHR1), transforming growth factor-beta1 (TGF-beta1), uncoupling protein 3 (UCP3), and vitamin D (1,25-dihydroxyvitamin D3) receptor (VDR). Yin yang haplotypes, two high-frequency haplotypes composed of completely mismatching SNP alleles, were examined. To quantify LD patterns, two common measures of LD, D' and r2, were calculated for the SNPs within the genes. The haplotype distribution varied in the different genes. Yin yang haplotypes were observed only in PTHR1 and UCP3. D' ranged from 0.020 to 1.000 with the average of 0.475, whereas the average r2 was 0.158 (ranging from 0.000 to 0.883). A decay of LD was observed as the intermarker distance increased, however, there was a great difference in LD characteristics of different genes or even in different regions within gene. The differences in haplotype distributions and LD patterns among the genes underscore the importance of characterizing genomic regions of interest prior to association studies.

  20. Genetic differences in the two main groups of the Japanese population based on autosomal SNPs and haplotypes.

    PubMed

    Yamaguchi-Kabata, Yumi; Tsunoda, Tatsuhiko; Kumasaka, Natsuhiko; Takahashi, Atsushi; Hosono, Naoya; Kubo, Michiaki; Nakamura, Yusuke; Kamatani, Naoyuki

    2012-05-01

    Although the Japanese population has a rather low genetic diversity, we recently confirmed the presence of two main clusters (the Hondo and Ryukyu clusters) through principal component analysis of genome-wide single-nucleotide polymorphism (SNP) genotypes. Understanding the genetic differences between the two main clusters requires further genome-wide analyses based on a dense SNP set and comparison of haplotype frequencies. In the present study, we determined haplotypes for the Hondo cluster of the Japanese population by detecting SNP homozygotes with 388,591 autosomal SNPs from 18,379 individuals and estimated the haplotype frequencies. Haplotypes for the Ryukyu cluster were inferred by a statistical approach using the genotype data from 504 individuals. We then compared the haplotype frequencies between the Hondo and Ryukyu clusters. In most genomic regions, the haplotype frequencies in the Hondo and Ryukyu clusters were very similar. However, in addition to the human leukocyte antigen region on chromosome 6, other genomic regions (chromosomes 3, 4, 5, 7, 10 and 12) showed dissimilarities in haplotype frequency. These regions were enriched for genes involved in the immune system, cell-cell adhesion and the intracellular signaling cascade. These differentiated genomic regions between the Hondo and Ryukyu clusters are of interest because they (1) should be examined carefully in association studies and (2) likely contain genes responsible for morphological or physiological differences between the two groups.

  1. A phased SNP-based classification of sickle cell anemia HBB haplotypes.

    PubMed

    Shaikho, Elmutaz M; Farrell, John J; Alsultan, Abdulrahman; Qutub, Hatem; Al-Ali, Amein K; Figueiredo, Maria Stella; Chui, David H K; Farrer, Lindsay A; Murphy, George J; Mostoslavsky, Gustavo; Sebastiani, Paola; Steinberg, Martin H

    2017-08-11

    Sickle cell anemia causes severe complications and premature death. Five common β-globin gene cluster haplotypes are each associated with characteristic fetal hemoglobin (HbF) levels. As HbF is the major modulator of disease severity, classifying patients according to haplotype is useful. The first method of haplotype classification used restriction fragment length polymorphisms (RFLPs) to detect single nucleotide polymorphisms (SNPs) in the β-globin gene cluster. This is labor intensive, and error prone. We used genome-wide SNP data imputed to the 1000 Genomes reference panel to obtain phased data distinguishing parental alleles. We successfully haplotyped 813 sickle cell anemia patients previously classified by RFLPs with a concordance >98%. Four SNPs (rs3834466, rs28440105, rs10128556, and rs968857) marking four different restriction enzyme sites unequivocally defined most haplotypes. We were able to assign a haplotype to 86% of samples that were either partially or misclassified using RFLPs. Phased data using only four SNPs allowed unequivocal assignment of a haplotype that was not always possible using a larger number of RFLPs. Given the availability of genome-wide SNP data, our method is rapid and does not require high computational resources.

  2. Haplotype analysis of sucrose synthase gene family in three Saccharum species

    PubMed Central

    2013-01-01

    . spontanuem, respectively. Phylogenetic analysis showed three separate clusters composed of SbSuSy1 and SbSuSy2, SbSuSy3 and SbSuSy5, and SbSuSy4. Conclusions The five members of the SuSy gene family evolved before the divergence of the genera in the tribe Andropogoneae at least 12 MYA. Each ScSuSy gene showed at least one non-synonymous substitution in SNP haplotypes. The SNP frequency is the lowest in S. officinarum, intermediate in S. robustum, and the highest in S. spontaneum, which may reflect the timing of the two rounds of whole genome duplication in these octoploids. The higher rate of shared SNP frequency between S. officinarum and S. robustum than between S. officinarum and in S. spontaneum confirmed that the speciation event separating S. officinarum and S. robustum occurred after their common ancestor diverged from S. spontaneum. The SNP and haplotype frequencies in three Saccharum species provide fundamental information for designing strategies to sequence these autopolyploid genomes. PMID:23663250

  3. Haplotype analysis of sucrose synthase gene family in three Saccharum species.

    PubMed

    Zhang, Jisen; Arro, Jie; Chen, Youqiang; Ming, Ray

    2013-05-10

    , respectively. Phylogenetic analysis showed three separate clusters composed of SbSuSy1 and SbSuSy2, SbSuSy3 and SbSuSy5, and SbSuSy4. The five members of the SuSy gene family evolved before the divergence of the genera in the tribe Andropogoneae at least 12 MYA. Each ScSuSy gene showed at least one non-synonymous substitution in SNP haplotypes. The SNP frequency is the lowest in S. officinarum, intermediate in S. robustum, and the highest in S. spontaneum, which may reflect the timing of the two rounds of whole genome duplication in these octoploids. The higher rate of shared SNP frequency between S. officinarum and S. robustum than between S. officinarum and in S. spontaneum confirmed that the speciation event separating S. officinarum and S. robustum occurred after their common ancestor diverged from S. spontaneum. The SNP and haplotype frequencies in three Saccharum species provide fundamental information for designing strategies to sequence these autopolyploid genomes.

  4. Analysis of MHC class I genes across horse MHC haplotypes

    PubMed Central

    Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

    2010-01-01

    The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063

  5. Application of the Linux cluster for exhaustive window haplotype analysis using the FBAT and Unphased programs.

    PubMed

    Mishima, Hiroyuki; Lidral, Andrew C; Ni, Jun

    2008-05-28

    Genetic association studies have been used to map disease-causing genes. A newly introduced statistical method, called exhaustive haplotype association study, analyzes genetic information consisting of different numbers and combinations of DNA sequence variations along a chromosome. Such studies involve a large number of statistical calculations and subsequently high computing power. It is possible to develop parallel algorithms and codes to perform the calculations on a high performance computing (HPC) system. However, most existing commonly-used statistic packages for genetic studies are non-parallel versions. Alternatively, one may use the cutting-edge technology of grid computing and its packages to conduct non-parallel genetic statistical packages on a centralized HPC system or distributed computing systems. In this paper, we report the utilization of a queuing scheduler built on the Grid Engine and run on a Rocks Linux cluster for our genetic statistical studies. Analysis of both consecutive and combinational window haplotypes was conducted by the FBAT (Laird et al., 2000) and Unphased (Dudbridge, 2003) programs. The dataset consisted of 26 loci from 277 extended families (1484 persons). Using the Rocks Linux cluster with 22 compute-nodes, FBAT jobs performed about 14.4-15.9 times faster, while Unphased jobs performed 1.1-18.6 times faster compared to the accumulated computation duration. Execution of exhaustive haplotype analysis using non-parallel software packages on a Linux-based system is an effective and efficient approach in terms of cost and performance.

  6. Application of the Linux cluster for exhaustive window haplotype analysis using the FBAT and Unphased programs

    PubMed Central

    Mishima, Hiroyuki; Lidral, Andrew C; Ni, Jun

    2008-01-01

    Background Genetic association studies have been used to map disease-causing genes. A newly introduced statistical method, called exhaustive haplotype association study, analyzes genetic information consisting of different numbers and combinations of DNA sequence variations along a chromosome. Such studies involve a large number of statistical calculations and subsequently high computing power. It is possible to develop parallel algorithms and codes to perform the calculations on a high performance computing (HPC) system. However, most existing commonly-used statistic packages for genetic studies are non-parallel versions. Alternatively, one may use the cutting-edge technology of grid computing and its packages to conduct non-parallel genetic statistical packages on a centralized HPC system or distributed computing systems. In this paper, we report the utilization of a queuing scheduler built on the Grid Engine and run on a Rocks Linux cluster for our genetic statistical studies. Results Analysis of both consecutive and combinational window haplotypes was conducted by the FBAT (Laird et al., 2000) and Unphased (Dudbridge, 2003) programs. The dataset consisted of 26 loci from 277 extended families (1484 persons). Using the Rocks Linux cluster with 22 compute-nodes, FBAT jobs performed about 14.4–15.9 times faster, while Unphased jobs performed 1.1–18.6 times faster compared to the accumulated computation duration. Conclusion Execution of exhaustive haplotype analysis using non-parallel software packages on a Linux-based system is an effective and efficient approach in terms of cost and performance. PMID:18541045

  7. Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes

    PubMed Central

    Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.

    2016-01-01

    The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800

  8. African gene flow to north Brazil as revealed by HBB*S gene haplotype analysis.

    PubMed

    Lemos Cardoso, Greice; Farias Guerreiro, João

    2006-01-01

    Haplotypes linked to the HBB*S gene were analyzed in a sample of 260 chromosomes of Brazilian sickle cell anemia patients from the population of Belém, state of Pará, to evaluate if the present-day haplotype frequencies correlate as well as expected with historical information on the geographic origin of African slaves sent directly to Northern Brazil. The HBB*S gene haplotype distribution (66% Bantu, 21.8% Benin, 10.9% Senegal, and 1.3% Cameroon) is in agreement with those observed for other Brazilian populations regarding the highest proportion of the Bantu type, followed by the Benin type, but it differs significantly concerning the Senegal type as this haplotype is rare or absent in samples from other Brazilian regions already studied. In addition, our results are in accordance with historical records that establish that about 90% of the slaves sent to Northern Brazil were from Angola, Congo, and Mozambique, where the Bantu haplotype predominates, in contrast to 10% of slaves from Senegambia, Guine-Bissau, and Cape Verde, where the Senegal haplotype is the most common. On the other hand, the observed frequency of the Benin haplotype in Belém was much higher than that expected by historical data. This fact corroborates the suggestion that the high prevalence of the Benin type in Belém is due to domestic slave trade and later internal migrations, mainly from the Northeast, since there are no historical records of direct slave trade from Central West Africa to North Brazil. Am. J. Hum. Biol. 18:93-98, 2006. (c) 2005 Wiley-Liss, Inc.

  9. β3 Integrin Haplotype Influences Gene Regulation and Plasma von Willebrand Factor Activity

    PubMed Central

    Payne, Katie E; Bray, Paul F; Grant, Peter J; Carter, Angela M

    2008-01-01

    The Leu33Pro polymorphism of the gene encoding β3 integrin (ITGB3) is associated with acute coronary syndromes and influences platelet aggregation. Three common promoter polymorphisms have also been identified. The aims of this study were to (1) investigate the influence of the ITGB3 −400C/A, −425A/C and −468G/A promoter polymorphisms on reporter gene expression and nuclear protein binding and (2) determine genotype and haplotype associations with platelet αIIbβ3 receptor density. Promoter haplotypes were introduced into an ITGB3 promoter-pGL3 construct by site directed mutagenesis and luciferase reporter gene expression analysed in HEL and HMEC-1 cells. Binding of nuclear proteins was assessed by electrophoretic mobility shift assay. The association of ITGB3 haplotype with platelet αIIbβ3 receptor density was determined in 223 subjects. Species conserved motifs were identified in the ITGB3 promoter in the vicinity of the 3 polymorphisms. The GAA, GCC, AAC, AAA and ACC constructs induced ~50% increased luciferase expression relative to the GAC construct in both cell types. Haplotype analysis including Leu33Pro indicated 5 common haplotypes; no associations between ITGB3 haplotypes and receptor density were found. However, the GCC-Pro33 haplotype was associated with significantly higher vWF activity (128.6 [112.1–145.1]%) compared with all other haplotypes (107.1 [101.2–113.0]%, p=0.02). In conclusion, the GCC-Pro33 haplotype was associated with increased vWF activity but not with platelet αIIbβ3 receptor density, which may indicate ITGB3 haplotype influences endothelial function. PMID:18045606

  10. The role of the JAK2 GGCC haplotype and the TET2 gene in familial myeloproliferative neoplasms

    PubMed Central

    Olcaydu, Damla; Rumi, Elisa; Harutyunyan, Ashot; Passamonti, Francesco; Pietra, Daniela; Pascutto, Cristiana; Berg, Tiina; Jäger, Roland; Hammond, Emma; Cazzola, Mario; Kralovics, Robert

    2011-01-01

    Background Myeloproliferative neoplasms constitute a group of diverse chronic myeloid malignancies that share pathogenic features such as acquired mutations in the JAK2, TET2, CBL and MPL genes. There are recent reports that a JAK2 gene haplotype (GGCC or 46/1) confers susceptibility to JAK2 mutation-positive myeloproliferative neoplasms. The aim of this study was to examine the role of the JAK2 GGCC haplotype and germline mutations of TET2, CBL and MPL in familial myeloproliferative neoplasms. Design and Methods We investigated patients with familial (n=88) or sporadic (n=684) myeloproliferative neoplasms, and a control population (n=203) from the same demographic area in Italy. Association analysis was performed using tagged single nucleotide polymorphisms (rs10974944 and rs12343867) of the JAK2 haplotype. Sequence analysis of TET2, CBL and MPL was conducted in the 88 patients with familial myeloproliferative neoplasms. Results Association analysis revealed no difference in haplotype frequency between familial and sporadic cases of myeloproliferative neoplasms (P=0.6529). No germline mutations in TET2, CBL or MPL that segregate with the disease phenotype were identified. As we observed variability in somatic mutations in the affected members of a pedigree with myeloproliferative neoplasms, we postulated that somatic mutagenesis is increased in familial myeloproliferative neoplasms. Accordingly, we compared the incidence of malignant disorders between sporadic and familial patients. Although the overall incidence of malignant disorders did not differ significantly between cases of familial and sporadic myeloproliferative neoplasms, malignancies were more frequent in patients with familial disease aged between 50 to 70 years (P=0.0198) than in patients in the same age range with sporadic myeloproliferative neoplasms. Conclusions We conclude that the JAK2 GGCC haplotype and germline mutations of TET2, CBL or MPL do not explain familial clustering of

  11. The HLA-DRB9 gene and the origin of HLA-DR haplotypes.

    PubMed

    Gongora, R; Figueroa, F; Klein, J

    1996-11-01

    HLA-DRB9 is a gene fragment consisting of exon 2 and flanking intron sequences. It is located at the extreme end of the DRB subregion, whose other end is demarcated by the DRB1 locus. We sequenced approximately 1400 base pairs of the segment encompassing the DRB9 locus from eight human haplotypes (DR1, DR10, DR2, DR3, DR5, DR6, DR8, and DR9, the DR4 and DR7 having been sequenced by others earlier), as well as two chimpanzee, five gorillas, one orangutan and one macaque haplotype. The analysis of these sequences indicates that the DRB9 locus, which we estimate to be more than 58 million years (my) old, has been coevolving with the DRB1 locus for the last 4.2 my. As a consequence of this coevolution, the human DRB9 alleles fall into groups that correlate with the DRB1 allelic groups and with the gene organization of the human haplotypes. This observation implies that the present-day HLA-DR haplotype groups (DR1, DR51, DR52, DR8, and DR53) were founded more than 4 my ago and have remained intact (barring minor internal rearrangements that did not recombine the DRB1 and DRB9 genes) for this period of time. The haplotypes have been transmitted during speciations from ancestral to emerging species just like allelic lineages at the DRB1 locus. Thus not only allelic but also haplotype polymorphism evolves trans-specifically.

  12. Acute chest syndrome is associated with single nucleotide polymorphism-defined beta globin cluster haplotype in children with sickle cell anaemia

    PubMed Central

    Bean, Christopher J.; Boulet, Sheree L.; Yang, Genyan; Payne, Amanda B.; Ghaji, Nafisa; Pyle, Meredith E.; Hooper, W. Craig; Bhatnagar, Pallav; Keefer, Jeffrey; Barron-Casella, Emily A.; Casella, James F.; DeBaun, Michael R.

    2013-01-01

    Summary Genetic diversity at the human β-globin locus has been implicated as a modifier of sickle cell anaemia (SCA) severity. However, haplotypes defined by restriction fragment length polymorphism sites across the β-globin locus have not been consistently associated with clinical phenotypes. To define the genetic structure at the β-globin locus more thoroughly, we performed high-density single nucleotide polymorphism (SNP) mapping in 820 children who were homozygous for the sickle cell mutation (HbSS). Genotyping results revealed very high linkage disequilibrium across a large region spanning the locus control region and the HBB (β-globin gene) cluster. We identified three predominant haplotypes accounting for 96% of the βS-carrying chromosomes in this population that could be distinguished using a minimal set of common SNPs. Consistent with previous studies, fetal haemoglobin level was significantly associated with βS-haplotypes. After controlling for covariates, an association was detected between haplotype and rate of hospitalization for acute chest syndrome (ACS) (incidence rate ratio 0.51, 95% confidence interval 0.29–0.89) but not incidence rate of vaso-occlusive pain or presence of silent cerebral infarct (SCI). Our results suggest that these SNP-defined βS-haplotypes may be associated with ACS, but not pain or SCI in a study population of children with SCA. PMID:23952145

  13. Whole-genome sequencing suggests a chemokine gene cluster that modifies age at onset in familial Alzheimer's disease

    PubMed Central

    Lalli, M A; Bettcher, B M; Arcila, M L; Garcia, G; Guzman, C; Madrigal, L; Ramirez, L; Acosta-Uribe, J; Baena, A; Wojta, K J; Coppola, G; Fitch, R; de Both, M D; Huentelman, M J; Reiman, E M; Brunkow, M E; Glusman, G; Roach, J C; Kao, A W; Lopera, F; Kosik, K S

    2015-01-01

    We have sequenced the complete genomes of 72 individuals affected with early-onset familial Alzheimer's disease caused by an autosomal dominant, highly penetrant mutation in the presenilin-1 (PSEN1) gene, and performed genome-wide association testing to identify variants that modify age at onset (AAO) of Alzheimer's disease. Our analysis identified a haplotype of single-nucleotide polymorphisms (SNPs) on chromosome 17 within a chemokine gene cluster associated with delayed onset of mild-cognitive impairment and dementia. Individuals carrying this haplotype had a mean AAO of mild-cognitive impairment at 51.0±5.2 years compared with 41.1±7.4 years for those without these SNPs. This haplotype thus appears to modify Alzheimer's AAO, conferring a large (~10 years) protective effect. The associated locus harbors several chemokines including eotaxin-1 encoded by CCL11, and the haplotype includes a missense polymorphism in this gene. Validating this association, we found plasma eotaxin-1 levels were correlated with disease AAO in an independent cohort from the University of California San Francisco Memory and Aging Center. In this second cohort, the associated haplotype disrupted the typical age-associated increase of eotaxin-1 levels, suggesting a complex regulatory role for this haplotype in the general population. Altogether, these results suggest eotaxin-1 as a novel modifier of Alzheimer's disease AAO and open potential avenues for therapy. PMID:26324103

  14. Whole-genome sequencing suggests a chemokine gene cluster that modifies age at onset in familial Alzheimer's disease.

    PubMed

    Lalli, M A; Bettcher, B M; Arcila, M L; Garcia, G; Guzman, C; Madrigal, L; Ramirez, L; Acosta-Uribe, J; Baena, A; Wojta, K J; Coppola, G; Fitch, R; de Both, M D; Huentelman, M J; Reiman, E M; Brunkow, M E; Glusman, G; Roach, J C; Kao, A W; Lopera, F; Kosik, K S

    2015-11-01

    We have sequenced the complete genomes of 72 individuals affected with early-onset familial Alzheimer's disease caused by an autosomal dominant, highly penetrant mutation in the presenilin-1 (PSEN1) gene, and performed genome-wide association testing to identify variants that modify age at onset (AAO) of Alzheimer's disease. Our analysis identified a haplotype of single-nucleotide polymorphisms (SNPs) on chromosome 17 within a chemokine gene cluster associated with delayed onset of mild-cognitive impairment and dementia. Individuals carrying this haplotype had a mean AAO of mild-cognitive impairment at 51.0 ± 5.2 years compared with 41.1 ± 7.4 years for those without these SNPs. This haplotype thus appears to modify Alzheimer's AAO, conferring a large (~10 years) protective effect. The associated locus harbors several chemokines including eotaxin-1 encoded by CCL11, and the haplotype includes a missense polymorphism in this gene. Validating this association, we found plasma eotaxin-1 levels were correlated with disease AAO in an independent cohort from the University of California San Francisco Memory and Aging Center. In this second cohort, the associated haplotype disrupted the typical age-associated increase of eotaxin-1 levels, suggesting a complex regulatory role for this haplotype in the general population. Altogether, these results suggest eotaxin-1 as a novel modifier of Alzheimer's disease AAO and open potential avenues for therapy.

  15. Specific CAPN10 gene haplotypes influence the clinical profile of polycystic ovary patients.

    PubMed

    Gonzalez, Alejandro; Abril, Eduardo; Roca, Alfredo; Aragón, Maria José; Figueroa, Maria José; Velarde, Pilar; Ruiz, Rocío; Fayez, Omar; Galán, José Jorge; Herreros, José Antonio; Real, Luis Miguel; Ruiz, Agustín

    2003-11-01

    Recently, several research groups have evaluated CAPN10 gene in polycystic ovarian syndrome (PCOS) patients and other phenotypes, including hirsutism or intermediate phenotypes of PCOS. Molecular genetic analysis of CAPN10 gene indicates that different alleles may play a role in PCOS susceptibility and could be associated with idiopathic hirsutism. However, these observations are not exempt from controversy, because independent studies cannot replicate these preliminary findings. We present a haplotype-phenotype correlation study of CAPN10 haplotypes in 148 women showing ecographically detected polycystic ovaries (PCO) combined with one or more of these clinical symptoms: amenorrhea or severe oligomenorrhea, hyperandrogenism, and anovulatory infertility, as well as 93 unrelated controls. We have reconstructed and analyzed 482 CAPN10 haplotypes in patients and controls. We detected the association of UCSNP-44 allele with PCO phenotype in the Spanish population (P = 0.02). In addition, we identified several CAPN10 alleles associated to phenotypic differences observed between PCO patients, such as the presence of hypercholesterolemia (haplotype 1121, P = 0.005), presence of hyperandrogenic features (P = 0.05), and familial cancer incidence (haplotype 1111, P = 0.0005). Our results confirm the association of UCSNP-44 allele with PCO phenotype in the Spanish population. Moreover, we have identified novel candidate risk alleles and genotypes, within CAPN10 gene, that could be associated with important phenotypic and prognosis differences observed in PCOS patients.

  16. IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

    PubMed Central

    Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

    2017-01-01

    Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656

  17. Lack of haplotype structuring for two candidate genes for trypanotolerance in cattle.

    PubMed

    Álvarez, I; Pérez-Pardal, L; Traoré, A; Fernández, I; Goyache, F

    2016-04-01

    Bovine trypanotolerance is a heritable trait associated to the ability of the individuals to control parasitaemia and anaemia. The INHBA (BTA4) and TICAM1 (BTA7) genes are strong candidates for trypanotolerance-related traits. The coding sequence of both genes (3951 bp in total) were analysed in a panel including 79 Asian, African and European cattle (Bos taurus and B. indicus) to identify naturally occurring polymorphisms on both genes. In general, the genetic diversity was low. Nineteen of the 33 mutations identified were found just one time. Seventeen different haplotypes were defined for the TICAM1 gene, and 9 and 12 were defined for the exon 1 and the exon 2 of the INHBA gene, respectively. There was no clear separation between cattle groups. The most frequent haplotypes identified in West African taurine samples were also identified in other cattle groups including Asian zebu and European cattle. Phylogenetic trees and principal component analysis confirmed that divergence among the cattle groups analysed was poor, particularly for the INHBA sequences. The European cattle subset had the lowest values of haplotype diversity for both the exon1 (monomorphic) and the exon2 (0.077 ± 0.066) of the INHBA gene. Neutrality tests, in general, did not suggest that the analysed genes were under positive selection. The assessed scenario would be consistent with the identification of recent mutations in evolutionary terms. © 2015 Blackwell Verlag GmbH.

  18. Haplotypes and gene expression implicate the MAPT region for Parkinson disease

    PubMed Central

    Tobin, J.E.; Latourelle, J.C.; Lew, M.F.; Klein, C.; Suchowersky, O.; Shill, H.A.; Golbe, L.I.; Mark, M.H.; Growdon, J.H.; Wooten, G.F.; Racette, B.A.; Perlmutter, J.S.; Watts, R.; Guttman, M.; Baker, K.B.; Goldwurm, S.; Pezzoli, G.; Singer, C.; Saint-Hilaire, M.H.; Hendricks, A.E.; Williamson, S.; Nagle, M.W.; Wilk, J.B.; Massood, T.; Laramie, J.M.; DeStefano, A.L.; Litvan, I.; Nicholson, G.; Corbett, A.; Isaacson, S.; Burn, D.J.; Chinnery, P.F.; Pramstaller, P.P.; Sherman, S.; Al-hinti, J.; Drasby, E.; Nance, M.; Moller, A.T.; Ostergaard, K.; Roxburgh, R.; Snow, B.; Slevin, J.T.; Cambi, F.; Gusella, J.F.; Myers, R.H.

    2009-01-01

    Background Microtubule-associated protein tau (MAPT) has been associated with several neurodegenerative disorders including forms of parkinsonism and Parkinson disease (PD). We evaluated the association of the MAPT region with PD in a large cohort of familial PD cases recruited by the GenePD Study. In addition, postmortem brain samples from patients with PD and neurologically normal controls were used to evaluate whether the expression of the 3-repeat and 4-repeat isoforms of MAPT, and neighboring genes Saitohin (STH) and KIAA1267, are altered in PD cerebellum. Methods Twenty-one single-nucleotide polymorphisms (SNPs) in the region of MAPT on chromosome 17q21 were genotyped in the GenePD Study. Single SNPs and haplotypes, including the H1 haplotype, were evaluated for association to PD. Relative quantification of gene expression was performed using real-time RT-PCR. Results After adjusting for multiple comparisons, SNP rs1800547 was significantly associated with PD affection. While the H1 haplotype was associated with a significantly increased risk for PD, a novel H1 subhaplotype was identified that predicted a greater increased risk for PD. The expression of 4-repeat MAPT, STH, and KIAA1267 was significantly increased in PD brains relative to controls. No difference in expression was observed for 3-repeat MAPT. Conclusions This study supports a role for MAPT in the pathogenesis of familial and idiopathic Parkinson disease (PD). Interestingly, the results of the gene expression studies suggest that other genes in the vicinity of MAPT, specifically STH and KIAA1267, may also have a role in PD and suggest complex effects for the genes in this region on PD risk. PMID:18509094

  19. The "Sardinian" HLA-A30,B18,DR3,DQw2 haplotype constantly lacks the 21-OHA and C4B genes. Is it an ancestral haplotype without duplication?

    PubMed

    Contu, L; Carcassi, C; Dausset, J

    1989-01-01

    The C4 and 21-OH loci of the class III HLA have been studied by specific DNA probes and the restriction enzyme Taq 1 in 24 unrelated Sardinian individuals selected from completely HLA-typed families. All 24 individuals had the HLA extended haplotype A30,Cw5,B18, BfF1,DR3,DRw52,DQw2, named "Sardinian" in the present paper because of its frequency of 15% in the Sardinian population. Eighteen of these were homozygous for the entire haplotype, and six were heterozygous at the A locus and blank (or homozygous) at all the other loci. In all completely homozygous cells and in four heterozygous cells at the A locus, the restriction fragments of the 21-OHA (3.2 kb) and C4B (5.8 kb or 5.4 kb) genes were absent, and the fragments of the C4A (7.0 kb) and 21-OHB (3.7 kb) genes were present. It is suggested that the "Sardinian" haplotype is an ancestral haplotype without duplication of the C4 and 21-OH genes, practically always identical in its structure, also in unrelated individuals. The diversity of this haplotype in the class III region (about 30 kb less) may be at least partially responsible for its misalignment with most haplotypes, which have duplicated C4 and 21-OH genes, and therefore also for its decreased probability to recombine. This can help explain its high stability and frequency in the Sardinian population. The same conclusion can be suggested for the Caucasian extended haplotype A1,B8,DR3 that always seems to lack the C4A and 21-OHA genes.

  20. Longitudinal analysis of haplotypes and polymorphisms of the APOA5 and APOC3 genes associated with variation in serum triglyceride levels: the Bogalusa Heart Study.

    PubMed

    Hallman, D Michael; Srinivasan, Sathanur R; Chen, Wei; Boerwinkle, Eric; Berenson, Gerald S

    2006-12-01

    Polymorphisms in the APOC3 and APOA5 genes, from the APOA1/APOC3/APOA4/APOA5 gene cluster on chromosome 11q23, have been associated with interindividual variation in plasma triglycerides. APOA5 polymorphisms implicated include 2 in the promoter region (-1131 T/C and -3 A/G) and 1 in exon 2 (+56 C/G). APOC3 polymorphisms implicated include 1 (SstI) in the 3' untranslated region and 1 (-2854 G/T) in the APOC3-APOA4 intergenic region. We analyzed the associations of haplotypes and multilocus genotypes of these polymorphisms on longitudinal serum triglyceride profiles in 360 African American and 823 white subjects from the Bogalusa Heart Study. Subjects were examined from 2 to 8 times (mean +/- SD, 5.4 +/- 1.3) between 1973 and 1996, at ages ranging from 4 to 38 years, with 1978 observations in African Americans and 4465 in whites. Serum triglycerides were significantly higher among whites across all ages. Allele frequencies differed significantly between African Americans and whites at all but the APOA5 +56 C/G locus. Linkage disequilibrium among the loci was higher in whites and haplotype diversity lower: 6 haplotypes had estimated frequencies of more than 1% in African Americans, 5 in whites. Individually, all polymorphisms except APOC3 -2854 G/T showed significant associations with triglyceride levels in the full sample. However, genotype models including all 5 loci showed significant triglyceride associations for only 3 (APOC3 SstI, APOA5 -1131 T/C, and APOA5 +56 C/G); significant interactions among them indicated their effects were not independent. Neither APOC3 -2854 G/T nor APOA5 -3 A/G had significant effects when the other 3 loci were in the models. The EM algorithm was used to estimate haplotype frequencies and assign haplotype probabilities to individuals, which is conditional on their genotypes; individuals' haplotype probability vectors were then used as predictors in multilevel mixed models of longitudinal triglyceride profiles. Of haplotypes comprising

  1. Catechol-O-methyltransferase gene haplotypes in Mexican and Spanish patients with fibromyalgia

    PubMed Central

    Vargas-Alarcón, Gilberto; Fragoso, José-Manuel; Cruz-Robles, David; Vargas, Angélica; Vargas, Alfonso; Lao-Villadóniga, José-Ignacio; García-Fructuoso, Ferrán; Ramos-Kuri, Manuel; Hernández, Fernando; Springall, Rashidi; Bojalil, Rafael; Vallejo, Maite; Martínez-Lavín, Manuel

    2007-01-01

    Autonomic dysfunction is frequent in patients with fibromyalgia (FM). Heart rate variability analyses have demonstrated signs of ongoing sympathetic hyperactivity. Catecholamines are sympathetic neurotransmitters. Catechol-O-methyltransferase (COMT), an enzyme, is the major catecholamine-clearing pathway. There are several single-nucleotide polymorphisms (SNPs) in the COMT gene associated with the different catecholamine-clearing abilities of the COMT enzyme. These SNPs are in linkage disequilibrium and segregate as 'haplotypes'. Healthy females with a particular COMT gene haplotype (ACCG) producing a defective enzyme are more sensitive to painful stimuli. The objective of our study was to define whether women with FM, from two different countries (Mexico and Spain), have the COMT gene haplotypes that have been previously associated with greater sensitivity to pain. All the individuals in the study were female. Fifty-seven Mexican patients and 78 Spanish patients were compared with their respective healthy control groups. All participants filled out the Fibromyalgia Impact Questionnaire (FIQ). Six COMT SNPs (rs2097903, rs6269, rs4633, rs4818, rs4680, and rs165599) were genotyped from peripheral blood DNA. In Spanish patients, there was a significant association between three SNPs (rs6269, rs4818, and rs4680) and the presence of FM when compared with healthy controls. Moreover, in Spanish patients with the 'high pain sensitivity' haplotype (ACCG), the disease, as assessed by the FIQ, was more severe. By contrast, Mexican patients displayed only a weak association between rs6269 and rs165599, and some FIQ subscales. In our group of Spanish patients, there was an association between FM and the COMT haplotype previously associated with high pain sensitivity. This association was not observed in Mexican patients. Studies with a larger sample size are needed in order to verify or amend these preliminary results. PMID:17961261

  2. Specific haplotypes of the CALPAIN-5 gene are associated with polycystic ovary syndrome.

    PubMed

    González, A; Sáez, M E; Aragón, M J; Galán, J J; Vettori, P; Molina, L; Rubio, C; Real, L M; Ruiz, Agustín; Ramírez-Lorca, R

    2006-04-01

    Polycystic ovary syndrome (PCOS) is a common endocrine disorder in women of reproductive age. The aim of the present study was to investigate the role of CALPAIN-5 (CAPN5) gene in PCOS susceptibility. We analysed four intronic polymorphisms of the CAPN5 gene in 148 well-characterized women with PCOS and 606 unrelated controls. We performed a case-control study and an intracohort analysis of clinical characteristics associated with PCOS. Analysis of haplotypes distribution between PCOS population compared to controls showed a strong deviation (P = 0.00029). The haplotypes GGCA and GGTG were overrepresented in PCOS patients (P = 0.009 and P = 0.001, respectively). In addition, we identified several CAPN5 haplotypes associated with phenotypic differences observed between PCOS patients, such as the presence of obesity (P = 0.02), cardiovascular complications (P = 0.02), familial antecedents of obesity (P = 0.003) and of hypertension (P = 0.007) and type 2 diabetes mellitus aggregation (P = 0.04). These results suggest a role of CAPN5 gene in PCOS susceptibility in humans. Moreover, novel candidate risk alleles have been identified, within CAPN5 gene, which could be associated with important phenotypic and prognosis differences observed in PCOS patients.

  3. A Haplotype Information Theory Method Reveals Genes of Evolutionary Interest in European vs. Asian Pigs.

    PubMed

    Hudson, Nicholas J; Naval-Sánchez, Marina; Porto-Neto, Laercio; Pérez-Enciso, Miguel; Reverter, Antonio

    2018-06-05

    Asian and European wild boars were independently domesticated ca. 10,000 years ago. Since the 17th century, Chinese breeds have been imported to Europe to improve the genetics of European animals by introgression of favourable alleles, resulting in a complex mosaic of haplotypes. To interrogate the structure of these haplotypes further, we have run a new haplotype segregation analysis based on information theory, namely compression efficiency (CE). We applied the approach to sequence data from individuals from each phylogeographic region (n = 23 from Asia and Europe) including a number of major pig breeds. Our genome-wide CE is able to discriminate the breeds in a manner reflecting phylogeography. Furthermore, 24,956 non-overlapping sliding windows (each comprising 1,000 consecutive SNP) were quantified for extent of haplotype sharing within and between Asia and Europe. The genome-wide distribution of extent of haplotype sharing was quite different between groups. Unlike European pigs, Asian pigs haplotype sharing approximates a normal distribution. In line with this, we found the European breeds possessed a number of genomic windows of dramatically higher haplotype sharing than the Asian breeds. Our CE analysis of sliding windows capture some of the genomic regions reported to contain signatures of selection in domestic pigs. Prominent among these regions, we highlight the role of a gene encoding the mitochondrial enzyme LACTB which has been associated with obesity, and the gene encoding MYOG a fundamental transcriptional regulator of myogenesis. The origin of these regions likely reflects either a population bottleneck in European animals, or selective targets on commercial phenotypes reducing allelic diversity in particular genes and/or regulatory regions.

  4. Linkage Disequilibrium and Haplotype Diversity in the Genes of the Renin–Angiotensin System: Findings From the Family Blood Pressure Program

    PubMed Central

    Zhu, Xiaofeng; Yan, Denise; Cooper, Richard S.; Luke, Amy; Ikeda, Morna A.; Chang, Yen-Pei C.; Weder, Alan; Chakravarti, Aravinda

    2003-01-01

    Association studies of candidate genes with complex traits have generally used one or a few single nucleotide polymorphisms (SNPs), although variation in the extent of linkage disequilibrium (LD) within genes markedly influences the sensitivity and precision of association studies. The extent of LD and the underlying haplotype structure for most candidate genes are still unavailable. We sampled 193 blacks (African-Americans) and 160 whites (European-Americans) and estimated the intragenic LD and the haplotype structure in four genes of the renin–angiotensin system. We genotyped 25 SNPs, with all but one of the pairs spaced between 1 and 20 kb, thus providing resolution at small scale. The pattern of LD within a gene was very heterogeneous. Using a robust method to define haplotype blocks, blocks of limited haplotype diversity were identified at each locus; between these blocks, LD was lost owing to the history of recombination events. As anticipated, there was less LD among blacks, the number of haplotypes was substantially larger, and shorter haplotype segments were found, compared with whites. These findings have implications for candidate-gene association studies and indicate that variation between populations of European and African origin in haplotype diversity is characteristic of most genes. [The sequence data described in this paper are available in GenBank under the following accession nos: AGT, MIM 106150; Renin, MIM 179820; ACE, MIM 106180; Angiotensin receptor I, MIM 106165. Supplementary material is available online at http://www.genome.org.] PMID:12566395

  5. Identification and genetic effect of haplotype in the bovine BMP7 gene.

    PubMed

    Huang, Yong-Zhen; Wang, Xin-Lei; He, Hua; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong

    2013-12-15

    Bone morphogenetic proteins (BMPs) are peptide growth factors belonging to the transforming growth factor-beta (TGF-β) superfamily, and some members of the BMP family support white adipocyte differentiation. In this study, we focused on the BMP7 which singularly promotes the differentiation of brown preadipocytes. Haplotypes involving 5 single nucleotide polymorphism (SNP) sites in the bovine BMP7 gene were identified and their effect on body weight was analyzed. 16 haplotypes and 18 combined haplotypes were revealed and the linkage disequilibrium was assessed in the cattle population with 602 individuals representing three main cattle breeds from China. The results showed that haplotypes 3, 10 and 14 were predominant and accounted for 75.64%, 69.85%, and 83.36% in Nanyang, Qinchuan and Jiaxian cattle breeds, respectively. The statistical analyses indicated that the SNP 1, 4, and 5 are associated with the body weight, body length, and heart girth at 12 and 24 months in Nanyang cattle population (P<0.05), whereas there is no significant association between their 16 haplotypes and 18 combined haplotypes. Our results provide evidence that some SNPs and haplotypes in BMP7 are associated with growth traits, and may be utilized as a genetic marker in marker-assisted selection for beef cattle breeding programs. Copyright © 2013. Published by Elsevier B.V.

  6. A Candidate Trans-acting Modulator of Fetal Hemoglobin Gene Expression in the Arab-Indian Haplotype of Sickle Cell Anemia

    PubMed Central

    Vathipadiekal, Vinod; Farrell, John J.; Wang, Shuai; Edward, Heather L.; Shappell, Heather; Al-Rubaish, A.M.; Al-Muhanna, Fahad; Naserullah, Z.; Alsuliman, A.; Qutub, Hatem Othman; Simkin, Irene; Farrer, Lindsay A.; Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Mostoslavsky, Gustavo; Murphy, George J.; Patra, Pradeep.K.; Chui, David H.K.; Alsultan, Abdulrahman; Al-Ali, Amein K.; Sebastiani, Paola.; Steinberg, Martin. H.

    2016-01-01

    Fetal hemoglobin (HbF) levels are higher in the Arab-Indian (AI) β-globin gene haplotype of sickle cell anemia compared with African-origin haplotypes. To study genetic elements that effect HbF expression in the AI haplotype we completed whole genome sequencing in 14 Saudi AI haplotype sickle hemoglobin homozygotes—seven selected for low HbF (8.2±1.3%) and seven selected for high HbF (23.5±.2.6%). An intronic single nucleotide polymorphism (SNP) in ANTXR1, an anthrax toxin receptor (chromosome 2p13), was associated with HbF. These results were replicated in two independent Saudi AI haplotype cohorts of 120 and 139 patients, but not in 76 Saudi Benin haplotype, 894 African origin haplotype and 44 Arab Indian haplotype patients of Indian descent, suggesting that this association is effective only in the Saudi AI haplotype background. ANTXR1 variants explained 10% of the HbF variability compared with 8% for BCL11A. These two genes had independent, additive effects on HbF and together explained about 15% of HbF variability in Saudi AI sickle cell anemia patients. ANTXR1 was expressed at mRNA and protein levels in erythroid progenitors derived from induced pluripotent stem cells (iPSCs) and CD34+ cells. As CD34+ cells matured and their HbF decreased ANTXR1 expression increased; as iPSCs differentiated and their HbF increased, ANTXR1 expression decreased. Along with elements in cis to the HbF genes, ANTXR1 contributes to the variation in HbF in Saudi AI haplotype sickle cell anemia and is the first gene in trans to HBB that is associated with HbF only in carriers of the Saudi AI haplotype. PMID:27501013

  7. Haplotype Analysis in Multiple Crosses to Identify a QTL Gene

    PubMed Central

    Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

    2004-01-01

    Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P ≤ 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene. PMID:15310659

  8. Haplotype analysis in multiple crosses to identify a QTL gene.

    PubMed

    Wang, Xiaosong; Korstanje, Ron; Higgins, David; Paigen, Beverly

    2004-09-01

    Identifying quantitative trait locus (QTL) genes is a challenging task. Herein, we report using a two-step process to identify Apoa2 as the gene underlying Hdlq5, a QTL for plasma high-density lipoprotein cholesterol (HDL) levels on mouse chromosome 1. First, we performed a sequence analysis of the Apoa2 coding region in 46 genetically diverse mouse strains and found five different APOA2 protein variants, which we named APOA2a to APOA2e. Second, we conducted a haplotype analysis of the strains in 21 crosses that have so far detected HDL QTLs; we found that Hdlq5 was detected only in the nine crosses where one parent had the APOA2b protein variant characterized by an Ala61-to-Val61 substitution. We then found that strains with the APOA2b variant had significantly higher (P < or = 0.002) plasma HDL levels than those with either the APOA2a or the APOA2c variant. These findings support Apoa2 as the underlying Hdlq5 gene and suggest the Apoa2 polymorphisms responsible for the Hdlq5 phenotype. Therefore, haplotype analysis in multiple crosses can be used to support a candidate QTL gene.

  9. Evidence of triple mutant Pfdhps ISGNGA haplotype in Plasmodium falciparum isolates from North-east India: An analysis of sulfadoxine resistant haplotype selection.

    PubMed

    Das, Manuj K; Chetry, Sumi; Kalita, Mohan C; Dutta, Prafulla

    2016-12-01

    North-east region of India has consistent role in the spread of multi drug resistant Plasmodium (P.) falciparum to other parts of Southeast Asia. After rapid clinical treatment failure of Artemisinin based combination therapy-Sulphadoxine/Pyrimethamine (ACT-SP) chemoprophylaxis, Artemether-Lumefantrine (ACT-AL) combination therapy was introduced in the year 2012 in this region for the treatment of uncomplicated P. falciparum malaria. In a DNA sequencing based polymorphism analysis, seven codons of P. falciparum dihydropteroate synthetase ( Pf dhps) gene were screened in a total of 127 P. falciparum isolates collected from Assam, Arunachal Pradesh and Tripura of North-east India during the year 2014 and 2015 to document current sulfadoxine resistant haplotypes. Sequences were analyzed to rearrange both nucleotide and protein haplotypes. Molecular diversity indices were analyzed in DNA Sequence Polymorphism software (DnaSP) on the basis of Pf dhps gene sequences. Disappearance from selective neutrality was assessed based on the ratio of non-synonomous to synonomous nucleotide substitutions [dN/dS ratio]. Moreover, two-tailed Z test was performed in search of the significance for probability of rejecting null hypothesis of strict neutrality [dN = dS]. Presence of mutant P. falciparum multidrug resistance protein1 ( Pf mdr1) was also checked in those isolates that were present with new Pf dhps haplotypes. Phylogenetic relationship based on Pf dhps gene was reconstructed in Molecular Evolutionary Genetics Analysis (MEGA). Among eight different sulfadoxine resistant haplotypes found, IS GNG A haplotype was documented in a total of five isolates from Tripura with association of a new mutant M538 R allele. Sequence analysis of Pf mdr1 gene in these five isolates came to notice that not all but only one isolate was mutant at codon 86 (N86 Y ; Y YSND) in the multidrug resistance protein. Molecular diversity based on Pf dhps haplotypes revealed that P. falciparum

  10. Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction

    PubMed Central

    Barbero, Marina M. D.; Oliveira, Henrique N.; de Camargo, Gregório M. F.; Fernandes Júnior, Gerardo A.; Aspilcueta-Borquis, Rusbel R.; Souza, Fabio R. P.; Boligon, Arione A.; Melo, Thaise P.; Regatieri, Inaê C.; Feitosa, Fabieli L. B.; Fonseca, Larissa F. S.; Magalhães, Ana F. B.; Costa, Raphael B.; Albuquerque, Lucia G.

    2018-01-01

    Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs. PMID:29293544

  11. Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction.

    PubMed

    Takada, Luciana; Barbero, Marina M D; Oliveira, Henrique N; de Camargo, Gregório M F; Fernandes Júnior, Gerardo A; Aspilcueta-Borquis, Rusbel R; Souza, Fabio R P; Boligon, Arione A; Melo, Thaise P; Regatieri, Inaê C; Feitosa, Fabieli L B; Fonseca, Larissa F S; Magalhães, Ana F B; Costa, Raphael B; Albuquerque, Lucia G

    2018-01-01

    Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs.

  12. Two haplotype clusters of Echinococcus granulosus sensu stricto in northern Iraq (Kurdistan region) support the hypothesis of a parasite cradle in the Middle East.

    PubMed

    Hassan, Zuber Ismael; Meerkhan, Azad Abdullah; Boufana, Belgees; Hama, Abdullah A; Ahmed, Bayram Dawod; Mero, Wijdan Mohammed Salih; Orsten, Serra; Interisano, Maria; Pozio, Edoardo; Casulli, Adriano

    2017-08-01

    Human cystic echinococcosis (CE) caused by Echinococcus granulosus s.s. is a major public health problem in Iraqi Kurdistan with a reported surgical incidence of 6.3 per 100,000 Arbil inhabitants. A total of 125 Echinococcus isolates retrieved from sheep, goats and cattle were used in this study. Our aim was to determine species/genotypes infecting livestock in Iraqi Kurdistan and examine intraspecific variation and population structure of Echinococcus granulosus s.s. in this region and relate it to that of other regions worldwide. Using nucleotide sequences of the mitochondrial cytochrome c oxidase subunit 1 (cox 1) we identified E. granulosus s.s. as the cause of hydatidosis in all examined animals. The haplotype network displayed a double-clustered topology with two main E. granulosus s.s. haplotypes, (KU05) and (KU33). The 'founder' haplotype (KU05) confirmed the presence of a common lineage of non-genetically differentiated populations as inferred by the low non-significant fixation index values. Overall diversity and neutrality indices indicated demographic expansion. We used E. granulosus s.s. nucleotide sequences from GenBank to draw haplotype networks for the Middle East (Iran, Jordan and Turkey), Europe (Albania, Greece, Italy, Romania and Spain), China, Mongolia, Russia, South America (Argentina, Brazil, Chile and Mexico) and Tunisia. Networks with two haplotype clusters like that reported here for Iraqi Kurdistan were seen for the Middle East, Europe, Mongolia, Russia and Tunisia using both 827bp and 1609bp cox1 nucleotide sequences, whereas a star-like network was observed for China and South America. We hypothesize that the double clustering seen at what is generally assumed to be the cradle of domestication may have emerged independently and dispersed from the Middle East to other regions and that haplotype (KU33) may be the main haplotype within a second cluster in the Middle East from where it has spread into Europe, Mongolia, Russia and North

  13. Discovery, evaluation and distribution of haplotypes of the wheat Ppd-D1 gene.

    PubMed

    Guo, Zhiai; Song, Yanxia; Zhou, Ronghua; Ren, Zhenglong; Jia, Jizeng

    2010-02-01

    Ppd-D1 is one of the most potent genes affecting the photoperiod response of wheat (Triticum aestivum). Only two alleles, insensitive Ppd-D1a and sensitive Ppd-D1b, were known previously, and these did not adequately explain the broad adaptation of wheat to photoperiod variation. In this study, five diagnostic molecular markers were employed to identify Ppd-D1 haplotypes in 492 wheat varieties from diverse geographic locations and 55 accessions of Aegilops tauschii, the D genome donor species of wheat. Six Ppd-D1 haplotypes, designated I-VI, were identified. Types II, V and VI were considered to be more ancient and types I, III and IV were considered to be derived from type II. The transcript abundances of the Ppd-D1 haplotypes showed continuous variation, being highest for haplotype I, lowest for haplotype III, and correlating negatively with varietal differences in heading time. These haplotypes also significantly affected other agronomic traits. The distribution frequency of Ppd-D1 haplotypes showed partial correlations with both latitudes and altitudes of wheat cultivation regions. The evolution, expression and distribution of Ppd-D1 haplotypes were consistent evidentially with each other. What was regarded as a pair of alleles in the past can now be considered a series of alleles leading to continuous variation.

  14. Haplotype diversity of the myostatin gene among beef cattle breeds

    PubMed Central

    Dunner, Susana; Miranda, M Eugenia; Amigues, Yves; Cañón, Javier; Georges, Michel; Hanset, Roger; Williams, John; Ménissier, François

    2003-01-01

    A total of 678 individuals from 28 European bovine breeds were both phenotyped and analysed at the myostatin locus by the Single Strand Conformation Polymorphism (SSCP) method. Seven new mutations were identified which contribute to the high polymorphism (1 SNP every 100 bp) present in this small gene; twenty haplotypes were described and a genotyping method was set up using the Oligonucleotide Ligation Assay (OLA) method. Some haplotypes appeared to be exclusive to a particular breed; this was the case for 5 in the Charolaise (involving mutation Q204X) and 7 in the Maine-Anjou (involving mutation E226X). The relationships between the different haplotypes were studied, thus allowing to test the earlier hypothesis on the origin of muscular hypertrophy in Europe: muscular hypertrophy (namely nt821(del11)) was mainly spread in different waves from northern Europe milk purpose populations in most breeds; however, other mutations (mostly disruptive) arose in a single breed, were highly selected and have since scarcely evolved to other populations. PMID:12605853

  15. [Association between the methylenetetrahydrofolate reductase gene polymorphisms and haplotype with toxicity response of high dose methotrexate chemotherapy].

    PubMed

    Liao, Qing-Chuan; Li, Xiao-Lei; Liu, Si-Ting; Zhang, Yong; Li, Tian-Yuan; Qiu, Jin-Chun

    2012-07-01

    To investigate the association between single nucleotide polymorphisms (SNP) and its haplotypes of methylenetetrahydrofolate reductase (MTHFR) gene with high dose methotrexate (HDMTX)-induced toxicity in children with acute lymphoblastic leukemia (ALL). HDMTX-treated children with ALL (1.2 to 14-years old) were selected from inpatient and followed for a retrospective study. The toxicity response of HDMTX chemotherapy was evaluated using WHO common toxicity criteria. Sixty-one patients with therapy-related toxicity and 36 patients without therapy-related toxicity were genotyped for 2 SNP (677C > T and 1298A > C) of the MTHFR gene by polymerase chain reaction-restriction fragment length polymorphism. Frequency of haplotypes and linkage disequilibrium of MTHFR gene were analyzed by SHEsis program. The distribution of MTHFR gene 677C > T polymorphism did not appeare different between groups with or without toxicity response (χ(2) = 4.609, P = 0.100), but the 1298A > C polymorphism was significantly different (χ(2) = 10.192, P = 0.006). Individuals who carried C allele (AC + CC genotype) had a decreased risk of toxicity response compared to AA genotype (OR = 0.245, 95%CI: 0.099 - 0.607, P = 0.002). 677C > T and 1298A > C polymorphisms showed strong linkage disequilibrium (D' = 0.895). The CC haplotype was significantly associated with decreased risk of toxicity response (OR = 0.338, 95%CI: 0.155 - 0.738, P = 0.005), while the TA haplotype was significantly associated with the increased risk of toxicity response (OR = 1.907, 95%CI: 1.045 - 3.482, P = 0.035). MTHFR gene 1298C allele and CC haplotype might serve as protective factors while TA haplotype as a risk factor for the susceptibility to toxicity response of HDMTX chemotherapy in children with ALL.

  16. Unraveling Haplotype Diversity of the Apical Membrane Antigen-1 Gene in Plasmodium falciparum Populations in Thailand

    PubMed Central

    Lumkul, Lalita; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai; Pattaradilokrat, Sittiporn

    2018-01-01

    Development of an effective vaccine is critically needed for the prevention of malaria. One of the key antigens for malaria vaccines is the apical membrane antigen 1 (AMA-1) of the human malaria parasite Plasmodium falciparum, the surface protein for erythrocyte invasion of the parasite. The gene encoding AMA-1 has been sequenced from populations of P. falciparum worldwide, but the haplotype diversity of the gene in P. falciparum populations in the Greater Mekong Subregion (GMS), including Thailand, remains to be characterized. In the present study, the AMA-1 gene was PCR amplified and sequenced from the genomic DNA of 65 P. falciparum isolates from 5 endemic areas in Thailand. The nearly full-length 1,848 nucleotide sequence of AMA-1 was subjected to molecular analyses, including nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity and neutrality tests. Phylogenetic analysis and pairwise population differentiation (Fst indices) were performed to infer the population structure. The analyses identified 60 single nucleotide polymorphic loci, predominately located in domain I of AMA-1. A total of 31 unique AMA-1 haplotypes were identified, which included 11 novel ones. The phylogenetic tree of the AMA-1 haplotypes revealed multiple clades of AMA-1, each of which contained parasites of multiple geographical origins, consistent with the Fst indices indicating genetic homogeneity or gene flow among geographically distinct populations of P. falciparum in Thailand’s borders with Myanmar, Laos and Cambodia. In summary, the study revealed novel haplotypes and population structure needed for the further advancement of AMA-1-based malaria vaccines in the GMS. PMID:29742870

  17. Mice, humans and haplotypes--the hunt for disease genes in SLE.

    PubMed

    Rigby, R J; Fernando, M M A; Vyse, T J

    2006-09-01

    Defining the polymorphisms that contribute to the development of complex genetic disease traits is a challenging, although increasingly tractable problem. Historically, the technical difficulties in conducting association studies across the entire human genome are such that murine models have been used to generate candidate genes for analysis in human complex diseases, such as SLE. In this article we discuss the advantages and disadvantages of this approach and specifically address some assumptions made in the transition from studying one species to another, using lupus as an example. These issues include differences in genetic structure and genetic organisation which are a reflection on the population history. Clearly there are major differences in the histories of the human population and inbred laboratory strains of mice. Both human and murine genomes do exhibit structure at the genetic level. That is to say, they comprise haplotypes which are genomic regions that carry runs of polymorphisms that are not independently inherited. Haplotypes therefore reduce the number of combinations of the polymorphisms in the DNA in that region and facilitate the identification of disease susceptibility genes in both mice and humans. There are now novel means of generating candidate genes in SLE using mutagenesis (with ENU) in mice and identifying mice that generate antinuclear autoimmunity. In addition, murine models still provide a valuable means of exploring the functional consequences of genetic variation. However, advances in technology are such that human geneticists can now screen large fractions of the human genome for disease associations using microchip technologies that provide information on upwards of 100,000 different polymorphisms. These approaches are aimed at identifying haplotypes that carry disease susceptibility mutations and rely less on the generation of candidate genes.

  18. Variants and Haplotypes in Angiotensinogen Gene Are Associated With Plasmatic Angiotensinogen Level in Mexican Population

    PubMed Central

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Alfaro-Ruiz, Luis; Carrillo, Karol; Elizalde, Adela; Gil, Trinidad; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2011-01-01

    Introduction The plasmatic angiotensinogen (AGT) level has been associated with essential hypertension. Linkage analysis has found a relationship between the AGT gene locus and hypertension in the Mexican-American population, but studies have failed to identify genetic variants associated with hypertension or plasma AGT levels. This study analyzes the relationship between polymorphisms in the AGT gene and plasmatic AGT levels in Mexican population. Methods Nine polymorphisms in AGT gene were genotyped, and plasma AGT level was determined by enzyme-linked immunosorbent assay. Results Differences in AGT plasma levels were associated with 2 polymorphisms: T-20G, TT = 25.3 ± 8.3 versus TG + GG = 21.6 ± 8.8 μg/mL; P = 0.008 and C3389T (T174M), CC = 25.8 ± 9.9 versus TC + TT = 20.5 ± 5.4 μg/mL; P = 0.0002. Haplotype 2 was associated with low plasma AGT (−5.1 μg/mL [95% confidence interval: −8.6 to −1.6], P = 0.004) and Haplotype 8 was associated with high plasma AGT (6.5 μg/mL [95% confidence interval: 2.5 to 10.6], P = 0.001). This association remained after adjustment for covariates. A Likelihood Ratio Test for haplotype-phenotype association adjusted for covariates resulted in χ2 = 38.9, P = 0.0005. The total effect of the haplotypes on plasma AGT level variance was 19.5%. No association was identified between haplotypes and quantitative traits of blood pressure. Conclusions Two polymorphisms (T-20G and C3389T) and 2 haplotypes (H2 and H8) showed an association with plasma AGT levels in Mexican population. PMID:21629041

  19. Mutations in the von Hippel-Lindau (VHL) tumor suppressor gene and VHL-haplotype analysis in patients with presumable congenital erythrocytosis.

    PubMed

    Cario, Holger; Schwarz, Klaus; Jorch, Norbert; Kyank, Ulrike; Petrides, Petro E; Schneider, Dominik T; Uhle, Renate; Debatin, Klaus-Michael; Kohne, Elisabeth

    2005-01-01

    Congenital erythrocytoses or polycythemias are rare and heterogeneous. A homozygous mutation (C598T->Arg200Trp) in the von Hippel-Lindau (VHL) gene was originally identified as the cause of the endemic Chuvash polycythemia. Subsequently this and other mutations in the VHL gene were also detected in several patients of different ethnic origin. Haplotype analyses of the VHL gene suggested a common origin for the Chuvash-type mutation. Thirty-four patients with presumable congenital erythrocytosis due to an unknown underlying disorder were examined for VHL gene mutations and VHL region haplotypes. Four patients were homozygous and one patient heterozygous for the Chuvash-type mutation. One additional patient presented a previously not described heterozygous mutation G311->T VHL in exon 1. The haplotype analyses were in agreement with recently published data for three of the four patients with homozygous mutations as well as for the patient with a heterozygous Chuvash-type mutation. One patient of Turkish origin with homozygous Chuvash-type mutation had a haplotype not previously found in individuals with Chuvash-type mutation. These results confirm that mutations in the VHL gene are responsible for a substantial proportion of patients with congenital erythrocytoses. Erythrocytoses due to a C598->T mutation of the VHL gene are not geographically restricted. The majority of patients with Chuvash polycythemia share a common VHL gene haplotype. The different haplotype in one of the patients with Chuvash-type mutation indicates that this mutation was not spread only from a single founder but developed independently in other individuals.

  20. The advantages of haplotype analysis of the promoter region of the human apolipoprotein E gene.

    PubMed

    McLaughlin, D P; Sharma, A; McGinley, A; Samra, G S

    2001-12-15

    Polymorphisms in the regulatory region of the human apolipoprotein E gene (gene, APOE; protein, apoE) have been implicated in Alzheimer's disease. Here we describe in detail the advantages of a simple method for haplotype analysis of this region (at -491 and -427 bases relative to the transcription start site of the gene). The promoter region of the APOE gene was amplified by polymerase chain reaction (PCR) and this fragment was then used as a template for PCR with "nested" primers to generate a 228-bp product incorporating both the -491 and the -427 loci. PCR products were then digested with DraI and AluI together and subjected to polyacrylamide gel electrophoresis. The distinct pattern of bands appearing on the gel was then used to ascribe [-491,-427] haplotypes to each subject, from which -491 and -427 genotypes were inferred. -491 and -427 genotypes were also confirmed by digestion with DraI alone or AluI alone. Haplotype analysis was successful in all 20 samples analyzed and was 100% consistent with genotyping. We suggest that this is a reliable, time-saving method that the will be useful in large-scale APOE promoter genotyping studies. (c)2001 Elsevier Science.

  1. PTCH1 gene haplotype association with basal cell carcinoma after transplantation.

    PubMed

    Begnini, A; Tessari, G; Turco, A; Malerba, G; Naldi, L; Gotti, E; Boschiero, L; Forni, A; Rugiu, C; Piaserico, S; Fortina, A B; Brunello, A; Cascone, C; Girolomoni, G; Gomez Lira, M

    2010-08-01

    Basal cell carcinoma (BCC) is 10 times more frequent in organ transplant recipients (OTRs) than in the general population. Factors in OTRs conferring increased susceptibility to BCC include ultraviolet radiation exposure, immunosuppression, viral infections such as human papillomavirus, phototype and genetic predisposition. The PTCH1 gene is a negative regulator of the hedgehog pathway, that provides mitogenic signals to basal cells in skin. PTCH1 gene mutations cause naevoid BCC syndrome, and contribute to the development of sporadic BCC and other types of cancers. Associations have been reported between PTCH1 polymorphisms and BCC susceptibility in nontransplanted individuals. To search for novel common polymorphisms in the proximal 5' regulatory region upstream of PTCH1 gene exon 1B, and to investigate the possible association of PTCH1 polymorphisms and haplotypes with BCC risk after organ transplantation. Three PTCH1 single nucleotide polymorphisms (rs2297086, rs2066836 and rs357564) were analysed by restriction fragment length polymorphism analysis in 161 northern Italian OTRs (56 BCC cases and 105 controls). Two regions of the PTCH1 gene promoter were screened by heteroduplex analysis in 30 cases and 30 controls. Single locus analysis showed no significant association. Haplotype T(1686)-T(3944) appeared to confer a significantly higher risk for BCC development (odds ratio 2.98, 95% confidence interval 2.55-3.48; P = 0.001). Two novel rare polymorphisms were identified at positions 176 and 179 of the 5'UTR. Two novel alleles of the -4 (CGG)(n) microsatellite were identified. No association of this microsatellite with BCC was observed. Haplotypes containing T(1686)-T(3944) alleles were shown to be associated with an increased BCC risk in our study population. These data appear to be of great interest for further investigations in a larger group of transplant individuals. Our results do not support the hypothesis that common polymorphisms in the proximal 5

  2. Transcriptome analysis reveals the same 17 S-locus F-box genes in two haplotypes of the self-incompatibility locus of Petunia inflata.

    PubMed

    Williams, Justin S; Der, Joshua P; dePamphilis, Claude W; Kao, Teh-Hui

    2014-07-01

    Petunia possesses self-incompatibility, by which pistils reject self-pollen but accept non-self-pollen for fertilization. Self-/non-self-recognition between pollen and pistil is regulated by the pistil-specific S-RNase gene and by multiple pollen-specific S-locus F-box (SLF) genes. To date, 10 SLF genes have been identified by various methods, and seven have been shown to be involved in pollen specificity. For a given S-haplotype, each SLF interacts with a subset of its non-self S-RNases, and an as yet unknown number of SLFs are thought to collectively mediate ubiquitination and degradation of all non-self S-RNases to allow cross-compatible pollination. To identify a complete suite of SLF genes of P. inflata, we used a de novo RNA-seq approach to analyze the pollen transcriptomes of S2-haplotype and S3-haplotype, as well as the leaf transcriptome of the S3S3 genotype. We searched for genes that fit several criteria established from the properties of the known SLF genes and identified the same seven new SLF genes in S2-haplotype and S3-haplotype, suggesting that a total of 17 SLF genes constitute pollen specificity in each S-haplotype. This finding lays the foundation for understanding how multiple SLF genes evolved and the biochemical basis for differential interactions between SLF proteins and S-RNases. © 2014 American Society of Plant Biologists. All rights reserved.

  3. Genetic variants on apolipoprotein gene cluster influence triglycerides with a risk of coronary artery disease among Indians.

    PubMed

    AshokKumar, Manickaraj; Subhashini, Navaneethan Gnana Veera; SaiBabu, Ramineni; Ramesh, Arabandi; Cherian, Kotturathu Mammen; Emmanuel, Cyril

    2010-01-01

    Apolipoprotein C3 and apolipoprotien A5 are proteins coded from the APOA1/C3/A4/A5 gene cluster. Sst I polymorphism on apolipoprotein C3 and -1131C polymorphism of apolipoprotien A5 are key variants involved in triglyceride metabolism and cause a significant cardio-metabolic risk. Here, we have evaluated these two variants for their roles in coronary artery disease in patients of the Indian population. The apolipoprotein gene cluster variants were analysed in 416 angiographically determined coronary artery disease patients and matched 416 controls using polymerase chain reaction-restriction fragment length polymorphism. The characteristics of the study subjects were analyzed statistically for their association with the polymorphisms. The alleles were combined as haplotypes and their combined risks were evaluated. The minor allele genotypes of both apolipoprotein C3 (S2) and apolipoprotien A5 (C) had a significant risk for coronary artery disease. The S2 allele genotyped patients had a significantly increased triglyceride level (P < 0.001) and increased triglycerides were observed among both patient and control CC genotype carriers. We identified the haplotype S2/C with a significant increased risk (P < 0.001) to coronary artery disease with increased levels of circulating triglycerides compared to other haplotypes in patients. We conclude that the variants on apolipoprotein C3 and apolipoprotien A5 modulate serum triglyceride levels and increase the risk of coronary artery disease.

  4. Clustering cancer gene expression data by projective clustering ensemble

    PubMed Central

    Yu, Xianxue; Yu, Guoxian

    2017-01-01

    Gene expression data analysis has paramount implications for gene treatments, cancer diagnosis and other domains. Clustering is an important and promising tool to analyze gene expression data. Gene expression data is often characterized by a large amount of genes but with limited samples, thus various projective clustering techniques and ensemble techniques have been suggested to combat with these challenges. However, it is rather challenging to synergy these two kinds of techniques together to avoid the curse of dimensionality problem and to boost the performance of gene expression data clustering. In this paper, we employ a projective clustering ensemble (PCE) to integrate the advantages of projective clustering and ensemble clustering, and to avoid the dilemma of combining multiple projective clusterings. Our experimental results on publicly available cancer gene expression data show PCE can improve the quality of clustering gene expression data by at least 4.5% (on average) than other related techniques, including dimensionality reduction based single clustering and ensemble approaches. The empirical study demonstrates that, to further boost the performance of clustering cancer gene expression data, it is necessary and promising to synergy projective clustering with ensemble clustering. PCE can serve as an effective alternative technique for clustering gene expression data. PMID:28234920

  5. Diametrical clustering for identifying anti-correlated gene clusters.

    PubMed

    Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

    2003-09-01

    Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.

  6. Myopia and Late-Onset Progressive Cone Dystrophy Associate to LVAVA/MVAVA Exon 3 Interchange Haplotypes of Opsin Genes on Chromosome X.

    PubMed

    Orosz, Orsolya; Rajta, István; Vajas, Attila; Takács, Lili; Csutak, Adrienne; Fodor, Mariann; Kolozsvári, Bence; Resch, Miklós; Sényi, Katalin; Lesch, Balázs; Szabó, Viktória; Berta, András; Balogh, István; Losonczy, Gergely

    2017-03-01

    Rare interchange haplotypes in exon 3 of the OPN1LW and OPN1MW opsin genes cause X-linked myopia, color vision defect, and cone dysfunction. The severity of the disease varies on a broad scale from nonsyndromic high myopia to blue cone monochromatism. Here, we describe a new genotype-phenotype correlation attributed to rare exon 3 interchange haplotypes simultaneously present in the long- and middle-wavelength sensitive opsin genes (L- and M-opsin genes). A multigenerational family with X-linked high myopia and cone dystrophy was investigated. Affected male patients had infantile onset myopia with normal visual acuity and color vision until their forties. Visual acuity decreased thereafter, along with the development of severe protan and deutan color vision defects. A mild decrease in electroretinography response of cone photoreceptors was detected in childhood, which further deteriorated in middle-aged patients. Rods were also affected, however, to a lesser extent than cones. Clinical exome sequencing identified the LVAVA and MVAVA toxic haplotypes in the OPN1LW and OPN1MW opsin genes, respectively. Here, we show that LVAVA haplotype of the OPN1LW gene and MVAVA haplotype of the OPN1MW gene cause apparently nonsyndromic high myopia in young patients but lead to progressive cone-rod dystrophy with deuteranopia and protanopia in middle-aged patients corresponding to a previously unknown disease course. To the best of our knowledge, this is the first report on the joint effect of these toxic haplotypes in the two opsin genes on chromosome X.

  7. Haplotype combination of the bovine INSIG1 gene sequence variants and association with growth traits in Nanyang cattle.

    PubMed

    Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong

    2012-06-01

    The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.

  8. Short communication: casein haplotype variability in sicilian dairy goat breeds.

    PubMed

    Gigli, I; Maizon, D O; Riggio, V; Sardina, M T; Portolano, B

    2008-09-01

    In the Mediterranean region, goat milk production is an important economic activity. In the present study, 4 casein genes were genotyped in 5 Sicilian goat breeds to 1) identify casein haplotypes present in the Argentata dell'Etna, Girgentana, Messinese, Derivata di Siria, and Maltese goat breeds; and 2) describe the structure of the Sicilian goat breeds based on casein haplotypes and allele frequencies. In a sample of 540 dairy goats, 67 different haplotypes with frequency >or=0.01 and 27 with frequency >or=0.03 were observed. The most common CSN1S1-CSN2-CSN1S2-CSN3 haplotype for Derivata di Siria and Maltese was FCFB (0.17 and 0.22, respectively), whereas for Argentata dell'Etna, Girgentana and Messinese was ACAB (0.06, 0.23, and 0.10, respectively). According to the haplotype reconstruction, Argentata dell'Etna, Girgentana, and Messinese breeds presented the most favorable haplotype for cheese production, because the casein concentration in milk of these breeds might be greater than that in Derivata di Siria and Maltese breeds. Based on a cluster analysis, the breeds formed 2 main groups: Derivata di Siria, and Maltese in one group, and Argentata dell'Etna and Messinese in the other; the Girgentana breed was between these groups but closer to the latter.

  9. Haplotypes and effects on growth traits of bovine Wnt7a gene in Chinese Qinchuan cattle.

    PubMed

    Xue, Jing; Sun, Yujia; Guo, Wenjiao; Yang, Ziqi; Tian, Huibin; Zhang, Chunlei; Lei, Chuzhao; Lan, Xianyong; Chen, Hong

    2013-07-25

    Wnt7a is a member of the WNT gene family, which encodes secreted signaling proteins and responds to many biological processes. Specifically Wnt7a influences satellite stem cells and regulates the regenerative potential of the muscle. However, similar researches about the bovine Wnt7a gene are lacking. Therefore, in this study, polymorphisms of the bovine Wnt7a gene were detected in 488 individuals from Chinese Qinchuan cattle by DNA pooling, forced PCR-RFLP, and DNA sequencing methods. 3 novel SNPs were identified, two SNPs (g.T4926C and g.A21943G) were in the intron and the last one (g.C63777T) was in the exon. Five haplotypes involved in these three variant sites in the Wnt7a gene were identified and their effects on growth traits were analyzed. The results revealed that haplotype 1 had the highest haplotype frequencies and was highly significantly associated with body height (P<0.01), body weight (P<0.05), chest width (P<0.05) and height at hip cross (P<0.01) respectively. Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Eda haplotypes in three-spined stickleback are associated with variation in immune gene expression

    PubMed Central

    Robertson, Shaun; Bradley, Janette E.; MacColl, Andrew D. C.

    2017-01-01

    Haplotypes underlying local adaptation and speciation are predicted to have numerous phenotypic effects, but few genes involved have been identified, with much work to date concentrating on visible, morphological, phenotypes. The link between genes controlling these adaptive morphological phenotypes and the immune system has seldom been investigated, even though changes in the immune system could have profound adaptive consequences. The Eda gene in three-spined stickleback is one of the best studied major adaptation genes; it directly controls bony plate architecture and has been associated with additional aspects of adaptation to freshwater. Here, we exposed F2 hybrids, used to separate Eda genotype from genetic background, to contrasting conditions in semi-natural enclosures. We demonstrate an association between the Eda haplotype block and the expression pattern of key immune system genes. Furthermore, low plated fish grew less and experienced higher burdens of a common ectoparasite with fitness consequences. Little is currently known about the role of the immune system in facilitating adaptation to novel environments, but this study provides an indication of its potential importance. PMID:28195171

  11. Haplotype combination of the bovine CFL2 gene sequence variants and association with growth traits in Qinchuan cattle.

    PubMed

    Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong

    2015-06-01

    The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.25Haplotype analysis of three SNPs showed that 8 different haplotypes were identified in all, but only 5 haplotypes were listed except for those with a frequency of <0.03. Hap4 (-GTC-) had the highest haplotype frequencies (34.70%). However in the three SNPs there were no significant associations between the 13 combined genotypes of the CFL2 gene and growth traits. LD analysis showed that the SNP T 1694 A and C 2213 G loci had a strong linkage (r(2)>0.33). Association analysis indicated that SNP G 1500 A, T 1694 A and C 2213 G were significantly associated with growth traits in the QC population. The results of our study suggest that the CFL2 gene may be a strong candidate gene that affects growth traits in the QC cattle breeding program. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Honey bee-inspired algorithms for SNP haplotype reconstruction problem

    NASA Astrophysics Data System (ADS)

    PourkamaliAnaraki, Maryam; Sadeghi, Mehdi

    2016-03-01

    Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/

  13. A novel haplotype of spinocerebellar ataxia type 6 contributes to the highest prevalence in western Japan.

    PubMed

    Terasawa, Hideo; Oda, Masaya; Morino, Hiroyuki; Miyachi, Takafumi; Izumi, Yuishin; Maruyama, Hirofumi; Matsumoto, Masayasu; Kawakami, Hideshi

    2004-03-25

    The highest prevalence rate of spinocerebellar ataxia type 6 (SCA6) in the worldwide population is in the Chugoku and Kansai areas of Western Japan, but the reason of this geographic characteristics is unclear. We investigated the predisposing haplotypes and their geographic distribution. Genotyping of five microsatellite markers and three single nucleotide polymorphisms linked to the CACNA1A gene in 150 Japanese SCA6 patients from unrelated 118 families revealed three major haplotypes, carrying a pool of one common haplotype core. A founder chromosome was thought to have historically diverged into at least three types. One of the major haplotypes newly identified showed a strong geographical cluster around the Seto Inland Sea in the Chugoku and Kansai areas of Western Japan, whereas the others were widely distributed throughout Japan. The distribution of predisposing haplotypes contributes to the geographical differences in prevalence of SCA6.

  14. Haplotypes and Sequence Variation in the Ovine Adiponectin Gene (ADIPOQ)

    PubMed Central

    An, Qing-Ming; Zhou, Hui-Tong; Hu, Jiang; Luo, Yu-Zhu; Hickford, Jon G. H.

    2015-01-01

    The adiponectin gene (ADIPOQ) plays an important role in energy homeostasis. In this study five separate regions (regions 1 to 5) of ovine ADIPOQ were analysed using PCR-SSCP. Four different PCR-SSCP patterns (A1-D1, A2-D2) were detected in region-1 and region-2, respectively, with seven and six SNPs being revealed. In region-3, three different patterns (A3-C3) and three SNPs were observed. Two patterns (A4-B4, A5-B5) and two and one SNPs were observed in region-4 and region-5, respectively. In total, nineteen SNPs were detected, with five of them in the coding region and two (c.46T/C and c.515G/A) putatively resulting in amino acid changes (p.Tyr16His and p.Lys172Arg). In region-1, -2 and -3 of 316 sheep from eight New Zealand breeds, variants A1, A2 and A3 were the most common, although variant frequencies differed in the eight breeds. Across region-1 and region-3, nine haplotypes were identified and haplotypes A1-A3, A1-C3, B1-A3 and B1-C3 were most common. These results indicate that the ADIPOQ gene is polymorphic and suggest that further analysis is required to see if the variation in the gene is associated with animal production traits. PMID:26610572

  15. HLA Type Inference via Haplotypes Identical by Descent

    NASA Astrophysics Data System (ADS)

    Setty, Manu N.; Gusev, Alexander; Pe'Er, Itsik

    The Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self antigens from non self ones. HLA genes are hyper variable with nearly every locus harboring over a dozen alleles. This variation plays an important role in susceptibility to multiple autoimmune diseases and needs to be matched on for organ transplantation. Unfortunately, HLA typing by serological methods is time consuming and expensive compared to high throughput Single Nucleotide Polymorphism (SNP) data. We present a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from SNP genotype data. IBD information is modeled as graph where shared haplotypes are explored among clusters of individuals with known and unknown HLA types to identify the latter. We analyze performance of the method in a previously typed subset of the HapMap population, achieving accuracy of 96% in HLA-A, 94% in HLA-B, 95% in HLA-C, 77% in HLA-DR1, 93% in HLA-DQA1 and 90% in HLA-DQB1 genes. We compare our method to a tag SNP based approach and demonstrate higher sensitivity and specificity. Our method demonstrates the power of using shared haplotype segments for large-scale imputation at the HLA locus.

  16. Haplotype analysis of the apolipoprotein A5 gene in obese pediatric patients.

    PubMed

    Horvatovich, Katalin; Bokor, Szilvia; Baráth, Akos; Maász, Anita; Kisfali, Péter; Járomi, Luca; Polgár, Noémi; Tóth, Dénes; Répásy, Judit; Endreffy, Emoke; Molnár, Dénes; Melegh, Béla

    2011-06-01

    Apolipoprotein A5 (APOA5) gene variants have been shown to be associated with elevated TG levels; the T-1131C (rs662799) variant has been reported to confer risk for the metabolic syndrome in adult populations. Little is known about the APOA5 variants in pediatric population, no such information is available for pediatric obesity at all. Here we examined four haplotype-tagging polymorphisms (T-1131C, IVS3 + G476A [rs2072560], T1259C [rs2266788] and C56G [rs3135506]) and studied also the frequency of major naturally occurring haplotypes of APOA5 in obese children. The polymorphisms were analyzed in 232 obese children, and in 137 healthy, normal weight controls, using PCR-RFLP methods. In the pediatric patients we could confirm the already known adult subjects based association of -1131C, IVS3 + 476A and 1259C variants with elevated triglyceride concentrations, both in obese patients and in the controls. The prevalence of the APOA5*2 haplotype (containing the minor allele of T-1131C, IVS3 + G476A and T1259C SNPs together) was 15.5% in obese children, and 5.80% in the controls (p<0.001); multiple logistic regression analysis revealed that this haplotype confers susceptibility for development of obesity (OR=2.87; 95% CI: 1.29-6.37; p≤0.01). By contrast, the APOA5*4 haplotype (with -1131C alone) did not show similar associations. Our findings also suggest that the APOA5*5 haplotype (1259C alone) can be protective against obesity (OR=0.25; 95% CI: 0.07-0.80; p<0.05). While previous studies in adults demonstrated, that the APOA5 -1131C minor allele confers risk for adult metabolic syndrome, here we show, that the susceptibility nature of this SNP restricted to the APOA5*2 haplotype in pediatric obese subjects.

  17. Relationship of polymorphisms and haplotype in interleukin-16 and adiponectin gene with late-onset Alzheimer’s disease risk

    PubMed Central

    Yin, Honglei; Zhang, Yuzhen; Hua, Linlin; Li, Jinfeng; Zeng, Zhilei; Yang, Xiaopeng; Gong, Bin; Geng, Shuang; Liu, Yajun; Zhang, Hui; Liu, Yanqiu; Zhao, Jing; Wang, Yunliang

    2017-01-01

    Aims To investigate the impact of Interleukin-16 (IL- 16) and Adiponectin (ANP) gene single nucleotide polymorphisms (SNPs), gene- gene interactions and haplotype on late-onset Alzheimer’s disease (LOAD) risk. Methods Hardy-Weinberg equilibrium (HWE), haplotype and pairwise linkage disequilibrium (LD) analysis were investigated by using SNPstats (available online at http://bioinfo.iconcologia.net/SNPstats). Generalized multifactor dimensionality reduction (GMDR) was used to examine interaction among 4 SNPs, odds ratio (OR) and 95% confident interval (95%CI) were calculated by logistic regression model. Results LOAD risk was significantly higher in carriers of rs266729- G allele than those with CC genotype (CG+ GG versus CC), OR (95%CI) =1.61 (1.26-1.96), and higher in carriers of rs1501299- T allele, OR (95%CI) = 1.62 (1.32-2.12), lower in carriers of rs4072111- T allele, adjusted OR (95%CI) =0.65 (0.44-0.93). We also found a significant gene- gene interaction between rs266729 and rs4072111. Participants with CG or GG of rs266729 and CC of rs4072111 genotype have the highest LOAD risk, OR (95%CI) = 2.62 (1.64 -3.58). Haplotype containing the rs266729- G and rs1501299- T alleles were associated with increased LOAD risk, OR (95%CI)= 1.83 (1.32- 2.43), and haplotype containing the rs1131445- C and rs4072111- T alleles were associated with decreased LOAD risk, OR (95%CI)= 0.53 (0.18- 0.95). Conclusions We concluded that rs266729 and rs1501299 minor alleles were associated with increased LOAD risk, but rs4072111 minor allele was associated with decreased LOAD risk. We also found that interaction involving rs266729 and rs4072111, and haplotype combinations were associated with LOAD risk. PMID:29108295

  18. The effect of using genealogy-based haplotypes for genomic prediction.

    PubMed

    Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

    2013-03-06

    Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.

  19. Novel Harmful Recessive Haplotypes Identified for Fertility Traits in Nordic Holstein Cattle

    PubMed Central

    Sahana, Goutam; Nielsen, Ulrik Sander; Aamand, Gert Pedersen; Lund, Mogens Sandø; Guldbrandtsen, Bernt

    2013-01-01

    Using genomic data, lethal recessives may be discovered from haplotypes that are common in the population but never occur in the homozygote state in live animals. This approach only requires genotype data from phenotypically normal (i.e. live) individuals and not from the affected embryos that die. A total of 7,937 Nordic Holstein animals were genotyped with BovineSNP50 BeadChip and haplotypes including 25 consecutive markers were constructed and tested for absence of homozygotes states. We have identified 17 homozygote deficient haplotypes which could be loosely clustered into eight genomic regions harboring possible recessive lethal alleles. Effects of the identified haplotypes were estimated on two fertility traits: non-return rates and calving interval. Out of the eight identified genomic regions, six regions were confirmed as having an effect on fertility. The information can be used to avoid carrier-by-carrier mattings in practical animal breeding. Further, identification of causative genes/polymorphisms responsible for lethal effects will lead to accurate testing of the individuals carrying a lethal allele. PMID:24376603

  20. Effects of Polymorphisms in APOA4-APOA5-ZNF259-BUD13 Gene Cluster on Plasma Levels of Triglycerides and Risk of Coronary Heart Disease in a Chinese Han Population

    PubMed Central

    Su, Li; Zhang, Mingjun; Wang, Long; Jing, Jinjin; Zhou, Li

    2015-01-01

    Background/Aim Recent genome-wide association studies have identified several loci influencing lipid levels. The present study focused on the triglycerides (TG)-associated locus, the APOA4-APOA5-ZNF259-BUD13 gene cluster on chromosome 11, to explore the role of genetic variants in this gene cluster in the development of increasing TG levels and coronary heart disease (CHD). Methodology/Principal Findings Six single nucleotide polymorphisms (SNPs), rs4417316, rs651821, rs6589566, rs7396835, rs964184 and rs17119975, in the APOA4-APOA5-ZNF259-BUD13 gene cluster were selected and genotyped in 5374 healthy Chinese subjects. There were strong significant associations between the six SNPs and TG levels (P<1.0×10−8). Moreover, a weighted genotype score was found to be associated with TG levels (P = 3.28×10−13). The frequencies of three common haplotypes were observed to be significantly different between the high TG group and the low TG group (P<0.05). However, no significant effects were found for the SNPs regarding susceptibility to CHD in the Chinese case-control populations. Conclusions/Significance This study highlights the genotypes, genotype scores and haplotypes of the APOA4-APOA5-ZNF259-BUD13 gene cluster that were associated with TG levels in a Chinese population; however, the genetic variants in this gene cluster did not increase the risk of CHD in the Chinese population. PMID:26397108

  1. Polymorphisms in IL-1 gene cluster and its association with the risk of perinatal HIV transmission, in an Indian cohort.

    PubMed

    Ahir, Swati; Chaudhari, Deepali; Chavan, Vijay; Samant-Mavani, Padmaja; Nanavati, Ruchi; Mehta, Preeti; Mania-Pramanik, Jayanti

    2013-06-01

    Host genetic diversity plays a very important role in protecting infants exposed to HIV-1 through their mothers. IL-1 family genes are key mediators of inflammatory responses and no studies are available on its association with perinatal HIV transmission. We aimed to evaluate if single nucleotide polymorphisms in IL-1 family genes are associated with perinatal HIV transmission. Infants of HIV positive women were genotyped for five polymorphic loci in IL1 gene cluster namely; IL1R1 (rs2234650), IL1A (rs1800587), IL1B (rs16944), IL1B (rs1143634), and IL1RN (rs315952) using polymerase chain reaction with sequence specific primers (PCR-SSP) method. Haplotype block structure was determined using Haploview and statistical analysis was done using PyPop. In this cohort based observational study significantly increased frequency of CT genotype in IL1R1 (rs2234650) was observed in positive vs. negative children (76.4% vs. 42.2%, p = 0.023), while CC genotype was significantly (p = 0.022) high in exposed uninfected children compared to infected ones (51.1% vs. 17.6%). These significances, however, did not stand the Bonferroni corrections. Haplotypic analysis demonstrated that the TCCCT haplotype was significantly associated (p = 0.002) with HIV transmission and remained significant even after Bonferroni correction. The children who had the protective CC genotype at IL1R1 (rs2234650) and were still positive had the TTC haplotype for IL1A (rs1800587):IL1B (rs1143634):IL1R1 (rs2234650). In contrast, 16 out of 19 (84.2%) children who had the CT genotype and were still negative had the protective CTC haplotype for IL1A (rs1800587):IL1B (rs16944):IL1B (rs1143634). IL1R1 (rs2234650) polymorphisms CT/CC along the specific haplotypes of the IL-1 gene family can be exploited as possible markers for prediction of perinatal HIV transmission. Copyright © 2013 Elsevier B.V. All rights reserved.

  2. Biological impact of α genes, β haplotypes, and G6PD activity in sickle cell anemia at baseline and with hydroxyurea

    PubMed Central

    Arnaud, Cécile; Kamdem, Annie; Hau, Isabelle; Lelong, Françoise; Epaud, Ralph; Pondarré, Corinne; Pissard, Serge

    2018-01-01

    Sickle cell anemia (SCA), albeit monogenic, has heterogeneous phenotypic expression, mainly related to the level of hemoglobin F (HbF). No large cohort studies have ever compared biological parameters in patients with major β-globin haplotypes; ie, Senegal (SEN), Benin (BEN), and Bantu/Central African Republic (CAR). The aim of this study was to evaluate the biological impact of α genes, β haplotypes, and glucose-6-phosphate dehydrogenase (G6PD) activity at baseline and with hydroxyurea (HU). Homozygous HbS patients from the Créteil pediatric cohort with available α-gene and β-haplotype data were included (n = 580; 301 females and 279 males) in this retrospective study. Homozygous β-haplotype patients represented 74% of cases (37.4% CAR/CAR, 24.3% BEN/BEN, and 12.1% SEN/SEN). HU was given to 168 cohort SCA children. Hematological parameters were recorded when HbF was maximal, and changes (ΔHU-T0) were calculated. At baseline, CAR-haplotype and α-gene numbers were independently and negatively correlated with Hb and positively correlated with lactate dehydrogenase. HbF was negatively correlated with CAR-haplotype numbers and positively with BEN- and SEN-haplotype numbers. The BCL11A/rs1427407 “T” allele, which is favorable for HbF expression, was positively correlated with BEN- and negatively correlated with CAR-haplotype numbers. With HU treatment, Δ and HbF values were positively correlated with the BEN-haplotype number. BEN/BEN patients had higher HbF and Hb levels than CAR/CAR and SEN/SEN patients. In conclusion, we show that BEN/BEN patients have the best response on HU and suggest that this could be related to the higher prevalence of the favorable BCL11A/rs1427407/T/allele for HbF expression in these patients. PMID:29555644

  3. The effect of using genealogy-based haplotypes for genomic prediction

    PubMed Central

    2013-01-01

    Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971

  4. Asian population frequencies and haplotype distribution of killer cell immunoglobulin-like receptor (KIR) genes among Chinese, Malay, and Indian in Singapore.

    PubMed

    Lee, Yi Chuan; Chan, Soh Ha; Ren, Ee Chee

    2008-11-01

    Killer cell immunoglobulin-like receptors (KIR) gene frequencies have been shown to be distinctly different between populations and contribute to functional variation in the immune response. We have investigated KIR gene frequencies in 370 individuals representing three Asian populations in Singapore and report here the distribution of 14 KIR genes (2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 2DS1, 2DS2, 2DS3, 2DS4, 2DS5, 3DL1, 3DL2, 3DL3, 3DS1) with two pseudogenes (2DP1, 3DP1) among Singapore Chinese (n = 210); Singapore Malay (n = 80), and Singapore Indian (n = 80). Four framework genes (KIR3DL3, 3DP1, 2DL4, 3DL2) and a nonframework pseudogene 2DP1 were detected in all samples while KIR2DS2, 2DL2, 2DL5, and 2DS5 had the greatest significant variation across the three populations. Fifteen significant linkage patterns, consistent with associations between genes of A and B haplotypes, were observed. Eighty-four distinct KIR profiles were determined in our populations, 38 of which had not been described in other populations. KIR haplotype studies were performed using nine Singapore Chinese families comprising 34 individuals. All genotypes could be resolved into corresponding pairs of existing haplotypes with eight distinct KIR genotypes and eight different haplotypes. The haplotype A2 with frequency of 63.9% was dominant in Singapore Chinese, comparable to that reported in Korean and Chinese Han. The A haplotypes predominate in Singapore Chinese, with ratio of A to B haplotypes of approximately 3:1. Comparison with KIR frequencies in other populations showed that Singapore Chinese shared similar distributions with Chinese Han, Japanese, and Korean; Singapore Indian was found to be comparable with North Indian Hindus while Singapore Malay resembled the Thai.

  5. Single-nucleotide polymorphisms and haplotypes of non-coding area in the CP gene are correlated with Parkinson's disease.

    PubMed

    Zhao, Na; Xiao, Jianqiu; Zheng, Zhiyong; Fei, Guoqiang; Zhang, Feng; Jin, Lirong; Zhong, Chunjiu

    2015-04-01

    Our previous studies have demonstrated that ceruloplasmin (CP) dysmetabolism is correlated with Parkinson's disease (PD). However, the causes of decreased serum CP levels in PD patients remain to be clarified. This study aimed to explore the potential association between genetic variants of the CP gene and PD. Clinical features, serum CP levels, and the CP gene (both promoter and coding regions) were analyzed in 60 PD patients and 50 controls. A luciferase reporter system was used to investigate the function of promoter single-nucleotide polymorphisms (SNPs). High-density comparative genomic hybridization microarrays were also used to detect large-scale copy-number variations in CP and an additional 47 genes involved in PD and/or copper/iron metabolism. The frequencies of eight SNPs (one intronic SNP and seven promoter SNPs of the CP gene) and their haplotypes were significantly different between PD patients, especially those with lowered serum CP levels, and controls. However, the luciferase reporter system revealed no significant effect of the risk haplotype on promoter activity of the CP gene. Neither these SNPs nor their haplotypes were correlated with the Hoehn and Yahr staging of PD. The results of this study suggest that common genetic variants of CP are associated with PD and further investigation is needed to explore their functions in PD.

  6. Association of paraoxonase gene cluster polymorphisms with ALS in France, Quebec, and Sweden.

    PubMed

    Valdmanis, P N; Kabashi, E; Dyck, A; Hince, P; Lee, J; Dion, P; D'Amour, M; Souchon, F; Bouchard, J-P; Salachas, F; Meininger, V; Andersen, P M; Camu, W; Dupré, N; Rouleau, G A

    2008-08-12

    The paraoxonase gene cluster on chromosome 7 comprising the PON1-3 genes is an attractive candidate for association in amyotrophic lateral sclerosis (ALS) given the role of paraoxonase genes during the response to oxidative stress and their contribution to the enzymatic break down of nerve toxins. Oxidative stress is considered one of the mechanisms involved in ALS pathogenesis. Evidence for this includes the fact that mutations of SOD1, which normally reduce the production of toxic superoxide anion, account for 12% to 23% of familial cases in ALS. In addition, PON variants were shown to be associated with susceptibility to ALS in several North American and European populations. We extended this analysis to examine 20 single nucleotide polymorphisms (SNPs) across the PON gene cluster in a set of patients from France (480 cases, 475 controls), Quebec (159 cases, 95 controls), and Sweden (558 cases, 506 controls). Although individual SNPs were not considered associated on their own, a haplotype of SNPs at the C-terminal portion of PON2 that includes the PON2 C311S amino acid change was significant in the French (p value 0.0075) and Quebec (p value 0.026) populations as well as all three populations combined (p value 1.69 x 10(-6)). Stratification of the samples showed that this variation was pertinent to ALS susceptibility as a whole, and not to a particular subset of patients. These findings contribute to the increasing weight of evidence that genetic variants in the paraoxonase gene cluster are associated with amyotrophic lateral sclerosis.

  7. Annotated definition of BCL11A and HMIP-2 haplotypes through the analysis of sicilian β-thalassemia patients with high levels of fetal hemoglobin.

    PubMed

    Buccheri, Maria A; Spina, Sonia; Ruberto, Concetta; Lombardo, Turi; Labie, Dominique; Ragusa, And Angela

    2013-01-01

    Fetal hemoglobin (Hb F) is the principal ameliorating factor of β-thalassemia (β-thal) and sickle cell disease. Persistent production in adult life is a quantitative trait regulated by loci inside or outside the β-globin gene cluster. From genome-wide association studies, principal quantitative trait loci (QTL) (accounting for 50.0% of Hb F variability in different populations) have been identified in the BCL11A gene, HBS1L-MYB intergenic polymorphism and the β-globin gene cluster itself. In this study, we analyzed quantitative trait haplotypes in two Sicilian families with extremely mild β-thal and unusually high Hb F expression, in order to examine possible genetic background variations in a similar β-thalassemic phenotype. This study redefines the linkage disequilibrium blocks at these loci, but also shows slight differences between probands in haplotype combinations which could reflect different mechanisms of high Hb F production in patients with β-thal. We proposed a haplotype-based approach as a useful tool for the understanding of β-thal phenotype variation in patients with similar β-thalassemic backgrounds in an attempt to answer the recurring question of why patients with the same β-thalassemic genotype show different phenotypes.

  8. Sex differences in TTC12/ANKK1 haplotype associations with daily tobacco smoking in Black and White Americans.

    PubMed

    David, Sean P; Mezuk, Briana; Zandi, Peter P; Strong, David; Anthony, James C; Niaura, Raymond; Uhl, George R; Eaton, William W

    2010-03-01

    The 11q23.1 genomic region has been associated with nicotine dependence in Black and White Americans. By conducting linkage disequilibrium analyses of 7 informative single nucleotide polymorphisms (SNPs) within the tetratricopeptide repeat domain 12 (TTC12)/ankyrin repeat and kinase containing 1 (ANKK1)/dopamine (D2) receptor gene cluster, we identified haplotype block structures in 270 Black and 368 White (n = 638) participants, from the Baltimore Epidemiologic Catchment Area cohort study, spanning the TTC12 and ANKK1 genes consisting of three SNPs (rs2303380-rs4938015-rs11604671). Informative haplotypes were examined for sex-specific associations with daily tobacco smoking initiation and cessation using longitudinal data from 1993-1994 and 2004-2005 interviews. There was a Haplotype x Sex interaction such that Black men possessing the GTG haplotype who were smokers in 1993-2004 were more likely to have stopped smoking by 2004-2005 (55.6% GTG vs. 22.0% other haplotypes), while Black women were less likely to have quit smoking if they possessed the GTG (20.8%) versus other haplotypes (24.0%; p = .028). In Whites, the GTG haplotype (vs. other haplotypes) was associated with lifetime history of daily smoking (smoking initiation; odds ratio = 1.6; 95% CI = 1.1-2.4; p = .013). Moreover, there was a Haplotype x Sex interaction such that there was higher prevalence of smoking initiation with GTG (77.6%) versus other haplotypes (57.0%; p = .043). In 2 different ethnic American populations, we observed man-woman variation in the influence of the rs2303380-rs4938015-rs11604671 GTG haplotype on smoking initiation and cessation. These results should be replicated in larger cohorts to establish the relationship among the rs2303380-rs4938015-rs11604671 haplotype block, sex, and smoking behavior.

  9. MGMT DNA repair gene promoter/enhancer haplotypes alter transcription factor binding and gene expression.

    PubMed

    Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z

    2016-10-01

    The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.

  10. Diet and Colorectal Cancer: Analysis of a Candidate Pathway Using SNPS, Haplotypes, and Multi-Gene Assessment

    PubMed Central

    Slattery, Martha L.; Lundgreen, Abbie; Herrick, Jennifer S.; Caan, Bette J.; Potter, John D.; Wolff, Roger K.

    2012-01-01

    There is considerable biologic plausibility to the hypothesis that genetic variability in pathways involved in insulin signaling and energy homeostasis may modulate dietary risk associated with colorectal cancer. We utilized data from 2 population-based case-control studies of colon (n = 1,574 cases, 1,970 controls) and rectal (n = 791 cases, 999 controls) cancer to evaluate genetic variation in candidate SNPs identified from 9 genes in a candidate pathway: PDK1, RP6KA1, RPS6KA2, RPS6KB1, RPS6KB2, PTEN, FRAP1 (mTOR), TSC1, TSC2, Akt1, PIK3CA, and PRKAG2 with dietary intake of total energy, carbohydrates, fat, and fiber. We employed SNP, haplotype, and multiple-gene analysis to evaluate associations. PDK1 interacted with dietary fat for both colon and rectal cancer and with dietary carbohydrates for colon cancer. Statistically significant interaction with dietary carbohydrates and rectal cancer was detected by haplotype analysis of PDK1. Evaluation of dietary interactions with multiple genes in this candidate pathway showed several interactions with pairs of genes: Akt1 and PDK1, PDK1 and PTEN, PDK1 and TSC1, and PRKAG2 and PTEN. Analyses show that genetic variation influences risk of colorectal cancer associated with diet and illustrate the importance of evaluating dietary interactions beyond the level of single SNPs or haplotypes when a biologically relevant candidate pathway is examined. PMID:21999454

  11. Interactions Between Serotonin Transporter Gene Haplotypes and Quality of Mothers’ Parenting Predict the Development of Children’s Noncompliance

    PubMed Central

    Sulik, Michael J.; Eisenberg, Nancy; Lemery-Chalfant, Kathryn; Spinrad, Tracy L.; Silva, Kassondra M.; Eggum, Natalie D.; Betkowski, Jennifer A.; Kupfer, Anne; Smith, Cynthia L.; Gaertner, Bridget; Stover, Daryn A.; Verrelli, Brian C.

    2012-01-01

    The LPR and STin2 polymorphisms of the serotonin transporter gene (SLC6A4) were combined into haplotypes that, together with quality of maternal parenting, were used to predict initial levels and linear change in children’s (N = 138) noncompliance and aggression from age 18 –54 months. Quality of mothers’ parenting behavior was observed when children were 18 months old, and nonparental caregivers’ reports of noncompliance and aggression were collected annually from 18 to 54 months of age. Quality of early parenting was negatively related to the slope of noncompliance only for children with the LPR-S/STin2-10 haplotype and to 18-month noncompliance only for children with haplotypes that did not include LPR-S. The findings support the notion that SLC6A4 haplotypes index differential susceptibility to variability in parenting quality, with certain haplotypes showing greater reactivity to both supportive and unsupportive environments. These different genetic backgrounds likely reflect an evolutionary response to variation in the parenting environment. PMID:22059451

  12. A candidate gene study in low HDL-cholesterol families provides evidence for the involvement of the APOA2 gene and the APOA1C3A4 gene cluster.

    PubMed

    Lilja, Heidi E; Soro, Aino; Ylitalo, Kati; Nuotio, Ilpo; Viikari, Jorma S A; Salomaa, Veikko; Vartiainen, Erkki; Taskinen, Marja-Riitta; Peltonen, Leena; Pajukanta, Päivi

    2002-09-01

    In patients with premature coronary heart disease, the most common lipoprotein abnormality is high-density lipoprotein (HDL) deficiency. To assess the genetic background of the low HDL-cholesterol trait, we performed a candidate gene study in 25 families with low HDL, collected from the genetically isolated population of Finland. We studied 21 genes encoding essential proteins involved in the HDL metabolism by genotyping intragenic and flanking markers for these genes. We found suggestive evidence for linkage in two candidate regions: Marker D1S2844, in the apolipoprotein A-II (APOA2) region, yielded a LOD score of 2.14 and marker D11S939 flanking the apolipoprotein A-I/C-III/A-IV gene cluster (APOA1C3A4) produced a LOD score of 1.69. Interestingly, we identified potential shared haplotypes in these two regions in a subset of low HDL families. These families also contributed to the obtained positive LOD scores, whereas the rest of the families produced negative LOD scores. None of the remaining candidate regions provided any evidence for linkage. Since only a limited number of loci were tested in this candidate gene study, these LOD scores suggest significant involvement of the APOA2 gene and the APOA1C3A4 gene cluster, or loci in their immediate vicinity, in the pathogenesis of low HDL.

  13. Haplotypes identified by 10 DNA restriction fragment length polymorphisms at the human low density lipoprotein receptor gene locus.

    PubMed Central

    Kotze, M J; Langenhoven, E; Retief, A E; Seftel, H C; Henderson, H E; Weich, H F

    1989-01-01

    Ten useful two allele restriction fragment length polymorphisms of the low density lipoprotein receptor gene were used for haplotype analysis in 45 unrelated familial hypercholesterolaemic (FH) patients, 60 normal controls, and 32 FH homozygotes, all of whom were white Afrikaners. Pedigree analysis in 27 informative heterozygous FH and 23 normal families has shown the segregation of at least 17 haplotypes in the normal population (111 chromosomes) compared to a predominant association of two of these haplotypes with the disease in the FH subjects. This association was further confirmed in 32 FH homozygotes, indicating at least two 'founder' members for the disease in the Afrikaner population. Recombination events were not detected in any of the families studied and we thus conclude that the haplotypes associated with FH function as specific markers for the disease and will allow presymptomatic diagnosis in affected families. PMID:2565980

  14. [Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].

    PubMed

    Gu, Ming-liang; Chu, Jia-you

    2007-12-01

    Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.

  15. Mapping a New Spontaneous Preterm Birth Susceptibility Gene, IGF1R, Using Linkage, Haplotype Sharing, and Association Analysis

    PubMed Central

    Luukkonen, Aino; Teramo, Kari; Puttonen, Hilkka; Ojaniemi, Marja; Varilo, Teppo; Chaudhari, Bimal P.; Plunkett, Jevon; Murray, Jeffrey C.; McCarroll, Steven A.; Muglia, Louis J.; Palotie, Aarno; Hallman, Mikko

    2011-01-01

    Preterm birth is the major cause of neonatal death and serious morbidity. Most preterm births are due to spontaneous onset of labor without a known cause or effective prevention. Both maternal and fetal genomes influence the predisposition to spontaneous preterm birth (SPTB), but the susceptibility loci remain to be defined. We utilized a combination of unique population structures, family-based linkage analysis, and subsequent case-control association to identify a susceptibility haplotype for SPTB. Clinically well-characterized SPTB families from northern Finland, a subisolate founded by a relatively small founder population that has subsequently experienced a number of bottlenecks, were selected for the initial discovery sample. Genome-wide linkage analysis using a high-density single-nucleotide polymorphism (SNP) array in seven large northern Finnish non-consanginous families identified a locus on 15q26.3 (HLOD 4.68). This region contains the IGF1R gene, which encodes the type 1 insulin-like growth factor receptor IGF-1R. Haplotype segregation analysis revealed that a 55 kb 12-SNP core segment within the IGF1R gene was shared identical-by-state (IBS) in five families. A follow-up case-control study in an independent sample representing the more general Finnish population showed an association of a 6-SNP IGF1R haplotype with SPTB in the fetuses, providing further evidence for IGF1R as a SPTB predisposition gene (frequency in cases versus controls 0.11 versus 0.05, P = 0.001, odds ratio 2.3). This study demonstrates the identification of a predisposing, low-frequency haplotype in a multifactorial trait using a well-characterized population and a combination of family and case-control designs. Our findings support the identification of the novel susceptibility gene IGF1R for predisposition by the fetal genome to being born preterm. PMID:21304894

  16. Multiconstrained gene clustering based on generalized projections

    PubMed Central

    2010-01-01

    Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions. PMID:20356386

  17. A prospective study of the APOA1 XmnI and APOC3 SstI polymorphisms in the APOA1/C3/A4 gene cluster and risk of incident myocardial infarction in men.

    PubMed

    Liu, Simin; Song, Yiqing; Hu, Frank B; Niu, Tianhua; Ma, Jing; Gaziano, Michael; Stampfer, Meir J

    2004-11-01

    Apolipoproteins AI/CIII/AIV play important roles in the metabolism of triglycerides (TG) and high-density lipoprotein (HDL) cholesterol. However, whether genetic variations in the APOA1/C3/A4 gene cluster are associated with the risk of myocardial infarction (MI) remains uncertain and prospective data are sparse. In a prospective nested case-control study of 385 incident cases of MI and 373 age- and smoking-matched controls from the Physicians' Health Study, we examined the relationship between 2 common single nucleotide polymorphisms (APOA1 XmnI and APOC3 SstI) in the APOA1/C3/A4 gene cluster and haplotypes defined by these SNPs and risk of incident MI. No significant differences in allele or genotype frequency for the APOA1 XmnI and APOC3 SstI polymorphisms were detected between cases and controls. After adjusting for non-lipid coronary risk factors, the relative risks for incident MI were 1.00 (95% CI 0.68-1.47) for men carrying the X2 allele compared with those homozygous for the X1 allele in the APOA1 XmnI site and 1.07 (95% CI 0.69-1.64) for men carrying the S2 versus those homozygous for the S1 allele in the APOC3 SstI site. Moreover, we did not observe any effect modification by HDL or TG levels for the associations of these APOA1 and APOC3 genotypes with MI risk. There were significant differences in TG levels among men carrying different haplotypes (P=0.01) and men carrying the X1-S2 haplotype had higher levels of TG than those carrying the X2-S1 haplotype (202 mg/dl versus 157 mg/dl, P=0.03); however, haplotype frequencies defined by these two polymorphisms did not differ significantly between cases and controls. In this prospective study of apparently healthy middle-aged US men, carriers of the X1-S2 haplotype in the APOA1 XmnI and APOC3 SstI variants across the APOA1/C3/A4 gene cluster had higher TG levels, but there was no evidence for significant associations between these two common variants or haplotypes defined by them and risk of incident MI in

  18. Sublocalization of an Ataxia-Telangiectasia Gene Distal to D11S384 by Ancestral Haplotyping In Costa Rican Families

    PubMed Central

    Uhrhammer, Nancy; Lange, Ethan; Porras, Oscar; Naeim, Arash; Chen, Xiaoguang; Sheikhavandi, Sepideh; Chiplunkar, Sujata; Yang, Lan; Dandekar, Sugandha; Liang, Teresa; Patel, Nima; Teraoka, Sharon; Udar, Nitin; Calvo, Nidia; Concannon, Patrick; Lange, Kenneth; Gatti, Richard A.

    1995-01-01

    In an effort to localize a gene for ataxia-telangiectasia (A-T), we have genotyped 27 affected Costa Rican families, with 13 markers, in the chromosome 11q22-23 region. Significant linkage disequilibrium was detected for 9/13 markers between D11S1816 and D11S1391. Recombination events observed in these pedigrees places A-T between D11S1819 and D11S1960. One ancestral haplotype is common to 24/54 affected chromosomes and roughly two-thirds of the families. Inferred (ancestral) recombination events involving this common haplotype in earlier generations suggest that A-T is distal to D11S384 and proximal to D11S1960. Several other common haplotypes were identified, consistent with multiple mutations in a single gene. When considered together with all other evidence, this study further sublocalizes the major A-T locus to ≈200 kb, between markers S384 and S535. ImagesFigure 5 PMID:7611278

  19. TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS

    PubMed Central

    AGUIAR, DEREK; WONG, WENDY S.W.; ISTRAIL, SORIN

    2014-01-01

    The growing availability of inexpensive high-throughput sequence data is enabling researchers to sequence tumor populations within a single individual at high coverage. But, cancer genome sequence evolution and mutational phenomena like driver mutations and gene fusions are difficult to investigate without first reconstructing tumor haplotype sequences. Haplotype assembly of single individual tumor populations is an exceedingly difficult task complicated by tumor haplotype heterogeneity, tumor or normal cell sequence contamination, polyploidy, and complex patterns of variation. While computational and experimental haplotype phasing of diploid genomes has seen much progress in recent years, haplotype assembly in cancer genomes remains uncharted territory. In this work, we describe HapCompass-Tumor a computational modeling and algorithmic framework for haplotype assembly of copy number variable cancer genomes containing haplotypes at different frequencies and complex variation. We extend our polyploid haplotype assembly model and present novel algorithms for (1) complex variations, including copy number changes, as varying numbers of disjoint paths in an associated graph, (2) variable haplotype frequencies and contamination, and (3) computation of tumor haplotypes using simple cycles of the compass graph which constrain the space of haplotype assembly solutions. The model and algorithm are implemented in the software package HapCompass-Tumor which is available for download from http://www.brown.edu/Research/Istrail_Lab/. PMID:24297529

  20. Origin and Diversification Dynamics of Self-Incompatibility Haplotypes

    PubMed Central

    Gervais, Camille E.; Castric, Vincent; Ressayre, Adrienne; Billiard, Sylvain

    2011-01-01

    Self-incompatibility (SI) is a genetic system found in some hermaphrodite plants. Recognition of pollen by pistils expressing cognate specificities at two linked genes leads to rejection of self pollen and pollen from close relatives, i.e., to avoidance of self-fertilization and inbred matings, and thus increased outcrossing. These genes generally have many alleles, yet the conditions allowing the evolution of new alleles remain mysterious. Evolutionary changes are clearly necessary in both genes, since any mutation affecting only one of them would result in a nonfunctional self-compatible haplotype. Here, we study diversification at the S-locus (i.e., a stable increase in the total number of SI haplotypes in the population, through the incorporation of new SI haplotypes), both deterministically (by investigating analytically the fate of mutations in an infinite population) and by simulations of finite populations. We show that the conditions allowing diversification are far less stringent in finite populations with recurrent mutations of the pollen and pistil genes, suggesting that diversification is possible in a panmictic population. We find that new SI haplotypes emerge fastest in populations with few SI haplotypes, and we discuss some implications for empirical data on S-alleles. However, allele numbers in our simulations never reach values as high as observed in plants whose SI systems have been studied, and we suggest extensions of our models that may reconcile the theory and data. PMID:21515570

  1. Finding approximate gene clusters with Gecko 3.

    PubMed

    Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian

    2016-11-16

    Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. A Metabolic Gene Cluster in the Wheat W1 and the Barley Cer-cqu Loci Determines β-Diketone Biosynthesis and Glaucousness

    PubMed Central

    Lee, Wing-Sham; Malitsky, Sergey; Almekias-Siegl, Efrat; Levy, Matan; Ben-Zvi, Gil; Alkan, Noam; Uauy, Cristobal; Jetter, Reinhard

    2016-01-01

    The glaucous appearance of wheat (Triticum aestivum) and barley (Hordeum vulgare) plants, that is the light bluish-gray look of flag leaf, stem, and spike surfaces, results from deposition of cuticular β-diketone wax on their surfaces; this phenotype is associated with high yield, especially under drought conditions. Despite extensive genetic and biochemical characterization, the molecular genetic basis underlying the biosynthesis of β-diketones remains unclear. Here, we discovered that the wheat W1 locus contains a metabolic gene cluster mediating β-diketone biosynthesis. The cluster comprises genes encoding proteins of several families including type-III polyketide synthases, hydrolases, and cytochrome P450s related to known fatty acid hydroxylases. The cluster region was identified in both genetic and physical maps of glaucous and glossy tetraploid wheat, demonstrating entirely different haplotypes in these accessions. Complementary evidence obtained through gene silencing in planta and heterologous expression in bacteria supports a model for a β-diketone biosynthesis pathway involving members of these three protein families. Mutations in homologous genes were identified in the barley eceriferum mutants defective in β-diketone biosynthesis, demonstrating a gene cluster also in the β-diketone biosynthesis Cer-cqu locus in barley. Hence, our findings open new opportunities to breed major cereal crops for surface features that impact yield and stress response. PMID:27225753

  3. A Metabolic Gene Cluster in the Wheat W1 and the Barley Cer-cqu Loci Determines β-Diketone Biosynthesis and Glaucousness.

    PubMed

    Hen-Avivi, Shelly; Savin, Orna; Racovita, Radu C; Lee, Wing-Sham; Adamski, Nikolai M; Malitsky, Sergey; Almekias-Siegl, Efrat; Levy, Matan; Vautrin, Sonia; Bergès, Hélène; Friedlander, Gilgi; Kartvelishvily, Elena; Ben-Zvi, Gil; Alkan, Noam; Uauy, Cristobal; Kanyuka, Kostya; Jetter, Reinhard; Distelfeld, Assaf; Aharoni, Asaph

    2016-06-01

    The glaucous appearance of wheat (Triticum aestivum) and barley (Hordeum vulgare) plants, that is the light bluish-gray look of flag leaf, stem, and spike surfaces, results from deposition of cuticular β-diketone wax on their surfaces; this phenotype is associated with high yield, especially under drought conditions. Despite extensive genetic and biochemical characterization, the molecular genetic basis underlying the biosynthesis of β-diketones remains unclear. Here, we discovered that the wheat W1 locus contains a metabolic gene cluster mediating β-diketone biosynthesis. The cluster comprises genes encoding proteins of several families including type-III polyketide synthases, hydrolases, and cytochrome P450s related to known fatty acid hydroxylases. The cluster region was identified in both genetic and physical maps of glaucous and glossy tetraploid wheat, demonstrating entirely different haplotypes in these accessions. Complementary evidence obtained through gene silencing in planta and heterologous expression in bacteria supports a model for a β-diketone biosynthesis pathway involving members of these three protein families. Mutations in homologous genes were identified in the barley eceriferum mutants defective in β-diketone biosynthesis, demonstrating a gene cluster also in the β-diketone biosynthesis Cer-cqu locus in barley. Hence, our findings open new opportunities to breed major cereal crops for surface features that impact yield and stress response. © 2016 American Society of Plant Biologists. All rights reserved.

  4. HERC1 polymorphisms: population-specific variations in haplotype composition.

    PubMed

    Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen

    2009-08-01

    Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.

  5. De novo intrachromosomal gene conversion from OPN1MW to OPN1LW in the male germline results in Blue Cone Monochromacy

    PubMed Central

    Buena-Atienza, Elena; Rüther, Klaus; Baumann, Britta; Bergholz, Richard; Birch, David; De Baere, Elfride; Dollfus, Helene; Greally, Marie T.; Gustavsson, Peter; Hamel, Christian P.; Heckenlively, John R.; Leroy, Bart P.; Plomp, Astrid S.; Pott, Jan Willem R.; Rose, Katherine; Rosenberg, Thomas; Stark, Zornitza; Verheij, Joke B. G. M.; Weleber, Richard; Zobor, Ditta; Weisschuh, Nicole; Kohl, Susanne; Wissinger, Bernd

    2016-01-01

    X-linked cone dysfunction disorders such as Blue Cone Monochromacy and X-linked Cone Dystrophy are characterized by complete loss (of) or reduced L- and M- cone function due to defects in the OPN1LW/OPN1MW gene cluster. Here we investigated 24 affected males from 16 families with either a structurally intact gene cluster or at least one intact single (hybrid) gene but harbouring rare combinations of common SNPs in exon 3 in single or multiple OPN1LW and OPN1MW gene copies. We assessed twelve different OPN1LW/MW exon 3 haplotypes by semi-quantitative minigene splicing assay. Nine haplotypes resulted in aberrant splicing of ≥20% of transcripts including the known pathogenic haplotypes (i.e. ‘LIAVA’, ‘LVAVA’) with absent or minute amounts of correctly spliced transcripts, respectively. De novo formation of the ‘LIAVA’ haplotype derived from an ancestral less deleterious ‘LIAVS’ haplotype was observed in one family with strikingly different phenotypes among affected family members. We could establish intrachromosomal gene conversion in the male germline as underlying mechanism. Gene conversion in the OPN1LW/OPN1MW genes has been postulated, however, we are first to demonstrate a de novo gene conversion within the lineage of a pedigree. PMID:27339364

  6. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil.

    PubMed

    Dos Santos Silva, Wellington; de Nazaré Klautau-Guimarães, Maria; Grisolia, Cesar Koppe

    2010-07-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5' ε, HindIII-(G) γ, HindIII-(A) γ, HincII- ψβ1 and HincII-3' ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the "quilombo community", from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the β(A) chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil.

  7. β-globin haplotypes in normal and hemoglobinopathic individuals from Reconcavo Baiano, State of Bahia, Brazil

    PubMed Central

    2010-01-01

    Five restriction site polymorphisms in the β-globin gene cluster (HincII-5‘ ε, HindIII-G γ, HindIII-A γ, HincII- ψβ1 and HincII-3‘ ψβ1) were analyzed in three populations (n = 114) from Reconcavo Baiano, State of Bahia, Brazil. The groups included two urban populations from the towns of Cachoeira and Maragojipe and one rural Afro-descendant population, known as the “quilombo community”, from Cachoeira municipality. The number of haplotypes found in the populations ranged from 10 to 13, which indicated higher diversity than in the parental populations. The haplotypes 2 (+ - - - -), 3 (- - - - +), 4 (- + - - +) and 6 (- + + - +) on the βA chromosomes were the most common, and two haplotypes, 9 (- + + + +) and 14 (+ + - - +), were found exclusively in the Maragojipe population. The other haplotypes (1, 5, 9, 11, 12, 13, 14 and 16) had lower frequencies. Restriction site analysis and the derived haplotypes indicated homogeneity among the populations. Thirty-two individuals with hemoglobinopathies (17 sickle cell disease, 12 HbSC disease and 3 HbCC disease) were also analyzed. The haplotype frequencies of these patients differed significantly from those of the general population. In the sickle cell disease subgroup, the predominant haplotypes were BEN (Benin) and CAR (Central African Republic), with frequencies of 52.9% and 32.4%, respectively. The high frequency of the BEN haplotype agreed with the historical origin of the afro-descendant population in the state of Bahia. However, this frequency differed from that of Salvador, the state capital, where the CAR and BEN haplotypes have similar frequencies, probably as a consequence of domestic slave trade and subsequent internal migrations to other regions of Brazil. PMID:21637405

  8. Preliminary report: BGLIIA-BGLIIB haplotype of growth hormone cluster is associated with glucose intolerance in non-insulin-dependent diabetes mellitus and with growth hormone deficit in growth retardation.

    PubMed

    Bottini, E; Lucarelli, P; Amante, A; Saccucci, P; Gloria-Bottini, F

    2002-01-01

    We studied 101 growth-retarded children from the population of Ancona (Italy). Plasma growth hormone (GH) levels at the end of insulin and clonidine tests were considered for classification of children into 3 categories according to severity of GH deficit: total deficit of GH (TD), partial deficit (PD, and familiar short stature (FSS; no deficit of GH). The BGLIIA*2/BGLIIB*1 haplotype of GH cluster that was previously found to be negatively associated with severe glucose intolerance in non-insulin-dependent diabetes mellitus (NIDDM) is negatively associated with GH deficit in growth-retarded children. The hypothesis that intrauterine growth retardation and glucose intolerance in adult life could be phenotypes of the same underlying genotype has been recently put forward. The present observation suggests that genes influencing both growth and glucose tolerance are encoded in the GH cluster. Copyright 2002 by W.B. Saunders Company

  9. Genetic variation of 'Candidatus Liberibacter solanacearum' haplotype C and identification of a novel haplotype from Trioza urticae and stinging nettle.

    PubMed

    Haapalainen, Minna L; Wang, Jinhui; Latvala, Satu; Lehtonen, Mikko T; Pirhonen, Minna; Nissinen, Anne I

    2018-03-30

    'Candidatus Liberibacter solanacearum' (CLso) haplotype C is associated with disease in carrots and transmitted by the carrot psyllid Trioza apicalis. To identify possible other sources and vectors of this pathogen in Finland, samples were taken of wild plants within and near the carrot fields, the psyllids feeding on these plants, parsnips growing next to carrots, and carrot seeds. For analyzing the genotype of the CLso positive samples, a multi-locus sequence typing (MLST) scheme was developed. CLso haplotype C was detected in 11% of the Trioza anthrisci samples, in 35% of the Anthriscus sylvestris plants with discoloration, and in parsnips showing leaf discoloration. MLST revealed that the CLso in T. anthrisci and most A. sylvestris plants represent different strains than the bacteria found in T. apicalis and the cultivated plants. CLso haplotype D was detected in two of the 34 carrot seed lots tested, but was not detected in the plants grown from these seeds. Phylogenetic analysis by UPGMA clustering suggested that the haplotype D is more closely related to the haplotype A than to C. A novel, sixth haplotype of CLso, most closely related to A and D, was found in the psyllid Trioza urticae and stinging nettle (Urtica dioica, Urticaceae), and named as haplotype U.

  10. Single nucleotide polymorphisms and haplotypes associated with feed efficiency in beef cattle

    PubMed Central

    2013-01-01

    Background General, breed- and diet-dependent associations between feed efficiency in beef cattle and single nucleotide polymorphisms (SNPs) or haplotypes were identified on a population of 1321 steers using a 50 K SNP panel. Genomic associations with traditional two-step indicators of feed efficiency – residual feed intake (RFI), residual average daily gain (RADG), and residual intake gain (RIG) – were compared to associations with two complementary one-step indicators of feed efficiency: efficiency of intake (EI) and efficiency of gain (EG). Associations uncovered in a training data set were evaluated on independent validation data set. A multi-SNP model was developed to predict feed efficiency. Functional analysis of genes harboring SNPs significantly associated with feed efficiency and network visualization aided in the interpretation of the results. Results For the five feed efficiency indicators, the numbers of general, breed-dependent, and diet-dependent associations with SNPs (P-value < 0.0001) were 31, 40, and 25, and with haplotypes were six, ten, and nine, respectively. Of these, 20 SNP and six haplotype associations overlapped between RFI and EI, and five SNP and one haplotype associations overlapped between RADG and EG. This result confirms the complementary value of the one and two-step indicators. The multi-SNP models included 89 SNPs and offered a precise prediction of the five feed efficiency indicators. The associations of 17 SNPs and 7 haplotypes with feed efficiency were confirmed on the validation data set. Nine clusters of Gene Ontology and KEGG pathway categories (mean P-value < 0.001) including, 9nucleotide binding; ion transport, phosphorous metabolic process, and the MAPK signaling pathway were overrepresented among the genes harboring the SNPs associated with feed efficiency. Conclusions The general SNP associations suggest that a single panel of genomic variants can be used regardless of breed and diet. The breed- and diet

  11. Plasmodium falciparum sulfadoxine resistance is geographically and genetically clustered within the DR Congo

    PubMed Central

    Taylor, Steve M.; Antonia, Alejandro L.; Parobek, Christian M.; Juliano, Jonathan J.; Janko, Mark; Emch, Michael; Alam, Md Tauqeer; Udhayakumar, Venkatachalam; Tshefu, Antoinette K.; Meshnick, Steven R.

    2013-01-01

    Understanding the spatial clustering of Plasmodium falciparum populations can assist efforts to contain drug-resistant parasites and maintain the efficacy of future drugs. We sequenced single nucleotide polymorphisms (SNPs) in the dihydropteroate synthase gene (dhps) associated with sulfadoxine resistance and 5 microsatellite loci flanking dhps in order to investigate the genetic backgrounds, genetic relatedness, and geographic clustering of falciparum parasites in the Democratic Republic of the Congo (DRC). Resistant haplotypes were clustered into subpopulations: one in the northeast DRC, and the other in the balance of the DRC. Network and clonal lineage analyses of the flanking microsatellites indicate that geographically-distinct mutant dhps haplotypes derive from separate lineages. The DRC is therefore a watershed for haplotypes associated with sulfadoxine resistance. Given the importance of central Africa as a corridor for the spread of antimalarial resistance, the identification of the mechanisms of this transit can inform future policies to contain drug-resistant parasite strains. PMID:23372922

  12. Association of polymorphisms and haplotypes in the cytochrome P450 1B1 gene with uterine leiomyoma: A case control study

    PubMed Central

    SALIMI, SAEEDEH; KHODAMIAN, MARYAM; NAROOIE-NEJAD, MEHRNAZ; HAJIZADEH, AZAM; FAZELI, KIMIA; NAMAZI, LIDA; YAGHMAEI, MINOO

    2015-01-01

    Uterine leiomyoma (UL) is an estrogen-dependent neoplasm of the uterus and estrogen metabolizing enzymes affect its promotion and progression. The aim of the present study was to evaluate the association between four single-nucleotide polymorphisms (SNPs) of the cytochrome P450 1B1 (CYP1B1) gene and UL risk. Four SNPs of the CYP1B1 gene in 105 UL patients and 112 unrelated healthy controls were genotyped using a direct sequencing method. Haplotype analyses were performed with UNPHASED software and linkage disequilibrium (LD) was assessed by Haploview software. There were no associations between Leu432Val (rs1056836), Asp449Asp (rs1056837) and Asn453Ser (rs1800440) polymorphisms of the CYP1B1 gene and UL. Although the genotypic frequencies of the Arg368His (rs79204362) polymorphism did not differ between the two groups, the frequency of A (His) allele was significantly higher in UL females (P=0.02). In addition, the frequency of GTAA haplotype was significantly higher in the controls and played a protective role in UL susceptibility. A strong LD between the three common SNPs (rs1056836, rs1056837 and rs1800440) in the CYP1B1 gene was observed in the population. In conclusion, a higher frequency of the CYP1B1 368His (A) allele was observed in UL females. The frequency of the GTAA haplotype was significantly higher in healthy females and this haplotype played a protective role in UL susceptibility. PMID:26075073

  13. Pre-harvest sprouting resistance and haplotype variation of ThVp-1 gene in the collection of wheat-wheatgrass hybrids

    PubMed Central

    Kocheshkova, A. A.; Kroupin, P. Yu.; Bazhenov, M. S.; Karlov, G. I.; Pochtovyy, A. A.; Upelniek, V. P.; Belov, V. I.

    2017-01-01

    The germplasm collection of 87 wheat-wheatgrass hybrids developed in Tsitisin Main Botanical Garden (Russia, Moscow) was evaluated for resistance to pre-harvest sprouting (PHS) by spike sprouting (SS) and germination index (GI) assays as well as for spike and grain features. The PHS resistance variation and haplotype polymorphism of the wheatgrass ThVp-1 and wheat TaVp-1B genes orthologues of Vp-1 was revealed in the studied collection. Four haplotypes of ThVp-1 were revealed: ThVp-1a (41% of the entries), ThVp-1b (13%), ThVp-1c (29%), and ThVp-1d (15%). The association between the allelic state of ThVp-1 and PHS resistance in the wheat-wheatgrass hybrids was shown: haplotype ThVp-1d of the wheatgrass Vp-1 gene is significantly associated with reduced PHS in the wheat-wheatgrass hybrids (mean SS 0.33, mean GI 0.64). The resistant entries may be perspective as a source of PHS resistance in the development of commercial cultivars of perennial wheat. PMID:29131854

  14. RNA sequencing demonstrates large-scale temporal dysregulation of gene expression in stimulated macrophages derived from MHC-defined chicken haplotypes.

    PubMed

    Irizarry, Kristopher J L; Downs, Eileen; Bryden, Randall; Clark, Jory; Griggs, Lisa; Kopulos, Renee; Boettger, Cynthia M; Carr, Thomas J; Keeler, Calvin L; Collisson, Ellen; Drechsler, Yvonne

    2017-01-01

    Discovering genetic biomarkers associated with disease resistance and enhanced immunity is critical to developing advanced strategies for controlling viral and bacterial infections in different species. Macrophages, important cells of innate immunity, are directly involved in cellular interactions with pathogens, the release of cytokines activating other immune cells and antigen presentation to cells of the adaptive immune response. IFNγ is a potent activator of macrophages and increased production has been associated with disease resistance in several species. This study characterizes the molecular basis for dramatically different nitric oxide production and immune function between the B2 and the B19 haplotype chicken macrophages.A large-scale RNA sequencing approach was employed to sequence the RNA of purified macrophages from each haplotype group (B2 vs. B19) during differentiation and after stimulation. Our results demonstrate that a large number of genes exhibit divergent expression between B2 and B19 haplotype cells both prior and after stimulation. These differences in gene expression appear to be regulated by complex epigenetic mechanisms that need further investigation.

  15. Association of polyunsaturated fatty acids in breast milk with fatty acid desaturase gene polymorphisms among Chinese lactating mothers.

    PubMed

    Ding, Zhen; Liu, Guo-Liang; Li, Xiang; Chen, Xue-Yan; Wu, Yi-Xia; Cui, Can-Can; Zhang, Xi; Yang, Guang; Xie, Lin

    2016-06-01

    The fatty acid desaturase (FADS) controls polyunsaturated fatty acid (PUFA) synthesis in human tissues and breast milk. Evaluate the influence of 10 single nucleotide polymorphisms (SNPs) and various haplotypes in the FADS gene cluster (FADS1, FADS2, FADS3) on PUFA concentration in the breast milk of 209 healthy Chinese women. PUFA concentrations were measured in breast milk using gas chromatography and genotyping was performed using the Sequenom Mass Array system. A SNP (rs1535) and 2-locus haplotypes (rs3834458-rs1535, rs1535-rs174575) in the FADS2 gene were associated with concentrations of γ-linoleic acid (GLA) and arachidonic acid (AA) in breast milk. Likewise, in the FADS1 gene, a 2-locus constructed haplotype (rs174547-rs174553) also affected GLA and AA concentration (P<0.05 for all). Minor allele carriers of the SNP and haplotypes described above had lower concentrations of GLA and AA. In the FADS2 gene, the 3-locus haplotype rs3834458-rs1535-rs174575, significantly affected concentrations of GLA but not AA. Pairwise comparison showed that individuals major homozygous for the SNP rs1000778 in the FADS3 gene had lower concentrations of ALA and linoleic acid (LA) in their breast milk. Polymorphisms in the FADS gene cluster influence PUFA concentrations in the breast milk of Chinese Han lactating women. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. The putative oncogene Pim-1 in the mouse: its linkage and variation among t haplotypes.

    PubMed

    Nadeau, J H; Phillips, S J

    1987-11-01

    Pim-1, a putative oncogene involved in T-cell lymphomagenesis, was mapped between the pseudo-alpha globin gene Hba-4ps and the alpha-crystallin gene Crya-1 on mouse chromosome 17 and therefore within the t complex. Pim-1 restriction fragment variants were identified among t haplotypes. Analysis of restriction fragment sizes obtained with 12 endonucleases demonstrated that the Pim-1 genes in some t haplotypes were indistinguishable from the sizes for the Pim-1b allele in BALB/c inbred mice. There are now three genes, Pim-1, Crya-1 and H-2 I-E, that vary among independently derived t haplotypes and that have indistinguishable alleles in t haplotypes and inbred strains. These genes are closely linked within the distal inversion of the t complex. Because it is unlikely that these variants arose independently in t haplotypes and their wild-type homologues, we propose that an exchange of chromosomal segments, probably through double crossingover, was responsible for indistinguishable Pim-1 genes shared by certain t haplotypes and their wild-type homologues. There was, however, no apparent association between variant alleles of these three genes among t haplotypes as would be expected if a single exchange introduced these alleles into t haplotypes. If these variant alleles can be shown to be identical to the wild-type allele, then lack of association suggests that multiple exchanges have occurred during the evolution of the t complex.

  17. Conversion events in gene clusters

    PubMed Central

    2011-01-01

    Background Gene clusters containing multiple similar genomic regions in close proximity are of great interest for biomedical studies because of their associations with inherited diseases. However, such regions are difficult to analyze due to their structural complexity and their complicated evolutionary histories, reflecting a variety of large-scale mutational events. In particular, conversion events can mislead inferences about the relationships among these regions, as traced by traditional methods such as construction of phylogenetic trees or multi-species alignments. Results To correct the distorted information generated by such methods, we have developed an automated pipeline called CHAP (Cluster History Analysis Package) for detecting conversion events. We used this pipeline to analyze the conversion events that affected two well-studied gene clusters (α-globin and β-globin) and three gene clusters for which comparative sequence data were generated from seven primate species: CCL (chemokine ligand), IFN (interferon), and CYP2abf (part of cytochrome P450 family 2). CHAP is freely available at http://www.bx.psu.edu/miller_lab. Conclusions These studies reveal the value of characterizing conversion events in the context of studying gene clusters in complex genomes. PMID:21798034

  18. Association of methylenetetrahydrofolate reductase gene-gene interaction and haplotype with susceptibility to acute lymphoblastic leukemia in Chinese children.

    PubMed

    Xia, Xiaojun; Duan, Yun; Cui, Jie; Jiang, Junfeng; Lin, Li; Peng, Xiaojuan; Wang, YuHong; Guo, Bingtao; Liu, Shouhai; Lei, Xudong

    2017-08-01

    The aim of this study was to investigate the association of methylenetetrahydrofolate reductase (MTHFR) gene polymorphism and additional gene-gene interaction with acute lymphoblastic leukemia (ALL) risk. Logistic regression was performed to investigate the association between two single nucleotide polymorphisms (SNPs) within MTHFR gene and ALL risk and additional gene-gene interaction between rs1801133 and rs1801131. The minor allele of rs1801133 and rs1801131 is associated with decreased ALL risk, OR (95% CI) were 0.61 (0.38-0.89), and 0.68 (0.50-0.96), respectively. We also found a significantly interaction between the two SNPs, participants with rs1801133 - CT or TT and rs1801131 - AC or CC genotype have the lowest ALL risk, compared with participants with rs1801133 - CC and rs1801131 - AA genotype, OR (95% CI) was 0.32 (0.12-0.63). We did not find any haplotype between the rs1801133 and rs1801131 associated with ALL risk. rs1801133 and rs1801131 within MTHFR gene and their interaction were both associated with ALL risk in Chinese children.

  19. Two Orangutan Species Have Evolved Different KIR Alleles and Haplotypes1

    PubMed Central

    Guethlein, Lisbeth A.; Norman, Paul J.; Heijmans, Corinne M. C.; de Groot, Natasja G.; Hilton, Hugo G.; Babrzadeh, Farbod; Abi-Rached, Laurent; Bontrop, Ronald E.; Parham, Peter

    2017-01-01

    The immune and reproductive functions of human Natural Killer (NK) cells are regulated by interactions of the C1 and C2 epitopes of HLA-C with C1-specific and C2-specific lineage III killer cell immunoglobulin-like receptors (KIR). This rapidly evolving and diverse system of ligands and receptors is restricted to humans and great apes. In this context, the orangutan has particular relevance because it represents an evolutionary intermediate, one having the C1 epitope and corresponding KIR, but lacking the C2 epitope. Through a combination of direct sequencing, KIR genotyping and data mining from the Great Ape Genome Project (GAGP) we characterized the KIR alleles and haplotypes for panels of ten Bornean orangutans and 19 Sumatran orangutans. The orangutan KIR haplotypes have between five and ten KIR genes. The seven orangutan lineage III KIR genes all locate to the centromeric region of the KIR locus, whereas their human counterparts also populate the telomeric region. One lineage III KIR gene is Bornean-specific, one is Sumatran-specific and five are shared. Of twelve KIR gene-content haplotypes five are Bornean-specific, five are Sumatran-specific and two are shared. The haplotypes have different combinations of genes encoding activating and inhibitory C1 receptors that can be of higher or lower affinity. All haplotypes encode an inhibitory C1 receptor, but only some haplotypes encode an activating C1 receptor. Of 130 KIR alleles, 55 are Bornean-specific, 65 are Sumatran specific and ten are shared. PMID:28264973

  20. TNF-alpha SNP haplotype frequencies in equidae.

    PubMed

    Brown, J J; Ollier, W E R; Thomson, W; Matthews, J B; Carter, S D; Binns, M; Pinchbeck, G; Clegg, P D

    2006-05-01

    Tumour necrosis factor alpha (TNF-alpha) is a pro-inflammatory cytokine that plays a crucial role in the regulation of inflammatory and immune responses. In all vertebrate species the genes encoding TNF-alpha are located within the major histocompatability complex. In the horse TNF-alpha has been ascribed a role in a variety of important disease processes. Previously two single nucleotide polymorphisms (SNPs) have been reported within the 5' un-translated region of the equine TNF-alpha gene. We have examined the equine TNF-alpha promoter region further for additional SNPs by analysing DNA from 131 horses (Equus caballus), 19 donkeys (E. asinus), 2 Grant's zebras (E. burchellii boehmi) and one onager (E. hemionus). Two further SNPs were identified at nucleotide positions 24 (T/G) and 452 (T/C) relative to the first nucleotide of the 522 bp polymerase chain reaction product. A sequence variant at position 51 was observed between equidae. SNaPSHOT genotyping assays for these and the two previously reported SNPs were performed on 457 horses comprising seven different breeds and 23 donkeys to determine the gene frequencies. SNP frequencies varied considerably between different horse breeds and also between the equine species. In total, nine different TNF-alpha promoter SNP haplotypes and their frequencies were established amongst the various equidae examined, with some haplotypes being found only in horses and others only in donkeys or zebras. The haplotype frequencies observed varied greatly between different horse breeds. Such haplotypes may relate to levels of TNF-alpha production and disease susceptibility and further investigation is required to identify associations between particular haplotypes and altered risk of disease.

  1. Epistatic interaction between haplotypes of the ghrelin ligand and receptor genes influence susceptibility to myocardial infarction and coronary artery disease.

    PubMed

    Baessler, Andrea; Fischer, Marcus; Mayer, Bjoern; Koehler, Martina; Wiedmann, Silke; Stark, Klaus; Doering, Angela; Erdmann, Jeanette; Riegger, Guenter; Schunkert, Heribert; Kwitek, Anne E; Hengstenberg, Christian

    2007-04-15

    Data from both experimental models and humans provide evidence that ghrelin and its receptor, the growth hormone secretagogue receptor (ghrelin receptor, GHSR), possess a variety of cardiovascular effects. Thus, we hypothesized that genetic variants within the ghrelin system (ligand ghrelin and its receptor GHSR) are associated with susceptibility to myocardial infarction (MI) and coronary artery disease (CAD). Seven single nucleotide polymorphisms (SNPs) covering the GHSR region as well as eight SNPs across the ghrelin gene (GHRL) region were genotyped in index MI patients (864 Caucasians, 'index MI cases') from the German MI family study and in matched controls without evidence of CAD (864 Caucasians, 'controls', MONICA Augsburg). In addition, siblings of these MI patients with documented severe CAD (826 'affected sibs') were matched likewise with controls (n = 826 Caucasian 'controls') and used for verification. The effect of interactions between genetic variants of both genes of the ghrelin system was explored by conditional classification tree models. We found association of several GHSR SNPs with MI [best SNP odds ratio (OR) 1.7 (1.2-2.5); P = 0.002] using a recessive model. Moreover, we identified a common GHSR haplotype which significantly increases the risk for MI [multivariate adjusted OR for homozygous carriers 1.6 (1.1-2.5) and CAD OR 1.6 (1.1-2.5)]. In contrast, no relationship between genetic variants and the disease could be revealed for GHRL. However, the increase in MI/CAD frequency related to the susceptible GHSR haplotype was abolished when it coincided with a common GHRL haplotype. Multivariate adjustments as well as permutation-based methods conveyed the same results. These data are the first to demonstrate an association of SNPs and haplotypes within important genes of the ghrelin system and the susceptibility to MI, whereas association with MI/CAD could be identified for genetic variants across GHSR, no relationship could be revealed for GHRL

  2. Linkage Study Revealed Complex Haplotypes in a Multifamily due to Different Mutations in CAPN3 Gene in an Iranian Ethnic Group.

    PubMed

    Mojbafan, Marzieh; Tonekaboni, Seyed Hassan; Abiri, Maryam; Kianfar, Soudeh; Sarhadi, Ameneh; Nilipour, Yalda; Tavakkoly-Bazzaz, Javad; Zeinali, Sirous

    2016-07-01

    Calpainopathy is an autosomal recessive form of limb girdle muscular dystrophies which is caused by mutation in CAPN3 gene. In the present study, co-segregation of this disorder was analyzed with four short tandem repeat markers linked to the CAPN3 gene. Three apparently unrelated Iranian families with same ethnicity were investigated. Haplotype analysis and sequencing of the CAPN3 gene were performed. DNA sample from one of the patients was simultaneously sent for next-generation sequencing. DNA sequencing identified two mutations. It was seen as a homozygous c.2105C>T in exon 19 in one family, a homozygous novel mutation c.380G>A in exon 3 in another family, and a compound heterozygote form of these two mutations in the third family. Next-generation sequencing also confirmed our results. It was expected that, due to the rare nature of limb girdle muscular dystrophies, affected individuals from the same ethnic group share similar mutations. Haplotype analysis showed two different homozygote patterns in two families, yet a compound heterozygote pattern in the third family as seen in the mutation analysis. This study shows that haplotype analysis would help in determining presence of different founders.

  3. Comparative sequence analysis of the potato cyst nematode resistance locus H1 reveals a major lack of co-linearity between three haplotypes in potato (Solanum tuberosum ssp.).

    PubMed

    Finkers-Tomczak, Anna; Bakker, Erin; de Boer, Jan; van der Vossen, Edwin; Achenbach, Ute; Golas, Tomasz; Suryaningrat, Suwardi; Smant, Geert; Bakker, Jaap; Goverse, Aska

    2011-02-01

    The H1 locus confers resistance to the potato cyst nematode Globodera rostochiensis pathotypes 1 and 4. It is positioned at the distal end of chromosome V of the diploid Solanum tuberosum genotype SH83-92-488 (SH) on an introgression segment derived from S. tuberosum ssp. andigena. Markers from a high-resolution genetic map of the H1 locus (Bakker et al. in Theor Appl Genet 109:146-152, 2004) were used to screen a BAC library to construct a physical map covering a 341-kb region of the resistant haplotype coming from SH. For comparison, physical maps were also generated of the two haplotypes from the diploid susceptible genotype RH89-039-16 (S. tuberosum ssp. tuberosum/S. phureja), spanning syntenic regions of 700 and 319 kb. Gene predictions on the genomic segments resulted in the identification of a large cluster consisting of variable numbers of the CC-NB-LRR type of R genes for each haplotype. Furthermore, the regions were interspersed with numerous transposable elements and genes coding for an extensin-like protein and an amino acid transporter. Comparative analysis revealed a major lack of gene order conservation in the sequences of the three closely related haplotypes. Our data provide insight in the evolutionary mechanisms shaping the H1 locus and will facilitate the map-based cloning of the H1 resistance gene.

  4. Haplotype analysis indicates an association between the DOPA decarboxylase (DDC) gene and nicotine dependence.

    PubMed

    Ma, Jennie Z; Beuten, Joke; Payne, Thomas J; Dupont, Randolph T; Elston, Robert C; Li, Ming D

    2005-06-15

    DOPA decarboxylase (DDC; also known as L-amino acid decarboxylase; AADC) is involved in the synthesis of dopamine, norepinephrine and serotonin. Because the mesolimbic dopaminergic system is implicated in the reinforcing effects of many drugs, including nicotine, the DDC gene is considered a plausible candidate for involvement in the development of vulnerability to nicotine dependence (ND). Further, this gene is located within the 7p11 region that showed a 'suggestive linkage' to ND in our previous genome-wide scan in the Framingham Heart Study population. In the present study, we tested eight single nucleotide polymorphisms (SNPs) within DDC for association with ND, which was assessed by smoking quantity (SQ), the heaviness of smoking index (HSI) and the Fagerstrom test for ND (FTND) score, in a total of 2037 smokers and non-smokers from 602 nuclear families of African- or European-American (AA or EA, respectively) ancestry. Association analysis for individual SNPs using the PBAT-GEE program indicated that SNP rs921451 was significantly associated with two of the three adjusted ND measures in the EA sample (P=0.01-0.04). Haplotype-based association analysis revealed a protective T-G-T-G haplotype for rs921451-rs3735273-rs1451371-rs2060762 in the AA sample, which was significantly associated with all three adjusted ND measures after correction for multiple testing (min Z=-2.78, P=0.006 for HSI). In contrast, we found a high-risk T-G-T-G haplotype for a different SNP combination in the EA sample, rs921451-rs3735273-rs1451371-rs3757472, which showed a significant association after Bonferroni correction with the SQ and FTND score (max Z=2.73, P=0.005 for FTND). In summary, our findings provide the first evidence for the involvement of DDC in the susceptibility to ND and, further, reveal the racial specificity of its impact.

  5. Molecular characterization of a long range haplotype affecting protein yield and mastitis susceptibility in Norwegian Red cattle.

    PubMed

    Sodeland, Marte; Grove, Harald; Kent, Matthew; Taylor, Simon; Svendsen, Morten; Hayes, Ben J; Lien, Sigbjørn

    2011-08-11

    of α(S2)-casein in milk. Highest test scores for CM were found in the region 89-91 Mb on BTA6, very close to a cluster of genes coding for CXC chemokines. Expression levels of some of these CXC chemokines have previously been shown to increase in bovine mammary gland cell lines after exposure to bacterial cell wall components. Molecular characterization of the long range haplotype from the Holstein-Friesian bull 1606 Frasse, imported into NRC in the 1970s, revealed polymorphisms that could affect transcription or translation of the casein gene CSN1S2. Sires with this haplotype had daughters with significantly elevated milk protein content and selection for milk production traits is likely to have increased the frequency of this haplotype in the NRC population. The haplotype was also associated with increased mastitis susceptibility, which might explain parts of the genetic correlation between PY and CM in NRC.

  6. Association analysis of calpain 10 gene variants/haplotypes with gestational diabetes mellitus among Mexican women.

    PubMed

    Castro-Martínez, Anna Gabriela; Sánchez-Corona, José; Vázquez-Vargas, Adriana Patricia; García-Zapién, Alejandra Guadalupe; López-Quintero, Andres; Villalpando-Velazco, Héctor Javier; Flores-Martínez, Silvia Esperanza

    2018-02-28

    Gestational diabetes mellitus (GDM) is a metabolically complex disease with major genetic determinants. GDM has been associated with insulin resistance and dysfunction of pancreatic beta cells, so the GDM candidate genes are those that encode proteins modulating the function and secretion of insulin, such as that for calpain 10 (CAPN10). This study aimed to assess whether single nucleotide polymorphism (SNP)-43, SNP-44, SNP-63, and the indel-19 variant, and specific haplotypes of the CAPN10 gene were associated with gestational diabetes mellitus. We studied 116 patients with gestational diabetes mellitus and 83 women with normal glucose tolerance. Measurements of anthropometric and biochemical parameters were performed. SNP-43, SNP-44, and SNP-63 were identified by polymerase chain reaction (PCR)-restriction fragment length polymorphisms, while the indel-19 variant was detected by TaqMan qPCR assays.  The allele, genotype, and haplotype frequencies of the four variants did not differ significantly between women with gestational diabetes mellitus and controls. However, in women with gestational diabetes mellitus, glucose levels were significantly higher bearing the 3R/3R genotype than in carriers of the 3R/2R genotype of the indel-19 variant (p = 0.006). In conclusion, the 3R/3R genotype of the indel-19 variant of the CAPN-10 gene influenced increased glucose levels in these Mexican women with gestational diabetes mellitus.

  7. OAS single-nucleotide polymorphisms and haplotypes are associated with variations in immune responses to rubella vaccine

    PubMed Central

    Haralambieva, Iana H.; Dhiman, Neelam; Ovsyannikova, Inna G.; Vierkant, Robert A.; Pankratz, V. Shane; Jacobson, Robert M.; Poland, Gregory A.

    2010-01-01

    Interferon (IFN)-induced antiviral genes are crucial players in innate antiviral defense and potential determinants of immune response heterogeneity. We selected 114 candidate SNPs from 12 antiviral genes using an LD tagSNP selection approach and genotyped them in a cohort of 738 schoolchildren immunized with two doses of rubella vaccine. Associations between SNPs/haplotypes and rubella virus-specific immune measures were assessed using linear regression methodologies. We identified 23 significant associations (p<0.05) between polymorphisms within the 2′-5′-oligoadenylate synthetase (OAS) gene cluster, and rubella virus-specific IL-2, IL-10, IL-6 secretion and antibody levels. The minor allele variants of three OAS1 SNPs (rs3741981/Ser162Gly, rs1051042/Thr361Arg, rs2660), located in a linkage disequilibrium block of functional importance, were significantly associated with an increase in rubella virus-specific IL-2/Th1 response (p≤0.024). Seven OAS1 and OAS3 promoter/regulatory SNPs were similarly associated with IL-2 secretion. Importantly, two SNPs (rs3741981 and rs10774670), independently cross-regulated rubella virus-specific IL-10 secretion levels (p≤0.031). Furthermore, both global tests and individual haplotype analyses revealed significant associations between OAS1 haplotypes and rubella virus-specific cytokine secretion. Our results suggest that innate immunity and OAS genetic variations are likely involved in modulating the magnitude and quality of the adaptive immune responses to live attenuated rubella vaccine. PMID:20079393

  8. Association of pro-ghrelin and GHS-R1A gene polymorphisms and haplotypes with heavy alcohol use and body mass.

    PubMed

    Landgren, Sara; Jerlhag, Elisabet; Zetterberg, Henrik; Gonzalez-Quintela, Arturo; Campos, Joaquin; Olofsson, Ulrica; Nilsson, Staffan; Blennow, Kaj; Engel, Jörgen A

    2008-12-01

    Ghrelin, an orexigenic peptide, acts on growth hormone secretagogue receptors (GHS-R1A), expressed in the hypothalamus as well as in important reward nodes such as the ventral tegmental area. Interestingly, ghrelin has been found to activate an important part of the reward systems, i.e., the cholinergic-dopaminergic reward link. Additionally, the rewarding and neurochemical properties of alcohol are, at least in part, mediated via this reward link. There is comorbidity between alcohol dependence and eating disorders. Thus, plasma levels of ghrelin are altered in patients with addictive behaviors such as alcohol and nicotine dependence and in binge eating disorder. This overlap prompted as to investigate the pro-ghrelin and GHS-R1A genes in a haplotype analysis of heavy alcohol-using individuals. A total of 417 Spanish individuals (abstainers, moderate, and heavy alcohol drinkers) were investigated in a haplotype analysis of the pro-ghrelin and GHS-R1A genes. Tag SNPs were chosen using HapMap data and the Tagger and Haploview softwares. These SNPs were then genotyped using TaqMan Allelic Discrimination. SNP rs2232165 of the GHS-R1A gene was associated with heavy alcohol consumption and SNP rs2948694 of the same gene as well as haplotypes of both the pro-ghrelin and the GHS-R1A genes were associated with body mass in heavy alcohol consuming individuals. The present findings are the first to disclose an association between the pro-ghrelin and GHS-R1A genes and heavy alcohol use, further strengthening the role of the ghrelin system in addictive behaviors and brain reward.

  9. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations.

    PubMed

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-08-10

    The sickle (betas) mutation in the beta-globin gene (HBB) occurs on five "classical" betas haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the betas allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical betas haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical betas haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). The most common betas sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the betas mutation. Two different classical betas haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of betas haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle

  10. A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.

    PubMed

    Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin

    2018-04-26

    Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.

  11. Finding gene clusters for a replicated time course study

    PubMed Central

    2014-01-01

    Background Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. Findings In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. Conclusions The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism. PMID:24460656

  12. No association between polymorphisms/haplotypes of the vascular endothelial growth factor gene and preeclampsia

    PubMed Central

    2011-01-01

    Background Preeclampsia (PE) is the first worldwide cause of death in pregnant women, intra-uterine growth retardation, and fetal prematurity. Some vascular endothelial grown factor gene (VEGF) polymorphisms have been associated to PE and other pregnancy disturbances. We evaluated the associations between VEGF genotypes/haplotypes and PE in Mexican women. Methods 164 pregnant women were enrolled in a case-control study (78 cases and 86 normotensive pregnant controls). The rs699947 (-2578C/A), rs1570360 (-1154G/A), rs2010963 (+405G/C), and rs25648 (-7C/T), VEGF variants were discriminated using Polymerase Chain Reaction - Restriction Fragment Length Polymorphism (PCR-RFLP) methods or Taqman single nucleotide polymorphism (SNP) assays. Results The proportions of the minor allele for rs699947, rs1570360, rs2010963, and rs25648 VEGF SNPs were 0.33, 0.2, 0.39, and 0.17 in controls, and 0.39, 0.23, 0.41, and 0.15 in cases, respectively (P values > 0.05). The most frequent haplotypes of rs699947, rs1570360, rs2010963, and rs25648 VEGF SNPs, were C-G-C-C and C-G-G-C with frequencies of 0.39, 0.21 in cases and 0.37, 0.25 in controls, respectively (P values > 0.05) Conclusion There was no evidence of an association between VEGF alleles, genotypes, or haplotypes frequencies and PE in our study. PMID:21575227

  13. Detecting local haplotype sharing and haplotype association

    USDA-ARS?s Scientific Manuscript database

    A novel haplotype association method is presented, and its power is demonstrated. Relying on a statistical model for linkage disequilibrium (LD), the method first infers ancestral haplotypes and their loadings at each marker for each individual. The loadings are then used to quantify local haplotype...

  14. Constrained clusters of gene expression profiles with pathological features.

    PubMed

    Sese, Jun; Kurokawa, Yukinori; Monden, Morito; Kato, Kikuya; Morishita, Shinichi

    2004-11-22

    Gene expression profiles should be useful in distinguishing variations in disease, since they reflect accurately the status of cells. The primary clustering of gene expression reveals the genotypes that are responsible for the proximity of members within each cluster, while further clustering elucidates the pathological features of the individual members of each cluster. However, since the first clustering process and the second classification step, in which the features are associated with clusters, are performed independently, the initial set of clusters may omit genes that are associated with pathologically meaningful features. Therefore, it is important to devise a way of identifying gene expression clusters that are associated with pathological features. We present the novel technique of 'itemset constrained clustering' (IC-Clustering), which computes the optimal cluster that maximizes the interclass variance of gene expression between groups, which are divided according to the restriction that only divisions that can be expressed using common features are allowed. This constraint automatically labels each cluster with a set of pathological features which characterize that cluster. When applied to liver cancer datasets, IC-Clustering revealed informative gene expression clusters, which could be annotated with various pathological features, such as 'tumor' and 'man', or 'except tumor' and 'normal liver function'. In contrast, the k-means method overlooked these clusters.

  15. Haplotypes of heparin-binding epidermal-growth-factor-like growth factor gene are associated with pre-eclampsia.

    PubMed

    Harendra, Galhenagey Gayani; Jayasekara, Rohan W; Dissanayake, Vajira H W

    2012-01-01

    Heparin-binding epidermal-growth-factor-like growth factor (HBEGF) plays an important role in placentation, including impaired placentation, the primary defect seen in pre-eclampsia. We carried out a case-control disease-association study to examine the association of single nucleotide polymorphisms (SNP) in the HBEGF gene and haplotypes defined by them with pre-eclampsia in a Sinhalese population in Sri Lanka. A total of 175 women with pre-eclampsia and 171 matched normotensive controls were genotyped for six SNP selected in silico as having putative functional effects using mass array Sequenom iplex methodology and a newly designed polymerase chain reaction-restriction fragment length polymorphism assay. The individual SNP were not associated with pre-eclampsia. The haplotypes defined by them, however, showed both predisposing (rs13385T,rs2074613G,rs2237076G,rs2074611C,rs4150196A,rs1862176A; odds ratio,1.65; 95% confidence interval1.04-2.60; P=0.032) and protective (rs13385C,rs2074613G,rs2237076A,rs2074611C,rs4150196A,rs1862176A; odds ratio,0.20; 95% confidence interval, 0.04-0.89; P=0.034) effects. These results confirm that polymorphisms in the HGEGF gene are associated with pre-eclampsia. The haplotypes are likely to exert their effects through the numerous transcription regulation factors binding to the polymorphic sites, namely GATA-1, GATA-3, MZF-1 and AML-1a. © 2011 The Authors. Journal of Obstetrics and Gynaecology Research © 2011 Japan Society of Obstetrics and Gynecology.

  16. Haplotype analysis of the HFE gene among populations of Northern Eurasia, in patients with metabolic disorders or stomach cancer, and in long-lived people.

    PubMed

    Mikhailova, S V; Babenko, V N; Ivanoshchuk, D E; Gubina, M A; Maksimov, V N; Solovjova, I G; Voevoda, M I

    2016-06-17

    Previously, it was shown that the HFE gene (associated with human hereditary hemochromatosis) has several haplotypes of intronic polymorphisms. Some haplotype frequencies are race specific and hence can be used in phylogenetic analysis. We assumed that analysis of Caucasoid patients-living now in Western Siberia and having diseases associated with dietary habits and metabolic rate-will allow us to understand the processes of possible selection during settling of the northern part of Asia. Haplotype analysis of Northern Eurasian native and recently settled ethnic groups was performed on polymorphisms rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, rs1572982, rs2794719, rs807209, and rs2032451 of this gene. The CCA haplotype of the rs2071303, rs1800708, and rs1572982 was found to be associated with HLA-A2 (39 %) in Asian populations. Haplotype analysis for the rs1799945, rs1800730, rs1800562, rs2071303, rs1800708, and rs1572982 was performed on Russian patients with some metabolic disorders or stomach cancer and among long-lived people. Decreased frequencies of the TTA haplotype (T in rs2071303, T in rs1800708, and A in rs1572982) were observed in the groups of patients with diseases associated with overweight (fatty liver disease, type 2 diabetes mellitus, or metabolic syndrome + arterial hypertension) as compared with the control sample. We detected significant differences in this haplotype's frequency between the patients with type 2 diabetes mellitus and Russian adolescents, elderly citizens, and long-lived people (χ(2) P value = 0.003, 0.010, and 0.015, respectively). No significant differences in frequencies of the alleles with mutations in coding regions of the HFE gene (C282Y, H63D, and S65C) were detected between the analyzed patients (with stomach cancer, metabolic syndrome, fatty liver disease, or type 2 diabetes mellitus) and the control Caucasoid sample. Monophyletic origin of H63D (rs1799945) was confirmed in Caucasoids and Northern

  17. Population-Specific Haplotype Association of the Postsynaptic Density Gene DLG4 with Schizophrenia, in Family-Based Association Studies

    PubMed Central

    Balan, Shabeesh; Yamada, Kazuo; Hattori, Eiji; Iwayama, Yoshimi; Toyota, Tomoko; Ohnishi, Tetsuo; Maekawa, Motoko; Toyoshima, Manabu; Iwata, Yasuhide; Suzuki, Katsuaki; Kikuchi, Mitsuru; Yoshikawa, Takeo

    2013-01-01

    The post-synaptic density (PSD) of glutamatergic synapses harbors a multitude of proteins critical for maintaining synaptic dynamics. Alteration of protein expression levels in this matrix is a marked phenomenon of neuropsychiatric disorders including schizophrenia, where cognitive functions are impaired. To investigate the genetic relationship of genes expressed in the PSD with schizophrenia, a family-based association analysis of genetic variants in PSD genes such as DLG4, DLG1, PICK1 and MDM2, was performed, using Japanese samples (124 pedigrees, n = 376 subjects). Results showed a significant association of the rs17203281 variant from the DLG4 gene, with preferential transmission of the C allele (p = 0.02), although significance disappeared after correction for multiple testing. Replication analysis of this variant, found no association in a Chinese schizophrenia cohort (293 pedigrees, n = 1163 subjects) or in a Japanese case-control sample (n = 4182 subjects). The DLG4 expression levels between postmortem brain samples from schizophrenia patients showed no significant changes from controls. Interestingly, a five marker haplotype in DLG4, involving rs2242449, rs17203281, rs390200, rs222853 and rs222837, was enriched in a population specific manner, where the sequences A-C-C-C-A and G-C-C-C-A accumulated in Japanese (p = 0.0009) and Chinese (p = 0.0007) schizophrenia pedigree samples, respectively. However, this could not be replicated in case-control samples. None of the variants in other examined candidate genes showed any significant association in these samples. The current study highlights a putative role for DLG4 in schizophrenia pathogenesis, evidenced by haplotype association, and warrants further dense screening for variants within these haplotypes. PMID:23936182

  18. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco

    2018-04-11

    The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.

  19. CPm gene diversity in field isolates of Citrus tristeza virus from Colombia.

    PubMed

    Oliveros-Garay, Oscar Arturo; Martinez-Salazar, Natalhie; Torres-Ruiz, Yanneth; Acosta, Orlando

    2009-01-01

    The nucleotide sequence diversity of the CPm gene from 28 field isolates of Citrus tristeza virus (CTV) was assessed by SSCP and sequence analyses. These isolates showed two major shared haplotypes, which differed in distribution: A1 was the major haplotype in 23 isolates from different geographic regions, whereas R1 was found in isolates from a discrete region. Phylogenetic reconstruction clustered A1 within an independent group, while R1 was grouped with mild isolates T30 from Florida and T385 from Spain. Some isolates contained several minor haplotypes, which were very similar to, and associated with, the major haplotype.

  20. Genome sequence, comparative analysis and haplotype structure of the domestic dog.

    PubMed

    Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

    2005-12-08

    Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

  1. Haplotypes in SLC24A5 Gene as Ancestry Informative Markers in Different Populations

    PubMed Central

    Giardina, Emiliano; Pietrangeli, Ilenia; Martínez-Labarga, Cristina; Martone, Claudia; de Angelis, Flavio; Spinella, Aldo; De Stefano, Gianfranco; Rickards, Olga; Novelli, Giuseppe

    2008-01-01

    Ancestry informative markers (AIMs) are human polymorphisms that exhibit substantially allele frequency differences among populations. These markers can be useful to provide information about ancestry of samples which may be useful in predicting a perpetrator’s ethnic origin to aid criminal investigations. Variations in human pigmentation are the most obvious phenotypes to distinguish individuals. It has been recently shown that the variation of a G in an A allele of the coding single-nucleotide polymorphism (SNP) rs1426654 within SLC24A5 gene varies in frequency among several population samples according to skin pigmentation. Because of these observations, the SLC24A5 locus has been evaluated as Ancestry Informative Region (AIR) by typing rs1426654 together with two additional intragenic markers (rs2555364 and rs16960620) in 471 unrelated individuals originating from three different continents (Africa, Asia and Europe). This study further supports the role of human SLC24A5 gene in skin pigmentation suggesting that variations in SLC24A5 haplotypes can correlate with human migration and ancestry. Furthermore, our data do reveal the utility of haplotype and combined unphased genotype analysis of SLC24A5 in predicting ancestry and provide a good example of usefulness of genetic characterization of larger regions, in addition to single polymorphisms, as candidates for population-specific sweeps in the ancestral population. PMID:19440451

  2. Haplotype-Based Association Analysis via Variance-Components Score Test

    PubMed Central

    Tzeng, Jung-Ying ; Zhang, Daowen 

    2007-01-01

    Haplotypes provide a more informative format of polymorphisms for genetic association analysis than do individual single-nucleotide polymorphisms. However, the practical efficacy of haplotype-based association analysis is challenged by a trade-off between the benefits of modeling abundant variation and the cost of the extra degrees of freedom. To reduce the degrees of freedom, several strategies have been considered in the literature. They include (1) clustering evolutionarily close haplotypes, (2) modeling the level of haplotype sharing, and (3) smoothing haplotype effects by introducing a correlation structure for haplotype effects and studying the variance components (VC) for association. Although the first two strategies enjoy a fair extent of power gain, empirical evidence showed that VC methods may exhibit only similar or less power than the standard haplotype regression method, even in cases of many haplotypes. In this study, we report possible reasons that cause the underpowered phenomenon and show how the power of the VC strategy can be improved. We construct a score test based on the restricted maximum likelihood or the marginal likelihood function of the VC and identify its nontypical limiting distribution. Through simulation, we demonstrate the validity of the test and investigate the power performance of the VC approach and that of the standard haplotype regression approach. With suitable choices for the correlation structure, the proposed method can be directly applied to unphased genotypic data. Our method is applicable to a wide-ranging class of models and is computationally efficient and easy to implement. The broad coverage and the fast and easy implementation of this method make the VC strategy an effective tool for haplotype analysis, even in modern genomewide association studies. PMID:17924336

  3. Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations

    PubMed Central

    Hanchard, Neil; Elzein, Abier; Trafford, Clare; Rockett, Kirk; Pinder, Margaret; Jallow, Muminatou; Harding, Rosalind; Kwiatkowski, Dominic; McKenzie, Colin

    2007-01-01

    Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence

  4. eNOS gene haplotype is indirectly associated with the recovery of cardiovascular autonomic modulation from exercise.

    PubMed

    Silva, Bruno M; Barbosa, Thales C; Neves, Fabricia J; Sales, Allan K; Rocha, Natalia G; Medeiros, Renata F; Pereira, Felipe S; Garcia, Vinicius P; Cardoso, Fabiane T; Nobrega, Antonio C L

    2014-12-01

    Polymorphisms in the endothelial nitric oxide synthase (eNOS) gene decrease expression and activation of eNOS in vitro, which is associated with lower post-exercise increase in vasodilator reactivity in vivo. However, it is unknown whether such polymorphisms are associated with other eNOS-related phenotypes during recovery from exercise. Therefore, we investigated the impact of an eNOS haplotype containing polymorphic alleles at loci -786 and 894 on the recovery of cardiovascular autonomic function from exercise. Sedentary, non-obese, healthy subjects were enrolled [n = 107, age 32 ± 1 years (mean ± SEM)]. Resting autonomic modulation (heart rate variability, systolic blood pressure variability, and spontaneous baroreflex sensitivity) and vascular reactivity (forearm hyperemic response post-ischemia) were assessed at baseline, 10, 60, and 120 min after a maximal cardiopulmonary exercise test. Besides, autonomic function was assessed by heart rate recovery (HRR) immediately after peak exercise. Haplotype analysis showed that vagal modulation (i.e., HF n.u.) was significantly higher, combined sympathetic and vagal modulation (i.e., LF/HF) was significantly lower and total blood pressure variability was significantly lower post-exercise in a haplotype containing polymorphic alleles (H2) compared to a haplotype with wild type alleles (H1). HRR was similar between groups. Corroborating previous evidence, H2 had significantly lower post-exercise increase in vasodilator reactivity than H1. In conclusion, a haplotype containing polymorphic alleles at loci -786 and 894 had enhanced recovery of autonomic modulation from exercise, along with unchanged HRR, and attenuated vasodilator reactivity. Then, these results suggest an autonomic compensatory response of a direct deleterious effect of eNOS polymorphisms on the vascular function. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Interactions between Serotonin Transporter Gene Haplotypes and Quality of Mothers' Parenting Predict the Development of Children's Noncompliance

    ERIC Educational Resources Information Center

    Sulik, Michael J.; Eisenberg, Nancy; Lemery-Chalfant, Kathryn; Spinrad, Tracy L.; Silva, Kassondra M.; Eggum, Natalie D.; Betkowski, Jennifer A.; Kupfer, Anne; Smith, Cynthia L.; Gaertner, Bridget; Stover, Daryn A.; Verrelli, Brian C.

    2012-01-01

    The LPR and STin2 polymorphisms of the serotonin transporter gene (SLC6A4) were combined into haplotypes that, together with quality of maternal parenting, were used to predict initial levels and linear change in children's (N = 138) noncompliance and aggression from age 18-54 months. Quality of mothers' parenting behavior was observed when…

  6. Relationship of the APOA5/A4/C3/A1 gene cluster and APOB gene polymorphisms with dyslipidemia.

    PubMed

    Ou, H J; Huang, G; Liu, W; Ma, X L; Wei, Y; Zhou, T; Pan, Z M

    2015-08-10

    We determined the alleles of ten single nucleotide poly-morphisms (SNPs) in the APOA5/A4/C3/A1 gene cluster and in APOB in Han Chinese from Xinjiang Shihezi, China using MALDI-TOF mass spectrometry, and explored the correlation between these SNPs and dyslipidemia through a case-control study design with 250 pa-tients and 250 normal controls. All SNPs except for APOA5 rs2072560 conformed to Hardy-Weinberg equilibrium (all P > 0.05). APOA5 rs651821, APOA4 rs5104, APOC3 rs734104, and APOC3 rs5128 geno-type and allele frequencies were significantly different between groups (all P < 0.01). For rs651821, the risks of dyslipidemia for the CC or CC+CT genotypes were 9.917 or 1.859 times that of TT, and the risk of the C vs T allele was 2.027. For rs5104, the AG, GG, or AG+GG risks were 1.797, 1.861, and 1.809 times AA, and the G vs A risk was 1.427. For rs734104, the CT, CC, or CC+CT risks were 1.851, 2.570, and 1.958 times TT, and the C vs T risk was 1.610. For rs5128, the GC or CC+GC risks were 1.738 or 1.749 times GG, and the C vs G risk was 1.477. Compared with the wild-type haplotype TATG, the risks of dyslipidemia with CGCC, TGCC, or CATG haplotypes (odds ratios = 2.434, 1.503, and 2.740, respectively) were significantly higher. Our results suggested that these four SNPs were significantly associated with dyslipidemia in Xinjiang Shihezi Han Chinese, and might serve as risk factors for dyslipidemia. Individuals carrying the CGCC, TGCC, or CATG haplotypes were prone to dyslipidemia.

  7. Genomic analyses of bacterial porin-cytochrome gene clusters

    DOE PAGES

    Shi, Liang; Fredrickson, James K.; Zachara, John M.

    2014-11-26

    In this study, the porin-cytochrome (Pcc) protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III) by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c type cytochrome (c-Cyt) and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters) of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteriamore » from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr) gene clusters of other Fe(III)-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III) and Mn(IV) oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular electron transfer reactions with the substrates other than Fe(III) and Mn(IV) oxides.« less

  8. Haplotypic Analysis of Wellcome Trust Case Control Consortium Data

    PubMed Central

    Browning, Brian L.; Browning, Sharon R.

    2008-01-01

    We applied a recently developed multilocus association testing method (localized haplotype clustering) to Wellcome Trust Case Control Consortium data (14,000 cases of seven common diseases and 3,000 shared controls genotyped on the Affymetrix 500K array). After rigorous data quality filtering, we identified three disease-associated loci with strong statistical support from localized haplotype cluster tests but with only marginal significance in single marker tests. These loci are chromosomes 10p15.1 with type 1 diabetes (p = 5.1 × 10-9), 12q15 with type 2 diabetes (p = 1.9 × 10-7) and 15q26.2 with hypertension (p = 2.8 × 10-8). We also detected the association of chromosome 9p21.3 with type 2 diabetes (p = 2.8 × 10-8), although this locus did not pass our stringent genotype quality filters. The association of 10p15.1 with type 1 diabetes and 9p21.3 with type 2 diabetes have both been replicated in other studies using independent data sets. Overall, localized haplotype cluster analysis had better success detecting disease associated variants than a previous single-marker analysis of imputed HapMap SNPs. We found that stringent application of quality score thresholds to genotype data substantially reduced false-positive results arising from genotype error. In addition, we demonstrate that it is possible to simultaneously phase 16,000 individuals genotyped on genome-wide data (450K markers) using the Beagle software package. PMID:18224336

  9. Prokaryotic Gene Clusters: A Rich Toolbox for Synthetic Biology

    PubMed Central

    Fischbach, Michael; Voigt, Christopher A.

    2014-01-01

    Bacteria construct elaborate nanostructures, obtain nutrients and energy from diverse sources, synthesize complex molecules, and implement signal processing to react to their environment. These complex phenotypes require the coordinated action of multiple genes, which are often encoded in a contiguous region of the genome, referred to as a gene cluster. Gene clusters sometimes contain all of the genes necessary and sufficient for a particular function. As an evolutionary mechanism, gene clusters facilitate the horizontal transfer of the complete function between species. Here, we review recent work on a number of clusters whose functions are relevant to biotechnology. Engineering these clusters has been hindered by their regulatory complexity, the need to balance the expression of many genes, and a lack of tools to design and manipulate DNA at this scale. Advances in synthetic biology will enable the large-scale bottom-up engineering of the clusters to optimize their functions, wake up cryptic clusters, or to transfer them between organisms. Understanding and manipulating gene clusters will move towards an era of genome engineering, where multiple functions can be “mixed-and-matched” to create a designer organism. PMID:21154668

  10. Hypercontrols in genotype-phenotype analysis reveal ancestral haplotypes associated with essential hypertension.

    PubMed

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2012-04-01

    The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ(2)=23.9; P=0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8-4.9; P=0.000006) in the recessive model. Two polymorphisms, A-20C (P=0.003) and C3389T (P=0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ(2)=8.1; P=0.004) and H5 (χ(2)=5.1; P=0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as "superalleles."

  11. Hypercontrols in Genotype-Phenotype Analysis Reveal Ancestral Haplotypes Associated With Essential Hypertension

    PubMed Central

    Balam-Ortiz, Eros; Esquivel-Villarreal, Adolfo; Huerta-Hernandez, David; Fernandez-Lopez, Juan Carlos; Alfaro-Ruiz, Luis; Muñoz-Monroy, Omar; Gutierrez, Ruth; Figueroa-Genis, Enrique; Carrillo, Karol; Elizalde, Adela; Hidalgo, Alfredo; Rodriguez, Mauricio; Urushihara, Maki; Kobori, Hiroyuki; Jimenez-Sanchez, Gerardo

    2012-01-01

    The angiotensinogen gene locus has been associated with essential hypertension in most populations analyzed to date. Increased plasma angiotensinogen levels have been proposed as an underlying cause of essential hypertension in whites; however, differences in the genetic regulation of plasma angiotensinogen levels have also been reported for other populations. The aim of this study was to analyze the relationship between angiotensinogen gene polymorphisms and haplotypes with plasma angiotensinogen levels and the risk of essential hypertension in the Mexican population. We genotyped 9 angiotensinogen gene polymorphisms in 706 individuals. Four polymorphisms, A-6, C4072, C6309, and G12775, were associated with increased risk, and the strongest association was found for the C6309 allele (χ2 = 23.9; P = 0.0000009), which resulted in an odds ratio of 3.0 (95% CI: 1.8–4.9; P = 0.000006) in the recessive model. Two polymorphisms, A-20C (P = 0.003) and C3389T (P = 0.0001), were associated with increased plasma angiotensinogen levels but did not show association with essential hypertension. The haplotypes H1 (χ2 = 8.1; P = 0.004) and H5 (χ2 = 5.1; P = 0.02) were associated with essential hypertension. Using phylogenetic analysis, we found that haplotypes 1 and 5 are the human ancestral haplotypes. Our results suggest that the positive association between angiotensinogen gene polymorphisms and haplotypes with essential hypertension is not simply explained by an increase in plasma angiotensinogen concentration. Complex interactions between risk alleles suggest that these haplotypes act as “superalleles.” PMID:22371359

  12. A Gene-Oriented Haplotype Comparison Reveals Recently Selected Genomic Regions in Temperate and Tropical Maize Germplasm

    PubMed Central

    Zhang, Jie; Li, Yongxiang; Zheng, Jun; Zhang, Hongwei; Yang, Xiaohong; Wang, Jianhua; Wang, Guoying

    2017-01-01

    The extensive genetic variation present in maize (Zea mays) germplasm makes it possible to detect signatures of positive artificial selection that occurred during temperate and tropical maize improvement. Here we report an analysis of 532,815 polymorphisms from a maize association panel consisting of 368 diverse temperate and tropical inbred lines. We developed a gene-oriented approach adapting exonic polymorphisms to identify recently selected alleles by comparing haplotypes across the maize genome. This analysis revealed evidence of selection for more than 1100 genomic regions during recent improvement, and included regulatory genes and key genes with visible mutant phenotypes. We find that selected candidate target genes in temperate maize are enriched in biosynthetic processes, and further examination of these candidates highlights two cases, sucrose flux and oil storage, in which multiple genes in a common pathway can be cooperatively selected. Finally, based on available parallel gene expression data, we hypothesize that some genes were selected for regulatory variations, resulting in altered gene expression. PMID:28099470

  13. Deletion analysis of male sterility effects of t-haplotypes in the mouse.

    PubMed

    Bennett, D; Artzt, K

    1990-01-01

    We present data on the effects of three chromosome 17 deletions on transmission ratio distortion (TRD) and sterility of several t-haplotypes. All three deletions have similar effects on male TRD: that is, Tdel/tcomplete genotypes all transmit their t-haplotype in very high proportion. However, each deletion has different effects on sterility of heterozygous males, with TOr/t being fertile, Thp/t less fertile, and TOrl/t still less fertile. These data suggest that wild-type genes on chromosomes homologous to t-haplotypes can be important regulators of both TRD and fertility in males, and that the wild-type genes concerned with TRD and fertility are at least to some extent different. The data also provide a rough map of the positions of these genes.

  14. Genotypes and Haplotypes of the Estrogen Receptor α Gene (ESR1) Are Associated With Female-to-Male Gender Dysphoria.

    PubMed

    Cortés-Cortés, Joselyn; Fernández, Rosa; Teijeiro, Nerea; Gómez-Gil, Esther; Esteva, Isabel; Almaraz, Mari Cruz; Guillamón, Antonio; Pásaro, Eduardo

    2017-03-01

    Gender dysphoria, a marked incongruence between one's experienced gender and biological sex, is commonly believed to arise from discrepant cerebral and genital sexual differentiation. With the discovery that estrogen receptor β is associated with female-to-male (FtM) but not with male-to-female (MtF) gender dysphoria, and given estrogen receptor α involvement in central nervous system masculinization, it was hypothesized that estrogen receptor α, encoded by the ESR1 gene, also might be implicated. To investigate whether ESR1 polymorphisms (TA)n-rs3138774, PvuII-rs2234693, and XbaI-rs9340799 and their haplotypes are associated with gender dysphoria in adults. Molecular analysis was performed in peripheral blood samples from 183 FtM subjects, 184 MtF subjects, and 394 sex- and ethnically-matched controls. Genotype and haplotype analyses of the (TA)n-rs3138774, PvuII-rs2234693, and XbaI-rs9340799 polymorphisms. Allele and genotype frequencies for the polymorphism XbaI were statistically significant only in FtM vs control XX subjects (P = .021 and P = .020). In XX individuals, the A/G genotype was associated with a low risk of gender dysphoria (odds ratio [OR] = 0.34; 95% CI = 0.16-0.74; P = .011); in XY individuals, the A/A genotype implied a low risk of gender dysphoria (OR = 0.39; 95% CI = 0.17-0.89; P = .008). Binary logistic regression showed partial effects for all three polymorphisms in FtM but not in MtF subjects. The three polymorphisms were in linkage disequilibrium: a small number of TA repeats was linked to the presence of PvuII and XbaI restriction sites (haplotype S-T-A), and a large number of TA repeats was linked to the absence of these restriction sites (haplotype L-C-G). In XX individuals, the presence of haplotype L-C-G carried a low risk of gender dysphoria (OR = 0.66; 95% CI = 0.44-0.99; P = .046), whereas the presence of haplotype L-C-A carried a high susceptibility to gender dysphoria (OR = 3.96; 95% CI = 1.04-15.02; P = .044

  15. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed Central

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina

    2018-01-01

    The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446

  16. Differential fine-tuning of gene expression regulation in coffee leaves by CcDREB1D promoter haplotypes under water deficit.

    PubMed

    Alves, Gabriel Sergio Costa; Torres, Luana Ferreira; Déchamp, Eveline; Breitler, Jean-Christophe; Joët, Thierry; Gatineau, Frédéric; Andrade, Alan Carvalho; Bertrand, Benoît; Marraccini, Pierre; Etienne, Hervé

    2017-05-17

    Despite the importance of the DREB1D gene (also known as CBF4) in plant responses to water deficit and cold stress, studies analysing its regulation by transgenic approaches are lacking. In the current work, a functional study of three CcDREB1D promoter haplotypes (named HP15, HP16 and HP17) isolated from drought-tolerant and drought-sensitive clones of Coffea canephora was carried out in plants of C. arabica stably transformed by Agrobacterium tumefaciens by analysing their ability to regulate the expression of the uidA reporter gene in response to water deficit mimicked by polyethylene glycol (-2.0 MPa) and low relative humidity treatments. A deletion analysis of their corresponding 5'-upstream regions revealed increased specificity of β-glucuronidase activity in the polyethylene glycol and low relative humidity treatments, with high expression in leaf mesophyll and guard cells in full-length constructs. RT-qPCR assays also revealed that the HP16 haplotype (specific to clone tolerant to water deficit) had stronger and earlier activity compared with the HP15 and HP17 haplotypes. As most of the cis-regulatory elements involved in ABA-dependent and -independent networks, tissue specificity and light regulation are common to these haplotypes, we propose that their organization, as well as the nucleic acid polymorphisms present outside these boxes, may play a role in modulating activities of DREB1D promoters in guard cells. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  17. Gene Cluster Encoding Cholate Catabolism in Rhodococcus spp.

    PubMed Central

    Wilbrink, Maarten H.; Casabon, Israël; Stewart, Gordon R.; Liu, Jie; van der Geize, Robert; Eltis, Lindsay D.

    2012-01-01

    Bile acids are highly abundant steroids with important functions in vertebrate digestion. Their catabolism by bacteria is an important component of the carbon cycle, contributes to gut ecology, and has potential commercial applications. We found that Rhodococcus jostii RHA1 grows well on cholate, as well as on its conjugates, taurocholate and glycocholate. The transcriptome of RHA1 growing on cholate revealed 39 genes upregulated on cholate, occurring in a single gene cluster. Reverse transcriptase quantitative PCR confirmed that selected genes in the cluster were upregulated 10-fold on cholate versus on cholesterol. One of these genes, kshA3, encoding a putative 3-ketosteroid-9α-hydroxylase, was deleted and found essential for growth on cholate. Two coenzyme A (CoA) synthetases encoded in the cluster, CasG and CasI, were heterologously expressed. CasG was shown to transform cholate to cholyl-CoA, thus initiating side chain degradation. CasI was shown to form CoA derivatives of steroids with isopropanoyl side chains, likely occurring as degradation intermediates. Orthologous gene clusters were identified in all available Rhodococcus genomes, as well as that of Thermomonospora curvata. Moreover, Rhodococcus equi 103S, Rhodococcus ruber Chol-4 and Rhodococcus erythropolis SQ1 each grew on cholate. In contrast, several mycolic acid bacteria lacking the gene cluster were unable to grow on cholate. Our results demonstrate that the above-mentioned gene cluster encodes cholate catabolism and is distinct from a more widely occurring gene cluster encoding cholesterol catabolism. PMID:23024343

  18. Sequence Variants and Haplotype Analysis of Cat ERBB2 Gene: A Survey on Spontaneous Cat Mammary Neoplastic and Non-Neoplastic Lesions

    PubMed Central

    Santos, Sara; Bastos, Estela; Baptista, Cláudia S.; Sá, Daniela; Caloustian, Christophe; Guedes-Pinto, Henrique; Gärtner, Fátima; Gut, Ivo G.; Chaves, Raquel

    2012-01-01

    The human ERBB2 proto-oncogene is widely considered a key gene involved in human breast cancer onset and progression. Among spontaneous tumors, mammary tumors are the most frequent cause of cancer death in cats and second most frequent in humans. In fact, naturally occurring tumors in domestic animals, more particularly cat mammary tumors, have been proposed as a good model for human breast cancer, but critical genetic and molecular information is still scarce. The aims of this study include the analysis of the cat ERBB2 gene partial sequences (between exon 17 and 20) in order to characterize a normal and a mammary lesion heterogeneous populations. Cat genomic DNA was extracted from normal frozen samples (n = 16) and from frozen and formalin-fixed paraffin-embedded mammary lesion samples (n = 41). We amplified and sequenced two cat ERBB2 DNA fragments comprising exons 17 to 20. It was possible to identify five sequence variants and six haplotypes in the total population. Two sequence variants and two haplotypes show to be specific for cat mammary tumor samples. Bioinformatics analysis predicts that four of the sequence variants can produce alternative transcripts or activate cryptic splicing sites. Also, a possible association was identified between clinicopathological traits and the variant haplotypes. As far as we know, this is the first attempt to examine ERBB2 genetic variations in cat mammary genome and its possible association with the onset and progression of cat mammary tumors. The demonstration of a possible association between primary tumor size (one of the two most important prognostic factors) and the number of masses with the cat ERBB2 variant haplotypes reveal the importance of the analysis of this gene in veterinary medicine. PMID:22489125

  19. More powerful haplotype sharing by accounting for the mode of inheritance.

    PubMed

    Ziegler, Andreas; Ewhida, Adel; Brendel, Michael; Kleensang, André

    2009-04-01

    The concept of haplotype sharing (HS) has received considerable attention recently, and several haplotype association methods have been proposed. Here, we extend the work of Beckmann and colleagues [2005 Hum. Hered. 59:67-78] who derived an HS statistic (BHS) as special case of Mantel's space-time clustering approach. The Mantel-type HS statistic correlates genetic similarity with phenotypic similarity across pairs of individuals. While phenotypic similarity is measured as the mean-corrected cross product of phenotypes, we propose to incorporate information of the underlying genetic model in the measurement of the genetic similarity. Specifically, for the recessive and dominant modes of inheritance we suggest the use of the minimum and maximum of shared length of haplotypes around a marker locus for pairs of individuals. If the underlying genetic model is unknown, we propose a model-free HS Mantel statistic using the max-test approach. We compare our novel HS statistics to BHS using simulated case-control data and illustrate its use by re-analyzing data from a candidate region of chromosome 18q from the Rheumatoid Arthritis (RA) Consortium. We demonstrate that our approach is point-wise valid and superior to BHS. In the re-analysis of the RA data, we identified three regions with point-wise P-values<0.005 containing six known genes (PMIP1, MC4R, PIGN, KIAA1468, TNFRSF11A and ZCCHC2) which might be worth follow-up.

  20. Comparative sequence analysis of the potato cyst nematode resistance locus H1 reveals a major lack of co-linearity between three haplotypes in potato (Solanum tuberosum ssp.)

    PubMed Central

    Bakker, Erin; de Boer, Jan; van der Vossen, Edwin; Achenbach, Ute; Golas, Tomasz; Suryaningrat, Suwardi; Smant, Geert; Bakker, Jaap; Goverse, Aska

    2010-01-01

    The H1 locus confers resistance to the potato cyst nematode Globodera rostochiensis pathotypes 1 and 4. It is positioned at the distal end of chromosome V of the diploid Solanum tuberosum genotype SH83-92-488 (SH) on an introgression segment derived from S. tuberosum ssp. andigena. Markers from a high-resolution genetic map of the H1 locus (Bakker et al. in Theor Appl Genet 109:146–152, 2004) were used to screen a BAC library to construct a physical map covering a 341-kb region of the resistant haplotype coming from SH. For comparison, physical maps were also generated of the two haplotypes from the diploid susceptible genotype RH89-039-16 (S. tuberosum ssp. tuberosum/S. phureja), spanning syntenic regions of 700 and 319 kb. Gene predictions on the genomic segments resulted in the identification of a large cluster consisting of variable numbers of the CC-NB-LRR type of R genes for each haplotype. Furthermore, the regions were interspersed with numerous transposable elements and genes coding for an extensin-like protein and an amino acid transporter. Comparative analysis revealed a major lack of gene order conservation in the sequences of the three closely related haplotypes. Our data provide insight in the evolutionary mechanisms shaping the H1 locus and will facilitate the map-based cloning of the H1 resistance gene. Electronic supplementary material The online version of this article (doi:10.1007/s00122-010-1472-9) contains supplementary material, which is available to authorized users. PMID:21049265

  1. A comprehensively molecular haplotype-resolved genome of a European individual

    PubMed Central

    Suk, Eun-Kyung; McEwen, Gayle K.; Duitama, Jorge; Nowick, Katja; Schulz, Sabrina; Palczewski, Stefanie; Schreiber, Stefan; Holloway, Dustin T.; McLaughlin, Stephen; Peckham, Heather; Lee, Clarence; Huebsch, Thomas; Hoehe, Margret R.

    2011-01-01

    Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, “Max Planck One” (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ∼1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying “haploid landscapes,” which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for “phase-sensitive” personal genomics. MP1's annotated haploid genomes are available as a public resource. PMID:21813624

  2. Association Between Chloroplast DNA and Mitochondrial DNA Haplotypes in Prunus spinosa L. (Rosaceae) Populations across Europe

    PubMed Central

    MOHANTY, APARAJITA; MARTÍN, JUAN PEDRO; GONZÁLEZ, LUIS MIGUEL; AGUINAGALDE, ITZIAR

    2003-01-01

    Chloroplast DNA (cpDNA) and mitochondrial DNA (mtDNA) were studied in 24 populations of Prunus spinosa sampled across Europe. The cpDNA and mtDNA fragments were amplified using universal primers and subsequently digested with restriction enzymes to obtain the polymorphisms. Combinations of all the polymorphisms resulted in 33 cpDNA haplotypes and two mtDNA haplotypes. Strict association between the cpDNA haplotypes and the mtDNA haplotypes was detected in most cases, indicating conjoint inheritance of the two genomes. The most frequent and abundant cpDNA haplotype (C20; frequency, 51 %) is always associated with the more frequent and abundant mtDNA haplotype (M1; frequency, 84 %). All but two of the cpDNA haplotypes associated with the less frequent mtDNA haplotype (M2) are private haplotypes. These private haplotypes are phylogenetically related but geographically unrelated. They form a separate cluster on the minimum‐length spanning tree. PMID:14534199

  3. Clustering Algorithms: Their Application to Gene Expression Data

    PubMed Central

    Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

    2016-01-01

    Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867

  4. Mutation Analysis in Classical Phenylketonuria Patients Followed by Detecting Haplotypes Linked to Some PAH Mutations.

    PubMed

    Dehghanian, Fatemeh; Silawi, Mohammad; Tabei, Seyed M B

    2017-02-01

    Deficiency of phenylalanine hydroxylase (PAH) enzyme and elevation of phenylalanine in body fluids cause phenylketonuria (PKU). The gold standard for confirming PKU and PAH deficiency is detecting causal mutations by direct sequencing of the coding exons and splicing involved sequences of the PAH gene. Furthermore, haplotype analysis could be considered as an auxiliary approach for detecting PKU causative mutations before direct sequencing of the PAH gene by making comparisons between prior detected mutation linked-haplotypes and new PKU case haplotypes with undetermined mutations. In this study, 13 unrelated classical PKU patients took part in the study detecting causative mutations. Mutations were identified by polymerase chain reaction (PCR) and direct sequencing in all patients. After that, haplotype analysis was performed by studying VNTR and PAHSTR markers (linked genetic markers of the PAH gene) through application of PCR and capillary electrophoresis (CE). Mutation analysis was performed successfully and the detected mutations were as follows: c.782G>A, c.754C>T, c.842C>G, c.113-115delTCT, c.688G>A, and c.696A>G. Additionally, PAHSTR/VNTR haplotypes were detected to discover haplotypes linked to each mutation. Mutation detection is the best approach for confirming PAH enzyme deficiency in PKU patients. Due to the relatively large size of the PAH gene and high cost of the direct sequencing in developing countries, haplotype analysis could be used before DNA sequencing and mutation detection for a faster and cheaper way via identifying probable mutated exons.

  5. APC Yin-Yang haplotype associated with colorectal cancer risk

    PubMed Central

    GARRE, P.; DE LA HOYA, M.; INIESTA, P.; ROMERA, A.; LLOVET, P.; GONZALEZ, S.; PEREZ-SEGURA, P.; CAPELLA, G.; DIAZ-RUBIO, E.; CALDES, T.

    2010-01-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan® assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32–2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61–1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC. PMID:22993613

  6. APC Yin-Yang haplotype associated with colorectal cancer risk.

    PubMed

    Garre, P; DE LA Hoya, M; Iniesta, P; Romera, A; Llovet, P; Gonzalez, S; Perez-Segura, P; Capella, G; Diaz-Rubio, E; Caldes, T

    2010-09-01

    The Yin-Yang haplotype is defined as two mismatched haplotypes (Yin and Yang) representing the majority of the existing haplotypes in a particular genomic region. The human adenomatous polyposis coli (APC) gene shows a Yin-Yang haplotype pattern accounting for 84% of all of the haplotypes existing in the Spanish population. Several association studies have been published regarding APC gene variants (SNPs and haplotypes) and colorectal cancer (CRC) risk. However, no studies concerning diplotype structure and CRC risk have been conducted. The aim of the present study was to investigate whether the APC Yin-Yang homozygote diplotype is over-represented in patients with sporadic CRC when compared to its distribution in controls, and its association with CRC risk. TaqMan(®) assays were used to genotype three tagSNPs selected across the APC Yin-Yang region. Frequencies of the APC Yin-Yang tagSNP alleles, haplotype and diplotype of 378 CRC cases and 642 controls were compared. Two Spanish CRC group samples were included [Hospital Clínico San Carlos in Madrid (HCSC) and Instituto Catalán de Oncología in Barcelona (ICO)]. Analysis of 157 consecutive CRC patients and 405 control subjects from HCSC showed a significative effect for the risk of CRC (OR=1.93; 95% CI 1.32-2.81; P=0.001). However, this effect was not confirmed in 221 CRC patients and 237 control subjects from ICO (OR=0.89; 95% CI 0.61-1.28; P=0.521). We found a significant association between the APC homozygote Yin-Yang diplotype and the risk of colorectal cancer in the HCSC samples. However, we did not observe this association in the ICO samples. These observations suggest that a study with a larger Spanish cohort is necessary to confirm the effects of the APC Yin-Yang diplotype on the risk of CRC.

  7. CORM: An R Package Implementing the Clustering of Regression Models Method for Gene Clustering

    PubMed Central

    Shi, Jiejun; Qin, Li-Xuan

    2014-01-01

    We report a new R package implementing the clustering of regression models (CORM) method for clustering genes using gene expression data and provide data examples illustrating each clustering function in the package. The CORM package is freely available at CRAN from http://cran.r-project.org. PMID:25452684

  8. A parsimonious tree-grow method for haplotype inference.

    PubMed

    Li, Zhenping; Zhou, Wenfeng; Zhang, Xiang-Sun; Chen, Luonan

    2005-09-01

    Haplotype information has become increasingly important in analyzing fine-scale molecular genetics data, such as disease genes mapping and drug design. Parsimony haplotyping is one of haplotyping problems belonging to NP-hard class. In this paper, we aim to develop a novel algorithm for the haplotype inference problem with the parsimony criterion, based on a parsimonious tree-grow method (PTG). PTG is a heuristic algorithm that can find the minimum number of distinct haplotypes based on the criterion of keeping all genotypes resolved during tree-grow process. In addition, a block-partitioning method is also proposed to improve the computational efficiency. We show that the proposed approach is not only effective with a high accuracy, but also very efficient with the computational complexity in the order of O(m2n) time for n single nucleotide polymorphism sites in m individual genotypes. The software is available upon request from the authors, or from http://zhangroup.aporc.org/bioinfo/ptg/ chen@elec.osaka-sandai.ac.jp Supporting materials is available from http://zhangroup.aporc.org/bioinfo/ptg/bti572supplementary.pdf

  9. In Vivo Characterization of Human APOA5 Haplotypes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahituv, Nadav; Akiyama, Jennifer; Chapman-Helleboid, Audrey

    2006-10-01

    Increased plasma triglycerides concentrations are an independent risk factor for cardiovascular disease. Numerous studies support a reproducible genetic association between two minor haplotypes in the human apolipoprotein A5 gene (APOA5) and increased plasma triglyceride concentrations. We thus sought to investigate the effect of these minor haplotypes (APOA5*2 and APOA5*3) on ApoAV plasma levels through the precise insertion of single-copy intact APOA5 haplotypes at a targeted location in the mouse genome. While we found no difference in the amount of human plasma ApoAV in mice containing the common APOA5*1 and minor APOA5*2 haplotype, the introduction of the single APOA5*3 defining allelemore » (19W) resulted in 3-fold lower ApoAV plasma levels consistent with existing genetic association studies. These results indicate that S19W polymorphism is likely to be functional and explain the strong association of this variant with plasma triglycerides supporting the value of sensitive in vivo assays to define the functional nature of human haplotypes.« less

  10. A susceptible haplotype within APOE gene influences BMD and intensifies the osteoporosis risk in postmenopausal women of Northwest India.

    PubMed

    Singh, Monica; Singh, Puneetpal; Singh, Surinder; Juneja, Pawan Kumar; Kaur, Taranpal

    2010-11-01

    The association of apolipoprotein E (APOE) genotypes with bone mineral density (BMD) and risk of osteoporosis have remained unclear. The influence of APOE gene polymorphisms on BMD as genetic mediators of osteoporosis risk needs to be explored in Indian postmenopausal females where this disease is rising rampantly. The present study investigated the role and relevance of four pertinent APOE single nucleotide polymorphisms: 5'UTR G/C (rs440446), Int2 G/A (rs769450), Exon4 T/C (rs429358), Exon4C/T (rs7412) in DEXA verified 133 osteoporotic, 57 osteopenic and 83 normal postmenopausal females of India, who were not taking hormone replacement therapy. Minor allele frequencies of rs440446 and rs429358 were higher in osteoporotic females (0.31, 0.18) than osteopenic (0.29, 0.15) and females having normal bone mass (0.16, 0.07). Disease association analysis revealed a susceptibility haplotype CGTC (in order of rs440446, rs769450, rs429358, rs7412) and the carriers of this haplotype has higher risk of osteopenia (OR 3.53, 95% CI 1.21-11.0, P=0.017) and osteoporosis (OR 3.61, 95% CI 1.53-9.48, P=0.002) after adjusting the confounding effect of age, BMI and years since menopause. Females who possess either one copy or two copies of the haplotype have lesser BMD values of lumbar spine (0.88 and 0.85 g/cm(2)) and femoral neck (0.84 and 0.82 g/cm(2)) than those females who possess zero copy (0.9 and 0.87 g/cm(2), respectively). The present study exposed a susceptibility haplotype CGTC, within APOE gene, which was found to be associated with BMD and risk of osteopenia and osteoporosis in postmenopausal females of India. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  11. Geographic distribution of haplotype diversity at the bovine casein locus

    PubMed Central

    Jann, Oliver C; Ibeagha-Awemu, Eveline M; Özbeyaz, Ceyhan; Zaragoza, Pilar; Williams, John L; Ajmone-Marsan, Paolo; Lenstra, Johannes A; Moazami-Goudarzi, Katy; Erhardt, Georg

    2004-01-01

    The genetic diversity of the casein locus in cattle was studied on the basis of haplotype analysis. Consideration of recently described genetic variants of the casein genes which to date have not been the subject of diversity studies, allowed the identification of new haplotypes. Genotyping of 30 cattle breeds from four continents revealed a geographically associated distribution of haplotypes, mainly defined by frequencies of alleles at CSN1S1 and CSN3. The genetic diversity within taurine breeds in Europe was found to decrease significantly from the south to the north and from the east to the west. Such geographic patterns of cattle genetic variation at the casein locus may be a result of the domestication process of modern cattle as well as geographically differentiated natural or artificial selection. The comparison of African Bos taurus and Bos indicus breeds allowed the identification of several Bos indicus specific haplotypes (CSN1S1*C-CSN2*A2-CSN3*AI/CSN3*H) that are not found in pure taurine breeds. The occurrence of such haplotypes in southern European breeds also suggests that an introgression of indicine genes into taurine breeds could have contributed to the distribution of the genetic variation observed. PMID:15040901

  12. Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters.

    PubMed

    Lukashin, A V; Fuchs, R

    2001-05-01

    Cluster analysis of genome-wide expression data from DNA microarray hybridization studies has proved to be a useful tool for identifying biologically relevant groupings of genes and samples. In the present paper, we focus on several important issues related to clustering algorithms that have not yet been fully studied. We describe a simple and robust algorithm for the clustering of temporal gene expression profiles that is based on the simulated annealing procedure. In general, this algorithm guarantees to eventually find the globally optimal distribution of genes over clusters. We introduce an iterative scheme that serves to evaluate quantitatively the optimal number of clusters for each specific data set. The scheme is based on standard approaches used in regular statistical tests. The basic idea is to organize the search of the optimal number of clusters simultaneously with the optimization of the distribution of genes over clusters. The efficiency of the proposed algorithm has been evaluated by means of a reverse engineering experiment, that is, a situation in which the correct distribution of genes over clusters is known a priori. The employment of this statistically rigorous test has shown that our algorithm places greater than 90% genes into correct clusters. Finally, the algorithm has been tested on real gene expression data (expression changes during yeast cell cycle) for which the fundamental patterns of gene expression and the assignment of genes to clusters are well understood from numerous previous studies.

  13. Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics.

    PubMed Central

    Sobel, E.; Lange, K.

    1996-01-01

    The introduction of stochastic methods in pedigree analysis has enabled geneticists to tackle computations intractable by standard deterministic methods. Until now these stochastic techniques have worked by running a Markov chain on the set of genetic descent states of a pedigree. Each descent state specifies the paths of gene flow in the pedigree and the founder alleles dropped down each path. The current paper follows up on a suggestion by Elizabeth Thompson that genetic descent graphs offer a more appropriate space for executing a Markov chain. A descent graph specifies the paths of gene flow but not the particular founder alleles traveling down the paths. This paper explores algorithms for implementing Thompson's suggestion for codominant markers in the context of automatic haplotyping, estimating location scores, and computing gene-clustering statistics for robust linkage analysis. Realistic numerical examples demonstrate the feasibility of the algorithms. PMID:8651310

  14. Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.

    PubMed

    Städler, Thomas; Delph, Lynda F

    2002-09-03

    Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.

  15. No association between polymorphisms and haplotypes of COL1A1 and COL1A2 genes and osteoporotic fracture in postmenopausal Chinese women

    PubMed Central

    Hu, Wei-wei; He, Jin-wei; Zhang, Hao; Wang, Chun; Gu, Jie-mei; Yue, Hua; Ke, Yao-hua; Hu, Yun-qiu; Fu, Wen-zhen; Li, Miao; Liu, Yu-juan; Zhang, Zhen-lin

    2011-01-01

    Aim: To study whether genetic polymorphisms of COL1A1 and COL1A2 genes affected the onset of fracture in postmenopausal Chinese women. Methods: SNPs in COL1A1 and COL1A2 genes were identified via direct sequencing in 32 unrelated postmenopausal Chinese women. Ten SNPs were genotyped in 1252 postmenopausal Chinese women. The associations were examined using both single-SNP and haplotype tests using logistic regression. Results: Twenty four (4 novel) and 28 (7 novel) SNPs were identified in COL1A1 and COL1A2 gene, respectively. The distribution frequencies of 2 SNPs in COL1A1 (rs2075554 and rs2586494) and 3 SNPs in COL1A2 (rs42517, rs1801182, and rs42524) were significantly different from those documented for the European Caucasian population. No significant difference was observed between fracture and control groups with respect to allele frequency or genotype distribution in 9 selected SNPs and haplotype. No significant association was found between fragility fracture and each SNP or haplotype. The results remained the same after additional corrections for other risk factors such as weight, height, and bone mineral density. Conclusion: Our results show no association between common genetic variations of COL1A1 and COL1A2 genes and fracture, suggesting the complex genetic background of osteoporotic fractures. PMID:21602843

  16. Fast gene ontology based clustering for microarray experiments.

    PubMed

    Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

    2008-11-21

    Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  17. PWHATSHAP: efficient haplotyping for future generation sequencing.

    PubMed

    Bracciali, Andrea; Aldinucci, Marco; Patterson, Murray; Marschall, Tobias; Pisanti, Nadia; Merelli, Ivan; Torquati, Massimo

    2016-09-22

    Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WHATSHAP is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered PWHATSHAP, a parallel, high-performance version of WHATSHAP. PWHATSHAP is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WHATSHAP, PWHATSHAP exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WHATSHAP, which increases with coverage. Due to its structure and management of the large datasets, the parallelisation of WHATSHAP posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, PWHATSHAP, is a freely available toolkit that improves the efficiency of the analysis of genomics

  18. Can polymorphisms in the fatty acid desaturase (FADS) gene cluster alter the effects of fish oil supplementation on plasma and erythrocyte fatty acid profiles? An exploratory study.

    PubMed

    Meldrum, Suzanne J; Li, Yuchun; Zhang, Guicheng; Heaton, Alexandra E M; D'Vaz, Nina; Manz, Judith; Reischl, Eva; Koletzko, Berthold V; Prescott, Susan L; Simmer, Karen

    2017-09-19

    The enzymes encoded by fatty acid desaturases (FADS) genes determine the desaturation of long-chain polyunsaturated fatty acids (LCPUFA). We investigated if haplotype and single nucleotide polymorphisms (SNPs) in FADS gene cluster can influence LCPUFA status in infants who received either fish oil or placebo supplementation. Children enrolled in the Infant Fish Oil Supplementation Study (IFOS) were randomly allocated to receive either fish oil or placebo from birth to 6 months of age. Blood was collected at 6 months of age for the measurement of fatty acids and for DNA extraction. A total of 276 participant DNA samples underwent genotyping, and 126 erythrocyte and 133 plasma fatty acid measurements were available for analysis. Twenty-two FADS SNPs were selected on the basis of literature and linkage disequilibrium patterns identified from the HapMap data. Haplotype construction was completed using PHASE. For participants allocated to the fish oil group who had two copies of the FADS1 haplotype consisting of SNP minor alleles, DHA levels were significantly higher compared to other haplotypes. This finding was not observed for the placebo group. Furthermore, for members of the fish oil group only, the minor homozygous carriers of all the FADS1 SNPs investigated had significantly higher DHA than other genotypes (rs174545, rs174546, rs174548, rs174553, rs174556, rs174537, rs174448, and rs174455). Overall results of this preliminary study suggest that supplementation with fish oil may only significantly increase DHA in minor allele carriers of FADS1 SNPs. Further research is required to confirm this novel finding.

  19. Comparative genomics of ParaHox clusters of teleost fishes: gene cluster breakup and the retention of gene sets following whole genome duplications

    PubMed Central

    Siegel, Nicol; Hoegg, Simone; Salzburger, Walter; Braasch, Ingo; Meyer, Axel

    2007-01-01

    Background The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. Results We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. Conclusion There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular – but possibly clusters of genes more generally – might be linked to the presence of promoter, enhancer or inhibitor

  20. Genetic analysis of autoimmune regulator haplotypes in alopecia areata.

    PubMed

    Wengraf, D A; McDonagh, A J G; Lovewell, T R J; Vasilopoulos, Y; Macdonald-Hull, S P; Cork, M J; Messenger, A G; Tazi-Ahnini, R

    2008-03-01

    Alopecia areata is an immune-mediated disorder, occurring with the highest observed frequency in the rare recessive autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED) syndrome caused by mutations of the autoimmune regulator (AIRE) gene on chromosome 21q22.3. We have previously detected association between alopecia areata and a single nucleotide polymorphism (SNP) in the AIRE gene in patients without APECED, and we now report the findings of an extended examination of the association of alopecia areata with haplotype analysis including six SNPs in the AIRE gene: C-103T, C4144G, T5238C, G6528A, T7215C and T11787C. In Caucasian groups of 295 patients and 363 controls, we found strong association between the AIRE 7215C allele and AA [P = 3.8 x 10(-8), OR (95% CI): 2.69 (1.8-4.0)]. The previously reported association between AA and the AIRE 4144G allele was no longer significant on correction for multiple testing. The AIRE haplotypes CCTGCT and CGTGCC showed a highly significant association with AA [P = 6.05 x 10(-6), 9.47 (2.91-30.8) and P = 0.001, 3.51 (1.55-7.95), respectively]. To select the haplotypes most informative for analysis, we tagged the polymorphisms using SNPTag software. Employing AIRE C-103T, G6528A, T7215C and T11787C as tag SNPs, two haplotypes were associated with AA; AIRE CGCT and AIRE CGCC [P = 3.84 x 10(-7), 11.40 (3.53-36.9) and P = 3.94 x 10(-4), 2.13 (1.39-3.24) respectively]. The AIRE risk haplotypes identified in this study potentially account for a major component of the genetic risk of developing alopecia areata.

  1. Fractal Clustering and Knowledge-driven Validation Assessment for Gene Expression Profiling.

    PubMed

    Wang, Lu-Yong; Balasubramanian, Ammaiappan; Chakraborty, Amit; Comaniciu, Dorin

    2005-01-01

    DNA microarray experiments generate a substantial amount of information about the global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. A number of clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or some distance metric to cluster the points in multi-dimension linear Euclidean space. Their results shows poor consistence with the functional annotation of genes in previous validation study. From a novel different perspective, we propose fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry. This method clusters points in such a way that points in the same clusters are more self-affine among themselves than to the points in other clusters. We assess this method using annotation-based validation assessment for gene clusters. It shows that this method is superior in identifying functional related gene groups than other traditional methods.

  2. Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis

    PubMed Central

    Koh, Esther G. L.; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V.; Brenner, Sydney; Venkatesh, Byrappa

    2003-01-01

    The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes. PMID:12547909

  3. Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis.

    PubMed

    Koh, Esther G L; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V; Brenner, Sydney; Venkatesh, Byrappa

    2003-02-04

    The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes.

  4. Conditions for the Evolution of Gene Clusters in Bacterial Genomes

    PubMed Central

    Ballouz, Sara; Francis, Andrew R.; Lan, Ruiting; Tanaka, Mark M.

    2010-01-01

    Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model), genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters. PMID:20168992

  5. Population genetics analysis of Phlebotomus papatasi sand flies from Egypt and Jordan based on mitochondrial cytochrome b haplotypes.

    PubMed

    Flanley, Catherine M; Ramalho-Ortigao, Marcelo; Coutinho-Abreu, Iliano V; Mukbel, Rami; Hanafi, Hanafi A; El-Hossary, Shabaan S; Fawaz, Emad El-Din Y; Hoel, David F; Bray, Alexander W; Stayback, Gwen; Shoue, Douglas A; Kamhawi, Shaden; Karakuş, Mehmet; Jaouadi, Kaouther; Yaghoobie-Ershadi, Mohammad Reza; Krüger, Andreas; Amro, Ahmad; Kenawy, Mohamed Amin; Dokhan, Mostafa Ramadhan; Warburg, Alon; Hamarsheh, Omar; McDowell, Mary Ann

    2018-03-27

    Phlebotomus papatasi sand flies are major vectors of Leishmania major and phlebovirus infection in North Africa and across the Middle East to the Indian subcontinent. Population genetics is a valuable tool in understanding the level of genetic variability present in vector populations, vector competence, and the development of novel control strategies. This study investigated the genetic differentiation between P. papatasi populations in Egypt and Jordan that inhabit distinct ecotopes and compared this structure to P. papatasi populations from a broader geographical range. A 461 base pair (bp) fragment from the mtDNA cytochrome b (cyt b) gene was PCR amplified and sequenced from 116 individual female sand flies from Aswan and North Sinai, Egypt, as well as Swaimeh and Malka, Jordan. Haplotypes were identified and used to generate a median-joining network, F ST values and isolation-by-distance were also evaluated. Additional sand fly individuals from Afghanistan, Iran, Israel, Jordan, Libya, Tunisia and Turkey were included as well as previously published haplotypes to provide a geographically broad genetic variation analysis. Thirteen haplotypes displaying nine variant sites were identified from P. papatasi collected in Egypt and Jordan. No private haplotypes were identified from samples in North Sinai, Egypt, two were observed in Aswan, Egypt, four from Swaimeh, Jordan and two in Malka, Jordan. The Jordan populations clustered separately from the Egypt populations and produced more private haplotypes than those from Egypt. Pairwise F ST values fall in the range 0.024-0.648. The clustering patterns and pairwise F ST values indicate a strong differentiation between Egyptian and Jordanian populations, although this population structure is not due to isolation-by-distance. Other factors, such as environmental influences and the genetic variability in the circulating Le. major parasites, could possibly contribute to this heterogeneity. The present study aligns with

  6. Insights into HLA-G Genetics Provided by Worldwide Haplotype Diversity

    PubMed Central

    Castelli, Erick C.; Ramalho, Jaqueline; Porto, Iane O. P.; Lima, Thálitta H. A.; Felício, Leandro P.; Sabbagh, Audrey; Donadi, Eduardo A.; Mendes-Junior, Celso T.

    2014-01-01

    Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures. PMID:25339953

  7. Gene flow between wolf and shepherd dog populations in Georgia (Caucasus).

    PubMed

    Kopaliani, Natia; Shakarashvili, Maia; Gurielidze, Zurab; Qurkhuli, Tamar; Tarkhnishvili, David

    2014-01-01

    We studied the distribution of the mitochondrial DNA haplotypes and microsatellite genotypes at 8 loci in 102 gray wolves, 57 livestock guarding dogs, and 9 mongrel dogs from Georgia (Caucasus). Most of the studied dogs had mitochondrial haplotypes clustered with presumably East Asian dog lineages, and most of the studied wolves had the haplotypes clustered with European wolves, but 20% of wolves and 37% of dogs shared the same mitochondrial haplotypes. Bayesian inference with STRUCTURE software suggested that more than 13% of the studied wolves had detectable dog ancestry and more than 10% of the dogs had detectable wolf ancestry. About 2-3% of the sampled wolves and dogs were identified, with a high probability, as first-generation hybrids. These results were supported by the relatedness analysis, which showed that 10% of wolves and 20% of dogs had closest relatives from an opposite group. The results of the study suggest that wolf-dog hybridization is a common event in the areas where large livestock guarding dogs are held in a traditional way, and that gene flow between dogs and gray wolves was an important force influencing gene pool of dogs for millennia since early domestication events. This process may have been terminated 1) in areas outside the natural range of gray wolves and 2) since very recent time, when humans started to more tightly control contacts of purebred dogs.

  8. High-Density SNP Genotyping to Define β-Globin Locus Haplotypes

    PubMed Central

    Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.

    2014-01-01

    Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352

  9. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

    PubMed Central

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya

    2017-01-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390

  10. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression.

    PubMed

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady

    2017-02-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.

  11. SNP and haplotype analysis of paired box 3 (PAX3) gene provide evidence for association with growth traits in Chinese cattle.

    PubMed

    Xu, Yao; Cai, Hanfang; Zhou, Yang; Shi, Tao; Lan, Xianyong; Zhang, Chunlei; Lei, Chuzhao; Jia, Yutang; Chen, Hong

    2014-07-01

    Paired box 3 (PAX3) belongs to the PAX superfamily of transcription factors and plays essential roles in the embryogenesis and postnatal formation of limb musculature through affecting the survival of muscle progenitor cells. By genetic mapping, PAX3 gene is assigned in the interval of quantitative trait loci for body weight on bovine BTA2. The objectives of this study were to detect polymorphisms of PAX3 gene in 1,241 cattle from five breeds and to investigate their effects on growth traits. Initially, three novel single nucleotide polymorphisms (SNPs) were identified by DNA pool sequencing and aCRS-RFLP methods (AC_000159: g.T-580G, g.A4617C and g.79018Ins/del G), which were located at 5'-UTR, exon 4 and intron 6, respectively. A total of eight haplotypes were constructed and the frequency of the three main haplotypes H1 (TAG), H2 (GCG) and H3 (GAG) accounted for over 81.7 % of the total individuals. Statistical analysis revealed that the three SNPs were associated with body height and body length of Nanyang and Chinese Caoyuan cattle at the age of 6 and/or 12 months old (P < 0.05), and consistently significant effects were also found in the haplotype combination analysis on these traits (P < 0.05). This study presented a complete scan of variations within bovine PAX3 gene, which could provide evidence for improving the economic traits of cattle by using these variations as potentially genetic markers in early marker-assisted selection programs.

  12. A tripartite clustering analysis on microRNA, gene and disease model.

    PubMed

    Shen, Chengcheng; Liu, Ying

    2012-02-01

    Alteration of gene expression in response to regulatory molecules or mutations could lead to different diseases. MicroRNAs (miRNAs) have been discovered to be involved in regulation of gene expression and a wide variety of diseases. In a tripartite biological network of human miRNAs, their predicted target genes and the diseases caused by altered expressions of these genes, valuable knowledge about the pathogenicity of miRNAs, involved genes and related disease classes can be revealed by co-clustering miRNAs, target genes and diseases simultaneously. Tripartite co-clustering can lead to more informative results than traditional co-clustering with only two kinds of members and pass the hidden relational information along the relation chain by considering multi-type members. Here we report a spectral co-clustering algorithm for k-partite graph to find clusters with heterogeneous members. We use the method to explore the potential relationships among miRNAs, genes and diseases. The clusters obtained from the algorithm have significantly higher density than randomly selected clusters, which means members in the same cluster are more likely to have common connections. Results also show that miRNAs in the same family based on the hairpin sequences tend to belong to the same cluster. We also validate the clustering results by checking the correlation of enriched gene functions and disease classes in the same cluster. Finally, widely studied miR-17-92 and its paralogs are analyzed as a case study to reveal that genes and diseases co-clustered with the miRNAs are in accordance with current research findings.

  13. Haplotypes composed of minor frequency single nucleotide polymorphisms of the TNF gene protect from progression into sepsis: A study using the new sepsis classification.

    PubMed

    Retsas, Theodoros; Huse, Klaus; Lazaridis, Lazaros-Dimitrios; Karampela, Niki; Bauer, Michael; Platzer, Matthias; Kolonia, Virginia; Papageorgiou, Eirini; Giamarellos-Bourboulis, Evangelos J; Dimopoulos, George

    2018-02-01

    Several articles have provided conflicting results regarding the role of single nucleotide polymorphisms (SNPs) in the promoter region of the TNF gene in susceptibility to sepsis. Former articles have been based on previous definitions of sepsis. This study investigated the influence of TNF haplotypes on the development of sepsis using the new Sepsis-3 definitions. DNA was isolated from patients suffering from infection and systemic inflammatory response syndrome. Haplotyping was performed for six SNPs of TNF. The serum levels of tumour necrosis factor alpha (TNF-α) of these patients were measured using an enzyme immunosorbent assay. Patients were classified into infection and sepsis categories using the Sepsis-3 definitions. Associations between the TNF haplotypes and the clinical characteristics and serum TNF-α levels of the patients were examined. The most common TNF haplotype h1 was composed of major alleles of the studied SNPs. Carriage of haplotypes composed of minor frequency alleles was associated with a lower risk of developing sepsis (odds ratio 0.41, 95% confidence interval 0.19-0.88, p=0.022), but this did not affect the 28-day outcome. Serum TNF-α levels were significantly higher among patients homozygous for h1 haplotypes who developed sepsis compared to infection (p=0.032); a similar result was not observed for patients carrying other haplotypes. Haplotypes containing minor frequency SNP alleles of TNF protect against the development of sepsis without affecting the outcome. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.

  14. The distribution of HLA haplotypes in the ethnic groups that make up the Brazilian Bone Marrow Volunteer Donor Registry (REDOME).

    PubMed

    Halagan, Michael; Oliveira, Danielli Cristina; Maiers, Martin; Fabreti-Oliveira, Raquel A; Moraes, Maria Elisa Hue; Visentainer, Jeane Eliete Laguila; Pereira, Noemi Farah; Romero, Matilde; Cardoso, Juliana Fernandes; Porto, Luís Cristóvão

    2018-04-26

    The Registries of Bone Marrow Donors around the world include more than 30 million volunteer donors from 57 different countries, and were responsible for over 17,000 hematopoietic stem cell transplants in 2016. The Brazilian Bone Marrow Volunteer Donor Registry (REDOME) was established in 1993 and is the third largest registry in the world with more than 4.3 million donors. We characterized HLA allele and haplotypes frequencies from REDOME comparing them with the donor self-reported race group classification. Five-locus haplotype frequencies (A~C~B~DRB1~DQB1) were estimated for each of the six race groups, resolving phase and allelic ambiguity using the expectation-maximization (EM) algorithm. The top 100 haplotypes in the race groups were separated into eight clusters of haplotypes, based on haplotype similarity, using CLUTO. We present HLA allele and haplotype frequency data from six race groups from 2,938,259 individuals from REDOME. The most frequent haplotype was the same for all groups: A*01:01g~C*07:01g~B*08:01g~DRB1*03:01g~DQB1*02:01g. Some frequent haplotypes such as A*02:01g~C*16:01g~B*44:03~DRB1*07:01g~DQB1*02:01g was not found in people with Preta (Sub-Saharan African descent). A cluster including Branca (European) and Parda or non-informed (admixed) could be distinguished from both Preta (SubSaharan) and Indígena (Amerindian) groups, and from the Amarela (Asian) ones, which clustered with their original population. These results have implications on cross-population matching and can help in donor searches and population-based recruitment strategies.

  15. Mutation profile of all 49 exons of the human myosin VIIA gene, and haplotype analysis, in Usher 1B families from diverse origins.

    PubMed

    Adato, A; Weil, D; Kalinski, H; Pel-Or, Y; Ayadi, H; Petit, C; Korostishevsky, M; Bonne-Tamir, B

    1997-10-01

    Usher syndrome types I (USH1A-USH1E) are a group of autosomal recessive diseases characterized by profound congenital hearing loss, vestibular areflexia, and progressive visual loss due to retinitis pigmentosa. The human myosin VIIA gene, located on 11q14, has been shown to be responsible for Usher syndrome type 1B (USH1B). Haplotypes were constructed in 28 USH1 families by use of the following polymorphic markers spanning the USH1B locus: D11S787, D11S527, D11S1789, D11S906, D11S4186, and OMP. Affected individuals and members of their families from 12 different ethnic origins were screened for the presence of mutations in all 49 exons of the myosin VIIA gene. In 15 families myosin VIIA mutations were detected, verifying their classification as USH1B. All these mutations are novel, including three missense mutations, one premature stop codon, two splicing mutations, one frameshift, and one deletion of >2 kb comprising exons 47 and 48, a part of exon 49, and the introns between them. Three mutations were shared by more than one family, consistent with haplotype similarities. Altogether, 16 USH1B haplotypes were observed in the 15 families; most haplotypes were population specific. Several exonic and intronic polymorphisms were also detected. None of the 20 known USH1B mutations reported so far in other world populations were identified in our families.

  16. [Spatial Distribution of Intron 2 of nad1 Gene Haplotypes in Populations of Norway and Siberian Spruce (Picea abies-P. obovata) Species Complex].

    PubMed

    Mudrik, E A; Polyakova, T A; Shatokhina, A V; Bondarenko, G N; Politov, D V

    2015-10-01

    The length and sequence variations among intron 2 haplotypes of the mitochondrial DNA nad1 gene have been studied in the Norway and Siberian spruce (Picea abies (L.) H. Karst.-P. obovata Ledeb.) species complex. Twenty-two native populations and 15 provenances were analyzed. The distribution of the northern European haplogroup (haplotypes 721, 755, 789, 823, 857, 891, and 925) is delimited in the west by the Ural region inclusively. Haplotype 712 is widespread in populations of Siberia, in the Far East and in northeastern Russia. A novel variant of the Siberian haplogroup (780) containing three copies of the first minisatellite motif (34 bp) was found for the first time. The absence of an admixture of the northern European and Siberian haplotypes in the zone of spruce species introgression previously marked by morphological traits and nuclear allozyme loci was demonstrated. This may be evidence of the existence of a sharper geographic boundary between the two haplogroups, as compared to a boundary based on phenotypic and allozyme data. A high proportion of the interpopulation component of variation (65%) estimated by AMOVA indicates a substantial genetic subdivision of European and Siberian populations of the Palearctic spruce complex by mtDNA, which can be putatively explained by natural barriers to gene flow with seeds related, for instance, to the woodless regions of the western Siberian Plain in the Pleistocene and the probable floodplains of large rivers.

  17. Influence of haplotypes, gene expression and soluble levels of L-selectin on the risk of acute coronary syndrome.

    PubMed

    Sandoval-Pinto, Elena; Padilla-Gutiérrez, Jorge Ramón; Hernández-Bello, Jorge; Martínez-Fernández, Diana Emilia; Valdés-Alvarado, Emmanuel; Muñoz-Valle, José Francisco; Flores-Salinas, H E; Valle, Yeminia

    2017-08-20

    L-selectin gene (SELL) is a candidate gene for the development of acute coronary syndrome (ACS) that contributes to endothelial dysfunction. The -642C>T (rs2205849) and 725C>T (rs2229569) polymorphisms have been associated with changes in gene expression, ligand affinity and increased risk of cardiovascular disease. The aim of this study was to investigate the association between the haplotypes constructed with the -642C>T and 725C>T polymorphisms of the SELL gene, the expression levels of its mRNA and the serum levels of soluble L-selectin with ACS. We recruited 615 individuals of Mexican origin matched by age, including 342 patients with ACS and 273 individuals without personal history of ischemic cardiopathy as control group (CG). Genotyping was performed by PCR-RFLP. The qPCR technique was used to analyze the expression of mRNA using TaqMan® UPL probes. The levels of soluble L-selectin were measured with ELISA. The allele variants in both polymorphisms were over-represented in the CG compared to the ACS (OR range: 0.371-0.716, p<0.006). The CT and TT haplotypes had a protective effect against the development of ACS (OR=0.401, p<0.0001; OR=0.628, p<0.0001, respectively). SELL expression was 3.076 times higher in the ACS group compared to CG (p<0.001). The levels of soluble L-selectin were similar between ACS and CG. Both polymorphisms had no effect on mRNA expression and soluble protein levels. The polymorphisms -642C>T and 725C>T of the SELL gene are protective factors against the development of ACS. There is an increased gene expression of L-selectin in ACS compared to CG in the population of Western Mexico. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Genes encoding cuticular proteins are components of the Nimrod gene cluster in Drosophila.

    PubMed

    Cinege, Gyöngyi; Zsámboki, János; Vidal-Quadras, Maite; Uv, Anne; Csordás, Gábor; Honti, Viktor; Gábor, Erika; Hegedűs, Zoltán; Varga, Gergely I B; Kovács, Attila L; Juhász, Gábor; Williams, Michael J; Andó, István; Kurucz, Éva

    2017-08-01

    The Nimrod gene cluster, located on the second chromosome of Drosophila melanogaster, is the largest synthenic unit of the Drosophila genome. Nimrod genes show blood cell specific expression and code for phagocytosis receptors that play a major role in fruit fly innate immune functions. We previously identified three homologous genes (vajk-1, vajk-2 and vajk-3) located within the Nimrod cluster, which are unrelated to the Nimrod genes, but are homologous to a fourth gene (vajk-4) located outside the cluster. Here we show that, unlike the Nimrod candidates, the Vajk proteins are expressed in cuticular structures of the late embryo and the late pupa, indicating that they contribute to cuticular barrier functions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Identification of lethal cluster of genes in the yeast transcription network

    NASA Astrophysics Data System (ADS)

    Rho, K.; Jeong, H.; Kahng, B.

    2006-05-01

    Identification of essential or lethal genes would be one of the ultimate goals in drug designs. Here we introduce an in silico method to select the cluster with a high population of lethal genes, called lethal cluster, through microarray assay. We construct a gene transcription network based on the microarray expression level. Links are added one by one in the descending order of the Pearson correlation coefficients between two genes. As the link density p increases, two meaningful link densities pm and ps are observed. At pm, which is smaller than the percolation threshold, the number of disconnected clusters is maximum, and the lethal genes are highly concentrated in a certain cluster that needs to be identified. Thus the deletion of all genes in that cluster could efficiently lead to a lethal inviable mutant. This lethal cluster can be identified by an in silico method. As p increases further beyond the percolation threshold, the power law behavior in the degree distribution of a giant cluster appears at ps. We measure the degree of each gene at ps. With the information pertaining to the degrees of each gene at ps, we return to the point pm and calculate the mean degree of genes of each cluster. We find that the lethal cluster has the largest mean degree.

  20. Haplotype Analysis of the Melanopsin Gene in Seasonal Affective Disorder and Controls

    DTIC Science & Technology

    2007-06-19

    Cole, P. A. (2002). Serotonin n-acetyltransferase: Mechanism and inhibition. Current Medicinal Chemistry , 9(12), 1187-1199. 152 APPENDIX A STRUCTURED ...such that low light levels fall below this threshold during winter in individuals with SAD. The present study investigated the haplotype structure of...Association Studies 51 Advantages of Population-Based Case-Control Samples 52 Haplotype Structure 53 Linkage Disequilibrium: A Measure of Correlation Between

  1. The ergot alkaloid gene cluster in Claviceps purpurea: extension of the cluster sequence and intra species evolution.

    PubMed

    Haarmann, Thomas; Machado, Caroline; Lübbe, Yvonne; Correia, Telmo; Schardl, Christopher L; Panaccione, Daniel G; Tudzynski, Paul

    2005-06-01

    The genomic region of Claviceps purpurea strain P1 containing the ergot alkaloid gene cluster [Tudzynski, P., Hölter, K., Correia, T., Arntz, C., Grammel, N., Keller, U., 1999. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol. Gen. Genet. 261, 133-141] was explored by chromosome walking, and additional genes probably involved in the ergot alkaloid biosynthesis have been identified. The putative cluster sequence (extending over 68.5kb) contains 4 different nonribosomal peptide synthetase (NRPS) genes and several putative oxidases. Northern analysis showed that most of the genes were co-regulated (repressed by high phosphate), and identified probable flanking genes by lack of co-regulation. Comparison of the cluster sequences of strain P1, an ergotamine producer, with that of strain ECC93, an ergocristine producer, showed high conservation of most of the cluster genes, but significant variation in the NRPS modules, strongly suggesting that evolution of these chemical races of C. purpurea is determined by evolution of NRPS module specificity.

  2. Large clusters of co-expressed genes in the Drosophila genome.

    PubMed

    Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

    2002-12-12

    Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.

  3. Molecular identification and first report of mitochondrial COI gene haplotypes in the hawksbill turtle Eretmochelys imbricata (Testudines: Cheloniidae) in the Colombian Caribbean nesting colonies.

    PubMed

    Daza-Criado, L; Hernández-Fernández, J

    2014-02-21

    Hawksbill sea turtles Eretmochelys imbricata are found extensively around the world, including the Atlantic, Pacific, and Indian Oceans; the Persian Gulf, and the Red and Mediterranean Seas. Populations of this species are affected by international trafficking of their shields, meat, and eggs, making it a critically endangered animal. We determined the haplotypes of 17 hawksbill foraging turtles of Islas del Rosario (Bolivar) and of the nesting beach Don Diego (Magdalena) in the Colombian Caribbean based on amplification and sequencing of the mitochondrial gene cytochrome oxidase c subunit I (COI). We identified 5 haplotypes, including EI-A1 previously reported in Puerto Rico, which was similar to 10 of the study samples. To our knowledge, the remaining 4 haplotypes have not been described. Samples EICOI11 and EICOI3 showed 0.2% divergence from EI-A1, by a single nucleotide change, and were classified as the EI-A2 haplotype. EICOI6, EICOI14, and EICOI12 samples showed 0.2% divergence from EI-A1 and 0.3% divergence from EI-A2 and were classified as EI-A3 haplotype. Samples EICOI16 and EICOI15 presented 5 nucleotide changes each and were classified as 2 different haplotypes, EI-A4 and EI-A5, respectively. The last 2 haplotypes had higher nucleotide diversity (K2P=1.7%) than that by the first 3 haplotypes. EI-A1 and EI-A2 occurred in nesting individuals, and EI-A2, EI-A3, EI-A4, and EI-A5 occurred in foraging individuals. The description of the haplotypes may be associated with reproductive migrations or foraging and could support the hypothesis of natal homing. Furthermore, they can be used in phylogeographic studies.

  4. Arrangement of the Clostridium baratii F7 Toxin Gene Cluster with Identification of a σ Factor That Recognizes the Botulinum Toxin Gene Cluster Promoters

    DOE PAGES

    Dover, Nir; Barash, Jason R.; Burke, Julianne N.; ...

    2014-05-22

    Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bontmore » gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ 70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.« less

  5. Differential Retention of Gene Functions in a Secondary Metabolite Cluster.

    PubMed

    Reynolds, Hannah T; Slot, Jason C; Divon, Hege H; Lysøe, Erik; Proctor, Robert H; Brown, Daren W

    2017-08-01

    In fungi, distribution of secondary metabolite (SM) gene clusters is often associated with host- or environment-specific benefits provided by SMs. In the plant pathogen Alternaria brassicicola (Dothideomycetes), the DEP cluster confers an ability to synthesize the SM depudecin, a histone deacetylase inhibitor that contributes weakly to virulence. The DEP cluster includes genes encoding enzymes, a transporter, and a transcription regulator. We investigated the distribution and evolution of the DEP cluster in 585 fungal genomes and found a wide but sporadic distribution among Dothideomycetes, Sordariomycetes, and Eurotiomycetes. We confirmed DEP gene expression and depudecin production in one fungus, Fusarium langsethiae. Phylogenetic analyses suggested 6-10 horizontal gene transfers (HGTs) of the cluster, including a transfer that led to the presence of closely related cluster homologs in Alternaria and Fusarium. The analyses also indicated that HGTs were frequently followed by loss/pseudogenization of one or more DEP genes. Independent cluster inactivation was inferred in at least four fungal classes. Analyses of transitions among functional, pseudogenized, and absent states of DEP genes among Fusarium species suggest enzyme-encoding genes are lost at higher rates than the transporter (DEP3) and regulatory (DEP6) genes. The phenotype of an experimentally-induced DEP3 mutant of Fusarium did not support the hypothesis that selective retention of DEP3 and DEP6 protects fungi from exogenous depudecin. Together, the results suggest that HGT and gene loss have contributed significantly to DEP cluster distribution, and that some DEP genes provide a greater fitness benefit possibly due to a differential tendency to form network connections. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2017. This work is written by US Government employees and is in the public domain in the US.

  6. Functional clustering of time series gene expression data by Granger causality

    PubMed Central

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  7. Transcription factor clusters regulate genes in eukaryotic cells

    PubMed Central

    Hedlund, Erik G; Friemann, Rosmarie; Hohmann, Stefan

    2017-01-01

    Transcription is regulated through binding factors to gene promoters to activate or repress expression, however, the mechanisms by which factors find targets remain unclear. Using single-molecule fluorescence microscopy, we determined in vivo stoichiometry and spatiotemporal dynamics of a GFP tagged repressor, Mig1, from a paradigm signaling pathway of Saccharomyces cerevisiae. We find the repressor operates in clusters, which upon extracellular signal detection, translocate from the cytoplasm, bind to nuclear targets and turnover. Simulations of Mig1 configuration within a 3D yeast genome model combined with a promoter-specific, fluorescent translation reporter confirmed clusters are the functional unit of gene regulation. In vitro and structural analysis on reconstituted Mig1 suggests that clusters are stabilized by depletion forces between intrinsically disordered sequences. We observed similar clusters of a co-regulatory activator from a different pathway, supporting a generalized cluster model for transcription factors that reduces promoter search times through intersegment transfer while stabilizing gene expression. PMID:28841133

  8. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    PubMed

    Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  9. Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

    PubMed Central

    Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417

  10. Hierarchical Dirichlet process model for gene expression clustering

    PubMed Central

    2013-01-01

    Clustering is an important data processing tool for interpreting microarray data and genomic network inference. In this article, we propose a clustering algorithm based on the hierarchical Dirichlet processes (HDP). The HDP clustering introduces a hierarchical structure in the statistical model which captures the hierarchical features prevalent in biological data such as the gene express data. We develop a Gibbs sampling algorithm based on the Chinese restaurant metaphor for the HDP clustering. We apply the proposed HDP algorithm to both regulatory network segmentation and gene expression clustering. The HDP algorithm is shown to outperform several popular clustering algorithms by revealing the underlying hierarchical structure of the data. For the yeast cell cycle data, we compare the HDP result to the standard result and show that the HDP algorithm provides more information and reduces the unnecessary clustering fragments. PMID:23587447

  11. Addiction Genetics and Pleiotropic Effects of Common Haplotypes that Make Polygenic Contributions to Vulnerability to Substance Dependence

    PubMed Central

    Uhl, George R.; Drgon, Tomas; Johnson, Catherine; Liu, Qing-Rong

    2016-01-01

    Abundant evidence from family, adoption, and twin studies point to large genetic contributions to individual differences in vulnerability to develop dependence on one or more addictive substances. Twin data suggest that most of this genetic vulnerability is shared by individuals who are dependent on a variety of addictive substances. Molecular genetic studies, especially genomewide and candidate gene association studies, have elucidated common haplotypes in dozens of genes that appear to make polygenic contributions to vulnerability to developing dependence. Most genes that harbor currently identified addiction-associated haplotypes are expressed in the brain. Haplotypes in many of the same genes are identified in genomewide association studies that compare allele frequencies in substance dependent vs. control individuals from European, African, and Asian racial/ethnic backgrounds. Many of these addiction-associated haplotypes display pleiotropic influences on a variety of related brain-based phenotypes that display 1) substantial heritability and 2) clinical cooccurence with substance dependence. PMID:19152208

  12. Interleukin-1 gene cluster variants in hemodialysis patients with end stage renal disease: An association and meta-analysis

    PubMed Central

    Tripathi, G.; Rangaswamy, D.; Borkar, M.; Prasad, N.; Sharma, R. K.; Sankhwar, S. N.; Agrawal, S.

    2015-01-01

    We evaluated whether polymorphisms in interleukin (IL-1) gene cluster (IL-1 alpha [IL-1A], IL-1 beta [IL-1B], and IL-1 receptor antagonist [IL-1RN]) are associated with end stage renal disease (ESRD). A total of 258 ESRD patients and 569 ethnicity matched controls were examined for IL-1 gene cluster. These were genotyped for five single-nucleotide gene polymorphisms in the IL-1A, IL-1B and IL-1RN genes and a variable number of tandem repeats (VNTR) in the IL-1RN. The IL-1B − 3953 and IL-1RN + 8006 polymorphism frequencies were significantly different between the two groups. At IL-1B, the T allele of − 3953C/T was increased among ESRD (P = 0.0001). A logistic regression model demonstrated that two repeat (240 base pair [bp]) of the IL-1Ra VNTR polymorphism was associated with ESRD (P = 0.0001). The C/C/C/C/C/1 haplotype was more prevalent in ESRD = 0.007). No linkage disequilibrium (LD) was observed between six loci of IL-1 gene. We further conducted a meta-analysis of existing studies and found that there is a strong association of IL-1 RN VNTR 86 bp repeat polymorphism with susceptibility to ESRD (odds ratio = 2.04, 95% confidence interval = 1.48-2.82; P = 0.000). IL-1B − 5887, +8006 and the IL-1RN VNTR polymorphisms have been implicated as potential risk factors for ESRD. The meta-analysis showed a strong association of IL-1RN 86 bp VNTR polymorphism with susceptibility to ESRD. PMID:25684870

  13. Mitochondrial Haplotype Diversity in Zambian Lions: Bridging a Gap in the Biogeography of an Iconic Species

    PubMed Central

    Curry, Caitlin J.; White, Paula A.; Derr, James N.

    2015-01-01

    Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species. PMID:26674533

  14. Mitochondrial Haplotype Diversity in Zambian Lions: Bridging a Gap in the Biogeography of an Iconic Species.

    PubMed

    Curry, Caitlin J; White, Paula A; Derr, James N

    2015-01-01

    Analysis of DNA sequence diversity at the 12S to 16S mitochondrial genes of 165 African lions (Panthera leo) from five main areas in Zambia has uncovered haplotypes which link Southern Africa with East Africa. Phylogenetic analysis suggests Zambia may serve as a bridge connecting the lion populations in southern Africa to eastern Africa, supporting earlier hypotheses that eastern-southern Africa may represent the evolutionary cradle for the species. Overall gene diversity throughout the Zambian lion population was 0.7319 +/- 0.0174 with eight haplotypes found; three haplotypes previously described and the remaining five novel. The addition of these five novel haplotypes, so far only found within Zambia, nearly doubles the number of haplotypes previously reported for any given geographic location of wild lions. However, based on an AMOVA analysis of these haplotypes, there is little to no matrilineal gene flow (Fst = 0.47) when the eastern and western regions of Zambia are considered as two regional sub-populations. Crossover haplotypes (H9, H11, and Z1) appear in both populations as rare in one but common in the other. This pattern is a possible result of the lion mating system in which predominately males disperse, as all individuals with crossover haplotypes were male. The determination and characterization of lion sub-populations, such as done in this study for Zambia, represent a higher-resolution of knowledge regarding both the genetic health and connectivity of lion populations, which can serve to inform conservation and management of this iconic species.

  15. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    PubMed

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of E<10(-5)) are included in 27 clusters. Five clusters are associated with metabolism, containing P450 genes restricted to the Brassica family and predicted to be involved in secondary metabolism. Operon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary

  16. De novo assembly of a haplotype-resolved human genome.

    PubMed

    Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

    2015-06-01

    The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.

  17. Heterologous expression of pikromycin biosynthetic gene cluster using Streptomyces artificial chromosome system.

    PubMed

    Pyeon, Hye-Rim; Nah, Hee-Ju; Kang, Seung-Hoon; Choi, Si-Sun; Kim, Eung-Soo

    2017-05-31

    Heterologous expression of biosynthetic gene clusters of natural microbial products has become an essential strategy for titer improvement and pathway engineering of various potentially-valuable natural products. A Streptomyces artificial chromosomal conjugation vector, pSBAC, was previously successfully applied for precise cloning and tandem integration of a large polyketide tautomycetin (TMC) biosynthetic gene cluster (Nah et al. in Microb Cell Fact 14(1):1, 2015), implying that this strategy could be employed to develop a custom overexpression scheme of natural product pathway clusters present in actinomycetes. To validate the pSBAC system as a generally-applicable heterologous overexpression system for a large-sized polyketide biosynthetic gene cluster in Streptomyces, another model polyketide compound, the pikromycin biosynthetic gene cluster, was preciously cloned and heterologously expressed using the pSBAC system. A unique HindIII restriction site was precisely inserted at one of the border regions of the pikromycin biosynthetic gene cluster within the chromosome of Streptomyces venezuelae, followed by site-specific recombination of pSBAC into the flanking region of the pikromycin gene cluster. Unlike the previous cloning process, one HindIII site integration step was skipped through pSBAC modification. pPik001, a pSBAC containing the pikromycin biosynthetic gene cluster, was directly introduced into two heterologous hosts, Streptomyces lividans and Streptomyces coelicolor, resulting in the production of 10-deoxymethynolide, a major pikromycin derivative. When two entire pikromycin biosynthetic gene clusters were tandemly introduced into the S. lividans chromosome, overproduction of 10-deoxymethynolide and the presence of pikromycin, which was previously not detected, were both confirmed. Moreover, comparative qRT-PCR results confirmed that the transcription of pikromycin biosynthetic genes was significantly upregulated in S. lividans containing tandem

  18. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    PubMed

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  19. Kullback-Leibler divergence for detection of rare haplotype common disease association.

    PubMed

    Lin, Shili

    2015-11-01

    Rare haplotypes may tag rare causal variants of common diseases; hence, detection of such rare haplotypes may also contribute to our understanding of complex disease etiology. Because rare haplotypes frequently result from common single-nucleotide polymorphisms (SNPs), focusing on rare haplotypes is much more economical compared with using rare single-nucleotide variants (SNVs) from sequencing, as SNPs are available and 'free' from already amassed genome-wide studies. Further, associated haplotypes may shed light on the underlying disease causal mechanism, a feat unmatched by SNV-based collapsing methods. In recent years, data mining approaches have been adapted to detect rare haplotype association. However, as they rely on an assumed underlying disease model and require the specification of a null haplotype, results can be erroneous if such assumptions are violated. In this paper, we present a haplotype association method based on Kullback-Leibler divergence (hapKL) for case-control samples. The idea is to compare haplotype frequencies for the cases versus the controls by computing symmetrical divergence measures. An important property of such measures is that both the frequencies and logarithms of the frequencies contribute in parallel, thus balancing the contributions from rare and common, and accommodating both deleterious and protective, haplotypes. A simulation study under various scenarios shows that hapKL has well-controlled type I error rates and good power compared with existing data mining methods. Application of hapKL to age-related macular degeneration (AMD) shows a strong association of the complement factor H (CFH) gene with AMD, identifying several individual rare haplotypes with strong signals.

  20. Genetic and physical mapping of homologues of the virus resistance gene Rx1 and the cyst nematode resistance gene Gpa2 in potato.

    PubMed

    Bakker, E; Butterbach, P; Rouppe van der Voort, J; van der Vossen, E; van Vliet, J; Bakker, J; Goverse, A

    2003-05-01

    Nine resistance gene homologues (RGHs) were identified in two diploid potato clones (SH and RH), with a specific primer pair based on conserved motifs in the LRR domain of the potato cyst nematode resistance gene Gpa2 and the potato virus X resistance gene Rx1. A modified AFLP method was used to facilitate the genetic mapping of the RGHs in the four haplotypes under investigation. All nine RGHs appeared to be located in the Gpa2/ Rx1 cluster on chromosome XII. Construction of a physical map using bacterial artificial chromosome (BAC) clones for both the Solanum tuberosum ssp. tuberosum and the S. tuberosum ssp. andigena haplotype of SH showed that the RGHs are located within a stretch of less than 200 kb. Sequence analysis of the RGHs revealed that they are highly similar (93 to 95%) to Gpa2 and Rx1. The sequence identities among all RGHs range from 85 to 100%. Two pairs of RGHs are identical, or nearly so (100 and 99.9%), with each member located in a different genotype. Southern-blot analysis on genomic DNA revealed no evidence for additional homologues outside the Gpa2/ Rx1 cluster on chromosome XII.

  1. High density genotyping of STAT4 gene reveals multiple haplotypic associations with Systemic Lupus Erythematosus in different racial groups

    PubMed Central

    Namjou, Bahram; Sestak, Andrea L.; Armstrong, Don L.; Zidovetzki, Raphael; Kelly, Jennifer A.; Jacob, Noam; Ciobanu, Voicu; Kaufman, Kenneth M.; Ojwang, Joshua O.; Ziegler, Julie; Quismorio, Francesco; Reiff, Andreas; Myones, Barry L.; Guthridge, Joel M.; Nath, Swapan K.; Bruner, Gail R.; Mehrian-Shai, Ruth; Silverman, Earl; Klein-Gitelman, Marisa; McCurdy, Deborah; Wagner-Weiner, Linda; Nocton, James J.; Putterman, Chaim; Bae, Sang-Cheol; Kim, Yun Jung; Petri, Michelle; Reveille, John D.; Vyse, Timothy J.; Gilkeson, Gary S.; Kamen, Diane L.; Alarcón-Riquelme, Marta E.; Gaffney, Patrick M.; Moser, Kathy L; Merrill, Joan T.; Scofield, R. Hal; James, Judith A.; Langefeld, Carl D.; Harley, John B.; Jacob, Chaim O.

    2009-01-01

    Objective Systemic lupus erythematosus (SLE) is the prototypic systemic autoimmune disorder with complex etiology and a strong genetic component. Recently, gene products involved in the interferon pathway have been under intense investigation in SLE pathogenesis. STAT1 and STAT4 are transcription factors that play key roles in the interferon and Th1 signaling pathways, making them attractive candidates for SLE susceptibility. Methods Fifty-six single-nucleotide polymorphisms (SNPs) across STAT1 and STAT4 genes on chromosome 2 were genotyped using Illumina platform as a part of extensive association study in a large collection of 9923 lupus cases and controls from different racial groups. DNA from patients and controls was obtained from peripheral blood. Principal component analyses and population based case-control association analyses were performed and the p values, FDR q values and Odds ratios with 95% confidence intervals (95% CIs) were calculated. Results We observed strong genetic associations with SLE and multiple SNPs located within the STAT4 gene in different ethnicities (Fisher combined p= 7.02×10−25). In addition to strong confirmation of the association in the 3rd intronic region of this gene reported previously, we identified additional haplotypic association across STAT4 gene and in particular a common risk haplotype that is found in multiple racial groups. In contrast, only a relatively weak suggestive association was observed with STAT1, probably due to the proximity to STAT4. Conclusion Our findings indicate that the STAT4 gene is likely to be a crucial component in SLE pathogenesis among multiple racial groups. The functional effects of this association, when revealed, might improve our understanding of the disease and provide new therapeutic targets. PMID:19333953

  2. Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors

    PubMed Central

    Dannemann, Michael; Andrés, Aida M.; Kelso, Janet

    2016-01-01

    Pathogens and the diseases they cause have been among the most important selective forces experienced by humans during their evolutionary history. Although adaptive alleles generally arise by mutation, introgression can also be a valuable source of beneficial alleles. Archaic humans, who lived in Europe and Western Asia for more than 200,000 years, were probably well adapted to this environment and its local pathogens. It is therefore conceivable that modern humans entering Europe and Western Asia who admixed with them obtained a substantial immune advantage from the introgression of archaic alleles. Here we document a cluster of three Toll-like receptors (TLR6-TLR1-TLR10) in modern humans that carries three distinct archaic haplotypes, indicating repeated introgression from archaic humans. Two of these haplotypes are most similar to the Neandertal genome, and the third haplotype is most similar to the Denisovan genome. The Toll-like receptors are key components of innate immunity and provide an important first line of immune defense against bacteria, fungi, and parasites. The unusually high allele frequencies and unexpected levels of population differentiation indicate that there has been local positive selection on multiple haplotypes at this locus. We show that the introgressed alleles have clear functional effects in modern humans; archaic-like alleles underlie differences in the expression of the TLR genes and are associated with reduced microbial resistance and increased allergic disease in large cohorts. This provides strong evidence for recurrent adaptive introgression at the TLR6-TLR1-TLR10 locus, resulting in differences in disease phenotypes in modern humans. PMID:26748514

  3. A Stationary Wavelet Entropy-Based Clustering Approach Accurately Predicts Gene Expression

    PubMed Central

    Nguyen, Nha; Vo, An; Choi, Inchan

    2015-01-01

    Abstract Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation. PMID:25383910

  4. An effective fuzzy kernel clustering analysis approach for gene expression data.

    PubMed

    Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao

    2015-01-01

    Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.

  5. Identification of a Fourth Haplotype of Bactericera cockerelli (Hemiptera: Triozidae) in the United States

    PubMed Central

    Swisher, Kylie D.; Henne, Donald C.; Crosslin, James M.

    2014-01-01

    Abstract The potato psyllid, Bactericera cockerelli (Sulc) (Hemiptera: Triozidae), is a pest of potato and other solanaceous crops in North and Central America and New Zealand. Previous genotyping studies have demonstrated the presence of three different haplotypes of B. cockerelli in the United States corresponding to three geographical regions: Central, Western, and Northwestern. These studies utilized psyllids collected in the western and central United States between 1998 and 2011. In an effort to further genotype potato psyllids collected in the 2012 growing season, a fourth B. cockerelli haplotype was discovered corresponding to the Southwestern United States geographical region. High-resolution melting analyses identified this new haplotype using an amplicon generated from a portion of the B. cockerelli mitochondrial cytochrome coxidase subunit I gene. Sequencing of this gene, as well as use of a restriction enzyme assay, confirmed the identification of the novel B. cockerelli haplotype in the United States. PMID:25368079

  6. Distinct genotype distribution and haplotype profiles in MDR1 gene among Chinese Han, Bai, Wa and Tibetan ethnic groups.

    PubMed

    Lai, Yong; Huang, Min; Li, Hui; Wang, Xue-Ding; Li, Jia-Li

    2012-11-01

    P-Glycoprotein (P-gp, encoded by MDR1 gene) plays an important role in determining bioavailability and pharmacologic effects of many drugs. There is increasing evidence that P-gp activity may be genetically determined. In this study, we investigated the genotype distribution and the haplotype profiles of MDR1 gene in Chinese Han, Bai, Wa and Tibetan subjects. Much lower frequencies of the 1236T allele and the 2677T allele were found in Wa subjects than those in other three ethnic groups, while the 2677A allele was found about 6-fold more frequently in Han subjects than in subjects of other three ethnic groups. The Han, Bai and Tibetan subjects share the same three predominant haplotypes (T-T-T, T-G-C and C-G-C), and T-T-T is the highest and accounts for more than one third of the number of haplotypes in the subjects from each ethnic group. However, T-T-T was less common than T-G-C, T-G-T and C-G-C and occurring at only 13.8% in Wa subjects, furthermore, higher frequencies of T-G-T, C-T-C, C-G-T and C-T-T were observed in Wa subjects compared to those in other three ethnic groups. Frequencies of C-A-C and T-A-C in Han subjects were higher than those in other three ethnic groups. The findings of this study will be of some relevance in predicting MDR1 phenotype and pharmacokinetics as well as pharmacodynamic effects of many commonly used drugs that are P-gp substrates in these four Chinese ethnic groups.

  7. Clustering Genes of Common Evolutionary History

    PubMed Central

    Gori, Kevin; Suchan, Tomasz; Alvarez, Nadir; Goldman, Nick; Dessimoz, Christophe

    2016-01-01

    Phylogenetic inference can potentially result in a more accurate tree using data from multiple loci. However, if the loci are incongruent—due to events such as incomplete lineage sorting or horizontal gene transfer—it can be misleading to infer a single tree. To address this, many previous contributions have taken a mechanistic approach, by modeling specific processes. Alternatively, one can cluster loci without assuming how these incongruencies might arise. Such “process-agnostic” approaches typically infer a tree for each locus and cluster these. There are, however, many possible combinations of tree distance and clustering methods; their comparative performance in the context of tree incongruence is largely unknown. Furthermore, because standard model selection criteria such as AIC cannot be applied to problems with a variable number of topologies, the issue of inferring the optimal number of clusters is poorly understood. Here, we perform a large-scale simulation study of phylogenetic distances and clustering methods to infer loci of common evolutionary history. We observe that the best-performing combinations are distances accounting for branch lengths followed by spectral clustering or Ward’s method. We also introduce two statistical tests to infer the optimal number of clusters and show that they strongly outperform the silhouette criterion, a general-purpose heuristic. We illustrate the usefulness of the approach by 1) identifying errors in a previous phylogenetic analysis of yeast species and 2) identifying topological incongruence among newly sequenced loci of the globeflower fly genus Chiastocheta. We release treeCl, a new program to cluster genes of common evolutionary history (http://git.io/treeCl). PMID:26893301

  8. An Association Study of the SLC19A1 Gene Polymorphisms/Haplotypes with Idiopathic Recurrent Pregnancy Loss in an Iranian Population.

    PubMed

    Mohtaram, Shirin; Sheikhha, Mohammad Hasan; Honarvar, Negar; Sazegari, Ali; Maraghechi, Neda; Feizollahi, Zahra; Ghasemi, Nasrin

    2016-05-01

    The genetics of folate metabolism is one of the most significant mechanisms influencing fetal growth and may underlie some cases of unexplained recurrent miscarriage. Reduced folate carrier 1, encoded by the SLC19A1 gene, is a transporter of folate. Folate deficiency and elevated levels of homocysteine could be disadvantageous for the female reproductive system health. Thus, the balance between homocysteine and folate status can be used to measure the risk of recurrent pregnancy loss. The purpose of this study was to determine the association between -43T>C, 80G>A, and 696C>T polymorphisms of the SLC19A1 gene in 147 women who had unexplained recurrent miscarriage in comparison with 150 healthy women. Amplification refractory mutation system-polymerase chain reaction was used to genotype the molecular polymorphisms of this gene. The results indicated that the -43T>C single nucleotide of the SLC19A1 gene was significantly associated with a risk of recurrent miscarriage in Iranian women (p < 0.05). No significant association was observed for the other two polymorphisms. The haplotype frequency distribution of -43C/80G/696C, -;43C/80G/696T, -43C/80G, and 80G/696T was significantly different in patients than controls, which may represent a novel risk factor for idiopathic recurrent pregnancy loss. Polymorphisms and haplotypes of the SLC19A1 gene can be considered risk factors for idiopathic recurrent pregnancy loss.

  9. Association of chemokine receptor gene (CCR2-CCR5) haplotypes with acquisition and control of HIV-1 infection in Zambians

    PubMed Central

    2011-01-01

    Background Polymorphisms in chemokine (C-C motif) receptors 2 and 5 genes (CCR2 and CCR5) have been associated with HIV-1 infection and disease progression. We investigated the impact of CCR2-CCR5 haplotypes on HIV-1 viral load (VL) and heterosexual transmission in an African cohort. Between 1995 and 2006, cohabiting Zambian couples discordant for HIV-1 (index seropositive and HIV-1 exposed seronegative {HESN}) were monitored prospectively to determine the role of host genetic factors in HIV-1 control and heterosexual transmission. Genotyping for eight CCR2 and CCR5 variants resolved nine previously recognized haplotypes. By regression and survival analytic techniques, controlling for non-genetic factors, we estimated the effects of these haplotypic variants on a) index partner VL, b) seroconverter VL, c) HIV-1 transmission by index partners, d) HIV-1 acquisition by HESN partners. Results Among 567 couples, 240 virologically linked transmission events had occurred through 2006. HHF*2 homozygosity was associated with significantly lower VL in seroconverters (mean beta = -0.58, log10 P = 0.027) and the HHD/HHE diplotype was associated with significantly higher VL in the seroconverters (mean beta = 0.54, log10 P = 0.014) adjusted for age and gender in multivariable model. HHD/HHE was associated with more rapid acquisition of infection by the HESNs (HR = 2.0, 95% CI = 1.20-3.43, P = 0.008), after adjustments for index partner VL and the presence of genital ulcer or inflammation in either partner in Cox multivariable models. The HHD/HHE effect was stronger in exposed females (HR = 2.1, 95% CI = 1.14-3.95, P = 0.018). Conclusions Among Zambian discordant couples, HIV-1 coreceptor gene haplotypes and diplotypes appear to modulate HIV-1 VL in seroconverters and alter the rate of HIV-1 acquisition by HESNs. These associations replicate or resemble findings reported in other African and European populations. PMID:21429204

  10. Distribution and Genetic Diversity of Bacteriocin Gene Clusters in Rumen Microbial Genomes.

    PubMed

    Azevedo, Analice C; Bento, Cláudia B P; Ruiz, Jeronimo C; Queiroz, Marisa V; Mantovani, Hilário C

    2015-10-01

    Some species of ruminal bacteria are known to produce antimicrobial peptides, but the screening procedures have mostly been based on in vitro assays using standardized methods. Recent sequencing efforts have made available the genome sequences of hundreds of ruminal microorganisms. In this work, we performed genome mining of the complete and partial genome sequences of 224 ruminal bacteria and 5 ruminal archaea to determine the distribution and diversity of bacteriocin gene clusters. A total of 46 bacteriocin gene clusters were identified in 33 strains of ruminal bacteria. Twenty gene clusters were related to lanthipeptide biosynthesis, while 11 gene clusters were associated with sactipeptide production, 7 gene clusters were associated with class II bacteriocin production, and 8 gene clusters were associated with class III bacteriocin production. The frequency of strains whose genomes encode putative antimicrobial peptide precursors was 14.4%. Clusters related to the production of sactipeptides were identified for the first time among ruminal bacteria. BLAST analysis indicated that the majority of the gene clusters (88%) encoding putative lanthipeptides contained all the essential genes required for lanthipeptide biosynthesis. Most strains of Streptococcus (66.6%) harbored complete lanthipeptide gene clusters, in addition to an open reading frame encoding a putative class II bacteriocin. Albusin B-like proteins were found in 100% of the Ruminococcus albus strains screened in this study. The in silico analysis provided evidence of novel biosynthetic gene clusters in bacterial species not previously related to bacteriocin production, suggesting that the rumen microbiota represents an underexplored source of antimicrobial peptides. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  11. HaploForge: a comprehensive pedigree drawing and haplotype visualization web application.

    PubMed

    Tekman, Mehmet; Medlar, Alan; Mozere, Monika; Kleta, Robert; Stanescu, Horia

    2017-12-15

    Haplotype reconstruction is an important tool for understanding the aetiology of human disease. Haplotyping infers the most likely phase of observed genotypes conditional on constraints imposed by the genotypes of other pedigree members. The results of haplotype reconstruction, when visualized appropriately, show which alleles are identical by descent despite the presence of untyped individuals. When used in concert with linkage analysis, haplotyping can help delineate a locus of interest and provide a succinct explanation for the transmission of the trait locus. Unfortunately, the design choices made by existing haplotype visualization programs do not scale to large numbers of markers. Indeed, following haplotypes from generation to generation requires excessive scrolling back and forth. In addition, the most widely used program for haplotype visualization produces inconsistent recombination artefacts for the X chromosome. To resolve these issues, we developed HaploForge, a novel web application for haplotype visualization and pedigree drawing. HaploForge takes advantage of HTML5 to be fast, portable and avoid the need for local installation. It can accurately visualize autosomal and X-linked haplotypes from both outbred and consanguineous pedigrees. Haplotypes are coloured based on identity by descent using a novel A* search algorithm and we provide a flexible viewing mode to aid visual inspection. HaploForge can currently process haplotype reconstruction output from Allegro, GeneHunter, Merlin and Simwalk. HaploForge is licensed under GPLv3 and is hosted and maintained via GitHub. https://github.com/mtekman/haploforge. r.kleta@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  12. A TNF region haplotype offers protection from typhoid fever in Vietnamese patients

    PubMed Central

    2009-01-01

    The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085

  13. The genetic architecture of 3'untranslated region of the MICA gene: polymorphisms and haplotypes.

    PubMed

    Luo, Jia; Tian, Wei; Liu, Xue Xiang; Yu, JunLong; Li, LiXin; Pan, FengHua

    2013-10-01

    In this study, the 3'untranslated region (3'UTR) of MHC class I chain-related gene A (MICA) were investigated in 104 healthy, unrelated Han individuals recruited from northern China, using PCR-sequencing method. Nine polymorphic sites were detected, which were in very strong linkage disequilibrium with each other .Seven different MICA 3'UTR alleles were identified, among which UTR1 predominated (0.6971),followed by UTR2 (0.2356). Twenty-one extended haplotypes incorporating the 3'UTR and MICA exons 2-5 were observed in this population. Phylogenetic analysis revealed the existence of two MICA lineages, each with multiple subsets. The 2 lineages were primarily linked to UTR1 and UTR2 in the 3'UTR, respectively. Ewens-Watterson homozygosity statistics at MICA coding and 3'UTR regions were consistent with neutral expectations. Our data provided for the first time the data of genetic variation in the 3'UTR of MICA gene in human populations. The findings are valuable for future studies of the mechanisms underlying MICA post-transcriptional regulation, and will inform studies of evolution of the MHC gene complex. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  14. Haplotype-based gene-gene interaction of bone morphogenetic protein 4 and interferon regulatory factor 6 in the etiology of non-syndromic cleft lip with or without cleft palate in a Chilean population.

    PubMed

    Blanco, Rafael; Colombo, Alicia; Pardo, Rosa; Suazo, José

    2017-04-01

    Non-syndromic cleft lip with or without cleft palate (NSCL/P) is the most common craniofacial birth defect in humans, the etiology of which can be dependent on the interactions of multiple genes. We previously reported haplotype associations for polymorphic variants of interferon regulatory factor 6 (IRF6), msh homeobox 1 (MSX1), bone morphogenetic protein 4 (BMP4), and transforming growth factor beta 3 (TGFB3) in Chile. Here, we analyzed the haplotype-based gene-gene interaction for markers of these genes and NSCL/P risk in the Chilean population. We genotyped 15 single nucleoptide polymorphisms (SNPs) in 152 Chilean patients and 164 controls. Linkage disequilibrium (LD) blocks were determined using the Haploview software, and phase reconstruction was performed by the Phase program. Haplotype-based interactions were evaluated using the multifactor dimensionality reduction (MDR) method. We detected two LD blocks composed of two SNPs from BMP4 (Block 1) and three SNPs from IRF6 (Block 2). Although MDR showed no statistical significance for the global interaction model involving these blocks, we found four combinations conferring a statistically significantly increased NSCL/P risk (Block 1-Block 2): T-T/T-G C-G-T/G-A-T; T-T/T-G C-G-C/C-G-C; T-T/T-G G-A-T/G-A-T; and T-T/C-G G-A-T/G-A-T. These findings may reflect the presence of a genomic region containing potential causal variants interacting in the etiology of NSCL/P and may contribute to disentangling the complex etiology of this birth defect. © 2017 Eur J Oral Sci.

  15. Assembly and features of secondary metabolite biosynthetic gene clusters in Streptomyces ansochromogenes.

    PubMed

    Zhong, Xingyu; Tian, Yuqing; Niu, Guoqing; Tan, Huarong

    2013-07-01

    A draft genome sequence of Streptomyces ansochromogenes 7100 was generated using 454 sequencing technology. In combination with local BLAST searches and gap filling techniques, a comprehensive antiSMASH-based method was adopted to assemble the secondary metabolite biosynthetic gene clusters in the draft genome of S. ansochromogenes. A total of at least 35 putative gene clusters were identified and assembled. Transcriptional analysis showed that 20 of the 35 gene clusters were expressed in either or all of the three different media tested, whereas the other 15 gene clusters were silent in all three different media. This study provides a comprehensive method to identify and assemble secondary metabolite biosynthetic gene clusters in draft genomes of Streptomyces, and will significantly promote functional studies of these secondary metabolite biosynthetic gene clusters.

  16. Molecular definition of red cell Rh haplotypes by tightly linked SphI RFLPs.

    PubMed

    Huang, C H; Reid, M E; Chen, Y; Coghlan, G; Okubo, Y

    1996-01-01

    The Rh blood group system of human red cells contains five major antigens D, C/c, and E/e (the latter four designated "non-D") that are specified by eight gene complexes known as Rh haplotypes. In this paper, we report on the mapping of RH locus and identification of a set of SphI RFLPs that are tightly linked with the Rh structural genes. Using exon-specific probes, we have localized the SphI cleavage sites resulting in these DNA markers and derived a comprehensive map for the RH locus. It was found that the SphI fragments encompassing exons 4-7 of the Rh genes occur in four banding patterns or frameworks that correspond to the distribution and segregation of the common Rh haplotypes. This linkage disequilibrium allowed a genotype-phenotype correlation and direct determination of Rh zygosity related to the Rh-positive or Rh-negative status (D/D, D/d, and d/d). Studies on the occurrence of SphI RFLPs in a number of rare Rh variants indicated that Rh phenotypic diversity has taken place on different haplotype backgrounds and has arisen by diverse genetic mechanisms. The molecular definition of Rh haplotypes by SphI RFLP frameworks should provide a useful procedure for genetic counseling and prenatal assessment of Rh alloimmunization.

  17. Diploid/Polyploid Syntenic Shuttle Mapping and Haplotype-Specific Chromosome Walking Toward a Rust Resistance Gene (Bru1) in Highly Polyploid Sugarcane (2n ∼ 12x ∼ 115)

    PubMed Central

    Le Cunff, Loïc; Garsmeur, Olivier; Raboin, Louis Marie; Pauquet, Jérome; Telismart, Hugues; Selvi, Athiappan; Grivet, Laurent; Philippe, Romain; Begum, Dilara; Deu, Monique; Costet, Laurent; Wing, Rod; Glaszmann, Jean Christophe; D'Hont, Angélique

    2008-01-01

    The genome of modern sugarcane cultivars is highly polyploid (∼12x), aneuploid, of interspecific origin, and contains 10 Gb of DNA. Its size and complexity represent a major challenge for the isolation of agronomically important genes. Here we report on the first attempt to isolate a gene from sugarcane by map-based cloning, targeting a durable major rust resistance gene (Bru1). We describe the genomic strategies that we have developed to overcome constraints associated with high polyploidy in the successive steps of map-based cloning approaches, including diploid/polyploid syntenic shuttle mapping with two model diploid species (sorghum and rice) and haplotype-specific chromosome walking. Their applications allowed us (i) to develop a high-resolution map including markers at 0.28 and 0.14 cM on both sides and 13 markers cosegregating with Bru1 and (ii) to develop a physical map of the target haplotype that still includes two gaps at this stage due to the discovery of an insertion specific to this haplotype. These approaches will pave the way for the development of future map-based cloning approaches for sugarcane and other complex polyploid species. PMID:18757946

  18. Transcriptome Analysis of Aspergillus flavus Reveals veA-Dependent Regulation of Secondary Metabolite Gene Clusters, Including the Novel Aflavarin Cluster

    PubMed Central

    Cary, J. W.; Han, Z.; Yin, Y.; Lohmar, J. M.; Shantappa, S.; Harris-Coward, P. Y.; Mack, B.; Ehrlich, K. C.; Wei, Q.; Arroyo-Manzanares, N.; Uka, V.; Vanhaecke, L.; Bhatnagar, D.; Yu, J.; Nierman, W. C.; Johns, M. A.; Sorensen, D.; Shen, H.; De Saeger, S.; Diana Di Mavungu, J.

    2015-01-01

    The global regulatory veA gene governs development and secondary metabolism in numerous fungal species, including Aspergillus flavus. This is especially relevant since A. flavus infects crops of agricultural importance worldwide, contaminating them with potent mycotoxins. The most well-known are aflatoxins, which are cytotoxic and carcinogenic polyketide compounds. The production of aflatoxins and the expression of genes implicated in the production of these mycotoxins are veA dependent. The genes responsible for the synthesis of aflatoxins are clustered, a signature common for genes involved in fungal secondary metabolism. Studies of the A. flavus genome revealed many gene clusters possibly connected to the synthesis of secondary metabolites. Many of these metabolites are still unknown, or the association between a known metabolite and a particular gene cluster has not yet been established. In the present transcriptome study, we show that veA is necessary for the expression of a large number of genes. Twenty-eight out of the predicted 56 secondary metabolite gene clusters include at least one gene that is differentially expressed depending on presence or absence of veA. One of the clusters under the influence of veA is cluster 39. The absence of veA results in a downregulation of the five genes found within this cluster. Interestingly, our results indicate that the cluster is expressed mainly in sclerotia. Chemical analysis of sclerotial extracts revealed that cluster 39 is responsible for the production of aflavarin. PMID:26209694

  19. Computational intelligence in bioinformatics: SNP/haplotype data in genetic association study for common diseases.

    PubMed

    Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan

    2009-09-01

    Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.

  20. The analysis of APOL1 genetic variation and haplotype diversity provided by 1000 Genomes project.

    PubMed

    Peng, Ting; Wang, Li; Li, Guisen

    2017-08-11

    The APOL1 gene variants has been shown to be associated with an increased risk of multiple kinds of diseases, particularly in African Americans, but not in Caucasians and Asians. In this study, we explored the single nucleotide polymorphism (SNP) and haplotype diversity of APOL1 gene in different races provided by 1000 Genomes project. Variants of APOL1 gene in 1000 Genome Project were obtained and SNPs located in the regulatory region or coding region were selected for genetic variation analysis. Total 2504 individuals from 26 populations were classified as four groups that included Africa, Europe, Asia and Admixed populations. Tag SNPs were selected to evaluate the haplotype diversities in the four populations by HaploStats software. APOL1 gene was surrounded by some of the most polymorphic genes in the human genome, variation of APOL1 gene was common, with up to 613 SNP (1000 Genome Project reported) and 99 of them (16.2%) with MAF ≥ 1%. There were 79 SNPs in the URR and 92 SNPs in 3'UTR. Total 12 SNPs in URR and 24 SNPs in 3'UTR were considered as common variants with MAF ≥ 1%. It is worth noting that URR-1 was presents lower frequencies in European populations, while other three haplotypes taken an opposite pattern; 3'UTR presents several high-frequency variation sites in a short segment, and the differences of its haplotypes among different population were significant (P < 0.01), UTR-1 and UTR-5 presented much higher frequency in African population, while UTR-2, UTR-3 and UTR-4 were much lower. APOL1 coding region showed that two SNP of G1 with higher frequency are actually pull down the haplotype H-1 frequency when considering all populations pooled together, and the diversity among the four populations be widen by the G1 two mutation (P 1  = 3.33E-4 vs P 2  = 3.61E-30). The distributions of APOL1 gene variants and haplotypes were significantly different among the different populations, in either regulatory or coding regions. It could provide

  1. Identifying a gene expression signature of cluster headache in blood

    PubMed Central

    Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

    2017-01-01

    Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

  2. The human RHOX gene cluster: target genes and functional analysis of gene variants in infertile men.

    PubMed

    Borgmann, Jennifer; Tüttelmann, Frank; Dworniczak, Bernd; Röpke, Albrecht; Song, Hye-Won; Kliesch, Sabine; Wilkinson, Miles F; Laurentino, Sandra; Gromoll, Jörg

    2016-11-15

    The X-linked reproductive homeobox (RHOX) gene cluster encodes transcription factors preferentially expressed in reproductive tissues. This gene cluster has important roles in male fertility based on phenotypic defects of Rhox-mutant mice and the finding that aberrant RHOX promoter methylation is strongly associated with abnormal human sperm parameters. However, little is known about the molecular mechanism of RHOX function in humans. Using gene expression profiling, we identified genes regulated by members of the human RHOX gene cluster. Some genes were uniquely regulated by RHOXF1 or RHOXF2/2B, while others were regulated by both of these transcription factors. Several of these regulated genes encode proteins involved in processes relevant to spermatogenesis; e.g. stress protection and cell survival. One of the target genes of RHOXF2/2B is RHOXF1, suggesting cross-regulation to enhance transcriptional responses. The potential role of RHOX in human infertility was addressed by sequencing all RHOX exons in a group of 250 patients with severe oligozoospermia. This revealed two mutations in RHOXF1 (c.515G > A and c.522C > T) and four in RHOXF2/2B (-73C > G, c.202G > A, c.411C > T and c.679G > A), of which only one (c.202G > A) was found in a control group of men with normal sperm concentration. Functional analysis demonstrated that c.202G > A and c.679G > A significantly impaired the ability of RHOXF2/2B to regulate downstream genes. Molecular modelling suggested that these mutations alter RHOXF2/F2B protein conformation. By combining clinical data with in vitro functional analysis, we demonstrate how the X-linked RHOX gene cluster may function in normal human spermatogenesis and we provide evidence that it is impaired in human male fertility.

  3. Many nonuniversal archaeal ribosomal proteins are found in conserved gene clusters

    PubMed Central

    WANG, JIACHEN; DASGUPTA, INDRANI; FOX, GEORGE E.

    2009-01-01

    The genomic associations of the archaeal ribosomal proteins, (r-proteins), were examined in detail. The archaeal versions of the universal r-protein genes are typically in clusters similar or identical and to those found in bacteria. Of the 35 nonuniversal archaeal r-protein genes examined, the gene encoding L18e was found to be associated with the conserved L13 cluster, whereas the genes for S4e, L32e and L19e were found in the archaeal version of the spc operon. Eleven nonuniversal protein genes were not associated with any common genomic context. Of the remaining 19 protein genes, 17 were convincingly assigned to one of 10 previously unrecognized gene clusters. Examination of the gene content of these clusters revealed multiple associations with genes involved in the initiation of protein synthesis, transcription or other cellular processes. The lack of such associations in the universal clusters suggests that initially the ribosome evolved largely independently of other processes. More recently it likely has evolved in concert with other cellular systems. It was also verified that a second copy of the gene encoding L7ae found in some bacteria is actually a homolog of the gene encoding L30e and should be annotated as such. PMID:19478915

  4. Candidate Gene Approach for Parasite Resistance in Sheep – Variation in Immune Pathway Genes and Association with Fecal Egg Count

    PubMed Central

    Periasamy, Kathiravan; Pichler, Rudolf; Poli, Mario; Cristel, Silvina; Cetrá, Bibiana; Medus, Daniel; Basar, Muladno; A. K., Thiruvenkadan; Ramasamy, Saravanan; Ellahi, Masroor Babbar; Mohammed, Faruque; Teneva, Atanaska; Shamsuddin, Mohammed; Podesta, Mario Garcia; Diallo, Adama

    2014-01-01

    Sheep chromosome 3 (Oar3) has the largest number of QTLs reported to be significantly associated with resistance to gastro-intestinal nematodes. This study aimed to identify single nucleotide polymorphisms (SNPs) within candidate genes located in sheep chromosome 3 as well as genes involved in major immune pathways. A total of 41 SNPs were identified across 38 candidate genes in a panel of unrelated sheep and genotyped in 713 animals belonging to 22 breeds across Asia, Europe and South America. The variations and evolution of immune pathway genes were assessed in sheep populations across these macro-environmental regions that significantly differ in the diversity and load of pathogens. The mean minor allele frequency (MAF) did not vary between Asian and European sheep reflecting the absence of ascertainment bias. Phylogenetic analysis revealed two major clusters with most of South Asian, South East Asian and South West Asian breeds clustering together while European and South American sheep breeds clustered together distinctly. Analysis of molecular variance revealed strong phylogeographic structure at loci located in immune pathway genes, unlike microsatellite and genome wide SNP markers. To understand the influence of natural selection processes, SNP loci located in chromosome 3 were utilized to reconstruct haplotypes, the diversity of which showed significant deviations from selective neutrality. Reduced Median network of reconstructed haplotypes showed balancing selection in force at these loci. Preliminary association of SNP genotypes with phenotypes recorded 42 days post challenge revealed significant differences (P<0.05) in fecal egg count, body weight change and packed cell volume at two, four and six SNP loci respectively. In conclusion, the present study reports strong phylogeographic structure and balancing selection operating at SNP loci located within immune pathway genes. Further, SNP loci identified in the study were found to have potential for

  5. Haplotype specific alteration of diabetes MHC risk by olfactory receptor gene polymorphism.

    PubMed

    Jahromi, Mohamed M

    2012-12-01

    Evidence for genes associated with risk for Type 1 diabetes (T1D) in the extended region of the major histocompatibility complex (MHC) genes is accumulating. The aim of this study was to investigate the association pattern of the extended MHC region with T1D susceptibility to identify effects independent of well established DR/DQ genes. A total of 394 Europid families with T1D were genotyped for the single nucleotide polymorphism (SNP) in the olfactory receptor family 14, subfamily J, member 1 (OR14J1) gene, rs9257691, in the MHC telomeric region. The OR provides "an internal depiction of our external world" through the capture of odorant molecules in the main OR system by several large families of G-protein coupled receptors (GPCR). These receptors transduce and chemosignals into the central nervous system (CNS). This SNP was chosen to identify its association with T1D. Interestingly, OR14J1C allele was significantly associated with T1D that seems to go with DRB1*0401, Χ(2)=10.9, p=0.0003. However, by fixing both genes of DR*0401-DQB1*0302, high risk, the association of T1D with OR14J1C still existed, Χ(2)=7.4, p=0.005. The occurrence of association of the OR14J1C allele with T1D patients with DRB1*401/DQB1*0302 is an independent risk for T1D. As an accumulative report suggests the role of OR in the pathogenesis of diabetic microvascular and other diabetic complications, undoubtedly, this haplotype specific alteration of T1D risk is an independent risk for the disease and can address the promising MHC-linked gene other than DR/DQ. Moreover, there is nothing to hinder for that this might be a signal that identifies the role of OR gene in the pathogenesis of T1D in patients who are prone to diabetic complications. Copyright © 2012. Published by Elsevier B.V.

  6. Alternative haplotypes of antigen processing genes in zebrafish diverged early in vertebrate evolution

    PubMed Central

    McConnell, Sean C.; Hernandez, Kyle M.; Wcisel, Dustin J.; Kettleborough, Ross N.; Stemple, Derek L.; Andrade, Jorge; de Jong, Jill L. O.

    2016-01-01

    Antigen processing and presentation genes found within the MHC are among the most highly polymorphic genes of vertebrate genomes, providing populations with diverse immune responses to a wide array of pathogens. Here, we describe transcriptome, exome, and whole-genome sequencing of clonal zebrafish, uncovering the most extensive diversity within the antigen processing and presentation genes of any species yet examined. Our CG2 clonal zebrafish assembly provides genomic context within a remarkably divergent haplotype of the core MHC region on chromosome 19 for six expressed genes not found in the zebrafish reference genome: mhc1uga, proteasome-β 9b (psmb9b), psmb8f, and previously unknown genes psmb13b, tap2d, and tap2e. We identify ancient lineages for Psmb13 within a proteasome branch previously thought to be monomorphic and provide evidence of substantial lineage diversity within each of three major trifurcations of catalytic-type proteasome subunits in vertebrates: Psmb5/Psmb8/Psmb11, Psmb6/Psmb9/Psmb12, and Psmb7/Psmb10/Psmb13. Strikingly, nearby tap2 and MHC class I genes also retain ancient sequence lineages, indicating that alternative lineages may have been preserved throughout the entire MHC pathway since early diversification of the adaptive immune system ∼500 Mya. Furthermore, polymorphisms within the three MHC pathway steps (antigen cleavage, transport, and presentation) are each predicted to alter peptide specificity. Lastly, comparative analysis shows that antigen processing gene diversity is far more extensive than previously realized (with ancient coelacanth psmb8 lineages, shark psmb13, and tap2t and psmb10 outside the teleost MHC), implying distinct immune functions and conserved roles in shaping MHC pathway evolution throughout vertebrates. PMID:27493218

  7. HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

    PubMed

    Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

    2017-11-01

    The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by

  8. Hox gene cluster of the ascidian, Halocynthia roretzi, reveals multiple ancient steps of cluster disintegration during ascidian evolution.

    PubMed

    Sekigami, Yuka; Kobayashi, Takuya; Omi, Ai; Nishitsuji, Koki; Ikuta, Tetsuro; Fujiyama, Asao; Satoh, Noriyuki; Saiga, Hidetoshi

    2017-01-01

    Hox gene clusters with at least 13 paralog group (PG) members are common in vertebrate genomes and in that of amphioxus. Ascidians, which belong to the subphylum Tunicata (Urochordata), are phylogenetically positioned between vertebrates and amphioxus, and traditionally divided into two groups: the Pleurogona and the Enterogona. An enterogonan ascidian, Ciona intestinalis ( Ci ), possesses nine Hox genes localized on two chromosomes; thus, the Hox gene cluster is disintegrated. We investigated the Hox gene cluster of a pleurogonan ascidian, Halocynthia roretzi ( Hr ) to investigate whether Hox gene cluster disintegration is common among ascidians, and if so, how such disintegration occurred during ascidian or tunicate evolution. Our phylogenetic analysis reveals that the Hr Hox gene complement comprises nine members, including one with a relatively divergent Hox homeodomain sequence. Eight of nine Hr Hox genes were orthologous to Ci-Hox1 , 2, 3, 4, 5, 10, 12 and 13. Following the phylogenetic classification into 13 PGs, we designated Hr Hox genes as Hox1, 2, 3, 4, 5, 10, 11/12/13.a , 11/12/13.b and HoxX . To address the chromosomal arrangement of the nine Hox genes, we performed two-color chromosomal fluorescent in situ hybridization, which revealed that the nine Hox genes are localized on a single chromosome in Hr , distinct from their arrangement in Ci . We further examined the order of the nine Hox genes on the chromosome by chromosome/scaffold walking. This analysis suggested a gene order of Hox1 , 11/12/13.b, 11/12/13.a, 10, 5, X, followed by either Hox4, 3, 2 or Hox2, 3, 4 on the chromosome. Based on the present results and those previously reported in Ci , we discuss the establishment of the Hox gene complement and disintegration of Hox gene clusters during the course of ascidian or tunicate evolution. The Hox gene cluster and the genome must have experienced extensive reorganization during the course of evolution from the ancestral tunicate to Hr and Ci

  9. The ergot alkaloid gene cluster: functional analyses and evolutionary aspects.

    PubMed

    Lorenz, Nicole; Haarmann, Thomas; Pazoutová, Sylvie; Jung, Manfred; Tudzynski, Paul

    2009-01-01

    Ergot alkaloids and their derivatives have been traditionally used as therapeutic agents in migraine, blood pressure regulation and help in childbirth and abortion. Their production in submerse culture is a long established biotechnological process. Ergot alkaloids are produced mainly by members of the genus Claviceps, with Claviceps purpurea as best investigated species concerning the biochemistry of ergot alkaloid synthesis (EAS). Genes encoding enzymes involved in EAS have been shown to be clustered; functional analyses of EAS cluster genes have allowed to assign specific functions to several gene products. Various Claviceps species differ with respect to their host specificity and their alkaloid content; comparison of the ergot alkaloid clusters in these species (and of clavine alkaloid clusters in other genera) yields interesting insights into the evolution of cluster structure. This review focuses on recently published and also yet unpublished data on the structure and evolution of the EAS gene cluster and on the function and regulation of cluster genes. These analyses have also significant biotechnological implications: the characterization of non-ribosomal peptide synthetases (NRPS) involved in the synthesis of the peptide moiety of ergopeptines opened interesting perspectives for the synthesis of ergot alkaloids; on the other hand, defined mutants could be generated producing interesting intermediates or only single peptide alkaloids (instead of the alkaloid mixtures usually produced by industrial strains).

  10. Detecting novel SNPs and breed-specific haplotypes at calpastatin gene in Iranian fat- and thin-tailed sheep breeds and their effects on protein structure.

    PubMed

    Aali, Mohsen; Moradi-Shahrbabak, Mohammad; Moradi-Shahrbabak, Hosein; Sadeghi, Mostafa

    2014-03-01

    Calpastatin has been introduced as a potential candidate gene for growth and meat quality traits. In this study, genetic variability was investigated in the exon 6 and its intron boundaries of ovine CAST gene by PCR-SSCP analysis and DNA sequencing. Also a protein sequence and structural analysis were performed to predict the possible impact of amino acid substitutions on physicochemical properties and structure of the CAST protein. A total of 487 animals belonging to four ancient Iranian sheep breeds with different fat metabolisms, Lori-Bakhtiari and Chall (fat-tailed), Zel-Atabay cross-bred (medium fat-tailed) and Zel (thin-tailed), were analyzed. Eight unique SSCP patterns, representing eight different sequences or haplotypes, CAST-1, CAST-2 and CAST-6 to CAST-11, were identified. Haplotypes CAST-1 and CAST-2 were most common with frequency of 0.365 and 0.295. The novel haplotype CAST-8 had considerable frequency in Iranian sheep breeds (0.129). All the consensus sequences showed 98-99%, 94-98%, 92-93% and 82-83% similarity to the published ovine, caprine, bovine and porcine CAST locus sequences, respectively. Sequence analysis revealed four SNPs in intron 5 (C24T, G62A, G65T and T69-) and three SNPs in exon 6 (c.197A>T, c.282G>T and c.296C>G). All three SNPs in exon 6 were missense mutations which would result in p.Gln 66 Leu, p.Glu 94 Asp and p.Pro 99 Arg substitutions, respectively, in CAST protein. All three amino acid substitutions affected the physicochemical properties of ovine CAST protein including hydrophobicity, amphiphilicity and net charge and subsequently might influence its structure and effect on the activity of Ca2+ channels; hence, they might regulate calpain activity and afterwards meat tenderness and growth rate. The Lori-Bakhtiari population showed the highest heterozygosity in the ovine CAST locus (0.802). Frequency difference of haplotypes CAST-10 and CAST-8 between Lori-Bakhtiari (fat-tailed) and Zel (thin-tailed) breeds was highly

  11. Genetic polymorphisms in MDR1 and CYP3A4 genes in Asians and the influence of MDR1 haplotypes on cyclosporin disposition in heart transplant recipients.

    PubMed

    Chowbay, Balram; Cumaraswamy, Sivathasan; Cheung, Yin Bun; Zhou, Qingyu; Lee, Edmund J D

    2003-02-01

    Intestinal cytochrome P450 3A4 (CYP3A4) and P-glycoprotein (P-gp) both play a vital role in the metabolism of oral cyclosporine (CsA). We investigated the genetic polymorphisms in CYP3A4(promoter region and exons 5, 7 and 9) and MDR1 (exons 12, 21 and 26) genes and the impact of these polymorphisms on the pharmacokinetics of oral CsA in stable heart transplant patients (n = 14). CYP3A4 polymorphisms were rare in the Asian population and transplant patients. Haplotype analysis revealed 12 haplotypes in the Chinese, eight in the Malays and 10 in the Indians. T-T-T was the most common haplotype in all ethnic groups. The frequency of the homozygous mutant genotype at all three loci (TT-TT-TT) was highest in the Indians (31%) compared to 19% and 15% in the Chinese and Malays, respectively. In heart transplant patients, CsA exposure (AUC(0-4 h), AUC(0-12 h) and C(max)) was high in patients with the T-T-T haplotypes compared to those with C-G-C haplotypes. These findings suggest that haplotypes rather than genotypes influence CsA disposition in transplant patients.

  12. Vitamin K epoxide reductase complex subunit 1 (Vkorc1) haplotype diversity in mouse priority strains

    PubMed Central

    Song, Ying; Vera, Nicole; Kohn, Michael H

    2008-01-01

    Background Polymorphisms in the vitamin K-epoxide reductase complex subunit 1 gene, Vkorc1, could affect blood coagulation and other vitamin K-dependent proteins, such as osteocalcin (bone Gla protein, BGP). Here we sequenced the Vkorc1 gene in 40 mouse priority strains. We analyzed Vkorc1 haplotypes with respect to prothrombin time (PT) and bone mineral density and composition (BMD and BMC); phenotypes expected to be vitamin K-dependent and represented by data in the Mouse Phenome Database (MPD). Findings In the commonly used laboratory strains of Mus musculus domesticus we identified only four haplotypes differing in the intron or 5' region sequence of the Vkorc1. Six haplotypes differing by coding and non-coding polymorphisms were identified in the other subspecies of Mus. We detected no significant association of Vkorc1 haplotypes with PT, BMD and BMC within each subspecies of Mus. Vkorc1 haplotype sequences divergence between subspecies was associated with PT, BMD and BMC. Conclusion Phenotypic variation in PT, BMD and BMC within subspecies of Mus, while substantial, appears to be dominated by genetic variation in genes other than the Vkorc1. This was particularly evident for M. m. domesticus, where a single haplotype was observed in conjunction with virtually the entire range of PT, BMD and BMC values of all 5 subspecies of Mus included in this study. Differences in these phenotypes between subspecies also should not be attributed to Vkorc1 variants, but should be viewed as a result of genome wide genetic divergence. PMID:19046458

  13. Transfusion strategy for weak D type 4.0 based on RHD alleles and RH haplotypes in Tunisia

    PubMed Central

    Ouchari, Mouna; Srivastava, Kshitij; Romdhane, Houda; Yacoub, Saloua Jemni; Flegel, Willy Albert

    2017-01-01

    Background With more than 460 RHD alleles, this gene is the most complex and polymorphic among all blood group systems. The Tunisian population has the largest known prevalence of weak D type 4.0 alleles, occurring in 1 of 105 RH haplotypes. We aimed to establish a rationale for the transfusion strategy of weak D type 4.0 in Tunisia. Study design and methods Donors were randomly screened for the serological weak D phenotype. The RHD coding sequence and parts of the introns were sequenced. To establish the RH haplotype, the RHCE gene was tested for characteristic single nucleotide positions. Results We determined all RHD alleles and the RH haplotypes coding for the serologic weak D phenotype among 13,431 Tunisian donations. A serologic weak D phenotype was found in 67 individuals (0.50%). Among them, 60 carried a weak D type 4 allele: 53 weak D type 4.0, 6 weak D type 4.2.2 (DAR), and 1 weak D type 4.1. Another 4 donors had 1 variant allele each: DVII, weak D type 1, weak D type 3, and weak D type 100, while 3 donors showed a normal RHD sequence. The weak D type 4.0 was most often linked to RHCE*ceVS.04.01, weak D type 4.2.2 to RHCE*ceAR, and weak D type 4.1 to RHCE*ceVS.02, while the other RHD alleles were linked to one of the common RHCE alleles. Conclusions Among the weak D phenotypes in Tunisia, no novel RHD allele was found and almost 90% were caused by alleles of the weak D type 4 cluster, of which 88% represented the weak D type 4.0 allele. Based on established RH haplotypes for variant RHD and RHCE alleles and the lack of adverse clinical reports, we recommend D positive transfusions for patients with weak D type 4.0 in Tunisia. PMID:29193104

  14. Transfusion strategy for weak D Type 4.0 based on RHD alleles and RH haplotypes in Tunisia.

    PubMed

    Ouchari, Mouna; Srivastava, Kshitij; Romdhane, Houda; Jemni Yacoub, Saloua; Flegel, Willy Albert

    2018-02-01

    With more than 460 RHD alleles, this gene is the most complex and polymorphic among all blood group systems. The Tunisian population has the largest known prevalence of weak D Type 4.0 alleles, occurring in one of 105 RH haplotypes. We aimed to establish a rationale for the transfusion strategy of weak D Type 4.0 in Tunisia. Donors were randomly screened for the serologic weak D phenotype. The RHD coding sequence and parts of the introns were sequenced. To establish the RH haplotype, the RHCE gene was tested for characteristic single-nucleotide positions. We determined all RHD alleles and the RH haplotypes coding for the serologic weak D phenotype among 13,431 Tunisian donations. A serologic weak D phenotype was found in 67 individuals (0.50%). Among them, 60 carried a weak D Type 4 allele: 53 weak D Type 4.0, six weak D Type 4.2.2 (DAR), and one weak D Type 4.1. An additional four donors had one variant allele each: DVII, weak D Type 1, weak D Type 3, and weak D type 100, while three donors showed a normal RHD sequence. The weak D Type 4.0 was most often linked to RHCE*ceVS.04.01, weak D Type 4.2.2 to RHCE*ceAR, and weak D Type 4.1 to RHCE*ceVS.02, while the other RHD alleles were linked to one of the common RHCE alleles. Among the weak D phenotypes in Tunisia, no novel RHD allele was found and almost 90% were caused by alleles of the weak D Type 4 cluster, of which 88% represented the weak D Type 4.0 allele. Based on established RH haplotypes for variant RHD and RHCE alleles and the lack of adverse clinical reports, we recommend D+ transfusions for patients with weak D Type 4.0 in Tunisia. © 2017 AABB.

  15. Clustered Genes Involved in Cyclopiazonic Acid Production are Next to the Aflatoxin Biosynthesis Gene Cluster in Aspergillus flavus

    USDA-ARS?s Scientific Manuscript database

    Cyclopiazonic acid (CPA), an indole-tetramic acid toxin, is produced by many species of Aspergillus and Penicillium. In addition to CPA Aspergillus flavus produces polyketide-derived carcinogenic aflatoxins (AFs). AF biosynthesis genes form a gene cluster in a subtelomeric region. Isolates of A. fla...

  16. Unusual Gene Order and Organization of the Sea Urchin Hox Cluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cameron, R A; Rowen, L; Nesbitt, R

    2005-10-11

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3 gene is Hox5. (The gene order is :more » 5-Hox1, 2, 3, 11/13c, 11/13b, 11/13a, 9/10, 8, 7, 6, 5 - 3). The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less

  17. Unusual Gene Order and Organization of the Sea Urchin HoxCluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew

    2005-05-10

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is :more » 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.« less

  18. clusterProfiler: an R package for comparing biological themes among gene clusters.

    PubMed

    Yu, Guangchuang; Wang, Li-Gen; Han, Yanyan; He, Qing-Yu

    2012-05-01

    Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.

  19. IL7Rα Expression and Upregulation by IFNβ in Dendritic Cell Subsets Is Haplotype-Dependent

    PubMed Central

    McKay, Fiona C.; Hoe, Edwin; Parnell, Grant; Gatt, Prudence; Schibeci, Stephen D.; Stewart, Graeme J.; Booth, David R.

    2013-01-01

    The IL7Rα gene is unequivocally associated with susceptibility to multiple sclerosis (MS). Haplotype 2 (Hap 2) confers protection from MS, and T cells and dendritic cells (DCs) of Hap 2 exhibit reduced splicing of exon 6, resulting in production of relatively less soluble receptor, and potentially more response to ligand. We have previously shown in CD4 T cells that IL7Rα haplotypes 1 and 2, but not 4, respond to interferon beta (IFNβ), the most commonly used immunomodulatory drug in MS, and that haplotype 4 (Hap 4) homozygotes have the highest risk of developing MS. We now show that IL7R expression increases in myeloid cells in response to IFNβ, but that the response is haplotype-dependent, with cells from homozygotes for Hap 4 again showing no response. This was shown using freshly derived monocytes, in vitro cultured immature and mature monocyte-derived dendritic cells, and by comparing homozygotes for the common haplotypes, and relative expression of alleles in heterozygotes (Hap 4 vs not Hap 4). As for T cells, in all myeloid cell subsets examined, Hap 2 homozygotes showed a trend for reduced splicing of exon 6 compared to the other haplotypes, significantly so in most conditions. These data are consistent with increased signaling being protective from MS, constitutively and in response to IFNβ. We also demonstrate significant regulation of immune response, chemokine activity and cytokine biosynthesis pathways by IL7Rα signaling in IFNβ -treated myeloid subsets. IFNβ-responsive genes are over-represented amongst genes associated with MS susceptibility. IL7Rα haplotype may contribute to MS susceptibility through reduced capacity for IL7Rα signalling in myeloid cells, especially in the presence of IFNβ, and is currently under investigation as a predictor of therapeutic response. PMID:24147013

  20. From hormones to secondary metabolism: the emergence of metabolic gene clusters in plants.

    PubMed

    Chu, Hoi Yee; Wegel, Eva; Osbourn, Anne

    2011-04-01

    Gene clusters for the synthesis of secondary metabolites are a common feature of microbial genomes. Well-known examples include clusters for the synthesis of antibiotics in actinomycetes, and also for the synthesis of antibiotics and toxins in filamentous fungi. Until recently it was thought that genes for plant metabolic pathways were not clustered, and this is certainly true in many cases; however, five plant secondary metabolic gene clusters have now been discovered, all of them implicated in synthesis of defence compounds. An obvious assumption might be that these eukaryotic gene clusters have arisen by horizontal gene transfer from microbes, but there is compelling evidence to indicate that this is not the case. This raises intriguing questions about how widespread such clusters are, what the significance of clustering is, why genes for some metabolic pathways are clustered and those for others are not, and how these clusters form. In answering these questions we may hope to learn more about mechanisms of genome plasticity and adaptive evolution in plants. It is noteworthy that for the five plant secondary metabolic gene clusters reported so far, the enzymes for the first committed steps all appear to have been recruited directly or indirectly from primary metabolic pathways involved in hormone synthesis. This may or may not turn out to be a common feature of plant secondary metabolic gene clusters as new clusters emerge. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.

  1. Drivers of genetic diversity in secondary metabolic gene clusters within a fungal species

    PubMed Central

    Lind, Abigail L.; Wisecaver, Jennifer H.; Lameiras, Catarina; Wiemann, Philipp; Palmer, Jonathan M.; Keller, Nancy P.; Rodrigues, Fernando; Goldman, Gustavo H.

    2017-01-01

    Filamentous fungi produce a diverse array of secondary metabolites (SMs) critical for defense, virulence, and communication. The metabolic pathways that produce SMs are found in contiguous gene clusters in fungal genomes, an atypical arrangement for metabolic pathways in other eukaryotes. Comparative studies of filamentous fungal species have shown that SM gene clusters are often either highly divergent or uniquely present in one or a handful of species, hampering efforts to determine the genetic basis and evolutionary drivers of SM gene cluster divergence. Here, we examined SM variation in 66 cosmopolitan strains of a single species, the opportunistic human pathogen Aspergillus fumigatus. Investigation of genome-wide within-species variation revealed 5 general types of variation in SM gene clusters: nonfunctional gene polymorphisms; gene gain and loss polymorphisms; whole cluster gain and loss polymorphisms; allelic polymorphisms, in which different alleles corresponded to distinct, nonhomologous clusters; and location polymorphisms, in which a cluster was found to differ in its genomic location across strains. These polymorphisms affect the function of representative A. fumigatus SM gene clusters, such as those involved in the production of gliotoxin, fumigaclavine, and helvolic acid as well as the function of clusters with undefined products. In addition to enabling the identification of polymorphisms, the detection of which requires extensive genome-wide synteny conservation (e.g., mobile gene clusters and nonhomologous cluster alleles), our approach also implicated multiple underlying genetic drivers, including point mutations, recombination, and genomic deletion and insertion events as well as horizontal gene transfer from distant fungi. Finally, most of the variants that we uncover within A. fumigatus have been previously hypothesized to contribute to SM gene cluster diversity across entire fungal classes and phyla. We suggest that the drivers of genetic

  2. [Association between CETP polymorphisms and haplotypes with dyslipidemia in Xinjiang Uygur and Kazak residents].

    PubMed

    Hu, Y H; Liu, J M; Zhang, M; He, J; Yan, Y Z; Ma, J L; Ma, R L; Guo, H; Rui, D S; Sun, F; Mu, L L; Niu, Q; Ding, Y S; Zhang, J Y; Li, S G; Guo, S X

    2016-08-24

    To explore the relationship between the polymorphisms and haplotypes in the CETP gene and dyslipidemia among Xinjiang Kazak and Uygur residents. A population status survey was performed from 2010 to 2011 in Kashgar Xinjiang Uygur and Kazak residents, stratified cluster sampling method was used to select Uygur, Kazak residents with abnormal blood lipid values (n=367 and 345, respectively) as the dyslipidemia groups, and to select residents with normal lipid values as control group from the same area (n=374 and 390, respectively). SNaPshot technology was applied to detect the DNA of CETP gene rs3764261, rs1800775, rs708272 and rs5882 loci in all selected residents, and linkage disequilibrium analysis and haplotype construction were performed. (1) In Uygur residents, the dyslipidemia risk of rs708272 CT (OR=0.64, 95%CI 0.46-0.91, P=0.01) and TT genotype (OR=0.60, 95%CI 0.40-0.91, P=0.02) was significantly lower than CC genotype. Dyslipidemia risk of rs3764261 GT (OR=0.55, 95%CI 0.40-0.74, P=0.00) and TT genotype (OR=0.47, 95%CI 0.28-0.78, P<0.01) was significantly lower than GG genetype. Dyslipidemia risk of the rs1800775 CC genotype was higher than AA genotype (OR=1.79, 95%CI 1.17-2.74, P=0.01). There was no statistical significance in CETP gene of the 4 genotype and allele frequency between the dyslipidemia and normal lipid groups in Kazak residents (all P>0.05). (2) In Uighur residents with dyslipidemia, HDL-C level was significantly higher in rs708272 TT genotype carriers than in CC and CT genotypes (all P<0.05) and in rs3764261 TT genotype carriers than in GG genotype carriers (P=0.008), while was significantly lower in rs1800775 CC genotype carriers with AA genotype carriers (P=0.008). (3) Linkage disequilibrium analysis showed that there was strong linkage disequilibrium between rs3764261 and rs708272 (D'=0.869, r(2)=0.869), rs1800775 and rs708272 (D'=0.845, r(2)=0.446) in Uighur residents, and there was strong linkage disequilibrium between rs3764261 and rs

  3. Efficient algorithms for polyploid haplotype phasing.

    PubMed

    He, Dan; Saha, Subrata; Finkers, Richard; Parida, Laxmi

    2018-05-09

    Inference of haplotypes, or the sequence of alleles along the same chromosomes, is a fundamental problem in genetics and is a key component for many analyses including admixture mapping, identifying regions of identity by descent and imputation. Haplotype phasing based on sequencing reads has attracted lots of attentions. Diploid haplotype phasing where the two haplotypes are complimentary have been studied extensively. In this work, we focused on Polyploid haplotype phasing where we aim to phase more than two haplotypes at the same time from sequencing data. The problem is much more complicated as the search space becomes much larger and the haplotypes do not need to be complimentary any more. We proposed two algorithms, (1) Poly-Harsh, a Gibbs Sampling based algorithm which alternatively samples haplotypes and the read assignments to minimize the mismatches between the reads and the phased haplotypes, (2) An efficient algorithm to concatenate haplotype blocks into contiguous haplotypes. Our experiments showed that our method is able to improve the quality of the phased haplotypes over the state-of-the-art methods. To our knowledge, our algorithm for haplotype blocks concatenation is the first algorithm that leverages the shared information across multiple individuals to construct contiguous haplotypes. Our experiments showed that it is both efficient and effective.

  4. Analysis of lamprey clustered Fox genes: insight into Fox gene evolution and expression in vertebrates.

    PubMed

    Wotton, Karl R; Shimeld, Sebastian M

    2011-12-01

    In the human genome, members of the FoxC, FoxF, FoxL1, and FoxQ1 gene families are found in two paralagous clusters. One cluster contains the genes FOXQ1, FOXF2, FOXC1 and the second consists of FOXF1, FOXC2, and FOXL1. In jawed vertebrates these genes are known to be expressed in different pharyngeal tissues and all, except FoxQ1, are involved in patterning the early embryonic mesoderm. We have previously traced the evolution of this cluster in the bony vertebrates, and the gene content is identical in the dogfish, a member of the most basally branching lineage of the jawed vertebrates. Here we extend these analyses to jawless vertebrates. Using genomic searches and molecular approaches we have identified homologues of these genes from lampreys. We identify two FoxC genes, two FoxF genes, two FoxQ1 genes and single FoxL1 gene. We examine the embryonic expression of one predominantly mesodermally expressed gene family, FoxC, and the endodermally expressed member of the cluster, FoxQ1. We identified FoxQ1 transcripts in the pharyngeal endoderm, while the two FoxC genes are differentially expressed in the pharyngeal mesenchyme and ectoderm. Furthermore we identify conserved expression of lamprey FoxC genes in the paraxial and intermediate mesoderms. We interpret our results through a chordate-wide comparison of expression patterns and discuss gene content in the context of theories on the evolution of the vertebrate genome. 2011 Elsevier B.V. All rights reserved.

  5. An unusual haplotype structure on human chromosome 8p23 derived from the inversion polymorphism.

    PubMed

    Deng, Libin; Zhang, Yuezheng; Kang, Jian; Liu, Tao; Zhao, Hongbin; Gao, Yang; Li, Chaohua; Pan, Hao; Tang, Xiaoli; Wang, Dunmei; Niu, Tianhua; Yang, Huanming; Zeng, Changqing

    2008-10-01

    Chromosomal inversion is an important type of genomic variations involved in both evolution and disease pathogenesis. Here, we describe the refined genetic structure of a 3.8-Mb inversion polymorphism at chromosome 8p23. Using HapMap data of 1,073 SNPs generated from 209 unrelated samples from CEPH-Utah residents with ancestry from northern and western Europe (CEU); Yoruba in Ibadan, Nigeria (YRI); and Asian (ASN) samples, which were comprised of Han Chinese from Beijing, China (CHB) and Japanese from Tokyo, Japan (JPT)-we successfully deduced the inversion orientations of all their 418 haplotypes. In particular, distinct haplotype subgroups were identified based on principal component analysis (PCA). Such genetic substructures were consistent with clustering patterns based on neighbor-joining tree reconstruction, which revealed a total of four haplotype clades across all samples. Metaphase fluorescence in situ hybridization (FISH) in a subset of 10 HapMap samples verified their inversion orientations predicted by PCA or phylogenetic tree reconstruction. Positioning of the outgroup haplotype within one of YRI clades suggested that Human NCBI Build 36-inverted order is most likely the ancestral orientation. Furthermore, the population differentiation test and the relative extended haplotype homozygosity (REHH) analysis in this region discovered multiple selection signals, also in a population-specific manner. A positive selection signal was detected at XKR6 in the ASN population. These results revealed the correlation of inversion polymorphisms to population-specific genetic structures, and various selection patterns as possible mechanisms for the maintenance of a large chromosomal rearrangement at 8p23 region during evolution. In addition, our study also showed that haplotype-based clustering methods, such as PCA, can be applied in scanning for cryptic inversion polymorphisms at a genome-wide scale.

  6. Recombinant Rp1 genes confer necrotic or nonspecific resistance phenotypes.

    PubMed

    Smith, Shavannor M; Steinau, Martin; Trick, Harold N; Hulbert, Scot H

    2010-06-01

    Genes at the Rp1 rust resistance locus of maize confer race-specific resistance to the common rust fungus Puccinia sorghi. Three variant genes with nonspecific effects (HRp1 -Kr1N, -D*21 and -MD*19) were found to be generated by intragenic crossing over within the LRR region. The LRR region of most NBS-LRR encoding genes is quite variable and codes for one of the regions in resistance gene proteins that controls specificity. Sequence comparisons demonstrated that the Rp1-Kr1N recombinant gene was identical to the N-terminus of the rp1-kp2 gene and C-terminus of another gene from its HRp1-K grandparent. The Rp1-D*21 recombinant gene consists of the N-terminus of the rp1-dp2 gene and C-terminus of the Rp1-D gene from the parental haplotype. Similarly, a recombinant gene from the Rp1-MD*19 haplotype has the N-terminus of an rp1 gene from the HRp1-M parent and C-terminus of the rp1-D19 gene from the HRp1-D parent. The recombinant Rp1 -Kr1N, -D*21 and -MD*19 genes activated defense responses in the absence of their AVR proteins triggering HR (hypersensitive response) in the absence of the pathogen. The results indicate that the frequent intragenic recombination events that occur in the Rp1 gene cluster not only recombine the genes into novel haplotypes, but also create genes with nonspecific effects. Some of these may contribute to nonspecific quantitative resistance but others have severe consequences for the fitness of the plant.

  7. Identification and Functional Analysis of the Nocardithiocin Gene Cluster in Nocardia pseudobrasiliensis

    PubMed Central

    Sakai, Kanae; Komaki, Hisayuki; Gonoi, Tohru

    2015-01-01

    Nocardithiocin is a thiopeptide compound isolated from the opportunistic pathogen Nocardia pseudobrasiliensis. It shows a strong activity against acid-fast bacteria and is also active against rifampicin-resistant Mycobacterium tuberculosis. Here, we report the identification of the nocardithiocin gene cluster in N. pseudobrasiliensis IFM 0761 based on conserved thiopeptide biosynthesis gene sequence and the whole genome sequence. The predicted gene cluster was confirmed by gene disruption and complementation. As expected, strains containing the disrupted gene did not produce nocardithiocin while gene complementation restored nocardithiocin production in these strains. The predicted cluster was further analyzed using RNA-seq which showed that the nocardithiocin gene cluster contains 12 genes within a 15.2-kb region. This finding will promote the improvement of nocardithiocin productivity and its derivatives production. PMID:26588225

  8. Lampreys, the jawless vertebrates, contain only two ParaHox gene clusters.

    PubMed

    Zhang, Huixian; Ravi, Vydianathan; Tay, Boon-Hui; Tohari, Sumanty; Pillai, Nisha E; Prasad, Aravind; Lin, Qiang; Brenner, Sydney; Venkatesh, Byrappa

    2017-08-22

    ParaHox genes ( Gsx , Pdx , and Cdx ) are an ancient family of developmental genes closely related to the Hox genes. They play critical roles in the patterning of brain and gut. The basal chordate, amphioxus, contains a single ParaHox cluster comprising one member of each family, whereas nonteleost jawed vertebrates contain four ParaHox genomic loci with six or seven ParaHox genes. Teleosts, which have experienced an additional whole-genome duplication, contain six ParaHox genomic loci with six ParaHox genes. Jawless vertebrates, represented by lampreys and hagfish, are the most ancient group of vertebrates and are crucial for understanding the origin and evolution of vertebrate gene families. We have previously shown that lampreys contain six Hox gene loci. Here we report that lampreys contain only two ParaHox gene clusters (designated as α- and β-clusters) bearing five ParaHox genes ( Gsxα , Pdxα , Cdxα , Gsxβ , and Cdxβ ). The order and orientation of the three genes in the α-cluster are identical to that of the single cluster in amphioxus. However, the orientation of Gsxβ in the β-cluster is inverted. Interestingly, Gsxβ is expressed in the eye, unlike its homologs in jawed vertebrates, which are expressed mainly in the brain. The lamprey Pdxα is expressed in the pancreas similar to jawed vertebrate Pdx genes, indicating that the pancreatic expression of Pdx was acquired before the divergence of jawless and jawed vertebrate lineages. It is likely that the lamprey Pdxα plays a crucial role in pancreas specification and insulin production similar to the Pdx of jawed vertebrates.

  9. Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants

    DOE PAGES

    Schläpfer, Pascal; Zhang, Peifen; Wang, Chuan; ...

    2017-04-01

    Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we will need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can bemore » used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters.« less

  10. Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schläpfer, Pascal; Zhang, Peifen; Wang, Chuan

    Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we will need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can bemore » used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters.« less

  11. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  12. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  13. Horizontal transfer of a large and highly toxic secondary metabolic gene cluster between fungi.

    PubMed

    Slot, Jason C; Rokas, Antonis

    2011-01-25

    Genes involved in intermediary and secondary metabolism in fungi are frequently physically linked or clustered. For example, in Aspergillus nidulans the entire pathway for the production of sterigmatocystin (ST), a highly toxic secondary metabolite and a precursor to the aflatoxins (AF), is located in a ∼54 kb, 23 gene cluster. We discovered that a complete ST gene cluster in Podospora anserina was horizontally transferred from Aspergillus. Phylogenetic analysis shows that most Podospora cluster genes are adjacent to or nested within Aspergillus cluster genes, although the two genera belong to different taxonomic classes. Furthermore, the Podospora cluster is highly conserved in content, sequence, and microsynteny with the Aspergillus ST/AF clusters and its intergenic regions contain 14 putative binding sites for AflR, the transcription factor required for activation of the ST/AF biosynthetic genes. Examination of ∼52,000 Podospora expressed sequence tags identified transcripts for 14 genes in the cluster, with several expressed at multiple life cycle stages. The presence of putative AflR-binding sites and the expression evidence for several cluster genes, coupled with the recent independent discovery of ST production in Podospora [1], suggest that this HGT event probably resulted in a functional cluster. Given the abundance of metabolic gene clusters in fungi, our finding that one of the largest known metabolic gene clusters moved intact between species suggests that such transfers might have significantly contributed to fungal metabolic diversity. PAPERFLICK: Copyright © 2011 Elsevier Ltd. All rights reserved.

  14. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2012-12-11

    Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.

  15. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

    PubMed Central

    2012-01-01

    Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411

  16. Exact algorithms for haplotype assembly from whole-genome sequence data.

    PubMed

    Chen, Zhi-Zhong; Deng, Fei; Wang, Lusheng

    2013-08-15

    Haplotypes play a crucial role in genetic analysis and have many applications such as gene disease diagnoses, association studies, ancestry inference and so forth. The development of DNA sequencing technologies makes it possible to obtain haplotypes from a set of aligned reads originated from both copies of a chromosome of a single individual. This approach is often known as haplotype assembly. Exact algorithms that can give optimal solutions to the haplotype assembly problem are highly demanded. Unfortunately, previous algorithms for this problem either fail to output optimal solutions or take too long time even executed on a PC cluster. We develop an approach to finding optimal solutions for the haplotype assembly problem under the minimum-error-correction (MEC) model. Most of the previous approaches assume that the columns in the input matrix correspond to (putative) heterozygous sites. This all-heterozygous assumption is correct for most columns, but it may be incorrect for a small number of columns. In this article, we consider the MEC model with or without the all-heterozygous assumption. In our approach, we first use new methods to decompose the input read matrix into small independent blocks and then model the problem for each block as an integer linear programming problem, which is then solved by an integer linear programming solver. We have tested our program on a single PC [a Linux (x64) desktop PC with i7-3960X CPU], using the filtered HuRef and the NA 12878 datasets (after applying some variant calling methods). With the all-heterozygous assumption, our approach can optimally solve the whole HuRef data set within a total time of 31 h (26 h for the most difficult block of the 15th chromosome and only 5 h for the other blocks). To our knowledge, this is the first time that MEC optimal solutions are completely obtained for the filtered HuRef dataset. Moreover, in the general case (without the all-heterozygous assumption), for the HuRef dataset our

  17. Further insight into genetic variation and haplotype diversity of Cherry virus A from China

    PubMed Central

    Candresse, Thierry; He, Zhen; Li, Shifang; Ma, Yuxin

    2017-01-01

    Cherry virus A (CVA) infection appears to be prevalent in cherry plantations worldwide. In this study, the diversity of CVA isolates from 31 cherry samples collected from different orchards around Bohai Bay in northeastern China was analyzed. The complete genome of one of these isolates, ChYT52, was found to be 7,434 nt in length excluding the poly (A) tail. It shares between 79.9–98.7% identity with CVA genome sequences in GenBank, while its RdRp core is more divergent (79.1–90.7% nt identity), likely as a consequence of a recombination event. Phylogenetic analysis of ChYT52 genome with CVA genomes in Genbank resulted in at least 7 major clusters plus additional 5 isolates alone at the end of long branches suggesting the existence of further phylogroups diversity in CVA. The genetic diversity of Chinese CVA isolates from 31 samples and GenBank sequences were analyzed in three genomic regions that correspond to the coat protein, the RNA-dependent RNA polymerase core region, and the movement protein genes. With few exceptions likely representing further recombination impact, the trees various trees are largely congruent, indicating that each region provides valuable phylogenetic information. In all cases, the majority of the Chinese CVA isolates clustering in phylogroup I, together with the X82547 reference sequence from Germany. Statistically significant negative values were obtained for Tajima’s D in the three genes for phylogroup I, suggesting that it may be undergoing a period of expansion. There was considerable haplotype diversity in the individual samples and more than half samples contained genetically diverse haplotypes belonging to different phylogroups. In addition, a number of statistically significant recombination events were detected in CVA genomes or in the partial genomic sequences indicating an important contribution of recombination to CVA evolution. This work provides a foundation for elucidation of the epidemiological characteristics and

  18. The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans.

    PubMed

    Gardiner, Donald M; Cozijnsen, Anton J; Wilson, Leanne M; Pedras, M Soledade C; Howlett, Barbara J

    2004-09-01

    Sirodesmin PL is a phytotoxin produced by the fungus Leptosphaeria maculans, which causes blackleg disease of canola (Brassica napus). This phytotoxin belongs to the epipolythiodioxopiperazine (ETP) class of toxins produced by fungi including mammalian and plant pathogens. We report the cloning of a cluster of genes with predicted roles in the biosynthesis of sirodesmin PL and show via gene disruption that one of these genes (encoding a two-module non-ribosomal peptide synthetase) is essential for sirodesmin PL biosynthesis. Of the nine genes in the cluster tested, all are co-regulated with the production of sirodesmin PL in culture. A similar cluster is present in the genome of the opportunistic human pathogen Aspergillus fumigatus and is most likely responsible for the production of gliotoxin, which is also an ETP. Homologues of the genes in the cluster were also identified in expressed sequence tags of the ETP producing fungus Chaetomium globosum. Two other fungi with publicly available genome sequences, Magnaporthe grisea and Fusarium graminearum, had similar gene clusters. A comparative analysis of all four clusters is presented. This is the first report of the genes responsible for the biosynthesis of an ETP. Copyright 2004 Blackwell Publishing Ltd

  19. PAX6 Haplotypes Are Associated with High Myopia in Han Chinese

    PubMed Central

    Jiang, Bo; Yap, Maurice K. H.; Leung, Kim Hung; Ng, Po Wah; Fung, Wai Yan; Lam, Wai Wa; Gu, Yang-shun; Yip, Shea Ping

    2011-01-01

    Background The paired box 6 (PAX6) gene is considered as a master gene for eye development. Linkage of myopia to the PAX6 region on chromosome 11p13 was shown in several studies, but the results for association between myopia and PAX6 were inconsistent so far. Methodology/Principal Findings We genotyped 16 single nucleotide polymorphisms (SNPs) in the PAX6 gene and its regulatory regions in an initial study for 300 high myopia cases and 300 controls (Group 1), and successfully replicated the positive results with another independent group of 299 high myopia cases and 299 controls (Group 2). Five SNPs were genotyped in the replication study. The spherical equivalent of subjects with high myopia was ≤−8.0 dioptres. The PLINK package was used for genetic data analysis. No association was found between each of the SNPs and high myopia. However, exhaustive sliding-window haplotype analysis highlighted an important role for rs12421026 because haplotypes containing this SNP were found to be associated with high myopia. The most significant results were given by the 4-SNP haplotype window consisting of rs2071754, rs3026393, rs1506 and rs12421026 (P = 3.54×10−10, 4.06×10−11 and 1.56×10−18 for Group 1, Group 2 and Combined Group, respectively) and the 3-SNP haplotype window composed of rs3026393, rs1506 and rs12421026 (P = 5.48×10−10, 7.93×10−12 and 6.28×10−23 for the three respective groups). The results remained significant after correction for multiple comparisons by permutations. The associated haplotyes found in a previous study were also successfully replicated in this study. Conclusions/Significance PAX6 haplotypes are associated with susceptibility to the development of high myopia in Chinese. The PAX6 locus plays a role in high myopia. PMID:21589860

  20. Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants.

    PubMed

    Schläpfer, Pascal; Zhang, Peifen; Wang, Chuan; Kim, Taehyong; Banf, Michael; Chae, Lee; Dreher, Kate; Chavali, Arvind K; Nilo-Poyanco, Ricardo; Bernard, Thomas; Kahn, Daniel; Rhee, Seung Y

    2017-04-01

    Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters. © 2017 American Society of Plant Biologists. All Rights Reserved.

  1. Fungal secondary metabolites - strategies to activate silent gene clusters.

    PubMed

    Brakhage, Axel A; Schroeckh, Volker

    2011-01-01

    Filamentous fungi produce a multitude of low molecular weight bioactive compounds. The increasing number of fungal genome sequences impressively demonstrated that their biosynthetic potential is far from being exploited. In fungi, the genes required for the biosynthesis of a secondary metabolite are clustered. Many of these bioinformatically newly discovered secondary metabolism gene clusters are silent under standard laboratory conditions. Consequently, no product can be found. This review summarizes the current strategies that have been successfully applied during the last years to activate these silent gene clusters in filamentous fungi, especially in the genus Aspergillus. The techniques take advantage of genome mining, vary from the simple search for compounds with bioinformatically predicted physicochemical properties up to methods that exploit a probable interaction of microorganisms. Until now, the majority of successful approaches have been based on molecular biology like the generation of gene "knock outs", promoter exchange, overexpression of transcription factors or other pleiotropic regulators. Moreover, strategies based on epigenetics opened a new avenue for the elucidation of the regulation of secondary metabolite formation and will certainly continue to play a significant role for the elucidation of cryptic natural products. The conditions under which a given gene cluster is naturally expressed are largely unknown. One technique is to attempt to simulate the natural habitat by co-cultivation of microorganisms from the same ecosystem. This has already led to the activation of silent gene clusters and the identification of novel compounds in Aspergillus nidulans. These simulation strategies will help discover new natural products in the future, and may also provide fundamental new insights into microbial communication. Copyright © 2010 Elsevier Inc. All rights reserved.

  2. Associations of Polymorphisms in the Apolipoprotein APOA1-C3-A5 Gene Cluster with Acute Coronary Syndrome

    PubMed Central

    Ding, Yan; Zhu, Ming An; Wang, Zhi Xiao; Zhu, Jing; Feng, Jing Bo; Li, Dong Sheng

    2012-01-01

    Background. Acute coronary syndromes (ACSs) are clinically cardiovascular events associated with dyslipidemia in common. Single nucleotide polymorphisms (SNPs) and haplotypes in the APOA1/C3/A5 gene cluster are associated with diabetes and familial combined hyperlipidaemia (FCH). Little is known about whether the polymorphisms in these genes affect lipid homeostasis in patients with ACSs. The present paper aimed to examine these associations with 4 SNPs in the APOA1 −75G > A, the APOC3 −455T > C, and APOA5 −1131T > C, c.553G > T variant to ACSs in Chinese Han. Methods. Chinese Han of 229 patients with ACSs and 254 unrelated controls were analyzed. Four SNPs in APOA1/C3/A5 cluster were genotyped and lipid was determined. Results. Our data show that minor allelic frequencies of APOC3 −455T > C, APOA5 −1131T > C, and c.553G > T polymorphisms in patients with ACSs were significantly higher than control group (P < 0.05). Furthermore, the 3 polymorphic sites were strongly of linkage disequilibrium, and minor alleles of 3 SNP sites had higher TG level than wild alleles (P < 0.05), APOC3 −455C and APOA5 c.553T allele carriers also had lower level of HDL-C. Conclusions. The minor alleles of APOC3 −455T > C, APOA5 −1131T > C, and c.553G > T polymorphisms are closely associated with ACSs. PMID:22675253

  3. Next generation haplotyping to decipher nuclear genomic interspecific admixture in Citrus species: analysis of chromosome 2.

    PubMed

    Curk, Franck; Ancillo, Gema; Garcia-Lor, Andres; Luro, François; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Navarro, Luis; Ollitrault, Patrick

    2014-12-29

    The most economically important Citrus species originated by natural interspecific hybridization between four ancestral taxa (Citrus reticulata, Citrus maxima, Citrus medica, and Citrus micrantha) and from limited subsequent interspecific recombination as a result of apomixis and vegetative propagation. Such reticulate evolution coupled with vegetative propagation results in mosaic genomes with large chromosome fragments from the basic taxa in frequent interspecific heterozygosity. Modern breeding of these species is hampered by their complex heterozygous genomic structures that determine species phenotype and are broken by sexual hybridisation. Nevertheless, a large amount of diversity is present in the citrus gene pool, and breeding to allow inclusion of desirable traits is of paramount importance. However, the efficient mobilization of citrus biodiversity in innovative breeding schemes requires previous understanding of Citrus origins and genomic structures. Haplotyping of multiple gene fragments along the whole genome is a powerful approach to reveal the admixture genomic structure of current species and to resolve the evolutionary history of the gene pools. In this study, the efficiency of parallel sequencing with 454 methodology to decipher the hybrid structure of modern citrus species was assessed by analysis of 16 gene fragments on chromosome 2. 454 amplicon libraries were established using the Fluidigm array system for 48 genotypes and 16 gene fragments from chromosome 2. Haplotypes were established from the reads of each accession and phylogenetic analyses were performed using the haplotypic data for each gene fragment. The length of 454 reads and the level of differentiation between the ancestral taxa of modern citrus allowed efficient haplotype phylogenetic assignations for 12 of the 16 gene fragments. The analysis of the mixed genomic structure of modern species and cultivars (i) revealed C. maxima introgressions in modern mandarins, (ii) was

  4. Hb S [β6(A3)Glu→Val, GAG>GTG] and β-globin gene cluster haplotype distribution in Mauritania.

    PubMed

    Veten, Fatimetou M; Abdelhamid, Isselmou O; Meiloud, Ghlana M; Ghaber, Sidi M; Salem, Mohamed L; Abbes, Salem; Houmeida, Ahmed O

    2012-01-01

    Of 1050 Mauritanian blood donors screened from the two main racial groups, i.e., the Moors and Black Africans, 60 were found to carry Hb S [β6(A3)Glu→Val, GAG>GTG], giving a global frequency of 5.71%. The prevalence observed in the Black African Mauritanians (10.69%) is almost five times that found in the Moor group (2.25%). Four of the five main β(S) haplotypes were detected in this study: Senegal (77.8%), Benin (8.8%), Arab-Indian (5.5%) and Bantu (4.4%). These data showed that Hb S is a serious public health problem in Mauritania. They also confirm the ethnic heterogeneity of the Mauritanian population.

  5. Associations of the interleukin-1 gene locus polymorphisms with risk to hip and knee osteoarthritis: gender and subpopulation differences.

    PubMed

    Kaarvatn, M H; Jotanovic, Z; Mihelic, R; Etokebe, G E; Mulac-Jericevic, B; Tijanic, T; Balen, S; Sestan, B; Dembic, Z

    2013-02-01

    Genetic predisposition to the complex hereditary disease like osteoarthritis (OA) of the large joints (hip and knee) includes the interleukin-1 gene (IL-1) cluster on chromosome 2. Using a case-control study with 500 OA patients (240 knee and 260 hip OA patients, all with joint replacement), we analysed frequencies of IL-1 gene cluster polymorphisms in Croatian Caucasian population. The control samples came from 531 healthy individuals including blood donors. We genotyped two single nucleotide polymorphisms in the IL-1 gene locus at IL-1A (-889, C>T, rs1800587) and IL-1B (+3594, C>T, rs1143634) and compared their frequencies between patients and controls. We predicted haplotypes by combining current data with our previous results on gene polymorphisms (IL-1B, rs16944 and the IL-1 receptor antagonist gene [IL-1RN] variable number tandem repeat [VNTR]) for the same population. Haplotype analyses revealed gender disparities and showed that women carriers of the 1-2-1-1 haplotype [IL-1A(rs1800587) - IL-1B(rs1143634) - IL-1B(rs16944) - IL-1RN(VNTR)] had sixfold lower risk to develop knee OA. However, carriers of the 1-1-1-2 haplotype of both sexes had over twofold higher predisposition to hip OA. Our results differ from some earlier studies in Caucasian subpopulations, which may be due to the fact that this is the first study to separate genders in assessing the IL-1-locus genetic risk of OA. The results suggest that inflammatory mediators like IL-1 might be implicated in the pathogenesis of primary OA in large joints and that as yet unidentified gender-specific factors exist in a Croatian Caucasian population. © 2012 The Authors. Scandinavian Journal of Immunology © 2012 Blackwell Publishing Ltd.

  6. Bacillus cereus-type polyhydroxyalkanoate biosynthetic gene cluster contains R-specific enoyl-CoA hydratase gene.

    PubMed

    Kihara, Takahiro; Hiroe, Ayaka; Ishii-Hyakutake, Manami; Mizuno, Kouhei; Tsuge, Takeharu

    2017-08-01

    Bacillus cereus and Bacillus megaterium both accumulate polyhydroxyalkanoate (PHA) but their PHA biosynthetic gene (pha) clusters that code for proteins involved in PHA biosynthesis are different. Namely, a gene encoding MaoC-like protein exists in the B. cereus-type pha cluster but not in the B. megaterium-type pha cluster. MaoC-like protein has an R-specific enoyl-CoA hydratase (R-hydratase) activity and is referred to as PhaJ when involved in PHA metabolism. In this study, the pha cluster of B. cereus YB-4 was characterized in terms of PhaJ's function. In an in vitro assay, PhaJ from B. cereus YB-4 (PhaJ YB4 ) exhibited hydration activity toward crotonyl-CoA. In an in vivo assay using Escherichia coli as a host for PHA accumulation, the recombinant strain expressing PhaJ YB4 and PHA synthase led to increased PHA accumulation, suggesting that PhaJ YB4 functioned as a monomer supplier. The monomer composition of the accumulated PHA reflected the substrate specificity of PhaJ YB4 , which appeared to prefer short chain-length substrates. The pha cluster from B. cereus YB-4 functioned to accumulate PHA in E. coli; however, it did not function when the phaJ YB4 gene was deleted. The B. cereus-type pha cluster represents a new example of a pha cluster that contains the gene encoding PhaJ.

  7. Sequences of 95 human MHC haplotypes reveal extreme coding variation in genes other than highly polymorphic HLA class I and II

    PubMed Central

    Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter

    2017-01-01

    The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230

  8. Natural selection on marine carnivores elaborated a diverse family of classical MHC class I genes exhibiting haplotypic gene content variation and allelic polymorphism

    PubMed Central

    Norman, Paul J.; Parham, Peter

    2012-01-01

    Pinnipeds, marine carnivores, diverged from terrestrial carnivores ~45 million years ago, before their adaptation to marine environments. This lifestyle change exposed pinnipeds to different microbiota and pathogens, with probable impact on their MHC class I genes. Investigating this question, genomic sequences were determined for 71 MHC class I variants: 27 from harbor seal and 44 from gray seal. These variants form three MHC class I gene lineages, one comprising a pseudogene. The second, a candidate nonclassical MHC class I gene, comprises a nonpolymorphic transcribed gene related to dog DLA-79 and giant panda Aime-1906. The third is the diversity lineage, which includes 62 of the 71 seal MHC class I variants. All are transcribed, and they minimally represent six harbor and 12 gray seal MHC class I genes. Besides species-specific differences in gene number, seal MHC class I haplotypes exhibit gene content variation and allelic polymorphism. Patterns of sequence variation, and of positions for positively selected sites, indicate the diversity lineage genes are the seals’ classical MHC class I genes. Evidence that expansion of diversity lineage genes began before gray and harbor seals diverged is the presence in both species of two distinctive sublineages of diversity lineage genes. Pointing to further expansion following the divergence are the presence of species-specific genes and greater MHC class I diversity in gray seals than harbor seals. The elaboration of a complex variable family of classical MHC class I genes in pinnipeds contrasts with the single, highly polymorphic classical MHC class I gene of dog and giant panda, terrestrial carnivores. PMID:23001684

  9. Hox cluster polarity in early transcriptional availability: a high order regulatory level of clustered Hox genes in the mouse.

    PubMed

    Roelen, Bernard A J; de Graaff, Wim; Forlani, Sylvie; Deschamps, Jacqueline

    2002-11-01

    The molecular mechanism underlying the 3' to 5' polarity of induction of mouse Hox genes is still elusive. While relief from a cluster-encompassing repression was shown to lead to all Hoxd genes being expressed like the 3'most of them, Hoxd1 (Kondo and Duboule, 1999), the molecular basis of initial activation of this 3'most gene, is not understood yet. We show that, already before primitive streak formation, prior to initial expression of the first Hox gene, a dramatic transcriptional stimulation of the 3'most genes, Hoxb1 and Hoxb2, is observed upon a short pulse of exogenous retinoic acid (RA), whereas it is not in the case for more 5', cluster-internal, RA-responsive Hoxb genes. In contrast, the RA-responding Hoxb1lacZ transgene that faithfully mimics the endogenous gene (Marshall et al., 1994) did not exhibit the sensitivity of Hoxb1 to precocious activation. We conclude that polarity in initial activation of Hoxb genes reflects a greater availability of 3'Hox genes for transcription, suggesting a pre-existing (susceptibility to) opening of the chromatin structure at the 3' extremity of the cluster. We discuss the data in the context of prevailing models involving differential chromatin opening in the directionality of clustered Hox gene transcription, and regarding the importance of the cluster context for correct timing of initial Hox gene expression.Interestingly, Cdx1 manifested the same early transcriptional availability as Hoxb1. Copyright 2002 Elsevier Science Ireland Ltd.

  10. Haplotype-Based Genotyping in Polyploids.

    PubMed

    Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

    2018-01-01

    Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.

  11. A DRD1 haplotype is associated with risk for autism spectrum disorders in male-only affected sib-pair families.

    PubMed

    Hettinger, Joe A; Liu, Xudong; Schwartz, Charles E; Michaelis, Ron C; Holden, Jeanette J A

    2008-07-05

    Individuals with autism spectrum disorders (ASDs) have impairments in executive function and social cognition, with males generally being more severely affected in these areas than females. Because the dopamine D1 receptor (encoded by DRD1) is integral to the neural circuitry mediating these processes, we examined the DRD1 gene for its role in susceptibility to ASDs by performing single marker and haplotype case-control comparisons, family-based association tests, and genotype-phenotype assessments (quantitative transmission disequilibrium tests: QTDT) using three DRD1 polymorphisms, rs265981C/T, rs4532A/G, and rs686T/C. Our previous findings suggested that the dopaminergic system may be more integrally involved in families with affected males only than in other families. We therefore restricted our study to families with two or more affected males (N = 112). There was over-transmission of rs265981-C and rs4532-A in these families (P = 0.040, P = 0.038), with haplotype TDT analysis showing over-transmission of the C-A-T haplotype (P = 0.022) from mothers to affected sons (P = 0.013). In addition, haplotype case-control comparisons revealed an increase of this putative risk haplotype in affected individuals relative to a comparison group (P = 0.004). QTDT analyses showed associations of the rs265981-C, rs4532-A, rs686-T alleles, and the C-A-T haplotype with more severe problems in social interaction, greater difficulties with nonverbal communication and increased stereotypies compared to individuals with other haplotypes. Preferential haplotype transmission of markers at the DRD1 locus and an increased frequency of a specific haplotype support the DRD1 gene as a risk gene for core symptoms of ASD in families having only affected males. Copyright 2008 Wiley-Liss, Inc.

  12. COMT haplotypes modulate associations of antenatal maternal anxiety and neonatal cortical morphology.

    PubMed

    Qiu, Anqi; Tuan, Ta Anh; Ong, Mei Lyn; Li, Yue; Chen, Helen; Rifkin-Graboi, Anne; Broekman, Birit F P; Kwek, Kenneth; Saw, Seang-Mei; Chong, Yap-Seng; Gluckman, Peter D; Fortier, Marielle V; Holbrook, Joanna Dawn; Meaney, Michael J

    2015-02-01

    Exposure to antenatal maternal anxiety and complex genetic variations may shape fetal brain development. In particular, the catechol-O-methyltransferase (COMT) gene, located on chromosome 22q11.2, regulates catecholamine signaling in the prefrontal cortex and is implicated in anxiety, pain, and stress responsivity. This study examined whether individual single-nucleotide polymorphisms (SNPs) of the COMT gene and their haplotypes moderate the association between antenatal maternal anxiety and in utero cortical development. A total of 146 neonates were genotyped and underwent MRI shortly after birth. Neonatal cortical morphology was characterized using cortical thickness. Antenatal maternal anxiety was assessed using the State-Trait Anxiety Inventory at week 26 of pregnancy. Individual COMT SNPs (val158met, rs737865, and rs165599) modulated the association between antenatal maternal anxiety and the prefrontal and parietal cortical thickness in neonates. Based on haplotype trend regression analysis, findings also showed that among rs737865-val158met-rs165599 haplotypes, the A-val-G (AGG) haplotype probabilities modulated positive associations of antenatal maternal anxiety with cortical thickness in the right ventrolateral prefrontal cortex and the right superior parietal cortex and precuneus. In contrast, the G-met-A (GAA) haplotype probabilities modulated negative associations of antenatal maternal anxiety with cortical thickness in bilateral precentral gyrus and the dorsolateral prefrontal cortex. These results suggest that the association between maternal anxiety and in utero neurodevelopment is modified through complex genetic variation in COMT. Such genetic moderation may explain, in part, the variation in phenotypic outcomes in offspring associated with maternal emotional well-being.

  13. A cross-species bi-clustering approach to identifying conserved co-regulated genes.

    PubMed

    Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

    2016-06-15

    A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on synthetic data and compared

  14. Pichia stipitis genomics, transcriptomics, and gene clusters

    Treesearch

    Thomas W. Jeffries; Jennifer R. Headman Van Vleet

    2009-01-01

    Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...

  15. Anonymous marker loci within 400 kb of HLA-A generate haplotypes in linkage disequilibrium with the hemochromatosis gene (HFE)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yaouanq, J.; Perichon, M.; Treut, A.L.

    1994-02-01

    The hemochromatosis gene (HFE) maps to 6p21.3 and is less than 1 cM from the HLA class I gene; however, the precise physical location of the gene has remained elusive and controversial. The unambiguous identification of a crossover event within hemochromatosis families is very difficult; it is particularly hampered by the variability of the phenotypic expression as well as by the sex- and age-related penetrance of the disease. For these considerations, traditional linkage analysis could prove of limited value in further refining the extrapolated physical position of HFE. The authors therefore embarked upon a linkage-disequilibrium analysis of HFE and normalmore » chromosomes for the Brittany population. In this report, 66 hemochromatosis families yielding 151 hemochromatosis chromosomes and 182 normal chromosomes were RFLP-typed with a battery of probes, including two newly derived polymorphic markers from the 6.7 and HLA-F loci located 150 and 250 kb telomeric to HLA-A, respectively. The results suggest a strong peak of existing linkage disequilibrium focused within the i82-to-6.7 interval (approximately 250 kb). The zone of linkage disequilibrium is flanked by the i97 locus, positioned 30 kb proximal to i82, and the HLA-F gene, found 250 kb distal to HLA-A, markers of which display no significant association with HFE. These data support the possibility that HFE resides within the 400-kb expanse of DNA between i97 and HLA-F. Alternatively, the very tight association of HLA-A3 and allele 1 of the 6.7 locus, both of which are comprised by the major ancestral or founder HFE haplotype in Brittany, supports the possibility that the disease gene may reside immediately telomeric to the 6.7 locus within the linkage-disequilibrium zone. Additionally, hemochromatosis haplotypes possessing HLA-A11 and the low-frequency HLA-F polymorphism (allele 2) are supportive of a separate founder chromosome containing a second, independently arising mutant allele. 69 refs., 1 fig., 5

  16. References for Haplotype Imputation in the Big Data Era

    PubMed Central

    Li, Wenzhi; Xu, Wei; Li, Qiling; Ma, Li; Song, Qing

    2016-01-01

    Imputation is a powerful in silico approach to fill in those missing values in the big datasets. This process requires a reference panel, which is a collection of big data from which the missing information can be extracted and imputed. Haplotype imputation requires ethnicity-matched references; a mismatched reference panel will significantly reduce the quality of imputation. However, currently existing big datasets cover only a small number of ethnicities, there is a lack of ethnicity-matched references for many ethnic populations in the world, which has hampered the data imputation of haplotypes and its downstream applications. To solve this issue, several approaches have been proposed and explored, including the mixed reference panel, the internal reference panel and genotype-converted reference panel. This review article provides the information and comparison between these approaches. Increasing evidence showed that not just one or two genetic elements dictate the gene activity and functions; instead, cis-interactions of multiple elements dictate gene activity. Cis-interactions require the interacting elements to be on the same chromosome molecule, therefore, haplotype analysis is essential for the investigation of cis-interactions among multiple genetic variants at different loci, and appears to be especially important for studying the common diseases. It will be valuable in a wide spectrum of applications from academic research, to clinical diagnosis, prevention, treatment, and pharmaceutical industry. PMID:27274952

  17. Fetal hemoglobin in sickle cell anemia: The Arab-Indian haplotype and new therapeutic agents.

    PubMed

    Habara, Alawi H; Shaikho, Elmutaz M; Steinberg, Martin H

    2017-11-01

    Fetal hemoglobin (HbF) has well-known tempering effects on the symptoms of sickle cell disease and its levels vary among patients with different haplotypes of the sickle hemoglobin gene. Compared with sickle cell anemia haplotypes found in patients of African descent, HbF levels in Saudi and Indian patients with the Arab-Indian (AI) haplotype exceed that in any other haplotype by nearly twofold. Genetic association studies have identified some loci associated with high HbF in the AI haplotype but these observations require functional confirmation. Saudi patients with the Benin haplotype have HbF levels almost twice as high as African patients with this haplotype but this difference is unexplained. Hydroxyurea is still the only FDA approved drug for HbF induction in sickle cell disease. While most patients treated with hydroxyurea have an increase in HbF and some clinical improvement, 10 to 20% of adults show little response to this agent. We review the genetic basis of HbF regulation focusing on sickle cell anemia in Saudi Arabia and discuss new drugs that can induce increased levels of HbF. © 2017 Wiley Periodicals, Inc.

  18. [Frequency distribution of HLA antigens and haplotypes in newly arrived inhabitants of Magadan].

    PubMed

    Solovenchuk, L L; Pereverzeva, V V; Nevretdinova, Z G

    1994-09-01

    Peculiarities of the frequency distribution of antigens and haplotypes of A, B, and Cw subloci of the HLA system in 924 Slavic inhabitants of Magadan are described. Significant differences in gene and haplotype frequencies between inhabitants of Magadan and those of Moscow, Odessa, Poles'e, Latvia, and England were revealed, which could not be attributed solely to the specificity of migration processes. On the basis of an analysis of gamete associations of the A and B subloci, an attempt was made to explain the specificity of the frequency distribution of HLA system alleles and haplotypes in the investigated sample from an ecological point of view.

  19. Interactions of Environmental Factors and APOA1-APOC3-APOA4-APOA5 Gene Cluster Gene Polymorphisms with Metabolic Syndrome.

    PubMed

    Wu, Yanhua; Yu, Yaqin; Zhao, Tiancheng; Wang, Shibin; Fu, Yingli; Qi, Yue; Yang, Guang; Yao, Wenwang; Su, Yingying; Ma, Yue; Shi, Jieping; Jiang, Jing; Kou, Changgui

    2016-01-01

    positively associated with MetS (P<0.05). A family history of diabetes (OR = 1.630, 95%CI = 1.484-1.791), cardiovascular disease or cerebral diseases (OR = 1.297, 95%CI = 1.211-1.389) was associated with MetS. APOA1 rs670, APOA5 rs662799 and rs651821 revealed significant differences in genotype distributions between the MetS patients and control subjects. The minor alleles of APOA1 rs670, APOA5 rs662799 and rs651821, and APOA5 rs2075291 were associated with MetS (P<0.0016). APOA1 rs5072 and APOC3 rs5128, APOA5 rs651821 and rs662799 were in strong linkage disequilibrium to each other with r2 greater than 0.8. Five haplotypes were associated with an increased risk of MetS (OR = 1.23, 1.58, 1.80, 1.90, and 1.98). When we investigated the interactions of environmental factors and APOA1/C3/A4/A5 gene cluster gene polymorphisms, we found that APOA5 rs662799 had interactions with tobacco use and alcohol consumption (PGE<0.05). There was a high prevalence of MetS in the northeast of China. Male gender, increasing age, mental labor, family history of diabetes, cardiovascular disease or cerebral diseases, current smoking, excess salt intake, fruit and dairy intake less than 2 servings a week, and drinking were associated with MetS. The APOA1/C3/A4/A5 gene cluster was associated with MetS in the Han Chinese. APOA5 rs662799 had interactions with the environmental factors associated with MetS.

  20. IL1B-CGTC haplotype is associated with colorectal cancer in admixed individuals with increased African ancestry

    PubMed Central

    Sanabria-Salas, María Carolina; Hernández-Suárez, Gustavo; Umaña-Pérez, Adriana; Rawlik, Konrad; Tenesa, Albert; Serrano-López, Martha Lucía; Sánchez de Gómez, Myriam; Rojas, Martha Patricia; Bravo, Luis Eduardo; Albis, Rosario; Plata, José Luis; Green, Heather; Borgovan, Theodor; Li, Li; Majumdar, Sumana; Garai, Jone; Lee, Edward; Ashktorab, Hassan; Brim, Hassan; Li, Li; Margolin, David; Fejerman, Laura; Zabaleta, Jovanny

    2017-01-01

    Single-nucleotide polymorphisms (SNPs) in cytokine genes can affect gene expression and thereby modulate inflammation and carcinogenesis. However, the data on the association between SNPs in the interleukin 1 beta gene (IL1B) and colorectal cancer (CRC) are conflicting. We found an association between a 4-SNP haplotype block of the IL1B (-3737C/-1464G/-511T/-31C) and CRC risk, and this association was exclusively observed in individuals with a higher proportion of African ancestry, such as individuals from the Coastal Colombian region (odds ratio, OR 2.06; 95% CI 1.31–3.25; p < 0.01). Moreover, a significant interaction between this CRC risk haplotype and local African ancestry dosage was identified in locus 2q14 (p = 0.03). We conclude that Colombian individuals with high African ancestry proportions at locus 2q14 harbour more IL1B-CGTC copies and are consequently at an increased risk of CRC. This haplotype has been previously found to increase the IL1B promoter activity and is the most frequent haplotype in African Americans. Despite of limitations in the number of samples and the lack of functional analysis to examine the effect of these haplotypes on CRC cell lines, our results suggest that inflammation and ethnicity play a major role in the modulation of CRC risk. PMID:28157220

  1. Huntingtin Haplotypes Provide Prioritized Target Panels for Allele-specific Silencing in Huntington Disease Patients of European Ancestry

    PubMed Central

    Kay, Chris; Collins, Jennifer A; Skotte, Niels H; Southwell, Amber L; Warby, Simon C; Caron, Nicholas S; Doty, Crystal N; Nguyen, Betty; Griguoli, Annamaria; Ross, Colin J; Squitieri, Ferdinando; Hayden, Michael R

    2015-01-01

    Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin gene (HTT). Heterozygous polymorphisms in cis with the mutation allow for allele-specific suppression of the pathogenic HTT transcript as a therapeutic strategy. To prioritize target selection, precise heterozygosity estimates are needed across diverse HD patient populations. Here we present the first comprehensive investigation of all common target alleles across the HTT gene, using 738 reference haplotypes from the 1000 Genomes Project and 2364 haplotypes from HD patients and relatives in Canada, Sweden, France, and Italy. The most common HD haplotypes (A1, A2, and A3a) define mutually exclusive sets of polymorphisms for allele-specific therapy in the greatest number of patients. Across all four populations, a maximum of 80% are treatable using these three target haplotypes. We identify a novel deletion found exclusively on the A1 haplotype, enabling potent and selective silencing of mutant HTT in approximately 40% of the patients. Antisense oligonucleotides complementary to the deletion reduce mutant A1 HTT mRNA by 78% in patient cells while sparing wild-type HTT expression. By suppressing specific haplotypes on which expanded CAG occurs, we demonstrate a rational approach to the development of allele-specific therapy for a monogenic disorder. PMID:26201449

  2. Clustering change patterns using Fourier transformation with time-course gene expression data.

    PubMed

    Kim, Jaehee

    2011-01-01

    To understand the behavior of genes, it is important to explore how the patterns of gene expression change over a period of time because biologically related gene groups can share the same change patterns. In this study, the problem of finding similar change patterns is induced to clustering with the derivative Fourier coefficients. This work is aimed at discovering gene groups with similar change patterns which share similar biological properties. We developed a statistical model using derivative Fourier coefficients to identify similar change patterns of gene expression. We used a model-based method to cluster the Fourier series estimation of derivatives. We applied our model to cluster change patterns of yeast cell cycle microarray expression data with alpha-factor synchronization. It showed that, as the method clusters with the probability-neighboring data, the model-based clustering with our proposed model yielded biologically interpretable results. We expect that our proposed Fourier analysis with suitably chosen smoothing parameters could serve as a useful tool in classifying genes and interpreting possible biological change patterns.

  3. An ergot alkaloid biosynthesis gene and clustered hypothetical genes from Aspergillus fumigatus.

    PubMed

    Coyle, Christine M; Panaccione, Daniel G

    2005-06-01

    The ergot alkaloids are a family of indole-derived mycotoxins with a variety of significant biological activities. Aspergillus fumigatus, a common airborne fungus and opportunistic human pathogen, and several fungi in the relatively distant taxon Clavicipitaceae (clavicipitaceous fungi) produce different sets of ergot alkaloids. The ergot alkaloids of these divergent fungi share a four-member ergoline ring but differ in the number, type, and position of the side chains. Several genes required for ergot alkaloid production are known in the clavicipitaceous fungi, and these genes are clustered in the genome of the ergot fungus Claviceps purpurea. We investigated whether the ergot alkaloids of A. fumigatus have a common biosynthetic and genetic origin with those of the clavicipitaceous fungi. A homolog of dmaW, the gene controlling the determinant step in the ergot alkaloid pathway of clavicipitaceous fungi, was identified in the A. fumigatus genome. Knockout of dmaW eliminated all known ergot alkaloids from A. fumigatus, and complementation of the mutation restored ergot alkaloid production. Clustered with dmaW in the A. fumigatus genome are sequences corresponding to five genes previously proposed to encode steps in the ergot alkaloid pathway of C. purpurea, as well as additional sequences whose deduced protein products are consistent with their involvement in the ergot alkaloid pathway. The corresponding genes have similarities in their nucleotide sequences, but the orientations and positions within the cluster of several of these genes differ. The data indicate that the ergot alkaloid biosynthetic capabilities in A. fumigatus and the clavicipitaceous fungi had a common origin.

  4. An Ergot Alkaloid Biosynthesis Gene and Clustered Hypothetical Genes from Aspergillus fumigatus†

    PubMed Central

    Coyle, Christine M.; Panaccione, Daniel G.

    2005-01-01

    The ergot alkaloids are a family of indole-derived mycotoxins with a variety of significant biological activities. Aspergillus fumigatus, a common airborne fungus and opportunistic human pathogen, and several fungi in the relatively distant taxon Clavicipitaceae (clavicipitaceous fungi) produce different sets of ergot alkaloids. The ergot alkaloids of these divergent fungi share a four-member ergoline ring but differ in the number, type, and position of the side chains. Several genes required for ergot alkaloid production are known in the clavicipitaceous fungi, and these genes are clustered in the genome of the ergot fungus Claviceps purpurea. We investigated whether the ergot alkaloids of A. fumigatus have a common biosynthetic and genetic origin with those of the clavicipitaceous fungi. A homolog of dmaW, the gene controlling the determinant step in the ergot alkaloid pathway of clavicipitaceous fungi, was identified in the A. fumigatus genome. Knockout of dmaW eliminated all known ergot alkaloids from A. fumigatus, and complementation of the mutation restored ergot alkaloid production. Clustered with dmaW in the A. fumigatus genome are sequences corresponding to five genes previously proposed to encode steps in the ergot alkaloid pathway of C. purpurea, as well as additional sequences whose deduced protein products are consistent with their involvement in the ergot alkaloid pathway. The corresponding genes have similarities in their nucleotide sequences, but the orientations and positions within the cluster of several of these genes differ. The data indicate that the ergot alkaloid biosynthetic capabilities in A. fumigatus and the clavicipitaceous fungi had a common origin. PMID:15933009

  5. [Gene and haplotype frequencies for the loci HLA-A, B and DRB1 in 11755 north Chinese Han bone marrow registry donors].

    PubMed

    Wu, Qiang-Ju; Liu, Meng-Li; Qi, Jun; Liu, Sheng; Zhang, Yan; Wei, Xiao-Qian

    2007-04-01

    disequilibrium. HLA-A30-B13, A33-B58, A1-B37, A30-DR7, A33-DR13, A1-DR10, B37-DR10, B8-DR17, B13-DR7, B58-DR17 were significant positive linkage disequilibrium. It is concluded that this HLA-A, B, DRB1 gene and haplotype frequencies and linkage disequilibrium data with the largest sample size up to now is unique in north Chinese Han population. The study will be helpful to find matched donors for patients and establish the important foundation for further studying of transplantation immunity, HLA-related diseases and population genetics of this area.

  6. Comparison of two schemes for automatic keyword extraction from MEDLINE for functional gene clustering.

    PubMed

    Liu, Ying; Ciliax, Brian J; Borges, Karin; Dasigi, Venu; Ram, Ashwin; Navathe, Shamkant B; Dingledine, Ray

    2004-01-01

    One of the key challenges of microarray studies is to derive biological insights from the unprecedented quatities of data on gene-expression patterns. Clustering genes by functional keyword association can provide direct information about the nature of the functional links among genes within the derived clusters. However, the quality of the keyword lists extracted from biomedical literature for each gene significantly affects the clustering results. We extracted keywords from MEDLINE that describes the most prominent functions of the genes, and used the resulting weights of the keywords as feature vectors for gene clustering. By analyzing the resulting cluster quality, we compared two keyword weighting schemes: normalized z-score and term frequency-inverse document frequency (TFIDF). The best combination of background comparison set, stop list and stemming algorithm was selected based on precision and recall metrics. In a test set of four known gene groups, a hierarchical algorithm correctly assigned 25 of 26 genes to the appropriate clusters based on keywords extracted by the TDFIDF weighting scheme, but only 23 og 26 with the z-score method. To evaluate the effectiveness of the weighting schemes for keyword extraction for gene clusters from microarray profiles, 44 yeast genes that are differentially expressed during the cell cycle were used as a second test set. Using established measures of cluster quality, the results produced from TFIDF-weighted keywords had higher purity, lower entropy, and higher mutual information than those produced from normalized z-score weighted keywords. The optimized algorithms should be useful for sorting genes from microarray lists into functionally discrete clusters.

  7. Gene-environment interactions between smoking and a haplotype of RAI, ASE-1 and ERCC1 polymorphisms among women in relation to risk of lung cancer in a population-based study.

    PubMed

    Vogel, Ulla; Sørensen, Mette; Hansen, Rikke Dalgaard; Tjønneland, Anne; Overvad, Kim; Wallin, Håkan; Nexø, Bjørn A; Raaschou-Nielsen, Ole

    2007-03-08

    Homozygous carriers of a haplotype consisting of ERCC1 Asn118Asn(A), ASE-1 G-21A(G), RAI IVS1 A4364G(A) are at increased risk of lung cancer especially among women. Here, we analyse for gene-environment interactions with the predefined haplotype in a case cohort study including 428 lung cancer cases and a comparison group of 800 persons, all from the prospective Diet, Cancer and Health cohort of 57,000 Danes. At high smoking intensity (>20g tobacco/day), there was only additional risk of smoking intensity among women who were homozygous carriers of the haplotype (IRR=2.03; 95% CI: 1.10-3.73 per 5 additional g tobacco/day).

  8. Haplotypes in the gene encoding protein kinase c-beta (PRKCB1) on chromosome 16 are associated with autism.

    PubMed

    Philippi, A; Roschmann, E; Tores, F; Lindenbaum, P; Benajou, A; Germain-Leclerc, L; Marcaillou, C; Fontaine, K; Vanpeene, M; Roy, S; Maillard, S; Decaulne, V; Saraiva, J P; Brooks, P; Rousseau, F; Hager, J

    2005-10-01

    Autism is a developmental disorder characterized by impairments in social interaction and communication associated with repetitive patterns of interest or behavior. Autism is highly influenced by genetic factors. Genome-wide linkage and candidate gene association approaches have been used to try and identify autism genes. A few loci have repeatedly been reported linked to autism. Several groups reported evidence for linkage to a region on chromosome 16p. We have applied a direct physical identity-by-descent (IBD) mapping approach to perform a high-density (0.85 megabases) genome-wide linkage scan in 116 families from the AGRE collection. Our results confirm linkage to a region on chromosome 16p with autism. High-resolution single-nucleotide polymorphism (SNP) genotyping and analysis of this region show that haplotypes in the protein kinase c-beta gene are strongly associated with autism. An independent replication of the association in a second set of 167 trio families with autism confirmed our initial findings. Overall, our data provide evidence that the PRKCB1 gene on chromosome 16p may be involved in the etiology of autism.

  9. GraphTeams: a method for discovering spatial gene clusters in Hi-C sequencing data.

    PubMed

    Schulz, Tizian; Stoye, Jens; Doerr, Daniel

    2018-05-08

    Hi-C sequencing offers novel, cost-effective means to study the spatial conformation of chromosomes. We use data obtained from Hi-C experiments to provide new evidence for the existence of spatial gene clusters. These are sets of genes with associated functionality that exhibit close proximity to each other in the spatial conformation of chromosomes across several related species. We present the first gene cluster model capable of handling spatial data. Our model generalizes a popular computational model for gene cluster prediction, called δ-teams, from sequences to graphs. Following previous lines of research, we subsequently extend our model to allow for several vertices being associated with the same label. The model, called δ-teams with families, is particular suitable for our application as it enables handling of gene duplicates. We develop algorithmic solutions for both models. We implemented the algorithm for discovering δ-teams with families and integrated it into a fully automated workflow for discovering gene clusters in Hi-C data, called GraphTeams. We applied it to human and mouse data to find intra- and interchromosomal gene cluster candidates. The results include intrachromosomal clusters that seem to exhibit a closer proximity in space than on their chromosomal DNA sequence. We further discovered interchromosomal gene clusters that contain genes from different chromosomes within the human genome, but are located on a single chromosome in mouse. By identifying δ-teams with families, we provide a flexible model to discover gene cluster candidates in Hi-C data. Our analysis of Hi-C data from human and mouse reveals several known gene clusters (thus validating our approach), but also few sparsely studied or possibly unknown gene cluster candidates that could be the source of further experimental investigations.

  10. Clusters of antibiotic resistance genes enriched together stay together in swine agriculture

    DOE PAGES

    Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong; ...

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundancemore » of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk.Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance

  11. Clusters of antibiotic resistance genes enriched together stay together in swine agriculture

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundancemore » of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk.Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance

  12. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

    PubMed

    Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if

  13. Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection.

    PubMed

    Louzoun, Yoram; Alter, Idan; Gragert, Loren; Albrecht, Mark; Maiers, Martin

    2018-05-01

    Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.

  14. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    PubMed Central

    2010-01-01

    Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is

  15. Chromatin organization and global regulation of Hox gene clusters

    PubMed Central

    Montavon, Thomas; Duboule, Denis

    2013-01-01

    During development, a properly coordinated expression of Hox genes, within their different genomic clusters is critical for patterning the body plans of many animals with a bilateral symmetry. The fascinating correspondence between the topological organization of Hox clusters and their transcriptional activation in space and time has served as a paradigm for understanding the relationships between genome structure and function. Here, we review some recent observations, which revealed highly dynamic changes in the structure of chromatin at Hox clusters, in parallel with their activation during embryonic development. We discuss the relevance of these findings for our understanding of large-scale gene regulation. PMID:23650639

  16. The clinical application of single-sperm-based SNP haplotyping for PGD of osteogenesis imperfecta.

    PubMed

    Chen, Linjun; Diao, Zhenyu; Xu, Zhipeng; Zhou, Jianjun; Yan, Guijun; Sun, Haixiang

    2018-05-15

    Osteogenesis imperfecta (OI) is a genetically heterogeneous disorder, presenting either autosomal dominant, autosomal recessive or X-linked inheritance patterns. The majority of OI cases are autosomal dominant and are caused by heterozygous mutations in either the COL1A1 or COL1A2 gene. In these dominant disorders, allele dropout (ADO) can lead to misdiagnosis in preimplantation genetic diagnosis (PGD). Polymorphic markers linked to the mutated genes have been used to establish haplotypes for identifying ADO and ensuring the accuracy of PGD. However, the haplotype of male patients cannot be determined without data from affected relatives. Here, we developed a method for single-sperm-based single-nucleotide polymorphism (SNP) haplotyping via next-generation sequencing (NGS) for the PGD of OI. After NGS, 10 informative polymorphic SNP markers located upstream and downstream of the COL1A1 gene and its pathogenic mutation site were linked to individual alleles in a single sperm from an affected male. After haplotyping, a normal blastocyst was transferred to the uterus for a subsequent frozen embryo transfer cycle. The accuracy of PGD was confirmed by amniocentesis at 19 weeks of gestation. A healthy infant weighing 4,250 g was born via vaginal delivery at the 40th week of gestation. Single-sperm-based SNP haplotyping can be applied for PGD of any monogenic disorders or de novo mutations in males in whom the haplotype of paternal mutations cannot be determined due to a lack of affected relatives. ADO: allele dropout; DI: dentinogenesis imperfect; ESHRE: European Society of Human Reproduction and Embryology; FET: frozen embryo transfer; gDNA: genomic DNA; ICSI: intracytoplasmic sperm injection; IVF: in vitro fertilization; MDA: multiple displacement amplification; NGS: next-generation sequencing; OI: osteogenesis imperfect; PBS: phosphate buffer saline; PCR: polymerase chain reaction; PGD: preimplantation genetic diagnosis; SNP: single-nucleotide polymorphism; STR

  17. The hepcidin gene promoter nc.-1010C > T; -582A > G haplotype modulates serum ferritin in individuals carrying the common H63D mutation in HFE gene.

    PubMed

    Silva, Bruno; Pita, Lina; Gomes, Susana; Gonçalves, João; Faustino, Paula

    2014-12-01

    Hereditary hemochromatosis is an autosomal recessive disorder characterized by severe iron overload. It is usually associated with homozygosity for the HFE gene mutation c.845G > A; p.C282Y. However, in some cases, another HFE mutation (c.187C > G; p.H63D) seems to be associated with the disease. Its penetrance is very low, suggesting the possibility of other iron genetic modulators being involved. In this work, we have screened for HAMP promoter polymorphisms in 409 individuals presenting normal or increased serum ferritin levels together with normal or H63D-mutated HFE genotypes. Our results show that the hepcidin gene promoter TG haplotype, originated by linkage of the nc.-1010C > T and nc.-582A > G polymorphisms, is more frequent in the HFE_H63D individuals presenting serum ferritin levels higher than 300 μg/L than in those presenting the HFE_H63D mutation but with normal serum ferritin levels or in the normal control group.Moreover, it was observed that the TG haplotype was associated to increased serum ferritin levels in the overall pool of HFE_H63D individuals. Thus, our data suggest that screening for these polymorphisms could be of interest in order to explain the phenotype. However, this genetic condition seems to have no clinical significance.

  18. Dimensional Anxiety Mediates Linkage of GABRA2 Haplotypes With Alcoholism

    PubMed Central

    Enoch, Mary-Anne; Schwartz, Lori; Albaugh, Bernard; Virkkunen, Matti; Goldman, David

    2015-01-01

    The GABAAα2 receptor gene (GABRA2) modulates anxiety and stress response. Three recent association studies implicate GABRA2 in alcoholism, however in these papers both common, opposite-configuration haplotypes in the region distal to intron3 predict risk. We have now replicated the GABRA2 association with alcoholism in 331 Plains Indian men and women and 461 Finnish Caucasian men. Using a dimensional measure of anxiety, harm avoidance (HA), we also found that the association with alcoholism is mediated, or moderated, by anxiety. Nine SNPs were genotyped revealing two haplotype blocks. Within the previously implicated block 2 region, we identified the two common, opposite-configuration risk haplotypes, A and B. Their frequencies differed markedly in Finns and Plains Indians. In both populations, most block 2 SNPs were significantly associated with alcoholism. The associations were due to increased frequencies of both homozygotes in alcoholics, indicating the possibility of alcoholic subtypes with opposite genotypes. Congruently, there was no significant haplotype association. Using HA as an indicator variable for anxiety, we found haplotype linkage to alcoholism with high and low dimensional anxiety, and to HA itself, in both populations. High HA alcoholics had the highest frequency of the more abundant haplotype (A in Finns, B in Plains Indians); low HA alcoholics had the highest frequency of the less abundant haplotype (B in Finns, A in Plains Indians) (Finns: P α0.007, OR α2.1, Plains Indians: P α0.040, OR α1.9). Non-alcoholics had intermediate frequencies. Our results suggest that within the distal GABRA2 region is a functional locus or loci that may differ between populations but that alters risk for alcoholism via the mediating action of anxiety. PMID:16874763

  19. Multiple genetic origins of histidine-rich protein 2 gene deletion in Plasmodium falciparum parasites from Peru

    PubMed Central

    Akinyi, Sheila; Hayden, Tonya; Gamboa, Dionicia; Torres, Katherine; Bendezu, Jorge; Abdallah, Joseph F.; Griffing, Sean M.; Quezada, Wilmer Marquiño; Arrospide, Nancy; De Oliveira, Alexandre Macedo; Lucas, Carmen; Magill, Alan J.; Bacon, David J.; Barnwell, John W.; Udhayakumar, Venkatachalam

    2013-01-01

    The majority of malaria rapid diagnostic tests (RDTs) detect Plasmodium falciparum histidine-rich protein 2 (PfHRP2), encoded by the pfhrp2 gene. Recently, P. falciparum isolates from Peru were found to lack pfhrp2 leading to false-negative RDT results. We hypothesized that pfhrp2-deleted parasites in Peru derived from a single genetic event. We evaluated the parasite population structure and pfhrp2 haplotype of samples collected between 1998 and 2005 using seven neutral and seven chromosome 8 microsatellite markers, respectively. Five distinct pfhrp2 haplotypes, corresponding to five neutral microsatellite-based clonal lineages, were detected in 1998-2001; pfhrp2 deletions occurred within four haplotypes. In 2003-2005, outcrossing among the parasite lineages resulted in eight population clusters that inherited the five pfhrp2 haplotypes seen previously and a new haplotype; pfhrp2 deletions occurred within four of these haplotypes. These findings indicate that the genetic origin of pfhrp2 deletion in Peru was not a single event, but likely occurred multiple times. PMID:24077522

  20. Haplotype Frequency Distribution in Northeastern European Saduria entomon (Crustacea: Isopoda) Populations. A Phylogeographic Approach

    NASA Astrophysics Data System (ADS)

    Sell, Jerzy

    2003-11-01

    The distribution pattern of mtDNA haplotypes in distinct populations of the glacial relict crustacean Saduria entomon was examined to assess phylogeographic relationships among them. Populations from the Baltic, the White Sea and the Barents Sea were screened for mtDNA variation using PCR-based RFLP analysis of a 1150 bp fragment containing part of the CO I and CO II genes. Five mtDNA haplotypes were recorded. An analysis of geographical heterogeneity in haplotype frequency distributions revealed significant differences among populations. The isolated populations of S. entomon have diverged since the retreat of the last glaciation. The geographical pattern of variation is most likely the result of stochastic (founder effect, genetic drift) mechanisms and suggests that the haplotype differentiation observed is probably older than the isolation of the Baltic and Arctic seas.

  1. Very long haplotype tracts characterized at high resolution from HLA homozygous cell lines

    PubMed Central

    Norman, Paul J.; Norberg, Steve; Nemat-Gorgani, Neda; Royce, Thomas; Hollenbach, Jill A.; Won, Melissa Shults; Guethlein, Lisbeth A.; Gunderson, Kevin L.; Ronaghi, Mostafa; Parham, Peter

    2015-01-01

    The HLA region of chromosome 6 contains the most polymorphic genes in humans. Spanning ~5Mbp the densely packed region encompasses approximately 175 expressed genes including the highly polymorphic HLA class I and II loci. Most of the other genes and functional elements are also polymorphic, and many of them are directly implicated in immune function or immune-related disease. For these reasons this complex genomic region is subject to intense scrutiny by researchers with the common goal of aiding further understanding and diagnoses of multiple immune-related diseases and syndromes. To aid assay development and characterization of the classical loci, a panel of cell lines partially or fully homozygous for HLA class I and II was assembled over time by the International Histocompatibility Working Group (IHWG). Containing a minimum of 88 unique HLA haplotypes, we show this panel represents a significant proportion of European HLA allelic and haplotype diversity (60–95%). Using a high-density whole genome array that includes 13,331 HLA region SNPs, we analyzed 99 IHWG cells to map the coordinates of the homozygous tracts at a fine scale. The mean homozygous tract length within chromosome 6 from these individuals is 21Mbp. Within HLA the mean haplotype length is 4.3Mbp, and 65% of the cell lines were shown to be homozygous throughout the entire region. In addition, four cell lines are homozygous throughout the complex KIR region of chromosome 19 (~250kbp). The data we describe will provide a valuable resource for characterizing haplotypes, designing and refining imputation algorithms and developing assay controls. PMID:26198775

  2. Haplotypes in the CRP Gene Associated with Increased BMI and Levels of CRP in Subjects with Type 2 Diabetes or Obesity from Southwestern Mexico

    PubMed Central

    Martínez-Calleja, América; Quiróz-Vargas, Irma; Parra-Rojas, Isela; Muñoz-Valle, José Francisco; Leyva-Vázquez, Marco A.; Fernández-Tilapa, Gloria; Vences-Velázquez, Amalia; Cruz, Miguel; Salazar-Martínez, Eduardo; Flores-Alfaro, Eugenia

    2012-01-01

    Objective. We evaluated the association between four polymorphisms in the CRP gene with circulating levels of C-reactive protein (CRP), type 2 diabetes (T2D), obesity, and risk score of coronary heart disease. Methods. We studied 402 individuals and classified them into four groups: healthy, obese, T2D obese, and T2D without obesity, from Guerrero, Southwestern Mexico. Blood levels of CRP, glucose, cholesterol, triglycerides, and leukocytes were measured. Genotyping was performed by PCR/RFLP, and the risk score for coronary heart disease was determined by the Framingham's methodology. Results. The TT genotype of SNP rs1130864 was associated with increased body mass index and T2D patients with obesity. We found that the haplotype 2 (TGAG) was associated with increased levels of CRP (β = 0.3; 95%CI: 0.1, 0.5; P = 0.005) and haplotype 7 (TGGG) with higher body mass index (BMI) (β = 0.2; 95%CI: 0.1, 0.3; P < 0.001). The risk score for coronary heart disease was associated with increased levels of CRP, but not with any polymorphism or haplotype. Conclusions. The association between the TT genotype of SNP rs1130864 with obesity and the haplotype 7 with BMI may explain how obesity and genetic predisposition increase the risk of diseases such as T2D in the population of Southwestern Mexico. PMID:23049543

  3. Haplotypes in the CRP gene associated with increased BMI and levels of CRP in subjects with type 2 diabetes or obesity from Southwestern Mexico.

    PubMed

    Martínez-Calleja, América; Quiróz-Vargas, Irma; Parra-Rojas, Isela; Muñoz-Valle, José Francisco; Leyva-Vázquez, Marco A; Fernández-Tilapa, Gloria; Vences-Velázquez, Amalia; Cruz, Miguel; Salazar-Martínez, Eduardo; Flores-Alfaro, Eugenia

    2012-01-01

    We evaluated the association between four polymorphisms in the CRP gene with circulating levels of C-reactive protein (CRP), type 2 diabetes (T2D), obesity, and risk score of coronary heart disease. We studied 402 individuals and classified them into four groups: healthy, obese, T2D obese, and T2D without obesity, from Guerrero, Southwestern Mexico. Blood levels of CRP, glucose, cholesterol, triglycerides, and leukocytes were measured. Genotyping was performed by PCR/RFLP, and the risk score for coronary heart disease was determined by the Framingham's methodology. The TT genotype of SNP rs1130864 was associated with increased body mass index and T2D patients with obesity. We found that the haplotype 2 (TGAG) was associated with increased levels of CRP (β = 0.3; 95%CI: 0.1, 0.5; P = 0.005) and haplotype 7 (TGGG) with higher body mass index (BMI) (β = 0.2; 95%CI: 0.1, 0.3; P < 0.001). The risk score for coronary heart disease was associated with increased levels of CRP, but not with any polymorphism or haplotype. The association between the TT genotype of SNP rs1130864 with obesity and the haplotype 7 with BMI may explain how obesity and genetic predisposition increase the risk of diseases such as T2D in the population of Southwestern Mexico.

  4. New and Common Haplotypes Shape Genetic Diversity in Asian Tiger Mosquito Populations from Costa Rica and Panamá.

    PubMed

    Futami, K; Valderrama, A; Baldi, M; Minakawa, N; Marín Rodríguez, R; Chaves, L F

    2015-04-01

    The Asian tiger mosquito, Aedes albopictus (Skuse) (Diptera: Culicidae), is a vector of several human pathogens. Ae. albopictus is also an invasive species that, over recent years, has expanded its range out of its native Asia. Ae. albopictus was suspected to be present in Central America since the 1990s, and its presence was confirmed by most Central American nations by 2010. Recently, this species has been regularly found, yet in low numbers, in limited areas of Panamá and Costa Rica (CR). Here, we report that short sequences (∼558 bp) of the mitochondrial cytochrome oxidase subunit 1 (COI) and NADH dehydrogenase subunit 5 genes of Ae. albopictus, had no haplotype diversity. Instead, there was a common haplotype for each gene in both CR and Panamá. In contrast, a long COI sequence (∼1,390 bp) revealed that haplotype diversity (±SD) was relatively high in CR (0.72±0.04) when compared with Panamá (0.33±0.13), below the global estimate for reported samples (0.89±0.01). The long COI sequence allowed us to identify seven (five new) haplotypes in CR and two (one new) in Panamá. A haplotype network for the long COI gene sequence showed that samples from CR and Panamá belong to a single large group. The long COI gene sequences suggest that haplotypes in Panamá and CR, although similar to each other, had a significant geographic differentiation (Kst=1.33; P<0.001). Thus, most of our results suggest a recent range expansion in CR and Panamá. © The Authors 2015. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. Fragmentation of an aflatoxin-like gene cluster in a forest pathogen

    USDA-ARS?s Scientific Manuscript database

    Secondary metabolic pathway genes are typically clustered in fungi. An exception to this paradigm is seen for genes required for the production of dothistromin, an aflatoxin-like virulence factor produced by the pine needle pathogen Dothistroma septosporum. In contrast to the tight clustering of gen...

  6. Two families from New England with usher syndrome type IC with distinct haplotypes.

    PubMed

    DeAngelis, M M; McGee, T L; Keats, B J; Slim, R; Berson, E L; Dryja, T P

    2001-03-01

    To search for patients with Usher syndrome type IC among those with Usher syndrome type I who reside in New England. Genotype analysis of microsatellite markers closely linked to the USH1C locus was done using the polymerase chain reaction. We compared the haplotype of our patients who were homozygous in the USH1C region with the haplotypes found in previously reported USH1C Acadian families who reside in southwestern Louisiana and from a single family residing in Lebanon. Of 46 unrelated cases of Usher syndrome type I residing in New England, two were homozygous at genetic markers in the USH1C region. Of these, one carried the Acadian USH1C haplotype and had Acadian ancestors (that is, from Nova Scotia) who did not participate in the 1755 migration of Acadians to Louisiana. The second family had a haplotype that proved to be the same as that of a family with USH1C residing in Lebanon. Each of the two families had haplotypes distinct from the other. This is the first report that some patients residing in New England have Usher syndrome type IC. Patients with Usher syndrome type IC can have the Acadian haplotype or the Lebanese haplotype compatible with the idea that at least two independently arising pathogenic mutations have occurred in the yet-to-be identified USH1C gene.

  7. Polymorphisms in the LPL and CETP Genes and Haplotype in the ESR1 Gene Are Associated with Metabolic Syndrome in Women from Southwestern Mexico.

    PubMed

    Cahua-Pablo, José Ángel; Cruz, Miguel; Méndez-Palacios, Abigail; Antúnez-Ortiz, Diana Lizzete; Vences-Velázquez, Amalia; del Carmen Alarcón-Romero, Luz; Parra, Esteban Juan; Tello-Flores, Vianet Argelia; Leyva-Vázquez, Marco Antonio; Valladares-Salgado, Adán; Pérez-Macedonio, Claudia Paola; Flores-Alfaro, Eugenia

    2015-09-08

    Metabolic syndrome (MetS) is a combination of metabolic disorders associated with an increased risk for cardiovascular disease (CVD). Studies in women reported associations between polymorphisms in ESR1, LPL and CETP genes and MetS. Our aim was to evaluate the association between variants in ESR1, LPL and CETP genes with MetS and its components. Four hundred and eighty women were analyzed, anthropometric features and biochemical profiles were evaluated, and genotyping was performed by real-time PCR. We found an association with elevated glucose levels (odds ratio (OR) = 2.9; p = 0.013) in carrying the AA genotype of rs1884051 in the ESR1 gene compared with the GG genotype, and the CC genotype of rs328 in the LPL gene was associated with MetS compared to the CG or GG genotype (OR = 2.8; p = 0.04). Moreover, the GA genotype of rs708272 in the CETP gene is associated with MetS compared to the GG or AA genotype (OR = 1.8; p = 0.006). In addition the ACTCCG haplotype in the ESR1 gene is associated with a decrease in the risk of MetS (OR = 0.02; p < 0.001). In conclusion, our results show the involvement of the variants of ESR1, LPL and CETP genes in metabolic events related to MetS or some of its features.

  8. Polymorphisms in the LPL and CETP Genes and Haplotype in the ESR1 Gene Are Associated with Metabolic Syndrome in Women from Southwestern Mexico

    PubMed Central

    Cahua-Pablo, José Ángel; Cruz, Miguel; Méndez-Palacios, Abigail; Antúnez-Ortiz, Diana Lizzete; Vences-Velázquez, Amalia; del Carmen Alarcón-Romero, Luz; Parra, Esteban Juan; Tello-Flores, Vianet Argelia; Leyva-Vázquez, Marco Antonio; Valladares-Salgado, Adán; Pérez-Macedonio, Claudia Paola; Flores-Alfaro, Eugenia

    2015-01-01

    Metabolic syndrome (MetS) is a combination of metabolic disorders associated with an increased risk for cardiovascular disease (CVD). Studies in women reported associations between polymorphisms in ESR1, LPL and CETP genes and MetS. Our aim was to evaluate the association between variants in ESR1, LPL and CETP genes with MetS and its components. Four hundred and eighty women were analyzed, anthropometric features and biochemical profiles were evaluated, and genotyping was performed by real-time PCR. We found an association with elevated glucose levels (odds ratio (OR) = 2.9; p = 0.013) in carrying the AA genotype of rs1884051 in the ESR1 gene compared with the GG genotype, and the CC genotype of rs328 in the LPL gene was associated with MetS compared to the CG or GG genotype (OR = 2.8; p = 0.04). Moreover, the GA genotype of rs708272 in the CETP gene is associated with MetS compared to the GG or AA genotype (OR = 1.8; p = 0.006). In addition the ACTCCG haplotype in the ESR1 gene is associated with a decrease in the risk of MetS (OR = 0.02; p < 0.001). In conclusion, our results show the involvement of the variants of ESR1, LPL and CETP genes in metabolic events related to MetS or some of its features. PMID:26370976

  9. HLA-DR2-associated DRB1 and DRB5 alleles and haplotypes in Koreans.

    PubMed

    Song, E Y; Kang, S J; Lee, Y J; Park, M H

    2000-09-01

    There are considerable racial differences in the distribution of HLA-DR2-associated DRB1 and DRB5 alleles and the characteristics of linkage disequilibrium between these alleles. In this study, the frequencies of DR2-associated DRB1 and DRB5 alleles and related haplotypes were analyzed in 186 DR2-positive individuals out of 800 normal Koreans registered for unrelated bone marrow donors. HLA class I antigen typing was performed by the serological method and DRB1 and DRB5 genotyping by the PCR-single strand conformational polymorphism method. Only 3 alleles were detected for DR2-associated DRB1 and DRB5 genes, respectively: DRB1(*)1501 (gene frequency 8.0%), (*)1502 (3.2%), (*)1602 (0.9%); DRB5(*)0101 (8.0%), (*)0102 (3.2%), and (*)0202 (0.9%). DRB1-DRB5 haplotype analysis showed an exclusive association between these alleles: DRB1*1501-DRB5*0101 (haplotype frequency 8.0%), DRB1(*)1502-DRB5(*)0102 (3.2%), and DRB1(*)1602-DRB5(*)0202 (0.9%). The 5 most common DR2-associated A-B-DRB1 haplotypes occurring at frequencies of > or = 0.5% were A24-B52-DRB1(*)1502 (1.8%), A2-B62-DRB1(*)1501, A2-B54-DRB1(*)1501, A26-B61-DRB1(*)1501, and A24-B51-DRB1(*)1501. The remarkable homogeneity in the haplotypic associations between DR2-associated DRB1 and DRB5 alleles in Koreans would be advantageous for organ transplantation compared with other ethnic groups showing considerable heterogeneity in the distribution of DRB1-DRB5 haplotypes.

  10. Meiotic recombination generates rich diversity in NK cell receptor genes, alleles, and haplotypes

    PubMed Central

    Norman, Paul J.; Abi-Rached, Laurent; Gendzekhadze, Ketevan; Hammond, John A.; Moesta, Achim K.; Sharma, Deepti; Graef, Thorsten; McQueen, Karina L.; Guethlein, Lisbeth A.; Carrington, Christine V.F.; Chandanayingyong, Dasdayanee; Chang, Yih-Hsin; Crespí, Catalina; Saruhan-Direskeneli, Güher; Hameed, Kamran; Kamkamidze, Giorgi; Koram, Kwadwo A.; Layrisse, Zulay; Matamoros, Nuria; Milà, Joan; Park, Myoung Hee; Pitchappan, Ramasamy M.; Ramdath, D. Dan; Shiau, Ming-Yuh; Stephens, Henry A.F.; Struik, Siske; Tyan, Dolly; Verity, David H.; Vaughan, Robert W.; Davis, Ronald W.; Fraser, Patricia A.; Riley, Eleanor M.; Ronaghi, Mostafa; Parham, Peter

    2009-01-01

    Natural killer (NK) cells contribute to the essential functions of innate immunity and reproduction. Various genes encode NK cell receptors that recognize the major histocompatibility complex (MHC) Class I molecules expressed by other cells. For primate NK cells, the killer-cell immunoglobulin-like receptors (KIR) are a variable and rapidly evolving family of MHC Class I receptors. Studied here is KIR3DL1/S1, which encodes receptors for highly polymorphic human HLA-A and -B and comprises three ancient allelic lineages that have been preserved by balancing selection throughout human evolution. While the 3DS1 lineage of activating receptors has been conserved, the two 3DL1 lineages of inhibitory receptors were diversified through inter-lineage recombination with each other and with 3DS1. Prominent targets for recombination were D0-domain polymorphisms, which modulate enhancer function, and dimorphism at position 283 in the D2 domain, which influences inhibitory function. In African populations, unequal crossing over between the 3DL1 and 3DL2 genes produced a deleted KIR haplotype in which the telomeric “half” was reduced to a single fusion gene with functional properties distinct from its 3DL1 and 3DL2 parents. Conversely, in Eurasian populations, duplication of the KIR3DL1/S1 locus by unequal crossing over has enabled individuals to carry and express alleles of all three KIR3DL1/S1 lineages. These results demonstrate how meiotic recombination combines with an ancient, preserved diversity to create new KIR phenotypes upon which natural selection acts. A consequence of such recombination is to blur the distinction between alleles and loci in the rapidly evolving human KIR gene family. PMID:19411600

  11. Unique haplotypes of cacao trees as revealed by trnH-psbA chloroplast DNA

    PubMed Central

    Gutiérrez-López, Nidia; Ovando-Medina, Isidro; Salvador-Figueroa, Miguel; Molina-Freaner, Francisco; Avendaño-Arrazate, Carlos H.

    2016-01-01

    Cacao trees have been cultivated in Mesoamerica for at least 4,000 years. In this study, we analyzed sequence variation in the chloroplast DNA trnH-psbA intergenic spacer from 28 cacao trees from different farms in the Soconusco region in southern Mexico. Genetic relationships were established by two analysis approaches based on geographic origin (five populations) and genetic origin (based on a previous study). We identified six polymorphic sites, including five insertion/deletion (indels) types and one transversion. The overall nucleotide diversity was low for both approaches (geographic = 0.0032 and genetic = 0.0038). Conversely, we obtained moderate to high haplotype diversity (0.66 and 0.80) with 10 and 12 haplotypes, respectively. The common haplotype (H1) for both networks included cacao trees from all geographic locations (geographic approach) and four genetic groups (genetic approach). This common haplotype (ancient) derived a set of intermediate haplotypes and singletons interconnected by one or two mutational steps, which suggested directional selection and event purification from the expansion of narrow populations. Cacao trees from Soconusco region were grouped into one cluster without any evidence of subclustering based on AMOVA (FST = 0) and SAMOVA (FST = 0.04393) results. One population (Mazatán) showed a high haplotype frequency; thus, this population could be considered an important reservoir of genetic material. The indels located in the trnH-psbA intergenic spacer of cacao trees could be useful as markers for the development of DNA barcoding. PMID:27076998

  12. Distribution of MICA alleles and haplotypes associated with HLA in the Korean population.

    PubMed

    Pyo, Chul-Woo; Hur, Seong-Suk; Kim, Yang-Kyum; Choi, Hee-Baeg; Kim, Tae-Yoon; Kim, Tai-Gyu

    2003-03-01

    The MICA (MHC class I chain-related gene A) is a polymorphic gene located 46 kb centromeric of the HLA-B gene, and is preferentially expressed in epithelial cells and intestinal mucosa. The MICA gene, similar to human leukocyte antigen (HLA) class I, displays a high degree of genetic polymorphism in exons 2, 3, 4, and 5, amounting to 54 alleles. In this study, we investigated the polymorphisms at exons coding for extracellular domains (exons 2, 3, and 4), and the GCT repeat polymorphism at the transmembrane (exon 5) of MICA in 199 unrelated healthy Koreans. Eight alleles were observed in the Korean population, with allele frequencies for MICA*010, MICA*00201, MICA*027, MICA*004, MICA*012, MICA*00801, MICA*00901, and MICA*00701 being 18.3%, 17.8%, 13.6%, 12.3%, 11.1%, 10.8%, 10.6%, and 3.3%, respectively. Strong linkage disequilibria were also observed between the MICA and HLA-B gene-MICA*00201-B58, MICA*004-B44, MICA*00701-B27, MICA*00801-B60, MICA*00901-B51, MICA*010-B62, MICA*012-B54, and MICA*027-B61. In the analysis of the haplotypes of HLA class I genes (HLA-A, B, and C) and the MICA, the most common haplotype was MICA*004-A33-B44-Cw*07, followed by MICA*00201-A2-B58-Cw*0302 and MICA*012-A2-B54-Cw*0102. The MICA null haplotype might be identified in the HLA-B48 homozygous individual. These results will provide an understanding of the role of MICA in transplantation, disease association, and population analyses in Koreans.

  13. High-throughput platform for the discovery of elicitors of silent bacterial gene clusters.

    PubMed

    Seyedsayamdost, Mohammad R

    2014-05-20

    Over the past decade, bacterial genome sequences have revealed an immense reservoir of biosynthetic gene clusters, sets of contiguous genes that have the potential to produce drugs or drug-like molecules. However, the majority of these gene clusters appear to be inactive for unknown reasons prompting terms such as "cryptic" or "silent" to describe them. Because natural products have been a major source of therapeutic molecules, methods that rationally activate these silent clusters would have a profound impact on drug discovery. Herein, a new strategy is outlined for awakening silent gene clusters using small molecule elicitors. In this method, a genetic reporter construct affords a facile read-out for activation of the silent cluster of interest, while high-throughput screening of small molecule libraries provides potential inducers. This approach was applied to two cryptic gene clusters in the pathogenic model Burkholderia thailandensis. The results not only demonstrate a prominent activation of these two clusters, but also reveal that the majority of elicitors are themselves antibiotics, most in common clinical use. Antibiotics, which kill B. thailandensis at high concentrations, act as inducers of secondary metabolism at low concentrations. One of these antibiotics, trimethoprim, served as a global activator of secondary metabolism by inducing at least five biosynthetic pathways. Further application of this strategy promises to uncover the regulatory networks that activate silent gene clusters while at the same time providing access to the vast array of cryptic molecules found in bacteria.

  14. Molecular definition of red cell Rh haplotypes by tightly linked SphI RFLPs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huang, C.H.; Reid, M.E.; Chen, Y.

    The Rh blood group system of human red cells contains five major antigens D, C/c, and E/e (the latter four designated {open_quotes}non-D{close_quotes}) that are specified by eight gene complexes known as Rh haplotypes. In this paper, we report on the mapping of the RH locus and identification of a set of SphI RFLPs that are tightly linked with the Rh structural genes. Using exon-specific probes, we have localized the SphI cleavage sites resulting in these DNA markers and derived a comprehensive map for the RH locus. It was found that the SphI fragments encompassing exons 4-7 of the Rh genesmore » occur in four banding patterns or frameworks that correspond to the distribution and segregation of the common Rh haplotypes. This linkage disequilibrium allowed a genotype-phenotype correlation and direct determination of Rh zygosity related to the Rh-positive or Rh-negative status (D/D, D/d, and d/d). Studies on the occurrence of SphI RFLPs in a number of rare Rh variants indicated that Rh phenotypic diversity has taken place on different haplotype backgrounds and has arisen by diverse genetic mechanisms. The molecular definition of Rh haplotypes by SphI RFLP frameworks should provide a useful procedure for genetic counseling and prenatal assessment of Rh alloimmunization. 32 refs., 7 figs., 3 tabs.« less

  15. Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants1[OPEN

    PubMed Central

    Zhang, Peifen; Kim, Taehyong; Banf, Michael; Chavali, Arvind K.; Nilo-Poyanco, Ricardo; Bernard, Thomas

    2017-01-01

    Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters. PMID:28228535

  16. Investigation of extended Y chromosome STR haplotypes in Sardinia.

    PubMed

    Lacerenza, D; Aneli, S; Di Gaetano, C; Critelli, R; Piazza, A; Matullo, G; Culigioni, C; Robledo, R; Robino, C; Calò, C

    2017-03-01

    Y-chromosomal variation of selected single nucleotide polymorphisms (SNPs) and 32 short tandem repeat (STR) loci was evaluated in Sardinia in three open population groups (Northern Sardinia, n=40; Central Sardinia, n=56; Southern Sardinia, n=91) and three isolates (Desulo, n=34; Benetutti, n=45, Carloforte, n=42). The tested Y-STRs consisted of Yfiler ® Plus markers and the seven rapidly mutating (RM) loci not included in the YFiler ® Plus kit (DYF399S1, DYF403S1ab, DYF404S1, DYS526ab, DYS547, DYS612, and DYS626). As expected, inclusion of additional Y-STR loci increased haplotype diversity (h), though complete differentiation of male lineages was impossible even by means of RM Y-STRs (h=0.99997). Analysis of molecular variance indicated that the three open populations were fairly homogeneous, whereas signs of genetic heterogeneity could be detected when the three isolates were also included in the analysis. Multidimensional scaling analysis showed that, even for extended haplotypes including RM Y-STR markers, Sardinians were clearly differentiated from populations of the Italian peninsula and Sicily. The only exception was represented by the Carloforte sample that, in accordance with its peculiar population history, clustered with Northern/Central Italian populations. The introduction of extended forensic Y-STR panels, including highly variable RM Y-STR markers, is expected to reduce the impact of population structure on haplotype frequency estimations. However, our results show that the availability of geographically detailed reference databases is still important for the assessment of the evidential value of a Y-haplotype match. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. Conservation of gene linkage in dispersed vertebrate NK homeobox clusters.

    PubMed

    Wotton, Karl R; Weierud, Frida K; Juárez-Morales, José L; Alvares, Lúcia E; Dietrich, Susanne; Lewis, Katharine E

    2009-10-01

    Nk homeobox genes are important regulators of many different developmental processes including muscle, heart, central nervous system and sensory organ development. They are thought to have arisen as part of the ANTP megacluster, which also gave rise to Hox and ParaHox genes, and at least some NK genes remain tightly linked in all animals examined so far. The protostome-deuterostome ancestor probably contained a cluster of nine Nk genes: (Msx)-(Nk4/tinman)-(Nk3/bagpipe)-(Lbx/ladybird)-(Tlx/c15)-(Nk7)-(Nk6/hgtx)-(Nk1/slouch)-(Nk5/Hmx). Of these genes, only NKX2.6-NKX3.1, LBX1-TLX1 and LBX2-TLX2 remain tightly linked in humans. However, it is currently unclear whether this is unique to the human genome as we do not know which of these Nk genes are clustered in other vertebrates. This makes it difficult to assess whether the remaining linkages are due to selective pressures or because chance rearrangements have "missed" certain genes. In this paper, we identify all of the paralogs of these ancestrally clustered NK genes in several distinct vertebrates. We demonstrate that tight linkages of Lbx1-Tlx1, Lbx2-Tlx2 and Nkx3.1-Nkx2.6 have been widely maintained in both the ray-finned and lobe-finned fish lineages. Moreover, the recently duplicated Hmx2-Hmx3 genes are also tightly linked. Finally, we show that Lbx1-Tlx1 and Hmx2-Hmx3 are flanked by highly conserved noncoding elements, suggesting that shared regulatory regions may have resulted in evolutionary pressure to maintain these linkages. Consistent with this, these pairs of genes have overlapping expression domains. In contrast, Lbx2-Tlx2 and Nkx3.1-Nkx2.6, which do not seem to be coexpressed, are also not associated with conserved noncoding sequences, suggesting that an alternative mechanism may be responsible for the continued clustering of these genes.

  18. A Cluster of Cuticle Protein Genes of Drosophila Melanogaster at 65a: Sequence, Structure and Evolution

    PubMed Central

    Charles, J. P.; Chihara, C.; Nejad, S.; Riddiford, L. M.

    1997-01-01

    A 36-kb genomic DNA segment of the Drosophila melanogaster genome containing 12 clustered cuticle genes has been mapped and partially sequenced. The cluster maps at 65A 5-6 on the left arm of the third chromosome, in agreement with the previously determined location of a putative cluster encompassing the genes for the third instar larval cuticle proteins LCP5, LCP6 and LCP8. This cluster is the largest cuticle gene cluster discovered to date and shows a number of surprising features that explain in part the genetic complexity of the LCP5, LCP6 and LCP8 loci. The genes encoding LCP5 and LCP8 are multiple copy genes and the presence of extensive similarity in their coding regions gives the first evidence for gene conversion in cuticle genes. In addition, five genes in the cluster are intronless. Four of these five have arisen by retroposition. The other genes in the cluster have a single intron located at an unusual location for insect cuticle genes. PMID:9383064

  19. RTEL1 tagging SNPs and haplotypes were associated with glioma development.

    PubMed

    Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong

    2013-05-17

    As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.

  20. Genetic Dissection of the Human Leukocyte Antigen Region by Use of Haplotypes of Tasmanians with Multiple Sclerosis

    PubMed Central

    Rubio, Justin P.; Bahlo, Melanie; Butzkueven, Helmut; van der Mei, Ingrid A. F.; Sale, Michèle M.; Dickinson, Joanne L.; Groom, Patricia; Johnson, Laura J.; Simmons, Rex D.; Tait, Brian; Varney, Mike; Taylor, Bruce; Dwyer, Terence; Williamson, Robert; Gough, Nicholas M.; Kilpatrick, Trevor J.; Speed, Terence P.; Foote, Simon J.

    2002-01-01

    Association of multiple sclerosis (MS) with the human leukocyte antigen (HLA) class II haplotype DRB1*1501-DQB1*0602 is the most consistently replicated finding of genetic studies of the disease. However, the high level of linkage disequilibrium (LD) in the HLA region has hindered the identification of other loci that single-marker tests for association are unlikely to resolve. In order to address this issue, we generated haplotypes spanning 14.754 Mb (5 cM) across the entire HLA region. The haplotypes, which were inferred by genotyping relatives of 152 patients with MS and 105 unaffected control subjects of Tasmanian ancestry, define a genomic segment from D6S276 to D6S291, including 13 microsatellite markers integrated with allele-typing data for DRB1 and DQB1. Association to the DRB1*1501-DQB1*0602 haplotype was replicated. In addition, we found that the class I/extended class I region, defined by a genomic segment of ∼400 kb between MOGCA and D6S265, harbors genes that independently increase risk of, or provide protection from, MS. Log-linear modeling analysis of constituent haplotypes that represent genomic regions containing class I (MOGCA-D6S265), class III (TNFa-TNFd-D6S273), and class II (DRB1-DQB1) genes indicated that having class I and class II susceptibility variants on the same haplotype provides an additive effect on risk. Moreover, we found no evidence for a disease locus in the class III region defined by a 150-kb genomic segment containing the TNF locus and 14 other genes. A global overview of LD performed using GOLD identified two discrete blocks of LD in the HLA region that correspond well with previous findings. We propose that the analysis of haplotypes, by use of the types of approaches outlined in the present article, should make it possible to more accurately define the contribution of the HLA to MS. PMID:11923913

  1. Association of KIR genotypes and haplotypes with susceptibility to chronic hepatitis B virus infection in Chinese Han population.

    PubMed

    Lu, Zhiming; Zhang, Bingchang; Chen, Shijun; Gai, Zhongtao; Feng, Zhaolei; Liu, Xiangdong; Liu, Yiqing; Wen, Xin; Li, Li; Jiao, Yulian; Ma, Chunyan; Shao, Song; Cui, Xiangfa; Chen, Guojian; Li, Jianfeng; Zhao, Yueran

    2008-12-01

    Killer immunoglobulin-like receptor (KIR) genes can regulate the activation of NK and T cells upon interaction with HLA class I molecules. Hepatitis B virus (HBV) infection has been regarded as a multi-factorial disorder disease. Previous studies revealed that KIRs were involved in HCV and HIV infection or clearance. The aim of this study was to explore the possibility of the inheritance of KIR genotypes and haplotypes as a candidate for susceptibility to persistent HBV infection or HBV clearance. The sequence specific primer polymerase chain reaction (SSP-PCR) was employed to identify the KIR genes and pseudogenes in 150 chronic hepatitis B (CHB) patients, 251 spontaneously recovered (SR) controls, and 412 healthy controls. The frequencies of genotype G, M, FZ1 increased in CHB patients compared with healthy control subjects. The frequency of genotype AH was higher in SR controls than that in both CHB patients and healthy controls. The carriage frequencies of genotype G and AH were higher; while, the frequencies of AF and AJ were lower in SR controls than those in healthy control subjects. The frequency of A haplotype was lower, whereas, the frequency of B haplotype was higher in CHB patients and SR controls than those in healthy controls. In healthy controls, haplotype 4 was found lower compared with that in CHB patients and SR controls and the frequency of haplotype 5 was higher in SR controls than that in other two groups. Based on these findings, it seems that the genotypes M and FZ1 are HBV susceptive genotypes; AH, on the other hand, may be protective genotypes that facilitate the clearance of HBV. It appears that the haplotype 4 is HBV susceptive haplotype, whereas, haplotype 5 may be the protective haplotype that facilitates the clearance of HBV.

  2. Genetic differentiation of eastern wolves in Algonquin Park despite bridging gene flow between coyotes and grey wolves.

    PubMed

    Rutledge, L Y; Garroway, C J; Loveless, K M; Patterson, B R

    2010-12-01

    Distinguishing genetically differentiated populations within hybrid zones and determining the mechanisms by which introgression occurs are crucial for setting effective conservation policy. Extensive hybridization among grey wolves (Canis lupus), eastern wolves (C. lycaon) and coyotes (C. latrans) in eastern North America has blurred species distinctions, creating a Canis hybrid swarm. Using complementary genetic markers, we tested the hypotheses that eastern wolves have acted as a conduit of sex-biased gene flow between grey wolves and coyotes, and that eastern wolves in Algonquin Provincial Park (APP) have differentiated following a history of introgression. Mitochondrial, Y chromosome and autosomal microsatellite genetic data provided genotypes for 217 canids from three geographic regions in Ontario, Canada: northeastern Ontario, APP and southern Ontario. Coyote mitochondrial DNA (mtDNA) haplotypes were common across regions but coyote-specific Y chromosome haplotypes were absent; grey wolf mtDNA was absent from southern regions, whereas grey wolf Y chromosome haplotypes were present in all three regions. Genetic structuring analyses revealed three distinct clusters within a genetic cline, suggesting some gene flow among species. In APP, however, 78.4% of all breeders and 11 of 15 known breeding pairs had assignment probability of Q0.8 to the Algonquin cluster, and the proportion of eastern wolf Y chromosome haplotypes in APP breeding males was higher than expected from random mating within the park (P<0.02). The data indicate that Algonquin wolves remain genetically distinct despite providing a sex-biased genetic bridge between coyotes and grey wolves. We speculate that ongoing hybridization within the park is limited by pre-mating reproductive barriers.

  3. Modularity of Plant Metabolic Gene Clusters: A Trio of Linked Genes That Are Collectively Required for Acylation of Triterpenes in Oat[W][OA

    PubMed Central

    Mugford, Sam T.; Louveau, Thomas; Melton, Rachel; Qi, Xiaoquan; Bakht, Saleha; Hill, Lionel; Tsurushima, Tetsu; Honkanen, Suvi; Rosser, Susan J.; Lomonossoff, George P.; Osbourn, Anne

    2013-01-01

    Operon-like gene clusters are an emerging phenomenon in the field of plant natural products. The genes encoding some of the best-characterized plant secondary metabolite biosynthetic pathways are scattered across plant genomes. However, an increasing number of gene clusters encoding the synthesis of diverse natural products have recently been reported in plant genomes. These clusters have arisen through the neo-functionalization and relocation of existing genes within the genome, and not by horizontal gene transfer from microbes. The reasons for clustering are not yet clear, although this form of gene organization is likely to facilitate co-inheritance and co-regulation. Oats (Avena spp) synthesize antimicrobial triterpenoids (avenacins) that provide protection against disease. The synthesis of these compounds is encoded by a gene cluster. Here we show that a module of three adjacent genes within the wider biosynthetic gene cluster is required for avenacin acylation. Through the characterization of these genes and their encoded proteins we present a model of the subcellular organization of triterpenoid biosynthesis. PMID:23532069

  4. Occurrence of 15 Haplotypes of Linepithema micans (Hymenoptera: Formicidae) in Southern Brazil.

    PubMed

    Ramalho, Manuela Oliveira; Martins, C; Campos, T; Nondillo, A; Botton, M; Bueno, O C

    2017-08-01

    The ant genus Linepithema is widely known, thanks to the pest species Linepithema humile (Mayr), which is easily mistaken for Linepithema micans (Forel) due to their morphological similarity. Like L. humile, L. micans is associated to the main grapevine pest in Brazil, Eurhizococcus brasiliensis (Wille), also known as ground pearl. Therefore, the present study uses mtDNA fragments to expand the knowledge of haplotype diversity and distribution of L. micans in the state of Rio Grande do Sul (Brazil), to understand the genetic differences of the populations identified in this study. We identified 15 haplotypes of L. micans spread across different localities. Twelve of these haplotypes were new for the species. The high haplotype diversity uncovered in Rio Grande do Sul (Brazil) for this species was predictable, as L. micans is in its native environment. Additional studies that take gene flow into account may reveal interesting aspects of diversity in these populations. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. SNP analyses of growth factor genes EGF, TGF{beta}-1, and HGF reveal haplotypic association of EGF with autism

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Toyoda, Takao; Thanseem, Ismail; Kawai, Masayoshi

    Autism is a pervasive neurodevelopmental disorder diagnosed in early childhood. Growth factors have been found to play a key role in the cellular differentiation and proliferation of the central and peripheral nervous systems. Epidermal growth factor (EGF) is detected in several regions of the developing and adult brain, where, it enhances the differentiation, maturation, and survival of a variety of neurons. Transforming growth factor-{beta} (TGF{beta}) isoforms play an important role in neuronal survival, and the hepatocyte growth factor (HGF) has been shown to exhibit neurotrophic activity. We examined the association of EGF, TGF{beta}1, and HGF genes with autism, in amore » trio association study, using DNA samples from families recruited to the Autism Genetic Resource Exchange; 252 trios with a male offspring scored for autism were selected for the study. Transmission disequilibrium test revealed significant haplotypic association of EGF with autism. No significant SNP or haplotypic associations were observed for TGF{beta}1 or HGF. Given the role of EGF in brain and neuronal development, we suggest a possible role of EGF in the pathogenesis of autism.« less

  6. Accurate prediction of secondary metabolite gene clusters in filamentous fungi.

    PubMed

    Andersen, Mikael R; Nielsen, Jakob B; Klitgaard, Andreas; Petersen, Lene M; Zachariasen, Mia; Hansen, Tilde J; Blicher, Lene H; Gotfredsen, Charlotte H; Larsen, Thomas O; Nielsen, Kristian F; Mortensen, Uffe H

    2013-01-02

    Biosynthetic pathways of secondary metabolites from fungi are currently subject to an intense effort to elucidate the genetic basis for these compounds due to their large potential within pharmaceutics and synthetic biochemistry. The preferred method is methodical gene deletions to identify supporting enzymes for key synthases one cluster at a time. In this study, we design and apply a DNA expression array for Aspergillus nidulans in combination with legacy data to form a comprehensive gene expression compendium. We apply a guilt-by-association-based analysis to predict the extent of the biosynthetic clusters for the 58 synthases active in our set of experimental conditions. A comparison with legacy data shows the method to be accurate in 13 of 16 known clusters and nearly accurate for the remaining 3 clusters. Furthermore, we apply a data clustering approach, which identifies cross-chemistry between physically separate gene clusters (superclusters), and validate this both with legacy data and experimentally by prediction and verification of a supercluster consisting of the synthase AN1242 and the prenyltransferase AN11080, as well as identification of the product compound nidulanin A. We have used A. nidulans for our method development and validation due to the wealth of available biochemical data, but the method can be applied to any fungus with a sequenced and assembled genome, thus supporting further secondary metabolite pathway elucidation in the fungal kingdom.

  7. Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Data Analysis and Visualization; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,'' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA

    2008-05-12

    The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii)more » evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.« less

  8. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.

    PubMed

    N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype

  9. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map

    PubMed Central

    Haile, Jemanesh K.; Cory, Aron T.; Clarke, Fran R.; Clarke, John M.; Knox, Ron E.; Pozniak, Curtis J.

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype

  10. Variation in the fumonisin biosynthetic gene cluster in fumonisin-producing and nonproducing black aspergilli.

    PubMed

    Susca, Antonia; Proctor, Robert H; Butchko, Robert A E; Haidukowski, Miriam; Stea, Gaetano; Logrieco, Antonio; Moretti, Antonio

    2014-12-01

    The ability to produce fumonisin mycotoxins varies among members of the black aspergilli. Previously, analyses of selected genes in the fumonisin biosynthetic gene (fum) cluster in black aspergilli from California grapes indicated that fumonisin-nonproducing isolates of Aspergillus welwitschiae lack six fum genes, but nonproducing isolates of Aspergillus niger do not. In the current study, analyses of black aspergilli from grapes from the Mediterranean Basin indicate that the genomic context of the fum cluster is the same in isolates of A. niger and A. welwitschiae regardless of fumonisin-production ability and that full-length clusters occur in producing isolates of both species and nonproducing isolates of A. niger. In contrast, the cluster has undergone an eight-gene deletion in fumonisin-nonproducing isolates of A. welwitschiae. Phylogenetic analyses suggest each species consists of a mixed population of fumonisin-producing and nonproducing individuals, and that existence of both production phenotypes may provide a selective advantage to these species. Differences in gene content of fum cluster homologues and phylogenetic relationships of fum genes suggest that the mutation(s) responsible for the nonproduction phenotype differs, and therefore arose independently, in the two species. Partial fum cluster homologues were also identified in genome sequences of four other black Aspergillus species. Gene content of these partial clusters and phylogenetic relationships of fum sequences indicate that non-random partial deletion of the cluster has occurred multiple times among the species. This in turn suggests that an intact cluster and fumonisin production were once more widespread among black aspergilli. Copyright © 2014 Elsevier Inc. All rights reserved.

  11. Identification of specific angiotensin-converting enzyme variants and haplotypes that confer risk and protection against type 2 diabetic nephropathy.

    PubMed

    Ezzidi, Intissar; Mtiraoui, Nabil; Kacem, Maha; Chaieb, Molka; Mahjoub, Touhami; Almawi, Wassim Y

    2009-11-01

    Cross-sectional and family studies identified angiotensin-converting enzyme (ACE) gene as a risk factor for diabetic nephropathy (DN). The contribution of ACE gene variants to DN development and progression is controversial and varies among different ethnic/racial groups. We investigated the association of three ACE gene variants with DN, rs1799752 insertion/deletion (I/D), rs1800764T/C and rs12449782A/G in 917 Tunisian type 2 diabetic (T2DM) patients: 515 with (DN) and 402 without (DWN) nephropathy. ACE genotyping was done by PCR-based assays; haplotype estimation was performed using H-Plus software (chi(2)-test based). Genotype frequency distributions of the three studied variants were in Hardy-Weinberg equilibrium. Minor allele frequency of rs1800764 was higher in DN patients than DWN patients or healthy controls, and minor allele frequency of rs1799752 was higher in DN than DWN patients. Higher frequency of rs1799752 and rs1800764 homozygous mutant genotypes was seen in DN compared to DWN patients. Of the three variants, only rs1799752 deletion/deletion (D/D) genotype was associated with a significant increase in albumin to creatinine ratios levels, and D/D carriers had elevated low-density lipoprotein, total cholesterol and urea. Three locus haplotype [rs1799752(I/D)/rs1800764(T/C)/rs12449782(A/G)] analysis revealed that the frequency of DCG haplotype was higher, while that of ITG and ICA haplotypes were lower among unselected type 2 diabetic patients. Taking ITA haplotype as reference, multivariate regression analysis confirmed the negative (ITG), and positive (DCG, DTG, DCA and DTA) association of specific ACE haplotypes with DN, after adjusting for potential nephropathy-linked covariates. Our results support the involvement of specific ACE variants in DN pathogenesis and demonstrate the presence of DN-specific haplotypes at the ACE locus.

  12. Relative extended haplotype homozygosity signals across breeds reveal dairy and beef specific signatures of selection.

    PubMed

    Bomba, Lorenzo; Nicolazzi, Ezequiel L; Milanesi, Marco; Negrini, Riccardo; Mancini, Giordano; Biscarini, Filippo; Stella, Alessandra; Valentini, Alessio; Ajmone-Marsan, Paolo

    2015-04-02

    A number of methods are available to scan a genome for selection signatures by evaluating patterns of diversity within and between breeds. Among these, "extended haplotype homozygosity" (EHH) is a reliable approach to detect genome regions under recent selective pressure. The objective of this study was to use this approach to identify regions that are under recent positive selection and shared by the most representative Italian dairy and beef cattle breeds. A total of 3220 animals from Italian Holstein (2179), Italian Brown (775), Simmental (493), Marchigiana (485) and Piedmontese (379) breeds were genotyped with the Illumina BovineSNP50 BeadChip v.1. After standard quality control procedures, genotypes were phased and core haplotypes were identified. The decay of linkage disequilibrium (LD) for each core haplotype was assessed by measuring the EHH. Since accurate estimates of local recombination rates were not available, relative EHH (rEHH) was calculated for each core haplotype. Genomic regions that carry frequent core haplotypes and with significant rEHH values were considered as candidates for recent positive selection. Candidate regions were aligned across to identify signals shared by dairy or beef cattle breeds. Overall, 82 and 87 common regions were detected among dairy and beef cattle breeds, respectively. Bioinformatic analysis identified 244 and 232 genes in these common genomic regions. Gene annotation and pathway analysis showed that these genes are involved in molecular functions that are biologically related to milk or meat production. Our results suggest that a multi-breed approach can lead to the identification of genomic signatures in breeds of cattle that are selected for the same production goal and thus to the localisation of genomic regions of interest in dairy and beef production.

  13. Wide distribution of O157-antigen biosynthesis gene clusters in Escherichia coli.

    PubMed

    Iguchi, Atsushi; Shirai, Hiroki; Seto, Kazuko; Ooka, Tadasuke; Ogura, Yoshitoshi; Hayashi, Tetsuya; Osawa, Kayo; Osawa, Ro

    2011-01-01

    Most Escherichia coli O157-serogroup strains are classified as enterohemorrhagic E. coli (EHEC), which is known as an important food-borne pathogen for humans. They usually produce Shiga toxin (Stx) 1 and/or Stx2, and express H7-flagella antigen (or nonmotile). However, O157 strains that do not produce Stxs and express H antigens different from H7 are sometimes isolated from clinical and other sources. Multilocus sequence analysis revealed that these 21 O157:non-H7 strains tested in this study belong to multiple evolutionary lineages different from that of EHEC O157:H7 strains, suggesting a wide distribution of the gene set encoding the O157-antigen biosynthesis in multiple lineages. To gain insight into the gene organization and the sequence similarity of the O157-antigen biosynthesis gene clusters, we conducted genomic comparisons of the chromosomal regions (about 59 kb in each strain) covering the O-antigen gene cluster and its flanking regions between six O157:H7/non-H7 strains. Gene organization of the O157-antigen gene cluster was identical among O157:H7/non-H7 strains, but was divided into two distinct types at the nucleotide sequence level. Interestingly, distribution of the two types did not clearly follow the evolutionary lineages of the strains, suggesting that horizontal gene transfer of both types of O157-antigen gene clusters has occurred independently among E. coli strains. Additionally, detailed sequence comparison revealed that some positions of the repetitive extragenic palindromic (REP) sequences in the regions flanking the O-antigen gene clusters were coincident with possible recombination points. From these results, we conclude that the horizontal transfer of the O157-antigen gene clusters induced the emergence of multiple O157 lineages within E. coli and speculate that REP sequences may involve one of the driving forces for exchange and evolution of O-antigen loci.

  14. Delineation of metabolic gene clusters in plant genomes by chromatin signatures

    PubMed Central

    Yu, Nan; Nützmann, Hans-Wilhelm; MacDonald, James T.; Moore, Ben; Field, Ben; Berriri, Souha; Trick, Martin; Rosser, Susan J.; Kumar, S. Vinod; Freemont, Paul S.; Osbourn, Anne

    2016-01-01

    Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi. PMID:26895889

  15. A conserved gene cluster as a putative functional unit in insect innate immunity.

    PubMed

    Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Andó, István

    2010-11-05

    The Nimrod gene superfamily is an important component of the innate immune response. The majority of its member genes are located in close proximity within the Drosophila melanogaster genome and they lie in a larger conserved cluster ("Nimrod cluster"), made up of non-related groups (families, superfamilies) of genes. This cluster has been a part of the Arthropod genomes for about 300-350 million years. The available data suggest that the Nimrod cluster is a functional module of the insect innate immune response. Copyright © 2010 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  16. Characterization of swine leukocyte antigen alleles and haplotypes on a novel miniature pig line, Microminipig.

    PubMed

    Ando, A; Imaeda, N; Ohshima, S; Miyamoto, A; Kaneko, N; Takasu, M; Shiina, T; Kulski, J K; Inoko, H; Kitagawa, H

    2014-12-01

    Microminipigs are extremely small-sized, novel miniature pigs that were recently developed for medical research. The inbred Microminipigs with defined swine leukocyte antigen (SLA) haplotypes are expected to be useful for allo- and xenotransplantation studies and also for association analyses between SLA haplotypes and immunological traits. To establish SLA-defined Microminipig lines, we characterized the polymorphic SLA alleles for three class I (SLA-1, SLA-2 and SLA-3) and two class II (SLA-DRB1 and SLA-DQB1) genes of 14 parental Microminipigs using a high-resolution nucleotide sequence-based typing method. Eleven class I and II haplotypes, including three recombinant haplotypes, were found in the offspring of the parental Microminipigs. Two class I and class II haplotypes, Hp-31.0 (SLA-1*1502-SLA-3*070102-SLA-2*1601) and Hp-0.37 (SLA-DRB1*0701-SLA-DQB1*0502), are novel and have not so far been reported in other pig breeds. Crossover regions were defined by the analysis of 22 microsatellite markers within the SLA class III region of three recombinant haplotypes. The SLA allele and haplotype information of Microminipigs in this study will be useful to establish SLA homozygous lines including three recombinants for transplantation and immunological studies. © 2014 Stichting International Foundation for Animal Genetics.

  17. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture

    PubMed Central

    Johnson, Timothy A.; Stedtfeld, Robert D.; Wang, Qiong; Cole, James R.; Hashsham, Syed A.; Looft, Torey; Zhu, Yong-Guan

    2016-01-01

    ABSTRACT   Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. PMID:27073098

  18. Single-molecule dilution and multiple displacement amplification for molecular haplotyping.

    PubMed

    Paul, Philip; Apgar, Josh

    2005-04-01

    Separate haploid analysis is frequently required for heterozygous genotyping to resolve phase ambiguity or confirm allelic sequence. We demonstrate a technique of single-molecule dilution followed by multiple strand displacement amplification to haplotype polymorphic alleles. Dilution of DNA to haploid equivalency, or a single molecule, is a simple method for separating di-allelic DNA. Strand displacement amplification is a robust method for non-specific DNA expansion that employs random hexamers and phage polymerase Phi29 for double-stranded DNA displacement and primer extension, resulting in high processivity and exceptional product length. Single-molecule dilution was followed by strand displacement amplification to expand separated alleles to microgram quantities of DNA for more efficient haplotype analysis of heterozygous genes.

  19. Haplotype-based identification of a microsomal transfer protein marker associated with the human lifespan

    PubMed Central

    Geesaman, Bard J.; Benson, Erica; Brewster, Stephanie J.; Kunkel, Louis M.; Blanché, Hélène; Thomas, Gilles; Perls, Thomas T.; Daly, Mark J.; Puca, Annibale A.

    2003-01-01

    We previously reported a genomewide linkage study for human longevity using 308 long-lived individuals (LLI) (centenarians or near-centenarians) in 137 sibships and identified statistically significant linkage within chromosome 4 near microsatellite D4S1564. This interval spans 12 million bp and contains ≈50 putative genes. To identify the specific gene and gene variants impacting lifespan, we performed a haplotype-based fine-mapping study of the interval. The resulting genetic association study identified a haplotype marker within microsomal transfer protein as a modifier of human lifespan. This same variant was tested in a second cohort of LLI from France, and although the association was not replicated, there was evidence for statistical distortion in the form of Hardy–Weinberg disequilibrium. Microsomal transfer protein has been identified as the rate-limiting step in lipoprotein synthesis and may affect longevity by subtly modulating this pathway. This study provides proof of concept for the feasibility of using the genomes of LLI to identify genes impacting longevity. PMID:14615589

  20. Congruence as a measurement of extended haplotype structure across the genome

    PubMed Central

    2012-01-01

    Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. PMID:22369243

  1. Genetic variation in heat shock protein 70 is associated with septic shock: narrowing the association to a specific haplotype.

    PubMed

    Kee, C; Cheong, K Y; Pham, K; Waterer, G W; Temple, S E L

    2008-12-01

    Heat shock protein 70 (HSP70) plays a major role in immune responses. Polymorphisms within the gene have been associated with development of septic shock. This study refines the region of the HSP70 gene associated with development of septic shock and confirms its functionality. Subjects (n = 31) were grouped into one of three haplotypes based on their HSPA1B-179C>T and HSPA1B1267A>G genotypes. Mononuclear cells from these subjects were stimulated with heat-killed bacteria (10(7 )colony-forming units/mL Escherichia coli or Streptococcus pneumoniae) for 8 and 21 h. HSP70 and tumour necrosis factor (TNF) mRNA and protein levels were measured by reverse transcriptase-polymerase chain reaction and ELISA, respectively. The HSPA1B-179*C:1267*A haplotype was associated with significantly lower levels of HSPA1B mRNA and protein and higher production of TNF mRNA and protein compared to the other haplotypes. Induction of HSP70 was TNF independent. These results suggest that the HSPA1B-179C>T:1267A>G haplotype is functional and may explain the association of the HSP70 gene with development of septic shock.

  2. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

    PubMed Central

    Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

    1999-01-01

    With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World. PMID:10388827

  3. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy.

    PubMed

    Jaruzelska, J; Zietkiewicz, E; Batzer, M; Cole, D E; Moisan, J P; Scozzari, R; Tavaré, S; Labuda, D

    1999-07-01

    With 10 segregating sites (simple nucleotide polymorphisms) in the last intron (1089 bp) of the ZFX gene we have observed 11 haplotypes in 336 chromosomes representing a worldwide array of 15 human populations. Two haplotypes representing 77% of all chromosomes were distributed almost evenly among four continents. Five of the remaining haplotypes were detected in Africa and 4 others were restricted to Eurasia and the Americas. Using the information about the ancestral state of the segregating positions (inferred from human-great ape comparisons), we applied coalescent analysis to estimate the age of the polymorphisms and the resulting haplotypes. The oldest haplotype, with the ancestral alleles at all the sites, was observed at low frequency only in two groups of African origin. Its estimated age of 740 to 1100 kyr corresponded to the time to the most recent common ancestor. The two most frequent worldwide distributed haplotypes were estimated at 550 to 840 and 260 to 400 kyr, respectively, while the age of the continentally restricted polymorphisms was 120 to 180 kyr and smaller. Comparison of spatial and temporal distribution of the ZFX haplotypes suggests that modern humans diverged from the common ancestral stock in the Middle Paleolithic era. Subsequent range expansion prevented substantial gene flow among continents, separating African groups from populations that colonized Eurasia and the New World.

  4. Genetic diversity and geographical structure of the pitcher plant Nepenthes vieillardii in New Caledonia: A chloroplast DNA haplotype analysis.

    PubMed

    Kurata, Kaoruko; Jaffré, Tanguy; Setoguchi, Hiroaki

    2008-12-01

    Among the many species that grow in New Caledonia, the pitcher plant Nepenthes vieillardii (Nepenthaceae) has a high degree of morphological variation. In this study, we present the patterns of genetic differentiation of pitcher plant populations based on chloroplast DNA haplotype analysis using the sequences of five spacers. We analyzed 294 samples from 16 populations covering the entire range of the species, using 4660 bp of sequence. Our analysis identified 17 haplotypes, including one that is widely distributed across the islands, as well as regional and private haplotypes. The greatest haplotype diversity was detected on the eastern coast of the largest island and included several private haplotypes, while haplotype diversity was low in the southern plains region. The parsimony network analysis of the 17 haplotypes suggested that the genetic divergence is the result of long-term isolation of individual populations. Results from a spatial analysis of molecular variance and a cluster analysis suggest that the plants once covered the entire serpentine area of New Caledonia and that subsequent regional fragmentation resulted in the isolation of each population and significantly restricted seed flow. This isolation may have been an important factor in the development of the morphological and genetic variation among pitcher plants in New Caledonia.

  5. Association of HLA haplotype with alopecia areata in Chinese Hans.

    PubMed

    Xiao, F-L; Ye, D-Q; Yang, S; Zhou, F-S; Zhou, S-M; Zhu, Y-G; Liang, Y-H; Ren, Y-Q; Zhang, X-J

    2006-11-01

    Some studies have shown discrepancies in human leucocyte antigen (HLA) associated with alopecia areata (AA) between different ethnic populations. To investigate whether HLA-I, -DQA1 and -DQB1 alleles and the HLA haplotype are associated with AA, and the correlation between the HLA haplotype profile, age of onset and severity of AA in Chinese Hans. The polymerase chain reaction-sequence specific primer (PCR-SSP) method was used to analyse the frequencies of HLA class I, -DQA1 and -DQB1 alleles in 192 patients with AA and 252 controls in Chinese Hans. The linkage disequilibrium was calculated using the 2 x 2 table. The 24 two-locus haplotypes [including A*02-B*18, A*02-B*27, A*02-B*52, A*02-Cw*0704, A*02-DQA1*0104, A*02-DQB1*0604, A*02-DQB1*0606, B*18-Cw*0704, B*18-DQA1*0104, B*18-DQA1*0302, B*18-DQB1*0606, B*27-Cw*0704, B*27-DQA1*0104, B*27-DQA1*0302, B*52-Cw*0704, B*52-DQA1*0104, B*52-DQA1*0302, B52-DQB1*0606, Cw*0704-DQA1*0104, Cw*0704-DQA1*0302, Cw*0704-DQB1*0606, DQA1*0104-DQB1*0604, DQA1*0104-DQB1*0606, DQA1*0302-DQB1*0606 (P<0.05)] were associated with AA, while eight extended haplotypes (A*02-B*18-DQA1*0104, A*02-B*27-DQA1*0104, A*02-B*52-DQA1*0104, A*02-B*52-DQA1*0302, A*02-B*52-DQB1*0606, B*52-Cw*0704-DQA1*0104, B*52-Cw*0704-DQA1*0302, A*02-B*52-DQA1*0302-DQB1*0606) were found to be related to AA in Chinese Hans. Through stratified analysis, we found that the extended haplotype B*52-Cw*0704-DQA1*0302 was related to early onset of AA, and no haplotype was only associated with severe AA. This is the first detailed report to elucidate HLA haplotypes associated with AA and that demonstrates the significant HLA haplotypes in Chinese Hans AA. The haplotype B*52-Cw*0704-DQA1*0302 was identified to be related to early onset of AA. Our results provide some information for future research on predisposing genes in HLA regions in Chinese Hans.

  6. [Genetic ecological monitoring in human populations: heterozygosity, mtDNA haplotype variation, and genetic load].

    PubMed

    Balanovskiĭ, O P; Koshel', S M; Zaporozhchenko, V V; Pshenichnov, A S; Frolova, S A; Kuznetsova, M A; Baranova, E E; Teuchezh, I E; Kuznetsova, A A; Romashkina, M V; Utevskaia, O M; Churnosov, M I; Villems, R; Balanovskaia, E V

    2011-11-01

    Yu. P. Altukhov suggested that heterozygosity is an indicator of the state of the gene pool. The idea and a linked concept of genetic ecological monitoring were applied to a new dataset on mtDNA variation in East European ethnic groups. Haplotype diversity (an analog of the average heterozygosity) was shown to gradually decrease northwards. Since a similar trend is known for population density, interlinked changes were assumed for a set of parameters, which were ordered to form a causative chain: latitude increases, land productivity decreases, population density decreases, effective population size decreases, isolation of subpopulations increases, genetic drift increases, and mtDNA haplotype diversity decreases. An increase in genetic drift increases the random inbreeding rate and, consequently, the genetic load. This was confirmed by a significant correlation observed between the incidence of autosomal recessive hereditary diseases and mtDNA haplotype diversity. Based on the findings, mtDNA was assumed to provide an informative genetic system for genetic ecological monitoring; e.g., analyzing the ecology-driven changes in the gene pool.

  7. Genetic clustering and polymorphism of the merozoite surface protein-3 of Plasmodium knowlesi clinical isolates from Peninsular Malaysia.

    PubMed

    De Silva, Jeremy Ryan; Lau, Yee Ling; Fong, Mun Yik

    2017-01-03

    The simian malaria parasite Plasmodium knowlesi has been reported to cause significant numbers of human infection in South East Asia. Its merozoite surface protein-3 (MSP3) is a protein that belongs to a multi-gene family of proteins first found in Plasmodium falciparum. Several studies have evaluated the potential of P. falciparum MSP3 as a potential vaccine candidate. However, to date no detailed studies have been carried out on P. knowlesi MSP3 gene (pkmsp3). The present study investigates the genetic diversity, and haplotypes groups of pkmsp3 in P. knowlesi clinical samples from Peninsular Malaysia. Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes. From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups. A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection

  8. Performance of Single Nucleotide Polymorphisms versus Haplotypes for Genome-Wide Association Analysis in Barley

    PubMed Central

    Jannink, Jean-Luc

    2010-01-01

    Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933

  9. Mitochondrial DNA haplotype distribution patterns in Pinus ponderosa (Pinaceae): range-wide evolutionary history and implications for conservation.

    PubMed

    Potter, Kevin M; Hipkins, Valerie D; Mahalovich, Mary F; Means, Robert E

    2013-08-01

    Ponderosa pine (Pinus ponderosa Douglas ex P. Lawson & C. Lawson) exhibits complicated patterns of morphological and genetic variation across its range in western North America. This study aims to clarify P. ponderosa evolutionary history and phylogeography using a highly polymorphic mitochondrial DNA marker, with results offering insights into how geographical and climatological processes drove the modern evolutionary structure of tree species in the region. We amplified the mtDNA nad1 second intron minisatellite region for 3,100 trees representing 104 populations, and sequenced all length variants. We estimated population-level haplotypic diversity and determined diversity partitioning among varieties, races and populations. After aligning sequences of minisatellite repeat motifs, we evaluated evolutionary relationships among haplotypes. The geographical structuring of the 10 haplotypes corresponded with division between Pacific and Rocky Mountain varieties. Pacific haplotypes clustered with high bootstrap support, and appear to have descended from Rocky Mountain haplotypes. A greater proportion of diversity was partitioned between Rocky Mountain races than between Pacific races. Areas of highest haplotypic diversity were the southern Sierra Nevada mountain range in California, northwestern California, and southern Nevada. Pinus ponderosa haplotype distribution patterns suggest a complex phylogeographic history not revealed by other genetic and morphological data, or by the sparse paleoecological record. The results appear consistent with long-term divergence between the Pacific and Rocky Mountain varieties, along with more recent divergences not well-associated with race. Pleistocene refugia may have existed in areas of high haplotypic diversity, as well as the Great Basin, Southwestern United States/northern Mexico, and the High Plains.

  10. Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma.

    PubMed

    Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

    2007-12-30

    Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis.

  11. The evolutionary life cycle of the polysaccharide biosynthetic gene cluster based on the Sphingomonadaceae.

    PubMed

    Wu, Mengmeng; Huang, Haidong; Li, Guoqiang; Ren, Yi; Shi, Zhong; Li, Xiaoyan; Dai, Xiaohui; Gao, Ge; Ren, Mengnan; Ma, Ting

    2017-04-21

    Although clustering of genes from the same metabolic pathway is a widespread phenomenon, the evolution of the polysaccharide biosynthetic gene cluster remains poorly understood. To determine the evolution of this pathway, we identified a scattered production pathway of the polysaccharide sanxan by Sphingomonas sanxanigenens NX02, and compared the distribution of genes between sphingan-producing and other Sphingomonadaceae strains. This allowed us to determine how the scattered sanxan pathway developed, and how the polysaccharide gene cluster evolved. Our findings suggested that the evolution of microbial polysaccharide biosynthesis gene clusters is a lengthy cyclic process comprising cluster 1 → scatter → cluster 2. The sanxan biosynthetic pathway proved the existence of a dispersive process. We also report the complete genome sequence of NX02, in which we identified many unstable genetic elements and powerful secretion systems. Furthermore, nine enzymes for the formation of activated precursors, four glycosyltransferases, four acyltransferases, and four polymerization and export proteins were identified. These genes were scattered in the NX02 genome, and the positive regulator SpnA of sphingans synthesis could not regulate sanxan production. Finally, we concluded that the evolution of the sanxan pathway was independent. NX02 evolved naturally as a polysaccharide producing strain over a long-time evolution involving gene acquisitions and adaptive mutations.

  12. Co-clustering phenome–genome for phenotype classification and disease gene discovery

    PubMed Central

    Hwang, TaeHyun; Atluri, Gowtham; Xie, MaoQiang; Dey, Sanjoy; Hong, Changjin; Kumar, Vipin; Kuang, Rui

    2012-01-01

    Understanding the categorization of human diseases is critical for reliably identifying disease causal genes. Recently, genome-wide studies of abnormal chromosomal locations related to diseases have mapped >2000 phenotype–gene relations, which provide valuable information for classifying diseases and identifying candidate genes as drug targets. In this article, a regularized non-negative matrix tri-factorization (R-NMTF) algorithm is introduced to co-cluster phenotypes and genes, and simultaneously detect associations between the detected phenotype clusters and gene clusters. The R-NMTF algorithm factorizes the phenotype–gene association matrix under the prior knowledge from phenotype similarity network and protein–protein interaction network, supervised by the label information from known disease classes and biological pathways. In the experiments on disease phenotype–gene associations in OMIM and KEGG disease pathways, R-NMTF significantly improved the classification of disease phenotypes and disease pathway genes compared with support vector machines and Label Propagation in cross-validation on the annotated phenotypes and genes. The newly predicted phenotypes in each disease class are highly consistent with human phenotype ontology annotations. The roles of the new member genes in the disease pathways are examined and validated in the protein–protein interaction subnetworks. Extensive literature review also confirmed many new members of the disease classes and pathways as well as the predicted associations between disease phenotype classes and pathways. PMID:22735708

  13. Global spread and genetic variants of the two CYP9M10 haplotype forms associated with insecticide resistance in Culex quinquefasciatus Say.

    PubMed

    Itokawa, K; Komagata, O; Kasai, S; Kawada, H; Mwatele, C; Dida, G O; Njenga, S M; Mwandawiro, C; Tomita, T

    2013-09-01

    Insecticide resistance develops as a genetic factor (allele) conferring lower susceptibility to insecticides proliferates within a target insect population under strong positive selection. Intriguingly, a resistance allele pre-existing in a population often bears a series of further adaptive allelic variants through new mutations. This phenomenon occasionally results in replacement of the predominating resistance allele by fitter new derivatives, and consequently, development of greater resistance at the population level. The overexpression of the cytochrome P450 gene CYP9M10 is associated with pyrethroid resistance in the southern house mosquito Culex quinquefasciatus. Previously, we have found two genealogically related overexpressing CYP9M10 haplotypes, which differ in gene copy number (duplicated and non-duplicated). The duplicated haplotype was derived from the non-duplicated overproducer probably recently. In the present study, we investigated allelic series of CYP9M10 involved in three C. quinquefasciatus laboratory colonies recently collected from three different localities. Duplicated and non-duplicated overproducing haplotypes coexisted in African and Asian colonies indicating a global distribution of both haplotype lineages. The duplicated haplotypes both in the Asian and African colonies were associated with higher expression levels and stronger resistance than non-duplicated overproducing haplotypes. There were slight variation in expression level among the non-duplicated overproducing haplotypes. The nucleotide sequences in coding and upstream regions among members of this group also showed a little diversity. Non-duplicated overproducing haplotypes with relatively higher expression were genealogically closer to the duplicated haplotypes than the other non-duplicated overproducing haplotypes, suggesting multiple cis-acting mutations before duplication.

  14. Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.

    PubMed

    Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa

    2017-06-15

    Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Serpin peptidase inhibitor (SERPINB5) haplotypes are associated with susceptibility to hepatocellular carcinoma

    NASA Astrophysics Data System (ADS)

    Yang, Shun-Fa; Yeh, Chao-Bin; Chou, Ying-Erh; Lee, Hsiang-Lin; Liu, Yu-Fan

    2016-05-01

    Hepatocellular carcinoma (HCC) represents the second leading cause of cancer-related death worldwide. The serpin peptidase inhibitor SERPINB5 is a tumour-suppressor gene that promotes the development of various cancers in humans. However, whether SERPINB5 gene variants play a role in HCC susceptibility remains unknown. In this study, we genotyped 6 SNPs of the SERPINB5 gene in an independent cohort from a replicate population comprising 302 cases and 590 controls. Additionally, patients who had at least one rs2289520 C allele in SERPINB5 tended to exhibit better liver function than patients with genotype GG (Child-Pugh grade A vs. B or C; P = 0.047). Next, haplotype blocks were reconstructed according to the linkage disequilibrium structure of the SERPINB5 gene. A haplotype “C-C-C” (rs17071138 + rs3744941 + rs8089204) in SERPINB5-correlated promoter showed a significant association with an increased HCC risk (AOR = 1.450 P = 0.031). Haplotypes “T-C-A” and “C-C-C” (rs2289519 + rs2289520 + rs1455555) located in the SERPINB5 coding region had a decreased (AOR = 0.744 P = 0.031) and increased (AOR = 1.981 P = 0.001) HCC risk, respectively. Finally, an additional integrated in silico analysis confirmed that these SNPs affected SERPINB5 expression and protein stability, which significantly correlated with tumour expression and subsequently with tumour development and aggressiveness. Taken together, our findings regarding these biomarkers provide a prediction model for risk assessment.

  16. Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

    PubMed Central

    Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

    2015-01-01

    Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880

  17. How Have Self-Incompatibility Haplotypes Diversified? Generation of New Haplotypes during the Evolution of Self-Incompatibility from Self-Compatibility.

    PubMed

    Sakai, Satoki

    2016-08-01

    I developed a gametophytic self-incompatibility (SI) model to study the conditions leading to diversification in SI haplotypes. In the model, the SI system is assumed to be incomplete, and the pollen expressing a given specificity is not fully rejected by the pistils expressing the same specificity. I also assumed that mutations can occur that enhance the rejection of pollen by pistils with the same haplotype variant and reduce rejection by pistils with other variants in the same haplotype. I found that if such mutations occur, the new haplotypes (mutant variants) can stably coexist with the ancestral haplotype in which the mutant arose. This is because pollen bearing the new haplotype is most strongly rejected by pistils bearing the same new haplotype among the pistils in the population; hence, negative frequency-dependent selection prevents their fixation. I also performed simulations and found that the nearly complete SI system evolves from completely self-compatible populations and that SI haplotypes can increase to about 40-50 within a few thousand generations. On the basis of my findings, I propose that diversification of SI haplotypes occurred during the evolution of SI from self-compatibility.

  18. Complement factor H gene (CFH) polymorphisms C-257T, G257A and haplotypes are associated with protection against severe dengue phenotype, possible related with high CFH expression

    PubMed Central

    Pastor, André F.; Moura, Laís Rodrigues; Neto, José W.D.; Nascimento, Eduardo J.M.; Calzavara-Silva, Carlos E.; Gomes, Ana Lisa V.; da Silva, Ana Maria; Cordeiro, Marli T.; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H.V.G.; Marques, Ernesto T.A.; Acioli-Santos, Bartolomeu

    2013-01-01

    Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The –257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of –257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. PMID:23747994

  19. Clustering gene expression regulators: new approach to disease subtyping.

    PubMed

    Pyatnitskiy, Mikhail; Mazo, Ilya; Shkrob, Maria; Schwartz, Elena; Kotelnikova, Ekaterina

    2014-01-01

    One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms), that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.

  20. Clustering Gene Expression Regulators: New Approach to Disease Subtyping

    PubMed Central

    Pyatnitskiy, Mikhail; Mazo, Ilya; Shkrob, Maria; Schwartz, Elena; Kotelnikova, Ekaterina

    2014-01-01

    One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms), that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient. PMID:24416320

  1. Delineation of metabolic gene clusters in plant genomes by chromatin signatures.

    PubMed

    Yu, Nan; Nützmann, Hans-Wilhelm; MacDonald, James T; Moore, Ben; Field, Ben; Berriri, Souha; Trick, Martin; Rosser, Susan J; Kumar, S Vinod; Freemont, Paul S; Osbourn, Anne

    2016-03-18

    Plants are a tremendous source of diverse chemicals, including many natural product-derived drugs. It has recently become apparent that the genes for the biosynthesis of numerous different types of plant natural products are organized as metabolic gene clusters, thereby unveiling a highly unusual form of plant genome architecture and offering novel avenues for discovery and exploitation of plant specialized metabolism. Here we show that these clustered pathways are characterized by distinct chromatin signatures of histone 3 lysine trimethylation (H3K27me3) and histone 2 variant H2A.Z, associated with cluster repression and activation, respectively, and represent discrete windows of co-regulation in the genome. We further demonstrate that knowledge of these chromatin signatures along with chromatin mutants can be used to mine genomes for cluster discovery. The roles of H3K27me3 and H2A.Z in repression and activation of single genes in plants are well known. However, our discovery of highly localized operon-like co-regulated regions of chromatin modification is unprecedented in plants. Our findings raise intriguing parallels with groups of physically linked multi-gene complexes in animals and with clustered pathways for specialized metabolism in filamentous fungi. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Global variation in CYP2C8–CYP2C9 functional haplotypes

    PubMed Central

    Speed, William C; Kang, Soonmo Peter; Tuck, David P; Harris, Lyndsay N; Kidd, Kenneth K

    2009-01-01

    We have studied the global frequency distributions of 10 single nucleotide polymorphisms (SNPs) across 132 kb of CYP2C8 and CYP2C9 in ∼2500 individuals representing 45 populations. Five of the SNPs were in noncoding sequences; the other five involved the more common missense variants (four in CYP2C8, one in CYP2C9) that change amino acids in the gene products. One haplotype containing two CYP2C8 coding variants and one CYP2C9 coding variant reaches an average frequency of 10% in Europe; a set of haplotypes with a different CYP2C8 coding variant reaches 17% in Africa. In both cases these haplotypes are found in other regions of the world at <1%. This considerable geographic variation in haplotype frequencies impacts the interpretation of CYP2C8/CYP2C9 association studies, and has pharmacogenomic implications for drug interactions. PMID:19381162

  3. Analysis of Shared Haplotypes amongst Palauans Maps Loci for Psychotic Disorders to 4q28 and 5q23-q31.

    PubMed

    Bodea, Corneliu A; Middleton, Frank A; Melhem, Nadine M; Klei, Lambertus; Song, Youeun; Tiobech, Josepha; Marumoto, Pearl; Yano, Victor; Faraone, Stephen V; Roeder, Kathryn; Myles-Worsley, Marina; Devlin, Bernie; Byerley, William

    2017-02-01

    To localize genetic variation affecting risk for psychotic disorders in the population of Palau, we genotyped DNA samples from 203 Palauan individuals diagnosed with psychotic disorders, broadly defined, and 125 control subjects using a genome-wide single nucleotide polymorphism array. Palau has unique features advantageous for this study: due to its population history, Palauans are substantially interrelated; affected individuals often, but not always, cluster in families; and we have essentially complete ascertainment of affected individuals. To localize risk variants to genomic regions, we evaluated long-shared haplotypes, ≥10 Mb, identifying clusters of affected individuals who share such haplotypes. This extensive sharing, typically identical by descent, was significantly greater in cases than population controls, even after controlling for relatedness. Several regions of the genome exhibited substantial excess of shared haplotypes for affected individuals, including 3p21, 3p12, 4q28, and 5q23-q31. Two of these regions, 4q28 and 5q23-q31, showed significant linkage by traditional LOD score analysis and could harbor variants of more sizeable risk for psychosis or a multiplicity of risk variants. The pattern of haplotype sharing in 4q28 highlights PCDH10 , encoding a cadherin-related neuronal receptor, as possibly involved in risk.

  4. Apolipoprotein A5: A newly identified gene impacting plasmatriglyceride levels in humans and mice

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pennacchio, Len A.; Rubin, Edward M.

    2002-09-15

    Apolipoprotein A5 (APOA5) is a newly described member of theapolipoprotein gene family whose initial discovery arose from comparativesequence analysis of the mammalian APOA1/C3/A4 gene cluster. Functionalstudies in mice indicated that alteration in the level of APOA5significantly impacted plasma triglyceride concentrations. Miceover-expressing human APOA5 displayed significantly reducedtriglycerides, while mice lacking apoA5 had a large increase in thislipid parameter. Studies in humans have also suggested an important rolefor APOA5 in determining plasma triglyceride concentrations. In theseexperiments, polymorphisms in the human gene were found to define severalcommon haplotypes that were associated with significant changes intriglyceride concentrations in multiple populations. Several separateclinical studies havemore » provided consistent and strong support for theeffect with 24 percent of Caucasians, 35 percent of African-Americans and53 percent of Hispanics carrying APOA5 haplotypes associated withincreased plasma triglyceride levels. In summary, APOA5 represents anewly discovered gene involved in triglyceride metabolism in both humansand mice whose mechanism of action remains to be deciphered.« less

  5. Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma

    PubMed Central

    Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

    2007-01-01

    Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis. PMID:18305825

  6. Patterning C. elegans: homeotic cluster genes, cell fates and cell migrations.

    PubMed

    Salser, S J; Kenyon, C

    1994-05-01

    Despite its simple body form, the nematode C. elegans expresses homeotic cluster genes similar to those of insects and vertebrates in the patterning of many cell types and tissues along the anteroposterior axis. In the ventral nerve cord, these genes program spatial patterns of cell death, fusion, division and neurotransmitter production; in migrating cells they regulate the direction and extent of movement. Nematode development permits an analysis at the cellular level of how homeotic cluster genes interact to specify cell fates, and how cell behavior can be regulated to assemble an organism.

  7. A genomics based discovery of secondary metabolite biosynthetic gene clusters in Aspergillus ustus.

    PubMed

    Pi, Borui; Yu, Dongliang; Dai, Fangwei; Song, Xiaoming; Zhu, Congyi; Li, Hongye; Yu, Yunsong

    2015-01-01

    Secondary metabolites (SMs) produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic.

  8. A Genomics Based Discovery of Secondary Metabolite Biosynthetic Gene Clusters in Aspergillus ustus

    PubMed Central

    Pi, Borui; Yu, Dongliang; Dai, Fangwei; Song, Xiaoming; Zhu, Congyi; Li, Hongye; Yu, Yunsong

    2015-01-01

    Secondary metabolites (SMs) produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic. PMID:25706180

  9. Divergence at the casein haplotypes in dairy and meat goat breeds.

    PubMed

    Küpper, Julia; Chessa, Stefania; Rignanese, Daniela; Caroli, Anna; Erhardt, Georg

    2010-02-01

    Casein genes have been proved to have an influence on milk properties, and are in addition appropriate for phylogeny studies. A large number of casein polymorphisms exist in goats, making their analysis quite complex. The four casein loci were analyzed by molecular techniques for genetic polymorphism detection in the two dairy goat breeds Bunte Deutsche Edelziege (BDE; n=96), Weisse Deutsche Edelziege (WDE; n=91), and the meat goat breed Buren (n=75). Of the 35 analyzed alleles, 18 were found in BDE, and 17 in Buren goats and WDE. In addition, a new allele was identified at the CSN1S1 locus in the BDE, showing a frequency of 0.05. This variant, named CSN1S1*A', is characterized by a t-->c transversion in intron 9. Linkage disequilibrium was found at the casein haplotype in all three breeds. A total of 30 haplotypes showed frequencies higher than 0.01. In the Buren breed only one haplotype showed a frequency higher than 0.1. The ancestral haplotype B-A-A-B (in the order: CSN1S1-CSN2-CSN1S2-CSN3) occurred in all three breeds, showing a very high frequency (>0.8) in the Buren.

  10. Discovery of novel MHC-class I alleles and haplotypes in Filipino cynomolgus macaques (Macaca fascicularis) by pyrosequencing and Sanger sequencing: Mafa-class I polymorphism.

    PubMed

    Shiina, Takashi; Yamada, Yukiho; Aarnink, Alice; Suzuki, Shingo; Masuya, Anri; Ito, Sayaka; Ido, Daisuke; Yamanaka, Hisashi; Iwatani, Chizuru; Tsuchiya, Hideaki; Ishigaki, Hirohito; Itoh, Yasushi; Ogasawara, Kazumasa; Kulski, Jerzy K; Blancher, Antoine

    2015-10-01

    Although the low polymorphism of the major histocompatibility complex (MHC) transplantation genes in the Filipino cynomolgus macaque (Macaca fascicularis) is expected to have important implications in the selection and breeding of animals for medical research, detailed polymorphism information is still lacking for many of the duplicated class I genes. To better elucidate the degree and types of MHC polymorphisms and haplotypes in the Filipino macaque population, we genotyped 127 unrelated animals by the Sanger sequencing method and high-resolution pyrosequencing and identified 112 different alleles, 28 at cynomolgus macaque MHC (Mafa)-A, 54 at Mafa-B, 12 at Mafa-I, 11 at Mafa-E, and seven at Mafa-F alleles, of which 56 were newly described. Of them, the newly discovered Mafa-A8*01:01 lineage allele had low nucleotide similarities (<86%) with primate MHC class I genes, and it was also conserved in the Vietnamese and Indonesian populations. In addition, haplotype estimations revealed 17 Mafa-A, 23 Mafa-B, and 12 Mafa-E haplotypes integrated with 84 Mafa-class I haplotypes and Mafa-F alleles. Of these, the two Mafa-class I haplotypes, F/A/E/B-Hp1 and F/A/E/B-Hp2, had the highest haplotype frequencies at 10.6 and 10.2%, respectively. This suggests that large scale genetic screening of the Filipino macaque population would identify these and other high-frequency Mafa-class I haplotypes that could be used as MHC control animals for the benefit of biomedical research.

  11. RTEL1 tagging SNPs and haplotypes were associated with glioma development

    PubMed Central

    2013-01-01

    Abstract As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case–control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF) > 5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P = 0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P = 0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype “GG” of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P = 0.0002), while the genotype “CC” of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P = 0.0003). Furthermore, haplotype “GCT” in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher’s P = 0.0005; Pearson’s P = 0.0005), and haplotype “ATT” was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher’s P = 0.0013; Pearson’s P = 0.0013). Two single variants, the genotypes of “GG” of rs6010620 and “CC” of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. Virtual slides The virtual slides for this article

  12. Genome mining-directed activation of a silent angucycline biosynthetic gene cluster in Streptomyces chattanoogensis.

    PubMed

    Zhou, Zhenxing; Xu, Qingqing; Bu, Qingting; Guo, Yuanyang; Liu, Shuiping; Liu, Yu; Du, Yiling; Li, Yongquan

    2015-02-09

    Genomic sequencing of actinomycetes has revealed the presence of numerous gene clusters seemingly capable of natural product biosynthesis, yet most clusters are cryptic under laboratory conditions. Bioinformatics analysis of the completely sequenced genome of Streptomyces chattanoogensis L10 (CGMCC 2644) revealed a silent angucycline biosynthetic gene cluster. The overexpression of a pathway-specific activator gene under the constitutive ermE* promoter successfully triggered the expression of the angucycline biosynthetic genes. Two novel members of the angucycline antibiotic family, chattamycins A and B, were further isolated and elucidated. Biological activity assays demonstrated that chattamycin B possesses good antitumor activities against human cancer cell lines and moderate antibacterial activities. The results presented here provide a feasible method to activate silent angucycline biosynthetic gene clusters to discover potential new drug leads. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. DLA-DRB1, DQA1, and DQB1 alleles and haplotypes in North American Gray Wolves.

    PubMed

    Kennedy, Lorna J; Angles, John M; Barnes, Annette; Carmichael, Lindsey E; Radford, Alan D; Ollier, William E R; Happ, George M

    2007-01-01

    The canine major histocompatibility complex contains highly polymorphic genes, many of which are critical in regulating immune response. Since domestic dogs evolved from Gray Wolves (Canis lupus), common DLA class II alleles should exist. Sequencing was used to characterize 175 Gray Wolves for DLA class II alleles, and data from 1856 dogs, covering 85 different breeds of mostly European origin, were available for comparison. Within wolves, 28 new alleles were identified, all occurring in at least 2 individuals. Three DLA-DRB1, 8 DLA-DQA1, and 6 DLA-DQB1 alleles also identified in dogs were present. Twenty-eight haplotypes were identified, of which 2 three-locus haplotypes, and many DLA-DQA1/DQB1 haplotypes, are also found in dogs. The wolves studied had relatively few dog DLA alleles and may therefore represent a remnant population descended from Asian wolves. The single European wolf included carried a haplotype found in both these North American wolves and in many dog breeds. Furthermore, one wolf DQB1 allele has been found in Shih Tzu, a breed of Asian origin. These data suggest that the wolf ancestors of Asian and European dogs may have had different gene pools, currently reflected in the DLA alleles present in dog breeds.

  14. Association and haplotype analysis of the insulin-degrading enzyme (IDE) gene, a strong positional and biological candidate for type 2 diabetes susceptibility.

    PubMed

    Groves, Christopher J; Wiltshire, Steven; Smedley, Damian; Owen, Katherine R; Frayling, Timothy M; Walker, Mark; Hitman, Graham A; Levy, Jonathan C; O'Rahilly, Stephen; Menzel, Stephan; Hattersley, Andrew T; McCarthy, Mark I

    2003-05-01

    The gene for insulin-degrading enzyme (IDE) represents a strong positional and biological candidate for type 2 diabetes susceptibility. IDE maps to chromosome 10q23.3, a region linked to diabetes in several populations; the rat homolog has been directly implicated in diabetes susceptibility; and known functions of IDE support an important role in glucose homeostasis. We sought evidence for association between IDE variation and diabetes by mutation screening, defining local haplotype structure, and genotyping variants delineating common haplotypic diversity. An initial case-control analysis (628 diabetic probands from multiplex sibships and 604 control subjects) found no haplotypic associations, although one variant (IDE2, -179T-->C) showed modest association with diabetes (odds ratio [OR]1.25, P = 0.03). Linkage partitioning analyses failed to support this association, but provided borderline evidence for a different variant (IDE10, IVS20-405A-->G) (P = 0.06). Neither variant was associated with diabetes when replication was sought in 377 early onset diabetic subjects and 825 control subjects, though combined analysis of all typed cohorts indicated a nominally significant effect at IDE2 (OR 1.21 [1.04-1.40], P = 0.013). In the absence of convincing support for this association from linkage partitioning or analyses of continuous measures of glycemia, we conclude that analysis of over 2,400 samples provides no compelling evidence that variation in IDE contributes to diabetes susceptibility in humans.

  15. Haplotype diversity in the equine myostatin gene with focus on variants associated with race distance propensity and muscle fiber type proportions

    PubMed Central

    Petersen, Jessica L; Valberg, Stephanie J; Mickelson, James R; McCue, Molly E

    2014-01-01

    Summary Two variants in the equine myostatin gene (MSTN), including a T/C SNP substitution in the first intron and a 227-bp SINE insertion in the promoter, are associated with muscle fiber type proportions in the Quarter Horse (QH) and with the prediction of race distance propensity in the Thoroughbred (TB). Genotypes from these loci, along with 18 additional variants surrounding MSTN, were examined in 301 horses of 14 breeds to evaluate haplotype relationships and diversity. The C allele of intron 1 was found in 12 of 14 breeds at a frequency of 0.27; the SINE was observed in five breeds, but common in only the TB and QH (0.73 and 0.48 respectively). Haplotype data suggest the SINE insertion is contemporary to and arose upon a haplotype containing the intron 1 C allele. Gluteal muscle biopsies of TBs showed a significant association of the intron 1 C allele and SINE with a higher proportion of Type 2B and lower proportion of Type 1 fibers. However, in the Belgian horse, in which the SINE is not present, the intron 1 SNP was not associated with fiber type proportions, and evaluation of fiber type proportions across the Belgian, TB and QH breeds shows the significant effect of breed on fiber type proportions is negated when evaluating horses without the SINE variant. These data suggest the SINE, rather than the intron 1 SNP, is driving the observed muscle fiber type characteristics and is the variant targeted by selection for short-distance racing. PMID:25160752

  16. Two independent apolipoprotein A5 haplotypes influence human plasma triglyceride levels.

    PubMed

    Pennacchio, Len A; Olivier, Michael; Hubacek, Jaroslav A; Krauss, Ronald M; Rubin, Edward M; Cohen, Jonathan C

    2002-11-15

    The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in approximately 16% of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceride concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n=419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12% of Caucasians, 14% of African-Americans and 28% of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25-50% of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.

  17. Two independent apolipoprotein a5 Haplotypes influence human plasma triglyceride levels

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pennacchio, Len A.; Olivier, Michael; Hubacek, Jaroslav A.

    2002-09-16

    The recently identified apolipoprotein A5 gene (APOA5) has been shown to play an important role in determining plasma triglyceride concentrations in humans and mice. We previously identified an APOA5 haplotype (designated APOA5*2) that is present in {approx}16 percent of Caucasians and is associated with increased plasma triglyceride concentrations. In this report we describe another APOA5 haplotype (APOA5*3) containing the rare allele of the single nucleotide polymorphism c.56C>G that changes serine to tryptophan at codon 19 and is independently associated with high plasma triglyceride levels in three different populations. In a sample of 264 Caucasian men and women with plasma triglyceridemore » concentrations above the 90th percentile or below the 10th percentile, the APOA5*3 haplotype was more than three-fold more common in the group with high plasma triglyceride levels. In a second independently ascertained sample of Caucasian men and women (n 1/4 419) who were studied while consuming their self-selected diets as well as after high-carbohydrate diets and high-fat diets, the APOA5*3 haplotype was associated with increased plasma triglyceride levels on all three dietary regimens. In a third population comprising 2660 randomly selected individuals, the APOA5*3 haplotype was found in 12 percent of Caucasians, 14 percent of African-Americans and 28 percent of Hispanics and was associated with increased plasma triglyceride levels in both men and women in each ethnic group. These findings establish that the APOA5 locus contributes significantly to inter-individual variation in plasma triglyceride levels in humans. Together, the APOA5*2 and APOA5*3 haplotypes are found in 25 50 percent of African-Americans, Hispanics and Caucasians and support the contribution of common human variation to quantitative phenotypes in the general population.« less

  18. Three Novel Haplotypes of Theileria bicornis in Black and White Rhinoceros in Kenya.

    PubMed

    Otiende, M Y; Kivata, M W; Jowers, M J; Makumi, J N; Runo, S; Obanda, V; Gakuya, F; Mutinda, M; Kariuki, L; Alasaad, S

    2016-02-01

    Piroplasms, especially those in the genera Babesia and Theileria, have been found to naturally infect rhinoceros. Due to natural or human-induced stress factors such as capture and translocations, animals often develop fatal clinical piroplasmosis, which causes death if not treated. This study examines the genetic diversity and occurrence of novel Theileria species infecting both black and white rhinoceros in Kenya. Samples collected opportunistically during routine translocations and clinical interventions from 15 rhinoceros were analysed by polymerase chain reaction (PCR) using a nested amplification of the small subunit ribosomal RNA (18S rRNA) gene fragments of Babesia and Theileria. Our study revealed for the first time in Kenya the presence of Theileria bicornis in white (Ceratotherium simum simum) and black (Diceros bicornis michaeli) rhinoceros and the existence of three new haplotypes: haplotypes H1 and H3 were present in white rhinoceros, while H2 was present in black rhinoceros. No specific haplotype was correlated to any specific geographical location. The Bayesian inference 50% consensus phylogram recovered the three haplotypes monophyleticly, and Theileria bicornis had very high support (BPP: 0.98). Furthermore, the genetic p-uncorrected distances and substitutions between T. bicornis and the three haplotypes were the same in all three haplotypes, indicating a very close genetic affinity. This is the first report of the occurrence of Theileria species in white and black rhinoceros from Kenya. The three new haplotypes reported here for the first time have important ecological and conservational implications, especially for population management and translocation programs and as a means of avoiding the transport of infected animals into non-affected areas. © 2014 Blackwell Verlag GmbH.

  19. Haplotype-based association analysis of general cognitive ability in Generation Scotland, the English Longitudinal Study of Ageing, and UK Biobank.

    PubMed

    Howard, David M; Adams, Mark J; Clarke, Toni-Kim; Wigmore, Eleanor M; Zeng, Yanni; Hagenaars, Saskia P; Lyall, Donald M; Thomson, Pippa A; Evans, Kathryn L; Porteous, David J; Nagy, Reka; Hayward, Caroline; Haley, Chris S; Smith, Blair H; Murray, Alison D; Batty, G David; Deary, Ian J; McIntosh, Andrew M

    2017-01-01

    Cognitive ability is a heritable trait with a polygenic architecture, for which several associated variants have been identified using genotype-based and candidate gene approaches. Haplotype-based analyses are a complementary technique that take phased genotype data into account, and potentially provide greater statistical power to detect lower frequency variants. In the present analysis, three cohort studies (n total = 48,002) were utilised: Generation Scotland: Scottish Family Health Study (GS:SFHS), the English Longitudinal Study of Ageing (ELSA), and the UK Biobank. A genome-wide haplotype-based meta-analysis of cognitive ability was performed, as well as a targeted meta-analysis of several gene coding regions. None of the assessed haplotypes provided evidence of a statistically significant association with cognitive ability in either the individual cohorts or the meta-analysis. Within the meta-analysis, the haplotype with the lowest observed P -value overlapped with the D-amino acid oxidase activator ( DAOA ) gene coding region. This coding region has previously been associated with bipolar disorder, schizophrenia and Alzheimer's disease, which have all been shown to impact upon cognitive ability. Another potentially interesting region highlighted within the current genome-wide association analysis (GS:SFHS: P = 4.09 x 10 -7 ), was the butyrylcholinesterase ( BCHE ) gene coding region. The protein encoded by BCHE has been shown to influence the progression of Alzheimer's disease and its role in cognitive ability merits further investigation. Although no evidence was found for any haplotypes with a statistically significant association with cognitive ability, our results did provide further evidence that the genetic variants contributing to the variance of cognitive ability are likely to be of small effect.

  20. Global Identification of Genes Affecting Iron-Sulfur Cluster Biogenesis and Iron Homeostasis

    PubMed Central

    Hidese, Ryota; Kurihara, Tatsuo; Esaki, Nobuyoshi

    2014-01-01

    Iron-sulfur (Fe-S) clusters are ubiquitous cofactors that are crucial for many physiological processes in all organisms. In Escherichia coli, assembly of Fe-S clusters depends on the activity of the iron-sulfur cluster (ISC) assembly and sulfur mobilization (SUF) apparatus. However, the underlying molecular mechanisms and the mechanisms that control Fe-S cluster biogenesis and iron homeostasis are still poorly defined. In this study, we performed a global screen to identify the factors affecting Fe-S cluster biogenesis and iron homeostasis using the Keio collection, which is a library of 3,815 single-gene E. coli knockout mutants. The approach was based on radiolabeling of the cells with [2-14C]dihydrouracil, which entirely depends on the activity of an Fe-S enzyme, dihydropyrimidine dehydrogenase. We identified 49 genes affecting Fe-S cluster biogenesis and/or iron homeostasis, including 23 genes important only under microaerobic/anaerobic conditions. This study defines key proteins associated with Fe-S cluster biogenesis and iron homeostasis, which will aid further understanding of the cellular mechanisms that coordinate the processes. In addition, we applied the [2-14C]dihydrouracil-labeling method to analyze the role of amino acid residues of an Fe-S cluster assembly scaffold (IscU) as a model of the Fe-S cluster assembly apparatus. The analysis showed that Cys37, Cys63, His105, and Cys106 are essential for the function of IscU in vivo, demonstrating the potential of the method to investigate in vivo function of proteins involved in Fe-S cluster assembly. PMID:24415728

  1. Effects of IL-10 haplotype and atomic bomb radiation exposure on gastric cancer risk.

    PubMed

    Hayashi, Tomonori; Ito, Reiko; Cologne, John; Maki, Mayumi; Morishita, Yukari; Nagamura, Hiroko; Sasaki, Keiko; Hayashi, Ikue; Imai, Kazue; Yoshida, Kengo; Kajimura, Junko; Kyoizumi, Seishi; Kusunoki, Yoichiro; Ohishi, Waka; Fujiwara, Saeko; Akahoshi, Masazumi; Nakachi, Kei

    2013-07-01

    Gastric cancer (GC) is one of the cancers that reveal increased risk of mortality and incidence in atomic bomb survivors. The incidence of gastric cancer in the Life Span Study cohort of the Radiation Effects Research Foundation (RERF) increased with radiation dose (gender-averaged excess relative risk per Gy = 0.28) and remains high more than 65 years after exposure. To assess a possible role of gene-environment interaction, we examined the dose response for gastric cancer incidence based on immunosuppression-related IL-10 genotype, in a cohort study with 200 cancer cases (93 intestinal, 96 diffuse and 11 other types) among 4,690 atomic bomb survivors participating in an immunological substudy. Using a single haplotype block composed of four haplotype-tagging SNPs (comprising the major haplotype allele IL-10-ATTA and the minor haplotype allele IL-10-GGCG, which are categorized by IL-10 polymorphisms at -819A>G and -592T>G, +1177T>C and +1589A>G), multiplicative and additive models for joint effects of radiation and this IL-10 haplotyping were examined. The IL-10 minor haplotype allele(s) was a risk factor for intestinal type gastric cancer but not for diffuse type gastric cancer. Radiation was not associated with intestinal type gastric cancer. In diffuse type gastric cancer, the haplotype-specific excess relative risk (ERR) for radiation was statistically significant only in the major homozygote category of IL-10 (ERR = 0.46/Gy, P = 0.037), whereas estimated ERR for radiation with the minor IL-10 homozygotes was close to 0 and nonsignificant. Thus, the minor IL-10 haplotype might act to reduce the radiation related risk of diffuse-type gastric cancer. The results suggest that this IL-10 haplotyping might be involved in development of radiation-associated gastric cancer of the diffuse type, and that IL-10 haplotypes may explain individual differences in the radiation-related risk of gastric cancer. © 2013 by Radiation Research Society

  2. Haplotypes of the IL10 Gene as Potential Protection Factors in Leprosy Patients

    PubMed Central

    Garcia, Patricia; Alencar, Dayse; Pinto, Pablo; Santos, Ney; Salgado, Claudio; Sortica, Vinicius A.; Hutz, Mara H.; Santos, Sidney

    2013-01-01

    Leprosy is an infectious disease caused by Mycobacterium leprae characterized by dermatoneurological signs and symptoms that has a large number of new cases worldwide. Several studies have associated interleukin 10 with susceptibility/resistance to several diseases. We investigated haplotypes formed by three single nucleotide polymorphisms (SNPs) located in the IL10 gene (A-1082G, C-819T, and C-592A) in order to better understand the susceptibility to and severity of leprosy in an admixed northern Brazil population, taking into account estimates of interethnic admixture. We observed the genotypes ACC/ACC (P = 0.021, odds ratio [OR] [95% confidence interval (CI)] = 0.290 [0.085 to 0823]) and ACC/GCC (P = 0.003, OR [95% CI] = 0.220 [0.504 to 0.040]) presenting significant results for protection against leprosy development, framed in the profiles of low and medium interleukin production, respectively. Therefore, we suggest that genotypes A-1082G, C-819T, and C-592A formed by interleukin-10 polymorphisms are closely related to protection of the leprosy development in an admixed northern Brazil population, in particular ACC/ACC and ACC/GCC genotypes. PMID:23966553

  3. Genomics-driven discovery of the pneumocandin biosynthetic gene cluster in the fungus Glarea lozoyensis

    PubMed Central

    2013-01-01

    Background The antifungal therapy caspofungin is a semi-synthetic derivative of pneumocandin B0, a lipohexapeptide produced by the fungus Glarea lozoyensis, and was the first member of the echinocandin class approved for human therapy. The nonribosomal peptide synthetase (NRPS)-polyketide synthases (PKS) gene cluster responsible for pneumocandin biosynthesis from G. lozoyensis has not been elucidated to date. In this study, we report the elucidation of the pneumocandin biosynthetic gene cluster by whole genome sequencing of the G. lozoyensis wild-type strain ATCC 20868. Results The pneumocandin biosynthetic gene cluster contains a NRPS (GLNRPS4) and a PKS (GLPKS4) arranged in tandem, two cytochrome P450 monooxygenases, seven other modifying enzymes, and genes for L-homotyrosine biosynthesis, a component of the peptide core. Thus, the pneumocandin biosynthetic gene cluster is significantly more autonomous and organized than that of the recently characterized echinocandin B gene cluster. Disruption mutants of GLNRPS4 and GLPKS4 no longer produced the pneumocandins (A0 and B0), and the Δglnrps4 and Δglpks4 mutants lost antifungal activity against the human pathogenic fungus Candida albicans. In addition to pneumocandins, the G. lozoyensis genome encodes a rich repertoire of natural product-encoding genes including 24 PKSs, six NRPSs, five PKS-NRPS hybrids, two dimethylallyl tryptophan synthases, and 14 terpene synthases. Conclusions Characterization of the gene cluster provides a blueprint for engineering new pneumocandin derivatives with improved pharmacological properties. Whole genome estimation of the secondary metabolite-encoding genes from G. lozoyensis provides yet another example of the huge potential for drug discovery from natural products from the fungal kingdom. PMID:23688303

  4. Tryptophan Hydroxylase 2 haplotype association with borderline personality disorder and aggression in a sample of patients with personality disorders and healthy controls

    PubMed Central

    Perez-Rodriguez, M. Mercedes; Weinstein, Shauna; New, Antonia S.; Bevilacqua, Laura; Yuan, Qiaoping; Zhou, Zhifeng; Hodgkinson, Colin; Goodman, Marianne; Koenigsberg, Harold W.; Goldman, David; Siever, Larry J.

    2010-01-01

    Background There is decreased serotonergic function in impulsive aggression and borderline personality disorder (BPD), and genetic association studies suggest a role of serotonergic genes in impulsive aggression and BPD. Only one study has analyzed the association between the tryptophan-hydroxylase 2 (TPH2) gene and BPD. A TPH2 “risk” haplotype has been described that is associated with anxiety, depression and suicidal behavior. Methods We assessed the relationship between the previously identified “risk” haplotype at the TPH2 locus and BPD diagnosis, impulsive aggression, affective lability, and suicidal/parasuicidal behaviors, in a well-characterized clinical sample of 103 healthy controls (HCs) and 251 patients with personality disorders (109 with BPD). A logistic regression including measures of depression, affective lability and aggression scores in predicting “risk” haplotype was conducted. Results The prevalence of the “risk” haplotype was significantly higher in patients with BPD compared to HCs. Those with the “risk” haplotype have higher aggression and affect lability scores and more suicidal/parasuicidal behaviors than those without it. In the logistic regression model, affect lability was the only significant predictor and it correctly classified 83.1% of the subjects as “risk” or “non-risk” haplotype carriers. Conclusions We found an association between the previously described TPH2 “risk” haplotype and BPD diagnosis, affective lability, suicidal/parasuicidal behavior, and aggression scores. PMID:20451217

  5. Haplotyping for disease association: a combinatorial approach.

    PubMed

    Lancia, Giuseppe; Ravi, R; Rizzi, Romeo

    2008-01-01

    We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.

  6. A Nomadic Subtelomeric Disease Resistance Gene Cluster in Common Bean1[W

    PubMed Central

    David, Perrine; Chen, Nicolas W.G.; Pedrosa-Harand, Andrea; Thareau, Vincent; Sévignac, Mireille; Cannon, Steven B.; Debouck, Daniel; Langin, Thierry; Geffroy, Valérie

    2009-01-01

    The B4 resistance (R) gene cluster is one of the largest clusters known in common bean (Phaseolus vulgaris [Pv]). It is located in a peculiar genomic environment in the subtelomeric region of the short arm of chromosome 4, adjacent to two heterochromatic blocks (knobs). We sequenced 650 kb spanning this locus and annotated 97 genes, 26 of which correspond to Coiled-Coil-Nucleotide-Binding-Site-Leucine-Rich-Repeat (CNL). Conserved microsynteny was observed between the Pv B4 locus and corresponding regions of Medicago truncatula and Lotus japonicus in chromosomes Mt6 and Lj2, respectively. The notable exception was the CNL sequences, which were completely absent in these regions. The origin of the Pv B4-CNL sequences was investigated through phylogenetic analysis, which reveals that, in the Pv genome, paralogous CNL genes are shared among nonhomologous chromosomes (4 and 11). Together, our results suggest that Pv B4-CNL was derived from CNL sequences from another cluster, the Co-2 cluster, through an ectopic recombination event. Integration of the soybean (Glycine max) genome data enables us to date more precisely this event and also to infer that a single CNL moved from the Co-2 to the B4 cluster. Moreover, we identified a new 528-bp satellite repeat, referred to as khipu, specific to the Phaseolus genus, present both between B4-CNL sequences and in the two knobs identified at the B4 R gene cluster. The khipu repeat is present on most chromosomal termini, indicating the existence of frequent ectopic recombination events in Pv subtelomeric regions. Our results highlight the importance of ectopic recombination in R gene evolution. PMID:19776165

  7. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less

  8. Improved efficiency in amplification of Escherichia coli o-antigen gene clusters using genome-wide sequence comparison

    USDA-ARS?s Scientific Manuscript database

    Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...

  9. Global selection on sucrose synthase haplotypes during a century of wheat breeding.

    PubMed

    Hou, Jian; Jiang, Qiyan; Hao, Chenyang; Wang, Yuquan; Zhang, Hongna; Zhang, Xueyong

    2014-04-01

    Spike number per unit area, number of grains per spike, and thousand kernel weight (TKW) are important yield components. In China, increases in wheat (Triticum aestivum) yields are mainly due to increases in grain number per spike and TKW. TKW mainly depends on starch content, as starch accounts for about 70% of the grain endosperm. Sucrose synthase catalysis is the first step in the conversion of sucrose to starch, that is, the conversion of sucrose to fructose and UDP-glucose by the wheat sucrose synthase genes (TaSus1 and TaSus2) that are located on chromosomes 7A/7B/7D and 2A/2B/2D, respectively. A total of 1,520 wheat accessions were genotyped at the six loci. Two, two, five, and two haplotypes were identified at the TaSus2-2A, TaSus2-2B, TaSus1-7A, and TaSus1-7B loci, respectively. Their main variations were detected within the introns. Significant differences between the haplotypes correlated with TKW differences among 348 modern Chinese cultivars from the core collection. Frequency changes for favored haplotypes showed gradual increases in cultivars released since beginning of the last century in China, Europe, and North America. Geographic distributions and time changes of favored haplotypes were characterized in six major wheat production regions worldwide. Strong selection bottlenecks to haplotype variations occurred at polyploidization and domestication and during breeding of wheat. Genetic-effect differences between haplotypes at the same locus influence the selection time and intensity. This work shows that the endosperm starch synthesis pathway is a major target of indirect selection in global wheat breeding for higher yield.

  10. Genome Patterns of Selection and Introgression of Haplotypes in Natural Populations of the House Mouse (Mus musculus)

    PubMed Central

    Staubach, Fabian; Lorenc, Anna; Messer, Philipp W.; Tang, Kun; Petrov, Dmitri A.; Tautz, Diethard

    2012-01-01

    General parameters of selection, such as the frequency and strength of positive selection in natural populations or the role of introgression, are still insufficiently understood. The house mouse (Mus musculus) is a particularly well-suited model system to approach such questions, since it has a defined history of splits into subspecies and populations and since extensive genome information is available. We have used high-density single-nucleotide polymorphism (SNP) typing arrays to assess genomic patterns of positive selection and introgression of alleles in two natural populations of each of the subspecies M. m. domesticus and M. m. musculus. Applying different statistical procedures, we find a large number of regions subject to apparent selective sweeps, indicating frequent positive selection on rare alleles or novel mutations. Genes in the regions include well-studied imprinted loci (e.g. Plagl1/Zac1), homologues of human genes involved in adaptations (e.g. alpha-amylase genes) or in genetic diseases (e.g. Huntingtin and Parkin). Haplotype matching between the two subspecies reveals a large number of haplotypes that show patterns of introgression from specific populations of the respective other subspecies, with at least 10% of the genome being affected by partial or full introgression. Using neutral simulations for comparison, we find that the size and the fraction of introgressed haplotypes are not compatible with a pure migration or incomplete lineage sorting model. Hence, it appears that introgressed haplotypes can rise in frequency due to positive selection and thus can contribute to the adaptive genomic landscape of natural populations. Our data support the notion that natural genomes are subject to complex adaptive processes, including the introgression of haplotypes from other differentiated populations or species at a larger scale than previously assumed for animals. This implies that some of the admixture found in inbred strains of mice may also have

  11. Common PCSK1 haplotypes are associated with obesity in the Chinese population.

    PubMed

    Chang, Yi-Cheng; Chiu, Yen-Feng; Shih, Kuang-Chung; Lin, Ming-Wei; Sheu, Wayne Huey-Herng; Donlon, Timothy; Curb, Jess David; Jou, Yuh-Shan; Chang, Tien-Jyun; Li, Hung-Yuan; Chuang, Lee-Ming

    2010-07-01

    Prohormone convertase subtilisin/kexin type 1 (PCSK1) genetic polymorphisms have recently been associated with obesity in European populations. This study aimed to examine whether common PCSK1 genetic variation is associated with obesity and related metabolic phenotypes in the Chinese population. We genotyped nine common tag single-nucleotide polymorphisms (tagSNP) of the PCSK1 gene in 1,094 subjects of Chinese origin from the Stanford Asia-Pacific Program for Hypertension and Insulin Resistance (SAPPHIRe) family study. One SNP in the PCSK1 gene (rs155971) were nominally associated with risk of obesity in the SAPPHIRe cohort (P = 0.01). A common protective haplotype was associated with reduced risk of obesity (23.79% vs. 32.89%, P = 0.01) and smaller waist circumference (81.71 +/- 10.22 vs. 84.75 +/- 10.48 cm, P = 0.02). Another common haplotype was significantly associated with increased risk of obesity (37.07% vs. 23.84%, P = 0.005). The global P value for haplotype association with obesity was 0.02. We also identified a suggestive association of another PCSK1 SNP (rs3811951) with fasting glucose, fasting insulin, homeostasis model assessment of insulin resistance (HOMA(IR)), triglycerides, and high-density lipoprotein cholesterol (P = 0.05, 0.003, 0.001, 0.04, and 0.04, respectively). These data indicate common PCSK1 genetic variants are associated with obesity in the Chinese population.

  12. Complement factor H gene (CFH) polymorphisms C-257T, G257A and haplotypes are associated with protection against severe dengue phenotype, possible related with high CFH expression.

    PubMed

    Pastor, André F; Rodrigues Moura, Laís; Neto, José W D; Nascimento, Eduardo J M; Calzavara-Silva, Carlos E; Gomes, Ana Lisa V; Silva, Ana Maria da; Cordeiro, Marli T; Braga-Neto, Ulisses; Crovella, Sergio; Gil, Laura H V G; Marques, Ernesto T A; Acioli-Santos, Bartolomeu

    2013-09-01

    Four genetic polymorphisms located at the promoter (C-257T) and coding regions of CFH gene (exon 2 G257A, exon 14 A2089G and exon 19 G2881T) were investigated in 121 dengue patients (DENV-3) in order to assess the relationship between allele/haplotypes variants and clinical outcomes. A statistical value was found between the CFH-257T allele (TT/TC genotypes) and reduced susceptibility to severe dengue (SD). Statistical associations indicate that individuals bearing a T allele presented significantly higher protein levels in plasma. The -257T variant is located within a NF-κB binding site, suggesting that this variant might have effect on the ability of the CFH gene to respond to signals via the NF-κB pathway. The G257A allelic variant showed significant protection against severe dengue. When CFH haplotypes effect was considered, the ancestral CG/CG promoter-exon 2 SNP genotype showed significant risk to SD either in a general comparison (ancestral × all variant genotypes), as well as in individual genotypes comparison (ancestral × each variant genotype), where the most prevalent effect was observed in the CG/CG × CA/TG comparison. These findings support the involvement of -257T, 257A allele variants and haplotypes on severe dengue phenotype protection, related with high basal CFH expression. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  13. A ground truth based comparative study on clustering of gene expression data.

    PubMed

    Zhu, Yitan; Wang, Zuyi; Miller, David J; Clarke, Robert; Xuan, Jianhua; Hoffman, Eric P; Wang, Yue

    2008-05-01

    Given the variety of available clustering methods for gene expression data analysis, it is important to develop an appropriate and rigorous validation scheme to assess the performance and limitations of the most widely used clustering algorithms. In this paper, we present a ground truth based comparative study on the functionality, accuracy, and stability of five data clustering methods, namely hierarchical clustering, K-means clustering, self-organizing maps, standard finite normal mixture fitting, and a caBIG toolkit (VIsual Statistical Data Analyzer--VISDA), tested on sample clustering of seven published microarray gene expression datasets and one synthetic dataset. We examined the performance of these algorithms in both data-sufficient and data-insufficient cases using quantitative performance measures, including cluster number detection accuracy and mean and standard deviation of partition accuracy. The experimental results showed that VISDA, an interactive coarse-to-fine maximum likelihood fitting algorithm, is a solid performer on most of the datasets, while K-means clustering and self-organizing maps optimized by the mean squared compactness criterion generally produce more stable solutions than the other methods.

  14. Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.

    PubMed

    Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi

    2004-01-01

    In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.

  15. The Chloroplast atpA Gene Cluster in Chlamydomonas reinhardtii1

    PubMed Central

    Drapier, Dominique; Suzuki, Hideki; Levy, Haim; Rimbault, Blandine; Kindle, Karen L.; Stern, David B.; Wollman, Francis-André

    1998-01-01

    Most chloroplast genes in vascular plants are organized into polycistronic transcription units, which generate a complex pattern of mono-, di-, and polycistronic transcripts. In contrast, most Chlamydomonas reinhardtii chloroplast transcripts characterized to date have been monocistronic. This paper describes the atpA gene cluster in the C. reinhardtii chloroplast genome, which includes the atpA, psbI, cemA, and atpH genes, encoding the α-subunit of the coupling-factor-1 (CF1) ATP synthase, a small photosystem II polypeptide, a chloroplast envelope membrane protein, and subunit III of the CF0 ATP synthase, respectively. We show that promoters precede the atpA, psbI, and atpH genes, but not the cemA gene, and that cemA mRNA is present only as part of di-, tri-, or tetracistronic transcripts. Deletions introduced into the gene cluster reveal, first, that CF1-α can be translated from di- or polycistronic transcripts, and, second, that substantial reductions in mRNA quantity have minimal effects on protein synthesis rates. We suggest that posttranscriptional mRNA processing is common in C. reinhardtii chloroplasts, permitting the expression of multiple genes from a single promoter. PMID:9625716

  16. Evolution and dispersal of emmer wheat (Triticum sp.) from novel haplotypes of Ppd-1 (photoperiod response) genes and their surrounding DNA sequences.

    PubMed

    Takenaka, Shotaro; Kawahara, Taihachi

    2012-09-01

    The sequence data from 5' UTR, intronic, coding and 3' UTR regions of Ppd-A1 and Ppd-B1 were investigated for a total of 158 accessions of emmer wheat landraces comprising 19 of wild emmer wheat (Triticum dicoccoides), 45 of hulled emmer wheat (T. dicoccum) and 94 of free-threshing (FT) emmer wheat (T. durum etc.). We detected some novel types of deletions in the coding regions from 22 hulled emmer accessions and 20 FT emmer accessions. Emmer wheat accessions with these deletions could produce predicted proteins likely to lack function. We also observed some novel mutations in Ppd-B1. Sixty-seven and forty-one haplotypes were found in Ppd-A1 and Ppd-B1, respectively. Some mutations found in this study have not been known, so they have potential for useful genetic resources for wheat breeding. On the basis of sequence data from the 5' UTR region, both Ppd-A1 and Ppd-B1 haplotypes were divided into two groups (Type AI/AII and Type BI/BII). Types AI and AII of Ppd-A1 suggested gene flow between wild and hulled emmer. On the other hand, Types BI and BII of Ppd-B1 suggested gene flow between wild and FT emmer. More than half of hulled emmer accessions were Type AII/BI but few FT emmer accessions were of this type. Therefore, over half of the hulled emmer did not contribute to evolution of FT emmer.

  17. Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

    PubMed

    Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

    1998-08-01

    The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.

  18. Two Gene Clusters Coordinate Galactose and Lactose Metabolism in Streptococcus gordonii

    PubMed Central

    Zeng, Lin; Martino, Nicole C.

    2012-01-01

    Streptococcus gordonii is an early colonizer of the human oral cavity and an abundant constituent of oral biofilms. Two tandemly arranged gene clusters, designated lac and gal, were identified in the S. gordonii DL1 genome, which encode genes of the tagatose pathway (lacABCD) and sugar phosphotransferase system (PTS) enzyme II permeases. Genes encoding a predicted phospho-β-galactosidase (LacG), a DeoR family transcriptional regulator (LacR), and a transcriptional antiterminator (LacT) were also present in the clusters. Growth and PTS assays supported that the permease designated EIILac transports lactose and galactose, whereas EIIGal transports galactose. The expression of the gene for EIIGal was markedly upregulated in cells growing on galactose. Using promoter-cat fusions, a role for LacR in the regulation of the expressions of both gene clusters was demonstrated, and the gal cluster was also shown to be sensitive to repression by CcpA. The deletion of lacT caused an inability to grow on lactose, apparently because of its role in the regulation of the expression of the genes for EIILac, but had little effect on galactose utilization. S. gordonii maintained a selective advantage over Streptococcus mutans in a mixed-species competition assay, associated with its possession of a high-affinity galactose PTS, although S. mutans could persist better at low pHs. Collectively, these results support the concept that the galactose and lactose systems of S. gordonii are subject to complex regulation and that a high-affinity galactose PTS may be advantageous when S. gordonii is competing against the caries pathogen S. mutans in oral biofilms. PMID:22660715

  19. The evolution of resistance genes in multi-protein plant resistance systems.

    PubMed

    Friedman, Aaron R; Baker, Barbara J

    2007-12-01

    The genomic perspective aids in integrating the analysis of single resistance (R-) genes into a higher order model of complex plant resistance systems. The majority of R-genes encode a class of proteins with nucleotide binding (NB) and leucine-rich repeat (LRR) domains. Several R-proteins act in multi-protein R-complexes that mediate interaction with pathogen effectors to induce resistance signaling. The complexity of these systems seems to have resulted from multiple rounds of plant-pathogen co-evolution. R-gene evolution is thought to be facilitated by the formation of R-gene clusters, which permit sequence exchanges via recombinatorial mispairing and generate high haplotypic diversity. This pattern of evolution may also generate diversity at other loci that contribute to the R-complex. The rate of recombination at R-clusters is not necessarily homogeneous or consistent over evolutionary time: recent evidence suggests that recombination at R-clusters is increased following pathogen infection, suggesting a mechanism that induces temporary genome instability in response to extreme stress. DNA methylation and chromatin modifications may allow this instability to be conditionally regulated and targeted to specific genome regions. Knowledge of natural R-gene evolution may contribute to strategies for artificial evolution of novel resistance specificities.

  20. Functional and genetic analysis of haplotypic sequence variation at the nicastrin genomic locus

    PubMed Central

    Hamilton, Gillian; Killick, Richard; Lambert, Jean-Charles; Amouyel, Philippe; Carrasquillo, Minerva M.; Pankratz, V. Shane; Graff-Radford, Neill R.; Dickson, Dennis W.; Petersen, Ronald C.; Younkin, Steven G.; Powell, John F.; Wade-Martins, Richard

    2013-01-01

    Nicastrin (NCSTN) is a component of the γ-secretase complex and therefore potentially a candidate risk gene for Alzheimer's disease. Here, we have developed a novel functional genomics methodology to express common locus haplotypes to assess functional differences. DNA recombination was used to engineer 5 bacterial artificial chromosomes (BACs) to each express a different haplotype of the NCSTN locus. Each NCSTN-BAC was delivered to knockout nicastrin (Ncstn−/−) cells and clonal NCSTN-BAC+/Ncstn−/− cell lines were created for functional analyses. We showed that all NCSTN-BAC haplotypes expressed nicastrin protein and rescued γ-secretase activity and amyloid beta (Aβ) production in NCSTN-BAC+/Ncstn−/− lines. We then showed that genetic variation at the NCSTN locus affected alternative splicing in human postmortem brain tissue. However, there was no robust functional difference between clonal cell lines rescued by each of the 5 different haplotypes. Finally, there was no statistically significant association of NCSTN with disease risk in the 4 cohorts. We therefore conclude that it is unlikely that common variation at the NCSTN locus is a risk factor for Alzheimer's disease. PMID:22405046

  1. The Genetic and Molecular Organization of the Dopa Decarboxylase Gene Cluster of Drosophila Melanogaster

    PubMed Central

    Stathakis, D. G.; Pentz, E. S.; Freeman, M. E.; Kullman, J.; Hankins, G. R.; Pearlson, N. J.; Wright, TRF.

    1995-01-01

    We report the complete molecular organization of the Dopa decarboxylase gene cluster. Mutagenesis screens recovered 77 new Df(2L)TW130 recessive lethal mutations. These new alleles combined with 263 previously isolated mutations in the cluster to define 18 essential genes. In addition, seven new deficiencies were isolated and characterized. Deficiency mapping, restriction fragment length polymorphism (RFLP) analysis and P-element-mediated germline transformation experiments determined the gene order for all 18 loci. Genomic and cDNA restriction endonuclease mapping, Northern blot analysis and DNA sequencing provided information on exact gene location, mRNA size and transcriptional direction for most of these loci. In addition, this analysis identified two transcription units that had not previously been identified by extensive mutagenesis screening. Most of the loci are contained within two dense subclusters. We discuss the effectiveness of mutagens and strategies used in our screens, the variable mutability of loci within the genome of Drosophila melanogaster, the cytological and molecular organization of the Ddc gene cluster, the validity of the one band-one gene hypothesis and a possible purpose for the clustering of genes in the Ddc region. PMID:8647399

  2. Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters.

    PubMed

    Schorn, Michelle A; Alanjary, Mohammad M; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R; Ziemert, Nadine; Moore, Bradley S

    2016-12-01

    Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites.

  3. Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters

    PubMed Central

    Schorn, Michelle A.; Alanjary, Mohammad M.; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R.; Ziemert, Nadine

    2016-01-01

    Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites. PMID:27902408

  4. A Putative Gene Cluster from a Lyngbya wollei Bloom that Encodes Paralytic Shellfish Toxin Biosynthesis

    PubMed Central

    Mihali, Troco K.; Carmichael, Wayne W.; Neilan, Brett A.

    2011-01-01

    Saxitoxin and its analogs cause the paralytic shellfish-poisoning syndrome, adversely affecting human health and coastal shellfish industries worldwide. Here we report the isolation, sequencing, annotation, and predicted pathway of the saxitoxin biosynthetic gene cluster in the cyanobacterium Lyngbya wollei. The gene cluster spans 36 kb and encodes enzymes for the biosynthesis and export of the toxins. The Lyngbya wollei saxitoxin gene cluster differs from previously identified saxitoxin clusters as it contains genes that are unique to this cluster, whereby the carbamoyltransferase is truncated and replaced by an acyltransferase, explaining the unique toxin profile presented by Lyngbya wollei. These findings will enable the creation of toxin probes, for water monitoring purposes, as well as proof-of-concept for the combinatorial biosynthesis of these natural occurring alkaloids for the production of novel, biologically active compounds. PMID:21347365

  5. Statistical indicators of collective behavior and functional clusters in gene networks of yeast

    NASA Astrophysics Data System (ADS)

    Živković, J.; Tadić, B.; Wick, N.; Thurner, S.

    2006-03-01

    We analyze gene expression time-series data of yeast (S. cerevisiae) measured along two full cell-cycles. We quantify these data by using q-exponentials, gene expression ranking and a temporal mean-variance analysis. We construct gene interaction networks based on correlation coefficients and study the formation of the corresponding giant components and minimum spanning trees. By coloring genes according to their cell function we find functional clusters in the correlation networks and functional branches in the associated trees. Our results suggest that a percolation point of functional clusters can be identified on these gene expression correlation networks.

  6. Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana

    PubMed Central

    Reimegård, Johan; Kundu, Snehangshu; Pendle, Ali; Irish, Vivian F.; Shaw, Peter

    2017-01-01

    Abstract Co-expression of physically linked genes occurs surprisingly frequently in eukaryotes. Such chromosomal clustering may confer a selective advantage as it enables coordinated gene regulation at the chromatin level. We studied the chromosomal organization of genes involved in male reproductive development in Arabidopsis thaliana. We developed an in-silico tool to identify physical clusters of co-regulated genes from gene expression data. We identified 17 clusters (96 genes) involved in stamen development and acting downstream of the transcriptional activator MS1 (MALE STERILITY 1), which contains a PHD domain associated with chromatin re-organization. The clusters exhibited little gene homology or promoter element similarity, and largely overlapped with reported repressive histone marks. Experiments on a subset of the clusters suggested a link between expression activation and chromatin conformation: qRT-PCR and mRNA in situ hybridization showed that the clustered genes were up-regulated within 48 h after MS1 induction; out of 14 chromatin-remodeling mutants studied, expression of clustered genes was consistently down-regulated only in hta9/hta11, previously associated with metabolic cluster activation; DNA fluorescence in situ hybridization confirmed that transcriptional activation of the clustered genes was correlated with open chromatin conformation. Stamen development thus appears to involve transcriptional activation of physically clustered genes through chromatin de-condensation. PMID:28175342

  7. Identifying conserved gene clusters in the presence of homology families.

    PubMed

    He, Xin; Goldwasser, Michael H

    2005-01-01

    The study of conserved gene clusters is important for understanding the forces behind genome organization and evolution, as well as the function of individual genes or gene groups. In this paper, we present a new model and algorithm for identifying conserved gene clusters from pairwise genome comparison. This generalizes a recent model called "gene teams." A gene team is a set of genes that appear homologously in two or more species, possibly in a different order yet with the distance of adjacent genes in the team for each chromosome always no more than a certain threshold. We remove the constraint in the original model that each gene must have a unique occurrence in each chromosome and thus allow the analysis on complex prokaryotic or eukaryotic genomes with extensive paralogs. Our algorithm analyzes a pair of chromosomes in O(mn) time and uses O(m+n) space, where m and n are the number of genes in the respective chromosomes. We demonstrate the utility of our methods by studying two bacterial genomes, E. coli K-12 and B. subtilis. Many of the teams identified by our algorithm correlate with documented E. coli operons, while several others match predicted operons, previously suggested by computational techniques. Our implementation and data are publicly available at euler.slu.edu/ approximately goldwasser/homologyteams/.

  8. Detecting structure of haplotypes and local ancestry

    USDA-ARS?s Scientific Manuscript database

    We present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local an...

  9. Extended Islands of Tractability for Parsimony Haplotyping

    NASA Astrophysics Data System (ADS)

    Fleischer, Rudolf; Guo, Jiong; Niedermeier, Rolf; Uhlmann, Johannes; Wang, Yihui; Weller, Mathias; Wu, Xi

    Parsimony haplotyping is the problem of finding a smallest size set of haplotypes that can explain a given set of genotypes. The problem is NP-hard, and many heuristic and approximation algorithms as well as polynomial-time solvable special cases have been discovered. We propose improved fixed-parameter tractability results with respect to the parameter "size of the target haplotype set" k by presenting an O *(k 4k )-time algorithm. This also applies to the practically important constrained case, where we can only use haplotypes from a given set. Furthermore, we show that the problem becomes polynomial-time solvable if the given set of genotypes is complete, i.e., contains all possible genotypes that can be explained by the set of haplotypes.

  10. Molecular analysis of SCARECROW genes expressed in white lupin cluster roots

    PubMed Central

    Sbabou, Laila; Bucciarelli, Bruna; Miller, Susan; Liu, Junqi; Berhada, Fatiha; Filali-Maltouf, Abdelkarim; Allan, Deborah; Vance, Carroll

    2010-01-01

    The Scarecrow (SCR) transcription factor plays a crucial role in root cell radial patterning and is required for maintenance of the quiescent centre and differentiation of the endodermis. In response to phosphorus (P) deficiency, white lupin (Lupinus albus L.) root surface area increases some 50-fold to 70-fold due to the development of cluster (proteoid) roots. Previously it was reported that SCR-like expressed sequence tags (ESTs) were expressed during early cluster root development. Here the cloning of two white lupin SCR genes, LaSCR1 and LaSCR2, is reported. The predicted amino acid sequences of both LaSCR gene products are highly similar to AtSCR and contain C-terminal conserved GRAS family domains. LaSCR1 and LaSCR2 transcript accumulation localized to the endodermis of both normal and cluster roots as shown by in situ hybridization and gene promoter::reporter staining. Transcript analysis as evaluated by quantitative real-time-PCR (qRT-PCR) and RNA gel hybridization indicated that the two LaSCR genes are expressed predominantly in roots. Expression of LaSCR genes was not directly responsive to the P status of the plant but was a function of cluster root development. Suppression of LaSCR1 in transformed roots of lupin and Medicago via RNAi (RNA interference) delivered through Agrobacterium rhizogenes resulted in decreased root numbers, reflecting the potential role of LaSCR1 in maintaining root growth in these species. The results suggest that the functional orthologues of AtSCR have been characterized. PMID:20167612

  11. A homeotic gene cluster patterns the anteroposterior body axis of C. elegans.

    PubMed

    Wang, B B; Müller-Immergluck, M M; Austin, J; Robinson, N T; Chisholm, A; Kenyon, C

    1993-07-16

    In insects and vertebrates, clusters of Antennapedia class homeobox (HOM-C) genes specify anteroposterior body pattern. The nematode C. elegans also contains a small cluster of HOM-C genes, one of which has been shown to specify positional identity. Here we show that two additional C. elegans HOM-C genes also specify positional identity and that together these three HOM-C genes function along the anteroposterior axis in the same order as their homologs in other organisms. Thus, HOM-C-based pattern formation has been conserved in nematodes despite the many differences in morphology and embryology that distinguish them from other phyla. Each C. elegans HOM-C gene is responsible for a distinct body region; however, where their domains overlap, two HOM-C genes can act together to specify the fates of individual cells.

  12. Clustering gene expression data based on predicted differential effects of GV interaction.

    PubMed

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  13. APOC3/A5 haplotypes, lipid levels, and risk of myocardial infarction in the Central Valley of Costa Rica.

    PubMed

    Ruiz-Narváez, Edward A; Yang, Yadong; Nakanishi, Yukiko; Kirchdorfer, Jill; Campos, Hannia

    2005-12-01

    Genetic variation in the APOC3 and APOA5 genes has been associated with plasma triglyceride concentrations and may affect the risk of myocardial infarction (MI). To assess whether APOC3/A5 haplotypes are associated with risk of MI, we examined three single-nucleotide polymorphisms (SNPs) in APOC3 (3238C>G, -455T>C, and -482C>T) and six SNPs in the APOA5 gene (-1131T>C, c.-3A>G, c.56C>G, IVS3+476G>A, c.553G>T, and c.1259T>C) in incident cases (n = 1,703) of a first nonfatal MI matched for gender, age, and area of residence with population-based controls (n = 1,703). Conditional logistic regression models, adjusted for potential environmental confounders, were used for analysis. The common APOC3*222 haplotype was more frequent in cases than in controls (17.4% and 13.7%, respectively, P < 0.001) and was associated with increased risk of MI [odds ratio (OR) = 1.27; 95% confidence interval (95% CI), 1.09, 1.48] compared with APOC3*111 wild-type haplotype. This association was independent of the APOA5 SNPs. Although the APOC3 3238G, APOA5 -1131C, APOA5 c.-3G, and APOA5 c.1259C alleles were associated with higher triglyceride plasma concentrations, these effects could not explain the associations with MI in this population. In summary, this study supports the hypothesis that haplotypes in the APOC3 gene but not in the APOA5 gene increase susceptibility to MI.

  14. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity

    PubMed Central

    Sperschneider, Jana; Garnica, Diana P.; Miller, Marisa E.; Taylor, Jennifer M.; Dodds, Peter N.; Park, Robert F.

    2018-01-01

    ABSTRACT A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb) and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales. In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies. PMID:29463659

  15. The targetable A1 Huntington disease haplotype has distinct Amerindian and European origins in Latin America

    PubMed Central

    Kay, Chris; Tirado-Hurtado, Indira; Cornejo-Olivas, Mario; Collins, Jennifer A; Wright, Galen; Inca-Martinez, Miguel; Veliz-Otani, Diego; Ketelaar, Maria E; Slama, Ramy A; Ross, Colin J; Mazzetti, Pilar; Hayden, Michael R

    2017-01-01

    Huntington disease (HD) is a dominant neurodegenerative disorder caused by a CAG repeat expansion in the Huntingtin (HTT) gene. HD occurs worldwide, but the causative mutation is found on different HTT haplotypes in distinct ethnic groups. In Latin America, HD is thought to have European origins, but indigenous Amerindian ancestry has not been investigated. Here, we report dense HTT haplotypes in 62 mestizo Peruvian HD families, 17 HD families from across Latin America, and 42 controls of defined Peruvian Amerindian ethnicity to determine the origin of HD in populations of admixed Amerindian and European descent. HD in Peru occurs most frequently on the A1 HTT haplotype (73%), as in Europe, but on an unexpected indigenous variant also found in Amerindian controls. This Amerindian A1 HTT haplotype predominates over the European A1 variant among geographically disparate Latin American controls and in HD families from across Latin America, supporting an indigenous origin of the HD mutation in mestizo American populations. We also show that a proportion of HD mutations in Peru occur on a C1 HTT haplotype of putative Amerindian origin (14%). The majority of HD mutations in Latin America may therefore occur on haplotypes of Amerindian ancestry rather than on haplotypes resulting from European admixture. Despite the distinct ethnic ancestry of Amerindian and European A1 HTT, alleles on the parent A1 HTT haplotype allow for development of identical antisense molecules to selectively silence the HD mutation in the greatest proportion of patients in both Latin American and European populations. PMID:28000697

  16. Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury

    PubMed Central

    2010-01-01

    Background Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from the brain correlates with the development of spasticity. Results Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time expression profiles. Analysis of these gene clusters identifies early immunological/inflammatory and late developmental responses as well as a regulation of genes relating to neuron excitability that support the development of motor neuron hyper-excitability and the reappearance of plateau potentials in the late phase of the injury response. Transcription factor motif analysis identifies differentially expressed transcription factors involved in the regulation of each gene cluster, shaping the expression of the identified biological processes and their associated genes underlying the changes in motor neuron excitability. Conclusions This analysis provides important clues to the underlying mechanisms of transcriptional regulation responsible for the increased excitability observed in motor neurons in the late chronic phase of spinal cord injury suggesting alternative targets for treatment of spinal cord injury. Several transcription factors were identified as potential regulators of gene clusters containing elements related to motor neuron hyper-excitability, the manipulation

  17. Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury.

    PubMed

    Ryge, Jesper; Winther, Ole; Wienecke, Jacob; Sandelin, Albin; Westerdahl, Ann-Charlotte; Hultborn, Hans; Kiehn, Ole

    2010-06-09

    Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from the brain correlates with the development of spasticity. Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time expression profiles. Analysis of these gene clusters identifies early immunological/inflammatory and late developmental responses as well as a regulation of genes relating to neuron excitability that support the development of motor neuron hyper-excitability and the reappearance of plateau potentials in the late phase of the injury response. Transcription factor motif analysis identifies differentially expressed transcription factors involved in the regulation of each gene cluster, shaping the expression of the identified biological processes and their associated genes underlying the changes in motor neuron excitability. This analysis provides important clues to the underlying mechanisms of transcriptional regulation responsible for the increased excitability observed in motor neurons in the late chronic phase of spinal cord injury suggesting alternative targets for treatment of spinal cord injury. Several transcription factors were identified as potential regulators of gene clusters containing elements related to motor neuron hyper-excitability, the manipulation of which potentially could be

  18. Transcriptional analysis of exopolysaccharides biosynthesis gene clusters in Lactobacillus plantarum.

    PubMed

    Vastano, Valeria; Perrone, Filomena; Marasco, Rosangela; Sacco, Margherita; Muscariello, Lidia

    2016-04-01

    Exopolysaccharides (EPS) from lactic acid bacteria contribute to specific rheology and texture of fermented milk products and find applications also in non-dairy foods and in therapeutics. Recently, four clusters of genes (cps) associated with surface polysaccharide production have been identified in Lactobacillus plantarum WCFS1, a probiotic and food-associated lactobacillus. These clusters are involved in cell surface architecture and probably in release and/or exposure of immunomodulating bacterial molecules. Here we show a transcriptional analysis of these clusters. Indeed, RT-PCR experiments revealed that the cps loci are organized in five operons. Moreover, by reverse transcription-qPCR analysis performed on L. plantarum WCFS1 (wild type) and WCFS1-2 (ΔccpA), we demonstrated that expression of three cps clusters is under the control of the global regulator CcpA. These results, together with the identification of putative CcpA target sequences (catabolite responsive element CRE) in the regulatory region of four out of five transcriptional units, strongly suggest for the first time a role of the master regulator CcpA in EPS gene transcription among lactobacilli.

  19. RubisCO Gene Clusters Found in a Metagenome Microarray from Acid Mine Drainage

    PubMed Central

    Guo, Xue; Yin, Huaqun; Cong, Jing; Dai, Zhimin; Liang, Yili

    2013-01-01

    The enzyme responsible for carbon dioxide fixation in the Calvin cycle, ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO), is always detected as a phylogenetic marker to analyze the distribution and activity of autotrophic bacteria. However, such an approach provides no indication as to the significance of genomic content and organization. Horizontal transfers of RubisCO genes occurring in eubacteria and plastids may seriously affect the credibility of this approach. Here, we presented a new method to analyze the diversity and genomic content of RubisCO genes in acid mine drainage (AMD). A metagenome microarray containing 7,776 large-insertion fosmids was constructed to quickly screen genome fragments containing RubisCO form I large-subunit genes (cbbL). Forty-six cbbL-containing fosmids were detected, and six fosmids were fully sequenced. To evaluate the reliability of the metagenome microarray and understand the microbial community in AMD, the diversities of cbbL and the 16S rRNA gene were analyzed. Fosmid sequences revealed that the form I RubisCO gene cluster could be subdivided into form IA and IB RubisCO gene clusters in AMD, because of significant divergences in molecular phylogenetics and conservative genomic organization. Interestingly, the form I RubisCO gene cluster coexisted with the form II RubisCO gene cluster in one fosmid genomic fragment. Phylogenetic analyses revealed that horizontal transfers of RubisCO genes may occur widely in AMD, which makes the evolutionary history of RubisCO difficult to reconcile with organismal phylogeny. PMID:23335778

  20. Association analysis of the IL-1 gene cluster polymorphisms with aggressive and chronic periodontitis in the Algerian population.

    PubMed

    Boukortt, Kawther Nourelhouda; Saidi-Ouahrani, Nadjia; Boukerzaza, Boubaker; Ouhaibi-Djellouli, Hadjira; Hachmaoui, Khalida; Benaissa, Fatima Zohra; Taleb, Leila; Drabla-Ouahrani, Hayet; Deba, Tahria; Ouledhamou, Sid Ahmed; Mehtar, Nadhera; Boudjema, Abdellah

    2015-10-01

    There is strong evidence that genetic as well as environmental factors affect the development of periodontitis. Various studies suggest that genetic polymorphisms of the interleukin-1 (IL-1) genes are associated with an increased risk of developing the pathogenesis. The aim of the present study was to investigate the possible relationship between two polymorphisms of IL-1 gene cluster IL-1B (C+3954T) (rs1143634) and IL-1A (C-889T) (rs1800587) SNPs and the aggressive and chronic periodontitis risk in a case control study in Algerian population. 279 subjects were recruited and received a periodontal examination: 128 healthy controls and 151 cases. From cases, 91 patients were having a chronic disease whereas 60 subjects with aggressive form. All these subjects were genotyped for IL-1A (C-889T) and IL-1B (C+3954T) polymorphisms using TaqMan real time PCR technology. Frequencies of IL-1 alleles, genotypes and the haplotypes were also examined. Significant differences were found in the carriage rate of both minor alleles of the IL-1A (C-889T) and IL-1B (C+3954T) polymorphisms of aggressive periodontitis cases compared with healthy controls (OR [95%CI]=1.61 [1.03-2.49], p=0.03), (OR [95%CI]=1.69 [1.09-2.63], p=0.01), respectively. The result did not reach significance with the chronic form. The studied polymorphisms of the IL-1 genes appear to be associated with susceptibility to aggressive periodontitis (AgP) in the Algerian population. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Assessment of pfcrt 72-76 haplotypes eight years after chloroquine withdrawal in Kinshasa, Democratic Republic of Congo.

    PubMed

    Mvumbi, Dieudonné Makaba; Boreux, Raphael; Sacheli, Rosalie; Lelo, Mvumbi; Lengu, Bobanga; Nani-Tuma, Situakibanza; Melin, Pierrette; Ntumba, Kayembe; Lunganza, Kalala; DeMol, Patrick; Hayette, Marie-Pierre

    2013-12-20

    In 2001, the World Health Organization (WHO) has recommended the use of artemisinin-based combination therapy (ACT) as the first-line treatment of uncomplicated malaria cases, as monotherapies had become ineffective in many parts of the world. As a result, the Democratic Republic of Congo (DRC) withdrew chloroquine (CQ) from its malaria treatment policy in 2002 and an artesunate (AS)-amodiaquine (AQ) combination became the ACT of choice in DRC in 2005. AQ-resistance (AQR) has been reported in several parts of the world and mutations in codons 72-76 of the Plasmodium falciparum chloroquine-resistance transporter (pfcrt) gene have been strongly correlated with resistance, especially mutations encoding the SVMNT haplotype. This haplotype was first identified in Southeast Asia and South America but was recently reported in two African countries neighbouring DRC. These facts raised two questions: the first about the evolution of CQ resistance (CQR) in DRC and the second about the presence of the SVMNT haplotype, which would compromise the use of AQ as a partner drug for ACT. A total of 213 thick blood films were randomly collected in 2010 from a paediatric clinic in Kinshasa, DRC. Microscopy controls and real-time polymerase chain reaction (RT-PCR) were performed for Plasmodium species identification. Haplotypes of the pfcrt gene were determined by sequencing. The K76T mutation was detected in 145 out of 198 P. falciparum-positive samples (73.2%). In these 145 resistant strains, only the CVIET haplotype was detected. This study is the first to assess the molecular markers of resistance to CQ and AQ after the introduction of ACT in DRC. The results suggest first that CQR is decreasing, as wild-type pfcrt haplotypes were found in only 26.8% of the samples and secondly that the SVMNT haplotype is not yet present in Kinshasa, suggesting that AQ remains valid as a partner drug for ACT in this region.

  2. Identification and characterization of the ergochrome gene cluster in the plant pathogenic fungus Claviceps purpurea.

    PubMed

    Neubauer, Lisa; Dopstadt, Julian; Humpf, Hans-Ulrich; Tudzynski, Paul

    2016-01-01

    Claviceps purpurea is a phytopathogenic fungus infecting a broad range of grasses including economically important cereal crop plants. The infection cycle ends with the formation of the typical purple-black pigmented sclerotia containing the toxic ergot alkaloids. Besides these ergot alkaloids little is known about the secondary metabolism of the fungus. Red anthraquinone derivatives and yellow xanthone dimers (ergochromes) have been isolated from sclerotia and described as ergot pigments, but the corresponding gene cluster has remained unknown. Fungal pigments gain increasing interest for example as environmentally friendly alternatives to existing dyes. Furthermore, several pigments show biological activities and may have some pharmaceutical value. This study identified the gene cluster responsible for the synthesis of the ergot pigments. Overexpression of the cluster-specific transcription factor led to activation of the gene cluster and to the production of several known ergot pigments. Knock out of the cluster key enzyme, a nonreducing polyketide synthase, clearly showed that this cluster is responsible for the production of red anthraquinones as well as yellow ergochromes. Furthermore, a tentative biosynthetic pathway for the ergot pigments is proposed. By changing the culture conditions, pigment production was activated in axenic culture so that high concentration of phosphate and low concentration of sucrose induced pigment syntheses. This is the first functional analysis of a secondary metabolite gene cluster in the ergot fungus besides that for the classical ergot alkaloids. We demonstrated that this gene cluster is responsible for the typical purple-black color of the ergot sclerotia and showed that the red and yellow ergot pigments are products of the same biosynthetic pathway. Activation of the gene cluster in axenic culture opened up new possibilities for biotechnological applications like the dye production or the development of new pharmaceuticals.

  3. Clustering of time-course gene expression profiles using normal mixture models with autoregressive random effects

    PubMed Central

    2012-01-01

    Background Time-course gene expression data such as yeast cell cycle data may be periodically expressed. To cluster such data, currently used Fourier series approximations of periodic gene expressions have been found not to be sufficiently adequate to model the complexity of the time-course data, partly due to their ignoring the dependence between the expression measurements over time and the correlation among gene expression profiles. We further investigate the advantages and limitations of available models in the literature and propose a new mixture model with autoregressive random effects of the first order for the clustering of time-course gene-expression profiles. Some simulations and real examples are given to demonstrate the usefulness of the proposed models. Results We illustrate the applicability of our new model using synthetic and real time-course datasets. We show that our model outperforms existing models to provide more reliable and robust clustering of time-course data. Our model provides superior results when genetic profiles are correlated. It also gives comparable results when the correlation between the gene profiles is weak. In the applications to real time-course data, relevant clusters of coregulated genes are obtained, which are supported by gene-function annotation databases. Conclusions Our new model under our extension of the EMMIX-WIRE procedure is more reliable and robust for clustering time-course data because it adopts a random effects model that allows for the correlation among observations at different time points. It postulates gene-specific random effects with an autocorrelation variance structure that models coregulation within the clusters. The developed R package is flexible in its specification of the random effects through user-input parameters that enables improved modelling and consequent clustering of time-course data. PMID:23151154

  4. Two extended haplotype blocks are associated with adaptation to high altitude habitats in East African honey bees

    PubMed Central

    Schöning, Caspar

    2017-01-01

    Understanding the genetic basis of adaption is a central task in biology. Populations of the honey bee Apis mellifera that inhabit the mountain forests of East Africa differ in behavior and morphology from those inhabiting the surrounding lowland savannahs, which likely reflects adaptation to these habitats. We performed whole genome sequencing on 39 samples of highland and lowland bees from two pairs of populations to determine their evolutionary affinities and identify the genetic basis of these putative adaptations. We find that in general, levels of genetic differentiation between highland and lowland populations are very low, consistent with them being a single panmictic population. However, we identify two loci on chromosomes 7 and 9, each several hundred kilobases in length, which exhibit near fixation for different haplotypes between highland and lowland populations. The highland haplotypes at these loci are extremely rare in samples from the rest of the world. Patterns of segregation of genetic variants suggest that recombination between haplotypes at each locus is suppressed, indicating that they comprise independent structural variants. The haplotype on chromosome 7 harbors nearly all octopamine receptor genes in the honey bee genome. These have a role in learning and foraging behavior in honey bees and are strong candidates for adaptation to highland habitats. Molecular analysis of a putative breakpoint indicates that it may disrupt the coding sequence of one of these genes. Divergence between the highland and lowland haplotypes at both loci is extremely high suggesting that they are ancient balanced polymorphisms that greatly predate divergence between the extant honey bee subspecies. PMID:28542163

  5. The Genome of Tolypocladium inflatum: Evolution, Organization, and Expression of the Cyclosporin Biosynthetic Gene Cluster

    PubMed Central

    Bushley, Kathryn E.; Raja, Rajani; Jaiswal, Pankaj; Cumbie, Jason S.; Nonogaki, Mariko; Boyd, Alexander E.; Owensby, C. Alisha; Knaus, Brian J.; Elser, Justin; Miller, Daniel; Di, Yanming; McPhail, Kerry L.; Spatafora, Joseph W.

    2013-01-01

    The ascomycete fungus Tolypocladium inflatum, a pathogen of beetle larvae, is best known as the producer of the immunosuppressant drug cyclosporin. The draft genome of T. inflatum strain NRRL 8044 (ATCC 34921), the isolate from which cyclosporin was first isolated, is presented along with comparative analyses of the biosynthesis of cyclosporin and other secondary metabolites in T. inflatum and related taxa. Phylogenomic analyses reveal previously undetected and complex patterns of homology between the nonribosomal peptide synthetase (NRPS) that encodes for cyclosporin synthetase (simA) and those of other secondary metabolites with activities against insects (e.g., beauvericin, destruxins, etc.), and demonstrate the roles of module duplication and gene fusion in diversification of NRPSs. The secondary metabolite gene cluster responsible for cyclosporin biosynthesis is described. In addition to genes necessary for cyclosporin biosynthesis, it harbors a gene for a cyclophilin, which is a member of a family of immunophilins known to bind cyclosporin. Comparative analyses support a lineage specific origin of the cyclosporin gene cluster rather than horizontal gene transfer from bacteria or other fungi. RNA-Seq transcriptome analyses in a cyclosporin-inducing medium delineate the boundaries of the cyclosporin cluster and reveal high levels of expression of the gene cluster cyclophilin. In medium containing insect hemolymph, weaker but significant upregulation of several genes within the cyclosporin cluster, including the highly expressed cyclophilin gene, was observed. T. inflatum also represents the first reference draft genome of Ophiocordycipitaceae, a third family of insect pathogenic fungi within the fungal order Hypocreales, and supports parallel and qualitatively distinct radiations of insect pathogens. The T. inflatum genome provides additional insight into the evolution and biosynthesis of cyclosporin and lays a foundation for further investigations of the role

  6. Gene cluster conservation provides insight into cercosporin biosynthesis and extends production to the genus Colletotrichum.

    PubMed

    de Jonge, Ronnie; Ebert, Malaika K; Huitt-Roehl, Callie R; Pal, Paramita; Suttle, Jeffrey C; Spanner, Rebecca E; Neubauer, Jonathan D; Jurick, Wayne M; Stott, Karina A; Secor, Gary A; Thomma, Bart P H J; Van de Peer, Yves; Townsend, Craig A; Bolton, Melvin D

    2018-06-12

    Species in the genus Cercospora cause economically devastating diseases in sugar beet, maize, rice, soy bean, and other major food crops. Here, we sequenced the genome of the sugar beet pathogen Cercospora beticola and found it encodes 63 putative secondary metabolite gene clusters, including the cercosporin toxin biosynthesis ( CTB ) cluster. We show that the CTB gene cluster has experienced multiple duplications and horizontal transfers across a spectrum of plant pathogenic fungi, including the wide-host range Colletotrichum genus as well as the rice pathogen Magnaporthe oryzae Although cercosporin biosynthesis has been thought to rely on an eight-gene CTB cluster, our phylogenomic analysis revealed gene collinearity adjacent to the established cluster in all CTB cluster-harboring species. We demonstrate that the CTB cluster is larger than previously recognized and includes cercosporin facilitator protein, previously shown to be involved with cercosporin autoresistance, and four additional genes required for cercosporin biosynthesis, including the final pathway enzymes that install the unusual cercosporin methylenedioxy bridge. Lastly, we demonstrate production of cercosporin by Colletotrichum fioriniae , the first known cercosporin producer within this agriculturally important genus. Thus, our results provide insight into the intricate evolution and biology of a toxin critical to agriculture and broaden the production of cercosporin to another fungal genus containing many plant pathogens of important crops worldwide. Copyright © 2018 the Author(s). Published by PNAS.

  7. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data.

    PubMed

    Nishiyama, Takeshi; Takahashi, Kunihiko; Tango, Toshiro; Pinto, Dalila; Scherer, Stephen W; Takami, Satoshi; Kishino, Hirohisa

    2011-05-26

    Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  8. Identification of the Coumermycin A1 Biosynthetic Gene Cluster of Streptomyces rishiriensis DSM 40489

    PubMed Central

    Wang, Zhao-Xin; Li, Shu-Ming; Heide, Lutz

    2000-01-01

    The biosynthetic gene cluster of the aminocoumarin antibiotic coumermycin A1 was cloned by screening of a cosmid library of Streptomyces rishiriensis DSM 40489 with heterologous probes from a dTDP-glucose 4,6-dehydratase gene, involved in deoxysugar biosynthesis, and from the aminocoumarin resistance gyrase gene gyrBr. Sequence analysis of a 30.8-kb region upstream of gyrBr revealed the presence of 28 complete open reading frames (ORFs). Fifteen of the identified ORFs showed, on average, 84% identity to corresponding ORFs in the biosynthetic gene cluster of novobiocin, another aminocoumarin antibiotic. Possible functions of 17 ORFs in the biosynthesis of coumermycin A1 could be assigned by comparison with sequences in GenBank. Experimental proof for the function of the identified gene cluster was provided by an insertional gene inactivation experiment, which resulted in an abolishment of coumermycin A1 production. PMID:11036020

  9. Haplotype Sharing Provides Insights into Fine-Scale Population History and Disease in Finland.

    PubMed

    Martin, Alicia R; Karczewski, Konrad J; Kerminen, Sini; Kurki, Mitja I; Sarin, Antti-Pekka; Artomov, Mykyta; Eriksson, Johan G; Esko, Tõnu; Genovese, Giulio; Havulinna, Aki S; Kaprio, Jaakko; Konradi, Alexandra; Korányi, László; Kostareva, Anna; Männikkö, Minna; Metspalu, Andres; Perola, Markus; Prasad, Rashmi B; Raitakari, Olli; Rotar, Oxana; Salomaa, Veikko; Groop, Leif; Palotie, Aarno; Neale, Benjamin M; Ripatti, Samuli; Pirinen, Matti; Daly, Mark J

    2018-05-03

    Finland provides unique opportunities to investigate population and medical genomics because of its adoption of unified national electronic health records, detailed historical and birth records, and serial population bottlenecks. We assembled a comprehensive view of recent population history (≤100 generations), the timespan during which most rare-disease-causing alleles arose, by comparing pairwise haplotype sharing from 43,254 Finns to that of 16,060 Swedes, Estonians, Russians, and Hungarians from geographically and linguistically adjacent countries with different population histories. We find much more extensive sharing in Finns, with at least one ≥ 5 cM tract on average between pairs of unrelated individuals. By coupling haplotype sharing with fine-scale birth records from more than 25,000 individuals, we find that although haplotype sharing broadly decays with geographical distance, there are pockets of excess haplotype sharing; individuals from northeast Finland typically share several-fold more of their genome in identity-by-descent segments than individuals from southwest regions. We estimate recent effective population-size changes through time across regions of Finland, and we find that there was more continuous gene flow as Finns migrated from southwest to northeast between the early- and late-settlement regions than was dichotomously described previously. Lastly, we show that haplotype sharing is locally enriched by an order of magnitude among pairs of individuals sharing rare alleles and especially among pairs sharing rare disease-causing variants. Our work provides a general framework for using haplotype sharing to reconstruct an integrative view of recent population history and gain insight into the evolutionary origins of rare variants contributing to disease. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  10. Allelic and haplotypic diversity of HLA-A, -B, -C, -DRB1, and -DQB1 genes in the Korean population.

    PubMed

    Lee, K W; Oh, D H; Lee, C; Yang, S Y

    2005-05-01

    High-resolution human leukocyte antigen (HLA) typing exposes the unique patterns of HLA allele and haplotype frequencies in each population. In this study, HLA-A, -B, -C, -DRB1, and -DQB1 genotypes were analyzed in 485 apparently unrelated healthy Korean individuals. A total of 20 HLA-A, 43 HLA-B, 21 HLA-C, 31 HLA-DRB1, and 14 HLA-DQB1 alleles were identified. Eleven alleles (A*0201, A*1101, A*2402, A*3303, B*1501, Cw*0102, Cw*0302, Cw*0303, DQB1*0301, DQB1*0302, and DQB1*0303) were found in more than 10% of the population. In each serologic group, a maximum of three alleles were found with several exceptions (A2, B62, DR4, DR14, and DQ6). In each serologic group exhibiting multiple alleles, two major alleles were present at 62-96% (i.e. A*0201 and A*0206 comprise 85% of A2-positive alleles). Multiple-locus haplotypes estimated by the maximum likelihood method revealed 51 A-C, 43 C-B, 52 B-DRB1, 34 DRB1-DQB1, 48 A-C-B, 42 C-B-DRB1, 46 B-DRB1-DQB1, and 30 A-C-B-DRB1-DQB1 haplotypes with frequencies of more than 0.5%. In spite of their high polymorphism in B and DRB1, identification of relatively small numbers of two-locus (B-C and DRB1-DQB1) haplotypes suggested strong associations of those two loci, respectively. Five-locus haplotypes defined by high-resolution DNA typing correlated well with previously identified serology-based haplotypes in the population. The five most frequent haplotypes were: A*3303-Cw*1403-B*4403-DRB1*1302-DQB1*0604 (4.2%), A*3303-Cw*0701/6-B*4403-DRB1*0701-DQB1*0201/2 (3.0%), A*3303-Cw*0302-B*5801-DRB1*1302-DQB1*0609 (3.0%), A*2402-Cw*0702-B*0702-DRB1*0101-DQB1*0501 (2.9%), and A*3001-Cw*0602-B*1302-DRB1*0701-DQB1*0201/2 (2.7%). Several sets of allele level haplotypes that could not be discriminated by routine HLA-A, -B, and -DRB1 low-resolution typing originated from allelic diversity of A2, B61, DR4, and DR8 serologic groups. Information obtained in this study will be useful for medical and forensic applications as well as in anthropology.

  11. Genetic variants in a haplotype block spanning IDE are significantly associated with plasma Abeta42 levels and risk for Alzheimer disease.

    PubMed

    Ertekin-Taner, Nilüfer; Allen, Mariet; Fadale, Daniel; Scanlin, Leah; Younkin, Linda; Petersen, Ronald C; Graff-Radford, Neill; Younkin, Steven G

    2004-04-01

    Risk for late onset Alzheimer disease (LOAD) and plasma amyloid beta levels (Abeta42; encoded by APP), an intermediate phenotype for LOAD, show linkage to chromosome 10q. Several strong candidate genes (VR22, PLAU, IDE) lie within the 1-lod support interval for linkage. Others have independently identified haplotypes in the chromosome 10q region harboring IDE that show highly significant association with intermediate AD phenotypes and with risk for AD. To pursue these associations, we analyzed the same haplotypes for association with plasma Abeta42 in 24 extended LOAD families and for association with LOAD in two independent case-control series. One series (MCR, 188 age-matched case-control pairs) did not show association (p=0.64) with the six haplotypes in the 276-kb region spanning three genes (IDE, KNSL1, and HHEX) previously shown to associate with LOAD. The other series (MCJ, 109 age-matched case-control pairs) showed significant (p=0.003) association with these haplotypes. In the MCJ series, the H4 (odds ratio [OR]=5.1, p=0.003) and H2(H7) haplotypes (OR=0.60, p=0.04) had the same effects previously reported. In this series, the H8 haplotype (OR=2.7, p=0.098) also had an effect similar as in one previous case control series but not in others. In the extended families, the H8 haplotype was associated with significantly elevated plasma Abeta42 (p=0.02). In addition, the H5(H10) haplotype, which is associated with reduced risk for AD in the other study is associated with reduced plasma Abeta42 (p=0.007) in our family series. These results provide strong evidence for pathogenic variant(s) in the 276-kb region harboring IDE that influence intermediate AD phenotypes and risk for AD. Copyright 2004 Wiley-Liss, Inc.

  12. Gene structure and expression characteristic of a novel odorant receptor gene cluster in the parasitoid wasp Microplitis mediator (Hymenoptera: Braconidae).

    PubMed

    Wang, S-N; Shan, S; Zheng, Y; Peng, Y; Lu, Z-Y; Yang, Y-Q; Li, R-J; Zhang, Y-J; Guo, Y-Y

    2017-08-01

    Odorant receptors (ORs) expressed in the antennae of parasitoid wasps are responsible for detection of various lipophilic airborne molecules. In the present study, 107 novel OR genes were identified from Microplitis mediator antennal transcriptome data. Phylogenetic analysis of the set of OR genes from M. mediator and Microplitis demolitor revealed that M. mediator OR (MmedOR) genes can be classified into different subfamilies, and the majority of MmedORs in each subfamily shared high sequence identities and clear orthologous relationships to M. demolitor ORs. Within a subfamily, six MmedOR genes, MmedOR98, 124, 125, 126, 131 and 155, shared a similar gene structure and were tightly linked in the genome. To evaluate whether the clustered MmedOR genes share common regulatory features, the transcription profile and expression characteristics of the six closely related OR genes were investigated in M. mediator. Rapid amplification of cDNA ends-PCR experiments revealed that the OR genes within the cluster were transcribed as single mRNAs, and a bicistronic mRNA for two adjacent genes (MmedOR124 and MmedOR98) was also detected in female antennae by reverse transcription PCR. In situ hybridization experiments indicated that each OR gene within the cluster was expressed in a different number of cells. Moreover, there was no co-expression of the two highly related OR genes, MmedOR124 and MmedOR98, which appeared to be individually expressed in a distinct population of neurons. Overall, there were distinct expression profiles of closely related MmedOR genes from the same cluster in M. mediator. These data provide a basic understanding of the olfactory coding in parasitoid wasps. © 2017 The Royal Entomological Society.

  13. Polymorphisms and linkage analysis for ICAM-1 and the selectin gene cluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vora, D.K.; Rosenbloom, C.L.; Cottingham, R.W.

    1994-06-01

    Genetic polymorphisms in leukocyte and endothelial cell adhesion molecules may be important variables with regard to susceptibility to multifactorial disease processes that include an inflammatory component. For this reason, polymorphisms were sought for intercellular adhesion molecule-1 (ICAM-1; gene symbol ICAM1) and for the three genes in the selectin cluster, P-selectin, L-selectin, and E-selectin (gene symbols SELP, SELL, and SELE, respectively). Two amino acid polymorphisms were identified for ICAM-1; Gly or Arg at codon 241 and Lys or Glu at codon 469. Dinucleotide repeat polymorphisms were identified in the 3{prime}-untranslated region for ICAM-1 and in intron 9 for P-selectin. Restriction fragmentmore » length polymorphisms were found using cDNAs for each of the three selectin genes as probes; E-selectin with BglII, P-selectin with ScaI, and L-selectin with HincII. Linkage analysis was performed for the selectin gene cluster and for ICAM-1 using the CEPH families; ICAM-1 is very tightly linked to the LDL receptor on chromosome 19, and the selectin cluster is linked to markers at chromosome 1q23. 41 refs., 2 tabs.« less

  14. Reconstruction of Haplotype-Blocks Selected during Experimental Evolution.

    PubMed

    Franssen, Susanne U; Barton, Nicholas H; Schlötterer, Christian

    2017-01-01

    The genetic analysis of experimentally evolving populations typically relies on short reads from pooled individuals (Pool-Seq). While this method provides reliable allele frequency estimates, the underlying haplotype structure remains poorly characterized. With small population sizes and adaptive variants that start from low frequencies, the interpretation of selection signatures in most Evolve and Resequencing studies remains challenging. To facilitate the characterization of selection targets, we propose a new approach that reconstructs selected haplotypes from replicated time series, using Pool-Seq data. We identify selected haplotypes through the correlated frequencies of alleles carried by them. Computer simulations indicate that selected haplotype-blocks of several Mb can be reconstructed with high confidence and low error rates, even when allele frequencies change only by 20% across three replicates. Applying this method to real data from D. melanogaster populations adapting to a hot environment, we identify a selected haplotype-block of 6.93 Mb. We confirm the presence of this haplotype-block in evolved populations by experimental haplotyping, demonstrating the power and accuracy of our haplotype reconstruction from Pool-Seq data. We propose that the combination of allele frequency estimates with haplotype information will provide the key to understanding the dynamics of adaptive alleles. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Spatial enhancer clustering and regulation of enhancer-proximal genes by cohesin

    PubMed Central

    Ing-Simmons, Elizabeth; Seitan, Vlad C.; Faure, Andre J.; Flicek, Paul; Carroll, Thomas; Dekker, Job; Fisher, Amanda G.; Lenhard, Boris

    2015-01-01

    In addition to mediating sister chromatid cohesion during the cell cycle, the cohesin complex associates with CTCF and with active gene regulatory elements to form long-range interactions between its binding sites. Genome-wide chromosome conformation capture had shown that cohesin's main role in interphase genome organization is in mediating interactions within architectural chromosome compartments, rather than specifying compartments per se. However, it remains unclear how cohesin-mediated interactions contribute to the regulation of gene expression. We have found that the binding of CTCF and cohesin is highly enriched at enhancers and in particular at enhancer arrays or “super-enhancers” in mouse thymocytes. Using local and global chromosome conformation capture, we demonstrate that enhancer elements associate not just in linear sequence, but also in 3D, and that spatial enhancer clustering is facilitated by cohesin. The conditional deletion of cohesin from noncycling thymocytes preserved enhancer position, H3K27ac, H4K4me1, and enhancer transcription, but weakened interactions between enhancers. Interestingly, ∼50% of deregulated genes reside in the vicinity of enhancer elements, suggesting that cohesin regulates gene expression through spatial clustering of enhancer elements. We propose a model for cohesin-dependent gene regulation in which spatial clustering of enhancer elements acts as a unified mechanism for both enhancer-promoter “connections” and “insulation.” PMID:25677180

  16. Two Horizontally Transferred Xenobiotic Resistance Gene Clusters Associated with Detoxification of Benzoxazolinones by Fusarium Species

    PubMed Central

    Glenn, Anthony E.; Davis, C. Britton; Gao, Minglu; Gold, Scott E.; Mitchell, Trevor R.; Proctor, Robert H.; Stewart, Jane E.; Snook, Maurice E.

    2016-01-01

    Microbes encounter a broad spectrum of antimicrobial compounds in their environments and often possess metabolic strategies to detoxify such xenobiotics. We have previously shown that Fusarium verticillioides, a fungal pathogen of maize known for its production of fumonisin mycotoxins, possesses two unlinked loci, FDB1 and FDB2, necessary for detoxification of antimicrobial compounds produced by maize, including the γ-lactam 2-benzoxazolinone (BOA). In support of these earlier studies, microarray analysis of F. verticillioides exposed to BOA identified the induction of multiple genes at FDB1 and FDB2, indicating the loci consist of gene clusters. One of the FDB1 cluster genes encoded a protein having domain homology to the metallo-β-lactamase (MBL) superfamily. Deletion of this gene (MBL1) rendered F. verticillioides incapable of metabolizing BOA and thus unable to grow on BOA-amended media. Deletion of other FDB1 cluster genes, in particular AMD1 and DLH1, did not affect BOA degradation. Phylogenetic analyses and topology testing of the FDB1 and FDB2 cluster genes suggested two horizontal transfer events among fungi, one being transfer of FDB1 from Fusarium to Colletotrichum, and the second being transfer of the FDB2 cluster from Fusarium to Aspergillus. Together, the results suggest that plant-derived xenobiotics have exerted evolutionary pressure on these fungi, leading to horizontal transfer of genes that enhance fitness or virulence. PMID:26808652

  17. Genome-wide DNA methylation analysis reveals estrogen-mediated epigenetic repression of metallothionein-1 gene cluster in breast cancer.

    PubMed

    Jadhav, Rohit R; Ye, Zhenqing; Huang, Rui-Lan; Liu, Joseph; Hsu, Pei-Yin; Huang, Yi-Wen; Rangel, Leticia B; Lai, Hung-Cheng; Roa, Juan Carlos; Kirma, Nameer B; Huang, Tim Hui-Ming; Jin, Victor X

    2015-01-01

    Recent genome-wide analysis has shown that DNA methylation spans long stretches of chromosome regions consisting of clusters of contiguous CpG islands or gene families. Hypermethylation of various gene clusters has been reported in many types of cancer. In this study, we conducted methyl-binding domain capture (MBDCap) sequencing (MBD-seq) analysis on a breast cancer cohort consisting of 77 patients and 10 normal controls, as well as a panel of 38 breast cancer cell lines. Bioinformatics analysis determined seven gene clusters with a significant difference in overall survival (OS) and further revealed a distinct feature that the conservation of a large gene cluster (approximately 70 kb) metallothionein-1 (MT1) among 45 species is much lower than the average of all RefSeq genes. Furthermore, we found that DNA methylation is an important epigenetic regulator contributing to gene repression of MT1 gene cluster in both ERα positive (ERα+) and ERα negative (ERα-) breast tumors. In silico analysis revealed much lower gene expression of this cluster in The Cancer Genome Atlas (TCGA) cohort for ERα + tumors. To further investigate the role of estrogen, we conducted 17β-estradiol (E2) and demethylating agent 5-aza-2'-deoxycytidine (DAC) treatment in various breast cancer cell types. Cell proliferation and invasion assays suggested MT1F and MT1M may play an anti-oncogenic role in breast cancer. Our data suggests that DNA methylation in large contiguous gene clusters can be potential prognostic markers of breast cancer. Further investigation of these clusters revealed that estrogen mediates epigenetic repression of MT1 cluster in ERα + breast cancer cell lines. In all, our studies identify thousands of breast tumor hypermethylated regions for the first time, in particular, discovering seven large contiguous hypermethylated gene clusters.

  18. Cis-acting mutation and duplication: History of molecular evolution in a P450 haplotype responsible for insecticide resistance in Culex quinquefasciatus.

    PubMed

    Itokawa, Kentaro; Komagata, Osamu; Kasai, Shinji; Masada, Masahiro; Tomita, Takashi

    2011-07-01

    A cytochrome P450 gene, Cyp9m10, is more than 200-fold overexpressed in a pyrethroid resistant strain of Culex quinquefasciatus, JPal-per. The haplotype of this strain contains two copies of Cyp9m10 resulted from recent tandem duplication. In this study, we discovered and isolated a Cyp9m10 haplotype closely related to this duplicated Cyp9m10 haplotype from JHB, a strain used for the recent genome project for this mosquito species. The isolated haplotype (JHB-NIID-B haplotype) shared the same insertion of a transposable element upstream of the coding region with JPal-per strain but not duplicated. The JHB-NIID-B haplotype was considered to have diverged from the JPal-per lineage just before the duplication event. Cyp9m10 was moderately overexpressed in larvae with the JHB-NIID-B haplotype. The overexpressions in JHB-NIID-B and JPal-per haplotypes were developmentally regulated in similar pattern indicating both haplotypes share a common cis-acting mutation responsible for the overexpressions. The isolated moderately overexpressed haplotype conferred resistance, however, its efficacy was relatively small. We hypothesized that the first cis-acting mutation modified the consequence of the subsequent duplication in JPal-per lineage to confer stronger phenotypic effect than that if it occurred before the first cis-acting mutation. Copyright © 2011 Elsevier Ltd. All rights reserved.

  19. Patterns of haplotypes for 92 cystic fibrosis mutations: Variability, association and recurrence

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Morral, N.; Llevadot, R.; Estivill, X.

    1994-09-01

    Most CFTR mutations are very uncommon among the cystic fibrosis population, with frequencies of less than 1%, and many are found only in specific areas. We have analyzed 92 CF mutations for several markers (4 microsatellites and 3 other polymorphisms) scattered in the CFTR gene. Haplotypes associated with these mutations can be used as a framework in the screening of chromosomes carrying unknown mutations. The association between mutation and haplotype reduces the number of mutations it is necessary to search for to a maximum of 16 for the same haplotype. Only mutations {triangle}F508, G542X and N1303K are associated with moremore » than one haplotype as a result of slippage at more than one microsatellite loci, suggesting that these three are the most ancient CF mutations. Recurrence has been found for at least 7 mutations: H199Y, R347P, L558S, R553X, 2184insA, 3272-26A{r_arrow}G, 3849+10kbC{r_arrow}T and R1162X. Also microsatellite analysis of chromosomes of several ethnic origins (Czech, Italian, Russian, Slovac and Spanish) suggested that possibility of three or more independent origins for mutations R334W, R347P, R1162X, and 3849+10kbC{r_arrow}T, which was confirmed by analysis of markers flanking these mutations.« less

  20. Regulatory Feedback Loop of Two phz Gene Clusters through 5′-Untranslated Regions in Pseudomonas sp. M18

    PubMed Central

    Li, Yaqian; Du, Xilin; Lu, Zhi John; Wu, Daqiang; Zhao, Yilei; Ren, Bin; Huang, Jiaofang; Huang, Xianqing; Xu, Yuhong; Xu, Yuquan

    2011-01-01

    Background Phenazines are important compounds produced by pseudomonads and other bacteria. Two phz gene clusters called phzA1-G1 and phzA2-G2, respectively, were found in the genome of Pseudomonas sp. M18, an effective biocontrol agent, which is highly homologous to the opportunistic human pathogen P. aeruginosa PAO1, however little is known about the correlation between the expressions of two phz gene clusters. Methodology/Principal Findings Two chromosomal insertion inactivated mutants for the two gene clusters were constructed respectively and the correlation between the expressions of two phz gene clusters was investigated in strain M18. Phenazine-1-carboxylic acid (PCA) molecules produced from phzA2-G2 gene cluster are able to auto-regulate expression itself and activate the expression of phzA1-G1 gene cluster in a circulated amplification pattern. However, the post-transcriptional expression of phzA1-G1 transcript was blocked principally through 5′-untranslated region (UTR). In contrast, the phzA2-G2 gene cluster was transcribed to a lesser extent and translated efficiently and was negatively regulated by the GacA signal transduction pathway, mainly at a post-transcriptional level. Conclusions/Significance A single molecule, PCA, produced in different quantities by the two phz gene clusters acted as the functional mediator and the two phz gene clusters developed a specific regulatory mechanism which acts through 5′-UTR to transfer a single, but complex bacterial signaling event in Pseudomonas sp. strain M18. PMID:21559370

  1. Suggestive evidence for association between L-type voltage-gated calcium channel (CACNA1C) gene haplotypes and bipolar disorder in Latinos: a family-based association study

    PubMed Central

    Gonzalez, Suzanne; Xu, Chun; Ramirez, Mercedes; Zavala, Juan; Armas, Regina; Contreras, Salvador A; Contreras, Javier; Dassori, Albana; Leach, Robin J; Flores, Deborah; Jerez, Alvaro; Raventós, Henriette; Ontiveros, Alfonso; Nicolini, Humberto; Escamilla, Michael

    2013-01-01

    Objectives Through recent genome-wide association studies (GWAS), several groups have reported significant association between variants in the alpha 1C subunit of the L-type voltage-gated calcium channel (CACNA1C) and bipolar disorder (BP) in European and European-American cohorts. We performed a family-based association study to determine whether CACNA1C is associated with BP in the Latino population. Methods This study consisted of 913 individuals from 215 Latino pedigrees recruited from the United States, Mexico, Guatemala, and Costa Rica. The Illumina GoldenGate Genotyping Assay was used to genotype 58 single-nucleotide polymorphisms (SNPs) that spanned a 602.9 kb region encompassing the CACNA1C gene including two SNPs (rs7297582 and rs1006737) previously shown to associate with BP. Individual SNP and haplotype association analyses were performed using Family-Based Association Test (version 2.0.3) and Haploview (version 4.2) software. Results An eight-locus haplotype block that included these two markers showed significant association with BP (global marker permuted p = 0.0018) in the Latino population. For individual SNPs, this sample had insufficient power (10%) to detect associations with SNPs with minor effect (odds ratio = 1.15). Conclusions Although we were not able to replicate findings of association between individual CACNA1C SNPs rs7297582 and rs1006737 and BP, we were able to replicate the GWAS signal reported for CACNA1C through a haplotype analysis that encompassed these previously reported significant SNPs. These results provide additional evidence that CACNA1C is associated with BP and provides the first evidence that variations in this gene might play a role in the pathogenesis of this disorder in the Latino population. PMID:23437964

  2. Frequency distribution of interleukin-10 haplotypes (-1082 A>G, -819 C>T, and -592 C>A) in a Mexican population.

    PubMed

    Vázquez-Villamar, M; Palafox-Sánchez, C A; Hernández-Bello, J; Muñoz-Valle, J F; Valle, Y; Cruz, A; Alatorre-Meza, A I; Oregon-Romero, E

    2016-11-03

    Interleukin 10 (IL-10) is an immunoregulatory cytokine with multiple roles in the immune system. Three single nucleotide polymorphisms at positions -1082 (A>G), -819 (C>T), and -592 (C>A) in the promoter region of the IL10 gene are believed to be associated with different inflammatory, infectious, and autoimmune diseases. These polymorphisms exhibit a strong linkage disequilibrium (LD) and form three principal haplotypes (GCC, ACC, and ATA). The GCC and ATA haplotypes have been associated with high and low levels of IL-10 production, respectively. The aim of this study was to establish the allele and haplotype frequencies of the IL10 polymorphisms in Mestizos from western Mexico. SNPs were analyzed in 340 healthy unrelated Mestizos from western Mexico by polymerase chain reaction-restriction fragment length polymorphism. The studied population presented significant differences, in the distribution of IL10 polymorphisms, from the Asian, African, and European populations. We also observed a strong LD within -1082 A>G, -819 C>T, and -592 C>A (100% pc = 7.735 x 10 -18 ). The haplotypes ACC (45.4%), ATA (22.0%), GTA (14.9%), and GCC (13.9%) were most frequently observed in this population. The haplotype frequencies, however, differed from those reported previously in Mestizos from central Mexico, Asians, Africans, and European Caucasians, suggesting a differential gene flow in the Mexican Mestizo population. This could account for the genetic variability between Mexicans and populations of other ethnicities. The study of these polymorphisms and their haplotypes could help in expanding our knowledge to design future disease-risk studies on the western Mexican population.

  3. Influence of promoter/enhancer region haplotypes on MGMT transcriptional regulation: a potential biomarker for human sensitivity to alkylating agents.

    PubMed

    Xu, Meixiang; Nekhayeva, Ilona; Cross, Courtney E; Rondelli, Catherine M; Wickliffe, Jeffrey K; Abdel-Rahman, Sherif Z

    2014-03-01

    The O6-methylguanine-DNA methyltransferase gene (MGMT) encodes the direct reversal DNA repair protein that removes alkyl adducts from the O6 position of guanine. Several single-nucleotide polymorphisms (SNPs) exist in the MGMT promoter/enhancer (P/E) region. However, the haplotype structure encompassing these SNPs and their functional/biological significance are currently unknown. We hypothesized that MGMT P/E haplotypes, rather than individual SNPs, alter MGMT transcription and can thus alter human sensitivity to alkylating agents. To identify the haplotype structure encompassing the MGMT P/E region SNPs, we sequenced 104 DNA samples from healthy individuals and inferred the haplotypes using the data generated. We identified eight SNPs in this region, namely T7C (rs180989103), T135G (rs1711646), G290A (rs61859810), C485A (rs1625649), C575A (rs113813075), G666A (rs34180180), C777A (rs34138162) and C1099T (rs16906252). Phylogenetics and Sequence Evolution analysis predicted 21 potential haplotypes that encompass these SNPs ranging in frequencies from 0.000048 to 0.39. Of these, 10 were identified in our study population as 20 paired haplotype combinations. To determine the functional significance of these haplotypes, luciferase reporter constructs representing these haplotypes were transfected into glioblastoma cells and their effect on MGMT promoter activity was determined. Compared with the most common (reference) haplotype 1, seven haplotypes significantly upregulated MGMT promoter activity (18-119% increase; P < 0.05), six significantly downregulated MGMT promoter activity (29-97% decrease; P < 0.05) and one haplotype had no effect. Mechanistic studies conducted support the conclusion that MGMT P/E haplotypes, rather than individual SNPs, differentially regulate MGMT transcription and could thus play a significant role in human sensitivity to environmental and therapeutic alkylating agents.

  4. Founder haplotype analysis of Fanconi anemia in the Korean population finds common ancestral haplotypes for a FANCG variant.

    PubMed

    Park, Joonhong; Kim, Myungshin; Jang, Woori; Chae, Hyojin; Kim, Yonggoo; Chung, Nack-Gyun; Lee, Jae-Wook; Cho, Bin; Jeong, Dae-Chul; Park, In Yang; Park, Mi Sun

    2015-05-01

    A common ancestral haplotype is strongly suggested in the Korean and Japanese patients with Fanconi anemia (FA), because common mutations have been frequently found: c.2546delC and c.3720_3724delAAACA of FANCA; c.307+1G>C, c.1066C>T, and c.1589_1591delATA of FANCG. Our aim in this study was to investigate the origin of these common mutations of FANCA and FANCG. We genotyped 13 FA patients consisting of five FA-A patients and eight FA-G patients from the Korean FA population. Microsatellite markers used for haplotype analysis included four CA repeat markers which are closely linked with FANCA and eight CA repeat markers which are contiguous with FANCG. As a result, Korean FA-A patients carrying c.2546delC or c.3720_3724delAAACA did not share the same haplotypes. However, three unique haplotypes carrying c.307+1G>C, c.1066C > T, or c.1589_1591delATA, that consisted of eight polymorphic loci covering a flanking region were strongly associated with Korean FA-G, consistent with founder haplotypes reported previously in the Japanese FA-G population. Our finding confirmed the common ancestral haplotypes on the origins of the East Asian FA-G patients, which will improve our understanding of the molecular population genetics of FA-G. To the best of our knowledge, this is the first report on the association between disease-linked mutations and common ancestral haplotypes in the Korean FA population. © 2015 John Wiley & Sons Ltd/University College London.

  5. Haplotype frequency distribution for 7 microsatellites in chromosome 8 and 11 in relation to the metabolic syndrome in four ethnic groups: Tehran Lipid and Glucose Study.

    PubMed

    Daneshpour, Maryam Sadat; Hosseinzadeh, Nima; Zarkesh, Maryam; Azizi, Fereidoun

    2012-03-01

    Different variants of haplotype frequencies may lead to various frequencies of the same variants in individuals with drug resistance and disease susceptibility at the population level. In this study, the haplotype frequencies of 4 STR loci including the D8S1132, D8S1779, D8S514 and D8S1743, and 3 STR loci including D11S1304, D11S1998 and D11S934 were investigated in 563 individuals of four Iranian ethnic groups in the capital city of Iran, Tehran. One hundred thirty subjects had the metabolic syndrome. Haplotype frequencies of all markers were calculated. There were significant differences in the haplotype frequencies in short and long alleles between the metabolic affected subjects and controls. In addition, haplotype frequencies were significant in the four ethnic groups in both chromosomes 8 and 11. Our findings show a relation between the short allele of D8S1743 in all related haplotype frequencies of subjects with metabolic syndrome. These findings may require more studies of some candidate genes, including the lipoprotein lipase gene, in this chromosomal region. Copyright © 2011. Published by Elsevier B.V.

  6. BMP4 and FGF3 haplotypes increase the risk of tendinopathy in volleyball athletes.

    PubMed

    Salles, José Inácio; Amaral, Marcus Vinícius; Aguiar, Diego Pinheiro; Lira, Daisy Anne; Quinelato, Valquiria; Bonato, Letícia Ladeira; Duarte, Maria Eugenia Leite; Vieira, Alexandre Rezende; Casado, Priscila Ladeira

    2015-03-01

    To investigate whether genetic variants can be correlated with tendinopathy in elite male volleyball athletes. Case-control study. Fifteen single nucleotide polymorphisms within BMP4, FGF3, FGF10, FGFR1 genes were investigated in 138 elite volleyball athletes, aged between 18 and 35 years, who undergo 4-5h of training per day: 52 with tendinopathy and 86 with no history of pain suggestive of tendinopathy in patellar, Achilles, shoulder, and hip abductors tendons. The clinical diagnostic criterion was progressive pain during training, confirmed by magnetic resonance image. Genomic DNA was obtained from saliva samples. Genetic markers were genotyped using TaqMan real-time PCR. Chi-square test compared genotypes and haplotype differences between groups. Multivariate logistic regression analyzed the significance of covariates and incidence of tendinopathy. Statistical analysis revealed participant age (p=0.005) and years of practice (p=0.004) were risk factors for tendinopathy. A significant association between BMP4 rs2761884 (p=0.03) and tendinopathy was observed. Athletes with a polymorphic genotype have 2.4 times more susceptibility to tendinopathy (OR=2.39; 95%CI=1.10-5.19). Also, association between disease and haplotype TTGGA in BMP4 (p=0.01) was observed. The FGF3 TGGTA haplotype showed a tendency of association with tendinopathy (p=0.05), and so did FGF10 rs900379. FGFR1 showed no association with disease. These findings indicate that haplotypes in BMP4 and FGF3 genes may contribute to the tendon disease process in elite volleyball athletes. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  7. Performance Assessment of Kernel Density Clustering for Gene Expression Profile Data

    PubMed Central

    Zeng, Beiyan; Chen, Yiping P.; Smith, Oscar H.

    2003-01-01

    Kernel density smoothing techniques have been used in classification or supervised learning of gene expression profile (GEP) data, but their applications to clustering or unsupervised learning of those data have not been explored and assessed. Here we report a kernel density clustering method for analysing GEP data and compare its performance with the three most widely-used clustering methods: hierarchical clustering, K-means clustering, and multivariate mixture model-based clustering. Using several methods to measure agreement, between-cluster isolation, and withincluster coherence, such as the Adjusted Rand Index, the Pseudo F test, the r2 test, and the profile plot, we have assessed the effectiveness of kernel density clustering for recovering clusters, and its robustness against noise on clustering both simulated and real GEP data. Our results show that the kernel density clustering method has excellent performance in recovering clusters from simulated data and in grouping large real expression profile data sets into compact and well-isolated clusters, and that it is the most robust clustering method for analysing noisy expression profile data compared to the other three methods assessed. PMID:18629292

  8. F8 haplotype and inhibitor risk: results from the Hemophilia Inhibitor Genetics Study (HIGS) Combined Cohort

    PubMed Central

    Schwarz, John; Astermark, Jan; Menius, Erika D.; Carrington, Mary; Donfield, Sharyne M.; Gomperts, Edward D.; Nelson, George W.; Oldenburg, Johannes; Pavlova, Anna; Shapiro, Amy D.; Winkler, Cheryl A.; Berntorp, Erik

    2012-01-01

    Background Ancestral background, specifically African descent, confers higher risk for development of inhibitory antibodies to factor VIII (FVIII) in hemophilia A. It has been suggested that differences in the distribution of factor VIII gene (F8) haplotypes, and mismatch between endogenous F8 haplotypes and those comprising products used for treatment could contribute to risk. Design and Methods Data from the HIGS Combined Cohort were used to determine the association between F8 haplotype 3 (H3) vs. haplotypes 1 and 2 (H1+H2) and inhibitor risk among individuals of genetically-determined African descent. Other variables known to affect inhibitor risk including type of F8 mutation and HLA were included in the analysis. A second research question regarding risk related to mismatch in endogenous F8 haplotype and recombinant FVIII products used for treatment was addressed. Results H3 was associated with higher inhibitor risk among those genetically-identified (N=49) as of African ancestry, but the association did not remain significant after adjustment for F8 mutation type and the HLA variables. Among subjects of all racial ancestries enrolled in HIGS who reported early use of recombinant products (N=223), mismatch in endogenous haplotype and the FVIII proteins constituting the products used did not confer greater risk for inhibitor development. Conclusion H3 was not an independent predictor of inhibitor risk. Further, our findings did not support a higher risk of inhibitors in the presence of a haplotype mismatch between the FVIII molecule infused and that of the individual. PMID:22958194

  9. Origins and Domestication of Cultivated Banana Inferred from Chloroplast and Nuclear Genes

    PubMed Central

    Zhang, Cui; Wang, Xin-Feng; Shi, Feng-Xue; Chen, Wen-Na; Ge, Xue-Jun

    2013-01-01

    Background Cultivated bananas are large, vegetatively-propagated members of the genus Musa. More than 1,000 cultivars are grown worldwide and they are major economic and food resources in numerous developing countries. It has been suggested that cultivated bananas originated from the islands of Southeast Asia (ISEA) and have been developed through complex geodomestication pathways. However, the maternal and parental donors of most cultivars are unknown, and the pattern of nucleotide diversity in domesticated banana has not been fully resolved. Methodology/Principal Findings We studied the genetics of 16 cultivated and 18 wild Musa accessions using two single-copy nuclear (granule-bound starch synthase I, GBSS I, also known as Waxy, and alcohol dehydrogenase 1, Adh1) and two chloroplast (maturase K, matK, and the trnL-F gene cluster) genes. The results of phylogenetic analyses showed that all A-genome haplotypes of cultivated bananas were grouped together with those of ISEA subspecies of M. acuminata (A-genome). Similarly, the B- and S-genome haplotypes of cultivated bananas clustered with the wild species M. balbisiana (B-genome) and M. schizocarpa (S-genome), respectively. Notably, it has been shown that distinct haplotypes of each cultivar (A-genome group) were nested together to different ISEA subspecies M. acuminata. Analyses of nucleotide polymorphism in the Waxy and Adh1 genes revealed that, in comparison to the wild relatives, cultivated banana exhibited slightly lower nucleotide diversity both across all sites and specifically at silent sites. However, dramatically reduced nucleotide diversity was found at nonsynonymous sites for cultivated bananas. Conclusions/Significance Our study not only confirmed the origin of cultivated banana as arising from multiple intra- and inter-specific hybridization events, but also showed that cultivated banana may have not suffered a severe genetic bottleneck during the domestication process. Importantly, our findings

  10. Decreased calcium pump expression in human erythrocytes is connected to a minor haplotype in the ATP2B4 gene.

    PubMed

    Zámbó, Boglárka; Várady, György; Padányi, Rita; Szabó, Edit; Németh, Adrienn; Langó, Tamás; Enyedi, Ágnes; Sarkadi, Balázs

    2017-07-01

    Plasma membrane Ca 2+ -ATPases are key calcium exporter proteins in most tissues, and PMCA4b is the main calcium transporter in the human red blood cells (RBCs). In order to assess the expression level of PMCA4b, we have developed a flow cytometry and specific antibody binding method to quantitatively detect this protein in the erythrocyte membrane. Interestingly, we found several healthy volunteers showing significantly reduced expression of RBC-PMCA4b. Western blot analysis of isolated RBC membranes confirmed this observation, and indicated that there are no compensatory alterations in other PMCA isoforms. In addition, reduced PMCA4b levels correlated with a lower calcium extrusion capacity in these erythrocytes. When exploring the potential genetic background of the reduced PMCA4b levels, we found no missense mutations in the ATP2B4 coding regions, while a formerly unrecognized minor haplotype in the predicted second promoter region closely correlated with lower erythrocyte PMCA4b protein levels. In recent GWA studies, SNPs in this ATP2B4 haplotype have been linked to reduced mean corpuscular hemoglobin concentrations (MCHC), and to protection against malaria infection. Our data suggest that an altered regulation of gene expression is responsible for the reduced RBC-PMCA4b levels that is probably linked to the development of human disease-related phenotypes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Evolution of coding and non-coding genes in HOX clusters of a marsupial.

    PubMed

    Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

    2012-06-18

    The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.

  12. Evolution of coding and non-coding genes in HOX clusters of a marsupial

    PubMed Central

    2012-01-01

    Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672

  13. Identification of the Regulator Gene Responsible for the Acetone-Responsive Expression of the Binuclear Iron Monooxygenase Gene Cluster in Mycobacteria ▿

    PubMed Central

    Furuya, Toshiki; Hirose, Satomi; Semba, Hisashi; Kino, Kuniki

    2011-01-01

    The mimABCD gene cluster encodes the binuclear iron monooxygenase that oxidizes propane and phenol in Mycobacterium smegmatis strain MC2 155 and Mycobacterium goodii strain 12523. Interestingly, expression of the mimABCD gene cluster is induced by acetone. In this study, we investigated the regulator gene responsible for this acetone-responsive expression. In the genome sequence of M. smegmatis strain MC2 155, the mimABCD gene cluster is preceded by a gene designated mimR, which is divergently transcribed. Sequence analysis revealed that MimR exhibits amino acid similarity with the NtrC family of transcriptional activators, including AcxR and AcoR, which are involved in acetone and acetoin metabolism, respectively. Unexpectedly, many homologs of the mimR gene were also found in the sequenced genomes of actinomycetes. A plasmid carrying a transcriptional fusion of the intergenic region between the mimR and mimA genes with a promoterless green fluorescent protein (GFP) gene was constructed and introduced into M. smegmatis strain MC2 155. Using a GFP reporter system, we confirmed by deletion and complementation analyses that the mimR gene product is the positive regulator of the mimABCD gene cluster expression that is responsive to acetone. M. goodii strain 12523 also utilized the same regulatory system as M. smegmatis strain MC2 155. Although transcriptional activators of the NtrC family generally control transcription using the σ54 factor, a gene encoding the σ54 factor was absent from the genome sequence of M. smegmatis strain MC2 155. These results suggest the presence of a novel regulatory system in actinomycetes, including mycobacteria. PMID:21856847

  14. Analysis of the cytochrome c oxidase subunit II (COX2) gene in giant panda, Ailuropoda melanoleuca.

    PubMed

    Ling, S S; Zhu, Y; Lan, D; Li, D S; Pang, H Z; Wang, Y; Li, D Y; Wei, R P; Zhang, H M; Wang, C D; Hu, Y D

    2017-01-23

    The giant panda, Ailuropoda melanoleuca (Ursidae), has a unique bamboo-based diet; however, this low-energy intake has been sufficient to maintain the metabolic processes of this species since the fourth ice age. As mitochondria are the main sites for energy metabolism in animals, the protein-coding genes involved in mitochondrial respiratory chains, particularly cytochrome c oxidase subunit II (COX2), which is the rate-limiting enzyme in electron transfer, could play an important role in giant panda metabolism. Therefore, the present study aimed to isolate, sequence, and analyze the COX2 DNA from individuals kept at the Giant Panda Protection and Research Center, China, and compare these sequences with those of the other Ursidae family members. Multiple sequence alignment showed that the COX2 gene had three point mutations that defined three haplotypes, with 60% of the sequences corresponding to haplotype I. The neutrality tests revealed that the COX2 gene was conserved throughout evolution, and the maximum likelihood phylogenetic analysis, using homologous sequences from other Ursidae species, showed clustering of the COX2 sequences of giant pandas, suggesting that this gene evolved differently in them.

  15. Phylogeny and Haplotype Analysis of Fungi Within the Fusarium incarnatum-equiseti Species Complex.

    PubMed

    Ramdial, H; Latchoo, R K; Hosein, F N; Rampersad, S N

    2017-01-01

    Fusarium spp. are ranked among the top 10 most economically and scientifically important plant-pathogenic fungi in the world and are associated with plant diseases that include fruit decay of a number of crops. Fusarium isolates infecting bell pepper in Trinidad were identified based on sequence comparisons of the translation elongation factor gene (EF-1a) with sequences of Fusarium incarnatum-equiseti species complex (FIESC) verified in the FUSARIUM-ID database. Eighty-two isolates were identified as belonging to one of four phylogenetic species within the subclades FIESC-1, FIESC-15, FIESC-16, and FIESC-26, with the majority of isolates belonging to FIESC-15. A comparison of the level of DNA polymorphism and phylogenetic inference for sequences of the internal transcribed spacer region (ITS1-5.8S-ITS2) and EF-1a sequences for Trinidad and FUSARIUM-ID type species was carried out. The ITS sequences were less informative, had lower haplotype diversity and restricted haplotype distribution, and resulted in poor resolution and taxa placement in the consensus maximum-likelihood tree. EF-1a sequences enabled strongly supported phylogenetic inference with highly resolved branching patterns of the 30 phylogenetic species within the FIESC and placement of representative Trinidad isolates. Therefore, global phylogeny was inferred from EF-1a sequences representing 11 countries, and separation into distinct Incarnatum and Equiseti clades was again evident. In total, 42 haplotypes were identified: 12 were shared and the remaining were unique haplotypes. The most diverse haplotype was represented by sequences from China, Indonesia, Malaysia, and Trinidad and consisted exclusively of F. incarnatum isolates. Spain had the highest haplotype diversity, perhaps because both F. equiseti and F. incarnatum sequences were represented; followed by the United States, which contributed both F. equiseti and F. incarnatum sequences to the data set; then by countries representing Southeast

  16. The Glucuronic Acid Utilization Gene Cluster from Bacillus stearothermophilus T-6

    PubMed Central

    Shulami, Smadar; Gat, Orit; Sonenshein, Abraham L.; Shoham, Yuval

    1999-01-01

    A λ-EMBL3 genomic library of Bacillus stearothermophilus T-6 was screened for hemicellulolytic activities, and five independent clones exhibiting β-xylosidase activity were isolated. The clones overlap each other and together represent a 23.5-kb chromosomal segment. The segment contains a cluster of xylan utilization genes, which are organized in at least three transcriptional units. These include the gene for the extracellular xylanase, xylanase T-6; part of an operon coding for an intracellular xylanase and a β-xylosidase; and a putative 15.5-kb-long transcriptional unit, consisting of 12 genes involved in the utilization of α-d-glucuronic acid (GlcUA). The first four genes in the potential GlcUA operon (orf1, -2, -3, and -4) code for a putative sugar transport system with characteristic components of the binding-protein-dependent transport systems. The most likely natural substrate for this transport system is aldotetraouronic acid [2-O-α-(4-O-methyl-α-d-glucuronosyl)-xylotriose] (MeGlcUAXyl3). The following two genes code for an intracellular α-glucuronidase (aguA) and a β-xylosidase (xynB). Five more genes (kdgK, kdgA, uxaC, uxuA, and uxuB) encode proteins that are homologous to enzymes involved in galacturonate and glucuronate catabolism. The gene cluster also includes a potential regulatory gene, uxuR, the product of which resembles repressors of the GntR family. The apparent transcriptional start point of the cluster was determined by primer extension analysis and is located 349 bp from the initial ATG codon. The potential operator site is a perfect 12-bp inverted repeat located downstream from the promoter between nucleotides +170 and +181. Gel retardation assays indicated that UxuR binds specifically to this sequence and that this binding is efficiently prevented in vitro by MeGlcUAXyl3, the most likely molecular inducer. PMID:10368143

  17. Distribution of Suicin Gene Clusters in Streptococcus suis Serotype 2 Belonging to Sequence Types 25 and 28.

    PubMed

    Athey, Taryn B T; Vaillancourt, Katy; Frenette, Michel; Fittipaldi, Nahuel; Gottschalk, Marcelo; Grenier, Daniel

    2016-01-01

    Recently, we reported the purification and characterization of three distinct lantibiotics (named suicin 90-1330, suicin 3908, and suicin 65) produced by Streptococcus suis . In this study, we investigated the distribution of the three suicin lantibiotic gene clusters among serotype 2 S. suis strains belonging to sequence type (ST) 25 and ST28, the two dominant STs identified in North America. The genomes of 102 strains were interrogated for the presence of suicin gene clusters encoding suicins 90-1330, 3908, and 65. The gene cluster encoding suicin 65 was the most prevalent and mainly found among ST25 strains. In contrast, none of the genes related to suicin 90-1330 production were identified in 51 ST25 strains nor in 35/51 ST28 strains. However, the complete suicin 90-1330 gene cluster was found in ten ST28 strains, although some genes in the cluster were truncated in three of these isolates. The vast majority (101/102) of S. suis strains did not possess any of the genes encoding suicin 3908. In conclusion, this study indicates heterogeneous distribution of suicin genes in S. suis .

  18. Form gene clustering method about pan-ethnic-group products based on emotional semantic

    NASA Astrophysics Data System (ADS)

    Chen, Dengkai; Ding, Jingjing; Gao, Minzhuo; Ma, Danping; Liu, Donghui

    2016-09-01

    The use of pan-ethnic-group products form knowledge primarily depends on a designer's subjective experience without user participation. The majority of studies primarily focus on the detection of the perceptual demands of consumers from the target product category. A pan-ethnic-group products form gene clustering method based on emotional semantic is constructed. Consumers' perceptual images of the pan-ethnic-group products are obtained by means of product form gene extraction and coding and computer aided product form clustering technology. A case of form gene clustering about the typical pan-ethnic-group products is investigated which indicates that the method is feasible. This paper opens up a new direction for the future development of product form design which improves the agility of product design process in the era of Industry 4.0.

  19. Family-based association study of matrix metalloproteinase-3 and -9 haplotypes with susceptibility to ischemic white matter injury.

    PubMed

    Fornage, Myriam; Mosley, Thomas H; Jack, Clifford R; de Andrade, Mariza; Kardia, Sharon L R; Boerwinkle, Eric; Turner, Stephen T

    2007-01-01

    Susceptibility to ischemic damage to the subcortical white matter of the brain has a strong genetic basis. Dysregulation of matrix metalloproteinases (MMPs) contributes to loss of cerebrovascular integrity and white matter injury. We investigated whether sequence variation in the genes encoding MMP3 and MMP9 is associated with variation in leukoaraiosis volume, determined by magnetic resonance imaging, in non-Hispanic whites and African-Americans using family-based association tests. Seven hundred and fifty-six white and 671 African-American individuals from sibships ascertained through two or more siblings with hypertension were genotyped for 7 and 8 haplotype-tagging polymorphisms in the MMP3 and MMP9 genes, respectively. MMP3 sequence variation was significantly associated with variation in leukoaraiosis volume in Whites. Two common haplotypes with opposing relationships to leukoaraiosis volume were identified. MMP9 sequence variation was also significantly associated with variation in leukoaraiosis volume in both African-Americans and Whites. Different haplotypes contributed to these associations in the two racial groups. These findings add to the growing body of evidence from animal models and human clinical studies suggesting a role of MMPs in ischemic white matter injury. They provide the basis for further investigation of the role of these genes in susceptibility and/or progression to clinical disease.

  20. Intact cluster and chordate-like expression of ParaHox genes in a sea star

    PubMed Central

    2013-01-01

    Background The ParaHox genes are thought to be major players in patterning the gut of several bilaterian taxa. Though this is a fundamental role that these transcription factors play, their activities are not limited to the endoderm and extend to both ectodermal and mesodermal tissues. Three genes compose the ParaHox group: Gsx, Xlox and Cdx. In some taxa (mostly chordates but to some degree also in protostomes) the three genes are arranged into a genomic cluster, in a similar fashion to what has been shown for the better-known Hox genes. Sea urchins possess the full complement of ParaHox genes but they are all dispersed throughout the genome, an arrangement that, perhaps, represented the primitive condition for all echinoderms. In order to understand the evolutionary history of this group of genes we cloned and characterized all ParaHox genes, studied their expression patterns and identified their genomic loci in a member of an earlier branching group of echinoderms, the asteroid Patiria miniata. Results We identified the three ParaHox orthologs in the genome of P. miniata. While one of them, PmGsx is provided as maternal message, with no zygotic activation afterwards, the other two, PmLox and PmCdx are expressed during embryogenesis, within restricted domains of both endoderm and ectoderm. Screening of a Patiria bacterial artificial chromosome (BAC) library led to the identification of a clone containing the three genes. The transcriptional directions of PmGsx and PmLox are opposed to that of the PmCdx gene within the cluster. Conclusions The identification of P. miniata ParaHox genes has revealed the fact that these genes are clustered in the genome, in contrast to what has been reported for echinoids. Since the presence of an intact cluster, or at least a partial cluster, has been reported in chordates and polychaetes respectively, it becomes clear that within echinoderms, sea urchins have modified the original bilaterian arrangement. Moreover, the sea star

  1. Comparison of expression of secondary metabolite biosynthesis cluster genes in Aspergillus flavus, A. parasiticus, and A. oryzae.

    PubMed

    Ehrlich, Kenneth C; Mack, Brian M

    2014-06-23

    Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity.

  2. Comparison of Expression of Secondary Metabolite Biosynthesis Cluster Genes in Aspergillus flavus, A. parasiticus, and A. oryzae

    PubMed Central

    Ehrlich, Kenneth C.; Mack, Brian M.

    2014-01-01

    Fifty six secondary metabolite biosynthesis gene clusters are predicted to be in the Aspergillus flavus genome. In spite of this, the biosyntheses of only seven metabolites, including the aflatoxins, kojic acid, cyclopiazonic acid and aflatrem, have been assigned to a particular gene cluster. We used RNA-seq to compare expression of secondary metabolite genes in gene clusters for the closely related fungi A. parasiticus, A. oryzae, and A. flavus S and L sclerotial morphotypes. The data help to refine the identification of probable functional gene clusters within these species. Our results suggest that A. flavus, a prevalent contaminant of maize, cottonseed, peanuts and tree nuts, is capable of producing metabolites which, besides aflatoxin, could be an underappreciated contributor to its toxicity. PMID:24960201

  3. A recently transferred cluster of bacterial genes in Trichomonas vaginalis - lateral gene transfer and the fate of acquired genes

    PubMed Central

    2014-01-01

    Background Lateral Gene Transfer (LGT) has recently gained recognition as an important contributor to some eukaryote proteomes, but the mechanisms of acquisition and fixation in eukaryotic genomes are still uncertain. A previously defined norm for LGTs in microbial eukaryotes states that the majority are genes involved in metabolism, the LGTs are typically localized one by one, surrounded by vertically inherited genes on the chromosome, and phylogenetics shows that a broad collection of bacterial lineages have contributed to the transferome. Results A unique 34 kbp long fragment with 27 clustered genes (TvLF) of prokaryote origin was identified in the sequenced genome of the protozoan parasite Trichomonas vaginalis. Using a PCR based approach we confirmed the presence of the orthologous fragment in four additional T. vaginalis strains. Detailed sequence analyses unambiguously suggest that TvLF is the result of one single, recent LGT event. The proposed donor is a close relative to the firmicute bacterium Peptoniphilus harei. High nucleotide sequence similarity between T. vaginalis strains, as well as to P. harei, and the absence of homologs in other Trichomonas species, suggests that the transfer event took place after the radiation of the genus Trichomonas. Some genes have undergone pseudogenization and degradation, indicating that they may not be retained in the future. Functional annotations reveal that genes involved in informational processes are particularly prone to degradation. Conclusions We conclude that, although the majority of eukaryote LGTs are single gene occurrences, they may be acquired in clusters of several genes that are subsequently cleansed of evolutionarily less advantageous genes. PMID:24898731

  4. Clustering by soft-constraint affinity propagation: applications to gene-expression data.

    PubMed

    Leone, Michele; Sumedha; Weigt, Martin

    2007-10-15

    Similarity-measure-based clustering is a crucial problem appearing throughout scientific data analysis. Recently, a powerful new algorithm called Affinity Propagation (AP) based on message-passing techniques was proposed by Frey and Dueck (2007a). In AP, each cluster is identified by a common exemplar all other data points of the same cluster refer to, and exemplars have to refer to themselves. Albeit its proved power, AP in its present form suffers from a number of drawbacks. The hard constraint of having exactly one exemplar per cluster restricts AP to classes of regularly shaped clusters, and leads to suboptimal performance, e.g. in analyzing gene expression data. This limitation can be overcome by relaxing the AP hard constraints. A new parameter controls the importance of the constraints compared to the aim of maximizing the overall similarity, and allows to interpolate between the simple case where each data point selects its closest neighbor as an exemplar and the original AP. The resulting soft-constraint affinity propagation (SCAP) becomes more informative, accurate and leads to more stable clustering. Even though a new a priori free parameter is introduced, the overall dependence of the algorithm on external tuning is reduced, as robustness is increased and an optimal strategy for parameter selection emerges more naturally. SCAP is tested on biological benchmark data, including in particular microarray data related to various cancer types. We show that the algorithm efficiently unveils the hierarchical cluster structure present in the data sets. Further on, it allows to extract sparse gene expression signatures for each cluster.

  5. No Association between Personality and Candidate Gene Polymorphisms in a Wild Bird Population

    PubMed Central

    Durieux, Gillian; Burke, Terry; Dugdale, Hannah L.

    2015-01-01

    Consistency of between-individual differences in behaviour or personality is a phenomenon in populations that can have ecological consequences and evolutionary potential. One way that behaviour can evolve is to have a genetic basis. Identifying the molecular genetic basis of personality could therefore provide insight into how and why such variation is maintained, particularly in natural populations. Previously identified candidate genes for personality in birds include the dopamine receptor D4 (DRD4), and serotonin transporter (SERT). Studies of wild bird populations have shown that exploratory and bold behaviours are associated with polymorphisms in both DRD4 and SERT. Here we tested for polymorphisms in DRD4 and SERT in the Seychelles warbler (Acrocephalus sechellensis) population on Cousin Island, Seychelles, and then investigated correlations between personality and polymorphisms in these genes. We found no genetic variation in DRD4, but identified four polymorphisms in SERT that clustered into five haplotypes. There was no correlation between bold or exploratory behaviours and SERT polymorphisms/haplotypes. The null result was not due to lack of power, and indicates that there was no association between these behaviours and variation in the candidate genes tested in this population. These null findings provide important data to facilitate representative future meta-analyses on candidate personality genes. PMID:26473495

  6. Population genetics of Southern Hemisphere tope shark (Galeorhinus galeus): Intercontinental divergence and constrained gene flow at different geographical scales.

    PubMed

    Bester-van der Merwe, Aletta E; Bitalo, Daphne; Cuevas, Juan M; Ovenden, Jennifer; Hernández, Sebastián; da Silva, Charlene; McCord, Meaghen; Roodt-Wilding, Rouvay

    2017-01-01

    The tope shark (Galeorhinus galeus Linnaeus, 1758) is a temperate, coastal hound shark found in the Atlantic and Indo-Pacific oceans. In this study, the population structure of Galeorhinus galeus was determined across the entire Southern Hemisphere, where the species is heavily targeted by commercial fisheries, as well as locally, along the South African coastline. Analysis was conducted on a total of 185 samples using 19 microsatellite markers and a 671 bp fragment of the NADH dehydrogenase subunit 2 (ND2) gene. Across the Southern Hemisphere, three geographically distinct clades were recovered, including one from South America (Argentina, Chile), one from Africa (all the South African collections) and an Australia-New Zealand clade. Nuclear data revealed significant population subdivisions (FST = 0.192 to 0.376, p<0.05) indicating limited gene flow for tope sharks across ocean basins. Marked population connectivity was however evident across the Indian Ocean based on Bayesian clustering analysis. More locally in South Africa, F-statistics and multivariate analysis supported moderate to high gene flow across the Atlantic/Indian Ocean boundary (FST = 0.035 to 0.044, p<0.05), with exception of samples from Struisbaai and Port Elizabeth which differed significantly from the rest. Discriminant and Bayesian clustering analysis indicated admixture in all sampling populations, decreasing from west to east, corroborating possible restriction to gene flow across regional oceanographic barriers. Mitochondrial sequence data recovered seven haplotypes (h = 0.216, π = 0.001) for South Africa, with one major haplotype shared by 87% of the individuals and at least one private haplotype for each sampling location except Port Elizabeth. As with many other coastal shark species with cosmopolitan distribution, this study confirms the lack of both historical dispersal and inter-oceanic gene flow while also implicating contemporary factors such as oceanic currents and thermal fronts

  7. Population genetics of Southern Hemisphere tope shark (Galeorhinus galeus): Intercontinental divergence and constrained gene flow at different geographical scales

    PubMed Central

    Cuevas, Juan M.; Ovenden, Jennifer; Hernández, Sebastián; da Silva, Charlene; McCord, Meaghen; Roodt-Wilding, Rouvay

    2017-01-01

    The tope shark (Galeorhinus galeus Linnaeus, 1758) is a temperate, coastal hound shark found in the Atlantic and Indo-Pacific oceans. In this study, the population structure of Galeorhinus galeus was determined across the entire Southern Hemisphere, where the species is heavily targeted by commercial fisheries, as well as locally, along the South African coastline. Analysis was conducted on a total of 185 samples using 19 microsatellite markers and a 671 bp fragment of the NADH dehydrogenase subunit 2 (ND2) gene. Across the Southern Hemisphere, three geographically distinct clades were recovered, including one from South America (Argentina, Chile), one from Africa (all the South African collections) and an Australia-New Zealand clade. Nuclear data revealed significant population subdivisions (FST = 0.192 to 0.376, p<0.05) indicating limited gene flow for tope sharks across ocean basins. Marked population connectivity was however evident across the Indian Ocean based on Bayesian clustering analysis. More locally in South Africa, F-statistics and multivariate analysis supported moderate to high gene flow across the Atlantic/Indian Ocean boundary (FST = 0.035 to 0.044, p<0.05), with exception of samples from Struisbaai and Port Elizabeth which differed significantly from the rest. Discriminant and Bayesian clustering analysis indicated admixture in all sampling populations, decreasing from west to east, corroborating possible restriction to gene flow across regional oceanographic barriers. Mitochondrial sequence data recovered seven haplotypes (h = 0.216, π = 0.001) for South Africa, with one major haplotype shared by 87% of the individuals and at least one private haplotype for each sampling location except Port Elizabeth. As with many other coastal shark species with cosmopolitan distribution, this study confirms the lack of both historical dispersal and inter-oceanic gene flow while also implicating contemporary factors such as oceanic currents and thermal fronts

  8. Identification of new genes in a cell envelope-cell division gene cluster of Escherichia coli: cell envelope gene murG.

    PubMed Central

    Salmond, G P; Lutkenhaus, J F; Donachie, W D

    1980-01-01

    We report the identification, cloning, and mapping of a new cell envelope gene, murG. This lies in a group of five genes of similar phenotype (in the order murE murF murG murC ddl) all concerned with peptidoglycan biosynthesis. This group is in a larger cluster of at least 10 genes, all of which are involved in some way with cell envelope growth. Images PMID:6998962

  9. Population Structure of Pseudocercospora fijiensis in Costa Rica Reveals Shared Haplotype Diversity with Southeast Asian Populations.

    PubMed

    Saville, Amanda; Charles, Melodi; Chavan, Suchitra; Muñoz, Miguel; Gómez-Alpizar, Luis; Ristaino, Jean Beagle

    2017-12-01

    Pseudocercospora fijiensis is the causal pathogen of black Sigatoka, a devastating disease of banana that can cause 20 to 80% yield loss in the absence of fungicides in banana crops. The genetic structure of populations of P. fijiensis in Costa Rica was examined and compared with Honduran and global populations to better understand migration patterns and inform management strategies. In total, 118 isolates of P. fijiensis collected from Costa Rica and Honduras from 2010 to 2014 were analyzed using multilocus genotyping of six loci and compared with a previously published global dataset of populations of P. fijiensis. The Costa Rican and Honduran populations shared haplotype diversity with haplotypes from Southeast Asia, Oceania, and the Americas but not Africa for all but one of the six loci studied. Gene flow and shared haplotype diversity was found in Honduran and Costa Rican populations of the pathogen. The data indicate that the haplotypic diversity observed in Costa Rican populations of P. fijiensis is derived from dispersal from initial outbreak sources in Honduras and admixtures between genetically differentiated sources from Southeast Asia, Oceania, and the Americas.

  10. Using haplotypes to unravel the inheritance of Holstein coat color for a larger audience

    USDA-ARS?s Scientific Manuscript database

    Haplotype testing identifies single-nucleotide polymorphisms that bracket a group of alleles from several different genes located on a specific chromosomal section of DNA. For a trait with a limited number of genotypes and phenotypes, the rules of inheritance can be determined by matching up certain...

  11. ASSOCIATION BETWEEN GAB2 HAPLOTYPE AND HIGHER GLUCOSE METABOLISM IN ALZHEIMER'S DISEASE-AFFECTED BRAIN REGIONS IN COGNITIVELY NORMAL APOEε4 CARRIERS

    PubMed Central

    Liang, Winnie S.; Chen, Kewei; Lee, Wendy; Sidhar, Kunal; Corneveaux, Jason J.; Allen, April N.; Myers, Amanda; Villa, Stephen; Meechoovet, Bessie; Pruzin, Jeremy; Bandy, Daniel; Fleisher, Adam S.; Langbaum, Jessica B.S.; Huentelman, Matthew J.; Jensen, Kendall; Dunckley, Travis; Caselli, Richard J.; Kaib, Susan; Reiman, Eric M.

    2010-01-01

    In a genome-wide association study (GWAS) of late-onset Alzheimer's disease (AD), we found an association between common haplotypes of the GAB2 gene and AD risk in carriers of the apolipoprotein E (APOE) ε4 allele, the major late-onset AD susceptibility gene. We previously proposed the use of fluorodeoxyglucose positron emission tomography (FDG-PET) measurements as a quantitative presymptomatic endophenotype, more closely related to disease risk than the clinical syndrome itself, to help evaluate putative genetic and non-genetic modifiers of AD risk. In this study, we examined the relationship between the presence or absence of the relatively protective GAB2 haplotype and PET measurements of regional-to-whole brain FDG uptake in several AD-affected brain regions in 158 cognitively normal late-middle-aged APOEε4 homozygotes, heterozygotes, and non-carriers. GAB2 haplotypes were characterized using Affymetrix Genome-Wide Human SNP 6.0 Array data from each of these subjects. As predicted, the possibly protective GAB2 haplotype was associated with higher regional-to-whole brain FDG uptake in AD-affected brain regions in APOEε4 carriers. While additional studies are needed, this study supports the association between the possibly protective GAB2 haplotype and the risk of late-onset AD in APOEε4 carriers. It also supports the use of brain-imaging endophenotypes to help assess possible modifiers of AD risk. PMID:20888920

  12. antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters.

    PubMed

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko; Medema, Marnix H

    2015-07-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. H2 haplotype at chromosome 17q21.31 protects against childhood sexual abuse-associated risk for alcohol consumption and dependence

    PubMed Central

    Nelson, Elliot C.; Agrawal, Arpana; Pergadia, Michele L.; Wang, Jen C.; Whitfield, John B.; Saccone, F. Scott; Kern, Jason; Grant, Julia D.; Schrage, Andrew J.; Rice, John P.; Montgomery, Grant W.; Heath, Andrew C.; Goate, Alison M.; Martin, Nicholas G.; Madden, Pamela A.F.

    2011-01-01

    Animal research supports a central role for corticotropin releasing factor (CRF) in actions of ethanol on brain function. An examination of alcohol consumption in adolescents reported a significant genotype × environment (G × E) interaction involving rs1876831, a CRHR1 polymorphism, and negative events. CRHR1 and at least 4 other genes are located at 17q21.31 in an extremely large block of high linkage disequilibrium resulting from a local chromosomal inversion; the minor allele of rs1876831 is contained within the H2 haplotype. Here we examine whether G × E interactions involving this haplotype and childhood sexual abuse (CSA) are associated with risk for alcohol consumption and dependence in Australian participants (N=1128 respondents from 476 families) of the Nicotine Addiction Genetics project. Telephone interviews provided data on DSM-IV alcohol dependence diagnosis and CSA and enabled calculation of lifetime alcohol consumption factor score (ACFS) from 4 indices of alcohol consumption. Individuals reporting a history of CSA had significantly higher ACFS and increased risk for alcohol dependence. A significant G × E interaction was found for ACFS involving the H2 haplotype and CSA (p<0.017). A similar G × E interaction was associated with protective effects against alcohol dependence risk (odds ratio 0.42; 95%CI 0.20 – 0.89). For each outcome, no significant CSA-associated risk was observed in H2 haplotype carriers. These findings support conducting further investigation of the H2 haplotype to determine the gene(s) responsible. Our results also suggest that severe early trauma may prove to be an important clinical covariate in the treatment of alcohol dependence. PMID:19878140

  14. Organization of the Escherichia coli K-12 gene cluster responsible for production of the extracellular polysaccharide colanic acid.

    PubMed Central

    Stevenson, G; Andrianopoulos, K; Hobbs, M; Reeves, P R

    1996-01-01

    Colanic acid (CA) is an extracellular polysaccharide produced by most Escherichia coli strains as well as by other species of the family Enterobacteriaceae. We have determined the sequence of a 23-kb segment of the E. coli K-12 chromosome which includes the cluster of genes necessary for production of CA. The CA cluster comprises 19 genes. Two other sequenced genes (orf1.3 and galF), which are situated between the CA cluster and the O-antigen cluster, were shown to be unnecessary for CA production. The CA cluster includes genes for synthesis of GDP-L-fucose, one of the precursors of CA, and the gene for one of the enzymes in this pathway (GDP-D-mannose 4,6-dehydratase) was identified by biochemical assay. Six of the inferred proteins show sequence similarity to glycosyl transferases, and two others have sequence similarity to acetyl transferases. Another gene (wzx) is predicted to encode a protein with multiple transmembrane segments and may function in export of the CA repeat unit from the cytoplasm into the periplasm in a process analogous to O-unit export. The first three genes of the cluster are predicted to encode an outer membrane lipoprotein, a phosphatase, and an inner membrane protein with an ATP-binding domain. Since homologs of these genes are found in other extracellular polysaccharide gene clusters, they may have a common function, such as export of polysaccharide from the cell. PMID:8759852

  15. Huntington disease in the South African population occurs on diverse and ethnically distinct genetic haplotypes

    PubMed Central

    Baine, Fiona K; Kay, Chris; Ketelaar, Maria E; Collins, Jennifer A; Semaka, Alicia; Doty, Crystal N; Krause, Amanda; Jacquie Greenberg, L; Hayden, Michael R

    2013-01-01

    Huntington disease (HD) is a neurodegenerative disorder resulting from the expansion of a CAG trinucleotide repeat in the huntingtin (HTT) gene. Worldwide prevalence varies geographically with the highest figures reported in populations of European ancestry. HD in South Africa has been reported in Caucasian, black and mixed subpopulations, with similar estimated prevalence in the Caucasian and mixed groups and a lower estimate in the black subpopulation. Recent studies have associated specific HTT haplotypes with HD in distinct populations. Expanded HD alleles in Europe occur predominantly on haplogroup A (specifically high-risk variants A1/A2), whereas in East Asian populations, HD alleles are associated with haplogroup C. Whether specific HTT haplotypes associate with HD in black Africans and how these compare with haplotypes found in European and East Asian populations remains unknown. The current study genotyped the HTT region in unaffected individuals and HD patients from each of the South African subpopulations, and haplotypes were constructed. CAG repeat sizes were determined and phased to haplotype. Results indicate that HD alleles from Caucasian and mixed patients are predominantly associated with haplogroup A, signifying a similar European origin for HD. However, in black patients, HD occurs predominantly on haplogroup B, suggesting several distinct origins of the mutation in South Africa. The absence of high-risk variants (A1/A2) in the black subpopulation may also explain the reported low prevalence of HD. Identification of haplotypes associated with HD-expanded alleles is particularly relevant to the development of population-specific therapeutic targets for selective suppression of the expanded HTT transcript. PMID:23463025

  16. New HLA haplotype frequency reference standards: high-resolution and large sample typing of HLA DR-DQ haplotypes in a sample of European Americans.

    PubMed

    Klitz, W; Maiers, M; Spellman, S; Baxter-Lowe, L A; Schmeckpeper, B; Williams, T M; Fernandez-Viña, M

    2003-10-01

    A collaborative study involving a large sample of European Americans was typed for the histocompatibility loci of the HLA DR-DQ region and subjected to intensive typing validation measures in order to accurately determine haplotype composition and frequency. The resulting tables have immediate application to HLA typing and allogeneic transplantation. The loci within the DR-DQ region are especially valuable for such an undertaking because of their tight linkage and high linkage disequilibrium. The 3798 haplotypes, derived from 1899 unrelated individuals, had a total of 75 distinct DRB1-DQA1-DQB1 haplotypes. The frequency distribution of the haplotypes was right skewed with haplotypes occurring at a frequency of less than 1% numbering 59 and yet constituting less than 12% of the total sample. Given DRB1 typing, it was possible to infer the exact DQA1 and DQB1 composition of a haplotype with high confidence (>90% likelihood) in 21 of the 35 high-resolution DRB1 alleles present in the sample. Of the DRB1 alleles without high reliability for DQ haplotype inference, only *0401, *0701 and *1302 were common, the remaining 11 DRB1 alleles constituting less than 5% of the total sample. This approach failed for the 13 serologically equivalent DR alleles in which only 33% of DQ haplotypes could be reliably inferred. The 36 DQA1-DQB1 haplotypes present in the total sample conformed to the known pattern of permissible heterodimers. Four DQA1-DQB1 haplotypes, all rare, are reported here for the first time. The haplotype frequency tables are suitable as a reference standard for HLA typing of the DR and DQ loci in European Americans.

  17. Novel Tenascin-C Haplotype Modifies the Risk for a Failure to Heal After Rotator Cuff Repair.

    PubMed

    Kluger, Rainer; Huber, Klaus R; Seely, Philipp G; Berger, Christian E; Frommlet, Florian

    2017-11-01

    Several single-nucleotide polymorphisms (SNPs) in the TNC gene have recently been found to be associated with degenerative rotator cuff tears. Exonic SNPs in the TNC gene are related to the risk for a failure to heal after rotator cuff repair. Case-control study; Level of evidence, 3. A total of 302 patients from the Vienna area and European Caucasian ancestry underwent mini-open rotator cuff repair for a full-thickness superior or posterosuperior tear and were assessed for the integrity of the repair 1 year postoperatively with a real-time 7.5- to 10-MHz ultrasound linear array transducer. Outcomes were classified as intact (complete footprint coverage), small (<200 mm 2 ), or large (≥200 mm 2 ) recurrent defect. Patients were genotyped for 15 previously identified risk SNPs within a 49-kbp segment of the TNC gene with the KASP genotyping technology or the Ion-Torrent Personal Genome Machine System. All recurrent defects were atraumatic failures, and the overall failure rate was 39.7%. Of the traditional risk factors, only the initial tear size was significantly associated with a failure to heal. In a multinomial logistic regression model, the T allele at rs1138545 [C>T] was protective for a large recurrent defect (odds ratio = 0.16; 95% CI, 0.09-0.31). The role of rs1138545 was further backed by haplotype analysis, which showed that the combination of the C allele at rs1138545 [C>T], the A allele at rs2104772 [A>T], and the G allele at rs10759752 [A>G] formed the risk-related haplotype [CAG]. The CAG haplotype was associated with large recurrent defects ( P < .0001; haplotype frequency, 0.394; haplotype score, 4.518). Exonic marker rs1138545 transcribed into all isoforms of the TNC protein, whereas exonic marker rs2104772, which has been associated with Achilles tendinopathy before, transcribed only into large isoforms of the TNC protein. Recurrent defects after rotator cuff repairs are clinically relevant, and a heritable component of the disorder is plausible

  18. Methylenetetrahydrofolate reductase gene haplotypes affect toxicity during maintenance therapy for childhood acute lymphoblastic leukemia in Japanese patients.

    PubMed

    Tanaka, Yoichi; Manabe, Atsushi; Nakadate, Hisaya; Kondoh, Kensuke; Nakamura, Kozue; Koh, Katsuyoshi; Kikuchi, Akira; Komiyama, Takako

    2014-05-01

    Abstract The aim of this study was to investigate the influence of daily 6-mercaptopurine (6-MP) and low-dose weekly methotrexate (MTX) combination treatment and methylenetetrahydrofolate reductase (MTHFR) haplotypes on toxicity during maintenance therapy in Japanese childhood acute lymphoblastic leukemia (ALL). We retrospectively analyzed the MTHFR C677T and A1298C polymorphisms and influence of haplotypes on toxicity in 73 patients. Patients with the MTHFR 677TT and 677CT + 1298AC were associated with severe liver toxicity (p = 0.014, odds ratio [OR] = 3.82, 95% confidence interval [CI] = 1.27-11.46) and more rapid onset of liver toxicity (p = 0.010). Patients with MTHFR 677TT and 677CT + 1298AC were associated with lower frequency of 6-MP and MTX dose reduction due to leukopenia (p < 0.05). No difference was observed in average drug doses in the MTHFR genotypes. In conclusion, the MTHFR C677T and A1298C haplotypes might be useful for monitoring adverse effects in childhood ALL maintenance therapy in Japanese patients.

  19. Characterization of a Major Cluster of nif, fix, and Associated Genes in a Sugarcane Endophyte, Acetobacter diazotrophicus

    PubMed Central

    Lee, Sunhee; Reth, Alexander; Meletzus, Dietmar; Sevilla, Myrna; Kennedy, Christina

    2000-01-01

    A major 30.5-kb cluster of nif and associated genes of Acetobacter diazotrophicus (syn. Gluconacetobacter diazotrophicus), a nitrogen-fixing endophyte of sugarcane, was sequenced and analyzed. This cluster represents the largest assembly of contiguous nif-fix and associated genes so far characterized in any diazotrophic bacterial species. Northern blots and promoter sequence analysis indicated that the genes are organized into eight transcriptional units. The overall arrangement of genes is most like that of the nif-fix cluster in Azospirillum brasilense, while the individual gene products are more similar to those in species of Rhizobiaceae or in Rhodobacter capsulatus. PMID:11092875

  20. A founder haplotype of APOE-Sendai mutation associated with lipoprotein glomerulopathy.

    PubMed

    Toyota, Kentaro; Hashimoto, Taeko; Ogino, Daisuke; Matsunaga, Akira; Ito, Minoru; Masakane, Ikuto; Degawa, Noriyuki; Sato, Hiroshi; Shirai, Sayuri; Umetsu, Kazuo; Tamiya, Gen; Saito, Takao; Hayasaka, Kiyoshi

    2013-05-01

    Lipoprotein glomerulopathy (LPG) is a hereditary disease characterized by lipoprotein thrombi in the glomerulus, hyperlipoproteinemia, and a marked increase in serum apolipoprotein E (APOE). More than 12 APOE mutations have been identified as causes of LPG, and APOE-Sendai (Arg145Pro) mutation was frequently detected in patients from the eastern part of Japan including Yamagata prefecture. Recently, effective therapy with intensive lipid-lowering agents was established, and epidemiologic data are required for early diagnosis. We determined the haplotype structure of APOE-Sendai in 13 patients from 9 unrelated families with LPG, and found that the haplotype of all APOE-Sendai mutations was identical, suggesting that APOE-Sendai mutation is common in Japanese patients probably through a founder effect. We also studied the gene frequency of APOE-Sendai in 2023 control subjects and 418 patients receiving hemodialysis in Yamagata prefecture using the TaqMan method, but did not identify any subjects carrying the mutation, indicating that it is very rare in the general population even in the eastern part of Japan. In addition to APOE mutation, other genetic and/or epigenetic factors are considered to be involved in the pathogenesis of LPG because of its low penetrance. The patients did not have a common haplotype of the counterpart APOE allele, and some patients had the same haplotype of the counterpart APOE allele as the asymptomatic carriers. These results suggest that the counterpart APOE allele is not likely associated with the onset of LPG. Further study is required to clarify the pathogenesis of LPG.